Is it possible to blog in the AI era?

I write short stories every now and then and I throw them online. I also have a tech blog, where I moan about the decisions software I use make and with my “infinite wisdom”, I tell them what they should be doing instead.

I used to host both on Medium, but Medium got greedy. Then it was WordPress, but now even they’re trying to be greedy bastards and use my shit for training AI.

Some would argue that WordPress paid hosting will exempt me from the AI training, but for less than 100 visitors a year, it’s not really worth the expense.

So what is the solution? I ask the greater minds of this community for suggestions.

  • @Grimy@lemmy.world
    link
    fedilink
    0
    edit-2
    6 months ago

    If it’s about having your blog serve as AI training, it doesn’t really matter where you host it, it’s going to get scrapped and included in the data.

  • kamen
    link
    fedilink
    26 months ago

    Host your own stuff. With this little load you can do it on your own hardware with very little resources.

    • @sabreW4K3@lazysoci.alOP
      link
      fedilink
      06 months ago

      Yeah, I was thinking about throwing something on my Raspberry Pi, but didn’t know if I’d open the door to more issues.

      • fmstrat
        link
        fedilink
        16 months ago

        You could also spin up a $5 a month VPS somewhere like Linode.

      • @pacmondo@sh.itjust.works
        link
        fedilink
        06 months ago

        It can be pretty secure if you host it behind a cloudflare tunnel. Then you don’t have to open any ports to the wild west

        • @sabreW4K3@lazysoci.alOP
          link
          fedilink
          06 months ago

          Thank you. I’ve heard so much about CloudFlare tunnels, but don’t know how they work. Do I just point it at an IP and port or is it much more complicated than that?

          • @pacmondo@sh.itjust.works
            link
            fedilink
            26 months ago

            Basically you have to run a mini server (I use a docker container) called a cloudflare endpoint. From there you just enter the IPs and keys that your cloudflare account tells you to in the tunnel creation menu, and it all pretty much connects from there.

            Then, on the cloudflare side, you make different subdomains point to local ports. So, for example, for connecting to qbittorrent web client, in the cloudflare menus I can make qbit.domain.example point to localhost:8080. In this case, it means “localhost” relative to the cloudflare access point you’ve made (which in my case can use localhost because its hosted on the same machine as my other docker containers, but if they are on different machines you can use local IP addresses).

            I use their free plan, which is all you need if you’re just serving web content to a small number of users. You might need a domain to do this, but I don’t recall.

            My layman’s understanding is you basically make cloudflare be the router, so their server/ports are what is exposed to the open internet rather than your local router.

            • @sabreW4K3@lazysoci.alOP
              link
              fedilink
              16 months ago

              Thank you. I run like a million Docker containers and haven’t ever gotten around to looking into this and you’ve just enlightened me perfectly. I appreciate it.

              • @constantokra@lemmy.one
                link
                fedilink
                16 months ago

                If you’re already running a million docker containers then just get a vps somewhere to host your blog. Cheapest reliable one I found last I looked was vultr. I think mine is $15 a year.

  • @eveninghere@beehaw.org
    link
    fedilink
    16 months ago

    I think you should clarify the problem first.

    Privacy? You lose your privacy the moment you publish your blog anyway.

    Is it visibility? You never expected Google to show your blog in most cases.

    AI training? You could self-host and hope companies respect your robot.txt. But what’s the actual problem if you released your blog to the public in the first place? Anybody could’ve copy & pasted your blog also before this AI era.

    • @sabreW4K3@lazysoci.alOP
      link
      fedilink
      16 months ago

      Privacy?

      Privacy is of course my major concern, hence posting to this community. But not tinfoil hat level.

      visibility?

      I’m happy to have my stuff indexed by Google, in fact, I want it to be.

      AI training?

      I’ll take that for 500!

      Anybody could’ve copy & pasted your blog also before this AI era.

      Plagiarism has been an issue since before Confucius was copied by Baffledus. But the cream still rose to the top. However in this AI era, everything is buried as its all just considered a part of the source data.

      robot.txt.

      Stories keep popping up about AI ignoring robots.

        • @sabreW4K3@lazysoci.alOP
          link
          fedilink
          16 months ago

          I guess it comes down to what your definition of privacy is. I’m setting the bar low, I just don’t want to be used to train a large language model

    • @ReversalHatchery@beehaw.org
      link
      fedilink
      English
      1
      edit-2
      6 months ago

      Privacy? You lose your privacy the moment you publish your blog anyway.

      Oh, right, I’m gonna just reinstall facebook on the phone because I’ve lost everything… Oh and we have lost all of privacy by commenting on the internet and stepping out of the house! All resistance is futile! We need to close this community before people waste more of their time!

      This is not at all how it works. How would you lose privacy if you only publish what you want to publish? It’s entirely your decision what to include in your blog post.

  • @Zerush@lemmy.ml
    link
    fedilink
    1
    edit-2
    6 months ago

    Maybe https://bearblog.dev, very simple but I think enough for writing stories, it’s free, OpenSource and private.

    • A privacy-first, no-nonsense, super-fast blogging platform
    • No trackers, no javascript, no stylesheets. Just your words.
    • This is a blogging platform where words matter most.
    • Shun the bloat of the current web, embrace the bear necessities.
    • Looks great on any device
    • Tiny (~2.7kb), optimized, and awesome pages
    • No trackers, ads, or scripts
    • Seconds to sign up
    • Connect your custom domain
    • Free themes
    • RSS & Atom feeds
    • Built to last forever
    • @sabreW4K3@lazysoci.alOP
      link
      fedilink
      16 months ago

      So I’ve been looking at this and my only issues are that it’s not connected to the Fediverse and that it feels too sparse. I would like embedded image support. But thank you for the suggestion.

      • @Zerush@lemmy.ml
        link
        fedilink
        2
        edit-2
        6 months ago

        You can embed images, using Markdown ![](image URL), same as here in Lemmy, also several themes. But yes, it isn’t part of the fediverse.

        Another possibility, if you have an own host, is using the Blue Velvet Editor, its a complete wordprozessor which works right in your browser (PC and Mobile), it saves the documents locally in Html format by default. So you can upload it as is to an host, as webpage. Free, private, no account and blazing fast.

        https://bluevelvet.ssuiteoffice.com

        Screenshot Blue Velvet Editor

  • andrew_bidlaw
    link
    fedilink
    06 months ago

    In spite of you saying it’s not for you, I think that finding a cheap hosting for your blog is the easiest solution there. With some effort you can export your Wordpress.com site onto an actual free WP engine they sit above. Then, with plugins, here come autoreposts to other social media and whatever you want. Low traffic means you can choose an options with the lowest price.

    As a bonus you can also host your portfolio page, get personalized email addresses, a VPN server to wherever it’s hosted, and basically an environment you can put anything to, even your own Lemmy instance.

    On the latter - the population of federated platform is very small but super loyal, and also lacking content. So I feel that even if you won’t consider making or renting your own server, establishing your blog here can get you a lot of interactions. Probably, some admins won’t mind if you make your own /c/ommunity for that as long as it’s not abandoned.

    • @sabreW4K3@lazysoci.alOP
      link
      fedilink
      26 months ago

      Sorry it took me so long to come back to this. Because it was such a valuable reply, I kept it in my inbox so I wouldn’t lose it. Thank you.

      • andrew_bidlaw
        link
        fedilink
        2
        edit-2
        6 months ago

        You are welcome. Hope you’d get the most from whatever you choose (:

  • Autonomous User
    link
    fedilink
    English
    -1
    edit-2
    6 months ago

    Libre software, WordPress, can’t bans us from removing malicous source code. Medium is service as a software substitute. If you don’t want AI reading your text, don’t share it.

  • @rrobin@lemmy.world
    link
    fedilink
    16 months ago

    So lets be clear - there is no way to prevent others from crawling your website if they really want to (AI or non AI).

    Sure you can put up a robots.txt or reject certain user agents (if you self host) to try and screen the most common crawlers. But as far as your hosting is concerned the crawler for AI is not too different from e.g. the crawler from google that takes piece of content to show on results. You can put a captcha or equivalent to screen non-humans, but this does not work that well and might also prevent search engines from finding your site (which i don’t know if you want?).

    I don’t have a solution for the AI problem, as for the “greed” problem, I think most of us poor folks do one of the following:

    • github pages (if you don’t like github then codeberg or one of the other software forges that host pages)
    • self host your own http server if its not too much of an hassle
    • (make backups, yes always backups)

    Now for the AI problem, there are no good solutions, but there are funny ones:

    • write stories that seem plausible but hold high jinx in there - if there ever was a good reason for being creative it is “I hope AI crawls my story and the night time news reports that the army is now using trained squirrels as paratroopers”
    • double speak - if it works for fictional fascist states it works for AI too - replace all uses of word/expression with another, your readers might be slightly confused but such is life
    • turn off your web site at certain times of the day, just show a message showing that it only works outside of US work hours or something

    I should point out that none of this will make you famous or raise your SEO rank in search results.

    PS: can you share your site, now i’m curious about the stories