Forgot your password?
typodupeerror
The Internet AI

Study Finds a Third of New Websites Are AI-Generated 37

alternative_right shares a report from 404 Media: Researchers working with data from the Internet Archive have discovered that a third of websites created since 2022 are AI-generated. The team of researchers -- which includes people from Stanford, the Imperial College London, and the Internet Archive -- published their findings online in a paper titled "The Impact of AI-Generated Text on the Internet." The research also found that all this AI-generated text is making the web more cheery and less verbose."The proliferation of AI-generated and AI-assisted text on the internet is feared to contribute to a degradation in semantic and stylistic diversity, factual accuracy, and other negative developments," the researchers write in the paper. "We find that by mid-2025, roughly 35% of newly published websites were classified as AI-generated or AI-assisted, up from zero before ChatGPT's launch in late 2022."

"I find the sheer speed of the AI takeover of the web quite staggering," Jonas Dolezal, an AI researcher at Stanford and co-author of the paper, told 404 Media. "After decades of humans shaping it, a significant portion of the internet has become defined by AI in just three years. We're witnessing, in my opinion, a major transformation of the digital landscape in a fraction of the time it took to build in the first place."

Maty Bohacek, a student researcher at Stanford and one of the co-authors of the paper, added: "As AI-generated content spreads, the challenge is finding a role for these models that doesn't just result in a sanitized, repetitive web," he said. "Rather than forcing models to be perfectly compliant and agreeable, allowing them to have a more distinct personality or 'friction' might help them act as a creative partner rather than a replacement for human voice."

Study Finds a Third of New Websites Are AI-Generated

Comments Filter:
  • by oldgraybeard ( 2939809 ) on Monday April 27, 2026 @07:15PM (#66115510)
    The Internet is being buried with AI generated slop being created, indexed, summarized and regurgitated as even more AI slop to be consumed by AI bots to generate even more AI slop. With anything really creative, innovative, informative and true being the needle in the proverbial haystack and effectively hidden.
    • My website was king for 10 years. Now it's buried under 50 rudimentary AI clones churning out blog posts about how their software is better, whilst they blanket Reddit with new accounts promoting themselves and spreading misinformation about mine. Reddit seems to ban a lot but not all. Impossible to compete. Still have my core users but they're dwindling. Was a fun run.
  • Just as bad when more than a 1/3 of the web was WordPress or whatever the CMS du jour is.

    It's an attention grabbing headline until you think about how many small business websites there are trying to keep the effort trimmed down, like the local barber, dog groomer, or mohel.

    • "Just as bad when more than a 1/3 of the web was WordPress or whatever the CMS du jour is." You maybe under estimating the shear volume of AI slop being generated today.
      • by Himmy32 ( 650060 )

        Yeah, there's a lot of AI slop and it's problem. But I am not going to pretend that most web content has ever been artisanal and well curated. Well maybe back in the day when my GeoCities page for my dog was part of the most well regarded Webring. At least then Tom was still my friend.

    • by karmawarrior ( 311177 ) on Monday April 27, 2026 @08:42PM (#66115676) Journal

      Wordpress is a framework for publishing web content. It's not really relevant here. You can publish slop using AI too. And the fact a website is Wordpress based does not mean the content is good or bad.

      This is idiotic snobbery, and you should know better.

      • by Himmy32 ( 650060 )

        Of course using WordPress or even GeoCities doesn't mean the content is instantly bad, just easy to deploy. But that's the same for sites that include being AI assisted a technical blog post by a bilingual person using an LLM to fix improper idioms use. Knowing that 48% was just one framework which enables the publishing of web content of which a significant percentage is low effort or even human slop contextualizes the meaning of per site statistics.

        There's many wonderful WordPress sites, there were some g

  • Somebody is writing things as if they expected something different to happen.

    Yeah I am sure there are still people out there that hand code web sites.

    Probably many more that use template-based tools that have been developed over years and they are familiar with them. Many of those.

    For myself, if I started a new web site now I would use AI to do it because it is a better tool than any other. It is just a tool. You can get good or bad results from it like any other tool, depending on how skilled you

    • Did you even read even the summary? I know we're on Slashdot though, so ... fair enough. It's not about hand-coding websites, it's about the *content*. Imagine that somebody wants to share something with the world via the internet. Now, this sharing is at 66% efficiency, with the added bonus that the content will be stolen, rehashed and added to the already crowded competition for user attention. It's a bit grim, imo.

      • Good? cut out the middle man. we can chat with a bot ALL the time and stop surfing the web. why bother to fake websites and fake loaded search results? just have the bot do everything directly! add in some randomized tone/attitude by topic for variety like multiple sources do... conspiracy crazy people and idiot big mouths can be replaced with hallucinations, possibly with a lower occurrence.

        new chat. repeat. foobar.

  • I thought people choosing to hide the truth (Fox News, Moms for Liberty [to restrict others' education]) would cause the idiocracy. It seems the idiocracy will come from corporations using AI to echo sane-washed propaganda to other AIs.
  • Like ublock origin will add something to block the AI generated websites, I ran across a few myself and noticed it too after opening a link only to be presented with clickbait content
    • by Himmy32 ( 650060 )
      uBlock Origin itself could do it. Just need someone to make a list that we can subscribe to that blocks all of the slop sites.
  • We had ChatGPT 2 and other ways to generate sloppy websites before 2022.
    • by ebunga ( 95613 )

      Not to mention the affiliate marketing SEO spammers had their own little AI-based respinners going back a little further once the simple string substitution respinners were getting clobbered in google search results.

  • "Rather than forcing models to be perfectly compliant and agreeable, allowing them to have a more distinct personality or 'friction' might help them act as a creative partner rather than a replacement for human voice."

    It would also make AI more difficult for humans to detect. Being able to spot AI is the reason that key parts of the internet is still usable and human trust in it hasn't been broken yet.

  • If you don't feel like writing things, then I'm not going to read them myself. We will plug AI into AI as a big circular human centipede oroboros.

    I guess us humans will have to do things that don't involve mass media consumption.

    R.I.P. late stage capitalism

  • The problem is not that there are a lot of AI slop pages being generated every day. The problem is the search results you get back from google are increasingly dominated by these AI slop pages. It's becoming difficult to avoid them and the misinformation they spew.

  • GIGO (Score:4, Insightful)

    by Registered Coward v2 ( 447531 ) on Monday April 27, 2026 @07:46PM (#66115606)
    So AI will be training itself on stuff generated by itself, the ultimate self licking ice cream cone. Reminds me of the game of putting a paragraph repeatedly through translation software back and forth between to languages until you got gibberish,
  • by Arrogant-Bastard ( 141720 ) on Monday April 27, 2026 @08:03PM (#66115632)
    The damage that it's going to do the Internet, and to society, and to education, government, and all the other components of society, is staggering. An enormous amount of work done by dedicated people over decades will be swamped by the flood of AI slop, and I don't think we'll know what we've lost until it's gone.

    Many readers of this site are likely familiar with various sci-fi stories that deal with nanobots which have begun reproducing without limit, eventually consuming all resources and reducing their planet to "gray goo". This is the information equivalent: it will expand to occupy everything that it possibly can, overwhelming everything generated by humans. And when that happens, it will impact our shared view of reality, which is based on a (mostly) common set of facts.

    And when nothing is real, anything can be real. This will not escape the attention of would-be fascists and dictators.
  • I find the repetitive, flowery crap that claims to be a website today to be quite useless. Multiple sites on a subject have carbon copied content (or at least lead-ins). It is the quintessential enshittification of the web.

    Curious what this brings us next...

  • Is this why websites have all gotten so slow? They all need to use at least 100 libraries just to put up a blog page.
    • Not all.

      I edit my business's webpage with vim. It's plain html and has only two graphics (stored in the same directory).

      And I get people complimenting me on its design, which I find amazing.

  • The www. The open web is finished. The www is for logged in services only now. The things you have to do, booking air tickets, banking, taxes.

Asynchronous inputs are at the root of our race problems. -- D. Winker and F. Prosser

Working...