Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
The Internet Technology

The Internet Is Rotting (theatlantic.com) 106

Too much has been lost already. The glue that holds humanity's knowledge together is coming undone. From a report: It turns out that link rot and content drift are endemic to the web, which is both unsurprising and shockingly risky for a library that has "billions of books and no central filing system." Imagine if libraries didn't exist and there was only a "sharing economy" for physical books: People could register what books they happened to have at home, and then others who wanted them could visit and peruse them. It's no surprise that such a system could fall out of date, with books no longer where they were advertised to be -- especially if someone reported a book being in someone else's home in 2015, and then an interested reader saw that 2015 report in 2021 and tried to visit the original home mentioned as holding it. That's what we have right now on the web.

[...] People tend to overlook the decay of the modern web, when in fact these numbers are extraordinary -- they represent a comprehensive breakdown in the chain of custody for facts. Libraries exist, and they still have books in them, but they aren't stewarding a huge percentage of the information that people are linking to, including within formal, legal documents. No one is. The flexibility of the web -- the very feature that makes it work, that had it eclipse CompuServe and other centrally organized networks -- diffuses responsibility for this core societal function.

This discussion has been archived. No new comments can be posted.

The Internet Is Rotting

Comments Filter:
  • Oh my! (Score:5, Informative)

    by laejoh ( 648921 ) on Friday July 02, 2021 @09:52AM (#61543958)
    • by crgrace ( 220738 )

      Netcraft confirms it.

    • Well, depening on your definition of death, it already happened.

      It's like temperature: How many Kelvin can be left for it to be dead for you? A signal/noise ratio of 0.1? 0.1E-6? 0.1E-12?
      For me, it is 401 Kelvin. (Leetspeak for "AOL", as in: Eternal September.) ;)

      • by Anonymous Coward
        Today is Friday, September 10167th, 1993
    • by shanen ( 462549 )

      I guessed that your link was going to point at the Jargon file equivalent of this one:

      https://en.wikipedia.org/wiki/... [wikipedia.org]

      But usenet actually has died. It sort of still exists, but the current pulse is on the order of one beat per month.

  • Really? (Score:4, Insightful)

    by CantankerousCoot ( 8313012 ) on Friday July 02, 2021 @09:56AM (#61543970)
    "Imagine if libraries didn't exist..." Why would I do that? Libraries have existed for *at least* 2600 years. They're still here. It's going to be okay.
    • You have aphantasia, don't you?
      • No, I just remember a time before the WWW. Somehow, knowledge managed to accumulate and be passed-on without it.
        • No, I just remember a time before the WWW. Somehow, knowledge managed to accumulate and be passed-on without it.

          Apple and Oranges. Libraries store curated knowledge, not "all" information as some internet advocates imagine the internet should do.

          The internet is forgetting some things. Forgetting is not necessarily bad, forgetting some things is also considered by some to be learning. One is bombarded with information, more successful individuals are better at selecting what to forget. Forgetting is part of the learning process.

          • Show me a site on the internet which does NOT curate knowledge. It doesn't exist, because any place which tries instantly piles up with illegal content and gets used as a free storage dump.
            • by drnb ( 2434720 )

              Show me a site on the internet which does NOT curate knowledge. It doesn't exist, because any place which tries instantly piles up with illegal content and gets used as a free storage dump.

              "Curate" in the library sense is not simply removing the illegal, there are also quality and importance considerations.

      • You have aphantasia, don't you?

        That was a pretty cool movie, Wasn't it?

    • ""Imagine if libraries didn't exist..." Why would I do that? Libraries have existed for *at least* 2600 years. They're still here. It's going to be okay."

      Yes, in another 2600 years there will be a billion LitRPG books and no science books and all the links point to Nirvana.

    • by PPH ( 736903 )

      Libraries have existed for *at least* 2600 years.

      But they end up containing prayer books instead of important scientific works [wikipedia.org]. Convince me that librarians don't have their own political/social agendas. Or just ignorance.

  • What a load of BS. (Score:5, Insightful)

    by luis_a_espinal ( 1810296 ) on Friday July 02, 2021 @09:58AM (#61543980)

    The glue that holds humanity's knowledge together is coming undone...

    Stop with the histrionics. Link rot is part of the internet. Rot is part of any system. Systems grow around them or fall (and something else take their place.) This is just bullshit first-world-problems seeking for attention.

    • by Ritz_Just_Ritz ( 883997 ) on Friday July 02, 2021 @10:02AM (#61544008)

      It's The Atlantic. Histrionics is what they do.

    • I think the real story is someone was able to write over 6,000 words about link rot. Either The Atlantic really needed to fill some space, or they need a new editor.
      • I think the real story is someone was able to write over 6,000 words about link rot. Either The Atlantic really needed to fill some space, or they need a new editor.

        TFA talks about other related things, like how things are more malleable in the digital age. For example, I thought this was interesting about how an e-book version of "War and Peace" had every instance of the word "kindle" changed to "nook":

        Ebooks don’t have those limitations, both because of how readily new editions can be created and how simple it is to push “updates” to existing editions after the fact. Consider the experience of Philip Howard, who sat down to read a printed edition of War and Peace in 2010. Halfway through reading the brick-size tome, he purchased a 99-cent electronic edition for his Nook e-reader:

        As I was reading, I came across this sentence: “It was as if a light had been Nookd in a carved and painted lantern ” Thinking this was simply a glitch in the software, I ignored the intrusive word and continued reading. Some pages later I encountered the rogue word again. With my third encounter I decided to retrieve my hard cover book and find the original (well, the translated) text.

        For the sentence above I discovered this genuine translation: “It was as if a light had been kindled in a carved and painted lantern ”

        A search of this Nook version of the book confirmed it: Every instance of the word kindle had been replaced by nook, in perhaps an attempt to alter a previously made Kindle version of the book for Nook use.

        • TFA talks about other related things, like how things are more malleable in the digital age.

          Honestly, I never made it that far. After the third anecdote about link rot, I was done. I quickly scrolled through the rest looking for anything interesting. I must have missed the section on Ebooks.

          I'm just not a fan of the author's writing style, and I couldn't push through it any longer.

    • by ahoffer0 ( 1372847 ) on Friday July 02, 2021 @10:44AM (#61544240)

      I don't think you read the article. Did you get to the part about digital subscriptions instead of hard copy periodicals? Or the part where authors change the content of digital books without the purchasers' ever knowing?

    • by ArchieBunker ( 132337 ) on Friday July 02, 2021 @10:46AM (#61544244)

      It's a real problem when so many people get knowledge from Wikipedia and you go to read the sources and none of them work anymore.

      • by Coren22 ( 1625475 ) on Friday July 02, 2021 @11:21AM (#61544434) Journal

        I wonder if it would work better if Wikipedia made all their links run through the internet archive. That way they would have a link that doesn't rot, as the information is backed up. They could even point to a specific version of the article, so that later changes don't cause the information to be invalid.

        I suppose that would take some more funding for the internet archive, but perhaps the cooperation would make both of them better for it.

        • I wonder if it would work better if Wikipedia made all their links run through the internet archive.

          Aren’t they already doing that? They’ve had a partnership since 2016, with the IA archiving outbound links and then fixing them when/if they break.

          https://diff.wikimedia.org/201... [wikimedia.org]

      • by ebvwfbw ( 864834 )

        It's a real problem when so many people get knowledge from Wikipedia and you go to read the sources and none of them work anymore.

        That's because the original source was probably BS to begin with. They don't listen to experts and listen to idiots. I've seen many world experts just say FU to them because they're too stupid to accept the help of some of the best people in the world that know what they're talking about. It's like a leftist click.

    • The glue that holds humanity's knowledge together is coming undone...

      Stop with the histrionics. Link rot is part of the internet. Rot is part of any system. Systems grow around them or fall (and something else take their place.) This is just bullshit first-world-problems seeking for attention.

      People just thought the intertoobz was something different than it actually was. Of course links go away. Of course it is trivial to change documents.

      People just tried to bend it to what they thought it should be. And they all found out that it bends to no one. The internet simply reflects humanity.

    • Systems grow around them

      Exactly.

      That miss on the author's part, along with the absurd library analogy, turns this whole thing into an eye roll. Yeah, if a physical book was lost in some guy's house that is a shame. This is the internet, where everyone who wanted one got their own actual copy of the book for $0 marginal cost, so if Bob lost his under his couch no one other than Bob was impacted.

      And assuming Bob has a bittorrent client, he doesn't even have to bother looking under his couch.

      • by jythie ( 914043 )
        This kinda highlights the problem of the rot though. Things only stay on the internet if _someone_ is actively keeping them there. Works great for big things a lot of people know about, but can drop off rapidly, esp when talking about smaller things that might be captured in things forum posts or even news articles. In the past you could go to a library and look through newspaper archives going back a century or more as well as tons of other information that people access infrequently. Depending on 'ot
      • by vux984 ( 928602 )

        This isn't about books. This isn't about copies, this certainly isn't about marginal costs, and this isn't solved with a bittorrent client. You missed the plot completely.

        The library analogy is precisely that the internet has no capable equivalent. If I give you book or magazine name, article title, page number, date published, etc, that citation is a stable reference to an identifiable and (relatively) immutable physical body of work that can be located (relatively) easily.

        Maybe the local library has a cop

    • by Subm ( 79417 )

      Easy to call other's problems first-world, but will you speak so glibly when your favorite porn site goes down?

    • While I agree this is just slow news day hysterics, it's important to point out that one of the main reasons there are "1st world" countries us because they put considerable effort into education and preservation of knowledge.
    • 100 years ago most of the humans on the planet had three major worries that exceeded any other of their worries by at least an order of magnitude: 1)worrying about gathering and hunting enough food so they don't go hungry or worse, starve 2)hoping the untreated water they drink doesn't give them explosive dysentery, and 3) having a warm dry safe place to sleep for the night.

      Today in 2021, people without real problems of the past, can write an article about broken web links as if it is one of the largest p

  • by oldgraybeard ( 2939809 ) on Friday July 02, 2021 @10:00AM (#61543990)
    maybe I was the only one seeing information getting harder to find under the massive pile of obsolete, old and just junk info on today's commercialized web. And we have not even gotten to the massive stinking piles of advertising being pushed out on the internet.
    • Not to forget search engine amnesia, especially Google. Sometimes it feels like they have de-listed anything technical older than 5 years, especially datasheets.
    • maybe I was the only one seeing information getting harder to find under the massive pile of obsolete, old and just junk info on today's commercialized web. And we have not even gotten to the massive stinking piles of advertising being pushed out on the internet.

      Which is why forgetting can be an important part of learning. The problem is selecting what to remember and what to forget.

      Of course having to replicate data in order to share it does have its advantages due to redundancy. With only one copy of the data and sharing being done by links we do have some fragility. Sort of a poor backup policy problem.

    • maybe I was the only one seeing information getting harder to find under the massive pile of obsolete, old and just junk info

      Maybe you are. I for one think it's too hard to find old and obsolete info when you need it. Just because it's old and obsolete doesn't mean it's not valuable. History often helps with the context of today's discussion.

      And today's discussion is polluted with 1000 clones of each other. Seriously a current story tends to produce search results of an echo chamber literally all saying the same stuff in some cases verbatim and even worse linking to each other in a circular fashion as a citation. If they make a c

  • they represent a comprehensive breakdown in the chain of custody for facts

  • by MBGMorden ( 803437 ) on Friday July 02, 2021 @10:07AM (#61544028)

    It's simply a fact of existence that information is lost over time. Do any amount of genealogical research and you'll quickly realize how often "records lost in a fire" crops up. Servers going offline is simply another version of that. And even if a website goes offline if it was a truly popular site you'll find often times that someone has archived it.

    Yes, despite digital copying being prevalent, in 500 years every single forum conversation we ever had will not likely be available. This very comment I'm typing will not likely be stored anywhere, but the amount of information that IS still available will likely dwarf what we've been able to piece together from any prior time in history.

    • It's simply a fact of existence that information is lost over time.

      Yes, but pre-www sharing involved replication. Post-www sharing heavily relied on linking. Perhaps making data more vulnerable due to the sort of occasional natural or human disasters you refer to.

      Think of it as a backup problem. Perhaps there needs to be some effort to automatically back up things using something analogous to the original Google page rank algorithm. Lots of links to information, it gets backed up. OK, maybe some selectivity is needed to separate the scientific data from the advertisemen

    • Re: (Score:2, Interesting)

      by PPH ( 736903 )

      you'll quickly realize how often "records lost in a fire" crops up

      Very common for African American and Palestinian property records.

  • Ummm...What? (Score:5, Insightful)

    by jfdavis668 ( 1414919 ) on Friday July 02, 2021 @10:09AM (#61544042)
    There is lots of content. No one is organizing it. Why would someone be in charge of that? This article is pointless.
  • by bettersheep ( 6768408 ) on Friday July 02, 2021 @10:10AM (#61544044)

    The Internet is more humanity's drool than its knowledge.

    • The knowlegde is in there. It's just buried in an ocean of drool, and when you reach it, you notce it is badly explained.
      (Case in point: Try finding a video on YouTube, that *actually* explains how magnets work. You can tell, when they are mentioning spin and relativistic effects.)

      We need a web portal for "distillers". But not like Wikipedia, but with strict structuring into semantic graphs. That would be a game changer. (There would be a tool that displays it in an easy way for everyone.)

  • Any reliable, curated, objective sources of information will remain accessible or otherwise archived. It should be acknowledged that this composes a very tiny percentage of the Internet as a whole.

    All the rest of the Internet is memes, blogs and social media which get buried and crushed under their own weight. Humanity will not miss them if they aren't preserved.

    • "Objective" is a weasel word though. It implies a popsci view of science, where reality is assumed to be absolute. Which just isn't the case, and even if it was, it would be unobtainable for a human anyway, as we have known for at least a 100 years. Not with our neural nets that only works by detecting patterns via biasing input based on past input. Not with our model of reality having to be mostly hearsay out of sheer necessity (time constraints) anyway. Not with reality being realtive anyway.

      Your intentio

      • by Tarlus ( 1000874 )

        That's fair. I tend to use the word "objective" to mean "unattached to any bias or agenda" which isn't accurate.

  • by TomGreenhaw ( 929233 ) on Friday July 02, 2021 @10:24AM (#61544132)
    The Internet is largely evolving on its own. There are guardrails that the technology enforces and large organizations have their own defacto standards, but the Internet's content is basically unstructured data.

    One of the human mind's features is selective forgetfulness. If we remember every detail of our existence, we would become overloaded with useless information. The Internet needs the same thing and what we are seeing is a natural consequence of its usage.

    What we are seeing on the internet is content being "forgotten" because nobody deems it useful to maintain or access. Is this really a bad thing? Should browsers disable broken links and search engines deprecate useless pages?
    • I see your point, but I would say that the internet is a beautiful, wonderful wildflower; rather than calling it a weed.

    • by PPH ( 736903 )

      What we are seeing on the internet is content being "forgotten" because nobody deems it useful to maintain or access.

      And also so that they can patent it all over again. There is a lot of stuff out there that people are taking (or given) credit for 'on the Internet' that isn't their original creation.

      Anecdote: A bunch of us were sitting around discussing the origins of certain phrases or quotes. Specifically when and in which movies they first appeared in. One of my favorites "Come in, Rangoon" returns the answer of "Beyond Rangoon", released in 1995*. Except that I can vaguely recall having seen it, and used it when I wa

    • What we are seeing on the internet is content being "forgotten" because nobody deems it useful to maintain or access.

      No, it's being forgotten because the person who published the information no longer sees value in keeping it up (regardless of what possible consumers of that information, now or in the future, think). Or because a company reorganizes its website and is too obsessed with new-and-shiny to care about the old customer support forum which contained the kind of information only a few customers would need in a year, but which will take days to rediscover now that the forum post that summarized the solution has be

  • by iggymanz ( 596061 ) on Friday July 02, 2021 @10:26AM (#61544146)

    Haha, computer based things have the shortest lifespan of all for retaining information. Most the information that was on say 1970s computers is lost forever. No surprise info on the internet would undergo same fate, and we have the added hilarity of young people revising information in places such as Wikipedia for politicial correctness, virtual signalling, social agenda... or not allowing articles at all because they couldn't find internet article on it rather than getting their ass to libraries. What a farce.

    The situation in other realms is some better but not as good as people imagine.

    Even acid free paper will only last 200 years under normal conditions, that 1000 year claim is for special expensive storage where the AC will last 1000 years (hint, in much less time than that it's gonna be fucked)

    Your really old books and documents aren't on paper at all, ink on vellum (the real stuff, skin of calf) might last more than that in a cave in the middle east or Mediterranean monastery, but we normally don't use that now.

    • Wikipedia has always been a farce pretending to not be a farce.
    • Haha, computer based things have the shortest lifespan of all for retaining information. Most the information that was on say 1970s computers is lost forever. No surprise info on the internet would undergo same fate, and we have the added hilarity of young people revising information in places such as Wikipedia for politicial correctness, virtual signalling, social agenda

      That's getting a little harder to do, fortunately.

      Your really old books and documents aren't on paper at all, ink on vellum (the real stuff, skin of calf) might last more than that in a cave in the middle east or Mediterranean monastery, but we normally don't use that now.

      While looking at ways to archive materials, it was difficult to come to any other conclusion than yours. In fact, we had to come up with "active archiving", which is a royal PITA, because you just don't archive something once, you continue re-archiving the same material over time.

      The closest thing to "permanent" digital archiving we have is actual punched holes in huge reels of acid free paper tape, printed and read on a machine that is simply constructed,

      • That's getting a little harder to do, fortunately

        How has it been getting harder? Did they enact new policies?

        • That's getting a little harder to do, fortunately

          How has it been getting harder? Did they enact new policies?

          Wikipedia has editing tools and guidelines that deal pretty well with what they call "vandals" and Drama Queens, and if you go in there to post stupid woke stuff, thay can revert it back pretty quickl

      • oh the "word processors" of the 1950 and 1960s had those paper punch tapes, my mom at a school worked one, being fed pairs of tapes, one of letters or records, another of names and addresses.

        You couldn't even store 1Mbyte of information on a single tape though, too unwieldy to carry and prevent breakage when using.

        The tape won't last if it's not cared for properly though, it's making an assumption the air condition will work for decades or centuries. Just as your active archiving always assumes someone wil

    • Haha, computer based things have the shortest lifespan of all for retaining information.

      That's why I use Jordanian Salt Cave LLC to back up all my important data to papyrus. "The magic is in the jars."

  • by Pollux ( 102520 )

    Links come and links go. Welcome to the web. Reminds me of when I was cleaning out my old HS papers when moving out after graduating college. Found my senior research paper with three 4-year-old links. None of them worked anymore. That was back in 2003.

    This makes me reminisce about my first webpage I made in college, which included the perfunctory "Links Page" that's all but died from the modern internet. Of all the links I had to corners of the internet I thought were cool back then, one still surviv

  • This is exactly why.

    Also, PROTIP from Slashdot to The Atlantic: The web is not the Internet. Nowadays, it's just that part of the Internet where we herd those that still print out the Internet or use iPads. ... Like you. ;)

    I'm already working on something better though, so I just treat it as a lost cause.

  • Before, links rot because websites go down. Now link rot because the social platform companies ban people and remove all their posts. Or they lock writings / photo behind account login such that auto web crawling / archiving no longer works.
    • Hey, don't blame the game but the player. But you can play too. Create a bunch of sock puppet accounts, find a target, peruse their posting history and find stuff that you can label as "hate speech". Have all your sock puppets complain and get that person kicked.

      With some effort you can take over social media platformss that way.

  • I'm sure the rot stopped when they took down rotten.com in 2012. No? :smirk:
  • by virtig01 ( 414328 ) on Friday July 02, 2021 @11:06AM (#61544360)

    IPFS [ipfs.io], Arweave [arweave.org], and other projects are being used to build the permaweb. The Atlantic writes 6k words about link rot, but somehow failed to Google around for possible solutions.

    • IPFS, Arweave, and other projects are being used to build the permaweb. The Atlantic writes 6k words about link rot, but somehow failed to Google around for possible solutions.

      IPFS is unfortunately no real solution as it does not provide for automated data replication. Arweave might be more popular if it didn't depend on Erlang23.

  • Some of what they're talking about here is unintentional.
    But some of it is very much intentional.
    If corporations don't want you having something digital, or knowing something, they can easily make it disappear off the Internet.
    That is the real problem with so many things being in 'The Cloud', everything being on 'streaming services', and being herded away from buying physical media like CDs, DVDs, Bluray, and even printed books.
    Some of you have experienced this already with 'e-books': you don't actuall
    • In the near future, owning physical things will become prohibitively expensive due to the collapse of trading with China. We're approaching an end of an era of cheap stuff packed into wally world, and back to the pre-20th century standard where the average person in the middle class has only a few precious personal possessions.

      • I'd also add that housing is getting so crazy expensive everywhere that we are all about to have to live in a van or 1-bedroom studio apartment. So, we won't have room for anything but the most precious physical items. I agree that the cheap Chinese crap-flow is about to hit some turbulence, but keep in mind that the Chinese haven't fully developed any domestic demand for their shitty stuff. So, it'd hurt them severely if trade flows slowed or stopped. I think that's why they help prop up the dollar, rather
      • I don't think you get what I mean. They don't want people to 'own' ANYTHING. They're RENT SEEKERS. They want you to RENT everything. They'd make you pay for the air you breathe if they could figure out a way. I'm trying to tell a cautionary tale here.
      • Our supply of cheap goods isn't going anywhere. India and other countries in the area are increasingly stepping up with labor that's as cheap or cheaper because of China's growing labor costs, and Africa can step up after them. In a century or two when there is no more 3rd world with cheap labor... but then robots will have entirely replaced humans for menial jobs like assembly. Beyond that, labor is only a tiny percentage of cost for a massive array of goods... tons of stuff could pay developed nation wage
    • I own books, have a couple print magazine subscriptions, watch DVD/BR for most movies, and still have ebook and Netflix subscriptions. The crappy paperbacks I don't buy because I e-read those books doesn't bother me, but not owning LOTR in print or something, now that would bother me. I watch crappy movies and anime on Netflix but the truly great movies I love I own on disc. So, the bottom line is that I use the ephemeral mediums for craptasic consumables and physical media for the good shit. Kinda like wh
  • Anything as large and rich as the internet can't ever be completely organized, tied down, or defined. The library analogy can obviously be applied to particular (small) parts of the internet, but is a very misguided way of thinking about it as a whole. It's more like the ocean, vast, teeming with life, evolving, flowing, with hidden currents, producing crazy creatures and vast interlocking ecosystems. Humans never had a prayer of controlling, cataloging, stopping, or completely understanding the ocean, and
  • Well for a lot of stuff it's not that critical (for other than historians in the future), think about it , yhe n/th article about how buggy Fallout76 was at launch, or how to work around an obscure bug in .Net framework v2.0, or to 10000th plogpost about wout som big tittted reality stralet did whith whom. Is it really critical that this stuff stays online forever? Important stuff, well at least what traditionally print and tv covers get archived anyway.
  • My brother has a folder on his computer called OldComputr. This has an image of the disk on his previous computer. That has an image of the disk of the computer before that. all they way back to the first computer he owned, and then it contained files from other computers. As long as storage gets cheaper per byte this is likely to continue.

    In https://web.archive.org/ [archive.org] there lies the Wayback machine. This does not archive everything. It does not archive everything that people push at it, so their version c

  • Well, that could be a problem. (Sees article is from The Atlantic) Whew! Nevermind!
  • "Link rot" is not due to the architecture of the Internet. It's because at some point, people don't want to pay their hosting fees anymore. As the Internet has become more commercialized and profit-focused, things may disappear a bit faster than they did in the past than when universities were in charge. That's... just the way things are.

    It's sad how many people believe that cloud centralization or that reworking the usability of a web browser URL bar will fix all this. You can't force people to host th

  • The Internet shall never forget Harambe. That's all that matters. A mighty alpha beast lord taken too soon. RIP.

"The great question... which I have not been able to answer... is, `What does woman want?'" -- Sigmund Freud

Working...