Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
The Internet

Cloudflare and the Wayback Machine, Joining Forces For a More Reliable Web (archive.org) 17

Cloudflare and the Internet Archive are now working together to help make the web more reliable. Websites that enable Cloudflare's Always Online service will now have their content automatically archived, and if by chance the original host is not available to Cloudflare, then the Internet Archive will step in to make sure the pages get through to users. From a report: Cloudflare has become core infrastructure for the Web, and we are glad we can be helpful in making a more reliable web for everyone."The Internet Archive's Wayback Machine has an impressive infrastructure that can archive the web at scale," said Matthew Prince, co-founder and CEO of Cloudflare. "By working together, we can take another step toward making the Internet more resilient by stopping server issues for our customers and in turn from interrupting businesses and users online."

For more than 20 years the Internet Archive's Wayback Machine has been archiving much of the public Web, and making those archives available to journalists, researchers, activists, academics and the general public, in total to hundreds of thousands of people a day. To date more than 468 billion Web pages are available via the Wayback Machine and we are adding more than 1 billion new archived URLs/day. We archive URLs that are identified via a variety of different methods, such as "crawling" from lists of millions of sites, as submitted by users via the Wayback Machine's "Save Page Now" feature, added to Wikipedia articles, referenced in Tweets, and based on a number of other "signals" and sources, such multiple feeds of "news" stories. An additional source of URLs we will preserve now originates from customers of Cloudflare's Always Online service. As new URLs are added to sites that use that service they are submitted for archiving to the Wayback Machine. In some cases this will be the first time a URL will be seen by our system and result in a "First Archive" event.

This discussion has been archived. No new comments can be posted.

Cloudflare and the Wayback Machine, Joining Forces For a More Reliable Web

Comments Filter:
  • Comment removed based on user account deletion
  • Archive.is not .org blocks cloudflare and browsers the owner dosen’t like. By partnering with it’s competitor cloudflare is sending a message that they are the real organisation in charge of the internet.
    • --- cloudflare is sending a message that they are the real organisation in charge of the internet. --- .... Good, google needs some competition in that space. What we need now is a cloudflare browser. Though that may not be necessary, as cloudflare seems to be gaining its foothold in the lower levels of the network stack.
      • Good, google needs some competition in that space.

        Definitely agree, Google needs more competition in all the markets they are in.

        What we need now is a cloudflare browser.

        Oh! I would love to see what a cloudfare browser would be like. Probably just another Chromium browser, but hey, who knows!

        • by aitikin ( 909209 )

          Oh! I would love to see what a cloudfare browser would be like. Probably just another Chromium browser, but hey, who knows!

          Or another fork of Firefox. Neither of which I feel is really necessary today...

    • This can't be good, despite how it looks on the surface.

      It isn't too hard to guess who will be 'wearing the pants' in this relationship (Hint: It won't be Internet Archive).

      With that said, please tell me that somebody is backing up Google's Usenet archive. Google has started to fuck around with that as well ("Fresh new look! Blah blah blah") and that is a good sign that it may end up on the chopping block soon.

  • WTG, Brewster. Keep it going... it's working.

  • When they say "websites that enable Cloudflare's Always Online service", they mean websites for which someone has enabled Cloudflare's Always Online service. I have never had anything to do with Cloudflare, yet their web crawler came by to copy my site. BANNED.
  • Step 1: team up with the Internet Archive
    Step 2: make it so every time cloudflare goes down, traffic quietly reroutes to the most recent snapshot at the Internet Archive
    Step 3: save face

    • Probably. Still feels like they got turds in our peanut butter, though, especially since Crimeflare is already pulling a MITM on half the web.

We all agree on the necessity of compromise. We just can't agree on when it's necessary to compromise. -- Larry Wall

Working...