Internet Archive's Wayback Machine under severe threat by publisher blocks
Originality AI found 23 news sites blocking the Internet Archive’s crawler, while more than 100 journalists signed a letter backing the archive.
7 Articles
7 Articles
With over a trillion pages stored, the Wayback Machine is the largest archive on the World Wide Web. It provides insight into the web history, serves as a "memory" of the Internet, but also to uncover changes and deletions of websites and to hold those responsible to account. However, numerous media houses of all kinds have been blocking the archive for a short time. Now there is protest from journalists: they see the Wayback Machine in danger.
The Internet Archive Is Increasingly Restricted by Publishers – Pixel Envy
Kate Knibbs, Wired: A number of other major journalism organizations have also recently moved to restrict the Wayback Machine from archiving their stories, including The New York Times. According to analysis by the artificial-intelligence-detection startup Originality AI, 23 major news sites are currently blocking ia_archiverbot, the web crawler commonly used by the Internet Archive for the Wayback project. The social platform Reddit is too. Ot…
Mainstream US Media Block Internet Archive’s Wayback Machine to Prevent AI Abuse
Several major American media outlets have taken steps to block access to the Internet Archive’s “Wayback Machine” in a move aimed at preventing the tool from being exploited for AI training purposes. This decision comes amid growing concerns over unauthorized data scraping and the potential misuse of archived web content in the development of artificial intelligence models. The Internet Archive’s Wayback Machine has long been a valuable resource…
Internet Archive’s Wayback Machine under severe threat by publisher blocks
The Internet Archive's Wayback Machine is one of the web's most valuable resources, enabling us to access earlier versions of webpages and websites. It performs an invaluable role in preserving information that would otherwise be lost when websites go offline, as well as providing a practical tool to track updates made to a web page. However, the organization says that it is now under severe threat thanks to media organizations blocking access t…
Coverage Details
Bias Distribution
- 50% of the sources lean Left, 50% of the sources are Center
Factuality
To view factuality data please Upgrade to Premium


