Reddit is restricting its availability to the Internet Archive's Wayback Machine
Reddit has recently imposed new restrictions on the Internet Archive's Wayback Machine, significantly limiting its ability to preserve information from the platform. The Wayback Machine, a project run by the nonprofit Internet Archive, will now only be able to crawl Reddit's homepage, and will no longer have access to comments, subreddit pages, post details, profiles, and other data. This move is part of Reddit's efforts to restrict AI companies' ability to use its data to train large language models without paying licensing fees. Reddit has struck multimillion-dollar deals with companies like OpenAI and Google, allowing them to use Reddit posts to train their AI models. At the same time, the company has taken a hardline stance against companies that attempt to use its data without such arrangements, including suing Anthropic for alleged data scraping. The Internet Archive's previous ability to access and preserve Reddit data has now been significantly curtailed, raising concerns about the long-term preservation of online information.
Note: This is an AI-generated summary of the original article. For the full story, please visit the source link below.