Reddit Files Lawsuit Against AI Firm Anthropic for Data Misuse

Reddit has filed a comprehensive lawsuit against AI company Anthropic. The social media platform has accused the firm of systematically scraping platform content without permission to train its Claude chatbot technology.

Reddit Accuses Anthropic of Systematic Data Theft

Reddit’s court filing asserts mass unauthorized use of its service by Anthropic’s bots. The social media company alleges Anthropic is employing web crawlers to scrape users’ content from thousands of subreddits without obtaining proper licensing deals.

The complaint states that Anthropic began scraping Reddit posts as far back as December 2021 and scraping posts and comments from popular communities like explainlikeimfive, WritingPrompts, todayilearned, and AskHistorians.

Despite the fact that Reddit’s robots.txt file clearly prohibits such automated access, Anthropic continued scraping to build training datasets for its Claude AI model.

Complaint filed by Reddit against Anthropic

Reddit argues Anthropic’s ClaudeBot spidered the site over 100,000 times even after the firm publicly declared it blocked Reddit from spidering on its site in July 2024. This, the complaint argues, is a clear violation of Reddit’s User Agreement, which prohibits commercial exploitation of the site’s content without written consent.

The report further states Anthropic’s unwillingness to negotiate the licenses, unlike other rivals such as OpenAI and Google, which entered into official collaborations with Reddit.

This unauthorized data collection has allowed Anthropic to build a multi-billion dollar business without offering Reddit and its community any compensation or privacy protection that is included in legitimate licensing agreements.

Legal Claims Seek Billions in Damages and Injunctive Relief

Reddit’s suit also identifies five independent legal bases of action against Anthropic. They also want monetary damages and court injunctions to prevent further abuse of data.

The complaint filed in the San Francisco Superior Court is demanding specific performance, compensatory damages, lost profits, and disgorgement of Anthropic’s profits due to benefiting from Reddit content.

The breach-of-contract action is based on Anthropic’s violation of Reddit’s User Agreement that prohibits in writing the commercial exploitation of platform content without written consent. Reddit also brings an action for unjust enrichment, alleging Anthropic constructed a billion-dollar business on stolen information without paying a penny to content creators or the site.

Image showing Claude’s admission on Reddit data training

Other claims brought include trespass to chattels on unauthorized server access, tortious interference with user contracts of Reddit, and unfair business competition under California business code. The suit claims Anthropic’s conduct harmed Reddit’s capacity to make privacy promises to users and undermined the site’s model for licensing revenue.

Reddit also requests that Reddit-derived material be banned from use by Anthropic and that Claude technology trained on platform content be removed from commercial use.

The firm also seeks punitive damages, claiming Anthropic’s actions were malicious and willful, conducted in reckless disregard of user privacy rights and contractual obligations. Legal fees and additional damages will be determined by the court.

AI Training Data Ownership Battle Sets Industry Precedent

This legal confrontation highlights growing tensions over intellectual property rights in artificial intelligence development. Reddit’s case against Anthropic shows the overall conflict between social media platforms seeking to monetize user-generated content and AI companies that need vast datasets for model training.

The lawsuit raises fundamental questions about data ownership and fair compensation in the AI industry. While Reddit has established licensing partnerships with major tech companies like OpenAI and Google, Anthropic’s alleged refusal to negotiate formal agreements creates a stark contrast in approaches to content acquisition.

Reddit’s Public Content Policy specifically addresses AI training concerns and requires licensees to honor user deletion requests and implement privacy safeguards through the platform’s Compliance API. The policy prohibits the use of deleted content and restricts access to sensitive material, protections that unauthorized scraping cannot provide.

The case may influence how courts interpret existing copyright and contract law in the context of AI development.

Source: https://www.thecoinrepublic.com/2025/06/05/reddit-files-lawsuit-against-ai-firm-anthropic-for-data-misuse/