As of May 18, 2026, the preprint repository arXiv has implemented a formal policy to penalize researchers who submit work containing unverified machine-generated output. Under the new protocol, authors found to have published papers with "incontrovertible evidence" of negligent Large Language Model (LLM) usage face a mandatory one-year suspension from the platform.
Core mandates for compliance include:
Total Accountability: Authors retain sole liability for all submitted text, regardless of whether a human or machine generated it.
Verification Threshold: Papers displaying LLM-specific errors—such as phantom references, plagiarized fragments, or incoherent fabrications—trigger immediate review.
Procedural Checks: A penalty requires identification by site moderators followed by secondary validation from a section chair.
Appeals Process: The platform permits authors to contest flagging decisions, acknowledging the potential for human error in the enforcement process.
The Mechanism of Enforcement
The policy is not a blanket ban on automation, but a targeted reaction to the degradation of repository standards. Thomas Dietterich, chair of arXiv’s computer science section, stated that the existence of unverified LLM output fundamentally compromises the trustworthiness of a submission. The site effectively functions as a primary artery for scientific dissemination, and moderators are increasingly tasked with distinguishing between tool-assisted writing and automated carelessness.
| Feature | Pre-May 2026 Policy | New Policy Enforcement |
|---|---|---|
| Accountability | Vague | Strict/Personal |
| Penalty | Ad-hoc removal | 12-month ban |
| Evidence | Self-policing | Verified by section chair |
Contextualizing the Archive
For decades, arXiv has served as an open-access gateway for pre-publication research, particularly in mathematics, physics, and computer science. Unlike peer-reviewed journals, the site relies on a balance of community trust and administrative curation to manage high volumes of submissions.
Read More: Bun's Rust Code Change: What Developers Need to Know
The rise of generative software has flooded academic spaces with content that mimics formal research patterns without grounding them in empirical rigor. By imposing a "one-strike" rule, the repository signals that the cost of computational negligence has shifted from academic embarrassment to a tangible loss of access to the scientific public square. As of today, the mandate focuses on the integrity of the input rather than the nature of the tool, placing the burden of fact-checking squarely on the researcher.