arXiv, the widely utilized preprint repository for scientific literature, has introduced a punitive enforcement policy targeting the submission of unedited or fully automated research papers. Under the new protocol, authors who submit content containing "incontrovertible evidence" of unchecked Large Language Model output face a one-year suspension from the platform.
The primary metric for rejection is a failure of responsibility: if an author cannot verify their own results or includes meta-commentary from an LLM, the submission is deemed untrustworthy.
Enforcement and Due Process
The policy operates on a verification model to minimize false positives, requiring a dual-layer confirmation process:
Documentation: A moderator must first document evidence of unvetted generation—such as fabricated references, non-human meta-text, or hallucinations.
Confirmation: A Section Chair must formally review the findings before the penalty is finalized.
Appeals: Authors retain the right to contest the decision through an established internal appeals process.
Summary of Policy Implications
| Category of Use | Status | Requirement |
|---|---|---|
| Proofreading | Permitted | Maintain clear communication |
| Code Assistance | Permitted | Disclosure and reproducibility |
| Full Generation | Banned | Grounds for 1-year suspension |
| Hallucinated Data | Banned | Breach of academic trust |
Context and Escalation
The shift toward active banning follows a protracted period of administrative strain. In late 2025, arXiv restricted the submission of un-peer-reviewed "survey" and "position" papers in the Computer Science category after the platform experienced a high volume of low-quality, automated content.
Read More: New Earbuds May See and Remember Your World
As Thomas Dietterich, chair of the computer science section, indicated, the repository is not prohibiting the use of artificial intelligence tools; rather, it is enforcing a standard of human accountability. The core issue remains the "evaluation bottleneck"—where the rapid scalability of AI-generated output threatens to overwhelm the volunteer-driven moderation systems that maintain the repository’s integrity. This recent crackdown signals an intensification of the struggle to define the boundary between automated assistance and the fundamental expectations of scientific research.