arXiv Bans AI Content for One Year if Not Checked

Starting today, May 18, 2026, arXiv will ban researchers for one year if they submit AI-generated content without checking it. This is a new rule to keep scientific papers accurate.

As of May 18, 2026, the preprint repository arXiv has implemented a formal policy to penalize researchers who submit work containing unverified machine-generated output. Under the new protocol, authors found to have published papers with "incontrovertible evidence" of negligent Large Language Model (LLM) usage face a mandatory one-year suspension from the platform.

Core mandates for compliance include:

  • Total Accountability: Authors retain sole liability for all submitted text, regardless of whether a human or machine generated it.

  • Verification Threshold: Papers displaying LLM-specific errors—such as phantom references, plagiarized fragments, or incoherent fabrications—trigger immediate review.

  • Procedural Checks: A penalty requires identification by site moderators followed by secondary validation from a section chair.

  • Appeals Process: The platform permits authors to contest flagging decisions, acknowledging the potential for human error in the enforcement process.

The Mechanism of Enforcement

The policy is not a blanket ban on automation, but a targeted reaction to the degradation of repository standards. Thomas Dietterich, chair of arXiv’s computer science section, stated that the existence of unverified LLM output fundamentally compromises the trustworthiness of a submission. The site effectively functions as a primary artery for scientific dissemination, and moderators are increasingly tasked with distinguishing between tool-assisted writing and automated carelessness.

FeaturePre-May 2026 PolicyNew Policy Enforcement
AccountabilityVagueStrict/Personal
PenaltyAd-hoc removal12-month ban
EvidenceSelf-policingVerified by section chair

Contextualizing the Archive

For decades, arXiv has served as an open-access gateway for pre-publication research, particularly in mathematics, physics, and computer science. Unlike peer-reviewed journals, the site relies on a balance of community trust and administrative curation to manage high volumes of submissions.

Read More: Bun's Rust Code Change: What Developers Need to Know

The rise of generative software has flooded academic spaces with content that mimics formal research patterns without grounding them in empirical rigor. By imposing a "one-strike" rule, the repository signals that the cost of computational negligence has shifted from academic embarrassment to a tangible loss of access to the scientific public square. As of today, the mandate focuses on the integrity of the input rather than the nature of the tool, placing the burden of fact-checking squarely on the researcher.

Frequently Asked Questions

Q: What new rule did arXiv start on May 18, 2026?
arXiv started a new rule on May 18, 2026. Researchers who use AI-generated text in their papers must check it carefully. If they don't, they can be banned for one year.
Q: Why did arXiv make this new rule about AI content?
arXiv made this rule because unchecked AI content can have errors like fake references or copied text. This makes scientific papers less trustworthy. The rule aims to keep the information shared on arXiv accurate and reliable.
Q: What happens if a researcher is found to have used unchecked AI content?
If a researcher is found to have used AI content without checking it, they will be banned from submitting papers to arXiv for one full year. This penalty is given after moderators identify the issue and it is confirmed.
Q: Does this ban apply to all AI use in research papers?
No, this ban is not for all AI use. It is for using AI-generated content without checking it for errors or false information. Authors are still responsible for all text in their papers, whether written by them or an AI.