ArXiv, the preprint server that has served as the de facto first stop for computer science and mathematics research for more than three decades, will start banning authors for a year if their submissions show clear signs of unreviewed AI-generated content. The policy, posted late Thursday by Thomas Dietterich, chair of arXiv's computer science section editorial committee, is the platform's most aggressive move yet against a rising tide of chatbot-written manuscripts that moderators have struggled to triage.
What the policy actually punishes
The new rule does not ban AI-assisted writing. Instead, it targets papers carrying what Dietterich described as "incontrovertible evidence that the authors did not check the results of LLM generation." He gave concrete examples: hallucinated references — citations to papers that do not exist — and meta-comments from the model itself, such as "here is a 200 word summary; would you like me to make any changes?" or "the data in this table is illustrative, fill it in with the real numbers from your experiments."
Penalties are steep. A first offense triggers a one-year submission ban for every author listed on the paper. After the ban lifts, those authors must clear an additional hurdle: their next arXiv submission has to be accepted by a "reputable peer-reviewed venue" before the preprint server will take it. The decision is appealable, but a section chair must personally confirm the penalty before it takes effect. Dietterich emphasized that the rule will only apply where evidence is unambiguous, framing it as a one-strike policy with a deliberately high bar.
A platform under pressure
The crackdown extends a pattern set last year, when arXiv paused acceptance of computer science review and position papers after moderators were swamped by AI-written manuscripts dressed up as rigorous surveys. The platform's quality-control framework — endorsements from established authors, light moderator review — was never designed for the volume or the kind of fluent, plausible-looking output that modern LLMs can produce in seconds.
The timing matters. ArXiv is in the middle of spinning out as an independent nonprofit on July 1 after more than two decades hosted by Cornell University, and its leadership has framed the transition partly around the need to handle AI-era moderation at scale. Recent studies have documented a sharp rise in fabricated citations across academic literature, a trend researchers have attributed in large part to authors pasting unverified LLM output into manuscripts.
Implications for researchers
For the working scientist, the practical message is narrow but firm: use AI tools if you want, but read the output. The policy effectively treats unreviewed LLM artifacts the way arXiv has long treated plagiarism — as a research-integrity violation that gets the author, not the model, sanctioned.
Whether the line holds in practice is another question. Detecting AI involvement remains difficult unless the model has been sloppy enough to leave behind a literal giveaway, and Dietterich's framing sets a deliberately high evidentiary bar. The deterrent is the ban itself, and the implicit promise that arXiv now treats AI-generated slop as something worth chasing.



