Logo
FrontierNews.ai

ArXiv Cracks Down on AI-Generated Papers: What Scientists Need to Know

ArXiv.org, one of the world's most popular repositories of free scientific research, is cracking down on AI-generated content and those who fail to verify results before submission. The policy targets papers containing hallucinated references, unchecked language model outputs, and other signs that authors didn't properly review AI-generated text. The stakes are significant: researchers who violate the policy face a one-year ban from the platform, followed by a requirement that any future submissions must first be accepted at a reputable peer-reviewed venue.

Why Is ArXiv Taking This Action Now?

The move comes as AI-generated content has become a serious problem in academic publishing. At the 2026 International Conference on Learning Representations (ICLR), one of the world's most prestigious AI conferences, researchers found that 21% of peer reviews were fully AI-generated, and more than half showed signs of AI use. The situation with submitted papers was less severe but still concerning: approximately 1%, or 199 manuscripts, were fully AI-generated, while 9% contained more than 50% AI-generated text.

Thomas G. Dietterich, chair of the Computer Science Section of ArXiv and a professor at Oregon State University, explained the reasoning behind the strict enforcement. He noted that when authors fail to check language model outputs, the entire paper becomes untrustworthy.

"If a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper," said Thomas G. Dietterich.

Thomas G. Dietterich, Chair of Computer Science Section, ArXiv

Examples of violations include hallucinated references, citations to papers that don't exist, and what Dietterich called "meta-comments from the LLM," such as sections reading "Here is a 200-word summary; would you like me to make any changes?". These are telltale signs that an author simply copied and pasted AI output without reviewing it.

What Counts as a Violation, and How Will ArXiv Enforce It?

ArXiv's Code of Conduct makes clear that authors bear full responsibility for their papers regardless of how the content was generated. The enforcement process includes safeguards: a moderator must first document the problem, and then the Section Chair must confirm the violation before imposing penalties.

Steinn Sigurdsson, an astrophysics professor at Penn State and scientific director at ArXiv, emphasized that the platform receives far more problematic submissions than the public sees. He noted that some submissions are "really, really egregious" and that the additional consequences are designed to discourage bad actors from repeatedly trying to submit low-quality AI-generated work.

"You don't see the stuff we reject; some of it is really, really egregious. The decision to impose additional consequences is largely to throttle that stuff so n00bs and bad actors don't trash us trying repeatedly," noted Steinn Sigurdsson.

Steinn Sigurdsson, Astrophysics Professor at Penn State and Scientific Director, ArXiv

Dietterich clarified that appeals will be possible if bans are issued, ensuring that researchers have recourse if they believe a ban was imposed in error.

How Should Researchers Use AI Tools Responsibly?

  • Verify All Results: Check every fact, citation, and claim generated by AI tools before submitting. Language models frequently hallucinate references and create plausible-sounding but false information.
  • Review for Meta-Comments: Remove any text that appears to be instructions or commentary from the AI itself, such as "Here is a draft; please review" or similar phrases that indicate unedited output.
  • Cross-Check Citations: Ensure all references actually exist and accurately represent the cited work. Hallucinated citations are a common red flag for ArXiv moderators.
  • Maintain Author Responsibility: Remember that as an author, you are legally and professionally responsible for every word in your paper, regardless of whether AI tools were used to generate it.

The academic community has largely supported ArXiv's approach. Ethan Mollick, a Wharton professor studying AI, tweeted that the policy seems "incredibly reasonable at least in the short term". Ash Jogalekar, a senior program manager of agentic AI for science at Microsoft, agreed, stating that expecting scientists to use AI tools responsibly while verifying results is not just reasonable but the standard for good science.

"Expecting high standards from scientists and telling them that they can use AI tools but need to check and recheck the results before publishing is not just reasonable but is the way good science should always be done," said Ash Jogalekar.

Ash Jogalekar, Senior Program Manager of Agentic AI for Science, Microsoft

Lucas Beyer, a former OpenAI researcher now at Meta, praised the policy as "very good" and called for the restrictions to be "strongly enforced".

Enforcing these measures will be a significant undertaking. ArXiv.org handles an enormous volume of submissions, reaching 2 million total submissions by the end of 2021, with roughly 24,000 articles being submitted monthly as of November 2024. Despite the scale of the challenge, the platform's leadership appears committed to maintaining scientific integrity in the age of AI.