New arXiv policy: 1-year ban for hallucinated references

(twitter.com)

180 points | by gjuggler 3 hours ago ago

45 comments

btown 2 hours ago

> The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue.

This is incredibly good for science. arXiv is free, but it's a privilege not a right!

I'm not seeing this clearly listed on https://info.arxiv.org/help/policies/index.html so it's possible this is planned but not live yet - or perhaps I'm not digging deeply enough?

As a certain doctor once said: the whole point of the doomsday machine is lost if you keep it a secret!

[-]

dataflow an hour ago

> This is incredibly good for science.

I disagree. It's just one darn hallucinated citation for heaven's sake, not fraud or something. It doesn't account for the substance or quality of their work at all. A one-year ban seems plenty sufficient for a minor first time mistake like this. People make mistakes and a good fraction of them can learn from those mistakes. There's no need to permanently cripple someone's ability to progress their life or contribute to humanity just because an AI hallucinated a reference one time in their life. That's punitive instead of rehabilitative.

[-]

toast0 16 minutes ago

> It's just one darn hallucinated citation for heaven's sake, not fraud or something.

It is fraud.

> It doesn't account for the substance or quality of their work at all.

References are part of the work. If you're making up the references, what else are you making up?

> People make mistakes and a good fraction of them can learn from those mistakes. There's no need to permanently cripple someone's ability to progress their life or contribute to humanity just because an AI hallucinated a reference one time in their life.

A one year ban is not permanent. Having a negative consequence for making poor decisions seems like an inducement to learn from the mistake?

In an ideal world, one would be keeping notes on references used while doing the research that lead to writing the paper. Choosing not to do that is one poor decision.

Having a positive outlook, if asking an AI to provide references that may have been missed, one should at least verify the references exist and are relevant. Choosing not to do that is also a poor decision, even if one did take notes on references used while researching.

[-]

blazespin 13 minutes ago

it's very silly, but not a big deal. Arxiv is becoming irrelevant these days anyways.

In fact would be better if they just banned AI, so we could just get off the luddite platforms.

Automated research is the future, end of story. And really it couldn't have come out at a better time, given the increasingly diminishing returns on human powered research.

[-]

andrepd 5 minutes ago

Poe's law striking hard.

dataflow 6 minutes ago

> It is fraud.

No, fraud requires intent to deceive.

> A one year ban is not permanent.

...what text are you reading? Nobody was against the one-year ban. I was literally in favor of it in my comment. I explicitly said it is already plenty sufficient. What I said is there's no need to go beyond that. My entire gripe was that they very much are going beyond that with a permanent penalty. Did you completely miss where they said "...followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue"?

goolz 5 minutes ago

If you cannot be bothered to check your references when writing academic quality papers then you have no place writing them in the first place. The punishment is not chopping off a finger, it is a polite reminder to do the bare minimum.

wrs 37 minutes ago

A "mistake" would be a typo in a real citation. A hallucinated citation is evidence of just plain laziness and negligence, which taints the entire submission.

[-]

dataflow 18 minutes ago

No it is not. Seriously. All you need for this to happen is for your lab partner to ask AI to add a missing citation that they are already familiar with at the last minute before a midnight submission deadline, and for the AI to hallucinate something else, and for them to honestly miss this. It does not even imply any involvement on your part, let alone that either of you were lazy or negligent on the actual research or substance of the paper. The lack of any sympathy or imagination here is astounding.

[-]

asdff 11 minutes ago

There are no deadlines for journal submissions. Even if you felt you were running close to your revisions being due, an email to an editor will probably fix this for you. And what you described is still negligent, not verifying the garbage output bot did not in fact output garbage.

bigstrat2003 a few seconds ago

The lack of understanding that you are responsible for the content you create, no matter what tools you use, is what's astounding.

patcon 27 minutes ago

A citation is where you derived knowledge... If you haven't checked it and you are submitting something that should represent a ton of labour (and which will consume labour to review), you don't understand what you're doing. It is not just crossing T's and dotting I'd.

Your being set behind is less important than the fact that your publishing is setting everyone else behind.

Such a banned person is being helped to "step out of the way", and someone more competent will assuredly step forward to consume the limited maintenance labour more thoughtfully

[-]

dataflow 22 minutes ago

> Your being set behind is less important than the fact that your publishing is setting everyone else behind

One hallucinated citation does not in any way imply anyone is being left behind. All it means nobody is checked that particular line of the manuscript after it was written. The rest of the paper could still be solid and treated accordingly. If you find evidence of the contrary, of course treat it accordingly, but this is so obviously not that.

[-]

andrepd a few seconds ago

You clearly misunderstand. You cite a work in your paper because you have read that work, and build upon it or want to refer to it to back up a specific claim. Generating references is fraud period, because you are implying that you have read a work when in fact you just asked an AI "please insert some reference-shaped text here" to make it look like a proper paper. It is sadly not a necessary, but certainly a VERY sufficient, reason to conclude a paper is fraudulent.

some_furry 7 minutes ago

No. It's fraud.

conartist6 18 minutes ago

Yes, it is fraud

ajkjk 39 minutes ago

It's not the kind of mistake that is possible unless you're engaging in fraud anyway.

[-]

dataflow 36 minutes ago

> It's not the kind of mistake that is possible unless you're engaging in fraud anyway.

Seriously? You can't fathom an honest researcher asking for AI to find a citation they know exists, and the AI inserting or modifying a citation incorrectly without them realizing?

If you find evidence of fraud by all means lay down the hammer. Using a single hallucinated citation like it's some kind of ironclad proxy just because you think they must be committing fraud is insane.

[-]

mixtureoftakes 22 minutes ago

if you're not checking citations in the paper youre publishing AND trusting a non SOTA, hallucination prone ai model to come up with sources for it, its probably for the best of everyone that this paper isn't published.

yes there will be rare exceptions but in general i feel like this is a really good addition.

asdff 9 minutes ago

Why would you ask the ai to find a citation you know exists? Just reach for that citation.

mianos 10 minutes ago

You are being ironic right?

Loughla 34 minutes ago

Don't use AI? Problem solved?

imenani 3 hours ago

https://xcancel.com/tdietterich/status/2055000956144935055

[-]

JumpCrisscross 2 hours ago

> Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated (Dieterrich, T. G.)

MinimalAction 20 minutes ago

There needs be to a careful vetting before such adverse actions. If somebody includes a name and pushed it without express permission, does everyone get the ban? I agree that implemented the right way, this is good.

squirrelon an hour ago

Had a colleague submit a paper with literal AI slop left in the text, got hit with a nasty revision request. Check your drafts before you submit, people. The reviewers will find it.

[-]

miki123211 11 minutes ago

Also check your LaTeX comments, Arxiv makes those publicly visible!!!

I'm a screen reader user and usually read papers as raw TeX. I've seen everything: slurs, demeaning comments towards reviewers and professors, admissions of fraud, instructions to coauthors to commit further fraud before paper submission to mask the earlier fraud... it's all there. There's far less of it than I would think, definitely <1% of papers, but it's there.

I think it would be useful to run an LLM anti-fraud pass on the TeX source of all new arxiv papers. It wouldn't catch everything, but it would catch some of the dumbest fraudsters.

On the positive side, you can also find stronger claims that didn't survive review, additional explanations that didn't make the cut due to the conference's page limit, as well as experimental results that the authors felt weren't really worth including. Those need to be approached with an abundance of caution, but are genuinely useful sometimes.

SchemaLoad 6 minutes ago

Sad the suggestion here is to just disguise the slop to make it harder for reviewers to spot rather than not submitting slop to begin with.

bigfishrunning 3 hours ago

Good; academic literature is in crisis because of all of the slop. Forcing some consequences on easily-detectable hallucinations can only be a good thing

[-]

tengwar2 2 hours ago

It's not just AI, though. I did a doctorate in physics about 40 years back, and bad references were a problem back then.

[-]

dualvariable 37 minutes ago

Doesn't matter if it is AI hallucinations or entirely human scientific fraud, the problem is the same, and the solution works fine for both cases.

If you can't validate that your bibliography is full of real articles, you shouldn't get published.

LLMs have just poured gasoline on the fire.

wrs 36 minutes ago

"Bad", like, you literally just made them up? I hope that would have been a problem.

lucb1e an hour ago

In what way? Surely something like the source not quite saying what was cited, or mixing up citations, rather than inventing them outright?

[-]

miki123211 4 minutes ago

That, and mixing reference details from multiple sources and messing it up.

Let's say you read a paper on Arxiv but cite the version that was submitted to a journal or conference, without realizing that the authors made changes to the version they submitted and forgot to upload them to Arxiv.

asdff 8 minutes ago

Imagine how bad they are now then.

add-sub-mul-div 40 minutes ago

Yes and ffs arrows kill people too but we don't bring that up every time we talk about what to do with guns.

random3 2 hours ago

It seems a good idea to ban cheating, but how hard is it, especially in new reasoning/agents contexts to validate references?

The deeper question is whether legitimate AI generated results are allowed or not? Test - In the extreme - think proof of Riemann Hypothesis autonomously generated (end to end) formally proven - is it allowed or not?

[-]

Ifkaluva 2 hours ago

This is not about banning cheating, it’s about banning inaccurate information.

pointlessone 2 hours ago

It is allowed as long as it’s verified.

The thread specifically points out that if authors can’t be arsed to simply proofread their text the rest can not be trusted either.

It’s a simple heuristic against low quality submissions, not an anti-ai measure.

lionkor an hour ago

If you use AI correctly, nobody should be able to tell that it was used at all.

Retric 2 hours ago

You don’t need to solve everything, catching a few thousand non existent citations with such a policy is on its own a net benefit.

beloch an hour ago

There already exists multiple tools for automatically verifying references. This measure will likely only filter out the laziest and most incompetent of AI slop submissions. It's a very modest raising of the bar, but comes at zero cost to honest researchers.

I expect arXiv will still have problems with slop submissions but, at least, their references should actually exist going forward.

llm_nerd 43 minutes ago

It isn't "cheating" they're concerned with, it's sloppiness. This dictum isn't some sort of AI ban, but instead simply that if there is evidence that it was so low effort that the work includes such blatant problems, it's just adding noise.

pinkmuffinere an hour ago

> think proof of Riemann Hypothesis autonomously generated (end to end) formally proven - is it allowed or not?

Sorry to be rude, but this seems like a dumb question. I want science to progress. A primary purpose of these journals is to progress science. A full proof of the Riemann Hypothesis progresses science. I don't care how it was produced, if Hitler is coauthor, etc, I just care that it is correct. Whether the authors should be rewarded for whatever methods they used can be a separate question.

[-]

kingstnap an hour ago

Terence Tao had a nice talk from the Future of Mathematics conference posted yesterday [0] that shapes a lot of my own feelings on this matter.

The short of it is he argues how first to correctness shouldn't be the only goal / isn't a great optimisation incentive. Presentation and digestibility of correct results is a missing 1/3 when you've finished generation and verification. I completely agree with him. You don't just need an AI generated proof of the Reimann Hypothesis. You would really like it to be intentional and structured for others to understand.

A really beautiful quote I learned of in the talk is this:

> "We are not trying to meet some abstract production quota of definitions, theorems, and proofs. The measure of our success is whether what we do enables people to understand and think more clearly and effectively about math." - William Thurston

[0] https://www.youtube.com/watch?v=Uc2zt198U_U