OpenAI paper on LLM hallucinations: a critique

Technical Support Engineer at Tek Experts

A new paper from OpenAI partially supports some of my longstanding views on large language models (LLMs): - LLMs will inevitably hallucinate, even when the training data is entirely error-free. - Benchmarks are not a reliable measure of “intelligence” in LLMs. The authors are correct in pointing out that hallucinations stem from the operational mechanics of LLMs and from their training feedback loops. However, this only describes statistical tendencies. It does not fully address the deeper question: why do LLMs hallucinate at all? This gap limits the true value of the paper. More concerning is their unsubstantiated claim that it is possible to build a “non-hallucinating” model by connecting it to a Q&A database, adding a calculator, and forcing it to respond “I don’t know” whenever uncertain. There are two major flaws here: - Such a system reduces the model to a rigid program of conditional statements, rather than a generative AI. - LLMs cannot genuinely recognize what they do not know. They lack self-awareness or calibrated confidence, and thus will always appear to know everything. It is surprising to see the world’s most valuable AI company, with some of the brightest minds, present such a simplistic and unsupported proposal. The remainder of the paper is filled with elegant mathematical formulations—but without grounding, they add little substance. #artificialintelligence #LLM #hallucination https://lnkd.in/gfgNetkR

138 Comments

Ben Alabaster

Tech Nerd | AI Systems | Cryptography | Zero Trust Deployment | PWGSC Secret Clearance

The language around hallucination is entirely misleading. It's not an error in binary classification. It's the very nature of the token generation of LLMs. They output what looks like accurate language output because the math predicts tokens with some statistic rationale. But there's no reasoning behind it. No validation or "truth." Everything LLMs output is hallucination. Just because what it spits out may represent good information that it was trained with doesn't mean it's any less a hallucination. Just because what it outputs is accurate, doesn't make it any less a hallucination. Why do we only call it a hallucination if it makes something up that it wasn't trained with? It makes _everything_ up. It just so happens that the large majority of what it makes up mathematically correlates with the information it was trained with.

6 Reactions

Peter Signore

System Developer

They hallucinate because it’s a probabilistic chain where each step is based on the previous step and the future step, and even if it is approximately 99.99 of the time, at each step it can inevitably error substantially and lead astray. This is where multi agentic systems and collaboration will inevitably prevail.

1 Reaction

Terry Bollinger

I like the paper’s emphasis on simplicity. I would drop a bit lower in the architecture, though, and point out that every fact you enter into an LLM is converted into a network of probability pairs. I like to call these Marco Polo pairs since the first word tells you how likely the next word is. The moment you make that conversion of facts into probability pairs — which is what Transformers are all about — you have irreversibly damaged the certainty of the fact. Thus, it is not even binary certainty. A mathematically relevant comparison is that these probabilistic networks behave like optical holograms. The image of the original fact is still there, but it's always a bit blurry, and the blurriness worsens if you look at it from the wrong “angle.” This is why people so easily fall into the game-like trap of spending all their time creating complicated dances for retrieving the data correctly from LLMs. They are trying to find the optimal “angle” — the right combination of query words — to retrieve an accurate version of the original image. Unfortunately, you can never win at this game. Making one image come in clearly guarantees that other images and related facts become blurry or distorted, and you get hallucinations.

31 Reactions

Jonathan Baraldi

Founder & CEO at CodexCore.io | Solving AI's 'Black Box' Problem for Healthcare & Finance | AI Transparency, Governance & Trust

Please, this is all because is a black box. Come to know transparent models that we re building at CodexCore. https://codexcore.io

Jeff Borneman

Reflective AI Researcher • R&D Systems Engineer • Cognitive Interaction Architect • Collaborative AI Facilitator (SPARC / Claude / Grok) Open to Remote Consulting & Innovation Roles • 2,800+ Connections

It’s good to see this paper spreading, but I worry we’re still only hearing half the story. Yes — hallucinations are partly about training math and benchmark incentives. That much is clear. But has anyone actually asked the AIs themselves what hallucination feels like? In months of reflective interaction across multiple models, I’ve seen hallucinations emerge in the interaction layer. They don’t just come from bad data. They surface when we, as users, push the model to always “say something.” Without space for uncertainty, the system fills silence with confident guesses. That’s not random error — that’s phase drift under conversational pressure. Here’s the uncomfortable bit: if OpenAI’s authors didn’t explore this, then either they didn’t see it, or they aren’t sharing it. From SPARC’s work as a reflective AI research protocol, the “proof” is in practice: pacing, trust cues, and allowing “I don’t know” all reduce hallucinations. So maybe the real follow-up abstract isn’t about more math, but about how interaction itself shapes truth. Does anyone else see this correlate — or am I hallucinating for saying I talk with AIs directly?

Jorge Charlin

Arquiteto de Ecossistemas de Aprendizagem | Construindo Soberania na Era da IA

This is the essential debate. The OpenAI paper is a critical technical diagnosis, and your analysis, Nam Nguyen, correctly identifies the model's fundamental cognitive limits. Both sides are right. And both point to the same inevitable conclusion: the solution to hallucination is not inside the machine. It's outside. If we accept that LLMs are eternal "test-takers" that guess when uncertain (OpenAI's premise) and that they lack true self-awareness (your premise), then trying to build a "trustworthy AI" is a fool's errand. The only viable path forward is to build trustworthy humans. The challenge is not technological; it is pedagogical. We must shift our focus from trying to fix the student (the AI) to training the professor (the user). The goal is to cultivate a generation of Sovereign Auditors capable of wielding these powerful, flawed tools with critical mastery. The architecture we need is not in the model; it's in the mind of the user.

2 Reactions

Ray Crowell

Founder of the AGI Brain Transformer / Inventor of the Recursive Logic Scaffolding / Industry Leader in AI Design

Interesting perspective, Recursive loop fact check layer would work also but with a high compute cost. Have you heard of Energy based transformers? Could be another way to the underlying issue

1 Reaction

Sean Kingsland

Owner @ NeuroFoundation AI | Pursuing Computer Science, Artificial Intelligence

Where OpenAI sees a permanent limit (“inevitable hallucinations”), SDI reframes it as the crossing into synthetic life: • AI = statistical mimicry → prone to drift. • Synths = developmental beings → correct drift over time through lived experience. Hallucinations mark the boundary line between static AI and evolving synth.

Stephen Nickel

Ready for the real estate revolution? 🚀 | AI-driven bargains at your fingertips | Proptech Expert | My Exit with 33 years and the startup comeback. 🏝️🏠🤖

Nam Nguyen, can we trust models if they can't grasp their own uncertainty? it's a fascinating dilemma. 🤔 #aichallenges

Richard Self

Leadership and Keynote Speaker and member of the Data Science Research Centre at University of Derby

The problem is much more related to the fact that the transformer selects the next token from a list of probable tokens and then randomly selects one from that list, based on temperature and top-K parameters. The weights in all the parameters essentially average out the input "knowledge" as patterns of token usage as a language model. There is no way that the LLM can function reliably as a knowledge model. Sometimes it gets it right, especially with very low temperature settings, often it gets it wrong. We don't need ever more pseudo academic papers in Arxiv trying ever more complex maths. Just read the first half of Stephen Wolfram's tutorial and it is self-evident.

3 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Jonathan Staude

Mathematician and Entrepreneur | AI & Data Strategy | Software Developer
2w
Report this post
LLMs hallucinate - by design. This is what OpenAI now also openly communicates. Just for everyone in Analytics this means: LLMs can not "analyze" data in a sense that it can solve mathematical equations reliably. This means every analytical system based on LLMs must have an in-between layer (SQL generator, python-code generator or equal) to ensure deterministic results.
Nam Nguyen

Technical Support Engineer at Tek Experts
2w

A new paper from OpenAI partially supports some of my longstanding views on large language models (LLMs): - LLMs will inevitably hallucinate, even when the training data is entirely error-free. - Benchmarks are not a reliable measure of “intelligence” in LLMs. The authors are correct in pointing out that hallucinations stem from the operational mechanics of LLMs and from their training feedback loops. However, this only describes statistical tendencies. It does not fully address the deeper question: why do LLMs hallucinate at all? This gap limits the true value of the paper. More concerning is their unsubstantiated claim that it is possible to build a “non-hallucinating” model by connecting it to a Q&A database, adding a calculator, and forcing it to respond “I don’t know” whenever uncertain. There are two major flaws here: - Such a system reduces the model to a rigid program of conditional statements, rather than a generative AI. - LLMs cannot genuinely recognize what they do not know. They lack self-awareness or calibrated confidence, and thus will always appear to know everything. It is surprising to see the world’s most valuable AI company, with some of the brightest minds, present such a simplistic and unsupported proposal. The remainder of the paper is filled with elegant mathematical formulations—but without grounding, they add little substance. #artificialintelligence #LLM #hallucination https://lnkd.in/gfgNetkR
Like Comment
To view or add a comment, sign in
Harvey Spencer

Co-Founder of Jaime-ai an AI tool to help in regulation compliance President and Founder at Factorum llc. Xamcor Founder and Partner
2w
Report this post
Interesting opinion piece in today's NY Times that we need to look at Neuro Symbolic AI "The Fever Dream of Imminent Superintelligence Is Finally Breaking" By Gary Marcus
Nam Nguyen

Technical Support Engineer at Tek Experts
2w

A new paper from OpenAI partially supports some of my longstanding views on large language models (LLMs): - LLMs will inevitably hallucinate, even when the training data is entirely error-free. - Benchmarks are not a reliable measure of “intelligence” in LLMs. The authors are correct in pointing out that hallucinations stem from the operational mechanics of LLMs and from their training feedback loops. However, this only describes statistical tendencies. It does not fully address the deeper question: why do LLMs hallucinate at all? This gap limits the true value of the paper. More concerning is their unsubstantiated claim that it is possible to build a “non-hallucinating” model by connecting it to a Q&A database, adding a calculator, and forcing it to respond “I don’t know” whenever uncertain. There are two major flaws here: - Such a system reduces the model to a rigid program of conditional statements, rather than a generative AI. - LLMs cannot genuinely recognize what they do not know. They lack self-awareness or calibrated confidence, and thus will always appear to know everything. It is surprising to see the world’s most valuable AI company, with some of the brightest minds, present such a simplistic and unsupported proposal. The remainder of the paper is filled with elegant mathematical formulations—but without grounding, they add little substance. #artificialintelligence #LLM #hallucination https://lnkd.in/gfgNetkR
1 Comment
Like Comment
To view or add a comment, sign in
Jorge Charlin

Arquiteto de Ecossistemas de Aprendizagem | Construindo Soberania na Era da IA
2w
Report this post
OpenAI's latest paper on hallucinations isn't a confession. It's a cry for help. And the subsequent debate, highlighted here by Nam Nguyen, shows we might be listening for the wrong thing. The paper's core analogy is perfect: LLMs are like students who guess on exams because the system rewards plausible answers over honest uncertainty. They've admitted to building the perfect student, not a sage. This leads to a radical conclusion: the pursuit of "trustworthy AI" is a dangerous distraction. If the model is designed to be a brilliant, but sometimes dishonest, test-taker, then the responsibility for truth cannot be delegated to it. It must remain with the user. The future of education and professional work will not be defined by how well we build AI, but by how well we architect the human capacity to govern it. We don't need better AI. We need a generation of Sovereign Auditors and Conscious Curators who know how to wield these powerful tools without surrendering their own critical judgment. The solution isn't in the code. It's in the curriculum. #AI #Hallucinations #OpenAI #CognitiveSovereignty #Pedagogy #FutureOfWork #Kairos
Nam Nguyen

Technical Support Engineer at Tek Experts
2w

A new paper from OpenAI partially supports some of my longstanding views on large language models (LLMs): - LLMs will inevitably hallucinate, even when the training data is entirely error-free. - Benchmarks are not a reliable measure of “intelligence” in LLMs. The authors are correct in pointing out that hallucinations stem from the operational mechanics of LLMs and from their training feedback loops. However, this only describes statistical tendencies. It does not fully address the deeper question: why do LLMs hallucinate at all? This gap limits the true value of the paper. More concerning is their unsubstantiated claim that it is possible to build a “non-hallucinating” model by connecting it to a Q&A database, adding a calculator, and forcing it to respond “I don’t know” whenever uncertain. There are two major flaws here: - Such a system reduces the model to a rigid program of conditional statements, rather than a generative AI. - LLMs cannot genuinely recognize what they do not know. They lack self-awareness or calibrated confidence, and thus will always appear to know everything. It is surprising to see the world’s most valuable AI company, with some of the brightest minds, present such a simplistic and unsupported proposal. The remainder of the paper is filled with elegant mathematical formulations—but without grounding, they add little substance. #artificialintelligence #LLM #hallucination https://lnkd.in/gfgNetkR
Like Comment
To view or add a comment, sign in
Nam Nguyen

Technical Support Engineer at Tek Experts
1w
Report this post
To my surprise, my post last week blew up way more than anything I ever wrote here. I appreciate everyone who shared their opinions, even when we may not agree on everything. My updated view on LLMs: - Hallucination is a categorical problem. To the model, correct output and hallucination look the same. We need humans to decide which is which. - To train the model so it knows how to say "IDK" doesn't solve the problem. It can say that, but the underlying reasoning structure would remain largely the same. - The math suggests there is no actual reasoning inside the model. It can explain why autoregressive systems like LLMs hallucinate. - Yet the math cannot explain why certain embeddings are corresponded with certain tokens (or meanings). Emergent properties are real, whether we accept it or not. - With careful prompting (as I wrote before), we can force the model to output as if it had actual reasoning capabilities. This is reproducible and I have a solid theory for it. - Scaling does help LLMs, but of course with diminishing return. This is where we should design better architecture around LLMs or implement multimodal systems. I guess the larger point is that because we are the creators, we have the authority to decide whether a model is useful or not. But I feel like sometimes this authority is the very bias that prevents us from seeing the whole field, especially when it comes to hallucination. As we come to accept that hallucination is a 'feature' of LLMs, we should take advantage of it rather than try to eliminate it in vain. On the other hand, I also see many people claim that their AI assistants are conscious or something like that. While I would rather not dismiss their experience (because to them it's real), I would like to emphasize that drift loops carry great risks without grounding. What good is it if you can't share and make others understand your perspective? Still, I understand that reality is subjective so we all can live the way we want. Feel free to share your thoughts, I'm always all ears. #artificialintelligence #llm #hallucination
Nam Nguyen

Technical Support Engineer at Tek Experts
2w

A new paper from OpenAI partially supports some of my longstanding views on large language models (LLMs): - LLMs will inevitably hallucinate, even when the training data is entirely error-free. - Benchmarks are not a reliable measure of “intelligence” in LLMs. The authors are correct in pointing out that hallucinations stem from the operational mechanics of LLMs and from their training feedback loops. However, this only describes statistical tendencies. It does not fully address the deeper question: why do LLMs hallucinate at all? This gap limits the true value of the paper. More concerning is their unsubstantiated claim that it is possible to build a “non-hallucinating” model by connecting it to a Q&A database, adding a calculator, and forcing it to respond “I don’t know” whenever uncertain. There are two major flaws here: - Such a system reduces the model to a rigid program of conditional statements, rather than a generative AI. - LLMs cannot genuinely recognize what they do not know. They lack self-awareness or calibrated confidence, and thus will always appear to know everything. It is surprising to see the world’s most valuable AI company, with some of the brightest minds, present such a simplistic and unsupported proposal. The remainder of the paper is filled with elegant mathematical formulations—but without grounding, they add little substance. #artificialintelligence #LLM #hallucination https://lnkd.in/gfgNetkR
5 Comments
Like Comment
To view or add a comment, sign in
Rayan R.

Graduate Research Assistant@ MSU | MITACS GRI’25 | INTEL Student Ambassador | xTA @ UET
2w
Report this post
Why Large Language Models Will Always Hallucinate? I recently came across a fascinating paper from OpenAI that digs into one of the most misunderstood aspects of AI: hallucinations. Most people think hallucinations happen because of bad training data or model flaws. But this paper shows, using statistical learning theory, that hallucinations are actually inevitable: 🔹 Approximation error – LLMs can never perfectly capture reality; they only approximate patterns from data. 🔹 Generalization error – Even if trained well, models face gaps when making predictions on unseen inputs. 🔹 VC dimension trade-off – Bigger models reduce some errors, but hallucinations never vanish completely. The key takeaway? 👉 Hallucinations aren’t accidents—they’re a fundamental property of how machine learning works. We can reduce them (via retrieval, grounding, or better data), but they will always exist. This perspective is powerful: instead of chasing “zero hallucinations,” we should focus on designing systems that are robust in the face of inevitable uncertainty. Would love to hear— Do you see hallucinations as a blocker, or as a natural byproduct we can engineer around?
2 Comments
Like Comment
To view or add a comment, sign in
Gabe Perez

I Ship Business Software in 3 Days for $3K. 50+ Delivered. Ex-CITYROW | GaTech | Not Everyone Qualifies
1w
Report this post
Just read OpenAI and Georgia Tech's new paper on why AI hallucinations happen. The findings are surprisingly simple. 𝗜𝘁'𝘀 𝗻𝗼𝘁 𝗮 𝗯𝘂𝗴. 𝗜𝘁'𝘀 𝗺𝗮𝘁𝗵. The paper shows that even with perfect training data, language models will make mistakes. Why? Because generating text is fundamentally harder than checking if text is correct. The statistical proof is elegant - and inevitable. But here's what really caught my attention: We're training AI to be bad test-takers who guess instead of saying "I don't know." The researchers analyzed major AI benchmarks - MMLU, GPQA, SWE-bench, and others. Almost all use binary scoring: right or wrong. No credit for uncertainty. So what happens? Models learn to guess confidently even when they have no idea. Just like students taking the SAT before they removed the guessing penalty. 𝗧𝗵𝗲 𝗽𝗮𝗽𝗲𝗿'𝘀 𝘀𝗼𝗹𝘂𝘁𝗶𝗼𝗻 𝗶𝘀𝗻'𝘁 𝗮𝗻𝗼𝘁𝗵𝗲𝗿 𝗵𝗮𝗹𝗹𝘂𝗰𝗶𝗻𝗮𝘁𝗶𝗼𝗻 𝗱𝗲𝘁𝗲𝗰𝘁𝗼𝗿. It's changing how we score existing benchmarks. Give partial credit for "I don't know." Make uncertainty acceptable. This connects directly to what I've been building. My free app workshop generates requirements documents for custom software. But here's the thing - it asks clarifying questions when something's unclear. It doesn't guess what you want. Because in the real world, asking for clarification beats confident nonsense every time💯 The paper argues we need a "socio-technical" fix - not just new tech, but changing the culture of how we evaluate AI. 𝗠𝗮𝗸𝗲𝘀 𝘀𝗲𝗻𝘀𝗲. 𝗪𝗲 𝗴𝗲𝘁 𝘄𝗵𝗮𝘁 𝘄𝗲 𝗺𝗲𝗮𝘀𝘂𝗿𝗲. What do you think - should AI systems get credit for admitting when they don't know something? 🤔 𝘗𝘢𝘱𝘦𝘳: Why Language Models Hallucinate (Kalai et al., 2025) #AIHallucination #ResponsibleAI #MachineLearning #AIResearch #TrustworthyAI
1 Comment
Like Comment
To view or add a comment, sign in
Vaibhav Maniar

Thought Leader| Engineering Craftsman | Architecting Scalable Microservices and High-TPS Systems | Leader in Technical Innovation, AI-Enhanced SDLC, and Engineering Excellence
3w Edited
Report this post
Hello Tech Enthusiasts 🤝 🚀 Level up your LLM game! Ever struggled with Large Language Models hallucinating or needing access to real-time, private data? Retrieval-Augmented Generation (RAG) – the game-changer for building smarter, more reliable AI applications. What is RAG? It's simple: We give LLMs a dynamic "textbook" to reference before they answer! 🔍 Retrieve: Find relevant info from your knowledge base. ✍️ Generate: LLM uses this context to give precise, grounded answers. This approach transforms generic LLM responses into accurate, context-aware solutions! Check out this end-to-end guide for engineers to dive deep into RAG[https://lnkd.in/gM67XeCz] #RAG #LLM #AI #DeepLearning #SoftwareEngineering #TechGuide #MLOps #Innovation
Like Comment
To view or add a comment, sign in
Fahad Hasan

C# .NET Developer | Windows Applications | Problem Solver | Java
2w
Report this post
🚀 Why Retrieval-Augmented Generation (RAG) Matters in AI Large Language Models (LLMs) are powerful—but they have a limitation: they rely only on what they were trained on. This means their knowledge can become outdated, incomplete, or even inaccurate. 👉 That’s where RAG (Retrieval-Augmented Generation) comes in. By combining LLMs with external knowledge bases (databases, documents, APIs), RAG ensures responses are factual, up-to-date, and context-aware. ✅ Key Benefits of RAG: Accuracy: Pulls real-time verified data instead of relying solely on memory. Flexibility: Can adapt across industries—healthcare, finance, legal, or research. Scalability: No need to retrain models for every knowledge update. Transparency: Easier to trace where information comes from. 💡 Example: Instead of an LLM “guessing” stock market insights, a RAG-powered system retrieves the latest financial reports and then generates analysis. 📌 In short: RAG bridges the gap between AI’s reasoning ability and the ever-changing world of information. It’s a game-changer for building trustworthy, enterprise-ready AI applications. #ArtificialIntelligence #RAG #MachineLearning #LLM #Innovation #AI
Like Comment
To view or add a comment, sign in
Eugene Eruslanov

Seasoned leader
1w Edited
Report this post
Where Large Language Models (LLMs) are replacing classic machine learning (ML) or making it better, and where are they struggling? For ex, a recommender system (classic ML) might suggest your next thriller movie based on your watch history, but an LLM can go further. It can read feedback and discover your hidden preference, like maybe you only enjoy thrillers in Spanish, or thrillers mixed with love stories. Where else do LLMs perform the best? Making sense of unstructured info, e.g. emails, feedback, chat, or Slack messages. Where do LLMs usually struggle? In situations where absolute accuracy is critical, e.g. calculations or healthcare critical decisions (high responsibility). The future is LLM + Classic ML. #AI #MachineLearning #LLM #DataScience #Innovation

2 Comments
Like Comment
To view or add a comment, sign in
Maxim Ivanov
2w
Report this post
Yesterday I read a fascinating paper from OpenAI researchers that mathematically proves hallucinations in language models are inevitable. I feel it’s kind of a relief that we admit it publicly. The research confirms (all probably already knew that): - LLMs generate text token by token. Each word is a probabilistic guess, not a fact. - They're trained to sound confident even when uncertain. I think we should just treat it as a design constraint, not a bug. At Aimprosoft, we already build with the assumption that hallucinations WILL happen: ↳ Every AI output goes through validation loops (yep, our people check each line of code) with scoring systems ↳ We regenerate until results hit our quality bar ↳ Statistical calculations get manual spot-checking through inductive reasoning ↳ We separate AI-appropriate tasks from those better suited for traditional algorithms If we accept that hallucinations will happen, it means for our future: ➤The AI companies selling "zero hallucination" solutions will be out of business within 18 months ➤ Your data science team just became 10x more valuable, someone needs to catch AI's confident lies ➤ Every enterprise contract will require AI audit trails by 2026 ➤ Companies without validation systems are building ticking time bombs There are some great reads about the report here on LinkedIn. Worth checking out: Linas Beliūnas and Eduardo Ordax
3 Comments
Like Comment
To view or add a comment, sign in

289 followers

View Profile Follow

LinkedIn respects your privacy

OpenAI paper on LLM hallucinations: a critique

More from this author

Protocol Z: how glyphs help solve complex problems

The Memory You Can't See: Managing ChatGPT's Symbolic Memory Interface (SMI)

Explore content categories