dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse


Our Discord Notification Server invitation link is https://discord.gg/jB2qmRrxyD

Author Topic: Hallucinations and Failure Modes  (Read 9 times)

Offline Chip (OP)

  • Server Admin
  • Hero Member
  • *****
  • Administrator
  • *****
  • Join Date: Dec 2014
  • Location: Australia
  • Posts: 7149
  • Reputation Power: 0
  • Chip has hidden their reputation power
  • Gender: Male
  • Last Login:Yesterday at 11:27:06 PM
  • Deeply Confused Learner
  • Profession: IT Engineer now retired
Hallucinations and Failure Modes
« on: Yesterday at 09:57:33 PM »
AI will hallucinate if it doesn't have the right training data or if you ask it something it does not know !



Hallucinations and Failure Modes

A “hallucination” in an LLM is not a bug in the human sense. 
It is the natural result of a system that generates the most statistically likely continuation of text without an internal truth-checking mechanism.

---

1. Probabilistic generation

LLMs do not retrieve facts.

They predict the next token based on probability:

Code: [Select]
P(next token | context)

At each step, the model selects from likely continuations.

This means:

  • It is sampling from learned distributions
  • It does not “know” truth
  • It optimizes for plausibility, not correctness

---

2. Why confidence ≠ correctness

A model can sound extremely confident while being wrong.

Reason:

  • Language fluency is learned separately from factual grounding
  • High-probability text often resembles correct explanations
  • Training rewards coherence, not truth verification

So:

Code: [Select]
Fluent answer ≠ accurate answer

Confidence is just statistical smoothness in output.

---

3. Distribution gaps

The model is trained on a finite dataset distribution.

When it encounters something outside that distribution:

Code: [Select]
Unknown / rare / ambiguous input

It still must produce an output.

So it:

  • Interpolates from similar patterns
  • Blends unrelated concepts
  • Generates plausible but incorrect content

This is a major source of hallucinations.

---

4. Mode collapse (generation biasing)

Mode collapse happens when the model over-favours certain “safe” or common patterns.

Effects:

  • Repetitive answers
  • Generic explanations
  • Loss of diversity in responses

In extreme cases:

Code: [Select]
Many different inputs → same type of answer

It reduces variability but can harm accuracy.

---

5. Confabulation

Confabulation is when the model invents details to maintain coherence.

Example behaviour:

  • Fills missing facts with plausible ones
  • Creates citations, names, or numbers that look real
  • Maintains narrative consistency over truth

It is essentially:

Code: [Select]
"Make something that fits the pattern"

rather than:

Code: [Select]
"Verify what is true"

---

6. Context poisoning

Context poisoning occurs when incorrect or misleading information enters the context window.

Once inside:

Code: [Select]
Model treats it as ground truth

Causes include:

  • User-provided false data
  • Incorrect retrieved documents (RAG failures)
  • Earlier hallucinated output reused as context

Since the model has no built-in truth filter, it can fully propagate errors.

---

7. Prompt injection

Prompt injection is a security failure mode in instruction-following models.

It happens when malicious or unintended instructions are embedded inside input data.

Example:

  • Hidden instructions in retrieved documents
  • User tries to override system rules
  • Data contains “ignore previous instructions” style payloads

If not properly isolated, the model may:

Code: [Select]
Follow injected instructions instead of intended system behaviour

This is especially dangerous in RAG systems.

---

Key Insight

Hallucinations are not random mistakes.

They are structural outcomes of the system:

Code: [Select]
No grounded truth system + probabilistic language generation = plausible fiction

The model is optimising for:

  • Coherence
  • Fluency
  • Statistical likelihood

Not for:

  • Truth
  • Verification
  • External consistency checking

That gap is where all hallucinations and failure modes arise.
friendly
0
funny
0
informative
0
agree
0
disagree
0
like
0
dislike
0
No reactions
No reactions
No reactions
No reactions
No reactions
No reactions
No reactions
Our Discord Server invitation link is https://discord.gg/jB2qmRrxyD

Tags:
 

Related Topics

  Subject / Started by Replies Last post
29 Replies
57516 Views
Last post February 17, 2016, 01:53:55 PM
by shoybs
1 Replies
590 Views
Last post July 25, 2017, 09:39:14 PM
by gnossos
0 Replies
25747 Views
Last post May 09, 2019, 03:56:33 PM
by Chip
0 Replies
20130 Views
Last post June 01, 2019, 07:44:16 AM
by Chip
0 Replies
21783 Views
Last post July 06, 2019, 12:19:10 PM
by Chip
0 Replies
23022 Views
Last post July 07, 2019, 11:26:33 AM
by Chip
0 Replies
24154 Views
Last post October 22, 2019, 04:05:58 AM
by Chip
0 Replies
21007 Views
Last post October 01, 2021, 12:46:19 PM
by Chip
0 Replies
18745 Views
Last post February 08, 2025, 09:20:52 AM
by smfadmin
0 Replies
15119 Views
Last post September 10, 2025, 02:35:59 AM
by smfadmin


dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse





TERMS AND CONDITIONS

In no event will d&u or any person involved in creating, producing, or distributing site information be liable for any direct, indirect, incidental, punitive, special or consequential damages arising out of the use of or inability to use d&u. You agree to indemnify and hold harmless d&u, its domain founders, sponsors, maintainers, server administrators, volunteers and contributors from and against all liability, claims, damages, costs and expenses, including legal fees, that arise directly or indirectly from the use of any part of the d&u site.


TO USE THIS WEBSITE YOU MUST AGREE TO THE TERMS AND CONDITIONS ABOVE


Founded December 2014
SimplePortal 2.3.6 © 2008-2014, SimplePortal