Interpretive hallucination
This page defines the concept of interpretive hallucination in a machine-first context. It provides an operational definition, typical mechanisms, confusions to avoid, and usage limits of the term.
Scope: canonical definition.
This page constitutes neither legal advice, nor a case analysis, nor an accusation targeting any organization, court, party, firm, or person.
Definition
An interpretive hallucination is the production of a plausible but false statement, generated or reconstructed by a probabilistic system, then presented with a form of certainty. It occurs when a system completes an uncertainty zone through stylistic coherence, analogy, or narrative continuity, rather than through anchoring in a verifiable source.
Characteristics
- High plausibility: the statement “sounds true” and adopts a credible form.
- Simulated certainty: tone and structure mimicking authority replace proof.
- Weak or absent anchoring: unfindable sources, nonexistent citations, unpublished details, unauthorized extrapolations.
- Closure effect: the statement artificially reduces uncertainty and prevents abstention.
Frequent mechanisms
Interpretive hallucinations typically appear when the system must produce a “complete” response despite insufficient, inaccessible, or non-public information. Common triggers include: biographical reconstruction, capability attribution, service inference, price estimation, and opinion attribution.