Technology

LLMs Don't Hallucinate – They Drift

Hacker News2h ago

3 min read

📋

Key Facts

✓ A new framework challenges the common use of the term "hallucination" for AI errors, proposing "semantic drift" as a more accurate descriptor.
✓ The framework introduces a method for measuring "fidelity decay," which quantifies how a model's output deviates from expected meaning over time.
✓ This conceptual shift provides a structured diagnostic tool for analyzing and addressing reliability issues in large language models.
✓ The approach reframes AI errors as predictable outcomes of complex processing rather than random, inexplicable failures.
✓ The framework has been detailed in a recent conference contribution, signaling a move toward more rigorous evaluation metrics in AI research.

Quick Summary

The terminology surrounding artificial intelligence errors is undergoing a significant transformation. A new framework challenges the widely used term "hallucination" when describing large language model (LLM) failures, proposing a more precise alternative: semantic drift.

This conceptual shift is detailed in a recent conference contribution that introduces a method for measuring fidelity decay within AI systems. The framework provides a structured way to diagnose how and why model outputs deviate from expected or factual information, moving beyond anecdotal descriptions toward quantifiable metrics.

Redefining AI Errors

The term "hallucination" has become a catch-all for when AI models generate incorrect or nonsensical information. However, this metaphor is criticized for being imprecise and anthropomorphic. The new framework argues that what is often called a hallucination is better understood as a form of semantic drift—a gradual or sudden departure from intended meaning or factual grounding.

This reframing is not merely semantic; it has practical implications for diagnosis and improvement. By viewing errors as drift, developers can trace the degradation of information through the model's processing pipeline. The framework provides a method to measure this decay, offering a clearer lens through which to analyze model behavior.

Shifts from vague "hallucination" to measurable "semantic drift"
Introduces "fidelity decay" as a quantifiable metric
Provides a diagnostic framework for model errors

Measuring Fidelity Decay

At the core of the new framework is the concept of fidelity decay. This metric allows researchers to quantify how much a model's output drifts from a source of truth or a given prompt over time or through successive processing steps. It transforms a subjective observation into an objective measurement.

The framework establishes a systematic approach to tracking this decay. Instead of labeling an output as simply "wrong," analysts can now measure the degree of deviation. This enables more nuanced comparisons between different models, prompts, or architectural changes, focusing on the stability of semantic meaning rather than just factual accuracy.

The framework provides a method to measure this decay, offering a clearer lens through which to analyze model behavior.

Implications for AI Development

Adopting the language of semantic drift and fidelity decay could reshape AI development and evaluation. It moves the conversation from blaming a model for "making things up" to understanding the systemic factors that cause information to degrade. This perspective encourages a more engineering-focused approach to reliability.

For developers, this means new tools for debugging and improving model performance. For users, it offers a more transparent understanding of AI limitations. The framework suggests that errors are not random failures but predictable outcomes of complex processing, which can be measured, monitored, and potentially mitigated through targeted interventions.

Enables precise tracking of information degradation
Facilitates comparison between different model architectures
Shifts focus to systemic causes of errors

A New Diagnostic Lens

The proposed framework serves as a diagnostic tool for the AI community. By categorizing and measuring different types of drift, it helps identify specific failure modes within large language models. This structured analysis is crucial as these models become more integrated into critical applications where reliability is paramount.

The discussion around this framework has already begun within technical communities, highlighting a growing demand for more rigorous methods to assess AI performance. As the field matures, the ability to precisely measure and describe model behavior will be essential for building more trustworthy and effective AI systems.

Errors are not random failures but predictable outcomes of complex processing.

Looking Ahead

The move from "hallucination" to "semantic drift" represents a maturation in the discourse surrounding artificial intelligence. It reflects a deeper understanding of how these complex systems operate and fail. This framework provides the vocabulary and methodology needed for more productive conversations about AI safety and reliability.

As research continues to build on this foundation, the concepts of fidelity decay and semantic drift will likely become standard in the evaluation of large language models. This evolution in terminology is a critical step toward developing AI that is not only more powerful but also more predictable and transparent in its operation.

Continue scrolling for more

Technology

AI Transforms Mathematical Research and Proofs

Artificial intelligence is shifting from a promise to a reality in mathematics. Machine learning models are now generating original theorems, forcing a reevaluation of research and teaching methods.

Technology

Apple's 2026 Mac Lineup: Major Redesigns & New Models

Apple is preparing a significant refresh for its entire Mac computer lineup in 2026, featuring new MacBook Air and Pro models, a redesigned flagship laptop, new desktops, and an updated Studio Display.

52m

5 min

Read Article

Technology

GoPro HERO13 Black Drops to Lowest Price on Amazon

The GoPro HERO13 Black action camera has reached its lowest price point on Amazon, offering a compelling alternative for outdoor enthusiasts and athletes seeking high-quality video capture.

5 min

Read Article

Politics

Pentagon Watchdog Flags Drone Defense Gaps at US Bases

A new Pentagon watchdog report reveals that inconsistent counter-drone policies are leaving some US military bases vulnerable to increasing drone threats, with critical installations handling nuclear deterrence and high-explosive weapons at risk.

5 min

Read Article

Sports

MoonPay Secures Title Sponsorship for X Games League

The action sports spectacle is adopting a league-based format, with MoonPay securing an eight-figure title sponsorship deal. This partnership signals a significant evolution for the X Games brand.

5 min

Read Article

Technology

Sony LinkBuds Clip Review: Solid Design, Steep Price

Sony enters the clip-on earbud market with the LinkBuds Clip, featuring customizable cases and open-ear design. But does the $229.99 price justify the experience?

5 min

Read Article

Education

Rural Astronomy: Bridging the Sky Gap for Students

Rural areas possess the darkest skies but often lack the educational resources to nurture student interest in astronomy. New initiatives aim to bridge this gap by bringing telescopes directly into schools.

3 min

Read Article

Technology

Navigating Digital Restrictions: Internet Access in Authoritarian S...

As digital borders tighten, citizens in authoritarian states seek methods to bypass surveillance and access the open internet. This report examines the technical and strategic approaches used to maintain digital privacy and freedom.

5 min

Read Article

Technology

Apple's Google Gemini Partnership Set for iOS 26.4 Debut

The collaboration between Apple and Google is moving forward, with reports indicating that the integration of Gemini models into Apple Intelligence could be unveiled as early as next month.

5 min

Read Article

Science

Historic Winter Storms That Shaped the US

A retrospective on the most catastrophic winter storms to hit the United States since the 1920s, examining their human toll, economic impact, and the infrastructure changes they spurred.

7 min

Read Article

🎉

You're all caught up!

Check back later for more stories

Back to Home

LLMs Don't Hallucinate – They Drift

Key Facts

Quick Summary#

Redefining AI Errors#

Measuring Fidelity Decay#

Implications for AI Development#

A New Diagnostic Lens#

Looking Ahead#

AI Transforms Mathematical Research and Proofs

Apple's 2026 Mac Lineup: Major Redesigns & New Models

GoPro HERO13 Black Drops to Lowest Price on Amazon

Pentagon Watchdog Flags Drone Defense Gaps at US Bases

MoonPay Secures Title Sponsorship for X Games League

Sony LinkBuds Clip Review: Solid Design, Steep Price

Rural Astronomy: Bridging the Sky Gap for Students

Navigating Digital Restrictions: Internet Access in Authoritarian S...

Apple's Google Gemini Partnership Set for iOS 26.4 Debut

Historic Winter Storms That Shaped the US

You're all caught up!

Quick Summary

Redefining AI Errors

Measuring Fidelity Decay

Implications for AI Development

A New Diagnostic Lens

Looking Ahead