How Can We Teach AI to Say I Don’t Know?

January 27, 2026

How Can We Teach AI to Say I Don’t Know?

The Root of Unearned Confidence
A Proactive Approach to AI Interaction
The Practical Application of Rubrics
Forging a New Path Toward Reliability

Article Highlights

Off On

Generative artificial intelligence systems present information with a powerful and often convincing air of certainty, yet this confidence can frequently mask a complete fabrication in a phenomenon popularly known as “hallucination.” This tendency for AI to confidently invent facts when it lacks sufficient information is not merely a quirky bug but a fundamental obstacle preventing its reliable integration into critical decision-making processes across industries. The prevalent belief is that the solution lies in developing more advanced, knowledgeable models, but this overlooks a more immediate and powerful approach. The key to fostering trustworthy AI does not rest solely on the shoulders of its creators but also on the users who interact with it. A profound shift in communication strategy is necessary, one that moves beyond simply asking for an output and instead begins to meticulously define the rules of engagement, thereby teaching the machine the crucial art of admitting ignorance.

The Root of Unearned Confidence

The confident yet incorrect outputs generated by AI are not random system errors but are, in fact, a predictable consequence of their core design. These models are engineered to provide fluent, comprehensive, and stylistically coherent responses by predicting the next most likely word in a sequence. When faced with an ambiguous prompt or a gap in its training data, an AI’s primary directive is to maintain that fluency and complete the task, even if it requires fabricating information to fill the voids. This creates an inherent tension between the model’s default tendency for fluency and the more desirable, but less natural, behavior of restraint. The AI is not malfunctioning when it hallucinates; it is performing as intended under instructions that are too broad, forcing it to make inferences to satisfy the request. Just as asking for a “cookie recipe” without specifying dietary restrictions or available ingredients will yield a generic and possibly unusable result, giving an AI an open-ended task without clear constraints invites it to make assumptions that can lead to significant errors.

The real-world consequences of this default to fluency over factual restraint are becoming increasingly apparent and costly. In a notable case, the professional services firm Deloitte was required to repay a substantial sum after an AI-assisted report prepared for the Australian government was found to contain fabricated citations and a misattributed court quote. This incident served as a powerful illustration that the primary risk is not the AI itself but the unmanaged application of its default behaviors in high-stakes environments. The lesson is not that the technology should be abandoned, but that it must be actively constrained. The critical challenge for users and developers is to implement frameworks that compel the AI to prioritize accuracy and transparency. This requires teaching the system to recognize the limits of its knowledge and to understand that pausing, qualifying a statement, or declining to answer altogether is a more successful outcome than generating a polished but false response. By establishing these boundaries, the operational priorities of the AI can be realigned from simply providing an answer to providing a trustworthy one.

A Proactive Approach to AI Interaction

Much of the conventional advice for “writing better prompts” offers only superficial improvements and fails to address the underlying cause of AI fabrications. Techniques such as requesting a specific tone, defining a format, or asking the model to adopt a particular persona can certainly refine the style of an output, but they do little to prevent the generation of incorrect information. Instructions like “be accurate,” “cite your sources,” or “use only verified information” are fundamentally ambiguous from the model’s perspective. These commands sound sensible to a human user but leave critical details—such as what qualifies as a “verified” source or how to proceed when encountering conflicting data—entirely up to the AI’s interpretation. This approach of prompting with desired outcomes rather than with explicit operational rules is an open invitation for the model to fall back on its default programming, where completing the task smoothly and quickly often takes precedence over the user’s unstated demand for factual integrity. This effectively outsources critical judgment to the machine, creating a system that relies on hope instead of clear directives. The most effective strategy for mitigating this risk involves a fundamental shift from describing a desired product to defining a reliable process through a method known as rubric-based prompting. Drawing an analogy from the scoring guides used in education, an AI rubric’s purpose is not to grade a finished response but to shape the decision-making process during generation. It transforms the AI’s operation from one of inference to one of explicit instruction. A well-designed rubric establishes a clear hierarchy of goals, ensuring that factual accuracy can take precedence over narrative completeness or speed when necessary. It formally clarifies what is required, what is optional, and what is unacceptable in a response, effectively building a system of checks and balances that governs the AI’s behavior. Most critically, it provides the model with a clear protocol for what to do when it cannot meet the required criteria. This gives the AI permission to stop, return a partial response, acknowledge missing information, or defer to the user rather than defaulting to its ingrained habit of guessing.

The Practical Application of Rubrics

An effective AI rubric does not need to be an exhaustive, overly engineered document; its power lies in a concise set of enforceable criteria that directly address the primary risks of hallucination. The essential components begin with clear accuracy requirements, which specify what information must be supported by evidence, what qualifies as valid evidence, and whether approximations are permissible. This is complemented by explicit source expectations, guiding the model on whether sources are required, if they must come from a supplied corpus of materials, and how to handle contradictory information from different sources. Additionally, a rubric should include constraints on the model’s tone to prevent it from presenting speculative or conditional answers with unearned certainty. However, the single most important component is the definition of failure behavior. By explicitly instructing the model to stop, qualify its answers, or state what is unknown, we directly teach it the value of admitting ignorance.

In practice, rubrics do not replace prompts; they augment them in a symbiotic relationship where the prompt defines the task and the rubric defines the rules of performance. The rubric is typically embedded within the same query, clearly separated from the task description, to create a self-contained instruction set that minimizes ambiguity. For instance, a vague prompt such as, “Evaluate why [competitor] is outranking us for [specific topic] and recommend changes,” is an open invitation for the AI to hallucinate about traffic data and search engine features it cannot actually access. A revised, rubric-based approach would separate the instructions. The task would remain: “Analyze why [competitor] may be outperforming our site for [topic]. Provide insights and recommendations.” However, it would be followed by a clear rubric: “Do not claim rankings, traffic, or SERP features unless explicitly provided in the prompt. If required data is missing, state what cannot be determined and list the inputs needed. Frame recommendations as conditional when evidence is incomplete. Avoid definitive language without supporting data. If analysis cannot be completed reliably, return a partial response rather than guessing.” This structure fundamentally transforms the model’s approach by treating uncertainty as a hard constraint, compelling it to operate strictly within the bounds of known information.

Forging a New Path Toward Reliability

The journey toward creating a more dependable artificial intelligence was not about waiting for a hypothetical, flawless future model but about fundamentally rethinking the nature of the human-machine interaction. It became clear that achieving trustworthy outputs required a strategic pivot from merely describing a desired product to meticulously defining a reliable process. The adoption of rubric-based frameworks proved to be a superior tool for this purpose because it directly addressed the root cause of hallucinations: the model’s compulsion to infer when faced with ambiguity. By making rules explicit, especially those governing uncertainty and failure, users made an AI’s decision-making process more transparent and its outputs more dependable. For many workflows, these rubrics were standardized and reused across similar tasks, which systematically reduced error rates and built a foundation of trust over time. Ultimately, professional interaction with AI became an exercise in anticipating where a model would be forced to guess and proactively constraining its choices. By instructing models to slow down, qualify their statements, or stop entirely when information was missing, users successfully leveraged the immense power of AI while ensuring the results were accurate and truly valuable.

Explore more

Closing the Feedback Gap Helps Retain Top Talent

February 27, 2026

The silent departure of a high-performing employee often begins months before any formal resignation is submitted, usually triggered by a persistent lack of meaningful dialogue with their immediate supervisor. This communication breakdown represents a critical vulnerability for modern organizations. When talented individuals perceive that their professional growth and daily contributions are being ignored, the psychological contract between the employer and

Employment Design Becomes a Key Competitive Differentiator

February 27, 2026

The modern professional landscape has transitioned into a state where organizational agility and the intentional design of the employment experience dictate which firms thrive and which ones merely survive. While many corporations spend significant energy on external market fluctuations, the real battle for stability occurs within the structural walls of the office environment. Disruption has shifted from a temporary inconvenience

How Is AI Shifting From Hype to High-Stakes B2B Execution?

February 27, 2026

The subtle hum of algorithmic processing has replaced the frantic manual labor that once defined the marketing department, signaling a definitive end to the era of digital experimentation. In the current landscape, the novelty of machine learning has matured into a standard operational requirement, moving beyond the speculative buzzwords that dominated previous years. The marketing industry is no longer occupied

Why B2B Marketers Must Focus on the 95 Percent of Non-Buyers

February 27, 2026

Most executive suites currently operate under the delusion that capturing a lead is synonymous with creating a customer, yet this narrow fixation systematically ignores the vast ocean of potential revenue waiting just beyond the immediate horizon. This obsession with immediate conversion creates a frantic environment where marketing departments burn through budgets to reach the tiny sliver of the market ready

How Will GitProtect on Microsoft Marketplace Secure DevOps?

February 27, 2026

The modern software development lifecycle has evolved into a delicate architecture where a single compromised repository can effectively paralyze an entire global enterprise overnight. Software engineering is no longer just about writing logic; it involves managing an intricate ecosystem of interconnected cloud services and third-party integrations. As development teams consolidate their operations within these environments, the primary source of truth—the