KM and AI: An Overview of Opportunities and Challenges – Live expert session with Prof. Dr. Andreas Brandner →
Knowledge Management in the Age of AI: Keeping Humans at the Centre – Expert workshop with Stephanie Barnes →

Trust, Not Retrieval: Why AI Assistants Still Fail at Real Knowledge Work

Enterprise AI search has gotten fast. Ask a modern AI assistant a question and it will return an answer in seconds, synthesized, confident, and formatted like it came from someone who knew exactly what they were talking about. The retrieval problem, in the narrow sense of pulling matching content quickly, is largely solved. And yet knowledge workers still don’t fully trust what these systems tell them, and the gap between “it answered fast” and “I believe this answer” turns out to be the actual bottleneck, not the technology underneath it.

The numbers say trust, not speed, is still the open problem

Gartner’s 2024 Digital Worker Survey, cited in its own Market Guide for Enterprise AI Search, found that 34 percent of employees have difficulty finding information at work. What’s more telling is what happens once AI enters the picture: among the employees who already use tools like Microsoft 365 Copilot or Google Gemini specifically to find information, 36 percent still struggle to access what they need, even with AI assistance switched on. Speed didn’t fix the underlying problem. It just made the wrong or incomplete answer arrive faster.

The trust gap shows up even more starkly in a separate Gartner survey on AI maturity: among organizations with high AI maturity, 57 percent said business units trust and are ready to use new AI solutions. In low-maturity organizations, that figure drops to just 14 percent. Gartner’s own analysts put it plainly: trust is one of the differentiators between an AI initiative that succeeds and one that quietly fails, because adoption depends on it, and value depends on adoption.

Why “it sounds right” isn’t the same as “it is right”

Part of the problem is structural, not a matter of user skepticism to be trained away. Research tracking model confidence found that AI systems tend to use more confident, assertive language precisely when they’re generating incorrect information, not less. There’s no reliable tell in the tone of an answer that separates a well-grounded response from a fabricated one, which means the instinct most people rely on to judge whether to trust something (does this sound like someone who knows what they’re talking about) actively fails them here.

The business consequences of that gap are already measurable. Deloitte’s Global AI Survey found that 47 percent of enterprise AI users had made at least one major business decision based on content that was later found to be hallucinated. Separately, a 2025 EY Responsible AI Pulse Survey of nearly a thousand C-suite leaders found that 99 percent of organizations had experienced some form of AI-related financial loss, with a majority above a million dollars. These aren’t edge cases from careless deployments. They’re the expected result of treating a fluent answer as a verified one.

The real bottleneck isn’t the model, it’s what the model is reading

This is the point most conversations about AI accuracy miss, and it’s the one that matters most for anyone in knowledge management specifically: Gartner’s Market Guide is direct about where the failure actually originates. Effective information governance underpins effective enterprise search, and unmanaged or what Gartner terms “ROT” content, meaning redundant, obsolete, or trivial information, actively degrades AI search performance. The report states clearly that current RAG-based AI assistants and agents often underperform at scale specifically because of poor data source quality and weak retrieval relevance, not because the underlying model is incapable.

In other words, an AI assistant pointed at a fragmented, duplicated, outdated knowledge base will confidently retrieve fragmented, duplicated, outdated information, and present it with the same fluent authority it would use for something accurate. Gartner’s own framing for what AI search actually needs to work well is content that is Accurate, Pertinent, and Trusted, a standard most enterprise knowledge repositories were never built or maintained to meet. The AI isn’t the trust problem. It’s a very fast, very articulate mirror of whatever trust problem already existed in the knowledge base underneath it.

What actually closes the trust gap

The instinct in a lot of organizations is to solve this with a better model, a newer AI search vendor, or more prompt tuning. The evidence points somewhere less exciting and more foundational:

  • Knowledge governance has to happen before AI search deployment, not after. An AI assistant layered on top of an ungoverned repository inherits every duplicate, every outdated page, and every unclear source of truth already present, and then answers questions from all of it with equal confidence.
  • Provenance matters more than fluency. Systems and workflows that let a user trace an AI-generated answer back to its specific source build trust faster than systems that simply assert an answer, because they let the verification instinct that fluent language otherwise disables actually function.
  • Verification needs to be designed in, not left to individual judgment. Since confident tone doesn’t reliably signal accuracy, organizations that build a habit of spot-checking AI-sourced answers against original documents, particularly for anything decision-relevant, catch errors that a purely tone-based read would miss.
  • Trust has to be earned incrementally, and measured. High AI-maturity organizations didn’t reach 57 percent business-unit trust by accident. It tracks with sustained, deliberate AI programs rather than a single tool rollout, which suggests trust is a byproduct of consistent governance and communication over time, not a feature that ships with the software.

The takeaway for KM leaders

Every AI knowledge assistant is only as trustworthy as the knowledge management practice sitting beneath it. Organizations that treat AI search as a plug-in productivity fix, without first addressing duplicate content, unclear ownership, and stale documentation, end up with a faster way to surface the same unreliable information, delivered with more confidence than it deserves. The uncomfortable but useful reframe is that improving AI assistant trust is not primarily an AI problem. It’s a knowledge management problem that AI has simply made impossible to ignore.