Two Hospitals Are Tackling AI's Biggest Problem: Proving It Actually Works
A major gap exists between AI tools that look promising in development and those that actually improve patient care in hospitals. UCLA Health has launched a new research center specifically designed to bridge that divide by rigorously testing artificial intelligence in real-world clinical settings, from early usability testing through full-scale implementation studies.
Why Are Hospitals Struggling to Validate AI Tools?
The rapid expansion of AI across clinical settings has outpaced the infrastructure needed to properly evaluate whether these tools are safe, effective, and genuinely useful for patients. Computerized scribes that draft notes during patient visits, diagnostic systems that help interpret X-rays and MRIs, and countless other AI applications are now deployed in hospitals nationwide, yet many lack rigorous real-world validation.
This validation gap represents one of the most important unresolved challenges in healthcare AI. Without systematic evaluation across diverse patient populations and clinical workflows, hospitals cannot confidently determine whether an AI tool will actually improve care or simply add complexity to their operations.
"This new center will help address one of the most important gaps in health care AI, knowing whether these tools are safe, effective and useful in real-world clinical practice," said Johnese Spisso, president of UCLA Health and CEO of the UCLA Hospital System.
Johnese Spisso, President of UCLA Health and CEO of the UCLA Hospital System
What Is UCLA's New AI Validation Center?
The Innovations and Outcomes Validation of AI (INOVAi) Center, launched by UCLA Health in June 2026, represents one of the first programs in the nation focused specifically on evaluating and implementing AI in healthcare. The center operates as a Center of Excellence in AI evaluation and implementation science, working in alignment with UCLA's broader institutional approach to responsible health AI.
Rather than simply testing whether AI tools work in controlled laboratory conditions, INOVAi evaluates AI across the full lifecycle of real-world deployment. This includes early usability and feasibility testing, workflow integration studies, prospective clinical trials, and pragmatic implementation studies that measure outcomes in actual patient care environments.
How Does the Center Evaluate AI Tools?
- Early-Stage Testing: The center begins with usability and feasibility assessments to understand how AI tools interact with existing clinical workflows before broader deployment.
- Clinical Trials: Prospective clinical trials measure whether AI tools produce meaningful improvements in patient outcomes, physician efficiency, or both.
- Real-World Implementation: Pragmatic implementation studies evaluate how AI performs across diverse clinical settings and patient populations, not just in ideal conditions.
- Continuous Refinement: The center uses evaluation findings to refine AI models and implementation strategies, creating a feedback loop that improves tools over time.
This comprehensive approach addresses a critical need in healthcare AI. Many tools are deployed based on promising early results, but without systematic real-world evaluation, hospitals cannot determine whether those results hold up when the AI encounters the messy complexity of actual clinical practice.
What Early Evidence Shows About AI Scribes?
One of INOVAi's first major findings comes from a randomized trial examining AI scribes, which are software systems that automatically draft clinical notes during patient visits. The research, led by Dr. Paul Lukac, UCLA Health's chief AI officer, and Dr. John Mafi, associate professor of medicine, found that AI scribes significantly reduced the time physicians spent writing clinical notes.
Beyond time savings, the study revealed broader benefits for physician well-being. Physicians using AI scribes reported improvements in cognitive load, work exhaustion, and overall well-being. They also reported enhanced patient engagement, noting that they could spend more time connecting with patients rather than typing notes.
"The results of our research on AI's capacity to improve clinical workflow and enhance patient care have been promising thus far. While there still is considerable work to be done, our new program has the potential to further hone the discipline the field needs as we continue to build a shared language for what counts as evidence," said Dr. Paul Lukac, UCLA Health's chief AI officer.
Dr. Paul Lukac, Chief AI Officer at UCLA Health
These findings illustrate why systematic evaluation matters. AI scribes were not simply reducing administrative burden; they were improving physician well-being and patient interaction quality, outcomes that might not have been measured without rigorous research design.
How Does This Fit Into Broader Healthcare AI Trends?
UCLA's initiative arrives as healthcare AI is transitioning from experimental pilots to mainstream deployment. In China, for example, the National Healthcare Security Administration formally classified "AI-assisted diagnosis" as a recognized diagnostic service in December 2025, incorporating it into the official pricing framework for pathological diagnostic services. This regulatory recognition signals that AI in healthcare is moving beyond proof-of-concept toward standardized clinical practice.
However, this expansion brings new challenges. As AI tools proliferate across clinical settings, questions about data compliance, algorithmic transparency, and the boundaries of medical practice become increasingly urgent. Healthcare AI companies must navigate complex regulatory requirements around patient data protection, algorithmic fairness, and unlicensed medical practice.
UCLA's validation center addresses these challenges by establishing rigorous evaluation standards that can inform both clinical practice and regulatory policy. By systematically measuring whether AI tools are safe and effective in real-world settings, the center helps create the evidence base that hospitals, regulators, and patients need to make informed decisions about AI adoption.
"INOVAi represents an important Center of Excellence in AI evaluation and implementation science, and an integral part of UCLA's broader institutional approach to responsible health AI," said Dr. Steven Dubinett, dean of the David Geffen School of Medicine at UCLA.
Dr. Steven Dubinett, Dean of the David Geffen School of Medicine at UCLA
The center's work is particularly important because it addresses a fundamental gap in healthcare AI adoption. Many hospitals face pressure to implement AI tools quickly, but without clear evidence of real-world effectiveness, they risk deploying systems that fail to deliver promised benefits or inadvertently disrupt clinical workflows. INOVAi's rigorous evaluation approach provides a pathway for hospitals to adopt AI confidently, knowing that the tools have been tested in conditions similar to their own clinical environments.