Evaluating the AI App "POSOS" for Detecting Drug-Induced Iatrogenesis (A Clinical Vignettes Study) — a clinical trial (ClinicalTrials.gov)
A multicenter observational study using simulated cases (clinical vignettes) to test how much the AI app "POSOS" helps physicians detect drug-induced iatrogenesis.
Trial overview (primary data)
- StatusCompleted
- ConditionsIatrogenesis, Clinical Vignettes, Diagnostic Performance Study, Medical Information Search, Artificial Intelligence
- SponsorCentre Hospitalier Universitaire, Amiens
- Target enrollment85 participants
- Period2023-09-11 〜 2025-07-18
Key points
- An observational study testing the diagnostic performance of the AI app "POSOS" for detecting drug-induced iatrogenesis, using simulated cases (clinical vignettes).
- Physicians are randomly assigned to use or not use the app and grouped by years of experience for comparison.
- They answer vignettes mixing complex, simple, and non-iatrogenesis cases, with responses recorded at 5 and 15 minutes.
- Time taken, number of medical-search apps used, and user experience are also assessed.
Iatrogenesis means harm caused by medical care itself, and among its forms, drug side effects and drug-drug interactions are especially easy to overlook. When a patient takes several medications, or when symptoms resemble other illnesses, recognizing in a busy emergency setting that "this symptom might be caused by a drug" is far from straightforward. Software that can rapidly cross-check vast amounts of medication information is seen as a potential aid in reducing such oversights.
What makes this study notable is its design: the AI app is evaluated not on real patients but on "clinical vignettes" (simulated cases), and physicians are randomly split into groups that do or do not use the app for comparison. Rather than trying a new tool directly in live care, this approach seeks to isolate, in a safe simulated environment, how having the tool changes a clinician's judgment — a careful way to evaluate medical AI. Responses are recorded at two time points, after 5 and 15 minutes, alongside the time taken, the number of medical-search apps used, and the overall user experience. Note that the app's effectiveness, accuracy, and safety are not established; the study aims to test its usefulness.
The challenge of medication safety is broadly shared across countries and health systems. In the United States as well, drug interactions and adverse events are an important public-health topic, and the question of how to measure the performance of such detection-support tools carries cross-cutting lessons for how medical AI itself should be evaluated. A randomized comparison built on simulated cases is a framework that could be applied to evaluating other diagnostic-support tools too.
Why it matters
An effort to measure the performance of software that supports detection of drug-related adverse events, using a randomized comparison on simulated cases. A useful example for those interested in how medical AI is evaluated and in developing medication-safety tools.
FAQ
Was this study conducted on real patients?
Has the AI app been proven effective?
Sources (primary)
Source: ClinicalTrials.gov (U.S. NIH/NLM, public domain). This site does not provide medical advice. Verify the latest and exact details with the official source. This site is not endorsed or certified by the NIH/NLM.
- ClinicalTrials.gov (study record, original)
- NCT ID: NCT05952193