AITRICS Publishes Study on the Impact of Clinician-Driven Missing Data on AI Model Performance

2025-07-02

AI model performance is more influenced by missing data reflecting clinical judgment than by the sheer volume of data

[%EC%82%AC%EC%A7%84]%20%EC%9D%98%EB%A3%8C%EC%A7%84%20%ED%8C%90%EB%8B%A8%20%EB%B0%98%EC%98%81%20%EA%B2%B0%EC%B8%A1%EA%B0%92%EA%B3%BC%20AI%20%EB%AA%A8%EB%8D%B8%20%EC%84%B1%EB%8A%A5%20%EA%B4%80%EA%B3%84%20%EB%B6%84%EC%84%9D%20%EB%85%BC%EB%AC%B8%20%EB%B0%9C%ED%91%9C_44579.jpg

AITRICS (CEO Kwang Joon Kim), a company specializing in artificial intelligence(AI) technology announced on July 2 that its research paper, which explores how missing clinical data can reflect medical decision-making rather than merely indicating information gaps, has been published in the SCIE-indexed international journal Journal of Clinical Medicine.

The study retrospectively analyzed clinical data from 24,359 adult patients admitted to general internal medicine and surgical wards at Jesus Hospital. It validated the concept of Informative Presence — the idea that missing data itself can represent the outcome of clinical judgment.

Building on previous research, the team hypothesized that data collection patterns and missing rates would vary based on patient severity. Patients were stratified into high-risk (Charlson Comorbidity Index, CCI > 3) and low-to-moderate-risk (CCI ≤ 3) groups. The study then compared missing data rates and AI model performance between these two cohorts.

The results showed that high-risk patients underwent more tests and thus had lower missing data rates. Conversely, the low-to-moderate-risk group had higher missing rates due to less frequent testing. However, within both groups, patients who experienced clinical deterioration events had consistently lower missing rates than those who did not. This suggests that clinicians tend to order more tests when they suspect potential deterioration, regardless of the patient’s baseline risk level.

Despite differences in test frequency and missing data rates across patient groups, the AI model's predictive performance remained stable — with AUROC scores of 0.86 for the overall cohort, 0.86 for the high-risk group, and 0.85 for the low-to-moderate-risk group. This indicates that the clinical context behind whether a test was performed plays a more critical role in model performance than the amount of data alone.

CEO Kwang Joon Kim of AITRICS stated, “Since test frequencies and missing patterns vary depending on patient condition, AI models must be able to recognize and interpret such clinical behavior patterns based on severity. Rather than relying solely on numerical data, models that incorporate clinical judgment will be more trusted and effectively applied in real-world medical settings.”

Previoius

AITRICS Publishes Study Demonstrating the Clinical Significance of Missing Data

AITRICS Successfully Hosts the 2025 Rapid Response System Symposium

COMMUNICATION

AITRICS Publishes Study on the Impact of Clinician-Driven Missing Data on AI Model Performance