A Preliminary Comparative Study on the Diagnostic Accuracy of Machine Learning AI Systems in Medical Diagnosis

Prashant Kumar Jha

doi:10.62502/ijmi/v3i1art5

A Preliminary Comparative Study on the Diagnostic Accuracy of Machine Learning AI Systems in Medical Diagnosis

Authors : Prashant Kumar Jha

DOI : 10.62502/ijmi/v3i1art5

Volume : 3

Issue : 1

Year : 2026

Page No : 21-24

Background: Artificial intelligence (AI) and machine learning (ML) systems are increasingly being explored in medical imaging to support radiological diagnosis. Aim: This study aimed to perform a preliminary comparative assessment of the diagnostic accuracy of an ML-based ChatGPT reporting system versus manual radiologist interpretation in general radiography. Materials and Methods: A prospective study was conducted on 30 radiographic examinations (n = 30), including chest X-rays (PA view), spine (AP and lateral), upper extremity, and lower extremity radiographs, performed using an X-Tech 500 mA X-ray machine over a period from 2nd January 2026 to 16th January 2026. Each image was first reported by a radiologist, then independently analyzed by a ChatGPT-based ML system. Both reports were finally reviewed by a senior radiologist as the reference standard. Results: Manual radiologist interpretation showed higher diagnostic accuracy (96.7%, n = 29/30) compared to the ML system (80.0%, n = 24/30), with a statistically significant difference (p < 0.05). The ML system performed better in chest radiographs but showed reduced sensitivity in musculoskeletal imaging. It frequently failed to detect subtle findings such as hairline fractures and non-displaced fractures, with significantly lower sensitivity (33.3% vs 100%, p < 0.01). Conclusion: The ML-based ChatGPT system demonstrated moderate diagnostic performance but was inferior to manual radiologist interpretation, particularly for subtle skeletal injuries. Keywords: Machine Learning, Artificial Intelligence, ChatGPT, General Radiography

Citation Data

To evaluate the functional outcomes in patella rim cauterized total knee arthroplasty

Ashwin Shetty, Nikhil Manvi, Sanath Kumar Shetty

Syphilis in blood donors: Pre-transfusion serological screening by Rapid Plasma Reagin (RPR) Test at the blood bank of a Teaching Medical Institute in North Gujarat, India

Dipakkumar R. Prajapati, B. H. Parmar

“Histopathological Analysis of Appendix in Clinically diagnosed and operated Acute Appendicitis cases- A retrospective Study”

Basavanandaswamy CH, Sinhasan S.P,

A systematic review of the performance of Artificial Intelligence for automated DWI/FLAIR mismatch evaluation on MRI in ischemic stroke

Zahra Soltanali, Alireza Pourrahim, Chelsea Ruth-Ann Williams, Mohammad Hossain Ekvan, Iraj Ahmadi, Omid Raiesi

Xboom utilities Pvt. Ltd: Leading self defence industry

Kiran Desai, Kumar Ashutosh, Shubham Pasricha, Aadya Singh, Samyak Shah

A Preliminary Comparative Study on the Diagnostic Accuracy of Machine Learning AI Systems in Medical Diagnosis

Citation Data

Related Articles

To evaluate the functional outcomes in patella rim cauterized total knee arthroplasty

Syphilis in blood donors: Pre-transfusion serological screening by Rapid Plasma Reagin (RPR) Test at the blood bank of a Teaching Medical Institute in North Gujarat, India

Paediatric airway and challenges during covid era

Role of spirometry for preoperative evaluation of stable COPD patients before elective laparoscopic cholecystectomy

Trend and pattern of international capital flows in India

Perioperative management of thalassemia intermedia patient posted for major spine surgery – A case report

“Histopathological Analysis of Appendix in Clinically diagnosed and operated Acute Appendicitis cases- A retrospective Study”

A systematic review of the performance of Artificial Intelligence for automated DWI/FLAIR mismatch evaluation on MRI in ischemic stroke

Xboom utilities Pvt. Ltd: Leading self defence industry