Open Access. Powered by Scholars. Published by Universities.®
![Digital Commons Network](http://assets.bepress.com/20200205/img/dcn/DCsunburst.png)
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 1 of 1
Full-Text Articles in Physical Sciences and Mathematics
Analysis Of Predictive Performance And Reliability Of Classifiers For Quality Assessment Of Medical Evidence Revealed Important Variation By Medical Area, Simon Šuster, Timothy Baldwin, Karin Verspoor
Analysis Of Predictive Performance And Reliability Of Classifiers For Quality Assessment Of Medical Evidence Revealed Important Variation By Medical Area, Simon Šuster, Timothy Baldwin, Karin Verspoor
Natural Language Processing Faculty Publications
Objectives: A major obstacle in deployment of models for automated quality assessment is their reliability. To analyze their calibration and selective classification performance. Study Design and Setting: We examine two systems for assessing the quality of medical evidence, EvidenceGRADEr and RobotReviewer, both developed from Cochrane Database of Systematic Reviews (CDSR) to measure strength of bodies of evidence and risk of bias (RoB) of individual studies, respectively. We report their calibration error and Brier scores, present their reliability diagrams, and analyze the risk–coverage trade-off in selective classification. Results: The models are reasonably well calibrated on most quality criteria (expected calibration error …