Statistical combination schemes of repeated diagnostic test data.


Kelly H Zou, Jui G Bhagwat, and John A Carrino. 2006. “Statistical combination schemes of repeated diagnostic test data.” Acad Radiol, 13, 5, Pp. 566-72. Copy at


RATIONALE AND OBJECTIVES: When diagnostic tests are repeated and combined, a number of schemes may be adopted. Guidelines for their interpretations are required. MATERIALS AND METHODS: Three combination schemes, "and" (A), "or" (O), and "majority" (M), are considered. To evaluate these schemes, dependency by specifying kappa values quantifying repeated test agreement was structured. In a pilot study, the combined accuracies of magnetic resonance imaging using six different pulse sequences of medial collateral ligaments of the elbows of 28 cadavers, with eight having lesions artificially created surgically, were examined. Images were evaluated simultaneously by using a five-point ordinal scale. For each pulse sequence, individuals for whom the diagnosis varied from once to three repetitions were considered. RESULTS: Scheme M improves diagnostic accuracy when sensitivity and specificity of a single test exceed 0.5, with maximal improvement at 0.79. Under scheme A, sensitivity decreases to 0.38-0.59. Under scheme O, sensitivity increases to 0.53-0.79. Scheme M yields a small improvement, reaching 0.50-0.71. Under scheme A, specificity increases to 0.95-0.98. Under scheme O, specificity decreases to 0.91-0.98. Scheme M also yields a small improvement, reaching 0.94-0.98. CONCLUSION: Scheme A is recommended for ruling in diagnoses, scheme O is recommended for ruling out diagnoses, and scheme M is neutral. Consequently, different schemes may be used to optimize the target diagnostic accuracy.