Reliability of various observers in determining common radiographic parameters of adult hip structural anatomy.
Academic Article
Overview
abstract
BACKGROUND: Radiographic evaluation of the hip is extremely important in the diagnosis and treatment decisionmaking process for pre-arthritic hip disease. Many different radiographic measurements have been described as indicators of underlying structural hip deformity. The purpose of this study was to determine the interobserver and intraobserver reliability of various musculoskeletal physicians in performing selected measurements of adult structural hip anatomy. METHODS: A blinded review of 45 sets of radiographs from patients with developmental dysplasia, femoro-acetabular impingement, and normal anatomy was performed. Data points included the lateral center-edge angle (LCEA), vertical-center-anterior angle (VCA), head-neck offset ratio (UNO), alpha angle, Tönnis angle, Tönnis osteoarthritis grade and a radiographic diagnosis. One orthopaedic fellow, two orthopaedic residents, and two attending musculoskeletal physiatrists analyzed radiographs on two separate occasions. One sports medicine orthopaedic attending physician completed a single analysis of the image sets. Intraobserver and interobserver reliability was established using intra-class correlation coefficients (ICC) for continuous variables. Agreement regarding categorical variables was performed using the kappa coefficient RESULTS: Excellent intraobserver reliability was found for the following: LCEA (ICC = 0.88), VCA (0.88), Tönnis angle (0.83), HNO on the frog lateral (0.78), alpha angle on the frog lateral (0.76), HNO on the cross-table lateral (0.75), and angle alpha on the cross-table lateral (0.76). Intraobserver reliability for osteoarthritis grade was poor (weighted kappa = 0.57). For all data points, interobserver reliability was considerably worse, with 95% confidence intervals spanning below 0.55. CONCLUSIONS: While the described measurements of adult structural hip anatomy provide excellent reliability for a given reader, these measurements are less reliable across readers. Taken in isolation, these measurements, as performed by observers with varied clinical experience and clinical backgrounds, are limited in determining a consistent radiographic diagnosis.