Suboptimal reliability of liver biopsy evaluation has implications for randomized clinical trials
Autor: | Gary G. Koch, Howard C. Dittrich, Gad Cotter, Beth A. Davison, Jerry R. Colca, Arun J. Sanyal, Julie Iwashita, Christopher L. Edwards, Naim Alkhouri, Stephen A. Harrison |
---|---|
Rok vydání: | 2020 |
Předmět: |
Liver Cirrhosis
Male 0301 basic medicine medicine.medical_specialty Biopsy Risk Assessment law.invention 03 medical and health sciences 0302 clinical medicine Randomized controlled trial Non-alcoholic Fatty Liver Disease law Fibrosis Internal medicine medicine Humans Hypoglycemic Agents Reliability (statistics) Hepatology medicine.diagnostic_test business.industry Acetophenones Reproducibility of Results Middle Aged Prognosis medicine.disease Clinical trial 030104 developmental biology Clinical research Diabetes Mellitus Type 2 Liver Research Design Liver biopsy Disease Progression Female Thiazolidinediones 030211 gastroenterology & hepatology Insulin Resistance Steatohepatitis Steatosis business |
Zdroj: | Journal of Hepatology. 73:1322-1332 |
ISSN: | 0168-8278 |
Popis: | Background & Aims Liver biopsies are a critical component of pivotal studies in non-alcoholic steatohepatitis (NASH), constituting inclusion criteria, risk stratification factors and endpoints. We evaluated the reliability of NASH Clinical Research Network scoring of liver biopsies in a NASH clinical trial. Methods Digitized slides of 678 biopsies from 339 patients with paired biopsies randomized into the EMMINENCE study – examining a novel insulin sensitizer (MSDC-0602K) in NASH – were read independently by 3 hepatopathologists blinded to treatment code and scored using the NASH CRN histological scoring system. Various endpoints were computed from these scores. Results Inter-reader linearly weighted kappas were 0.609, 0.484, 0.328, and 0.517 for steatosis, fibrosis, lobular inflammation, and ballooning, respectively. Inter-reader unweighted kappas were 0.400 for the diagnosis of NASH, 0.396 for NASH resolution without worsening fibrosis, and 0.366 for fibrosis improvement without worsening NASH. In the current study, 46.3% of the patients included in the study based on 1 hepatopathologist's qualifying reading were deemed not to meet the study's histologic inclusion criteria by at least 1 of the 3 hepatopathologists. The MSDC-0602K treatment effect was lowest for those histologic features with lower inter-reader reliability. Simulations show that the lack of reliability of endpoints and inclusion criteria can drastically reduce study power – from >90% in a well-powered study to as low as 40%. Conclusions The reliability of hepatopathologists' liver biopsy evaluation using currently accepted criteria is suboptimal. This lack of reliability may affect NASH pivotal studies by introducing patients who do not meet NASH study entry criteria, misclassifying fibrosis subgroups, and attenuating apparent treatment effects. Lay summary Since liver biopsy analysis plays such an important role in clinical studies of non-alcoholic steatohepatitis, it is important to understand the reliability of hepato-pathologist readings. We examined both inter- and intra-reader variability in a large data set of paired liver biopsies from a clinical trial. We found very poor inter-reader and modest intra-reader variability. This result has important implications for entry criteria, fibrosis stratification, and the ability to measure a treatment effect in clinical trials. |
Databáze: | OpenAIRE |
Externí odkaz: |