-Author name in bold denotes the presenting author
-Asterisk * with author name denotes a Non-ASH member
Clinically Relevant Abstract denotes an abstract that is clinically relevant.

PhD Trainee denotes that this is a recommended PHD Trainee Session.

Ticketed Session denotes that this is a ticketed session.

2313 High Concordance between Investigator-Verified Diagnosis and Manual Data Abstraction for Sickle Cell Diagnosis Type: An ASH Research Collaborative Data Hub Validation Study

Program: Oral and Poster Abstracts
Session: 904. Outcomes Research: Hemoglobinopathies: Poster I
Hematology Disease Topics & Pathways:
Sickle Cell Disease, Clinical Practice (Health Services and Quality), Hemoglobinopathies, Diseases
Saturday, December 7, 2024, 5:30 PM-7:30 PM

Alexis Thompson, MD, MPH1, Ashima Singh, PhD, MS2, Donna S. Neuberg, ScD3, Amanda M. Brandow, DO, MS4, Allison A. King, MD, PhD, MPH5, Sophie Lanzkron, MD6, Emily A. Semmel, MS7*, Sam Walters, MBA8,9*, Susan Rogers7*, Kathleen Hewitt, DNP, RN10* and William A. Wood, MD, MPH11

1Division of Hematology, Children’s Hospital of Philadelphia, Phildelphia, PA
2Department of Pediatrics, Section of Hematology/Oncology/Bone Marrow Transplantation, Medical College of Wisconsin, Pewaukee, WI
3Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
4Department of Pediatrics, Section of Hematology/Oncology/Bone Marrow Transplantation, Medical College of Wisconsin, Milwaukee, WI
5School of Medicine, Washington University, Clayton, MO
6Thomas Jefferson University, Philadelphia, PA
7American Society of Hematology, Washington, DC
8Breakthrough Healthcare, Baltimore, MD
9American Society of Hematology, Ellicott City, MD
10ASH Research Collaborative, Gainesville, VA
11Division of Hematology, Department of Medicine, University of North Carolina At Chapel Hill Hospital, Chapel Hill, NC

Background: Sickle cell disease (SCD) is the most common inherited blood disorder in the United States. The ASH Research Collaborative (ASH RC) is building an SCD Data Hub (DH) to facilitate research and quality improvement using real-world data. The DH ingests multi-site electronic health record (EHR) data, which are rich in clinical context and longitudinal assessments. SCD diagnosis can be ascertained in the EHR from any encounter across the health system. Diagnostic accuracy is an essential data quality element in EHR-based research. We report here the first analyses from the SCD DH, focusing on the demographics of the cohort and accurate identification of SCD diagnosis type.

Methods: SCD DH sites have a data use agreement with the ASH RC, and either a reliance agreement with the Western-Copernicus Group Institutional Review Board (IRB) or local IRB approval to extract and transfer EHR data at least quarterly using the Observational Medical Outcomes Partnership (OMOP) common data model. The SCD DH was supplemented with a site principal investigator (PI) attestation of SCD diagnosis based on existing data sources (e.g., local registries, clinic notes, problem lists) deemed sufficient by the PI. Such cases are designated Investigator-Verified. A random sample of 25 patients (per site) with investigator verified SCD types and at least one encounter in 2022, was evaluated further at 10 sites in a Data Validation Pilot Project. SCD type and other variables were manually abstracted by site personnel and reported to the DH through REDCap. For each data item abstracted, source locations were recorded. The investigator-verified and manually abstracted SCD diagnosis types were compared. Concordance was calculated as the percentage of cases with the same SCD type determined by the Investigator-Verified and abstracted methods.

Results: Of the 56 sites enrolled in the DH, 20 are currently submitting data. As of 01/31/2024, the DH included data for 24,581 unique individuals with SCD. The geographic distribution of sites aligns well with the national distribution of the SCD population. Sites were urban (95%) and suburban (5%); all were academic medical centers; and were self-designated as pediatric (45%), adult (40%), or combined (15%) programs. The median patient age was 23 years (range: birth to 75 years); 55.4% were female.

The SCD DH contained 9,484 patients across 12 sites with Investigator-Verified SCD diagnosis types. SCD types were reported as HbSS (67.2%), HbSC (22.2%), HbSß0thalassemia (2.1%), HbSß+thalassemia (6.1%), or HbS/Other (2.4%). In the Data Validation Pilot Project, SCD type was manually abstracted in a random sample of 243 patients at 10 sites. Thirty-six patients, for whom the Investigator-Verified diagnosis was “unable to determine”, were excluded from the concordance analysis.

Of the 207 evaluable cases, 193 had concordant SCD diagnosis types, with an overall concordance rate of 93%. Five of nine sites had 100% concordance. There were 14 cases (7%) where the manually-abstracted SCD type was discordant with the Investigator-Verified SCD diagnosis. Discordant types were most often reported as HbSS in the Investigator-Verified Diagnosis Project, and discordance was higher for patients with less common SCD types, such as HbSß+thalassemia. Source locations reported for manual abstraction of SCD diagnosis included hemoglobin electrophoresis, HPLC, DNA testing, patient registry, newborn screening, and clinical documentation. Determination of concordance between other automated extracted and manually abstracted data elements is now underway for this validation study subset, the larger investigator-verified cohort, and the full DH SCD population to generate EHR-derived computable phenotype algorithms.

Conclusions: The ASH RC DH is one of the largest SCD data resources in the United States. Investigator-Verified diagnosis had high concordance with a manual chart abstraction in a random sample of patients across 10 sites. This concordance increases confidence in the validity of both existing and future SCD DH data. Well-curated and transformed data on a large cohort of individuals with SCD will be leveraged to conduct observational studies and inform the design and feasibility of prospective interventional studies to generate high-quality, granular real-world evidence and improve outcomes for this complex patient population.

Disclosures: Thompson: Editas: Consultancy, Research Funding; Beam Therapeutics: Consultancy, Research Funding; Novartis: Research Funding; CRISPR/Vertex: Consultancy, Research Funding; Global Blood Therapeutics: Divested equity in a private or publicly-traded company in the past 24 months; bluebird bio: Consultancy, Research Funding. Neuberg: Madrigal Pharmaceutical: Current equity holder in publicly-traded company. Brandow: Pfizer: Other: Adjudication committee for clinical trial . King: UpToDate: Patents & Royalties; Cigna: Consultancy. Lanzkron: Pfizer: Other: Stock in Family trust; Merck: Consultancy; PCORI: Research Funding; GMI: Consultancy; Takeda: Research Funding; Agios: Membership on an entity's Board of Directors or advisory committees; Novo Nordisk: Membership on an entity's Board of Directors or advisory committees; HRSA: Research Funding; Pfizer: Consultancy; GBT: Research Funding; Novartis: Consultancy; CSL-Behring: Research Funding; Bluebird BIo: Membership on an entity's Board of Directors or advisory committees; Teva: Other: Stock in Family trust. Wood: Pfizer: Research Funding; Genetech: Research Funding; ASH Research Collaborative: Consultancy; Koneksa Health: Consultancy, Current equity holder in publicly-traded company; Teledoc Health: Consultancy.

<< Previous Abstract | Next Abstract
*signifies non-member of ASH