-Author name in bold denotes the presenting author
-Asterisk * with author name denotes a Non-ASH member
Clinically Relevant Abstract denotes an abstract that is clinically relevant.

PhD Trainee denotes that this is a recommended PHD Trainee Session.

Ticketed Session denotes that this is a ticketed session.

3602 Machine Learning Classification of Spatial Patterns of Malignant Cells Reveals Implications in Prognosis and Tumor Microenvironment Composition in Lymphoma

Program: Oral and Poster Abstracts
Session: 803. Emerging Tools, Techniques, and Artificial Intelligence in Hematology: Poster II
Hematology Disease Topics & Pathways:
Research, Translational Research, Lymphomas, Non-Hodgkin lymphoma, Diseases, Lymphoid Malignancies, Computational biology, Technology and Procedures, Machine learning
Sunday, December 8, 2024, 6:00 PM-8:00 PM

Shruti Sridhar1*, Michal Marek Hoppe1*, Patrick Jaynes1*, Gayatri Kumar2*, Siddham Jasoria3*, Ziwei Meng4*, Yanfen Peng1*, Sanjay De Mel, BSc, FRCPath, MRCP5,6*, Limei Poon5,6*, Esther Hian Li Chan5,6*, Wee-Joo Chng1,5,6,7, Soo Yong Tan5,8*, Susan Swee-Shan Hue5,8*, Siok-Bian Ng1,5,8*, Chandramouli Nagarajan9*, Nicholas Francis Grigoropoulos9*, Shaoying Li10*, Joseph D. Khoury11, Pedro Farinha12, Anja Mottok12*, David W. Scott, MBChB, PhD12, Xiaofei Ye13*, Qiang Pan-Hammarström14*, Vaibhav Rajan4*, Kasthuri Kannan2* and Anand Devaprasath D. Jeyasekharan, MBBS, MRCP, PhD1,5,6,7*

1Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore
2Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX
3Birla Institute of Technology & Science, Pilani, Pilani, India
4Department of Information Systems and Analytics, School of Computing, National University of Singapore, Singapore, Singapore
5NUS Centre for Cancer Research, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
6Department of Haematology-Oncology, National University Cancer Institute, Singapore, National University Health System, Singapore, Singapore
7Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
8Department of Pathology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
9Department of Haematology, Singapore General Hospital, Singapore, Singapore
10Department of Hematopathology, Division of Pathology and Laboratory Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX
11Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE
12BC Cancer Research Centre, Vancouver, BC, Canada
13Kindstar Global Precision Medicine Institute, Wuhan, China
14Division of Immunology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden

Background

Malignancies exhibit variable cellular distribution patterns and the relationship between these topographic variations, underlying biological processes, and clinical outcomes remain poorly understood. Point process analyses, widely used in ecology, can elucidate the spatial distribution of points in complex systems but have rarely been applied to tumor heterogeneity. We recently demonstrated (Hoppe et al, Cancer Discovery 2023), that cells co-expressing high MYC and BCL2 but lacking BCL6 (M+2+6-), in Diffuse Large B Cell Lymphoma (DLBCL) are consistently correlated with poor survival compared to other MYC/BCL2/BCL6 combinations. Machine learning approaches can be applied to understand nuances of cellular point patterns and help with correlating with clinicopathological variables. Here, we developed a code frame that can be generalized across tissue regions accounting for heterogeneity to quantitatively study spatial patterns of M+2+6- cells in DLBCL.

Methods

We developed a scalable automated workflow for generating spatial point patterns. The individual steps of the pipeline have been consolidated into a standalone package that can be executed in a facile manner for any image type, without dependencies on any proprietary software. Using multiplexed fluorescent immunohistochemistry (mfIHC) in four cohorts of DLBCL (n=449), spatial point patterns were derived, and Geyer's point process model was applied. Machine learning classification models were benchmarked for spatial statistics derived from the point process model. A multi-omic analysis, including single-cell transcriptomic analyses of 22 DLBCL samples, was then conducted. Spatial transcriptomic technique, Stereoseq, was also conducted on 2 DLBCL samples.

Results

The workflow consists of the following parts: 1) The python script using the OpenCV package to manipulate the kernel size and intensity of the spatial coordinates overlaid on the images; 2) A QuPath groovy script to automate the import and export of the images and the parameter thresholds for the pixel classifier; 3) An R script to build the spatial point patterns from coordinates and save different oncogene co-expression as marks within the accurate geojson annotations; and 4) an R script to obtain measures of quality in terms of minimizing the number of points excluded while generating accurate spatial point pattern windows.

After applying the pipeline, we see that patients could be divided into two: one group showed “clustered” spatial organization, while the other displayed a “dispersed” M+2+6- cell distribution. We achieved an accuracy of 98% in classifying patients as “dispersed“ and “clustered” across four cohorts, through the random forest model. Cases with "dispersed" M+2+6- cells had shorter overall survival across all analyzed cohorts (P < 0.05 in 4/4 cohorts). Patients enriched in the “dispersed” phenotype, predominantly belonged to the ABC cell of origin subtype. We derived a “dispersed” pattern gene signature through multi-omic analyses which expressed genes implicated in cell migration and adhesion. Validation of the dispersed signature was conducted using Stereoseq, where M+2+6- cells enriched in the signature displayed greater values of L function across distances. Patients enriched in the dispersed phenotype displayed lower infiltration of immune subtypes though deconvolution hinting at a possible immunologically cold microenvironment.

Conclusion

This study demonstrates the clinical relevance studying the spatial distribution of malignant cell subpopulations through point pattern analysis. We anticipate that this machine learning pipeline can be developed for clinical use, enabling the classification of spatial phenotypes in DLBCL biopsies for patient stratification.

Disclosures: De Mel: Pfizer: Other: advisory board ; Amgen: Other: advisory board. Chng: Amgen: Honoraria; J&J: Honoraria, Research Funding; Celgene: Honoraria, Research Funding; BMS: Honoraria; Abbvie: Honoraria; Novartis: Honoraria; Takeda: Honoraria; Hummingbird: Research Funding. Scott: Veracyte: Consultancy, Honoraria; Roche: Consultancy, Honoraria; Roche/Genentech: Research Funding; AstraZenenca: Consultancy, Honoraria; Genmab: Consultancy, Honoraria; Abbvie: Consultancy, Honoraria; Nanostring: Patents & Royalties: use of gene expresssion to subtype aggressive lymphoma. Jeyasekharan: BeiGene: Consultancy; Roche: Consultancy; Gilead Sciences: Consultancy; Turbine: Consultancy; AstraZeneca: Consultancy, Research Funding; Antengene Corp: Consultancy; Janssen: Consultancy, Research Funding; MSD: Consultancy; IQVIA: Consultancy.

*signifies non-member of ASH