Session: 803. Emerging Tools, Techniques, and Artificial Intelligence in Hematology: Poster II
Hematology Disease Topics & Pathways:
Research, Translational Research, Lymphomas, Non-Hodgkin lymphoma, Diseases, Lymphoid Malignancies, Computational biology, Technology and Procedures, Machine learning
Malignancies exhibit variable cellular distribution patterns and the relationship between these topographic variations, underlying biological processes, and clinical outcomes remain poorly understood. Point process analyses, widely used in ecology, can elucidate the spatial distribution of points in complex systems but have rarely been applied to tumor heterogeneity. We recently demonstrated (Hoppe et al, Cancer Discovery 2023), that cells co-expressing high MYC and BCL2 but lacking BCL6 (M+2+6-), in Diffuse Large B Cell Lymphoma (DLBCL) are consistently correlated with poor survival compared to other MYC/BCL2/BCL6 combinations. Machine learning approaches can be applied to understand nuances of cellular point patterns and help with correlating with clinicopathological variables. Here, we developed a code frame that can be generalized across tissue regions accounting for heterogeneity to quantitatively study spatial patterns of M+2+6- cells in DLBCL.
Methods
We developed a scalable automated workflow for generating spatial point patterns. The individual steps of the pipeline have been consolidated into a standalone package that can be executed in a facile manner for any image type, without dependencies on any proprietary software. Using multiplexed fluorescent immunohistochemistry (mfIHC) in four cohorts of DLBCL (n=449), spatial point patterns were derived, and Geyer's point process model was applied. Machine learning classification models were benchmarked for spatial statistics derived from the point process model. A multi-omic analysis, including single-cell transcriptomic analyses of 22 DLBCL samples, was then conducted. Spatial transcriptomic technique, Stereoseq, was also conducted on 2 DLBCL samples.
Results
The workflow consists of the following parts: 1) The python script using the OpenCV package to manipulate the kernel size and intensity of the spatial coordinates overlaid on the images; 2) A QuPath groovy script to automate the import and export of the images and the parameter thresholds for the pixel classifier; 3) An R script to build the spatial point patterns from coordinates and save different oncogene co-expression as marks within the accurate geojson annotations; and 4) an R script to obtain measures of quality in terms of minimizing the number of points excluded while generating accurate spatial point pattern windows.
After applying the pipeline, we see that patients could be divided into two: one group showed “clustered” spatial organization, while the other displayed a “dispersed” M+2+6- cell distribution. We achieved an accuracy of 98% in classifying patients as “dispersed“ and “clustered” across four cohorts, through the random forest model. Cases with "dispersed" M+2+6- cells had shorter overall survival across all analyzed cohorts (P < 0.05 in 4/4 cohorts). Patients enriched in the “dispersed” phenotype, predominantly belonged to the ABC cell of origin subtype. We derived a “dispersed” pattern gene signature through multi-omic analyses which expressed genes implicated in cell migration and adhesion. Validation of the dispersed signature was conducted using Stereoseq, where M+2+6- cells enriched in the signature displayed greater values of L function across distances. Patients enriched in the dispersed phenotype displayed lower infiltration of immune subtypes though deconvolution hinting at a possible immunologically cold microenvironment.
Conclusion
This study demonstrates the clinical relevance studying the spatial distribution of malignant cell subpopulations through point pattern analysis. We anticipate that this machine learning pipeline can be developed for clinical use, enabling the classification of spatial phenotypes in DLBCL biopsies for patient stratification.
Disclosures: De Mel: Pfizer: Other: advisory board ; Amgen: Other: advisory board. Chng: Amgen: Honoraria; J&J: Honoraria, Research Funding; Celgene: Honoraria, Research Funding; BMS: Honoraria; Abbvie: Honoraria; Novartis: Honoraria; Takeda: Honoraria; Hummingbird: Research Funding. Scott: Veracyte: Consultancy, Honoraria; Roche: Consultancy, Honoraria; Roche/Genentech: Research Funding; AstraZenenca: Consultancy, Honoraria; Genmab: Consultancy, Honoraria; Abbvie: Consultancy, Honoraria; Nanostring: Patents & Royalties: use of gene expresssion to subtype aggressive lymphoma. Jeyasekharan: BeiGene: Consultancy; Roche: Consultancy; Gilead Sciences: Consultancy; Turbine: Consultancy; AstraZeneca: Consultancy, Research Funding; Antengene Corp: Consultancy; Janssen: Consultancy, Research Funding; MSD: Consultancy; IQVIA: Consultancy.