Activities

2026

POSTER
Locally Deployed Agentic LLMS for Structured Data Extraction From Complex Oncology EHRS

Event:

AI for Oncology and Cancer Research. May 7th - 8th, 2026. Milan, Italy

Abstract:

Background: Large language models (LLMs) have potential for structuring clinical data, yet extracting structured variables from lengthy EHRs, especially in non-English settings, remains challenging. We evaluated the reliability of open-source LLMs for this task using complex oncological EHRs while ensuring data privacy through on-premise deployment.

Methods: We studied a dataset from St. Olavs University Hospital, including 62 patients treated for stage IV colorectal cancer from 2010 to 2020, with EHRs ranging from 180,000 to 1.7 million characters. We manually extracted fifty variables related to patient characteristics and treatments to establish a gold standard, benchmarking extraction strategies with models under 100B parameters.

Results: The sequential approach achieved an F1 score of 0.74 and an accuracy of 0.73, with categorical variables like BRAF mutation status (F1 = 0.983) performing best, while temporal and numerical variables were more challenging. We developed a retrieval framework using MCP search, resulting in roughly 50-fold faster processing and improved accuracy for temporal variables.

Conclusions: Locally deployed LLMs can effectively extract structured oncology variables from complex, non-English EHRs. Improved retrieval architectures demonstrate potential for enhancing speed and performance, with ongoing work aimed at optimizing methods and expanding the dataset.

Presenter:

Andreas Eikså
Thomas Falkeid Hagland
Leo Quentin Thorbjørnsen Bækholt

2026

POSTER
Model-Guided Discovery of Synergistic Drug Combinations in Colorectal Cancer

Event:

Computational Systems Biology of Cancer 8th edition. Sept. 22nd - 26th, 2025. Institut Curie, Paris

Abstract:

Nearly half of colorectal cancer (CRC) patients have druggable molecular alterations, but single-target therapies often fail due to resistance. To improve treatment strategies, we used a Boolean modeling approach to identify effective drug combinations for CRC cell lines. From 880 candidate combinations, we validated 176, with 119 confirmed as synergistic, achieving a 68% success rate—significantly higher than past efforts.

Our framework demonstrated strong predictive performance with 72% accuracy, 76% precision, and 87% recall, correctly identifying almost 90% of observed synergies. Most successful combinations involved mTOR or AKT inhibitors, while combinations with CHEK1 inhibitors were less effective. We discovered two novel synergies: mTOR with ERK and mTOR with BCL2, both confirmed across 16 cell lines and supported by analysis suggesting mechanisms of action.

These findings highlight the potential of our modeling approach to enhance drug discovery efficiency, prioritize effective combinations, and deepen our understanding of CRC biology.

Presenter:

Viviam Bermúdez Paiva