AliBERT achieves it first successful application in oncology!

2023, 18 October

| 2 min read|

quinteninfra

Thumbnail for the article entitled “NLP: Quinten achieves first successful application of AliBERT in oncology!”

A successful proof-of-concept for AliBERT, the first French-language model specialized in the biomedical field. In partnership with a major French Cancer Fighting Institute, Quinten’s datalab team has developed a first concrete use case : the extraction of concepts and structured information from medical reports in oncology. A Natural Language Processing [NLP] task which, until now, has been highly complex, given the technical nature, diversity and specialization of these medical reports.

In medical research, structured data is crucial for reconstructing a patient’s history. However, some of these data are poorly or incompletely captured in hospital information systems. AliBERT, a pre-trained language model, enables structured data to be generated directly from medical reports, speeding up the costly, error-prone and time-consuming task of building databases based on chart reviews.

Using nearly 700 cancer patient files annotated by experts in the field, Quinten’s teams have trained AliBERT to recognize around ten key concepts in the follow-up of breast and lung cancer patients. In particular, it is now possible to efficiently detect – with performances (Accuracy) ranging from 80 to 95% – these concepts in several types of oncology reports (e.g., consultation, anatomopathology reports, Réunion de Concertation Pluridisciplinaire (RCP)).

Until now, traditional methods such as regular expression search or non-specialized neural networks were unable to process and exploit these complex and technical documents. The AliBERT tool, specialized in biomedical language, now automatically extracts all this information from a large number of heterogeneous documents.

A scientific publication is currently being written, reporting state of the art and progress achieved through this project. In the future, AliBERT will make it possible to reconstruct a patient’s care pathway, based on medical report databases set up in health data warehouses. An instrumental solution to accelerate research in the fight against cancer.

latest articles

Quinten Health joins SME Climate Hub Comunity - 2024

News

2024, 24 September

Proud to Join the SME Climate Hub: Committing to A...

We’re thrilled to join the SME Climate Hub community, committing to halve our emissions by 2030. Together, we're taking authentic action towards ...

1 min readquinteninfra

#Climate #SME Climate Hub

Find out more

News

2024, 28 March

Gender equality index 2023

The French law on the freedom to choose one’s professional future of September 5, 2018 makes equal pay for men and women an obligation of result...

1 min readquinteninfra

#HR #Index

Find out more

Thumbnail of the article about AliBERT available in open source

News

2024, 21 February

The Open-Source Revolution of AliBERT in French Bi...

Developed by Quinten in 2022, AliBERT is a specialized model focusing on the French biomedical language. One version has been released as an open-s...

2 min readquinteninfra

#AliBERT #Biomedical #NLP

Find out more

See all insights