Natural language processing in cancer treatment identification based on medical reports

Ismaeel, Zaid Ali; A. Alsumaiday, Mustafa Rabee

doi:10.30772/qjes.2026.168230.1856

Natural language processing in cancer treatment identification based on medical reports

Articles in Press

Document Type : Research Paper

Authors

Zaid Ali Ismaeel

Mustafa Rabee A. Alsumaiday

Department of Vocational Education-Nineveh, Ministry of Education, Mosul, Iraq.

https://doi.org/10.30772/qjes.2026.168230.1856

Abstract

Cancer is still a major health concern, particularly in areas like Iraq with inadequate healthcare systems, where survival rates depend on early and precise diagnosis. Using clinical text data from radiology reports in Mosul, Iraq, this study examines the use of Natural Language Processing (NLP) and Machine Learning (ML) models for cancer diagnosis and classification. In order to categories cancer cases into benign, malignant, stable, progress, and improvement groups, three machine learning classifiers—Support Vector Machine (SVM), XGBoost, and LightGBM—were trained using TF-IDF features on a balanced dataset of 12,923 labelled radiological reports. XGBoost outperformed the other models and showed the highest accuracy (97.25%). This study examines the useful implications for improving diagnostic efficiency and demonstrates the efficacy of NLP-driven machine learning models in healthcare settings with limited resources. The results imply that these ML-NLP models can increase accuracy, decrease the need for manual diagnostic procedures, and possibly offer a scalable solution for healthcare systems with limited funding.

Keywords

Cancer Diagnosis

Machine Learning

Natural Language Processing

Clinical Text Classification

XGBoost

LightGBM

Iraqi Healthcare

Radiology Reports

Subjects

Computer Engineering

Al-Qadisiyah Journal for Engineering Sciences

Articles in Press, Accepted Manuscript
Available Online from 01 June 2026

XML

Receive Date 16 January 2026
Revise Date 31 May 2026
Accept Date 01 June 2026

Article View

120

Advanced Search

Al-Qadisiyah Journal for Engineering Sciences

Natural language processing in cancer treatment identification based on medical reports

Articles in Press, Accepted Manuscript
Available Online from 01 June 2026

Submit Manuscript

Guide for Authors

Article Processing Charges (APC)

Reviewers

Call for Reviewers

Contact Us

Al-Qadisiyah Journal for Engineering Sciences

Natural language processing in cancer treatment identification based on medical reports

Articles in Press, Accepted Manuscript Available Online from 01 June 2026

Files

History

Share

How to cite

Statistics

Submit Manuscript

Browse

Journal Info

Guide for Authors

Article Processing Charges (APC)

Reviewers

Call for Reviewers

Contact Us

Articles in Press, Accepted Manuscript
Available Online from 01 June 2026