Application of Rough Sets to Predict the Breast Cancer Risk Association with Routine Blood Analyses
Amr H. Abdel Haliem1, Mohammed A. Atiea2, Mohammed E. Wahed3, Mohammed S. Metwally4

1Amr H. AbdelHaliem *, Faculty of Science, Department of, Computer Science, Suez, Egypt.
2Mohammed A. Atiea, Faculty of Computers and Informatics, Suez University, Suez, Egypt.
3Mohammed E. Wahed, Faculty of Computers and Informatics, Suez University Ismailia, Egypt.
4Mohammed S.Metwally, Faculty of Science, Department of Mathematics, Suez University, Suez, Egypt.

Manuscript received on November 01, 2020. | Revised Manuscript received on n November 14, 2020. | Manuscript published on January 10, 2021. | PP: 67-72 | Volume-10 Issue-3, January 2021 | Retrieval Number: 100.1/ijitee.B82351210220| DOI: 10.35940/ijitee.B8235.0110321
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (

Abstract: For women around the globe, breast cancer has been a significant cause of mortality. Around the same time, early diagnosis and high cancer prediction precision are critical to improving the quality of care and the recovery rate of the patient. Expert systems and machine learning techniques are gaining prominence in this area as a result of efficient classification and high diagnostic ability. This paper introduces a model of hybrid prediction (RS QA) based on a rough set theoryand a quasi-optimal (AQ) rule induction algorithm. To find a minimal set of attributes that completely define the results, a rough set tool is used. The selected characteristics were collected, ensuring the high standard of the classification. Then to produce the decision rules, we use the quasi-optimal (AQ) rule induction algorithm. These hybrid prediction models allow expert systems to be built based on the conceptual rules of the IF THEN sort. The suggested experiment is performed using the Coimbra Breast Cancer Dataset (BCCD) based on sets of measures that can be obtained in routine blood tests. Using classification precision, sensitivity, specificity, and receiver operating characteristics (ROC) curves, the efficiency of our suggested approach was assessed. Experimental results indicate the highest classification accuracy (91.7 percent), sensitivity (83.3 percent), and precision (94.3) obtained by the proposed (RS_QA) model. 
Keywords: Rough set, Breast cancer, Prediction system, AQ algorithm, Rule induction.