DATA SCIENCE WORKSHOP: Cervical Cancer Classification and Prediction Using Machine Learning and Deep Learning  with Python GUI

DATA SCIENCE WORKSHOP: Cervical Cancer Classification and Prediction Using Machine Learning and Deep Learning with Python GUI

PorVivian SiahaanRismon Hasiholan Sianipar

Es posible que este libro digital no cumpla las normas de accesibilidad y no sea totalmente compatible con las tecnologías de asistencia.
About 11,000 new cases of invasive cervical cancer are diagnosed each year in the U.S. However, the number of new cervical cancer cases has been declining steadily over the past decades. Although it is the most preventable type of cancer, each year cervical cancer kills about 4,000 women in the U.S. and about 300,000 women worldwide. Numerous studies report that high poverty levels are linked with low screening rates. In addition, lack of health insurance, limited transportation, and language difficulties hinder a poor woman’s access to screening services. Human papilloma virus (HPV) is the main risk factor for cervical cancer. In adults, the most important risk factor for HPV is sexual activity with an infected person. Women most at risk for cervical cancer are those with a history of multiple sexual partners, sexual intercourse at age 17 years or younger, or both. A woman who has never been sexually active has a very low risk for developing cervical cancer. Sexual activity with multiple partners increases the likelihood of many other sexually transmitted infections (chlamydia, gonorrhea, syphilis). Studies have found an association between chlamydia and cervical cancer risk, including the possibility that chlamydia may prolong HPV infection. Therefore, early detection of cervical cancer using machine and deep learning models can be of great help. The dataset used in this project is obtained from UCI Repository and kindly acknowledged. This file contains a List of Risk Factors for Cervical Cancer leading to a Biopsy Examination. The models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will develop a GUI using PyQt5 to plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performace of the model, scalability of the model, training loss, and training accuracy.

Detalles

Fecha de publicación
Mar 31, 2023
Idioma
English
Categoría
Computadoras y tecnología
Copyright
Todos los derechos reservados - Licencia estándar de copyright
Contribuyentes
Por (autor o autora): Vivian Siahaan, Por (autor o autora): Rismon Hasiholan Sianipar

Especificaciones

Formato
PDF

Calificaciones y comentarios