BreastCancerPrediction

Introduction

In this project, I build a Logistic regression model using Scikit-learn to classify breast cancer as either Malignant or Benign. I use the Breast Cancer Wisconsin (Diagnostic) Data Set from Kaggle. The goal is to use a simple logistic regression classifier to classify between cancerous and noncancerous patients.

Data description

Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. There is some attribute information such as: ID number and Diagnosis (M = malignant, B = benign) and ten real-valued features that are computed for each cell nucleus:

radius (mean of distances from center to points on the perimeter)
texture (standard deviation of gray-scale values)
perimeter
area
smoothness (local variation in radius lengths)
compactness (perimeter^2 / area - 1.0)
concavity (severity of concave portions of the contour)
concave points (number of concave portions of the contour)
symmetry
fractal dimension ("coastline approximation" - 1)

Implementation

Load & exlore the dataset
Perform LabelEncoding
Split the data into independent and dependent sets; perform feature scaling
Build logistic regression classifier
Evaluate the performance of the model

Results

I have checked the efficiency of a trained model on the test data and labels. In this case, the accuracy of the method is about 96,5%, which is high enough. So, I made a conclusion that the implementation using logistic regression classifier is suitable and efficient.

This project was implemented as a part of Breast Cancer Prediction Using Machine Learning guided project on Coursera.

Course sertificate

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
breast_cancer_prediction.ipynb		breast_cancer_prediction.ipynb
data.csv		data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BreastCancerPrediction

Introduction

Data description

Implementation

Results

About

Uh oh!

Languages

License

alinamuliak/BreastCancerPrediction

Folders and files

Latest commit

History

Repository files navigation

BreastCancerPrediction

Introduction

Data description

Implementation

Results

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages