GitHub

Data exploration and visualization.

Preprocessing the train data: feature engineering, outliers removal, handling missing values and categorical features, data scaling and dimensionality reduction.

Performing 4 machine learning algorithms on the train data, using the validation set to choose the best hyper-parameters that will prevent overfitting and evaluation of the models using K-fold cross validations and ROC curves.

Using the model with the best results to prefict the test data.

Project done with Python on Jupyter notebook with Scikit-Learn.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Machine Learning Project Report.pdf		Machine Learning Project Report.pdf
Machine Learning Project.ipynb		Machine Learning Project.ipynb
README.md		README.md
test_without_target.csv		test_without_target.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

EladGashri/Machine_Learning_Project

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages