Bank-Mareting-Data-Analysis

Requirements

Python 2.7
Numpy >= 1.14.2
Matplotlib >= 2.2.0
Pandas >= 0.22.0
Scikit-Learn >= 0.19.1

The data was collected as a marketing campaign to predict if a customer would make a term deposit in the bank.

The dataset considered for the project is 10% of the UCI bank Marketing dataset available online. The dataset has 4119 rows with 19 features.

The issues in the dataset were as follows: -> The features had missing values which had to be imputed. -> Preprocessing involved handling categorical data. -> The dataset was imbalanaced. Number of class 1 (yes) labels were low compared to number of class 0 (no) labels.

Preprocessing

Preprocessing work done on the data included:

Outlier removal
Label and one hot encoding
Handling missing data by mode imputation
Handling imbalanced data by oversampling using SMOTE,
Dimensionality reduction
Normalization and standardization

Models

Classsifiers used:

Support Vector Machine (SVM)
Naive Bayes
K Nearest Neighbors
Random Forest
Perceptron

Results

Performance Evaluation Metric used:

F1 score
AUC score
Training and test accuracy
Confusion matrix
ROC plots

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Code.py		Code.py
Code_pdf.pdf		Code_pdf.pdf
Project_Assignment.pdf		Project_Assignment.pdf
README.md		README.md
Report.pdf		Report.pdf
bank-additional.csv		bank-additional.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bank-Mareting-Data-Analysis

Requirements

Preprocessing

Models

Results

About

Releases

Packages

Languages

deepikakanade/Bank-Marketing-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Bank-Mareting-Data-Analysis

Requirements

Preprocessing

Models

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages