Machine Learning Project 3 - Credit Card Fraud Detection
Aim: The goal of this project is to automatically identify fraudulent credit card transactions using Machine Learning. This is a binary classification problem. My approach is explained below: Workflow: 1. Check the distribution of the classes in the response variable - whether it is an imbalanced or a balanced dataset. 2. Create a baseline model (LogisticRegression), and check recall value for fraudulent transaction class : a. if low recall value, then solve for the imbalanced data. i. Under-sampling ii. Over-sampling methods. b. if high recall value, move forward. 3. Model Selection: train with other models and select the best one. 4. Feature Selection: apply feature selection techniques to select the best features. 5. Final model Model Evaluation methods: 1. Recall Values 2. Precison Values 3. Area under curve 1. Class Distribution: This...