Model

The project focused on building a machine learning model to detect fraudulent credit card transactions using a dataset of 1,000 entries. Data preprocessing involved one-hot and frequency encoding, as well as addressing class imbalance with SMOTE. The Random Forest model achieved the best F1-score of ~0.28, and the project highlighted challenges with data imbalance and the importance of careful data cleaning and encoding.

Uploaded by

Pulkit Dubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views2 pages

Model

Uploaded by

Pulkit Dubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Credit Card Fraud Detection

For this project, I worked on building a machine learning model to detect fraudulent credit card
transactions. The dataset had 1,000 entries, and the target column is_fraud indicated whether a
transaction was legitimate or not.

 Data Preprocessing
The dataset included both numerical and categorical columns. I used one-hot encoding for
categorical features like gender, transaction category, and state, while I applied frequency
encoding for city names to avoid a high number of dummy variables. I also removed columns like
latitude, longitude, and credit card number, which I felt were either too specific or irrelevant for
the model. After encoding, I checked for imbalance and used SMOTE to oversample the minority
(fraud) class to help models learn better.

 Model Selection and Evaluation

I tried three models: Logistic Regression, Random Forest, and XGBoost. I chose these because
they’re commonly used for classification tasks and work well on tabular data. For evaluation, I
used accuracy, precision, recall, and F1-score — since fraud detection is an imbalanced
classification problem, precision and recall matter more than just accuracy.
Here’s what I found:
 Random Forest gave the best result overall with an F1-score of ~0.28.
 XGBoost followed closely.

 Visualizations and Insights

I created several graphs during EDA. One important one was the heatmap, which shows how
features are correlated. Interestingly, most features weren’t strongly correlated, suggesting the
model has to learn from patterns across multiple weak signals. I also visualized transaction trends
by day of the week and fraud distribution, which helped guide some of the preprocessing steps.

 Challenges and Learnings

One of the biggest challenges was handling the imbalanced dataset — many models performed
poorly on fraud cases despite decent accuracy. I also had to carefully clean and encode the data to
avoid errors like strings causing model crashes.
In just a week, I learned a lot. Even though the metrics weren’t perfect, this was a valuable
starting point and I’m excited to keep improving.

Fraud Detection with Machine Learning
No ratings yet
Fraud Detection with Machine Learning
8 pages
Credit Card Fraud Detection Report
No ratings yet
Credit Card Fraud Detection Report
2 pages
B17 Discrete Report
No ratings yet
B17 Discrete Report
16 pages
Credit Card Fraud Detection Using Machine Learning
No ratings yet
Credit Card Fraud Detection Using Machine Learning
6 pages
Credit Card Fraud Detection Using Machine Learning Techniques
No ratings yet
Credit Card Fraud Detection Using Machine Learning Techniques
4 pages
Yolo-NAS Predictions for Fraud Detection
No ratings yet
Yolo-NAS Predictions for Fraud Detection
25 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
3 pages
.Trashed 1750261541 Phase 2 - Hari
No ratings yet
.Trashed 1750261541 Phase 2 - Hari
3 pages
Ibm Project
No ratings yet
Ibm Project
18 pages
ANN, KNN & Decision Tree
No ratings yet
ANN, KNN & Decision Tree
13 pages
Financial Fraud Detection
No ratings yet
Financial Fraud Detection
11 pages
Wa0006
No ratings yet
Wa0006
6 pages
Credit Card Fraud Detection (Data Analyst)
No ratings yet
Credit Card Fraud Detection (Data Analyst)
22 pages
Ads Phase4
No ratings yet
Ads Phase4
5 pages
Credit Card Fraud Detection with ML
100% (2)
Credit Card Fraud Detection with ML
11 pages
Report
No ratings yet
Report
14 pages
Phase 3
No ratings yet
Phase 3
19 pages
Phase-3 Ai Credit Card Detection PDF
No ratings yet
Phase-3 Ai Credit Card Detection PDF
5 pages
Project Report
No ratings yet
Project Report
34 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
10 pages
ML for Online Payment Fraud Detection
No ratings yet
ML for Online Payment Fraud Detection
8 pages
Porposal Datamining
No ratings yet
Porposal Datamining
4 pages
Fraud Detection System Design First Person
No ratings yet
Fraud Detection System Design First Person
2 pages
Fraud Detection for ML Engineers
No ratings yet
Fraud Detection for ML Engineers
15 pages
ITR Presentation (FINAL)
No ratings yet
ITR Presentation (FINAL)
14 pages
Fraud Detection Using Machine Learning
No ratings yet
Fraud Detection Using Machine Learning
6 pages
EP4130 Project
No ratings yet
EP4130 Project
17 pages
Presentation Slides
No ratings yet
Presentation Slides
18 pages
Machine Learning Outlier Detection by Using Autoencoders
No ratings yet
Machine Learning Outlier Detection by Using Autoencoders
14 pages
307 A029 Seminar
No ratings yet
307 A029 Seminar
16 pages
AI Fraud Detection for Finance Pros
No ratings yet
AI Fraud Detection for Finance Pros
4 pages
Fraud Detection for IT Students
No ratings yet
Fraud Detection for IT Students
26 pages
Phase 2-AI Credit Card Fraud Detection System-1-2
No ratings yet
Phase 2-AI Credit Card Fraud Detection System-1-2
4 pages
PID 89: Analysis and Performance Evaluation of Credit Card Fraud Detection by Multi-Model ML
No ratings yet
PID 89: Analysis and Performance Evaluation of Credit Card Fraud Detection by Multi-Model ML
19 pages
Script KHDL
No ratings yet
Script KHDL
4 pages
Creditcard Fraud Detection
No ratings yet
Creditcard Fraud Detection
26 pages
Guarding Transaction With Ai Alternative NM
No ratings yet
Guarding Transaction With Ai Alternative NM
4 pages
Credit Card Fraud Detection Model
No ratings yet
Credit Card Fraud Detection Model
2 pages
ML Credit Card
No ratings yet
ML Credit Card
21 pages
23MZ02
No ratings yet
23MZ02
36 pages
AI and DS Final Document For Phase 5
No ratings yet
AI and DS Final Document For Phase 5
9 pages
Final Eddited Research Paper1
No ratings yet
Final Eddited Research Paper1
6 pages
Module 3.4 Classification Models, Case Study
No ratings yet
Module 3.4 Classification Models, Case Study
12 pages
Phase 5
No ratings yet
Phase 5
10 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
34 pages
Presentation Credit Card
No ratings yet
Presentation Credit Card
25 pages
Fraud Detection XGBoost Presentation
No ratings yet
Fraud Detection XGBoost Presentation
10 pages
Deep Learning Credit Card Fraud Detection
No ratings yet
Deep Learning Credit Card Fraud Detection
9 pages
Secureswipe Pioneering Strategies For Next-Gen Credit Card Fraud Prevention 1
No ratings yet
Secureswipe Pioneering Strategies For Next-Gen Credit Card Fraud Prevention 1
9 pages
Smart Card Fraud Detection System
No ratings yet
Smart Card Fraud Detection System
5 pages
Machine Learning for Credit Card Fraud Detection
100% (1)
Machine Learning for Credit Card Fraud Detection
22 pages
TE Seminar Formatfinal
No ratings yet
TE Seminar Formatfinal
16 pages
Autonomous Credit Card Fraud Detection Using Machine Learning Approach
No ratings yet
Autonomous Credit Card Fraud Detection Using Machine Learning Approach
23 pages
Ihkk
No ratings yet
Ihkk
62 pages
Case Study N Sanjay
No ratings yet
Case Study N Sanjay
7 pages
Rich Fraud Detection XGBoost Presentation
No ratings yet
Rich Fraud Detection XGBoost Presentation
14 pages
2 PB
No ratings yet
2 PB
10 pages
1 Report
No ratings yet
1 Report
55 pages
Introduction To Cybercrime (Unit-1) Notes
No ratings yet
Introduction To Cybercrime (Unit-1) Notes
32 pages
TrainJect Pitch
No ratings yet
TrainJect Pitch
6 pages
390 - 48.Concat-LL
No ratings yet
390 - 48.Concat-LL
2 pages
Taxi Bill Sample
No ratings yet
Taxi Bill Sample
1 page
Tree Data Structure
No ratings yet
Tree Data Structure
41 pages
Sdio Installation
No ratings yet
Sdio Installation
1 page
VITEEE 2024 Physics Questions
No ratings yet
VITEEE 2024 Physics Questions
110 pages
! Read Me
No ratings yet
! Read Me
1 page
Practical Programming Exercises
No ratings yet
Practical Programming Exercises
29 pages
Circular For Pre Board Exams
No ratings yet
Circular For Pre Board Exams
2 pages

Model

Uploaded by

Model

Uploaded by

Credit Card Fraud Detection

 Model Selection and Evaluation

 Visualizations and Insights

 Challenges and Learnings

You might also like