0% found this document useful (1 vote)

662 views4 pages

Geldium Task2 Model Plan

The document outlines a predictive model plan for credit delinquency using a binary classification approach with logistic regression. It details the model logic, justification for model choice, and an evaluation strategy that includes key metrics, bias detection, and ethical considerations. The model aims to classify customers at risk of delinquency to support proactive decision-making for reducing default rates.

Uploaded by

omhire2007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

662 views4 pages

Geldium Task2 Model Plan

Uploaded by

omhire2007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Predictive Model Plan – Student

Template
1. Model Logic (Generated with GenAI)
The predictive model for credit delinquency will be a binary classification model. The goal is
to predict the `Delinquent_Account` (1 for delinquent, 0 for not delinquent) based on
various customer features.

Pseudo-code/Step-by-Step Process:

- Load Data:
Read the customer dataset containing features like Age, Income, Credit_Score,
Credit_Utilization, Missed_Payments, Loan_Balance, Debt_to_Income_Ratio,
Employment_Status, Account_Tenure, Credit_Card_Type, Location, and Month_1 to Month_6
payment statuses.

- Data Preprocessing:
- Handle missing values using median or mean for numerical features.
- Encode categorical variables:
- One-Hot Encoding for Employment_Status, Credit_Card_Type, and Location.
- Map ordinal values for Month_1 to Month_6 statuses (On-time=0, Late=1, Missed=2).

- Feature Engineering:
- Derive new features such as the total number of missed payments over 6 months and the
proportion of late payments.

- Feature Scaling:
- Use StandardScaler to normalize numerical values.

- Train-Test Split:
- Divide the dataset into 80% training and 20% testing with stratified sampling.

- Model Selection:
- Logistic Regression.

- Model Training:
- Train the logistic regression model to learn optimal coefficients.

- Prediction:
- Predict delinquency probability; classify customers as delinquent if probability > 0.5.
- Evaluation:
- Use metrics such as Precision, Recall, F1 Score, and AUC-ROC.
- Conduct fairness analysis and bias detection across customer groups.

What the model is designed to do:

This model classifies customers into two groups: those at risk of delinquency and those not
at risk. It supports early identification and helps Geldium make informed, proactive
decisions to reduce default rates.

2. Justification for Model Choice

I selected Logistic Regression as the preferred model for predicting credit delinquency due
to the following reasons:

- Accuracy: Logistic Regression is well-suited for binary classification tasks and performs
well when relationships in the data are linear or can be linearized.

- Transparency: This model allows for direct interpretation of feature importance using
coefficients. It provides clear explanations for why a customer is classified as delinquent,
which is critical for:
- Regulatory compliance,
- Business stakeholder trust, and
- Delivering actionable insights.

- Ease of Use: It requires minimal computational power, simple implementation, and less
tuning compared to complex models.

- Financial Relevance: Logistic regression is widely used in credit scoring due to its
interpretability and the ability to estimate risk probabilities.

- Fit for Geldium: For a financial institution like Geldium, where clarity, fairness, and
explainability are vital, logistic regression balances predictive capability with business
requirements. Alternatives like decision trees may suffer from overfitting, and neural
networks, while powerful, act as black boxes—limiting interpretability and raising fairness
concerns.

3. Evaluation Strategy
To ensure robust and ethical performance of the model, the following evaluation strategy
will be implemented:
Key Metrics:

- Precision: Measures the proportion of correctly predicted delinquents out of all

delinquency predictions. High precision reduces unnecessary customer interventions.

- Recall (Sensitivity): Measures the proportion of actual delinquents correctly identified.

High recall helps avoid missing high-risk customers.

- F1 Score: The harmonic mean of precision and recall. Especially useful in imbalanced
datasets where both false positives and false negatives are costly.

- AUC-ROC Curve: Assesses the model's ability to distinguish between delinquent and non-
delinquent customers across thresholds.

Bias Detection and Fairness Checks:

- Data Bias Review: Check dataset for demographic representation imbalances (e.g., age,
employment status, location).

- Disparate Impact Analysis: Evaluate whether model predictions differ across subgroups in
terms of false positives or false negatives.

- Equal Opportunity Checks: Confirm whether the model performs equally across all
demographic groups in terms of true positive rates.

Bias Mitigation Techniques (if needed):

- Pre-processing: Apply sampling or re-weighting to improve representation.

- In-training Adjustments: Use fairness-aware objectives if bias is detected.

- Post-processing: Adjust classification thresholds to equalize outcomes across sensitive

groups.

Ethical Considerations:

- Transparency: Maintain clear justifications for all predictions.

- Fairness: Avoid proxy discrimination through careful feature selection and fairness audits.

- Data Privacy: Ensure compliance with GDPR/local regulations.

- Human Oversight: Model decisions should be reviewed by analysts to avoid sole reliance
on AI.

- Customer Impact: Monitor for and minimize harm from false predictions. Establish
feedback channels.

- Ongoing Monitoring: Regularly check for data drift and model performance degradation
over time. Retrain or recalibrate when needed.

Task 2 Model Plan Example Answer
No ratings yet
Task 2 Model Plan Example Answer
1 page
Task 2 ModelPlan Template
No ratings yet
Task 2 ModelPlan Template
3 pages
EDA Report
No ratings yet
EDA Report
6 pages
EDA SummaryReport Filled
No ratings yet
EDA SummaryReport Filled
4 pages
Filled EDA Summary Report
No ratings yet
Filled EDA Summary Report
3 pages
Task3 Business Summary Report Formatted
No ratings yet
Task3 Business Summary Report Formatted
2 pages
EDA Example Answer
No ratings yet
EDA Example Answer
3 pages
AIPowered Collec
No ratings yet
AIPowered Collec
3 pages
T1 Eda
No ratings yet
T1 Eda
1 page
Updated Business Summary Report Template
No ratings yet
Updated Business Summary Report Template
3 pages
Task 4 - Model Presentation (With Talk Track)
No ratings yet
Task 4 - Model Presentation (With Talk Track)
11 pages
Sukanya Linear LogisticRegression Report
100% (1)
Sukanya Linear LogisticRegression Report
23 pages
Marketing Analytics Unit 1
No ratings yet
Marketing Analytics Unit 1
50 pages
Credit EDA Case Study
100% (3)
Credit EDA Case Study
16 pages
Capstone Project Final Report
No ratings yet
Capstone Project Final Report
37 pages
Credit EDA Case Study Insights
100% (2)
Credit EDA Case Study Insights
17 pages
Predictive Modeling Guide
No ratings yet
Predictive Modeling Guide
29 pages
Thera Bank Loan Campaign Analysis
100% (1)
Thera Bank Loan Campaign Analysis
21 pages
Long Quiz FRA - Finance and Risk Analytics - Great Learning
100% (1)
Long Quiz FRA - Finance and Risk Analytics - Great Learning
8 pages
Credit Card Churn Prediction Model
No ratings yet
Credit Card Churn Prediction Model
12 pages
Novel Covid-19 Dataset Analysis: A Project Report
100% (3)
Novel Covid-19 Dataset Analysis: A Project Report
68 pages
House Price Prediction Using Data Science
No ratings yet
House Price Prediction Using Data Science
8 pages
Predicting Credit Risk of Financial Firms in India Using AI-based ML Approaches A Study of Nifty 50 Firms
No ratings yet
Predicting Credit Risk of Financial Firms in India Using AI-based ML Approaches A Study of Nifty 50 Firms
11 pages
Banking Credit Risk Analysis With Naive Bayes Approach and Cox Proportional Hazard
No ratings yet
Banking Credit Risk Analysis With Naive Bayes Approach and Cox Proportional Hazard
6 pages
PAM All Files
No ratings yet
PAM All Files
90 pages
Question Paper-Baiscs of Fintech (S-6-301)
No ratings yet
Question Paper-Baiscs of Fintech (S-6-301)
1 page
FRA Project Report - Chilla Nagaraju
100% (1)
FRA Project Report - Chilla Nagaraju
66 pages
Credit Card Default Prediction: Final Project Report
No ratings yet
Credit Card Default Prediction: Final Project Report
28 pages
Machine Learning Guided Project
No ratings yet
Machine Learning Guided Project
23 pages
Credit Card Default Risk Analysis
100% (1)
Credit Card Default Risk Analysis
16 pages
King County House Price Prediction Model
No ratings yet
King County House Price Prediction Model
15 pages
House Price Prediction Using Machine Learning
No ratings yet
House Price Prediction Using Machine Learning
6 pages
FoodHub Data Insights for Growth
No ratings yet
FoodHub Data Insights for Growth
20 pages
Credit Risk Sas
No ratings yet
Credit Risk Sas
152 pages
Loan Prediction System Overview
No ratings yet
Loan Prediction System Overview
5 pages
Credit EDA Case Study Doc 1
100% (1)
Credit EDA Case Study Doc 1
16 pages
Creditcard Fraud Detection
No ratings yet
Creditcard Fraud Detection
26 pages
Market Segmentation Statistics Project
100% (5)
Market Segmentation Statistics Project
14 pages
M4 Data Mining W4 Business Report
No ratings yet
M4 Data Mining W4 Business Report
22 pages
Applications and Benefits of Fuzzy Logic
No ratings yet
Applications and Benefits of Fuzzy Logic
3 pages
ML1+Project+ (Coded) + +Sample+Business+Report
No ratings yet
ML1+Project+ (Coded) + +Sample+Business+Report
56 pages
House Price Prediction Models Analysis
No ratings yet
House Price Prediction Models Analysis
27 pages
Predictive Analytics Guide
No ratings yet
Predictive Analytics Guide
17 pages
Credit Risk Management at SBI Explained
No ratings yet
Credit Risk Management at SBI Explained
104 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
19 pages
Churn Prediction Analysis in Telecom
No ratings yet
Churn Prediction Analysis in Telecom
57 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
GDP Forecasting Using Time Series Analysis
No ratings yet
GDP Forecasting Using Time Series Analysis
15 pages
ML-2 Guided Project Report
No ratings yet
ML-2 Guided Project Report
63 pages
India Credit Risk Model Report
No ratings yet
India Credit Risk Model Report
18 pages
Tour Insurance Claim Prediction Models
0% (1)
Tour Insurance Claim Prediction Models
16 pages
Answer Report (Preditive Modelling)
100% (1)
Answer Report (Preditive Modelling)
29 pages
Credit Rating's Impact on Card Balance
No ratings yet
Credit Rating's Impact on Card Balance
26 pages
EDA for Risk Analysis in Lending
100% (1)
EDA for Risk Analysis in Lending
19 pages
Loan Prediction
No ratings yet
Loan Prediction
37 pages
Neo Special Credit Opportunities Fund II - Nov 2025
No ratings yet
Neo Special Credit Opportunities Fund II - Nov 2025
84 pages
PM Guided Project Sample Business Report
100% (1)
PM Guided Project Sample Business Report
52 pages
Bankruptcy Prediction Model Overview
No ratings yet
Bankruptcy Prediction Model Overview
16 pages
Predictive Model Plan
No ratings yet
Predictive Model Plan
4 pages
Task 2 ModelPlan Template
No ratings yet
Task 2 ModelPlan Template
3 pages
MNM2615 Assignment01 Complete Humanized LongVersion
No ratings yet
MNM2615 Assignment01 Complete Humanized LongVersion
7 pages
Physiology Dept Pedagogy Flyer - 2025
No ratings yet
Physiology Dept Pedagogy Flyer - 2025
2 pages
John Deere 6020 Serie SCV 300
No ratings yet
John Deere 6020 Serie SCV 300
6 pages
NTPC Tandwa Township Overview
No ratings yet
NTPC Tandwa Township Overview
20 pages
UCSC012 Internet Programming: Dr.S.Sumathi, Assistant Professor - Senior Grade Sri Ramakrishna Institute of Technology
No ratings yet
UCSC012 Internet Programming: Dr.S.Sumathi, Assistant Professor - Senior Grade Sri Ramakrishna Institute of Technology
38 pages
Steps of Document Verification For Ug
No ratings yet
Steps of Document Verification For Ug
7 pages
4100 ES Manual
No ratings yet
4100 ES Manual
76 pages
Wallet Finder Balance Check - C Users AP Downloads FEBCC Check
No ratings yet
Wallet Finder Balance Check - C Users AP Downloads FEBCC Check
2 pages
Computer Network and Information Security
No ratings yet
Computer Network and Information Security
33 pages
2013 Camry ATF Exchange Guide
No ratings yet
2013 Camry ATF Exchange Guide
18 pages
Java Map Interface and HashMap Overview
No ratings yet
Java Map Interface and HashMap Overview
34 pages
Box 16 and 17 Financial Assistance To Individual in Crisis
No ratings yet
Box 16 and 17 Financial Assistance To Individual in Crisis
6 pages
D5N Track-Type Tractor Specs Overview
No ratings yet
D5N Track-Type Tractor Specs Overview
2 pages
Linux Networking Commands
100% (1)
Linux Networking Commands
11 pages
Timepix4: Specs, Features & Plans
No ratings yet
Timepix4: Specs, Features & Plans
16 pages
Integrating Factors in Linear ODEs
No ratings yet
Integrating Factors in Linear ODEs
10 pages
02 - Decision Constructs Loops
No ratings yet
02 - Decision Constructs Loops
45 pages
Avendus Spark Research 08 July 2025 - Data Center Ecosystem
100% (1)
Avendus Spark Research 08 July 2025 - Data Center Ecosystem
66 pages
Arjes Broschuere Impaktor 250-En
No ratings yet
Arjes Broschuere Impaktor 250-En
12 pages
VTSM
No ratings yet
VTSM
4 pages
Flexible OLED Displays: Seminar On
100% (1)
Flexible OLED Displays: Seminar On
20 pages
WH Dde
No ratings yet
WH Dde
21 pages
Equity 2023-2024
No ratings yet
Equity 2023-2024
2 pages
2020EHE BrandGuidelines
No ratings yet
2020EHE BrandGuidelines
38 pages
Trendline With Signals (ATP?) PineScript Code v.5
No ratings yet
Trendline With Signals (ATP?) PineScript Code v.5
6 pages
Accessing GeoWebFace FTP Data
No ratings yet
Accessing GeoWebFace FTP Data
1 page
Layout Kubikel Sirimau Pix Schneider - Po110-201 - 01
No ratings yet
Layout Kubikel Sirimau Pix Schneider - Po110-201 - 01
11 pages
Application ID 8394A00221 Do Id 8394A002211: Customer Name Nischay Nischay
No ratings yet
Application ID 8394A00221 Do Id 8394A002211: Customer Name Nischay Nischay
2 pages
Hindustan Times 27-11-2025
No ratings yet
Hindustan Times 27-11-2025
28 pages
01.0 Flyer - DLF
No ratings yet
01.0 Flyer - DLF
2 pages

Geldium Task2 Model Plan

Uploaded by

Geldium Task2 Model Plan

Uploaded by

Predictive Model Plan – Student

What the model is designed to do:

2. Justification for Model Choice

- Precision: Measures the proportion of correctly predicted delinquents out of all

- Recall (Sensitivity): Measures the proportion of actual delinquents correctly identified.

Bias Detection and Fairness Checks:

Bias Mitigation Techniques (if needed):

- Pre-processing: Apply sampling or re-weighting to improve representation.

- In-training Adjustments: Use fairness-aware objectives if bias is detected.

- Post-processing: Adjust classification thresholds to equalize outcomes across sensitive

- Transparency: Maintain clear justifications for all predictions.

- Data Privacy: Ensure compliance with GDPR/local regulations.

You might also like