Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2022, IRJET
Due to the advancements in the domain of Artificial Intelligence and Data Science, its utilization is becoming more common in every possible domain. Nowadays, the majority of the industries make use of AI and its applications in some or the other way. Taking the advantage of the field of Data Science results in creating effective and modern applications, products irrespective of the domain. One of the industries where the application of AI and Data Science is proving to be effective is the Finance Industry commonly known as the Banking Sector. Banks face severe losses due to the loan defaults made by the client and hence to overcome this problem, there lies a need to create a credit risk scoring model which can analyze and predict the loan defaults. Hence, with the help of Machine Learning, we aim to create a Loan Default Analysis model which could predict the loan defaults and integrate the model into a web application for the user for easy usability.
The International Journal of Science & Technoledge
Business firms and households sometimes seek for extra-funding to fulfill certain needs. The demand which arises from the need of extra funds is fulfilled by the credit market. Banks and others financial lending institutions are the key players in this market (Gaigaliene and Cesnys, 2018). Loan is one of the most important products of most financial institutions. All financial lenders try to find effective business strategies for persuading customers to apply for loans. However, there are some borrowers who default in loan payments (Begum and Deniz, 2019). During a loan term, default may occur when the borrower fails to make required payments. Therefore, an assessment of a borrower's default risk over time is essential to enable timely risk management. Credit officers determine whether borrowers can fulfill their requirements using manually analysis of borrower's credit history. In the last decade, this trend has changed over time with technological advancement (Rehman, 2017). In recent years, financial lending institutions are using automated loan default models as credit risk scoring tools when granting loans to potential borrowers (Bao et al., 2019). Machine Learning (ML) algorithms have been applied to assess the credit risk of borrowers in financial lending institutions (Djeundj and Crook, 2018). Reliable models for credit risks play an important role in loss control and revenue maximization (Luo and Nie, 2016). Earlier research treated loan default prediction as a binary classification problem, where a loan is classified as either creditworthy or non-creditworthy (Rosenberg and Gleit, 1994). Linear Discriminant Analysis (LDA) and logistic regression (LR) are two most popular tools for constructing credit scoring models (Wiginton, 1980). Subsequently, other classification algorithm such as, Artificial neural networks (ANN) Gulsoy and Kulluk (2019) support vector machines (SVM) Alaka et al. (2018), decision trees (DT) Liu et al. (2015), and Bayesian classifier (BC) Carta et al. (2020), have been used to estimate borrowers' probability of default. Recently, time-to-default modeling has attracted increasing research interest (Dirick et al., 2017). Time-to-default data fall into the category of lifetime data in general, which is commonly analyzed by survival analysis (SA) (Malekipirbazari and Aksakalli, 2015). In loan prediction, two types of errors inevitably lead to inefficiency in prediction
IRJET, 2023
Loan business is one of the major income sources for bank. Loan default problem is a major issue for loan business. Loans, specifically whether borrowers repay the loan or default on it, have a significant impact on a bank's profitability. By anticipating loan defaulters, the bank is able to reduce its non-performing assets. Three primary predictive analytics techniques-I Data Collection, II Data Cleaning, and III Performance Assessment-are used to research the prediction of loan defaulters. Experimental investigations reveal that when it comes to loan forecasting, the KNN model performs better than the Decision tree model.
Research Square (Research Square), 2023
Machine Learning is an AI technique, empowers organizations globally to gain insights from their data and achieve success. This study focuses on improving risk management of banking in Ethiopia by employing machine learning techniques to detect and predict loan defaulters, ultimately mitigating bad loans. The research aims to distinguish borrowers who repay loans promptly from those who don't, forecast potential defaults, and assess the creditworthiness of prospective customers without overhauling existing systems and data. To showcase the study's results, classi cation models are constructed to reveal signi cant patterns among customer attributes. The primary objective is to apply cutting-edge machine learning algorithms to classify bank loan defaulters. Several machine learning algorithms, including Random Forest, Decision Tree, Gradient Boosting (GB), XGBoost, and Multi-Layer Perceptron (MLP), are employed in the process. Various data split ratios, such as 70/30, 60/40, and 90/10, are explored, alongside the 80/20 data split, to build classi cation models for each classi er algorithm. Based on the experiment's outcomes, the 70/30 split is selected for its superior accuracy. The study evaluates the performance of these algorithms using metrics like accuracy, precision, recall, and F1score. The results indicate that the Gradient Boosting classi cation technique surpasses other algorithms, achieving a training accuracy of 98.7% and a testing accuracy of 97.8%.
2018
Every lender’s organization such as banks and credit card companies use credit score system to determining the creditworthiness of their clients. Currently, they are using numerical scoring system in where the score determined by the compering new customer vs. existing customer profile. This does not capture the exact behavior of certain individual entities or more optimal ways to segment scoring models for which few loan trends to classify in a result organization are deprive of profit and lead to the loss. Now it analyzed that the problem can be optimized using Machine Learning technique and possible to forecast the behavior of the customer. In this study, we applied various machine learning technique to predict the classified loans, minimize credit risk and maximize the profit of the lender’s organization. Hence, this study intended to find the best modeling with best performance and accuracy by the comparing their results. iv ©Daffodil International University
International journal of innovative technology and exploring engineering, 2019
Extending credits to corporates and individuals for the smooth functioning of growing economies like India is inevitable. As increasing number of customers apply for loans in the banks and non-banking financial companies (NBFC), it is really challenging for banks and NBFCs with limited capital to device a standard resolution and safe procedure to lend money to its borrowers for their financial needs. In addition, in recent times NBFC inventories have suffered a significant downfall in terms of the stock price. It has contributed to a contagion that has also spread to other financial stocks, adversely affecting the benchmark in recent times. In this paper, an attempt is made to condense the risk involved in selecting the suitable person who could repay the loan on time thereby keeping the bank's non-performing assets (NPA) on the hold. This is achieved by feeding the past records of the customer who acquired loans from the bank into a trained machine learning model which could yield an accurate result. The prime focus of the paper is to determine whether or not it will be safe to allocate the loan to a particular person. This paper has the following sections (i) Collection of Data, (ii) Data Cleaning and (iii) Performance Evaluation. Experimental tests found that the Naïve Bayes model has better performance than other models in terms of loan forecasting.
International Journal For Multidisciplinary Research
Loans are a major prerequisite in the present-day world. By this, as it were, Banks get a major portion of the entire benefit. It is advantageous for understudies to oversee their instruction and living costs, and for individuals to purchase any kind of extravagance like houses, cars, etc. But when it comes to choosing whether the applicant's profile is pertinent to be allowed a credit or not. Banks must see many aspects. Giving credit is the most commerce of banks. A noteworthy parcel of the bank's income comes straightforwardly from benefits from credits. Indeed, on the off chance that the bank favors the advance after the confirmation and certification relapses prepare, it cannot be beyond any doubt that the chosen candidate is the genuine candidate. Doing this handle physically requires unused time. Able to anticipate whether. a specific candidate is secure, and the whole confirmation handle is automatized with the machine. fashion. After the presentation of innovation t...
Journal of Autonomous Intelligence
In order to evaluate a person’s or a company’s creditworthiness, financial institutions must use credit scoring. Traditional credit scoring algorithms frequently rely on manual and rule-based methods, which can be tedious and inaccurate. Recent developments in artificial intelligence (AI) technology have opened up possibilities for creating more reliable and effective credit rating systems. The data are pre-processed, including scaling using the 0–1 normalization method and resolving missing values by imputation. Information gain (IG), gain ratio (GR), and chi-square are three feature selection methodologies covered in the study. While GR normalizes IG by dividing it by the total entropy of the feature, IG quantifies the reduction in total entropy by adding a new feature. Based on chi-squared statistics, the most vital traits are determined using chi-square. This research employs different ML models to develop a hybrid model for credit score prediction. The ML algorithms support vec...
Most banks or lending institutions are plagued by defaulting loans growing over the years. Credit risk analysis is the principal and most crucial step in granting of loans as money or assets. This is done to prevent of mitigate the damage caused when a loan defaults. With the advent of new technology associated with Machine Learning these institutions are upgrading their business models. In this work, we build multiple machine learning models that increase the efficiency and sensitivity of credit risk analysis using descriptive and predictive analytics. These models are applied on a set of test data and the resulting features re implemented on a separate set of trainer data to gain insight about the models. R language has been used to carry out the operations. We observe and find the best model which delivers the most balance combination of accuracy, sensitivity and efficiency among all the models.
2021
In our banking system, banks have many products to sell but main source of income of any banks is on its credit line. So, they can earn from interest of those loans which they credit. A bank’s profit or a loss depends to a large extent on loans i.e., whether the customers are paying back the loan or defaulting. By predicting the loan defaulters, the bank can reduce its Non-Performing Assets. This makes the study of this phenomenon very important. Previous research in this era has shown that there are so many methods to study the problem of controlling loan default. But as the right predictions are very important for the maximization of profits, it is essential to study the nature of the different methods and their comparison. A very important approach in predictive analytic is used to study the problem of predicting loan defaulters: The Logistic regression model. The data is collected from the Kaggle for studying and prediction. Logistic Regression models have been performed and the...
Applied Artificial Intelligence, 2018
Credit scoring and monitoring are the two important dimensions of the decision-making process for the loan institutions. In the first part of this study, we investigate the role of machine learning for applicant reassessment and propose a complementary screening step to an existing scoring system. We use a real data set from one of the prominent loan companies in Turkey. The information provided by the applicants form the variables in our analysis. The company's experts have already labeled the clients as bad and good according to their ongoing payments. Using this labeled data set, we execute several methods to classify the bad applicants as well as the significant variables in this classification. As the data set consists of applicants who have passed the initial scoring system, most of the clients are marked as good. To deal with this imbalanced nature of the problem, we employ a set of different approaches to improve the performance of predicting the applicants who are likely to default. In the second part of this study, we aim to predict the payment behavior of clients based on their static (demographic and financial) and dynamic (payment) information. Furthermore, we analyze the effect of the length of the payment history and the staying power of the proposed prediction models.
Banber Hayastani petakan tntesagitakan hamalsarani, 2022
There are many problems in each credit institution. The most important of them is the risk of possible losses in lending. Within the framework of the topic, the studies conducted by other researchers were investigated, from which it was concluded that machine learning tools are often used to optimally solve the above-mentioned problem. Real data on credits were used as a basis for modeling in the work. In this work, based on the available data, several machine learning models were developed, from which the best one was selected, which can contribute to the improvement of the credit risk management process. During the work, the logical connections between data and their interaction with each other were revealed. Then, based on the work done, the appropriate models were built, the quality of which was checked using various tools. The obtained models were compared and the best one was selected. The obtained results are practically applicable and show that each bank and credit organization can develop a better solution based on the large databases they have, which will contribute to curbing credit risk and reducing costs.
PERSPEKTIF
Financial organizations such as banks have experienced an increase in demand for loans from borrowers over the years. These organizations are highly interested in knowing whether a borrower can pay back if granted the loan requested. Granting loans to defaulters can cripple the business, hence, these financial organizations are compelled to evaluate credit worthiness of clients using the vast volume of historical data related to financial position of borrowers. Like other prediction models, credit scoring is a technique used in predicting the probability that a loan applicant, existing borrower, or counterparty will default. Machine learning technique has ability to solve these challenges faced by credit analyst by automating the processing and extraction of knowledge from data. This research focuses on the development of a credit scoring model using Rough Set Theory (RST) and Multi-Layer Perceptron (MLP) Neural Network. RST was used for feature selection while ANN trained with back...
Accurately identifying credit risk is essential for the successful operation and growth of any business. By accurately predicting an applicant's loan status, businesses can better understand the drivers of credit risk and develop informed market strategies to promote business expansion. The goal of this research is to develop a machine learning model that can reliably predict the loan status (approved or rejected) of applicants to the Commercial Bank of Ethiopia (CBE). The study utilized a dataset of 32,285 applicants with 10 attributes. To evaluate the performance of different classifiers, the overall accuracy was used as the primary metric. Supervised machine learning techniques including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), and Deep Neural Network (DNN) were applied to the applicant loan status prediction task, based on their widespread use in prior literature. Feature importance and correlation matrix analysis were used for feature selection. Additionally, the SMOTE technique was employed to balance the dataset. The results show that the RF classifier achieved the best overall performance, with an accuracy of 93.62%, a recall of 94.72%,
BIG DATA MINING AND ANALYTICS, 2023
Every real-world scenario is now digitally replicated in order to reduce paperwork and human labor costs. Machine Learning (ML) models are also being used to make predictions in these applications. Accurate forecasting requires knowledge of these machine learning models and their distinguishing features. The datasets we use as input for each of these different types of ML models, yielding different results. The choice of an ML model for a dataset is critical. A loan risk model is used to show how ML models for a dataset can be linked together. The purpose of this study is to look into how we could use machine learning to quantify or forecast mortgage credit risk. This phrase refers to the process of evaluating massive amounts of data in order to derive useful information for making decisions in a variety of fields. If credit risk is considered, a method based on an examination of what caused and how mortgage credit risk affected credit defaults during the still-current economic crisis of 2021 will be tried. Various approaches to credit risk calculation will be examined, ranging from the most basic to the most complex. In addition, we will conduct a case study on a sample of mortgage loans and compare the results of three different analytical approaches, logistic regression, decision tree, and gradient boost to see which one produced the most commercially useful insights.
2020
Prior PCs was simply sorted as a need of an individual yet now it turns into a need of a person. AI fills in as a significant part in field of PC. Machine can't thoroughly consider various circumstances however it can draw diverse kind of connections between various highlights and qualities. The significant piece of our life is to stay away from false exercises yet till now we can't authority over it. Credit business is one of the significant organizations of business banks. Deceitful exercises can be handle through installing AI calculations in our everyday life. In this venture we will utilize directed AI and for that we need to give named information to the AI calculation. This paper centers around anticipating SME client status for time of a half year by using application scoring extra to client conduct highlights. By using Neural Networks, Support Vector Machines and Inclination Boosting, execution examination and furthermore highlight investigation for client conduct are directed.
It can be easily observed that the general public is putting in more and more loan requests in the banking system recently, which can be regarded as a positive development for the banks, while at the same time presenting a considerable risk. Accurate risk management in the banking and finance sector is related to efficient and optimized use of the current resources, assessment of possible risks and taking timely precautions. It is of utmost importance for the banks to predict the problematic loans in terms of long-term stability. Giving credits to the applicants is one of the fundamental activities of the banks, however; the same activity brings significant risks. As part of their founding purpose, the banks do not avoid taking risks, and they choose to manage them. The banks should perform their risk management in the way to keep the damages resulting from the amount of loans they give to a minimum. Considering the above and in order to speed up the lending procedures in banks while making advantageous decisions, different algorithmic models and classifications, machine learning techniques such as artificial neural networks were started to be used lately, data mining being at the first place. In this study, the accuracy of the applicants' eligibility status for loans was determined by making use of several machine learning techniques. The open-access dataset from the German Credit Data UCI was employed. Based on the 1000 customers in this study's dataset, a 75,60% success rate was achieved in the XGBoost classifier, which has the best success rate among the studies conducted with the XGBoost classifier previously. In addition, the success rate is the highest among the other algorithms used in various studies made.
IJCSMC, 2021
In the Egyptian banking industry, loan officers use pure judgment to make personal loan approval decisions. In this paper, we develop a new predictive method for default customers' loans using machine learning. The new predictive method uses the available personal data and historical credit data to evaluate the credit trust-worthiness of customers to obtain loans. We used the ABE dataset for training and testing, as we used 10 features from the application form and i-score report class that could give great help to credit officers for taking the right decision through avoiding customer selection using random techniques. The collected dataset was analysed by using various machine learning classifiers based on important selected features, to obtain high accuracy. We compared the performance of several machine learning classifiers before and after feature selection. We have found that in terms of high accuracy, the most important features are (activity-income-loan) and in terms of better performance the decision tree classifier has surpassed any other machine learning classifier with significant prediction accuracy of almost 94.85%.
Thesis, 2024
Accurately identifying credit risk is essential for the successful operation and growth of any business. By accurately predicting an applicant's loan status, businesses can better understand the drivers of credit risk and develop informed market strategies to promote business expansion. The goal of this research is to develop a machine learning model that can reliably predict the loan status (approved or rejected) of applicants to the Commercial Bank of Ethiopia (CBE). The study utilized a dataset of 32,285 applicants with 10 attributes. To evaluate the performance of different classifiers, the overall accuracy was used as the primary metric. Supervised machine learning techniques including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), and Deep Neural Network (DNN) were applied to the applicant loan status prediction task, based on their widespread use in prior literature. Feature importance and correlation matrix analysis were used for feature selection. Additionally, the SMOTE technique was employed to balance the dataset. The results show that the RF classifier achieved the best overall performance, with an accuracy of 93.62%, a recall of 94.72%,
IRJET, 2021
In our banking system, banks have many products to sell but main source of income of any banks is on its credit line. So they can earn from interest of those loans which they credits. A bank's profit or a loss depends to a large extent on loans i.e. whether the customers are paying back the loan or defaulting. By predicting the loan defaulters, the bank can reduce its Non-performing Assets. This makes the study of this phenomenon very important. Previous research in this era has shown that there are so many methods to study the problem of controlling loan default. But as the right predictions are very important for the maximization of profits, it is essential to study the nature of the different methods and their comparison. A very important approach in predictive analytics is used to study the problem of predicting loan defaulters (i) Collection of Data, (ii) Data Cleaning and (iii) Performance Evaluation. Experimental tests found that the Naïve Bayes model has better performance than other models in terms of loan forecasting.
International Journal of Innovative Technology and Exploring Engineering, 2020
Data mining is the key tools for discoveries of knowledge from large data set. Nowadays, most of the organizations using this technology to maintain their data. This paper focuses on the Bank sector in Risk management specifically, detecting Bank loan defaulters through the data mining application to examine the patterns of different attribute which would contribute for detecting and predicting defaulters thus preventing wrong loans. This process can be done without change the current systems and the data. Then it helps to distinguish borrowers who repay loans promptly from those who don’t and avoid wrong loan allotment. In order to show the results of the study Classification model is implemented in order to find interesting patterns among attributes of customer. A total of 20461 sample data were taken by data base admin randomly from 3 consecutive years from the Bank database to build and test the model. In this research we used Classification model of decision tree and Naïve Baye...
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.