Fintech Credit Scoring Techniques for Evaluating P2P Loan Applications – A Python Machine Learning Ensemble Approach

AUTHORS

Rekha Ramesh Shenoy,Department of Computer Science, Lakehead University, ON, Canada
Sabah Mohammed,Department of Computer Science, Lakehead University, ON, Canada
Jinan Fiaidhi,Department of Computer Science, Lakehead University, ON, Canada

ABSTRACT

Financial Technology (fintech) has been widely recognized as one of the most important innovations in the financial industry and is seen to evolving very rapidly. It holds the promise of reshaping the financial industry by creating a diverse financial landscape by providing stability, improving quality, and most importantly reducing costs. One such fintech tool is the "Peer to Peer Lending" (also known as "P2P Lending"), which refers to companies that match lenders and borrowers without the use of the traditional banking systems. They are intermediaries that are usually online investment platforms that offer identity verification, proprietary credit models, loan approval, loan servicing, and legal and compliance. This can be an attractive alternative for a borrower as loans can be applied online, anonymously, and in a timely fashion. It is also beneficial for borrowers that do not have any previous credit history to be shown. Fintech develops a credit scoring model based on credit risk evaluation. This model establishes itself in the use of online data sources, alternative credit models, and a variety of machine learning and data analytics techniques to estimate risks involved in the lending process and to minimize operating costs. In this paper, we propose a stacking ensemble of machine learning classifiers that combines data preprocessing with different learning algorithms. We then compare the results of the bare bone classifiers with our stacking ensemble classifier. The ensemble model developed gives a better performance than each of the single classifiers that constitute the process of credit scoring.

 

KEYWORDS

Fintech tools, Credit scoring, Machine learning algorithms, Feature reduction, Outliers, Scikit-learn, Regression, Clustering, Bayesian, Neural networks, Forests, ensembles, Bagging, Boosting, Stacking.

REFERENCES

[1] A. Chandra and X. Yao, “Evolving hybrid ensembles of learning machines for better generalization,” Neurocomputing, vol.69, no.7-9, pp.686-700, March (2006)
[2] D. Browne, C. Manna, and S. Prestwich. “Relevance-redundancy dominance: A threshold-free approach to filter-based feature selection," In 24th Irish Conference on Artificial Intelligence and Cognitive Science, Sun SITE Central Europe/RWTH Aachen University, (2016)
[3] C. L. Huang, M. C. Chen, and C. J. Wang, “Credit scoring with a data mining approach based on support vector machines,” Expert Systems with Applications, vol.33, no.4, pp.847-856, November (2007)
[4] F. N. Koutanaei, H. Sajedi, and M. Khanbabaei, “A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring,” Journal of Retailing and Consumer Services, vol.27, pp.1-23, November (2015)
[5] S. V. Ha, H. N. Nguyen, and T. B. H. Nguyen, “A hybrid feature selection method for credit scoring,” EAI Endorsed Transactions on Context-aware Systems and Applications, vol.4, (2017) DOI: 10.4108/eai.6-3-2017.152335
[6] H. Wiryanto, “Credit scoring machine Learning with Keras  - R,” medium.com blog, Feb 11, (2018), Available Online: https://medium.com/@heruwiryanto/credit-scoring-machine-learning-with-keras-r-502fc6eb451d
[7] I. Brown and C. Mues, “An experimental comparison of classification algorithms for imbalanced credit scoring data sets,” Expert Systems with Applications, vol.39, no.3, pp.3446-3453, 15 February (2012)
[8] J. Kruppa, A. Schwarz, G. Arminger, and A. Ziegler, “Consumer credit risk: Individual probability estimates using machine learning,” Expert Systems with Applications, vol.40, no.13, pp.5125-5131, October 1, (2013)
[9] M. Ala’raj and M. F. Abbod, “A new hybrid ensemble credit scoring model based on classifiers consensus system approach,” Expert Systems with Applications, vol.64, pp.36-55, December, 1 (2016)
[10] N. C. Hsieh and L. P. Hung, “A data-driven ensemble classifier for credit scoring analysis,” Expert Systems with Applications, vol.37, no.1, pp.534-545, January (2010)
[11] C. Serrano-Cinca, B. Gutiérrez-Nieto, and L. López-Palacios. “Determinants of default in P2P lending,” PloS one 10, no.10 (2015): e0139427
[12] D. Opitz and R. Maclin. “Popular ensemble methods: An empirical study,” J. Artif. Intell. Res. (JAIR) 11, pp.169-198, (1999)

CITATION

  • APA:
    Shenoy,R.R.& Mohammed,S.& Fiaidhi,J.(2018). Fintech Credit Scoring Techniques for Evaluating P2P Loan Applications – A Python Machine Learning Ensemble Approach. International Journal of Smart Business and Technology, 6(1), 49-68. 10.21742/IJSBT.2018.6.1.05
  • Harvard:
    Shenoy,R.R., Mohammed,S., Fiaidhi,J.(2018). "Fintech Credit Scoring Techniques for Evaluating P2P Loan Applications – A Python Machine Learning Ensemble Approach". International Journal of Smart Business and Technology, 6(1), pp.49-68. doi:10.21742/IJSBT.2018.6.1.05
  • IEEE:
    [1] R.R.Shenoy, S.Mohammed, J.Fiaidhi, "Fintech Credit Scoring Techniques for Evaluating P2P Loan Applications – A Python Machine Learning Ensemble Approach". International Journal of Smart Business and Technology, vol.6, no.1, pp.49-68, Jun. 2018
  • MLA:
    Shenoy Rekha Ramesh, Mohammed Sabah and Fiaidhi Jinan. "Fintech Credit Scoring Techniques for Evaluating P2P Loan Applications – A Python Machine Learning Ensemble Approach". International Journal of Smart Business and Technology, vol.6, no.1, Jun. 2018, pp.49-68, doi:10.21742/IJSBT.2018.6.1.05

ISSUE INFO

  • Volume 6, No. 1, 2018
  • ISSN(p):2288-8969
  • ISSN(e):2207-516X
  • Published:Jun. 2018

DOWNLOAD