Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Last update: Jan 29, 2022

Overview

Mortgage-Application-Analysis

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables: age, income level, occupancy type, accepted, and debt-income ratio, Eliminating all the demographic bias except for age We picked 5 attributes from the Mortgage data set provided and created a separate *.csv file to avoid extra data loss from the null values of the attributes which we neglect in our model. We preprocessed the data to drop any null values of the applicants which might skew our datasets using the pandas library For the processing part, we had some classification data with controlled intervals. We used Ordinal encoding to convert those into numeric discrete data for training and testing our model. We also had one, unique string data attribute, which was encoded using One-hot encoding to extract numeric values for processing. With this clean data, we divided the data into two groups, 80% for validation and 20%, and trained our model to establish a correlation between mortgage application acceptance.

Using Matlab plot, we carried out data/representation/ visualization and found out, other than debt-to-income ratio, there isn’t any significant correlation between acceptance and other non-demographic factors After this visualization to establish our hypothesis, we trained our model using the data set we created., and evaluate the model we created we applied 4 types of algorithms to test it out: We used the Logistic Regression model to create a line the best fit for log-odds values to calculate the acceptance rate for the mortgage application. The F1 score, precision score, and recall score for this testing were very high, which suggested that the non-demographic factor which we accounted for didn’t have many roles in the application being accepted or rejected. Similarly, we carried out a random forest model, Decision Tree, and Support Vector machine algorithm and each of those evaluations had really high precision, recall, and F1 score supporting the evidence from data visualization.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Related tags

Overview

Mortgage-Application-Analysis

Owner

Pipelines de datos, 2021.

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

A demo of chinese asr

Python module (C extension and plain python) implementing Aho-Corasick algorithm

A python script that will use hydra to get user and password to login to ssh, ftp, and telnet

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

VD-BERT: A Unified Vision and Dialog Transformer with BERT

Open-source offline translation library written in Python. Uses OpenNMT for translations

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

Turkish Stop Words Türkçe Dolgu Sözcükleri

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Python package for performing Entity and Text Matching using Deep Learning.

A simple implementation of N-gram language model.

A 10000+ hours dataset for Chinese speech recognition

Code for Emergent Translation in Multi-Agent Communication

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.