Projects

Sign Language Prediction with MobileNet

In this exercise we shall again apply transfer learning to predict the Numeric Sign Language. We will be applying MobileNet Model and shall modify the model and then fine tune it to suit our requirements. Click here to view the code.

Tools

TensorFlow, Transfer Learning, CNN, Google Collab Notebook

MNIST Digit Recognizer using CNN

MNIST (“Modified National Institute of Standards and Technology”) is the de-facto “hello world” data set of computer vision. Since its release in 1999, this classic data set of handwritten images has served as the basis for bench marking classification algorithms. As new machine learning techniques emerge, MNIST remains a reliable resource for researchers and learners alike. In this exercise, our goal is to correctly identify digits from a data set of tens of thousands of handwritten images. Click here to view the code.

Tools

TensorFlow, CNN, Google Collab Notebooks

Exploratory Data Analysis - Titanic data set(Kaggle)

The objective of this exercise was to clean the data and then apply feature engineering to generate meaningful insights of the Titanic data set available at Kaggle . A training data set with 891 rows was used for this exercise. The data set has interesting features like age, gender, fare etc of the passengers to predict whether they survives the Titanic mishap. Click here to view the code.

Tools

Missing Value Treatment, Feature Engineering and Exploratory Data Analysis, Google Colab Notebooks

Creation of an India Credit Risk Default Model Using Logistic Regression

The project involved developing a credit risk default model using a given data which had to be checked for outliers, missing values, multicollinearity etc. Univariate and Bivariate Analysis had to be conducted and the model had to be built using Logistic Regression on most important variables. Model Performance Measures were undertaken that included predicting the accuracy of the model on certain datasets. Click here to view the code.

Tools

Logistic Regression, Univariate & Bivariate Analysis, Outlier Treatment, Model Performance Measures

Visualizing Car Insurance Claims using Tableau

This project explored the art of problem-solving with the aid of visual analytics. Tableau’s data visualization tools were used to create interactive dashboards to provide high-level insights to an Insurance company to drive the company’s car insurance schemes.

Tools

Data Visualization, Tableau, Business Intelligence

Build a forecasting model to predict monthly gas production

The project involved developing an ARIMA model to forecast the monthly Australian gas production level for the next 12 months. Click here to view the Code.

Tools

ARIMA, Time Series Forecasting, ADF Test

Choosing preferable mode of transport by employees

The project involves deciding on the mode of transport that the employees prefer while commuting to office. For this, multiple models such as KNN, Naive Bayes, Logistic Regression have been created and explored to check their model performance metrics. Bagging and Boosting modelling procedures have also been applied to create the models. Click here to view the code.

Tools

Bagging and Boosting, KNN, Naive Bayes, Logistic Regression

Cellphone-Logistic project

The primary objective was to investigate the parameters contributing for customer churn (attrition) in the Telecom Industry. A Logistic Regression Model was developed and validated with test data to predict customer churn.

-Tools

Logistic Regression, Model Comparison, Predictive Analytics

Building a supervised Model to cross-sell personal loans The objective of this exercise was to build a model using a Supervised learning technique to figure out profitable segments to target for cross-selling personal loans. A Pilot campaign data of 20000 customers was used which included several demographic and behavioral variables. The Model was further validated and a deployment strategy was recommended.

Tools

Random Forest, Data Mining, Pruning, Model Performance Measures