Ajay Sharma

Yellow Taxi Demand Prediction NYC

Predict the pick up density of yellow cabs at a given particular time and a location in new york city using Linear Regression, Random Forest, XGBoost, Time Series Forecasting and Fourier Transformation.

View More..

Cancer Diagnosis Using Medical Records

Classify the given genetic variations/mutations based on evidence from text- based clinical literature using Logistic Regression, Random Forest, TF-IDF and Feature Engineering.

View More..

Stackoverflow Tag Predictor

Suggest the tags based on the content that was there in the question posted on Stackoverflow. Techniques used : Logistic Regression (One vs Rest Multilabel Classifier).

View More..

Quora Question Similarity

Identify which questions asked on Quora are duplicates of questions that have already been asked. This could be useful to instantly provide answers to questions that have already been answered. We are tasked with predicting whether a pair of questions are duplicates or not. Techniques used : Logistic Regression, Linear SVM and XGBoost.

View More..

Netflix Movie Recommendation System

Netflix provided a lot of anonymous rating data, and a prediction accuracy bar that is 10% better than what Cinematch can do on the same training data set. (Accuracy is a measurement of how closely predicted ratings of movies match subsequent actual ratings). Techniques used : XGBoost, SVD++.

View More..

Microsoft Malware Detection

Identify whether a given piece of file/software is a malware. Techniques used : KNN, Logistic Regression, Random Forest and XGBoost.

View More..

Amazon Fashion Discovery Engine

Build a recommendation engine which suggests similar products (apparel) to the given product using (amazon.com) dataset. Techniques used : VGG-16 CNN, Tensorflow, TFIDF-AvgWord2VEC Model.

View More..

Amazon Fine Food Reviews

Predict the reviews on amazon fine food dataset that review is positive or negative by training model using many different classification algorithms and comparing their results against each other.

View More..

Scrabble Word Game

A complete scrabble game written in Python3 with the following features: challenge mode, time limit, point limit, multiplayer on a single computer, multiplayer over LAN and playing against computer.

View More..

Programmer | Mathematician | Data Science

About Me

Skills

Projects

Yellow Taxi Demand Prediction NYC

Cancer Diagnosis Using Medical Records

Stackoverflow Tag Predictor

Quora Question Similarity

Netflix Movie Recommendation System

Microsoft Malware Detection

Amazon Fashion Discovery Engine

Amazon Fine Food Reviews

Scrabble Word Game