Feature Engineering

Predicting Rider Retention

Predicted rider retention for a taxi service and identified most significant factors that contributed to it. Achieved an 80% accuracy with a catboost model, which was chosen for its interpretability.

Predicting User Adoption

Identified the most important factors contributing to user adoption for a product. Achieved a 96% accuracy with an SVM model, and found user length and opting into the mailing list as the most significant predictors.

Lightfm Recommendation System

Recommendation engine with a .97 AUC achieved using clustering techniques to create user features. Data represents Olist marketplace transactions and was retrieved from kaggle.com.

Predicting Hotel Cancellations

Created prediction algorithm for determining if a customer will cancel at the moment of booking. After eliminating numerous data leakage sources, we achieved a 90% AUROC with a catboost classification model.