Data Leakage

Predicting User Adoption

Identified the most important factors contributing to user adoption for a product. Achieved a 96% accuracy with an SVM model, and found user length and opting into the mailing list as the most significant predictors.

Predicting Hotel Cancellations

Created prediction algorithm for determining if a customer will cancel at the moment of booking. After eliminating numerous data leakage sources, we achieved a 90% AUROC with a catboost classification model.