Titanic Dataset Analysis with Scikit-Learn Pipeline

less than 1 minute read

Titanic Dataset Analysis with Scikit-Learn Pipeline

The goal is to carry out the Titanic data classification analysis in Python using Scikit-Learn library. We will still use the Titanic dataset available on Kaggle web site. The goal here is to use the Scikit-Learn Pipeline API to automate the pre-processing and feature engineering tasks. After setting up the Pipeline, we test a few models: Logistic Regression and Random Forest.

Titanic Dataset