Titanic Dataset Analysis with Scikit-Learn Pipeline

less than 1 minute read

Titanic Dataset Analysis with Scikit-Learn Pipeline

The goal is to carry out the Titanic data classification analysis in Python using Scikit-Learn library. We will still use the Titanic dataset available on Kaggle web site. The goal here is to use the Scikit-Learn Pipeline API to automate the pre-processing and feature engineering tasks. After setting up the Pipeline, we test a few models: Logistic Regression and Random Forest.

The Jupyter Notebook

Titanic Dataset

Share on

Twitter Facebook LinkedIn

José Lise

Titanic Dataset Analysis with Scikit-Learn Pipeline

Titanic Dataset Analysis with Scikit-Learn Pipeline

Share on

You may also enjoy

Neural machine translation with attention

Neural characters language models

N-gram language models or how to write scientific papers

Prohibited Comments Classification