Exploring the Titanic Dataset

“The way to get started is to quit talking and start doing.”

Walt Disney

Okay, so we have some data, hopefully cleaned up and enhanced (just a little). Let’s see what we can find out about it.

Let’s load some libraries for Machine Learning:

My plan for this dataset is to start by creating a Random Forest Classification and seeing what we can learn from that. Then, I think we will look at various other Machine Learning techniques. For example: Naive Bayes, Logistic Regression. To be quite honets, it has been years since I was at University, so I am going to need to reacquaint myself with statistics.