Exercise: Pandas and SQL
1. Pandas Exercise
Using the data in the Titanic example, describe a significant factor that could be used to predict survival rates. Focusing on one variable at a time is probably the easiest way to approach it.
Note that there are many possible answers. This doesn't have to be exhaustive, but try to find at least one thing expressing a correlation with surviving the wreck of the Titanic. Going from a simple hypothesis ("Maybe people from Cherbourg are unlucky?") to a more complex formula incorporating multiple variables, with the goal of a more accurate prediction, is the path of data analysis. This is our first step on that journey.
2. SQL Exercise
Using the music track company example from this session, Corporate has tasked you with identifying the three least popular genres of music ranked by the number of tracks purchased.