- Project Owner: @dark-teal-coder
- First Published Date: 2023-02-08
- Title: Python Data Analysis of Titanic Survival with Pandas
- Difficulty:
- Beginner
- Intermediate
- Advanced
- Scale:
- Small
- Medium
- Large
The project uses Python Pandas library to analyze Titanic data from Kaggle to answer the question "What categories of passengers were most likely to survive the Titanic disaster?". The data set can be obtained at Titanic - Machine Learning from Disaster.
- Python 3
- Python Package Installer/Manager
pip
- If you installed Python from python.org, you should already have
pip
. If it is not installed, you can use the commandpy -m ensurepip --default-pip
to bootstrap it from the standard library. If you are using Linux, you will have to install the package manager separately. You can find out more about thepip
tool here.
- If you installed Python from python.org, you should already have
- Text Editor and Integrated Development Environment (IDE)
- Command-line interface (CLI)
- You can install the open-source PowerShell on Windows, Linux and macOS if you do not have or want to use a pre-installed CLI on your local machine.
Check if you have Python installed using the command python --version
, or simply, python version
, in the CLI. Git-clone the project repository from Github to the local machine. Use the command py -m pip install package_name
to install the necessary Python libraries. Check out pip documentation to learn more about pip install
. Check the top part of the .py
script file for the list of libraries required. For example, you may need requests
and beautifulsoup4
libraries if you see the following lines in the top part of the script file:
import requests
from bs4 import BeautifulSoup
If pip
fails to locate the relevant packages, you may find it at Python Package Index (PyPI). Use python file_name.py
to run the script in a CLI. Or, use an IDE, such as VS Code, to run the script. There will usually be a [Run] button in the top right corner of the opened script file.
- pandas.DataFrame.describe documentation
- pandas.DataFrame.groupby
- Binning Data with Pandas qcut() and cut()
- pandas.DataFrame.any documentation
- pandas.DataFrame.dropna documentation
1st Completion Date: Feb 08, 2023