This repository contains scripts for extracting, transforming, loading, and processing Pokemon data from the PokéAPI. Follow the instructions below to set up the SQLite database, install dependencies, and run each script.
- Clone the Repository:
git clone https://github.com/tsankotsanev/pokemon_data_processing.git
- Navigate to the project directory:
cd pokemon_data_processing
- Create and activate Python Virtual Environment:
Make sure you have python 3.11.x installed
python -m venv venv
source venv/bin/activate
- Install Dependencies:
pip install -r requirements.txt
It is mandatory the scripts to be run in the listed order for them to function as intended.
This script fetches Pokemon data from the PokeAPI prints it in the terminal and saves it to in raw_data.json.
- Run the script:
python data_extraction.py
This script transforms the raw Pokemon data and calculates the BMI for each Pokemon. It then print it in the terminal and exports the transformed data in transformed_data.json.
- Run the script:
python data_transformation.py
This script loads the transformed Pokemon data into an SQLite database named pokemon.db.
- Run the script:
python data_loading.py
This script exports the data from the SQLite database to a CSV file and processes it to find the average BMI, as well as the Pokemon with the highest and lowest BMI and prints it in the terminal.
- Run the script:
python advanced_data_processing.py
Pokemon Histogram presenting all BMI of the pokemons in ascending order
Pokemon Scatter Plot presenting the relationship between height and weight