The design and implementation of a database to store the (unreal) data of a vaccine distribution. The logical database is designed based on relational schema and UML. The real database is built with Python and PostgreSQL on top of the theoretical relational schema and UML. In addition, there is also a data analysis which is done in Python with Pandas and Matplotlib.
The important files of the database is divided into different folders as following:
code/:flush.sqlandflush.txtare used for emptying the database (useful in testing and implementation phase)query_creation.sqlcontains some queries to test the databaserequirements.txtcontains list of external librariestable_creation.pyconnects to PostgreSQL server and executestable_creation.sqlto create the schemas in the databasetable_creation.sqlcontains the actual code to create the schemastable_population.pyconnects to PostgreSQL server and populating the tables using pandastest_postgresql_conn.pyfor testing connection to PostgreSQl server
data/:modified_excel_tables.py: helper files to modify different sheets in Excel file to corresponding CSV files (for faster data population)- Excel files for data
data_analysis/: Jupyter notebook contains the code and answers to some data analysis questiondocumentation/: Final report of the project which contains detailed breakdown of the UML design, the database implementation, as well as some explanation about the coding logic