Before starting, ensure you have the following installed:
- Python 3.8+: Download Python
- Verify:
python3 --version
- Verify:
- R 4.0+: Download R
- Verify:
R --version
- Verify:
- SQLite 3: Download SQLite
- Verify:
sqlite3 --version - Note: SQLite is pre-installed on most macOS/Linux systems
- Verify:
Clone the repository and navigate to the project directory:
git clone https://github.com/linkedin/ghc25-ds-workshop.git
cd ghc25-ds-workshop# Run the automated setup script
# This will:
# - Create the virtual environment
# - Install all dependencies
# - Verify the database setup
# - Keep the environment activated and ready to use
source setup.sh
# To exit the virtual environment later: deactivate# Create a virtual environment named 'ghc_workshop'
python -m venv ghc_workshop
# Install dependencies
ghc_workshop\Scripts\python.exe -m pip install -r requirements.txt
# Verify database setup
ghc_workshop\Scripts\python.exe scripts/setup.pyOpen R and install required packages:
# In R console or RStudio
source("rshiny/salesstore-dashboard/requirements.R")
# Type `q()` and press Enter to quit R
# When prompted "Save workspace image?", type `n` and press Enter- Open
sql/sample_query.sql - Press
Cmd+Shift+P(macOS) orCtrl+Shift+P(Windows/Linux) - Type "Tasks: Run Task" and select "Run Current SQL File"
This demonstrates how to query the sales store database.
- Open
notebook/sample_analysis.ipynb - Make sure your virtual environment is activated
- Run all cells in the notebook
Expected output:
- ✅ Data loaded successfully with 10 rows, 7 columns
- 📊 Available tables list showing
orderstable - Sample data preview from the orders table
Fekihmea. (n.d.). Sales Store Overview [Data set]. Kaggle.
Retrieved from https://www.kaggle.com/datasets/fekihmea/sales-store-overview/data
Licensed under the MIT License.
This project is licensed under the BSD 2-Clause License. See LICENSE.txt for details.