Skip to content

linkedin/ghc25-ds-workshop

Repository files navigation

GHC 25 Data Science Workshop

Prerequisites

Before starting, ensure you have the following installed:

  • Python 3.8+: Download Python
    • Verify: python3 --version
  • R 4.0+: Download R
    • Verify: R --version
  • SQLite 3: Download SQLite
    • Verify: sqlite3 --version
    • Note: SQLite is pre-installed on most macOS/Linux systems

Getting Started

Clone the repository and navigate to the project directory:

git clone https://github.com/linkedin/ghc25-ds-workshop.git
cd ghc25-ds-workshop

Setup Instructions

macOS/Linux

# Run the automated setup script
# This will:
# - Create the virtual environment
# - Install all dependencies
# - Verify the database setup
# - Keep the environment activated and ready to use
source setup.sh

# To exit the virtual environment later: deactivate

Windows

# Create a virtual environment named 'ghc_workshop'
python -m venv ghc_workshop

# Install dependencies
ghc_workshop\Scripts\python.exe -m pip install -r requirements.txt

# Verify database setup
ghc_workshop\Scripts\python.exe scripts/setup.py

Install R Packages (required for Rshiny dashboard)

Open R and install required packages:

# In R console or RStudio
source("rshiny/salesstore-dashboard/requirements.R")

# Type `q()` and press Enter to quit R
# When prompted "Save workspace image?", type `n` and press Enter

Test Your Setup

1) Run sample SQL query

  1. Open sql/sample_query.sql
  2. Press Cmd+Shift+P (macOS) or Ctrl+Shift+P (Windows/Linux)
  3. Type "Tasks: Run Task" and select "Run Current SQL File"

This demonstrates how to query the sales store database.

2) Run the sample analysis notebook

  1. Open notebook/sample_analysis.ipynb
  2. Make sure your virtual environment is activated
  3. Run all cells in the notebook

Expected output:

  • ✅ Data loaded successfully with 10 rows, 7 columns
  • 📊 Available tables list showing orders table
  • Sample data preview from the orders table

References

Fekihmea. (n.d.). Sales Store Overview [Data set]. Kaggle.
Retrieved from https://www.kaggle.com/datasets/fekihmea/sales-store-overview/data
Licensed under the MIT License.


License

This project is licensed under the BSD 2-Clause License. See LICENSE.txt for details.

About

This repo is specifically for the Grace Hopper 2025 DS Workshop

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published