stage0_runbook_grader

Grade the Grader runbook

Grade Runbook

This runbook reads a configuration.yaml file, loads the referenced data files from the input folder, and then grades a model's responses to prompts that produce a grade when provided given and expected values. Grades are written to the output folder as yyyy.mm.ddThh:mm:ss-grades.json

Expected Directory Structure

When you run the utility you will specify folders to mount for /config, /input and /output

/📁 config
├── 📝 grade_config.yaml                # Grade Configuration
/📁 input
│── 📁 grader                           # Simple LLM message list with grading prompts and keys
│   ├── ✏️ grader1.csv                  # grade prompts
│   ├── ✏️ grader_key.csv               # grading keys (given, expected, min, max)
/📁 output
│   ├── 📀 yyyy-mm-ddThh:mm:ss-grades.json  # Grades from running the evaluation

Using this in your project.

Adjust the command below to use appropriate values for your Echo Bot project, and add it to your pipenv scripts. See the [test_data] folder for sample files. Grades will be written to a file called {datetime}-grades.json in the output folder when you run the tool.

docker run --rm /
    -v ./my_model/input:/input
    -v ./my_model/config:/config
    -v ./my_model/output:/output
    ghcr.io/agile-learning-institute/stage0-echo-grade:latest

Contributing

Prerequisites

Ensure the following tools are installed:

Testing

All testing uses config/input/output folders in ./test_data.

Install Dependencies

pipenv install

Run Evaluate Runbook locally.

pipenv run grade

Debug Evaluate Runbook locally

pipenv run debug

Runs locally with logging level set to DEBUG

Build the Gary the Grader model

See Gary.modelfile - from llama3.2:latest, turns the temperature all the way down to 0

pipenv run model

Build the Evaluate Runbook container

pipenv run build

Build, and run the Evaluate Runbook container

pipenv run container

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
test_data		test_data
.gitignore		.gitignore
Dockerfile		Dockerfile
GRADER.md		GRADER.md
Gary.modelfile		Gary.modelfile
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
grade_runbook.py		grade_runbook.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

stage0_runbook_grader

Grade Runbook

Expected Directory Structure

Using this in your project.

Contributing

Prerequisites

Testing

Install Dependencies

Run Evaluate Runbook locally.

Debug Evaluate Runbook locally

Build the Gary the Grader model

Build the Evaluate Runbook container

Build, and run the Evaluate Runbook container

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

License

agile-learning-institute/stage0_runbook_grader

Folders and files

Latest commit

History

Repository files navigation

stage0_runbook_grader

Grade Runbook

Expected Directory Structure

Using this in your project.

Contributing

Prerequisites

Testing

Install Dependencies

Run Evaluate Runbook locally.

Debug Evaluate Runbook locally

Build the Gary the Grader model

Build the Evaluate Runbook container

Build, and run the Evaluate Runbook container

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages