From 11767d7b9771c3012eac783dadadbef31f6ba22d Mon Sep 17 00:00:00 2001 From: geoffreyjing Date: Sat, 15 Nov 2025 13:49:55 -0800 Subject: [PATCH 1/4] Update README.md --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 47020253..f34cf194 100644 --- a/README.md +++ b/README.md @@ -7,6 +7,8 @@ + +
## Agent Behavior Monitoring (ABM) @@ -62,6 +64,7 @@ Our mission is to unlock the power of production data for agent development, ena | Online ABM | [Research Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/monitoring/Research_Agent_Online_Monitoring.ipynb) | Monitor agent behavior in production | | Custom Scorers | [HumanEval](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/custom_scorers/HumanEval_Custom_Scorer.ipynb) | Build custom evaluators for your agents | | Offline Testing | [Get Started For Free] | Compare how different prompts, models, or agent configs affect performance across ANY metric | +| Prompt Comparisons | [Customer Support Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/JudgmentCustomScorer.ipynb) | Test and score customer support prompts to see which style performs best | You can access our [repo of cookbooks](https://github.com/JudgmentLabs/judgment-cookbook). From 663ece3b5cb202dc03eb772cc5aaafeb9ce990de Mon Sep 17 00:00:00 2001 From: Joseph S Camyre <68767176+JCamyre@users.noreply.github.com> Date: Sat, 15 Nov 2025 13:59:09 -0800 Subject: [PATCH 2/4] Clean up blank lines in README.md Removed extra blank lines in README.md --- README.md | 2 -- 1 file changed, 2 deletions(-) diff --git a/README.md b/README.md index f34cf194..019be8a3 100644 --- a/README.md +++ b/README.md @@ -7,8 +7,6 @@ - -
## Agent Behavior Monitoring (ABM) From c18b98b4c8d1526ed597ab2dd544a807a6865956 Mon Sep 17 00:00:00 2001 From: geoffreyjing Date: Sat, 15 Nov 2025 14:36:20 -0800 Subject: [PATCH 3/4] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 019be8a3..f55c563a 100644 --- a/README.md +++ b/README.md @@ -62,7 +62,7 @@ Our mission is to unlock the power of production data for agent development, ena | Online ABM | [Research Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/monitoring/Research_Agent_Online_Monitoring.ipynb) | Monitor agent behavior in production | | Custom Scorers | [HumanEval](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/custom_scorers/HumanEval_Custom_Scorer.ipynb) | Build custom evaluators for your agents | | Offline Testing | [Get Started For Free] | Compare how different prompts, models, or agent configs affect performance across ANY metric | -| Prompt Comparisons | [Customer Support Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/JudgmentCustomScorer.ipynb) | Test and score customer support prompts to see which style performs best | +| Prompt Comparisons | [Customer Support Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/prompt_comparisons/Customer_Support_Agent.ipynb) | Test and score customer support prompts to see which style performs best | You can access our [repo of cookbooks](https://github.com/JudgmentLabs/judgment-cookbook). From b58d8ff5e67328492ae94c78b5a1b727eac3f146 Mon Sep 17 00:00:00 2001 From: geoffreyjing Date: Sat, 15 Nov 2025 19:16:13 -0800 Subject: [PATCH 4/4] Update README.md commented out offline testing --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index f55c563a..2370acb3 100644 --- a/README.md +++ b/README.md @@ -61,7 +61,9 @@ Our mission is to unlock the power of production data for agent development, ena | RL | [Wikipedia Racer](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/rl/WikiRacingAgent_RL.ipynb) | Train agents with reinforcement learning | | Online ABM | [Research Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/monitoring/Research_Agent_Online_Monitoring.ipynb) | Monitor agent behavior in production | | Custom Scorers | [HumanEval](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/custom_scorers/HumanEval_Custom_Scorer.ipynb) | Build custom evaluators for your agents | + | Prompt Comparisons | [Customer Support Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/prompt_comparisons/Customer_Support_Agent.ipynb) | Test and score customer support prompts to see which style performs best | You can access our [repo of cookbooks](https://github.com/JudgmentLabs/judgment-cookbook).