Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,10 @@ Our mission is to unlock the power of production data for agent development, ena
| RL | [Wikipedia Racer](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/rl/WikiRacingAgent_RL.ipynb) | Train agents with reinforcement learning |
| Online ABM | [Research Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/monitoring/Research_Agent_Online_Monitoring.ipynb) | Monitor agent behavior in production |
| Custom Scorers | [HumanEval](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/custom_scorers/HumanEval_Custom_Scorer.ipynb) | Build custom evaluators for your agents |
<!--

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Documentation]

The standalone HTML comment lines interrupt the Markdown table, so the new “Prompt Comparisons” row renders outside the table. Please move the comment onto a single line with the row (e.g., <!-- | Offline Testing ... | -->) or remove it so the table remains contiguous.

Context for Agents
[**Documentation**]

The standalone HTML comment lines interrupt the Markdown table, so the new “Prompt Comparisons” row renders outside the table. Please move the comment onto a single line with the row (e.g., `<!-- | Offline Testing ... | -->`) or remove it so the table remains contiguous.

File: README.md
Line: 64

| Offline Testing | [Get Started For Free] | Compare how different prompts, models, or agent configs affect performance across ANY metric |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two things:

  1. Comment the Offline testing row out - please make a Linear task to make one, assign to yourself for now
  2. Make your Customer_Support_Agent.ipynb notebook public (i.e. share -> anyone with link can view)

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the notebook should already be public, just tried opening it in an incognito tab and was able to see it

-->
| Prompt Comparisons | [Customer Support Agent](https://colab.research.google.com/github/JudgmentLabs/judgment-cookbook/blob/main/prompt_comparisons/Customer_Support_Agent.ipynb) | Test and score customer support prompts to see which style performs best |

You can access our [repo of cookbooks](https://github.com/JudgmentLabs/judgment-cookbook).

Expand Down
Loading