Skip to content

Conversation

pamelafox
Copy link
Contributor

No description provided.

@pamelafox
Copy link
Contributor Author

/evaluate

Copy link

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

Copy link

metric stat baseline pr105
gpt_groundedness pass_rate 1.0 1.0
mean_rating 5.0 5.0
gpt_relevance pass_rate 1.0 1.0
mean_rating 5.0 5.0
answer_length mean 978.9 927.6
latency mean 2.51 2.07
citation_match rate 1.0 1.0
num_questions total 10 10

@pamelafox pamelafox changed the title Test eval Evaluation workflow should check out PR Oct 22, 2024
@pamelafox pamelafox merged commit 654c712 into main Oct 22, 2024
1 check passed
@pamelafox pamelafox deleted the testeval4 branch October 22, 2024 20:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant