make empirical accuracy calculation more robust #109

jbloom · 2025-10-24T14:29:51Z

This pull request addresses this issue, which was originally pointed out by @fc-jian.

The key point is that the computational of the empirical accuracy (consensus.empirical_accuracy) ran into numerical issues if the numbers were large as it involved computing very large numbers and then taking their logs. In #106, @fc-jian originally proposed using Stirling's approximation, and made a draft pull request #108 to fix that.

However, in looking more I discovered that the built-in python gammaln function is even a better way to do this. I also updated the docs to describe the new math being done.

This pull request therefore solves #106 and is in lieu of #108, as I think it is a better solution.

@fc-jian, thanks so much for noting and flagging all of this!

@fc-jian

This pull request addresses [this issue](#106), which was originally pointed out by @fc-jian. The key point is that the computational of the empirical accuracy (`consensus.empirical_accuracy`) ran into numerical issues if the numbers were large as it involved computing very large numbers and then taking their logs. In #106, @fc-jian originally proposed using Stirling's approximation, and made a draft pull request #108 to fix that. However, in looking more I discovered that the built-in python `gammaln` function is even a better way to do this. I also updated the docs to describe the new math being done. This pull request therefore solves #106 and is in lieu of #108, as I think it is a better solution. @fc-jian, thanks so much for noting and flagging all of this!

fc-jian · 2025-10-24T15:09:00Z

Thanks for the discussion and fixing. Sorry I am not familiar with the format requirements of this repo haha.

jbloom linked an issue Oct 24, 2025 that may be closed by this pull request

Overflow during the estimation of empirical accuracy #106

Closed

This was referenced Oct 24, 2025

Overflow during the estimation of empirical accuracy #106

Closed

Fix potential overflow in empirical accuracy calculation #108

Closed

jbloom merged commit 17e38a9 into master Oct 24, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

make empirical accuracy calculation more robust #109

make empirical accuracy calculation more robust #109

Uh oh!

jbloom commented Oct 24, 2025

Uh oh!

Uh oh!

fc-jian commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

make empirical accuracy calculation more robust #109

make empirical accuracy calculation more robust #109

Uh oh!

Conversation

jbloom commented Oct 24, 2025

Uh oh!

Uh oh!

fc-jian commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants