Skip to content

Do not store/compute probabilities#12

Merged
ravwojdyla merged 1 commit intomainfrom
rav-rm-probs-inf
Jun 20, 2025
Merged

Do not store/compute probabilities#12
ravwojdyla merged 1 commit intomainfrom
rav-rm-probs-inf

Conversation

@ravwojdyla
Copy link
Copy Markdown
Contributor

They are not immediately useful (for our current effort), and we can easily compute them adhoc from the logits. The major downside of storing them, is that they take quite a bit of space/IO, especially for full hg38. For reference, using the current code, for full hg38, the probabilities zarr group would require more than 100GB on disk. Obviously there's many ways to reduce that size, but I would argue, that's not that interesting and useful for us ATM.

@ravwojdyla ravwojdyla requested a review from yonromai June 20, 2025 05:17
@ravwojdyla ravwojdyla changed the title Do not store probabilities from inference step Do not store/compute probabilities Jun 20, 2025
Copy link
Copy Markdown
Contributor

@yonromai yonromai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good!

@ravwojdyla ravwojdyla merged commit 24d9e8a into main Jun 20, 2025
2 checks passed
@ravwojdyla ravwojdyla deleted the rav-rm-probs-inf branch June 20, 2025 18:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants