Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FT] Single token completion loglikelihood auto-detection #355

Open
hynky1999 opened this issue Oct 10, 2024 · 0 comments
Open

[FT] Single token completion loglikelihood auto-detection #355

hynky1999 opened this issue Oct 10, 2024 · 0 comments
Labels
feature request New feature/request low prio

Comments

@hynky1999
Copy link
Collaborator

Issue encountered

  • If all choices for loglikehood task are exactly one token, one can only run single pass to compute their logprobs. This is the case of MCF formulation (A/B/C) of tasks, which is the most used. However currently if one wants to leverage this fast evaluation, he needs to use special metric variant (metric_single_token). This is not only annoying to maintain, but many users don't know about this and don't benefit from potential speed-up.

Solution/Feature

We could detect the single token case automatically during loglikehood requests computation.

  1. Group loglikehood requests by context
  2. From each group select those that have exactly one token
  3. Run the single_token requests from each group using the single-token workflow.

Benefits

Huge speed-up all mcq tasks. Easier to maintain models and metrics (no need to create and handle single token variants of metrics)

@hynky1999 hynky1999 added feature request New feature/request low prio labels Oct 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature/request low prio
Projects
None yet
Development

No branches or pull requests

1 participant