[Chatllama] Evaluation Function and Loop with metrics #319

PierpaoloSorbellini · 2023-03-31T13:18:38Z

Description

Currently each training loop has an evaluation loop but it is not debugged nor used so far.

It needs to be generalised to be launched also outside the training activities, and to support specific language modelling metrics.
It would be nice if a report can be generated highlighting the performance achieved also in comparison with other models.

TODO

Understand that libraries such as openai/evals or FastChat can be adapted to be used as an evaluation tool
Debug Evaluation of the model.
Collect and Compute relevant metrics.
Launch the evaluation loop also outside the training.
Produce a meaningful report that can compare the performance of one or more models.

PierpaoloSorbellini added good first issue Good for newcomers chatllama Issue related to the ChatLLaMA module labels Mar 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chatllama] Evaluation Function and Loop with metrics #319

[Chatllama] Evaluation Function and Loop with metrics #319

PierpaoloSorbellini commented Mar 31, 2023 •

edited

Loading

[Chatllama] Evaluation Function and Loop with metrics #319

[Chatllama] Evaluation Function and Loop with metrics #319

Comments

PierpaoloSorbellini commented Mar 31, 2023 • edited Loading

Description

TODO

PierpaoloSorbellini commented Mar 31, 2023 •

edited

Loading