adding eval through mxeval #3

tarsur909 · 2023-10-29T21:07:25Z

No description provided.

shubhamugare · 2023-10-29T21:13:24Z

llm_cfg/core/evaluation.py

@@ -1,5 +1,5 @@
 import time
-from human_eval.data import write_jsonl, read_problems
+from mxeval.data import write_jsonl, read_problems, get_data


Can you update requirements.txt?

shubhamugare · 2023-10-29T21:14:40Z

llm_cfg/infer.py

@@ -25,10 +25,12 @@
    p.add_argument("--quantize", type=bool, default=True)
    p.add_argument("--gpu", type=int, default=1)
    p.add_argument("--num_samples", type=int, default=1)
+    p.add_argument("--language", choices = ["python", "go"], default = "python", help = "language")
+    p.add_argument("--dataset", choices = ["mbxp", "multi-humaneval", "mathqa-x"], default = "mbxp", help = "dataset")


let's just call it it humaneval and make that the default

Also, just mathqa

shubhamugare · 2023-10-29T21:37:00Z

llm_cfg/evaluation_mxeval.py

@@ -0,0 +1,35 @@
+import sys


We already have a file doing this called evaluation.py, can we just modify that file? That file additionally computes types of errors while doing the evaluation.

You don't need language, dataset and k as arguments. language and dataset information is present in the filename.

and all pass@k are computed for no extra cost when n>k?

tarsur909 and others added 2 commits October 29, 2023 21:07

adding eval through mxeval

e4321ef

Merge branch 'main' into mxeval

a19814a

shubhamugare reviewed Oct 29, 2023

View reviewed changes

tarsur909 added 2 commits October 29, 2023 21:27

update requirements and parse_args

8c23f13

update requirements and parse_args

cf12037

shubhamugare reviewed Oct 29, 2023

View reviewed changes

tarsur909 added 2 commits October 30, 2023 01:22

merge evaluations

6211db9

merge evaluations

d616153

shubhamugare approved these changes Oct 30, 2023

View reviewed changes

shubhamugare merged commit 5783d24 into main Oct 30, 2023
1 check failed

shubhamugare deleted the mxeval branch February 24, 2024 22:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding eval through mxeval #3

adding eval through mxeval #3

tarsur909 commented Oct 29, 2023

shubhamugare Oct 29, 2023

shubhamugare Oct 29, 2023

shubhamugare Oct 29, 2023

shubhamugare Oct 29, 2023

adding eval through mxeval #3

adding eval through mxeval #3

Conversation

tarsur909 commented Oct 29, 2023

shubhamugare Oct 29, 2023

Choose a reason for hiding this comment

shubhamugare Oct 29, 2023

Choose a reason for hiding this comment

shubhamugare Oct 29, 2023

Choose a reason for hiding this comment

shubhamugare Oct 29, 2023

Choose a reason for hiding this comment