|
Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/gy2g8jj1 |
|
|
|
Model Evals: |
|
| Tasks |Version|Filter| Metric |Value | |Stderr| |
|
|--------------|-------|------|----------|-----:|---|-----:| |
|
|arc_challenge |Yaml |none |acc |0.2253|± |0.0122| |
|
| | |none |acc_norm |0.2278|± |0.0123| |
|
|arc_easy |Yaml |none |acc |0.2551|± |0.0089| |
|
| | |none |acc_norm |0.2567|± |0.0090| |
|
|lambada_openai|Yaml |none |perplexity| NaN|± | NaN| |
|
| | |none |acc |0.0016|± |0.0005| |
|
|logiqa |Yaml |none |acc |0.2028|± |0.0158| |
|
| | |none |acc_norm |0.2028|± |0.0158| |
|
|piqa |Yaml |none |acc |0.4946|± |0.0117| |
|
| | |none |acc_norm |0.4924|± |0.0117| |
|
|sciq |Yaml |none |acc |0.0140|± |0.0037| |
|
| | |none |acc_norm |0.0140|± |0.0037| |
|
|winogrande |Yaml |none |acc |0.5036|± |0.0141| |
|
|wsc |Yaml |none |acc |0.6346|± |0.0474| |