jon-tow commited on
Commit
7a8af0a
·
1 Parent(s): b4759d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -5,6 +5,7 @@ language:
5
  ---
6
 
7
  GPT-J (with value head weights) trained on HH with PPO following @reciprocate's `trlx` example https://github.com/CarperAI/trlx/blob/2f90ba0ecd640ae18cd62adb5e934a4b779f534b/examples/hh/ppo_hh.py
 
8
  Logs: https://wandb.ai/jon-tow/trlx/reports/hh-gpt-j--VmlldzozODE1NjAw
9
 
10
  Usage:
 
5
  ---
6
 
7
  GPT-J (with value head weights) trained on HH with PPO following @reciprocate's `trlx` example https://github.com/CarperAI/trlx/blob/2f90ba0ecd640ae18cd62adb5e934a4b779f534b/examples/hh/ppo_hh.py
8
+
9
  Logs: https://wandb.ai/jon-tow/trlx/reports/hh-gpt-j--VmlldzozODE1NjAw
10
 
11
  Usage: