August Moharrami
August4293
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
7 days ago
August4293/tldr-preference-sft-trl-style-sample
updated
a collection
13 days ago
RL Fine-tuning Reasoning
liked
a model
14 days ago
trl-internal-testing/tiny-LlamaForCausalLM-3.2