-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 4.43M • • 2.6k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 3.85M • • 4.25k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 3.56M • 1.66k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 57
Molone Laveh PRO
molonelaveh
·
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a Space
19 days ago
fallenshock/FlowEdit
liked
a Space
21 days ago
argilla/synthetic-data-generator-argilla-reviewer
liked
a Space
21 days ago
autotrain-projects/autotrain-advanced
Organizations
Collections
2
models
None public yet
datasets
None public yet