TerraNull (Terra Nulles)

liked a model 13 days ago

FoundationVision/Infinity

Updated 14 days ago • 150 • 16

liked a model 3 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct

Updated Oct 25, 2024 • 67 • 538

liked a model 7 months ago

Alpha-VLLM/Lumina-Next-SFT

Text-to-Image • Updated Jun 20, 2024 • 66

reacted to ehristoforu's post with 🤝 7 months ago

Post

2130

😐 Hello, there are a couple of interesting things. The first is that I will soon release several pretty cool SDXL models, the second is a little sad, I conducted long-term tests of training and merging of XL models and realized that XL will not improve soon, the architecture will not allow us to continue pushing realism and other interesting things into it, the entire community has brought XL closer to the maximum ideal on its architecture.

liked a Space 9 months ago

Running on Zero

1.72k

👕👔👚

IDM VTON

High-fidelity Virtual Try-on

liked a model 9 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 459k • 5.95k

New activity in open-llm-leaderboard/open_llm_leaderboard 9 months ago

Feature Request for Leaderboard: date added to hub

8

#425 opened about 1 year ago by

madmaxbr5

liked a model 9 months ago

QQGYLab/ELLA

Updated Apr 19, 2024 • 107

liked a model 10 months ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 413 • 2.21k

liked a Space 10 months ago

Running on Zero

960

🥼👖👗

OOTDiffusion

High-quality virtual try-on ~ Your cyber fitting room

reacted to vladbogo's post with 👍 10 months ago

Post

"A Closer Look at the Limitations of Instruction Tuning" is a new paper that explores the efficacy and limitations of Instruction Tuning (IT) in Large Language Models (LLMs) for conversational agents. The authors conduct a series of experiments using both LoRA fine-tuning (LFT) and standard full-parameter fine-tuning (SFT) across various LLMs and IT datasets.

The key findings are:
* LoRA fine-tuning (LFT) preserves the pre-training token distribution while SFT doesn't. This indicates that using LFT, post fine-tuning the model still heavily relies on the pre-training and doesn't acquire new information.
* Dataset scaling is ineffective for LFT - experiments show that scaling the dataset size 52x or even 326x doesn't improve the performance.
* LoRA fine-tuning mainly enhances response initiation and style without substantial knowledge enhancement.
* Full-parameter fine-tuning tends to degrade LLM knowledge base and increase hallucination occurrences.
* Popular other methods and adjustments fail to significantly outperform simple LoRA fine-tuned models in terms of conversational quality and accuracy.

Congrats to the authors @Sreyan88 and others for their work!

Paper: A Closer Look at the Limitations of Instruction Tuning (2402.05119)