omarcevi
's Collections
Papers4Reading
updated
CLEAR: Character Unlearning in Textual and Visual Modalities
Paper
•
2410.18057
•
Published
•
200
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation
Generation
Paper
•
2410.23090
•
Published
•
54
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A
Gradient Perspective
Paper
•
2410.23743
•
Published
•
59
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM
Quantization
Paper
•
2411.02355
•
Published
•
46
Benchmarking and Dissecting the Nvidia Hopper GPU Architecture
Paper
•
2402.13499
•
Published
Balancing Pipeline Parallelism with Vocabulary Parallelism
Paper
•
2411.05288
•
Published
•
19
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
•
2411.04905
•
Published
•
113
Add-it: Training-Free Object Insertion in Images With Pretrained
Diffusion Models
Paper
•
2411.07232
•
Published
•
63
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
•
1810.04805
•
Published
•
16
Paper
•
2401.04088
•
Published
•
158