Artur Daveyan's picture

67 575

Artur Daveyan

ArturD

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

Kijai/HunyuanVideo_comfy

liked a model 8 days ago

NexaAIDev/OmniAudio-2.6B

liked a model 11 days ago

JeffreyXiang/TRELLIS-image-large

View all activity

Organizations

None yet

ArturD's activity

upvoted an article about 1 month ago

Article

EuroLLM-9B

By

•

Dec 2, 2024

• 105

upvoted a collection about 1 month ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 17 days ago • 67

upvoted a collection 2 months ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 19 days ago • 96

upvoted a collection 3 months ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated Nov 2, 2024 • 18

upvoted a paper 3 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44

upvoted 2 collections 3 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated about 10 hours ago • 149

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8, 2024 • 21

upvoted an article 3 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 122

upvoted 3 papers 3 months ago

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

Paper • 2410.00255 • Published Sep 30, 2024 • 5

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2, 2024 • 30

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 30

upvoted 3 collections 3 months ago

Yi-Coder

4 items • Updated Sep 4, 2024 • 32

BRAG-v0.1

BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. • 4 items • Updated Aug 4, 2024 • 13

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated about 1 month ago • 551

upvoted a paper 3 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

upvoted an article 3 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 180

upvoted a paper 3 months ago

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24, 2024 • 12

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

upvoted 2 papers 4 months ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 108

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 37