zijie tian

zijie-tian

https://zijie-tian.github.io

Zijie-Tian

AI & ML interests

Storage for AI

Recent Activity

upvoted an article 4 days ago

Fast, High-Fidelity LLM Decoding with Regex Constraints

upvoted a paper 11 days ago

TinyLlama: An Open-Source Small Language Model

liked a dataset about 2 months ago

mit-han-lab/pile-val-backup

View all activity

Organizations

zijie-tian's activity

upvoted an article 4 days ago

Article

Fast, High-Fidelity LLM Decoding with Regex Constraints

•

Feb 23, 2024

• 6

upvoted a paper 11 days ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 90

liked a dataset about 2 months ago

mit-han-lab/pile-val-backup

Viewer • Updated Aug 21, 2023 • 215k • 21k • 14

upvoted a paper 2 months ago

InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference

Paper • 2409.04992 • Published Sep 8, 2024 • 2

liked a Space 2 months ago

Running

203

⚡

paper-central

upvoted 3 papers 3 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 605

Selective Attention Improves Transformer

Paper • 2410.02703 • Published Oct 3, 2024 • 24

liked a model 3 months ago

openbmb/MiniCPM3-4B

Text Generation • Updated Nov 30, 2024 • 41.3k • 397

upvoted an article 3 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

liked a Space 4 months ago

Running

🏆

Zero Bubble Pipeline Parallellism

upvoted a paper 5 months ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257

liked a dataset 6 months ago

stas/openwebtext-10k

Viewer • Updated Sep 15, 2021 • 10k • 7.42k • 25

liked a model almost 2 years ago

EleutherAI/gpt-j-6b

Text Generation • Updated Jun 21, 2023 • 189k • 1.46k

liked a dataset almost 2 years ago

laion/relaion2B-en-research-safe

Viewer • Updated Jul 2, 2024 • 2.1B • 1.34k • 191

liked a model over 2 years ago

EleutherAI/gpt-neox-20b

Text Generation • Updated Jan 31, 2024 • 16.9k • • 548