1 16 15

Lim

liminalism

zhiQ
zhiQ

AI & ML interests

Day 1

Recent Activity

liked a model 21 days ago

Datou1111/shou_xin

View all activity

Organizations

liminalism's activity

upvoted a paper 8 months ago

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 73

upvoted 2 papers 9 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 52

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126

upvoted a collection 9 months ago

OpenELM Instruct Models

Collection

4 items • Updated Oct 4, 2024 • 115

upvoted 3 papers 9 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 65

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Paper • 2403.05313 • Published Mar 8, 2024 • 9

upvoted an article 9 months ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 99

upvoted a paper 9 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 91

upvoted 2 collections 9 months ago

🤖 Agents

Collection

21 items • Updated 8 days ago • 86

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 137

upvoted a paper 9 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 104

upvoted 2 papers 10 months ago

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 25

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 125

upvoted a paper 12 months ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 158

upvoted a paper about 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257