1 62 19

js

rldy

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

katanemo/Arch-Function-3B

upvoted a paper 10 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

upvoted a paper 15 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

View all activity

Organizations

rldy's activity

liked a model 4 days ago

katanemo/Arch-Function-3B

Text Generation • Updated Dec 2, 2024 • 14.9k • 73

upvoted a paper 10 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 13 days ago • 34

upvoted a paper 15 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 12

upvoted a paper 16 days ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 7

liked a Space 16 days ago

Running

430

📈

Scaling test-time compute

upvoted 2 papers 17 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 20 days ago • 91

liked a dataset 18 days ago

HuggingFaceTB/finemath

Viewer • Updated 14 days ago • 48.3M • 31k • 224

upvoted a paper 20 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 24 days ago • 82

upvoted a collection 27 days ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated 27 days ago • 85

upvoted 2 papers about 1 month ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 56

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 48

liked a model about 1 month ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Nov 29, 2024 • 94.7k • • 1.5k

upvoted 2 papers about 1 month ago

Patience Is The Key to Large Language Model Reasoning

Paper • 2411.13082 • Published Nov 20, 2024 • 7

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

upvoted 5 papers about 2 months ago

Loss-to-Loss Prediction: Scaling Laws for All Datasets

Paper • 2411.12925 • Published Nov 19, 2024 • 5

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 71

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 19

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48