Ashiq Rahman

TangoDJ

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

The Unbearable Slowness of Being: Why do we live at 10 bits/s?

updated a collection 19 days ago

Papers

updated a collection 26 days ago

Papers

View all activity

Organizations

TangoDJ's activity

upvoted a paper 17 days ago

The Unbearable Slowness of Being: Why do we live at 10 bits/s?

Paper • 2408.10234 • Published Aug 3, 2024 • 1

upvoted 2 papers 26 days ago

An Evolved Universal Transformer Memory

Paper • 2410.13166 • Published Oct 17, 2024 • 3

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 27 days ago • 66

upvoted 2 papers 27 days ago

Flow Matching Guide and Code

Paper • 2412.06264 • Published 28 days ago • 1

Discovering Preference Optimization Algorithms with and for Large Language Models

Paper • 2406.08414 • Published Jun 12, 2024 • 14

upvoted a paper 28 days ago

Reinforcement Learning: An Overview

Paper • 2412.05265 • Published about 1 month ago • 4

upvoted 6 papers about 2 months ago

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Paper • 2411.00640 • Published Nov 1, 2024 • 3

upvoted a collection about 2 months ago

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 12 items • Updated about 10 hours ago • 29

upvoted a collection 4 months ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 17 items • Updated 14 days ago • 93

upvoted 2 papers 9 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 253

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 104

upvoted 2 papers 10 months ago

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 23

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 26

upvoted 2 papers 11 months ago

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21, 2024 • 47

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20, 2024 • 95