Sugato Ray's picture

Sugato Ray

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 22 hours ago

LLM Training Datasets

updated a collection about 22 hours ago

liked a Space about 22 hours ago

davidberenstein1957/transformers-pipeline-playground

View all activity

Organizations

sugatoray's activity

commented a paper 2 days ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 8 days ago • 18 •

commented a paper about 1 month ago

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 29 •

commented 2 papers 3 months ago

Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models

Paper • 2406.04806 • Published Jun 7, 2024 • 1 •

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Paper • 2405.06682 • Published May 5, 2024 • 3 •

New activity in dvilasuero/img-prefs-distilabel 4 months ago

Update README.md with process-howto information

#2 opened 4 months ago by

commented 3 papers 5 months ago

Probabilistic Programming with Programmable Variational Inference

Paper • 2406.15742 • Published Jun 22, 2024 • 2 •

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Paper • 2406.16218 • Published Jun 23, 2024 • 2 •

TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

Paper • 2407.15734 • Published Jul 22, 2024 • 1 •

commented 3 papers 6 months ago

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Paper • 2405.20233 • Published May 30, 2024 • 6 •

HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction

Paper • 2401.17948 • Published Jan 31, 2024 • 2 •

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 12 •

New activity in sugatoray/DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M-GGUF 7 months ago

Add banner image to README.md

#2 opened 7 months ago by

Upload llama.png

#1 opened 7 months ago by

commented 2 papers 7 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12 •

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 34 •

commented 2 papers 8 months ago

Zero-Shot Tokenizer Transfer

Paper • 2405.07883 • Published May 13, 2024 • 5 •

Automating the Enterprise with Foundation Models

Paper • 2405.03710 • Published May 3, 2024 • 1 •

New activity in unalignment/toxic-dpo-v0.2 9 months ago

Update README.md

#2 opened 9 months ago by

New activity in HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 9 months ago

Update config.json

#11 opened 9 months ago by

New activity in mlx-community/stable-code-instruct-3b-4bit 9 months ago

Update config.json

#1 opened 9 months ago by