Nikita Balagansky's picture

5 3

Nikita Balagansky

elephantmipt

·

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

elephantmipt/sae_Qwen_Qwen2.5-7B_resid_pre_layer_24_size_16384_batchtopk_reg_coeff_0.0018

updated a model 5 days ago

elephantmipt/sae_Qwen_Qwen2.5-7B_resid_pre_layer_18_size_16384_batchtopk_reg_coeff_0.0018

updated a model 5 days ago

elephantmipt/sae_Qwen_Qwen2.5-7B_resid_pre_layer_12_size_16384_batchtopk_reg_coeff_0.0018

View all activity

Organizations

elephantmipt's activity

upvoted a paper 3 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 17

upvoted 2 papers 7 months ago

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 87

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

upvoted 2 papers 9 months ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 104

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 82