haruka's picture

19 5

haruka

harukafika

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

uer/sbert-base-chinese-nli

liked a model 6 days ago

deepseek-ai/DeepSeek-V3-Base

upvoted a paper 26 days ago

APOLLO: SGD-like Memory, AdamW-level Performance

View all activity

Organizations

None yet

harukafika's activity

upvoted 4 papers 26 days ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published about 1 month ago • 38

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 28 days ago • 72

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 28 days ago • 71

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 27 days ago • 66

upvoted 14 papers 3 months ago

FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors

Paper • 2410.16271 • Published Oct 21, 2024 • 81

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Paper • 2410.13639 • Published Oct 17, 2024 • 16

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment

Paper • 2410.13785 • Published Oct 17, 2024 • 19

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 17

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Paper • 2410.09009 • Published Oct 11, 2024 • 14

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Paper • 2410.08102 • Published Oct 10, 2024 • 20

PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness

Paper • 2410.07035 • Published Oct 9, 2024 • 17

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Paper • 2410.06456 • Published Oct 9, 2024 • 36

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 44

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 18

Tree of Problems: Improving structured problem solving with compositionality

Paper • 2410.06634 • Published Oct 9, 2024 • 8

Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations

Paper • 2410.10792 • Published Oct 14, 2024 • 29

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12, 2024 • 47

upvoted a paper 5 months ago

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8, 2024 • 14