sruthikesh surineni's picture

1 2

sruthikesh surineni

sruthikesh

sruthikesh-mu

AI & ML interests

AI, HW/SW codesign, HW architecture

Organizations

None yet

sruthikesh's activity

upvoted a paper about 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257

upvoted a paper over 1 year ago

Accelerating LLM Inference with Staged Speculative Decoding

Paper • 2308.04623 • Published Aug 8, 2023 • 23