JiHwanYoon's picture

19 42

JiHwanYoon

ohilikeit

·

ohilikeit

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

dunzhang/stella_en_1.5B_v5

View all activity

Organizations

ohilikeit's activity

upvoted a paper 7 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 55

upvoted a paper 8 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 118

upvoted a collection 8 months ago

Quantized-FT-Orca-Math

Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. • 8 items • Updated Aug 20, 2024 • 7

upvoted 2 papers 9 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 253

Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models

Paper • 2402.14714 • Published Feb 22, 2024 • 4

upvoted 7 papers 10 months ago

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 57

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 605

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13, 2024 • 47

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 93

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Paper • 2403.00745 • Published Mar 1, 2024 • 12

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 70

upvoted 5 papers 11 months ago

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Paper • 2402.12226 • Published Feb 19, 2024 • 41

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

Paper • 2402.09052 • Published Feb 14, 2024 • 17

Premise Order Matters in Reasoning with Large Language Models

Paper • 2402.08939 • Published Feb 14, 2024 • 27

Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30, 2024 • 17

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29, 2024 • 55

upvoted 2 papers 12 months ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 54

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 60