vuha14
's Collections
reviewing
updated
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue
Summarization
Paper
•
2402.13249
•
Published
•
11
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper
•
2402.12659
•
Published
•
17
Instruction-tuned Language Models are Better Knowledge Learners
Paper
•
2402.12847
•
Published
•
25
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
Language Models
Paper
•
2402.13064
•
Published
•
47
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper
•
2402.13753
•
Published
•
114
User-LLM: Efficient LLM Contextualization with User Embeddings
Paper
•
2402.13598
•
Published
•
19
Coercing LLMs to do and reveal (almost) anything
Paper
•
2402.14020
•
Published
•
12
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
Paper
•
2402.13720
•
Published
•
6
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
•
2402.10986
•
Published
•
77
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Paper
•
2402.11131
•
Published
•
42
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
•
2402.12226
•
Published
•
41
Paper
•
2402.12219
•
Published
•
16
OneBit: Towards Extremely Low-bit Large Language Models
Paper
•
2402.11295
•
Published
•
23
LongAgent: Scaling Language Models to 128k Context through Multi-Agent
Collaboration
Paper
•
2402.11550
•
Published
•
16
CoLLaVO: Crayon Large Language and Vision mOdel
Paper
•
2402.11248
•
Published
•
20
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and
Local Refinements
Paper
•
2402.10963
•
Published
•
10
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Paper
•
2402.11690
•
Published
•
8
Linear Transformers with Learnable Kernel Functions are Better
In-Context Models
Paper
•
2402.10644
•
Published
•
79
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs
Miss
Paper
•
2402.10790
•
Published
•
41
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
•
2402.10555
•
Published
•
34
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM
Workflows
Paper
•
2402.10379
•
Published
•
30
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
Editing
Paper
•
2402.10294
•
Published
•
24
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large
Language Models
Paper
•
2402.10524
•
Published
•
22
Large Language Models as Zero-shot Dialogue State Tracker through
Function Calling
Paper
•
2402.10466
•
Published
•
17
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
•
2402.14658
•
Published
•
82
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
•
2402.14034
•
Published
•
12
OmniPred: Language Models as Universal Regressors
Paper
•
2402.14547
•
Published
•
12
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper
•
2402.14289
•
Published
•
19
Scaling Up LLM Reviews for Google Ads Content Moderation
Paper
•
2402.14590
•
Published
•
8
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper
•
2402.14261
•
Published
•
10
Linear Transformers are Versatile In-Context Learners
Paper
•
2402.14180
•
Published
•
6