arxiv:2501.02506
Siyu Yuan
siyuyuan
AI & ML interests
Knowledge generation
Recent Activity
authored
a paper
2 days ago
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models
in Multi-Hop Tool Use
Organizations
Papers
1
models
None public yet
datasets
None public yet