TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Paper β’ 2411.18671 β’ Published Nov 27, 2024 β’ 20
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions Paper β’ 2407.12435 β’ Published Jul 17, 2024 β’ 14
MotionLLM: Understanding Human Behaviors from Human Motions and Videos Paper β’ 2405.20340 β’ Published May 30, 2024 β’ 20
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper β’ 2405.10300 β’ Published May 16, 2024 β’ 26
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper β’ 2405.10300 β’ Published May 16, 2024 β’ 26
Runtime error 27 π Grounding DINO 1.5 IDEA Research's Most Capable Open-Set Object Detection Model
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection Paper β’ 2303.05499 β’ Published Mar 9, 2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models Paper β’ 2305.15023 β’ Published May 24, 2023
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy Paper β’ 2403.14610 β’ Published Mar 21, 2024 β’ 3
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks Paper β’ 2401.14159 β’ Published Jan 25, 2024 β’ 1
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Paper β’ 2404.19759 β’ Published Apr 30, 2024 β’ 24
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated about 1 month ago β’ 698