Feng Ji's picture

19

Feng Ji

talkative

·

AI & ML interests

large models and its applications

Recent Activity

updated a collection 13 days ago

updated a collection 24 days ago

liked a model 4 months ago

speechbrain/sepformer-libri2mix

View all activity

Organizations

None yet

talkative's activity

updated a collection 13 days ago

daily_review

28 items • Updated 13 days ago

updated a collection 24 days ago

daily_review

28 items • Updated 13 days ago

liked a model 4 months ago

speechbrain/sepformer-libri2mix

Audio-to-Audio • Updated Feb 25, 2024 • 284 • 6

liked a model 5 months ago

stabilityai/stable-fast-3d

Image-to-3D • Updated 19 days ago • 5.12k • 569

liked a dataset 6 months ago

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 11.5k • 238

updated a collection 7 months ago

daily_review

28 items • Updated 13 days ago

liked a Space 7 months ago

Running on CPU Upgrade

Daily Papers

Complete list of past Daily Papers

reacted to merve's post with 👍 7 months ago

Post

5093

Real-time DEtection Transformer (RT-DETR) landed in transformers 🤩 with Apache 2.0 license 😍

🔖 models: https://huggingface.co/PekingU
🔖 demo: merve/RT-DETR-tracking-coco
📝 paper: DETRs Beat YOLOs on Real-time Object Detection (2304.08069)
📖 notebook: https://github.com/merveenoyan/example_notebooks/blob/main/RT_DETR_Notebook.ipynb

YOLO models are known to be super fast for real-time computer vision, but they have a downside with being volatile to NMS 🥲

Transformer-based models on the other hand are computationally not as efficient 🥲

Isn't there something in between? Enter RT-DETR!

The authors combined CNN backbone, multi-stage hybrid decoder (combining convs and attn) with a transformer decoder. In the paper, authors also claim one can adjust speed by changing decoder layers without retraining altogether.
The authors find out that the model performs better in terms of speed and accuracy compared to the previous state-of-the-art. 🤩

updated a collection 7 months ago

vlm

1 item • Updated Jul 2, 2024

updated 2 collections 9 months ago

vlm

1 item • Updated Jul 2, 2024

daily_review

28 items • Updated 13 days ago

liked a model 10 months ago

coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 2.22M • 2.26k

updated a Space 10 months ago

JupyterLab

updated a collection 12 months ago

daily_review

28 items • Updated 13 days ago