Feng Ji

talkative
ยท

AI & ML interests

large models and its applications

Recent Activity

updated a collection 13 days ago
daily_review
updated a collection 24 days ago
daily_review
liked a model 4 months ago
speechbrain/sepformer-libri2mix
View all activity

Organizations

None yet

talkative's activity

reacted to merve's post with ๐Ÿ‘ 7 months ago
view post
Post
5093
Real-time DEtection Transformer (RT-DETR) landed in transformers ๐Ÿคฉ with Apache 2.0 license ๐Ÿ˜

๐Ÿ”– models: https://huggingface.co/PekingU
๐Ÿ”– demo: merve/RT-DETR-tracking-coco
๐Ÿ“ paper: DETRs Beat YOLOs on Real-time Object Detection (2304.08069)
๐Ÿ“– notebook: https://github.com/merveenoyan/example_notebooks/blob/main/RT_DETR_Notebook.ipynb

YOLO models are known to be super fast for real-time computer vision, but they have a downside with being volatile to NMS ๐Ÿฅฒ

Transformer-based models on the other hand are computationally not as efficient ๐Ÿฅฒ

Isn't there something in between? Enter RT-DETR!

The authors combined CNN backbone, multi-stage hybrid decoder (combining convs and attn) with a transformer decoder. In the paper, authors also claim one can adjust speed by changing decoder layers without retraining altogether.
The authors find out that the model performs better in terms of speed and accuracy compared to the previous state-of-the-art. ๐Ÿคฉ
updated a Space 10 months ago