license: apache-2.0 | |
# EgoSpeak Checkpoints | |
This repo contains final checkpoints for the EgoSpeak project. | |
- **Models**: `lstr`, `mamba`, `miniroad` | |
- **Datasets**: `easycom`, `ego4dshuffle`, `ytconv_pretrained` | |
- **Modalities**: `A` (audio), `V` (video), `AV` (audio+video) | |
## File Naming Convention | |
- `mamba_easycom_A.pth` → Mamba model, trained on EasyCom, audio-only | |