Wav2Vec 2.0 - a facebook Collection

facebook 's Collections

Sparsh

Seamless Communication

MAGNeT

XLSR

XLS-R

Robust Wav2Vec 2.0

HuBERT

Fairseq S^2 TTS

Dinov2

MusicGen Stereo

Sapiens

OPT

FAIR's LayerSkip Llama models

Wav2Vec 2.0

updated Jan 16, 2024

A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.

facebook/wav2vec2-large-960h-lv60-self

Automatic Speech Recognition • Updated May 23, 2022 • 956k • 143

Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled audio data from the LibriSpeech and LibriVox (LV) corpora, and fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant Wav2Vec 2.0 checkpoint from the initial release, obtaining 1.9/3.9% WER on the LibriSpeech test clean/other subsets respectively.
facebook/wav2vec2-large-960h

Automatic Speech Recognition • Updated Apr 5, 2022 • 80.5k • 28

Note The Wav2Vec 2.0 "large" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-960h

Automatic Speech Recognition • Updated Nov 14, 2022 • 1.42M • 312

Note The Wav2Vec 2.0 "base" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-100h

Automatic Speech Recognition • Updated May 27, 2022 • 1.39k • 6

Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data, and fine-tuned on 100 hours of labelled LibriSpeech ASR data.
facebook/wav2vec2-large-lv60

Updated Dec 28, 2021 • 7.65k • 8

Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled data from the LibriSpeech and LibriVox (LV) corpora.
facebook/wav2vec2-large

Updated Aug 26, 2022 • 6.22k • 5

Note The Wav2Vec 2.0 "large" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
facebook/wav2vec2-base

Updated Dec 28, 2021 • 505k • 91

Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 6

Note The wav2vec 2.0 paper, accepted to NeurIPS 2020.