File size: 987 Bytes
71fb839 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
tags:
- pyannote
- speaker-diarization
datasets:
- ami
- voxconverse
license: mit
---
# Speaker diarization
Relies on pyannote.audio 2.0 currently in development: see [installation instructions](https://github.com/pyannote/pyannote-audio/tree/develop#installation).
```python
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1")
output = pipeline("audio.wav")
for turn, _, speaker in output.itertracks(yield_label=True):
# speaker speaks between turn.start and turn.end
...
```
## Benchmark
| Dataset | [Diarization error rate](http://pyannote.github.io/pyannote-metrics/reference.html#diarization) |
| --------------------------------------------------------------------------------------------------- | ------ |
| [AMI `only_words` evaluation set](https://github.com/BUTSpeechFIT/AMI-diarization-setup) | 21.4% |
|