File size: 2,888 Bytes
ec6d044
 
 
 
 
 
 
 
5ac58c0
 
 
 
 
 
 
 
 
 
 
ec6d044
 
a846d78
ec6d044
 
a846d78
ec6d044
 
 
 
 
 
a7f84bd
ec6d044
a846d78
 
 
 
 
 
 
 
 
ec6d044
 
 
 
 
a846d78
ec6d044
a846d78
 
 
 
 
ec6d044
 
 
a846d78
 
 
 
 
 
 
 
 
ec6d044
a846d78
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
---
title: PhoWhisper Web
emoji: 🎤
colorFrom: indigo
colorTo: indigo
sdk: static
pinned: true
models:
- huuquyet/PhoWhisper-tiny
- huuquyet/PhoWhisper-base
- huuquyet/PhoWhisper-small
- huuquyet/PhoWhisper-medium
- huuquyet/PhoWhisper-large
- Xenova/whisper-tiny.en
- Xenova/whisper-tiny
- Xenova/whisper-small.en
- Xenova/whisper-small
license: wtfpl
short_description: PhoWhisperML-powered speech recognition for Vietnamese
---


# PhoWhisper Web

ML-powered speech recognition for Vietnamese directly in your browser! Built with [🤗 Transformers.js](https://github.com/xenova/transformers.js) + [Next.js](https://nextjs.org/).

Using [🤗 PhoWhisper](https://github.com/VinAIResearch/PhoWhisper) models:
- [PhoWhisper-tiny](https://hf.co/vinai/PhoWhisper-tiny/) converted to [ONNX model](https://hf.co/huuquyet/PhoWhisper-tiny)
- [PhoWhisper-base](https://hf.co/vinai/PhoWhisper-base/) converted to [ONNX model](https://hf.co/huuquyet/PhoWhisper-base)
- [PhoWhisper-small](https://hf.co/vinai/PhoWhisper-small/) converted to [ONNX model](https://hf.co/huuquyet/PhoWhisper-small)
- [PhoWhisper-medium](https://hf.co/vinai/PhoWhisper-medium/) converted to [ONNX model](https://hf.co/huuquyet/PhoWhisper-medium)
- [PhoWhisper-large](https://hf.co/vinai/PhoWhisper-large/) converted to [ONNX model](https://hf.co/huuquyet/PhoWhisper-large)

Check out the demo sites:
<a href="https://huggingface.co/spaces/huuquyet/PhoWhisper-next">
  <picture>
    <source srcset="https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-sm.svg" media="(prefers-color-scheme: light), (prefers-color-scheme: no-preference)">
    <source srcset="https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-sm-dark.svg" media="(prefers-color-scheme: dark)">
    <img src="https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-sm.svg" alt="Open in Spaces">
  </picture>
</a>
 or [Vercel app](https://pho-whisper-next.vercel.app/).

https://github.com/xenova/whisper-web/assets/26504141/fb170d84-9678-41b5-9248-a112ecc74c27

## Running locally

1. Clone the repo and install dependencies with [bun](https://bun.sh):

  ```bash
  git clone https://github.com/huuquyet/PhoWhisper-next.git
  cd PhoWhisper-next
  bun install
  ```

2. Run the development server:

  ```bash
  bun dev
  ```
  > Firefox users need to change the `dom.workers.modules.enabled` setting in `about:config` to `true` to enable Web Workers.
  > Check out [this issue](https://github.com/xenova/whisper-web/issues/8) for more details.

3. Open the link (e.g., [http://localhost:3000/](http://localhost:3000/)) in your browser.

## Acknowledgement

- [Whisper-web](https://github.com/xenova/whisper-web)
- [PhoWhisper](https://github.com/VinAIResearch/PhoWhisper)
- [Transformers.js](https://github.com/xenova/transformers.js)
- [Next.js](https://nextjs.org)