Automatic Speech Recognition with Whisper model

This application performs a speech-to-text transcription using OpenAI's Whisper-tiny and Whisper-base model on the Hailo-8/8L/10H AI accelerators.

Prerequisites

Ensure your system matches the following requirements before proceeding:

Platforms tested: x86, Raspberry Pi 5
OS: Ubuntu 22 (x86) or Raspberry OS.
HailoRT 4.20 or 4.21 and the corresponding PCIe driver must be installed. You can download them from the Hailo Developer Zone

ffmpeg and libportaudio2 installed for audio processing.

sudo apt update
sudo apt install ffmpeg
sudo apt install libportaudio2
sudo apt install portaudio19-dev python3-pyaudio
sudo apt install hailo-all

Python 3.10 or 3.11 installed.

Hardware prepare [one of them]

reComputer AI R2140-12

reComputer Industrial R2045-12

Microphone Array

ReSpeaker Mic Array v3.0

Installation - Inference only

Follow these steps to set up the environment and install dependencies for inference:

Clone this repository:
```
https://github.com/Seeed-Projects/STT_hailo_whisper
```
If you have any authentication issues, add your SSH key or download the zip.

Activate the virtual environment from the repository root folder:

python -m venv .env --system-site-packages && source .env/bin/activate

Install necessary model
```
cd app && python download_resources.py
```

Install necessary lib

cd .. && pip install -r requirements.txt

Run this job

Run whisper for real-time STT
```
python hailo_whisper.py --hw-arch hailo8 --variant base --udp-host 0.0.0.0 --udp-port 12345
```
You can also run python hailo_whisper.py --help to check more information.
Run UDP reciver
```
cd test && python recive_message.py --host 0.0.0.0 --port 12345 --stats-interval 5
```
You can also run python recive_message.py --help to check more information.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
app		app
common		common
test		test
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
hailo_whisper.py		hailo_whisper.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Speech Recognition with Whisper model

Prerequisites

Hardware prepare [one of them]

reComputer AI R2140-12

reComputer Industrial R2045-12

Microphone Array

ReSpeaker Mic Array v3.0

Installation - Inference only

Run this job

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Automatic Speech Recognition with Whisper model

Prerequisites

Hardware prepare [one of them]

reComputer AI R2140-12

reComputer Industrial R2045-12

Microphone Array

ReSpeaker Mic Array v3.0

Installation - Inference only

Run this job

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages