This application performs a speech-to-text transcription using OpenAI's Whisper-tiny and Whisper-base model on the Hailo-8/8L/10H AI accelerators.
Ensure your system matches the following requirements before proceeding:
- Platforms tested: x86, Raspberry Pi 5
- OS: Ubuntu 22 (x86) or Raspberry OS.
- HailoRT 4.20 or 4.21 and the corresponding PCIe driver must be installed. You can download them from the Hailo Developer Zone
- ffmpeg and libportaudio2 installed for audio processing.
sudo apt update sudo apt install ffmpeg sudo apt install libportaudio2 sudo apt install portaudio19-dev python3-pyaudio sudo apt install hailo-all - Python 3.10 or 3.11 installed.
Follow these steps to set up the environment and install dependencies for inference:
-
Clone this repository:
https://github.com/Seeed-Projects/STT_hailo_whisper
If you have any authentication issues, add your SSH key or download the zip.
-
Activate the virtual environment from the repository root folder:
python -m venv .env --system-site-packages && source .env/bin/activate
-
Install necessary model
cd app && python download_resources.py
-
Install necessary lib
cd .. && pip install -r requirements.txt
-
Run whisper for real-time STT
python hailo_whisper.py --hw-arch hailo8 --variant base --udp-host 0.0.0.0 --udp-port 12345
You can also run
python hailo_whisper.py --helpto check more information. -
Run UDP reciver
cd test && python recive_message.py --host 0.0.0.0 --port 12345 --stats-interval 5
You can also run
python recive_message.py --helpto check more information.


