Skip to content

Seeed-Projects/STT_hailo_whisper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automatic Speech Recognition with Whisper model

This application performs a speech-to-text transcription using OpenAI's Whisper-tiny and Whisper-base model on the Hailo-8/8L/10H AI accelerators.

Prerequisites

Ensure your system matches the following requirements before proceeding:

  • Platforms tested: x86, Raspberry Pi 5
  • OS: Ubuntu 22 (x86) or Raspberry OS.
  • HailoRT 4.20 or 4.21 and the corresponding PCIe driver must be installed. You can download them from the Hailo Developer Zone
  • ffmpeg and libportaudio2 installed for audio processing.
    sudo apt update
    sudo apt install ffmpeg
    sudo apt install libportaudio2
    sudo apt install portaudio19-dev python3-pyaudio
    sudo apt install hailo-all
    
  • Python 3.10 or 3.11 installed.

Hardware prepare [one of them]

Microphone Array

Installation - Inference only

Follow these steps to set up the environment and install dependencies for inference:

  1. Clone this repository:

    https://github.com/Seeed-Projects/STT_hailo_whisper

    If you have any authentication issues, add your SSH key or download the zip.

  2. Activate the virtual environment from the repository root folder:

    python -m venv .env --system-site-packages && source .env/bin/activate
  3. Install necessary model

    cd app && python download_resources.py
  4. Install necessary lib

    cd .. && pip install -r requirements.txt

Run this job

  1. Run whisper for real-time STT

    python hailo_whisper.py --hw-arch hailo8 --variant base --udp-host 0.0.0.0 --udp-port 12345

    You can also run python hailo_whisper.py --help to check more information.

  2. Run UDP reciver

    cd test && python recive_message.py --host 0.0.0.0 --port 12345 --stats-interval 5

    You can also run python recive_message.py --help to check more information.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages