Skip to content

Real-Time Voice to Text Translator is a powerful C++ application that captures system or browser audio (like Chrome Live Captions) and instantly converts it into readable, savable text. Perfect for accessibility, meeting notes, interviews, and AI-assisted conversations β€” this tool bridges spoken language and text in real time.

Notifications You must be signed in to change notification settings

arellanojeremypaul/voice-to-text-transcriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ—£οΈ Real-Time Voice-to-Text Transcript Generator

Demo

A powerful C++ application that captures live audio (like Chrome Live Captions) and converts it to text in real time β€” ideal for accessibility, meeting notes, interviews, or AI-assisted conversations.
Translate speech instantly, save transcripts automatically, and connect with GPT-based tools for smart analysis.

πŸ‘‹ Hi, I'm Jeremy Arellano

πŸš€ AI/ML Engineer | Software Developer | .NET / C++ / Java Specialist

I'm passionate about creating scalable, intelligent, and accessible applications that bridge technology and human needs.
Currently working on AI-driven accessibility tools and real-time applications using modern frameworks.

🧠 About Me

  • πŸ’Ό Software Engineer experienced in C++, C#, Java, and .NET Framework
  • πŸ€– Enthusiastic about AI/ML applications, automation, and assistive technology
  • πŸ’‘ Focused on transforming ideas into high-impact software solutions

πŸš€ How It Works

  1. Ensure Live Captions are enabled in Chrome.
  2. Run the C++ app to capture captions from media in Chrome, including GPT-powered interviews.
  3. The app automatically processes the captions and displays them as text in real-time.
  4. Optionally, save the text or copy it for further use!

πŸ’‘ Use Cases

  • Job Interviews: Capture real-time captions from AI-assisted interviews with GPT or other platforms, making it easier to review and follow up on questions.
  • Content Creators: Quickly save captions from online videos for script writing or analysis.
  • Accessibility: Enhance accessibility by converting captions into text for users who need text-based content.
  • Language Learners: Capture and review spoken content to aid in language transcription and learning.
  • Developers & Researchers: Use captured text for speech-to-text projects or natural language processing (NLP) experiments.

πŸ”§ Technologies Used

  • C++: Core language used for high-performance real-time text capture.
  • No OCR: This tool extracts text directly from Chrome’s Live Caption pop-up window without using OCR, making it lightweight, fast, and fully local with no network traffic required.

πŸ“ License

This project is licensed under the MIT License.

🌐 Keywords

C++, Live-Caption, Text-Capture, Job-Interviews, GPT, Speech-to-Text, Chrome-API, Accessibility, Multi-Language-Support, Real-Time, Audio-To-Text, Voice-To-Text, Speech-Recognition, Audio-To-Text, Transcription, caption-generator

🧭 Goals for 2025

  • πŸš€ Launch 3 open-source AI or .NET projects
  • πŸ’¬ Share tech insights via blog posts
  • 🎯 Contribute to accessibility-focused open-source tools

⭐️ If you find my work useful, please consider giving a star!

Β© 2025 Jeremy. All rights reserved.

About

Real-Time Voice to Text Translator is a powerful C++ application that captures system or browser audio (like Chrome Live Captions) and instantly converts it into readable, savable text. Perfect for accessibility, meeting notes, interviews, and AI-assisted conversations β€” this tool bridges spoken language and text in real time.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published