Skip to content

๐ŸŽค Transform any browser tab into an intelligent audio recording system with automated transcription and AI analysis via n8n. Works with Zoom, Meet, Teams, YouTube, Spotify, Discord & more!

Notifications You must be signed in to change notification settings

yamb0x/audio-capture-n8n

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽค Audio Capture for n8n

Transform any browser tab into an intelligent audio recording system with automated transcription and AI analysis

Chrome Extension n8n Integration Node.js License: MIT

โœจ What This Does

Capture audio from any browser tab (Zoom calls, YouTube videos, Spotify, Discord) and automatically send it to n8n for intelligent processing with OpenAI Whisper transcription, GPT-4 analysis, and Notion integration.

๐ŸŽฏ Perfect for: Meeting notes, content analysis, lecture transcription, podcast processing, and any audio-to-text workflow.

๐Ÿš€ Key Features

  • ๐ŸŒ Universal Audio Capture - Works with any website (Zoom, Meet, Teams, YouTube, Spotify, Discord)
  • ๐Ÿค– Smart Chunking - Automatically splits audio into 30-second chunks for optimal processing
  • ๐Ÿ“Š Real-time Dashboard - Monitor recordings with rich metadata and source detection
  • ๐Ÿ”„ n8n Integration - Pre-built workflow for transcription and AI analysis
  • ๐ŸŽฏ Source Intelligence - Detects platform and adds contextual metadata
  • ๐ŸŒ™ Arc Browser Support - Special handling for Arc's permission model
  • โšก Zero Configuration - Works out of the box with sensible defaults

๐Ÿ“ฑ Live Dashboard

Dashboard Preview

Features:

  • ๐Ÿ“ž Source Detection: Google Meet, Zoom, Teams, YouTube, Spotify, Discord
  • ๐Ÿ“Š Real-time Monitoring: Active sessions, chunk processing, forwarding status
  • ๐ŸŽต Rich Metadata: Tab titles, recording types, audio sizes, processing times
  • โžก๏ธ n8n Forwarding: Automatic forwarding with status indicators

๐ŸŽฏ Quick Start

1. Install Chrome Extension

# Clone this repository
git clone https://github.com/yamb0x/audio-capture-n8n.git
cd audio-capture-n8n

# Install dependencies
npm install
  1. Open Chrome: chrome://extensions/
  2. Enable "Developer mode"
  3. Click "Load unpacked"
  4. Select this folder

2. Start Dashboard

# Mac/Linux
./start-dashboard.sh

# Windows
start-dashboard.bat

# Or use npm
npm start

3. Start Recording

  1. Open any website with audio
  2. Click the extension icon
  3. Click "Start Recording"
  4. Monitor at http://localhost:5678

๐Ÿ”ง n8n Integration

Webhook Data Structure

Each audio chunk includes rich metadata:

{
  "audio": "base64-encoded-webm-data",
  "timestamp": "2025-06-02T12:30:45.123Z",
  "duration": 30,
  "format": "webm",
  "recordingSessionId": "session_1733067015123_x3k9m2n1p",
  "meetingId": "team-standup-zoom",
  "meetingUrl": "https://zoom.us/j/123456789",
  "chunkIndex": 0,
  "isFirstChunk": true,
  "isLastChunk": false,
  "source": "zoom",
  "recordingType": "meeting-audio",
  "title": "Team Weekly Standup"
}

Pre-built n8n Workflow

Import n8n-enhanced-workflow.json for complete automation:

  • ๐ŸŽฏ Session Aggregation - Waits for complete recordings
  • ๐Ÿ—ฃ๏ธ OpenAI Whisper - High-quality transcription
  • ๐Ÿง  GPT-4 Analysis - Extract insights, action items, summaries
  • ๐Ÿ“ Notion Integration - Create structured meeting notes
  • ๐Ÿท๏ธ Smart Tagging - Automatic categorization and metadata

๐Ÿ“Š Platform Support

Platform Emoji Recording Type Special Features
Google Meet ๐Ÿ“ž meeting-audio Meeting ID extraction
Zoom ๐ŸŸฆ meeting-audio Session detection
Microsoft Teams ๐Ÿ’ฌ meeting-audio Team context
YouTube ๐Ÿ“บ media-playback Video metadata
Spotify ๐ŸŽต media-playback Track information
Discord ๐ŸŽฎ chat-audio Channel context
Any Website ๐ŸŒ general-audio Universal capture

๐Ÿ› ๏ธ Configuration

Default Webhook URL

http://localhost:5678/webhook/meeting-audio?forward=true

What this does:

  • โœ… Records in dashboard for monitoring
  • โœ… Forwards to n8n for processing
  • โœ… Best of both worlds

Environment Variables

PORT=5678                    # Dashboard port
N8N_WEBHOOK_URL=...         # Your n8n webhook URL
LOG_LEVEL=info              # Logging verbosity

๐Ÿ—๏ธ Architecture

๐ŸŒ Browser Tab โ†’ ๐Ÿงฉ Chrome Extension โ†’ ๐Ÿ“Š Dashboard โ†’ ๐Ÿค– n8n โ†’ ๐Ÿ—ฃ๏ธ Whisper โ†’ ๐Ÿง  GPT-4 โ†’ ๐Ÿ“ Notion
   (Audio)        (Capture)           (Monitor)      (Process)  (Transcribe)  (Analyze)   (Store)

๐Ÿ” Privacy & Security

  • ๐Ÿ  Local Processing - Audio processed locally when possible
  • ๐Ÿšซ No Cloud Storage - Audio chunks are temporary
  • ๐Ÿ”’ Minimal Permissions - Only required Chrome APIs
  • ๐Ÿ” Secure Transfer - HTTPS recommended for production
  • ๐Ÿ—‘๏ธ Auto-cleanup - Temporary files deleted after processing

๐ŸŽจ Dashboard Features

Real-Time Monitoring

  • ๐ŸŸข System Status - Server health and uptime
  • ๐Ÿ“ˆ Recording Stats - Sessions, chunks, data volume
  • ๐ŸŽต Live Sessions - Active recordings with progress
  • โšก Processing Times - n8n forwarding performance

Rich Source Information

  • ๐Ÿท๏ธ Platform Detection - Automatic source identification
  • ๐Ÿ“„ Tab Titles - Full page context
  • ๐ŸŽฏ Recording Types - Meeting, media, chat categorization
  • ๐Ÿ“Š Audio Analytics - Size, duration, chunk count

Modern UI Design

  • ๐ŸŽจ Monospace Aesthetic - Clean, minimal interface with Basis Grotesque and Kalice fonts
  • โšซ Black & White Theme - Professional, distraction-free design
  • ๐Ÿ”ฒ Sharp Borders - Precise, geometric interface elements
  • ๐Ÿšซ No Emojis in UI - Clean icons using CSS shapes instead

๐Ÿงช Advanced Usage

Custom Webhook URLs

// Dashboard only (no forwarding)
http://localhost:5678/webhook/meeting-audio

// Dashboard + custom n8n forwarding  
http://localhost:5678/webhook/meeting-audio?forward=true

// Direct to your n8n instance
https://your-n8n-instance.com/webhook/meeting-audio

Arc Browser Support

Special handling for Arc's unique permission model:

  • ๐Ÿ”„ Auto-detection - Recognizes Arc browser
  • ๐Ÿ“„ Full-page recorder - Alternative interface for permissions
  • โšก Permission recovery - Automatic fallback methods

๐Ÿ“ฆ File Structure

audio-capture-n8n/
โ”œโ”€โ”€ ๐Ÿงฉ Extension Core
โ”‚   โ”œโ”€โ”€ manifest.json          # Chrome Extension Manifest V3
โ”‚   โ”œโ”€โ”€ popup.html/js         # Main extension interface
โ”‚   โ”œโ”€โ”€ background.js         # Service worker
โ”‚   โ”œโ”€โ”€ content.js           # Page integration
โ”‚   โ””โ”€โ”€ recording-page.html  # Arc browser support
โ”œโ”€โ”€ ๐Ÿ“Š Dashboard & Monitoring  
โ”‚   โ”œโ”€โ”€ dashboard-server.js   # Real-time monitoring
โ”‚   โ”œโ”€โ”€ launcher.js          # Application launcher
โ”‚   โ””โ”€โ”€ start-dashboard.*    # Cross-platform scripts
โ”œโ”€โ”€ ๐Ÿค– n8n Integration
โ”‚   โ””โ”€โ”€ n8n-enhanced-workflow.json  # Complete automation workflow
โ”œโ”€โ”€ ๐Ÿ”ง Audio Processing
โ”‚   โ””โ”€โ”€ audio-processor.js   # Modern AudioWorklet implementation
โ””โ”€โ”€ ๐Ÿ“š Documentation
    โ””โ”€โ”€ README.md           # This file

๐Ÿค Contributing

We welcome contributions! Please see our contributing guidelines:

  1. ๐Ÿด Fork the repository
  2. ๐ŸŒŸ Create your feature branch
  3. โœ… Test thoroughly with the dashboard
  4. ๐Ÿ“ Update documentation
  5. ๐Ÿš€ Submit a pull request

๐Ÿ› Troubleshooting

Extension Issues

  • Permission denied: Check Chrome microphone settings
  • No recordings: Verify webhook URL in extension popup
  • Arc browser: Use the full-page recorder interface

Dashboard Issues

  • Empty dashboard: Extension using old webhook URL (auto-fixed)
  • Port conflicts: Dashboard handles port 5678 automatically
  • Missing data: Check extension console logs

n8n Integration

  • No events: Verify webhook URL configuration
  • Processing errors: Check OpenAI API key and credits
  • Workflow failures: Review n8n execution logs

๐Ÿ“ Changelog

v1.1.0 (2025-06-03)

  • ๐ŸŽจ New UI Design - Complete interface redesign with monospace aesthetic
  • ๐Ÿ”ค Custom Fonts - Added Basis Grotesque and Kalice font families
  • ๐Ÿ”ง UTF-8 Fix - Fixed character encoding issues across all interfaces
  • ๐ŸŽฏ Improved Icons - Replaced emojis with clean CSS-based icons
  • ๐Ÿ“ฑ Better Popup - Enhanced popup design with consistent styling

v1.0.0 (Initial Release)

  • ๐ŸŽค Core audio capture functionality
  • ๐Ÿ“Š Real-time dashboard
  • ๐Ÿค– n8n integration
  • ๐ŸŒ Multi-platform support

๐Ÿ“„ License

MIT License - Feel free to use, modify, and distribute!

๐ŸŒŸ Star This Repository

If this project helps you automate your audio workflows, please give it a star! โญ


Built with โค๏ธ for seamless meeting transcription and audio analysis

GitHub stars GitHub forks GitHub issues

About

๐ŸŽค Transform any browser tab into an intelligent audio recording system with automated transcription and AI analysis via n8n. Works with Zoom, Meet, Teams, YouTube, Spotify, Discord & more!

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published