Skip to content

The AI Video Chat Summarizer App is a cutting-edge tool that enables users to interact with video content through natural language. Powered by Google Gemini 2.0 Flash and the Phi Agent Framework, it allows users to upload videos, ask questions, and extract insights effortlessly. Built with Streamlit, the app is fast, intuitive, and easy to extend.

Notifications You must be signed in to change notification settings

Aravindbapu/AI_Video_Analyst

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎥 AI Video Chat Summarizer App

An AI-powered web application that lets you chat with your videos! Upload any video file and interact with it using natural language. Powered by Google Gemini 2.0 Flash, Phi Agent, and Streamlit.


📖 Table of Contents

  1. Features
  2. Project Structure
  3. Setup Instructions
  4. Example Prompts
  5. Environment Variables
  6. Technologies Used
  7. Future Improvements
  8. Acknowledgements

🧠 Features

  • 🎬 Upload and preview .mp4, .mov, or .avi videos
  • 💬 Ask natural language questions about the video content
  • ⚡ Gemini 2.0 Flash for fast, multimodal video understanding
  • 🌐 DuckDuckGo tool for live web context (via Phi Agent)
  • 💡 Clear chat history and reset session easily
  • 🧱 Clean modular codebase (easy to extend)

📁 Project Structure

.
├── app.py              # Main Streamlit app
├── agents/
│   └── video_agent.py  # AI Agent logic using Gemini + Phi
├── utils/
│   └── file_utils.py   # File saving utility
├── config/
│   └── settings.py     # API key & env configuration
├── requirements.txt
└── README.md

⚙️ Setup Instructions

1. Clone the repository

# Clone the repository
https://github.com/your-repo/video-analyst.git

2. Create a .env file

Add your Google Generative AI API Key:

GOOGLE_API_KEY=your-api-key-here

3. Install dependencies

pip install -r requirements.txt

4. Run the app

streamlit run app.py

✨ Example Prompts

  1. "What is this video about?"
  2. "Summarize the key points."
  3. "List any people mentioned in the video."
  4. "Does this video mention recent events?"

🔐 Environment Variables

Variable Description
GOOGLE_API_KEY Your Google Gemini API key

🛠️ Technologies Used

  • Streamlit: For building the web application.
  • Google Gemini 2.0 Flash: For advanced video understanding.
  • Phi Agent Framework: For natural language interaction.
  • DuckDuckGo Tool: For live web context.
  • Python 3.9+: Programming language.

📌 Future Improvements

  • Support for additional file formats like PDFs, DOCX, and Excel.
  • Multilingual query support for global accessibility.
  • Summarization timeline or highlights for better insights.
  • Deployment options using Streamlit Cloud or Docker.

🙌 Acknowledgements

  • Google Generative AI: For powering the AI capabilities.
  • Phi Agent Framework: For enabling natural language interaction.
  • Streamlit Team: For the amazing web framework.

🔗 Connect

📧 Email: kunamaravind@gmail.com


About

The AI Video Chat Summarizer App is a cutting-edge tool that enables users to interact with video content through natural language. Powered by Google Gemini 2.0 Flash and the Phi Agent Framework, it allows users to upload videos, ask questions, and extract insights effortlessly. Built with Streamlit, the app is fast, intuitive, and easy to extend.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages