An AI-powered web application that lets you chat with your videos! Upload any video file and interact with it using natural language. Powered by Google Gemini 2.0 Flash, Phi Agent, and Streamlit.
- Features
- Project Structure
- Setup Instructions
- Example Prompts
- Environment Variables
- Technologies Used
- Future Improvements
- Acknowledgements
- 🎬 Upload and preview
.mp4,.mov, or.avivideos - 💬 Ask natural language questions about the video content
- ⚡ Gemini 2.0 Flash for fast, multimodal video understanding
- 🌐 DuckDuckGo tool for live web context (via Phi Agent)
- 💡 Clear chat history and reset session easily
- 🧱 Clean modular codebase (easy to extend)
.
├── app.py # Main Streamlit app
├── agents/
│ └── video_agent.py # AI Agent logic using Gemini + Phi
├── utils/
│ └── file_utils.py # File saving utility
├── config/
│ └── settings.py # API key & env configuration
├── requirements.txt
└── README.md
# Clone the repository
https://github.com/your-repo/video-analyst.gitAdd your Google Generative AI API Key:
GOOGLE_API_KEY=your-api-key-herepip install -r requirements.txtstreamlit run app.py- "What is this video about?"
- "Summarize the key points."
- "List any people mentioned in the video."
- "Does this video mention recent events?"
| Variable | Description |
|---|---|
GOOGLE_API_KEY |
Your Google Gemini API key |
- Streamlit: For building the web application.
- Google Gemini 2.0 Flash: For advanced video understanding.
- Phi Agent Framework: For natural language interaction.
- DuckDuckGo Tool: For live web context.
- Python 3.9+: Programming language.
- Support for additional file formats like PDFs, DOCX, and Excel.
- Multilingual query support for global accessibility.
- Summarization timeline or highlights for better insights.
- Deployment options using Streamlit Cloud or Docker.
- Google Generative AI: For powering the AI capabilities.
- Phi Agent Framework: For enabling natural language interaction.
- Streamlit Team: For the amazing web framework.
🔗 Connect
📧 Email: kunamaravind@gmail.com