vision-ai
Here are 60 public repositories matching this topic...
🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐
-
Updated
Feb 9, 2026 - Python
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
-
Updated
Feb 4, 2026 - TypeScript
[CVPRW'25] Official Code For "SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection"
-
Updated
Jul 7, 2025 - Python
This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.
-
Updated
Feb 1, 2024 - Jupyter Notebook
🌀 The world's first emotionally intelligent CLI that thinks, creates, and empathizes with developers. Autonomous AI with Vision, Dream Engine, and Emotional Intelligence.
-
Updated
Aug 15, 2025 - TypeScript
Gemini Vision & Image Generation MCP for Claude Desktop and Claude Code
-
Updated
Jan 19, 2026 - JavaScript
Bidirectional Markdown↔PDF converter with AI-powered vision. MD→PDF with beautiful themes, PDF→MD with LLaVA - open source & privacy-first
-
Updated
Nov 5, 2025 - TypeScript
MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.
-
Updated
Jun 19, 2025 - Jupyter Notebook
MDDenseResNet : Enhanced Malware Detection Using DNNs
-
Updated
Jul 27, 2025 - Jupyter Notebook
Hybrid AI orchestration stack combining local LLMs (Ollama), vector search (Qdrant), and Azure AI Foundry for scalable RAG, Agentic AI, and Vision. Built with .NET 8 and Python.
-
Updated
Oct 12, 2025 - Python
General vision AI defect detection engine for MLops process/simulations
-
Updated
Mar 5, 2025 - Python
Vision Agent Analyst is a professional web application for automatic analysis of visual data (diagrams, interfaces, documents) using multimodal artificial intelligence models.
-
Updated
Dec 8, 2025 - Python
qwen3-vl-2b-instruct performing step by step tasks confirming normalized coordinations usage and tools executions
-
Updated
Jan 3, 2026 - Python
AI-powered health platform with multi-LLM engine (GPT-4o, Claude, Gemini). Workout generation, medication tracking with OCR, vision AI, gamification with leaderboards/rewards. Self-hosted, privacy-first.
-
Updated
Feb 8, 2026 - TypeScript
People detection and notifications based on the Raspberry Pi + AI Camera
-
Updated
Feb 3, 2025 - Python
Backend проекта Pinterest команды OND team
-
Updated
Mar 2, 2024 - Go
Improve this page
Add a description, image, and links to the vision-ai topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the vision-ai topic, visit your repo's landing page and select "manage topics."