Build software better, together

architzero / Aura-accessibility-scanner

An intelligent web accessibility auditor that finds WCAG violations and provides AI-powered suggestions to help developers fix them.

flask automation ocr ai flesch-kincaid mongodb-atlas wcag2 axe-core playwright ai-caption-generator blip-model web-accessibility-initiative

Updated Sep 6, 2025
Python

yu3325363946 / ai-education-video-analyzer

Star

自动生成字幕，内容总结，章节划分 | AI-driven education video analysis using Whisper, BLIP-2, and DeepSeek

nlp open-source education machine-learning ai spring-boot computer-vision vue deep-learning core edtech subtitles whisper-model whisper-ai blip-2 deepseek deepseek-api blip-model deepseek-r1

Updated Oct 16, 2025
Java

arthurtran04 / image-captioning

Star

An Image Captioning Application.

python3 gradio blip-model

Updated Sep 1, 2025
Python

allanninal / image-captioning-app

Star

A full-stack AI-powered image captioning app built with ReactJS (using Vite) and Flask. Users can upload images, and the app generates descriptive captions using Hugging Face’s BLIP model. Perfect for showcasing AI integration and web development skills in a mini-project.

python flask machine-learning web-development ai frontend backend reactjs image-processing full-stack artificial-intelligence image-captioning mini-project vite hugging-face blip-model

Updated Dec 3, 2024
JavaScript

amramer / realtime-vision-captioning

Star

This repository contains a small set of Jupyter notebooks demonstrating key computer vision and vision–language tasks using pretrained models. The final notebook integrates these tasks into a realtime webcam application that performs captioning and classification concurrently.

python realtime pytorch question-answering imagenet image-classification pretrained-models gradio resnet-50 imagecaptioning notebook-jupyter colab-notebook huggingface huggingface-transformers gradio-interface vision-language-model blip-model

Updated Jan 11, 2026
Jupyter Notebook

KienPC1234 / Emotica-AI

Star

Emotica AI is a compassionate and therapeutic virtual assistant designed to provide empathetic and supportive conversations. It integrates a local LLaMA model for text generation, a vision model for image captioning, a RAG system for information retrieval, and emotion detection to tailor its responses.

python model chatbot cuda embeddings lang blip-model gguf-model-support

Updated Nov 1, 2025
Python

yaekobB / multimodal-image-captioning

Star

Fine-tuned BLIP model on Flickr8k for multimodal image captioning (vision + language).

nlp computer-vision deep-learning transformers pytorch image-captioning lora fine-tuning huggingface vision-language flickr8k multimodal-ai blip-model

Updated Sep 1, 2025
Jupyter Notebook

gauthiii / fineTunedBLIP

Star

Fine Tuned the model BLIP to accurately caption images of Tom and Jerry.

image-captioning fine-tuning huggingface-transformers generative-ai blip-model

Updated Jan 15, 2025
Jupyter Notebook

SebastianBenjamin / Vision_AI

Star

A Flask-based API that generates captions for images using a custom deep learning model (BLIP). API accepts image or image frames and returns the caption generated using BLIP model.

javascript css python html flask deep-learning dnn blip huggingface huggingface-transformers blip-model

Updated Jul 10, 2025
HTML

AaryanGole26 / LUME

Star

LUME is an AI-powered app that turns your images into viral memes. Upload a photo, add an optional trending topic, and let Lume use BLIP and Groq AI to craft witty, high-quality captions with stylish overlays—ready to download and share instantly.

nlp-machine-learning groq-api blip-model

Updated Oct 16, 2025
Python

sujeethshingade / eagle-drone

Star

Drone based Image Descriptor - Toyota Hackathon 2025

flask nextjs image-descriptor blip-model

Updated Feb 13, 2025
TypeScript

arthurtran04 / VQA

Star

A Visual Question Answering (VQA) Application.

python3 gradio blip-model

Updated Jul 14, 2025
Python

yocho1 / Image-Captioning-AI

Star

An AI-powered image captioning web app using BLIP model from Hugging Face and Gradio.

ai gradio huggingface-transformers blip-model

Updated Dec 12, 2025
Python

HeartThanakorn / gradio-image-captioning-app

Star

A simple web application that generates captions for images using the BLIP model from Hugging Face Transformers and a user-friendly interface created with Gradio.

python machine-learning ai transformer image-captioning gradio hugging-face blip-model

Updated Oct 4, 2025
Python

SouravLenka / AI_StoryTeller

Star

AI StoryTeller is a multimodal AI application that converts images into creative short stories by combining computer vision and natural language generation. The system uses a pretrained image captioning model to understand visual content and Google Gemini to generate context-aware narratives grounded in the image.

python machine-learning natural-language-processing computer-vision deep-learning artificial-intelligence image-captioning story-generation fastapi api-development huggingface llm google-gemini blip-model multimodel-ai

Updated Feb 6, 2026
HTML

NarayanTheRocker / AI-Interactive-Learning-Assistant-for-Classrooms

Star

Welcome to the AI-Powered Interactive Learning Assistant! 🚀. This is an open-source, free, and low-hardware-intensive project designed especially for students and educators! Our goal is to bring the power of AI right into your classroom, making learning more interactive, engaging, and accessible for everyone.

intel flask-application multimodel learning-assistant llm llama-cpp doubt-solving blip-model

Updated Jul 12, 2025
Python

PinsaraPerera / AI-assessment

Star

This project generates behavioral descriptions from images by combining computer vision and natural language processing. It goes beyond basic scene descriptions to infer human behaviors, intentions, and social contexts.

streamlit behavioral-analysis vision-language-pretraining gpt-3-5-turbo blip-model

Updated May 5, 2025
Python

anaswaraku / ImageCaptioning

Star

Image Caption System Demo using Gradio

gradio blip-model

Updated Dec 15, 2025
Jupyter Notebook

mostafa1344 / realtime-vision-captioning

Star

🎥 Enable real-time image captioning and classification with this Jupyter notebook collection, featuring pretrained models and live webcam applications.

python realtime pytorch question-answering imagenet image-classification pretrained-models gradio resnet-50 imagecaptioning notebook-jupyter colab-notebook huggingface huggingface-transformers gradio-interface vision-language-model blip-model

Updated Feb 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blip-model

Here are 19 public repositories matching this topic...

architzero / Aura-accessibility-scanner

yu3325363946 / ai-education-video-analyzer

arthurtran04 / image-captioning

allanninal / image-captioning-app

amramer / realtime-vision-captioning

KienPC1234 / Emotica-AI

yaekobB / multimodal-image-captioning

gauthiii / fineTunedBLIP

SebastianBenjamin / Vision_AI

AaryanGole26 / LUME

sujeethshingade / eagle-drone

arthurtran04 / VQA

yocho1 / Image-Captioning-AI

HeartThanakorn / gradio-image-captioning-app

SouravLenka / AI_StoryTeller

NarayanTheRocker / AI-Interactive-Learning-Assistant-for-Classrooms

PinsaraPerera / AI-assessment

anaswaraku / ImageCaptioning

mostafa1344 / realtime-vision-captioning

Improve this page

Add this topic to your repo