LAVIS - A One-stop Library for Language-Vision Intelligence
-
Updated
Nov 18, 2024 - Jupyter Notebook
LAVIS - A One-stop Library for Language-Vision Intelligence
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
A web app for both Text-based and Visual Question Answering.
Add a description, image, and links to the visual-question-anwsering topic page so that developers can more easily learn about it.
To associate your repository with the visual-question-anwsering topic, visit your repo's landing page and select "manage topics."