A Python-based tool for analyzing images and generating structured metadata in CSV format. This tool processes images to extract meaningful information and organizes it into a structured format for easy integration with other systems.
- Image Analysis: Processes images to extract visual features and characteristics
- Metadata Generation: Creates structured data including:
- Image titles
- Alt text descriptions
- SEO-optimized descriptions
- Scene classification (indoor, outdoor, abstract, etc.)
- Visual cohesion scores
- Relevant tags and categories
- CSV Export: Saves all generated metadata in a structured CSV format
- Batch Processing: Handles multiple images in a single run
- Progress Tracking: Shows real-time progress of image processing
- Python 3.8+
- PyTorch 2.0+
- CUDA-capable GPU (recommended)
- 8GB+ RAM
- Clone the repository:
git clone https://github.com/tatianathevisionary/poster-analysis.git
cd poster-analysis- Create and activate a virtual environment:
python -m venv .venv
source .venv/bin/activate # On Windows: .venv\Scripts\activate- Install dependencies:
pip install -r requirements.txt- Test the installation:
python test_installation.py- Place your images in the
input/directory - Run the analysis script:
python inference_ram.py- Check the results:
- Generated metadata will be saved in
output/metadata.csv - Logs are available in
poster_analysis.log
The generated CSV file (output/metadata.csv) includes the following columns:
image_path: Path to the processed imagetitle: Generated title for the imagealt_text: Descriptive alt text for accessibilityseo_description: SEO-optimized descriptiontags: Comma-separated list of relevant tagsscene_type: Classification of the scene (indoor, outdoor, abstract, etc.)cohesion_score: Numerical score indicating visual harmony (0-1)
poster-analysis/
├── input/ # Place your images here
├── output/ # Generated metadata and results
├── ram/ # RAM model files
├── pretrained/ # Pretrained models
├── inference_ram.py # Main analysis script
├── test_installation.py
├── requirements.txt
├── README.md
└── LICENSE
- Fork the repository
- Create a feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
For support, please open an issue in the GitHub repository or contact support@metaposters.ai