All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Initial release of Research Paper Extractor
- Command-line interface for searching and downloading arXiv papers
- Support for searching by keywords, topics, and phrases
- Author-based search functionality
- Category filtering for arXiv categories
- Recent papers search (last N days)
- Batch download capabilities
- Specific paper download by arXiv ID
- Preview mode for search results
- Interactive mode for guided searching
- Automatic topic-based folder organization
- Smart file naming and sanitization
- Progress bars for downloads
- Comprehensive error handling and logging
- Configuration file for customizable settings
- Smart Search: Search papers by keywords, topics, or phrases
- Author Search: Find all papers by specific researchers
- Category Filtering: Filter by arXiv categories (AI, ML, Computer Vision, etc.)
- Recent Papers: Find papers published in the last N days
- Batch Download: Download multiple papers at once
- Specific Downloads: Download papers by arXiv ID
- Preview Mode: See search results before downloading
- Interactive Mode: User-friendly interactive interface
- Auto-Organization: Automatic topic-based folder creation and file naming
- Topic Folders: Each search creates its own organized folder
- Built with Python 3.8+
- Uses arXiv's public API
- Respectful rate limiting for server requests
- Cross-platform compatibility
- Proper package structure and setup
- Comprehensive documentation
- requests >= 2.31.0
- feedparser >= 6.0.10
- beautifulsoup4 >= 4.12.2
- lxml >= 4.9.3
- tqdm >= 4.66.1
- click >= 8.1.7
- python-dateutil >= 2.8.2
- Web interface
- Database integration for paper management
- Citation export functionality
- Advanced search filters
- Paper metadata extraction
- Integration with reference managers
- Batch processing capabilities
- API rate limit management
- User preferences and settings
- Paper recommendation system
- Performance optimizations
- Better error handling
- More comprehensive testing
- CI/CD pipeline
- Docker containerization
- Performance monitoring
- User analytics
- Accessibility improvements