Skip to content

web application for comparing two documents and determining their similarity percentage using advanced AI algorithms. Supports PDF, DOC, DOCX, and TXT files with a beautiful glassmorphism UI design.

Notifications You must be signed in to change notification settings

skp3214/Doc-Similarity-Checker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

23 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ“„ Document Similarity Checker

๐Ÿ” A modern web application for comparing two documents and determining their similarity percentage using advanced AI algorithms. Supports PDF, DOC, DOCX, and TXT files with a beautiful glassmorphism UI design.


โœจ Features

๐Ÿ“„ Document Comparison

โœ… Upload Two Documents - Support for PDF, DOC, DOCX, and TXT files โœ… AI-Powered Analysis - Advanced natural language processing using spaCy โœ… Similarity Scoring - Get accurate percentage similarity between documents โœ… Text Extraction - Automatic text extraction from various file formats โœ… Real-time Results - Instant similarity analysis and scoring

๐ŸŽจ Modern UI/UX

โœ… Glassmorphism Design - Beautiful frosted glass effects โœ… Dark/Light Mode - Automatic theme detection with manual toggle โœ… Responsive Design - Works perfectly on all devices โœ… Smooth Animations - Modern transitions and hover effects โœ… Gradient Cards - Visual differentiation with multiple gradient styles

๐Ÿ‘ค User System

โœ… User Registration & Login - Secure authentication system โœ… Profile Management - User dashboard and settings โœ… Session Management - Secure user sessions


๐Ÿ› ๏ธ Tech Stack

Backend

  • Django - Python web framework
  • spaCy - Natural language processing
  • NLTK - Text processing toolkit
  • scikit-learn - Machine learning algorithms

Frontend

  • HTML5 - Semantic markup
  • CSS3 - Modern styling with custom properties
  • JavaScript - Interactive theme management
  • Font Awesome - Beautiful icons

AI & Analysis

  • TF-IDF Vectorization - Text similarity analysis
  • Cosine Similarity - Document comparison algorithm
  • Text Preprocessing - Tokenization and lemmatization

โœจ ScreenShots

Screenshot_4-9-2025_13420_doc-similarity-checker onrender com Screenshot_4-9-2025_134639_doc-similarity-checker onrender com Screenshot_4-9-2025_134758_doc-similarity-checker onrender com

๐Ÿš€ How It Works

  1. ๐Ÿ“ค Upload Documents - Select two files to compare (PDF, DOC, DOCX, or TXT)
  2. ๐Ÿค– AI Analysis - Advanced algorithms process and analyze the text content
  3. ๐Ÿ“Š Get Results - Receive similarity percentage and detailed analysis
  4. ๐ŸŽจ Modern UI - Enjoy the beautiful interface with theme switching

๐Ÿ“‹ Requirements

  • Python 3.8+
  • Django 4.0+
  • spaCy with English model
  • NLTK data packages
  • PyPDF2 for PDF processing
  • python-docx for Word document processing

โšก Quick Start

Installation

# Clone the repository
git clone https://github.com/skp3214/document-similarity-checker.git
cd document-similarity-checker

# Create virtual environment
python -m venv venv
venv\Scripts\activate  # Windows
# source venv/bin/activate  # Linux/Mac

# Install dependencies
pip install -r requirements.txt

# Download NLTK data
python -c "import nltk; nltk.download('punkt'); nltk.download('stopwords'); nltk.download('wordnet')"

# Download spaCy model
python -m spacy download en_core_web_sm

Setup

# Navigate to project directory
cd Doc_Scanner_Matcher

# Run migrations
python manage.py migrate

# Create superuser (optional)
python manage.py createsuperuser

# Start development server
python manage.py runserver

Access

Open your browser and go to: http://127.0.0.1:8000/


๐Ÿ“– Usage

For Users

  1. ๐Ÿ  Home Page - Landing page with modern design
  2. ๐Ÿ“ค Upload Documents - Select two files to compare
  3. ๐Ÿ“Š View Results - See similarity percentage and analysis
  4. ๐Ÿ‘ค Profile - Manage your account and settings
  5. ๐ŸŒ™ Theme Toggle - Switch between light and dark modes

Supported File Formats

  • ๐Ÿ“„ PDF - Portable Document Format
  • ๐Ÿ“ DOC/DOCX - Microsoft Word documents
  • ๐Ÿ“ƒ TXT - Plain text files

AI Analysis Features

  • ๐Ÿ” Text Extraction - Automatic content extraction
  • ๐Ÿง  Semantic Analysis - Understanding document meaning
  • ๐Ÿ“ˆ Similarity Scoring - Percentage-based comparison
  • ๐ŸŽฏ Content Matching - Advanced text comparison algorithms

๐ŸŽจ UI Features

Modern Design Elements

  • Glassmorphism - Frosted glass effects with backdrop blur
  • Gradient Cards - Multiple gradient styles for visual appeal
  • Smooth Animations - Hover effects and page transitions
  • Responsive Layout - Optimized for all screen sizes

Theme System

  • ๐ŸŒž Light Mode - Clean, bright interface
  • ๐ŸŒ™ Dark Mode - Easy on the eyes with modern aesthetics
  • Auto Detection - Respects system preferences
  • Manual Toggle - One-click theme switching

๐Ÿ“ฌ Contact & Support

For questions, feedback, or contributions:

๐Ÿ“ง Email: spsm1818@gmail.com
๐Ÿ™ GitHub: skp3214

๐Ÿ› Bug Reports & Feature Requests

  • Use GitHub Issues for bug reports
  • Feature requests and UI/UX suggestions are welcome!
  • Pull requests for improvements are encouraged

๐Ÿท๏ธ Project Status

โœ… Current Version: v2.0

  • ๐ŸŽจ Modern UI Complete - Glassmorphism design with dark/light mode
  • ๐Ÿ”ง Fully Functional - Document comparison working perfectly
  • ๐Ÿ“ฑ Mobile Responsive - Optimized for all screen sizes
  • โ™ฟ Accessible - WCAG compliant design
  • ๐Ÿš€ Production Ready - Optimized for deployment

๐Ÿ”ฎ Future Enhancements

  • ๐ŸŒ Multi-language Support (i18n)
  • ๐Ÿ“Š Advanced Analytics with comparison history
  • ๐Ÿ”„ Batch Processing for multiple document pairs
  • ๐Ÿ“ฑ Progressive Web App (PWA) features
  • ๐ŸŽฏ API Endpoints for third-party integrations

๐Ÿš€ Happy Coding! ๐Ÿ˜Š

Built with โค๏ธ using Django, modern CSS, and AI-powered NLP

About

web application for comparing two documents and determining their similarity percentage using advanced AI algorithms. Supports PDF, DOC, DOCX, and TXT files with a beautiful glassmorphism UI design.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published