I design and build end-to-end AI systems that operate under uncertainty β focusing on evaluation, reliability, and real-world constraints rather than demos or toy benchmarks.
I work at the intersection of AI research and production engineering, with a focus on systems that must:
- Handle noisy, imperfect real-world inputs
- Produce deterministic, explainable outputs
- Be evaluated, measured, and stress-tested β not just generated
My work emphasizes architecture, evaluation, and failure modes over surface-level demos.
- Layout-aware OpticalRAG system for legal & procurement documents
- OCR β layout parsing β retrieval β LLM reasoning β deterministic risk scoring
- Designed for noisy PDFs, ambiguous clauses, and explainable outputs
- Includes tests, failure handling, metrics, and end-to-end execution artifacts
Focus: document understanding, evaluation under noise, system reliability
- Agent-based interview simulation and evaluation pipeline
- Generates structured, rubric-driven feedback (clarity, confidence, correctness)
- Emphasizes evaluation logic, scoring consistency, and failure analysis
- Produces machine-readable reports from live or scripted inputs
Focus: human evaluation, subjective signal modeling, scoring systems
- Real-time traffic intelligence pipeline
- Detection β tracking β violation logic with latency, FPS, and memory profiling
- Built with performance constraints and system metrics as first-class concerns
Focus: edge vision, performance engineering, real-time systems
- Pragya-OS: Exploration of agent-native operating systems for long-horizon planning, memory, and reflection
- Animark-AI: Experimental multimodal generation pipelines for visual storytelling and advertising
(Research explorations β not positioned as finished products)
Instead of videos, my projects emphasize reproducible proof:
- End-to-end input β output examples
- Deterministic JSON / report outputs
- Latency and pipeline metrics
- Explicit failure cases and trade-off documentation
These artifacts are available inside each repository under /examples, /docs, and /assets.
Modeling & Research
Python β’ PyTorch β’ TensorFlow β’ Scikit-learn β’ OpenCV β’ Pandas
Systems & MLOps
Docker β’ Linux β’ AWS β’ GCP β’ Git β’ MongoDB β’ MySQL
I write about deep learning fundamentals, optimization, and applied AI:
- CNNs and modern computer vision
- Optimizers, normalization, and training dynamics
- Practical trade-offs in real-world ML systems
π Articles published on Artificial Intelligence in Plain English and Pythonβs Gurus.
- Applied ML / AI Systems Engineering roles
- Evaluation-driven AI and agentic systems
- Multimodal intelligence under real-world constraints
If youβre interested in how systems behave when things go wrong, weβll get along well.


