Skip to content
Change the repository type filter

All

    Repositories list

    • Video-o3

      Public
      Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
      Python
      05800Updated Feb 16, 2026Feb 16, 2026
    • LongVPO

      Public
      [NeurIPS 2025] LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
      Python
      0200Updated Feb 4, 2026Feb 4, 2026
    • AMD

      Public
      [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
      Python
      11800Updated Jan 11, 2026Jan 11, 2026
    • HTML
      0200Updated Jan 3, 2026Jan 3, 2026
    • SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
      Python
      3657760Updated Dec 23, 2025Dec 23, 2025
    • UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
      Python
      24320Updated Dec 16, 2025Dec 16, 2025
    • SAM2-Plus

      Public
      SAM 2++: Tracking Anything at Any Granularity
      Python
      55400Updated Dec 15, 2025Dec 15, 2025
    • UniAVGen

      Public
      HTML
      0400Updated Dec 14, 2025Dec 14, 2025
    • [ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
      Python
      11640Updated Dec 11, 2025Dec 11, 2025
    • PixNerd

      Public
      [ICLR 2026] PixNerd: Pixel Neural Field Diffusion
      Python
      517050Updated Dec 10, 2025Dec 10, 2025
    • FlowBack

      Public
      [AAAI 2026] Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
      Python
      01310Updated Dec 9, 2025Dec 9, 2025
    • RGE

      Public
      Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
      Python
      01000Updated Nov 29, 2025Nov 29, 2025
    • JavaScript
      0200Updated Nov 25, 2025Nov 25, 2025
    • [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
      Python
      414850Updated Nov 4, 2025Nov 4, 2025
    • [TPAMI] JointFormer: A Unified Framework with Joint Modeling for Video Object Segmentation
      Python
      01100Updated Oct 21, 2025Oct 21, 2025
    • MeMOTR

      Public
      [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
      Python
      1821640Updated Oct 15, 2025Oct 15, 2025
    • MotionRAG

      Public
      [NeurIPS 2025] MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
      Python
      42230Updated Oct 9, 2025Oct 9, 2025
    • [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online
      Python
      589100Updated Oct 7, 2025Oct 7, 2025
    • JavaScript
      0000Updated Oct 2, 2025Oct 2, 2025
    • CycleACR

      Public
      [TPAMI-2025] CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
      Python
      0300Updated Sep 11, 2025Sep 11, 2025
    • DDT

      Public
      DDT: Decoupled Diffusion Transformer
      Python
      1736250Updated Aug 22, 2025Aug 22, 2025
    • MOTIP

      Public
      [CVPR 2025] Multiple Object Tracking as ID Prediction
      Python
      3847590Updated Aug 20, 2025Aug 20, 2025
    • VideoEval

      Public
      VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
      Python
      01400Updated Jul 31, 2025Jul 31, 2025
    • Video-DC

      Public
      Python
      11110Updated Jul 30, 2025Jul 30, 2025
    • CaReBench

      Public
      A Fine-grained Benchmark for Video Captioning and Retrieval
      Python
      12640Updated Jul 16, 2025Jul 16, 2025
    • [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
      Python
      02110Updated Jul 7, 2025Jul 7, 2025
    • p-MoD

      Public
      [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
      Python
      24310Updated Jun 26, 2025Jun 26, 2025
    • DEQDet

      Public
      [ICCV 2023] Deep Equilibrium Object Detection
      Jupyter Notebook
      12710Updated Jun 18, 2025Jun 18, 2025
    • SORCE

      Public
      Small Object Retrieval in Complex Environments (SORCE)
      Python
      1500Updated Jun 2, 2025Jun 2, 2025
    • DMM

      Public
      DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
      Python
      44730Updated Apr 27, 2025Apr 27, 2025