NICS-EFC Lab of Tsinghua University

All

32 repositories

C2C
Public
[ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
multi-agent kv-cache llm
Python
•
Apache License 2.0
•37•348•0•0•Updated Feb 21, 2026Feb 21, 2026
MARSHAL
Public
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
agent reinforcement-learning multi-agent-systems self-play llm
Python
•
Apache License 2.0
•1•38•0•0•Updated Feb 19, 2026Feb 19, 2026
db-SP
Public
This repository contains the official implementation of db-SP, a sparsity-aware sequence parallelism strategy designed to accelerate sparse attention in visual …
Python
•1•2•0•0•Updated Feb 12, 2026Feb 12, 2026
R2R
Public
[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"
Python
•
Apache License 2.0
•11•78•1•0•Updated Feb 10, 2026Feb 10, 2026
CLAP-triangle-counting
Public
[DATE'23] The official code for paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>
architecture triangle-counting content-addressable-storage
C++
•
Apache License 2.0
•0•23•0•1•Updated Jan 19, 2026Jan 19, 2026
UniNDP
Public
Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
Python
•
MIT License
•12•19•0•0•Updated Jan 18, 2026Jan 18, 2026
MoA
Public
[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
model-compression sparse-attention large-language-models
Python
•
MIT License
•8•154•0•0•Updated Jan 14, 2026Jan 14, 2026
FrameFusion
Public
[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"
video efficient-deep-learning llm lvlm
Python
•
MIT License
•1•69•1•0•Updated Jan 13, 2026Jan 13, 2026
TaH
Public
Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"
Python
•
Apache License 2.0
•10•65•0•0•Updated Jan 13, 2026Jan 13, 2026
AED
Public
an automatic, effective, and diverse vulnerability discovery framework for autonomous driving policies
Python
•
MIT License
•0•0•0•0•Updated Nov 30, 2025Nov 30, 2025
DiTFastAttnV2
Public
Python
•0•9•2•0•Updated Oct 23, 2025Oct 23, 2025
USF
Public
The official code of paper "A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models" (ICLR24)
Jupyter Notebook
•0•3•0•0•Updated Sep 27, 2025Sep 27, 2025
NIPA
Public
Python
•0•0•0•0•Updated Sep 26, 2025Sep 26, 2025
FrameFusion_Project_Page
Public
JavaScript
•3•0•0•0•Updated Aug 16, 2025Aug 16, 2025
PM-KVQ
Public
The official code implementation for paper "PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs"
Python
•3•19•0•0•Updated May 24, 2025May 24, 2025
VGDFR
Public
VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate
Python
•
MIT License
•0•17•1•0•Updated May 16, 2025May 16, 2025
ViDiT-Q
Public
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
quantization mixed-precision diffusion-models efficientml
Python
•24•149•24•1•Updated Mar 21, 2025Mar 21, 2025
MBQ
Public
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
Python
•
MIT License
•3•76•10•0•Updated Mar 17, 2025Mar 17, 2025
DLFR-VAE
Public
0•11•1•0•Updated Feb 18, 2025Feb 18, 2025
DiTFastAttn
Public
Jupyter Notebook
•
MIT License
•10•190•10•0•Updated Jan 14, 2025Jan 14, 2025
MNSIM-2.0
Public
A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems
mnsim-python
Python
•58•195•7•1•Updated Nov 27, 2024Nov 27, 2024
MixDQ
Public
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
efficient quantization mixed-precision diffusion-models
Python
•5•49•13•0•Updated Nov 27, 2024Nov 27, 2024
Rad-NeRF
Public
[NeurIPS24] Rad-NeRF: Ray-decoupled Training of Neural Radiance Field
Python
•0•7•0•0•Updated Nov 9, 2024Nov 9, 2024
MoA_project_page
Public
JavaScript
•0•0•0•0•Updated Nov 8, 2024Nov 8, 2024
effgenai-workshop.top
Public
SCSS
•0•0•0•0•Updated Oct 19, 2024Oct 19, 2024
MoA_Kernel
Public
The official CUDA kernel implementation for Mixture of Sparse Attention
Cuda
•
Apache License 2.0
•0•6•1•0•Updated Oct 9, 2024Oct 9, 2024
qllm-eval
Public
Code Repository of Evaluating Quantized Large Language Models
Python
•
MIT License
•10•135•5•0•Updated Sep 8, 2024Sep 8, 2024
FlashEval
Public
Python
•1•14•1•0•Updated Aug 9, 2024Aug 9, 2024
RTL_library_of_basic_hardware_units
Public
Here are some mplementations of some basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and support the hardwar…
MIT License
•9•0•0•0•Updated May 11, 2023May 11, 2023
nicsefc-readme
Public
some docs for rookies in nics-efc
7•22•0•1•Updated Mar 17, 2022Mar 17, 2022