Skip to content
Lei Wang edited this page Feb 6, 2026 · 9 revisions

CUTracer is an NVBit-based CUDA binary instrumentation tool. It cleanly separates lightweight data collection (instrumentation) from host-side processing (analysis). Typical workflows include per-warp instruction histograms between clock-cycle markers and kernel hang detection.

🎯 Start here:

📚 Configuration and reference:

🔧 Architecture and development:

🧪 Examples and help:

If you only have 5 minutes, read Quickstart and Post-processing: IPC Merge.

Welcome to the CUTracer wiki!

Clone this wiki locally