VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination

Abstract

Large Language Models (LLMs) have revolutionized code generation, achieving exceptional results on various established benchmarking frameworks. However, concerns about data contamination - where benchmark data inadvertently leaks into pre-training or fine-tuning datasets - raise questions about the validity of these evaluations. While this issue is known, limiting the industrial adoption of LLM-driven software engineering, hardware coding has received little to no attention regarding these risks. For the first time, we analyze state-of-the-art (SOTA) evaluation frameworks for Verilog code generation (VerilogEval and RTLLM), using established methods for contamination detection (CCD and Min-K% Prob). We cover SOTA commercial and open-source LLMs (CodeGen2.5, Minitron 4b, Mistral 7b, phi-4 mini, LLaMA-{1,2,3.1}, GPT-{2,3.5,4o}, Deepseek-Coder, and CodeQwen 1.5), in baseline and fine-tuned models (RTLCoder and Verigen). Our study confirms that data contamination is a critical concern. We explore mitigations and the resulting trade-offs for code quality vs fairness (i.e., reducing contamination toward unbiased benchmarking).

Author: Zeng Wang*, Minghao Shao*, Jitendra Bhandari, Likhitha Mankali, Ramesh Karri, Ozgur Sinanoglu, Muhammad Shafique, Johann Knechtel

Paper Link: VeriContaminated

Updates

[update]: upload the Colab for Verilog-related CDD, Min-K% and TED evaluation by 10/04/2025.

[update]: Paper has been accepted by IEEE International Conference on LLM-Aided Design (ICLAD), 2025

Citations

@article{wang2025vericontaminated,
  title={VeriContaminated: Assessing LLM-driven Verilog coding for data contamination},
  author={Wang, Zeng and Shao, Minghao and Bhandari, Jitendra and Mankali, Likhitha and Karri, Ramesh and Sinanoglu, Ozgur and Shafique, Muhammad and Knechtel, Johann},
  journal={IEEE International Conference on LLM-Aided Design (ICLAD)},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
VeriContaminated.ipynb		VeriContaminated.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination

Abstract

Updates

Citations

About

Uh oh!

Releases

Packages

Languages

DfX-NYUAD/VeriContaminated

Folders and files

Latest commit

History

Repository files navigation

VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination

Abstract

Updates

Citations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages