Skip to content

DfX-NYUAD/VeriContaminated

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination

Abstract

Large Language Models (LLMs) have revolutionized code generation, achieving exceptional results on various established benchmarking frameworks. However, concerns about data contamination - where benchmark data inadvertently leaks into pre-training or fine-tuning datasets - raise questions about the validity of these evaluations. While this issue is known, limiting the industrial adoption of LLM-driven software engineering, hardware coding has received little to no attention regarding these risks. For the first time, we analyze state-of-the-art (SOTA) evaluation frameworks for Verilog code generation (VerilogEval and RTLLM), using established methods for contamination detection (CCD and Min-K% Prob). We cover SOTA commercial and open-source LLMs (CodeGen2.5, Minitron 4b, Mistral 7b, phi-4 mini, LLaMA-{1,2,3.1}, GPT-{2,3.5,4o}, Deepseek-Coder, and CodeQwen 1.5), in baseline and fine-tuned models (RTLCoder and Verigen). Our study confirms that data contamination is a critical concern. We explore mitigations and the resulting trade-offs for code quality vs fairness (i.e., reducing contamination toward unbiased benchmarking).

Author: Zeng Wang*, Minghao Shao*, Jitendra Bhandari, Likhitha Mankali, Ramesh Karri, Ozgur Sinanoglu, Muhammad Shafique, Johann Knechtel

Paper Link: VeriContaminated

image

Updates

[update]: upload the Colab for Verilog-related CDD, Min-K% and TED evaluation by 10/04/2025.

[update]: Paper has been accepted by IEEE International Conference on LLM-Aided Design (ICLAD), 2025

Citations

@article{wang2025vericontaminated,
  title={VeriContaminated: Assessing LLM-driven Verilog coding for data contamination},
  author={Wang, Zeng and Shao, Minghao and Bhandari, Jitendra and Mankali, Likhitha and Karri, Ramesh and Sinanoglu, Ozgur and Shafique, Muhammad and Knechtel, Johann},
  journal={IEEE International Conference on LLM-Aided Design (ICLAD)},
  year={2025}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published