emmtrix ONNX-to-C Code Generator (emx-onnx-cgen)

emx-onnx-cgen compiles ONNX models to portable, deterministic C code for deeply embedded systems. The generated code is designed to run without dynamic memory allocation, operating-system services, or external runtimes, making it suitable for safety-critical and resource-constrained targets.

Key characteristics:

No dynamic memory allocation (malloc, free, heap usage)
Static, compile-time known memory layout for parameters, activations, and temporaries
Deterministic control flow (explicit loops, no hidden dispatch or callbacks)
No OS dependencies, using only standard C headers (for example, stdint.h and stddef.h)
Single-threaded execution model
Bitwise-stable code generation for reproducible builds
Readable, auditable C code suitable for certification and code reviews
Generated C output format spec: docs/output-format.md
Designed for bare-metal and RTOS-based systems

For PyTorch models, see the related project emx-pytorch-cgen.

Goals

Correctness-first compilation with outputs comparable to ONNX Runtime.
Deterministic and reproducible C code generation.
Clean, pass-based compiler architecture (import → normalize → optimize → lower → emit).
Minimal C runtime with explicit, predictable data movement.

Non-goals

Aggressive performance optimizations in generated C.
Implicit runtime dependencies or dynamic loading.
Training/backpropagation support.

Features

CLI for ONNX-to-C compilation and verification.
Deterministic codegen with explicit tensor shapes and loop nests.
Minimal C runtime templates in src/emx_onnx_cgen/templates/.
ONNX Runtime comparison for end-to-end validation.
Official ONNX operator coverage tracking.
Support for a wide range of ONNX operators (see SUPPORT_OPS.md).
Supported data types:
- bfloat16, float16, float, double
- int8, uint8, int16, uint16, int32, uint32, int64, uint64
- bool
- string (fixed-size '\0'-terminated C strings; see docs/output-format.md)
- optional(<tensor type>) (optional tensors represented via an extra _Bool <name>_present flag; see docs/output-format.md)
Optional support for dynamic dimensions using C99 variable-length arrays (VLAs), when the target compiler supports them.

Usage Scenarios

1. Fully Embedded, Standalone C Firmware

The generated C code can be embedded directly into a bare-metal C firmware or application where all model weights and parameters are compiled into the C source.

Typical characteristics:

No file system or OS required.
All weights stored as static const arrays in flash/ROM.
Deterministic memory usage with no runtime allocation.
Suitable for:
- Microcontrollers
- Safety-critical firmware
- Systems with strict certification requirements

This scenario is enabled via --large-weight-threshold 0, forcing all weights to be embedded directly into the generated C code.

2. Embedded or Host C/C++ Application with External Weights

The generated C code can be embedded into C or C++ applications where large model weights are stored externally and loaded from a binary file at runtime.

Typical characteristics:

Code and control logic compiled into the application.
Large constant tensors packed into a separate .bin file.
Explicit, generated loader functions handle weight initialization.
Suitable for:
- Embedded Linux or RTOS systems
- Applications with limited flash but available external storage
- Larger models where code size must be minimized

This scenario is enabled automatically once the cumulative weight size exceeds --large-weight-threshold (default: 102400 bytes).

3. Target-Optimized Code Generation via emmtrix Source-to-Source Tooling

In both of the above scenarios, the generated C code can serve as input to emmtrix source-to-source compilation and optimization tools, enabling target-specific optimizations while preserving functional correctness.

Examples of applied transformations include:

Kernel fusion and loop restructuring
Memory layout optimization and buffer reuse
Reduction of internal temporary memory
Utilization of SIMD / vector instruction sets
Offloading of large weights to external memory
Dynamic loading of weights or activations via DMA

This workflow allows a clear separation between:

Correctness-first, deterministic ONNX lowering, and
Target-specific performance and memory optimization,

while keeping the generated C code readable, auditable, and traceable.

The generated C code is intentionally structured to make such transformations explicit and analyzable, rather than relying on opaque backend-specific code generation.

Installation

Install the package directly from PyPI (recommended):

pip install emx-onnx-cgen

Required at runtime (both compile and verify):

onnx
numpy
jinja2

Optional for verification and tests:

onnxruntime
A C compiler (cc, gcc, clang or via --cc)

Quickstart

Compile an ONNX model into a C source file:

emx-onnx-cgen compile path/to/model.onnx build/model.c

Verify an ONNX model end-to-end against ONNX Runtime (default):

emx-onnx-cgen verify path/to/model.onnx

CLI Reference

emx-onnx-cgen provides two subcommands: compile and verify.

Common options

These options are accepted by both compile and verify:

--model-base-dir: Base directory for resolving the model path (and related paths).
--color: Colorize CLI output (auto, always, never; default: auto).
--large-weight-threshold: Store weights in a binary file once the cumulative byte size exceeds this threshold (default: 102400; set to 0 to disable).
--large-temp-threshold: Mark temporary buffers larger than this threshold as static (default: 1024).
--fp32-accumulation-strategy: Accumulation strategy for float32 inputs (simple uses float32, fp64 uses double; default: fp64).
--fp16-accumulation-strategy: Accumulation strategy for float16 inputs (simple uses float16, fp32 uses float; default: fp32).

`compile`

emx-onnx-cgen compile <model.onnx> <output.c> [options]

Options:

--model-name: Override the generated model name (default: output file stem).
--emit-testbench: Emit a JSON-producing main() testbench for validation.
--emit-data-file: Emit constant data arrays into a companion _data C file.
--no-restrict-arrays: Disable restrict qualifiers on generated array parameters.

`verify`

emx-onnx-cgen verify <model.onnx> [options]

Options:

--cc: Explicit C compiler command for building the testbench binary.
--max-ulp: Maximum allowed ULP distance for floating outputs (default: 100).
--atol-eps: Absolute tolerance as a multiple of machine epsilon for floating outputs (default: 1.0).
--runtime: Runtime backend for verification (onnxruntime or onnx-reference, default: onnxruntime).
--temp-dir-root: Root directory in which to create a temporary verification directory (default: system temp dir).
--temp-dir: Exact directory to use for temporary verification files (default: create a temporary directory).
--keep-temp-dir: Keep the temporary verification directory instead of deleting it.

How verification works:

Compile with a testbench: the compiler is invoked with --emit-testbench, generating a C program that runs the model and prints inputs/outputs as JSON.
Build and execute: the testbench is compiled with the selected C compiler (--cc, CC, or a detected cc/gcc/clang) and executed in a temporary directory.
Run runtime backend: the JSON inputs from the testbench are fed to the selected runtime (onnxruntime or onnx-reference) using the same model. The compiler no longer ships a Python runtime evaluator.
Compare outputs: floating outputs are compared by maximum ULP distance. Floating-point verification first ignores very small differences up to --atol-eps × machine epsilon of the evaluated floating-point type, treating such values as equal. For values with a larger absolute difference, the ULP distance is computed, and the maximum ULP distance is reported; non-floating outputs must match exactly. Missing outputs or mismatches are treated as failures.
ORT unsupported models: when using onnxruntime, if ORT reports NOT_IMPLEMENTED, verification is skipped with a warning (exit code 0).

Official ONNX test coverage

See ONNX_SUPPORT.md for the generated support matrix. See SUPPORT_OPS.md for operator-level support derived from the expectation JSON files.

Related Projects

emx-pytorch-cgen
A PyTorch-to-C compiler following the same design principles as emx-onnx-cgen, but operating directly on PyTorch models instead of ONNX graphs.
https://github.com/emmtrix/emx-pytorch-cgen
onnx2c
An ONNX-to-C code generator with a different design focus and code generation approach.
https://github.com/kraiskil/onnx2c

Maintained by

This project is maintained by emmtrix.

Name		Name	Last commit message	Last commit date
Latest commit History 603 Commits
.github/workflows		.github/workflows
docs		docs
emx-pytorch2c-org @ 29104c9		emx-pytorch2c-org @ 29104c9
onnx-org @ d3f6b79		onnx-org @ d3f6b79
onnx2c-org @ e0d9b35		onnx2c-org @ e0d9b35
prompts		prompts
reports		reports
src		src
tests		tests
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
DEVELOPMENT.md		DEVELOPMENT.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
ONNX_ERRORS_HISTOGRAM.md		ONNX_ERRORS_HISTOGRAM.md
ONNX_SUPPORT.md		ONNX_SUPPORT.md
README.md		README.md
SUPPORT_OPS.md		SUPPORT_OPS.md
build_backend.py		build_backend.py
logo.png		logo.png
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-ci.txt		requirements-ci.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

emmtrix ONNX-to-C Code Generator (emx-onnx-cgen)

Goals

Non-goals

Features

Usage Scenarios

1. Fully Embedded, Standalone C Firmware

2. Embedded or Host C/C++ Application with External Weights

3. Target-Optimized Code Generation via emmtrix Source-to-Source Tooling

Installation

Quickstart

CLI Reference

Common options

`compile`

`verify`

Official ONNX test coverage

Related Projects

Maintained by

About

Uh oh!

Releases 15

Packages

Contributors 4

Uh oh!

Languages

License

emmtrix/emx-onnx-cgen

Folders and files

Latest commit

History

Repository files navigation

emmtrix ONNX-to-C Code Generator (emx-onnx-cgen)

Goals

Non-goals

Features

Usage Scenarios

1. Fully Embedded, Standalone C Firmware

2. Embedded or Host C/C++ Application with External Weights

3. Target-Optimized Code Generation via emmtrix Source-to-Source Tooling

Installation

Quickstart

CLI Reference

Common options

compile

verify

Official ONNX test coverage

Related Projects

Maintained by

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 15

Packages 0

Contributors 4

Uh oh!

Languages

`compile`

`verify`

Packages