Skip to content

Cuboids Engine: Turing-Complete Voxel Engine with Ternary-Logic – 9.53 GVox/s 3D Spatial Recognition & 9-Param DNA Evolution (In Prod Now, MIT Licensed)

License

Notifications You must be signed in to change notification settings

PrimalNinja/cuboids

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cuboids: A GPU-Accelerated Ternary Spatial Computing Framework

🧠 The Paradigm Shift

Stop moving data. Start evolving perception.

Cuboids demonstrates a fundamental shift in 3D spatial computing. Traditional methods physically transform and write massive 3D datasets (voxels) to memory for each new viewpoint. Cuboids redefines the problem: it evolves compact "Spatial DNA" parameters—a mere 6 numbers defining a 3D view—while the data stays put. This transforms the GPU from a number cruncher into an autonomous spatial reasoning co-processor.

⚡️ Validated Performance at Scale

The core innovation has been tested at production-relevant scale. The latest benchmarks (N=512, 134 million voxels) show the DNA paradigm achieving a consistent 18-20x speedup over an optimized traditional baseline.

Metric Traditional (Transform & Write) Cuboids (Evolve Perception) Advantage
Workload 100 rotation+score cycles on 134M voxels 100 virtual evaluation cycles Identical problem
Time ~19,514 ms ~1,030 ms 18.94x faster
Key Bottleneck Memory Bandwidth (~13.4 GB of writes) Compute / ALU Throughput Operates in a faster hardware regime
Kernel Launches 200 1 Massive overhead eliminated

Why it scales: The performance gap grows with problem size. As data volume increases, the traditional approach hits a memory bandwidth wall, while Cuboids' cost scales with compute—a fundamental advantage on modern hardware.

⚙️ How It Works: The Technical Core

  1. Ternary Logic Substrate: Voxels are int8_t with values -1, 0, 1 (Inhibit/Empty/Excite). This is a dense, efficient representation for spatial correlation, offering a 4x memory saving over float32.
  2. Spatial DNA: A 3D perspective is encoded in just 6 evolvable parameters tx, ty, tz, rx, ry, rz). The algorithm searches for the DNA that yields the optimal "view" of the data.
  3. Persistent GPU-Resident Evolution: The entire evolutionary search—hundreds of generations—runs in persistent GPU kernels. This eliminates costly CPU-GPU synchronization and kernel launch overhead.
  4. GPU Primitive Operations: Kernels use native GPU instructions (multiply, add, compare, trigonometry) to implement spatial operations. This includes fused multiply-add (FFMA), rotations (built from sin/cos), and correlation scoring (using comparison and accumulation primitives).

📚 The Tri-Sword Framework

Cuboids is a reference implementation of the tri-sword architectural framework.

The tri-sword methodology explains the fundamental principles behind Cuboids' performance gains and provides a comprehensive guide for applying these techniques to other GPU computing problems.

What is Tri-Sword?

Tri-sword is a systematic framework for achieving 10-1000x GPU performance improvements through:

  • Ternary logic optimization (int8 vs float32, zero-skipping)
  • Persistent single-kernel execution (eliminating 70ms launch overhead)
  • Branchless control flow (warp coherence, predicate logic)
  • Structural decomposition (O(N³) → O(N²) complexity reduction)
  • Memory-mapped I/O control (constant memory as control ports)

How Cuboids Implements Tri-Sword:

Tri-Sword Principle Cuboids Implementation
Ternary Mathematics int8_t voxels (-1,0,1) = 4x memory reduction
Single Kernel Persistence 1 kernel vs 200 launches = 18.94x speedup
Memory Immutability Data stays stationary, DNA evolves in registers
Primitive Operations Native instruction set
Structural Optimization Parameter space (6 params) vs transformation space (134M voxels)

Learn the Complete Framework

The tri-sword documentation (tri-sword.md in this repository) provides:

  • Complete theoretical foundation and measured performance data
  • 57 benchmark test results (0.8x failures to 1589x successes)
  • Detailed implementation patterns and code examples
  • Real-world case studies including Cuboids
  • When to apply each optimization technique

Read it to understand:

  • Why Cuboids achieves 18.94x speedup (not just "it's faster")
  • How to apply these principles to your own GPU problems
  • The Six Commandments of Silicon Sovereignty
  • Comprehensive test portfolio showing when techniques work/fail

Location: ./tri-sword/tri-sword.md

🛠️ Project State: A Proven Engine, An Open Challenge

This repository contains 79 progressively optimized CUDA files documenting the complete journey from a JavaScript concept to validated hardware performance. It is a proven prototype and an open benchmarking challenge.

  • ✅ What's Validated: The DNA paradigm works and is fast. Files 0001-0077 are fully tested. The architectural advantage is clear and scales to 134M voxels.
  • 🎯 The Open Challenge: We invite the community to determine the absolute performance ceiling. Both the Cuboids ("DNA") code and the "traditional" baseline have optimization headroom. Can expert CUDA optimization close the gap, or does the paradigm's structural advantage hold?
    • Optimize the traditional baseline with expert techniques.
    • Push the DNA implementation further.
    • Run the definitive fair race on A100, H100, and other architectures.

🛠 Getting Started

Prerequisites: A CUDA-capable GPU and toolkit.

Quick Start: Clone the repo and run the early files (e.g., 0001_*) to verify the ternary system works on your hardware.

🔍 Why This Exists

Created to solve problems in AI causality, this project uncovered an unexpected algorithmic pathway. It is released not as a finished solution, but as an invitation for the community to rigorously test a new spatial computing idea.

📄 More Information

visit https://cyborgunicorn.com.au/cuboids

📄 License

MIT Licence

About

Cuboids Engine: Turing-Complete Voxel Engine with Ternary-Logic – 9.53 GVox/s 3D Spatial Recognition & 9-Param DNA Evolution (In Prod Now, MIT Licensed)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published