Cuboids: A GPU-Accelerated Ternary Spatial Computing Framework

🧠 The Paradigm Shift

Stop moving data. Start evolving perception.

Cuboids demonstrates a fundamental shift in 3D spatial computing. Traditional methods physically transform and write massive 3D datasets (voxels) to memory for each new viewpoint. Cuboids redefines the problem: it evolves compact "Spatial DNA" parameters—a mere 6 numbers defining a 3D view—while the data stays put. This transforms the GPU from a number cruncher into an autonomous spatial reasoning co-processor.

⚡️ Validated Performance at Scale

The core innovation has been tested at production-relevant scale. The latest benchmarks (N=512, 134 million voxels) show the DNA paradigm achieving a consistent 18-20x speedup over an optimized traditional baseline.

Metric	Traditional (Transform & Write)	Cuboids (Evolve Perception)	Advantage
Workload	100 rotation+score cycles on 134M voxels	100 virtual evaluation cycles	Identical problem
Time	~19,514 ms	~1,030 ms	18.94x faster
Key Bottleneck	Memory Bandwidth (~13.4 GB of writes)	Compute / ALU Throughput	Operates in a faster hardware regime
Kernel Launches	200	1	Massive overhead eliminated

Why it scales: The performance gap grows with problem size. As data volume increases, the traditional approach hits a memory bandwidth wall, while Cuboids' cost scales with compute—a fundamental advantage on modern hardware.

⚙️ How It Works: The Technical Core

Ternary Logic Substrate: Voxels are int8_t with values -1, 0, 1 (Inhibit/Empty/Excite). This is a dense, efficient representation for spatial correlation, offering a 4x memory saving over float32.
Spatial DNA: A 3D perspective is encoded in just 6 evolvable parameters tx, ty, tz, rx, ry, rz). The algorithm searches for the DNA that yields the optimal "view" of the data.
Persistent GPU-Resident Evolution: The entire evolutionary search—hundreds of generations—runs in persistent GPU kernels. This eliminates costly CPU-GPU synchronization and kernel launch overhead.
GPU Primitive Operations: Kernels use native GPU instructions (multiply, add, compare, trigonometry) to implement spatial operations. This includes fused multiply-add (FFMA), rotations (built from sin/cos), and correlation scoring (using comparison and accumulation primitives).

📚 The Tri-Sword Framework

Cuboids is a reference implementation of the tri-sword architectural framework.

The tri-sword methodology explains the fundamental principles behind Cuboids' performance gains and provides a comprehensive guide for applying these techniques to other GPU computing problems.

What is Tri-Sword?

Tri-sword is a systematic framework for achieving 10-1000x GPU performance improvements through:

Ternary logic optimization (int8 vs float32, zero-skipping)
Persistent single-kernel execution (eliminating 70ms launch overhead)
Branchless control flow (warp coherence, predicate logic)
Structural decomposition (O(N³) → O(N²) complexity reduction)
Memory-mapped I/O control (constant memory as control ports)

How Cuboids Implements Tri-Sword:

Tri-Sword Principle	Cuboids Implementation
Ternary Mathematics	int8_t voxels (-1,0,1) = 4x memory reduction
Single Kernel Persistence	1 kernel vs 200 launches = 18.94x speedup
Memory Immutability	Data stays stationary, DNA evolves in registers
Primitive Operations	Native instruction set
Structural Optimization	Parameter space (6 params) vs transformation space (134M voxels)

Learn the Complete Framework

The tri-sword documentation (tri-sword.md in this repository) provides:

Complete theoretical foundation and measured performance data
57 benchmark test results (0.8x failures to 1589x successes)
Detailed implementation patterns and code examples
Real-world case studies including Cuboids
When to apply each optimization technique

Read it to understand:

Why Cuboids achieves 18.94x speedup (not just "it's faster")
How to apply these principles to your own GPU problems
The Six Commandments of Silicon Sovereignty
Comprehensive test portfolio showing when techniques work/fail

Location: ./tri-sword/tri-sword.md

🛠️ Project State: A Proven Engine, An Open Challenge

This repository contains 79 progressively optimized CUDA files documenting the complete journey from a JavaScript concept to validated hardware performance. It is a proven prototype and an open benchmarking challenge.

✅ What's Validated: The DNA paradigm works and is fast. Files 0001-0077 are fully tested. The architectural advantage is clear and scales to 134M voxels.
🎯 The Open Challenge: We invite the community to determine the absolute performance ceiling. Both the Cuboids ("DNA") code and the "traditional" baseline have optimization headroom. Can expert CUDA optimization close the gap, or does the paradigm's structural advantage hold?
- Optimize the traditional baseline with expert techniques.
- Push the DNA implementation further.
- Run the definitive fair race on A100, H100, and other architectures.

🛠 Getting Started

Prerequisites: A CUDA-capable GPU and toolkit.

Quick Start: Clone the repo and run the early files (e.g., 0001_*) to verify the ternary system works on your hardware.

🔍 Why This Exists

Created to solve problems in AI causality, this project uncovered an unexpected algorithmic pathway. It is released not as a finished solution, but as an invitation for the community to rigorously test a new spatial computing idea.

📄 More Information

visit https://cyborgunicorn.com.au/cuboids

📄 License

MIT Licence

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
css		css
files		files
python		python
tri-sword		tri-sword
.gitignore		.gitignore
LICENSE		LICENSE
index.html		index.html
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cuboids: A GPU-Accelerated Ternary Spatial Computing Framework

🧠 The Paradigm Shift

⚡️ Validated Performance at Scale

⚙️ How It Works: The Technical Core

📚 The Tri-Sword Framework

What is Tri-Sword?

How Cuboids Implements Tri-Sword:

Learn the Complete Framework

🛠️ Project State: A Proven Engine, An Open Challenge

🛠 Getting Started

🔍 Why This Exists

📄 More Information

📄 License

About

Uh oh!

Releases

Packages

Languages

License

PrimalNinja/cuboids

Folders and files

Latest commit

History

Repository files navigation

Cuboids: A GPU-Accelerated Ternary Spatial Computing Framework

🧠 The Paradigm Shift

⚡️ Validated Performance at Scale

⚙️ How It Works: The Technical Core

📚 The Tri-Sword Framework

What is Tri-Sword?

How Cuboids Implements Tri-Sword:

Learn the Complete Framework

🛠️ Project State: A Proven Engine, An Open Challenge

🛠 Getting Started

🔍 Why This Exists

📄 More Information

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages