🚀 Accelerator Comparison: CUDA vs TPU

Legacy to Modern Migration Demo

Accelerator Comparison is a live, interactive CLI demonstration that compares a legacy NVIDIA A100 GPU workload against a modern Google TPU v5e implementation.

It simulates a real-world Research Computing workflow:

Legacy Code: Taking an existing PyTorch/CUDA training loop (ResNet-50).
AI Refactoring: Automatically converting it to a hardware-agnostic JAX/Flax model.
The Comparison: Provisioning both accelerators simultaneously and running them to convergence.

⚡ Quick Start

You can install and run this tool directly from your local machine (Linux/Mac) or Google Cloud Shell.

Prerequisites

Google Cloud SDK (gcloud) installed and authenticated.
Python 3.10+

Installing `uv`

If you don't have uv installed:

curl -LsSf https://astral.sh/uv/install.sh | sh

Installation

uv tool install git+https://github.com/ucr-research-computing/demo-cuda-to-tpu

Run the Demo

demo-cuda-to-tpu

🔄 Updating

To get the latest version of the demo:

uv tool upgrade demo-cuda-to-tpu

🏗️ Architecture

The tool acts as an Orchestrator. It does not run the heavy compute locally. Instead, it spins up ephemeral cloud resources to perform the work.

graph TD
    User[💻 CLI / Notebook] -->|Orchestrates| GCP[Google Cloud Platform]
    
    subgraph GCP
        direction TB
        A100[🔥 NVIDIA A100 VM]
        TPU[⚡ Google TPU v5e VM]
    end
    
    User -->|SCP: train_legacy.py| A100
    User -->|SCP: train_jax.py| TPU
    
    A100 -->|Stream Logs| User
    TPU -->|Stream Logs| User

🧠 The Workload: ResNet-50

This benchmark trains a ResNet-50 architecture on synthetic ImageNet data (224x224 RGB).

Feature	Legacy (A100)	Modern (TPU v5e)
Framework	PyTorch (Eager)	JAX / Flax (XLA Compiled)
Precision	FP32 / AMP	bfloat16 (Native)
Optimization	Manual CUDA/cuDNN	XLA Compiler
Throughput	Baseline	Optimized

☁️ Running in Google Colab

You can run this orchestrator from a Google Colab notebook to visualize the comparison without installing anything locally.

Open a new Colab Notebook.
Install the tool and authenticate:

# 1. Install
!pip install uv
!uv tool install git+https://github.com/ucr-research-computing/demo-cuda-to-tpu --force

# 2. Authenticate (Pop-up will appear)
from google.colab import auth
auth.authenticate_user()

# 3. Configure Project
!gcloud config set project ucr-research-computing
!gcloud config set compute/zone us-central1-a

# 4. Run the Comparison!
!/root/.local/bin/demo-cuda-to-tpu

🧹 Cleanup

The tool attempts to auto-cleanup at the end of the run. If you interrupted it or something broke, run:

demo-cuda-to-tpu --cleanup

Maintained by UCR Research Computing

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
src/demo_cuda_to_tpu		src/demo_cuda_to_tpu
tests		tests
README.md		README.md
colab.ipynb		colab.ipynb
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Accelerator Comparison: CUDA vs TPU

Legacy to Modern Migration Demo

⚡ Quick Start

Prerequisites

Installing `uv`

Installation

Run the Demo

🔄 Updating

🏗️ Architecture

🧠 The Workload: ResNet-50

☁️ Running in Google Colab

🧹 Cleanup

About

Uh oh!

Releases 1

Packages

Languages

UCR-Research-Computing/demo-cuda-to-tpu

Folders and files

Latest commit

History

Repository files navigation

🚀 Accelerator Comparison: CUDA vs TPU

Legacy to Modern Migration Demo

⚡ Quick Start

Prerequisites

Installing uv

Installation

Run the Demo

🔄 Updating

🏗️ Architecture

🧠 The Workload: ResNet-50

☁️ Running in Google Colab

🧹 Cleanup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Installing `uv`

Packages