CUDA Vanity Nostr npub Generator

A high-performance CUDA-accelerated vanity address generator for Nostr npub addresses. Searches for Nostr keypairs whose npub (bech32-encoded public key) starts with a specified pattern.

Performance

Benchmarked performance on real hardware (pattern: "cuda"):

GPU	Architecture	Compute Capability	Keys/Second	vs rana CPU
NVIDIA GB10	Blackwell	12.1	4.2M keys/s	3.1x faster ✅
RTX 2060	Turing	7.5	2.1M keys/s	1.6x faster ✅
GTX 1070	Pascal	6.1	532K keys/s	2.5x slower

CPU Comparison:

rana on Intel i9-14900K (24 cores, 32 threads): ~1.35M keys/s

Features

✅ Correct secp256k1 implementation - Generates valid Nostr keypairs
✅ Optimized Jacobian coordinates - No expensive modular inversions during computation
✅ Windowed scalar multiplication - 4-bit windows with precomputed point tables
✅ Architecture-specific tuning - Optimized kernel configuration per GPU generation
✅ Early pattern rejection - Fast rejection before full bech32 encoding
✅ Fast pattern matching - Validates bech32 character set (rejects b, i, o, 1)
✅ Real-time progress - Shows generation speed and keys generated
✅ Efficient batch processing - Quick termination after finding matches
✅ Verified compatibility - Keys tested with nak CLI tool

Installation

Prerequisites

NVIDIA GPU with CUDA support
CUDA Toolkit (tested with 12.9)
Make

Build on Linux/macOS

# Default build (compute capability 7.5 - RTX 20-series and newer)
make

# Build for specific GPU
make COMPUTE_CAP=89 CUDA_HOME=/usr/local/cuda-12.0

# Auto-detect GPU settings
make detect

# See all build options
make help

Build on Windows

Option 1: Visual Studio (Recommended)

# Simple build (auto-detects GPU architecture)
nvcc -o vanity_npub.exe vanity_npub.cu

# Or specify compute capability manually
nvcc -gencode=arch=compute_89,code=sm_89 -o vanity_npub.exe vanity_npub.cu

Option 2: WSL (Windows Subsystem for Linux)

Install WSL2 with CUDA support and follow the Linux build instructions above.

Finding your GPU's compute capability:

# Run in Command Prompt or PowerShell
nvidia-smi --query-gpu=name,compute_cap --format=csv

Usage

Linux/macOS

# Generate vanity npub
./vanity_npub <pattern>

# Example: Find npub starting with "npub1test"
./vanity_npub test

# Example: Find npub starting with "npub1cuda"
./vanity_npub cuda

Windows

# Generate vanity npub
vanity_npub.exe <pattern>

# Example: Find npub starting with "npub1test"
vanity_npub.exe test

# Example: Find npub starting with "npub1cuda"
vanity_npub.exe cuda

Example Output

Searching for npub starting with: npub1cuda
Initializing precomputed point table...
Precomputed table initialization complete.
[3s] Generated: 10245632 keys | Speed: 3415210.67 keys/s
Total: 10245632 keys in 3.00s (avg 3415210.67 keys/s)

Found matching npub!

Private key (hex): 16d8087ec4f5fc3d1e6e379f19058777f8578c7db8f1b9212a38a10f17b748e7
Public key (hex):  5e60b2b4dbe489b199dff9cb03e992847ada4e6b4b52478dfbd129f3c27b3c68

Encode Keys

# Encode with nak CLI
echo "<privkey_hex>" | nak encode nsec
echo "<pubkey_hex>" | nak encode npub

Pattern Constraints

Valid characters: a-z, 0-9 (excluding b, i, o, 1)
Case: Lowercase only (bech32 is case-insensitive but npub uses lowercase)
Position: Currently only supports prefix matching (after "npub1")

Difficulty Estimates

Each additional character increases difficulty by ~32x:

Pattern Length	Estimated Keys	GB10 @ 4.2M	RTX 2060 @ 2.1M	GTX 1070 @ 532K
4 chars	~1M	0.24s	0.48s	1.9s
5 chars	~33M	7.9s	15.7s	62s
6 chars	~1B	4.2 min	8.4 min	31 min
7 chars	~34B	2.2 hours	4.5 hours	18 hours
8 chars	~1T	2.8 days	5.5 days	22 days

Testing

# Run automated test with nak verification
make test

This will:

Build the project
Generate a vanity npub with pattern "q"
Verify the keypair using nak
Confirm the npub matches the pattern

Benchmarking

Run performance benchmarks:

./benchmark.sh <pattern>

# Example
./benchmark.sh q

This will run 5 iterations and show min/max/avg keys/second.

Technical Details

Algorithm

Key Generation: Uses CUDA curand to generate random 256-bit private keys
Point Multiplication: Windowed scalar multiplication (4-bit windows) on secp256k1 using Jacobian coordinates
Precomputed Tables: Uses constant memory table of [1G, 2G, ..., 15G] for fast lookups
Optimization: Only one modular inversion per key (at final affine conversion)
Early Rejection: Quick pattern check before full bech32 encoding
Encoding: Converts public key to bech32 npub format
Matching: Compares against target pattern, rejects invalid bech32 characters

Key Files

vanity_npub.cu - Main CUDA kernel and host code
secp256k1_jacobian.cuh - Optimized secp256k1 curve operations
ptx.cuh - PTX assembly utilities for low-level operations
Makefile - Build configuration

Kernel Configuration

The kernel automatically detects your GPU architecture and optimizes configuration:

Architecture	Compute Cap	Blocks	Threads	Keys/Thread	Total Keys/Batch
Blackwell	sm_120+	2048	256	512	268M
Ada	sm_89	1536	256	768	302M
Ampere	sm_86	1280	256	896	293M
Turing	sm_75	1024	256	1024	268M
Pascal	sm_61	1024	256	1024	268M

Register usage: 111 registers/thread (GTX 1070), 128 registers/thread (GB10)

Known Limitations

Prefix only: No suffix or middle pattern matching yet

Future Improvements

Multi-GPU support
Batch mode (generate multiple vanity addresses)
Suffix and middle pattern matching
Case-insensitive matching
Difficulty estimation and ETA display
Further register optimization for even better occupancy

License

MIT

Acknowledgments

secp256k1: Bitcoin's elliptic curve
Nostr: Decentralized social protocol
rana: CPU-based vanity address generator for performance comparison
nak: Nostr Army Knife CLI tool for verification

Contributing

Contributions welcome! Areas of interest:

Performance optimizations
Multi-GPU support
Additional pattern matching modes
Better occupancy through register optimization

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
contrib		contrib
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
benchmark.sh		benchmark.sh
ptx.cuh		ptx.cuh
secp256k1_jacobian.cuh		secp256k1_jacobian.cuh
vanity_npub.cu		vanity_npub.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA Vanity Nostr npub Generator

Performance

Features

Installation

Prerequisites

Build on Linux/macOS

Build on Windows

Usage

Linux/macOS

Windows

Example Output

Encode Keys

Pattern Constraints

Difficulty Estimates

Testing

Benchmarking

Technical Details

Algorithm

Key Files

Kernel Configuration

Known Limitations

Future Improvements

License

Acknowledgments

Contributing

About

Uh oh!

Languages

v0l/cuda_vanity

Folders and files

Latest commit

History

Repository files navigation

CUDA Vanity Nostr npub Generator

Performance

Features

Installation

Prerequisites

Build on Linux/macOS

Build on Windows

Usage

Linux/macOS

Windows

Example Output

Encode Keys

Pattern Constraints

Difficulty Estimates

Testing

Benchmarking

Technical Details

Algorithm

Key Files

Kernel Configuration

Known Limitations

Future Improvements

License

Acknowledgments

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages