Data-Oriented Cache Locality Benchmark

Overview

this is a very simple bench, inspired by Andrew Kelley's talk on data-oriented programming.

i first started with a simple cache alignment benchmark, which i subsequently deleted. you can see the remnants of that test in metadata_layouts.rs

Motivation

i wanted to look into this because i noticed that const META_SIZE: usize = 73 in bench_hashing.rs in the agave repo.

i tracked it down to ACCOUNTSDB.RS LINE 4829 - 4862

const META_SIZE: usize = 8 /* lamports */ + 1 /* executable */ + 32 /* owner */ + 32 /* pubkey */;

in this line you can see that pubkey has to cross cache line 1 into cache line 2, 64 bytes + 9 bytes.

Initial Approach

i attempted to rearrange and mess around with the padding of the structs to see if optimizing for the cache line would make it faster even if it took up more memory. it didn't. no difference at all.

The Actual Test

the actual test i ended up with is trying to redo the hashwriting with blake3 to take a Struct of Arrays (SoA) instead of Array of Structs (AoS) and see if that improved cache locality and sped things up.

Test Cases

the three tests are as follows; a simple A/B test with SoA and AoS

batch hashing 10,000 accounts
add up the lamports of 10,000 accounts (i.e reading from one field)
creating 10,000 accounts

Results

the results are reliably as follows;

1. Batch hashing

barely any difference (both have to access all fields anyways in their entirety)

2. Field access

SoA wins 3-4x. because sequential memory access, cheaper to traverse one vec (SoA) to find the account data you need rather than search ALL accounts to find specific data from one account (AoS). In this test we add up the lamports of all 10,000 accounts. This lets us ignore all the other data fields.

3. Creation

AoS wins ~1.5-2x creating an account is 1 entry push versus SoA having to do four seperate operations Vec<>(lamports, executable bool, owner, pubkey) for each account

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
benches		benches
reference		reference
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-Oriented Cache Locality Benchmark

Overview

Motivation

Initial Approach

The Actual Test

Test Cases

Results

1. Batch hashing

2. Field access

3. Creation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

tomooshi/accountsdb-index-bench

Folders and files

Latest commit

History

Repository files navigation

Data-Oriented Cache Locality Benchmark

Overview

Motivation

Initial Approach

The Actual Test

Test Cases

Results

1. Batch hashing

2. Field access

3. Creation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages