OV-SCAN: Open-Vocabulary 3D Object Detection

Non-official implementation of the SC-NOD 3D bounding box optimization from OV-SCAN (ICCV 2025).

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection Adrian Chow et al. | Paper

SAM3 Masks + NuScenes LiDAR → Point-in-Mask → DBSCAN → 3D BBox Optimization → 3D NMS → Submission JSON

Results

Our implementation: BBOX seeker only

Dataset: Nuscenes v1.0-mini

Method	mAP	NDS	car	ped	cone	Speed
OV-SCAN SC-NOD	24.40%	24.70%	28.1%	45.4%	46.4%	~27s/sample
OpenSight	21.53%	23.47%	21.9%	41.0%	55.5%	~1.3s/sample

Pre-computed submission files (.json) are included in results/submissions/.

Paper results: BBOX seeker + Train

Dataset: Nuscenes v1.0-trainval

Method	mAP	NDS
OV-SCAN	31.1%	32.8%
OpenSight	22.9%	23.47%

Setup

# 1. Clone
git clone https://github.com/nautel/OVSCAN.git && cd OVSCAN

# 2. Install dependencies
pip install numpy scipy scikit-learn tqdm numba shapely pyquaternion

# 3. Download NuScenes v1.0-mini (LiDAR only)
#    From: https://www.nuscenes.org/nuscenes
#    Extract so that data/nuscenes/samples/LIDAR_TOP/*.bin exists
#    Info PKL files and SAM3 masks are already included in this repo.

# 4. (Optional) For evaluation
pip install nuscenes-devkit

Quick Start

# SC-NOD PSO optimizer (best accuracy)
python -m Implement_OVSCAN --split train --optimizer pso --verbose

# UltraFast geometric optimizer (fast)
python -m Implement_OVSCAN --split train --optimizer fast

# Process subset
python -m Implement_OVSCAN --split train --start_idx 0 --end_idx 10 --verbose

# Evaluate
python -m Implement_OVSCAN.evaluate \
    --result_path results/submissions/scnod_pso_mAP24.40_NDS24.70.json \
    --version v1.0-mini --eval_set mini_train --verbose

Custom paths: --data_root /path/to/nuscenes --sam3_root /path/to/masks --output_dir /path/to/output

Package Structure

├── Implement_OVSCAN/
│   ├── config.py            # Paths, anchors, thresholds, PSO hyperparameters
│   ├── cost_functions.py    # SC-NOD cost: J_density, J_lshape, J_surface, J_2d
│   ├── pso_optimizer.py     # AdaptivePSOOptimizer (cosine annealing, multi-anchor)
│   ├── fast_optimizer.py    # UltraFastOptimizer (ConvexHull + grid search)
│   ├── data_loader.py       # NuScenes + SAM3 mask loading (.npy/.npz)
│   ├── mask_processor.py    # LiDAR-to-camera projection + object extraction
│   ├── point_clustering.py  # DBSCAN depth clustering
│   ├── nms_3d.py            # 3D NMS (Shapely BEV IoU)
│   ├── output_formatter.py  # NuScenes submission JSON
│   ├── run.py               # CLI entry point
│   └── evaluate.py          # NuScenes evaluation wrapper
├── data/
│   ├── nuscenes/            # Info PKLs (included) + LiDAR bins (download)
│   └── sam3_masks/          # Compressed masks (included, 26MB)
└── results/                 # Pre-computed submission + benchmark

Citation

@inproceedings{chow2025ovscan,
    title={OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection},
    author={Chow, Adrian},
    booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    year={2025}
}

License

MIT License

Disclaimer: This is a non-official implementation. For the official version, see ahtchow/OV-SCAN.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OV-SCAN: Open-Vocabulary 3D Object Detection

Results

Our implementation: BBOX seeker only

Paper results: BBOX seeker + Train

Setup

Quick Start

Package Structure

Citation

License

About

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
results		results
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
__main__.py		__main__.py
config.py		config.py
cost_functions.py		cost_functions.py
data_loader.py		data_loader.py
evaluate.py		evaluate.py
fast_optimizer.py		fast_optimizer.py
mask_processor.py		mask_processor.py
nms_3d.py		nms_3d.py
output_formatter.py		output_formatter.py
point_clustering.py		point_clustering.py
pso_optimizer.py		pso_optimizer.py
requirements.txt		requirements.txt
run.py		run.py

License

nautel/OVSCAN

Folders and files

Latest commit

History

Repository files navigation

OV-SCAN: Open-Vocabulary 3D Object Detection

Results

Our implementation: BBOX seeker only

Paper results: BBOX seeker + Train

Setup

Quick Start

Package Structure

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages