Skip to content

Add Mega-Sam image#89

Merged
wbrown merged 14 commits intomainfrom
rwang.megasam
Apr 11, 2025
Merged

Add Mega-Sam image#89
wbrown merged 14 commits intomainfrom
rwang.megasam

Conversation

@Rexwang8
Copy link
Contributor

Summary

Add the Mega-Sam image built with an update torch-extras image (torch-extras:es-actions-68fbfd1-nccl-cuda12.4.1-ubuntu22.04-nccl2.25.1-1-torch2.6.0-vision0.21.0-audio2.6.0-abi0) that updates the old Cuda 11.8 example to ml-containers. Image Page.

This image should import and install all dependencies needed up to and including the setup step. We exclude the use of conda as shown in the quickstart in the base repo, as well as the installation of transformers. Instead, follow the To Use section to get to the setup.

To Use

You will want to srun the image with a command like srun --container-image="ghcr.io#coreweave/ml-containers/cw-mega-sam:rwang.megasam-366a38a" --pty /bin/bash -i. After that, navigate to /work/mega-sam/base and run python setup.py install.

Patch

A cuda function .type() was depreciated and had to be replaced with .type().scalarType() for the build to function properly. This is automatically included in the .patch file, but you may need to manually run this fix if you make changes to the code.

@Rexwang8 Rexwang8 requested a review from wbrown March 10, 2025 21:44
@Rexwang8 Rexwang8 self-assigned this Mar 10, 2025
@Rexwang8
Copy link
Contributor Author

Let me know if you want me to squash or otherwise combine these commits. A lot of them were very small changes to get the CI/CD to work.

@wbrown wbrown merged commit 85e3f72 into main Apr 11, 2025
1 check passed
@wbrown wbrown deleted the rwang.megasam branch April 11, 2025 01:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants