SMOCLIP

Contrastive Language-Image Pretraining (CLIP) models demnstrate the powerful ability in order to maximize the mutual information between textual and visual modalities to learn representations. However, the neural network push itself to learn in the direction where the difference between the correct and incorrect labels is largest, which leads to network overfitting when there is less training data and is not enough to characterize all the sample features. Thus, we propose SMOCLIP which smooths the labels for contrastive loss to train CLIP. This is a regularization strategy, which mainly adds noise through soft one-hot, which reduces the weight of the class of the real sample label in the calculation of the loss function, and finally suppresses the overfitting. When applied to existing datasets such as CC3M and CC12M, our method enhances the generalization capabilities of CLIP and improve the zero-shot classification accuracy over the benchmark on an equal computational budget. We also demonstrate that it is possible to enhance the model comprehension ability when ensembling various models.

Model	Training data	ImageNet zero-shot acc.
ViT-B/32	CC3M	40.91%
ResNet50	CC3M	32.55%
ViT-B/32	CC12M	45.15%
ResNet50	CC12M	49.91%

Approach

Data

Refer to https://github.com/mlfoundations/open_clip

Usage

Create a virtual environment

python3 -m venv .env
source .env/bin/activate
pip install -U pip

pip install openclip

pip install 'open_clip_torch[training]'

Train and evaluation

cd open_clip/src
torchrun --nproc_per_node 4 -m open_clip_train.main \
    --train-data '/data/cc12m/cc12m-train-{0000..2175}.tar' \
    --train-num-samples 10968539 \
    --dataset-type webdataset \
    --batch-size 320 \
    --precision amp \
    --workers 4 \
    --imagenet-val /data/imagenet/validation/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
outputs		outputs
scripts		scripts
src		src
tests		tests
tutorials		tutorials
.gitattributes		.gitattributes
.gitignore		.gitignore
CITATION.cff		CITATION.cff
HISTORY.md		HISTORY.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
Readme.md		Readme.md
anote.md		anote.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements-training.txt		requirements-training.txt
requirements.txt		requirements.txt
图片.png		图片.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMOCLIP

Approach

Data

Usage

Train and evaluation

About

Uh oh!

Releases

Packages

Languages

License

Marydsy/SMOCLIP

Folders and files

Latest commit

History

Repository files navigation

SMOCLIP

Approach

Data

Usage

Train and evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages