Skip to content

Confusion about different EVA architectures #175

@t0278611

Description

@t0278611

I read your EVA-CLIP Paper and I am a little bit confused about the timeline. It seems that you trained EVA-CLIP and EVA at the same time and EVA-02 is a successor. In the EVA-CLIP Paper you mention EVA-02-CLIP but nothing about which architecture you are using. When I look at the code, it does seem like EVA-02-CLIP is indeed using the EVA-02 architecture, so I am a little bit confused about how the models are related.

In Particular I would like to know which architecture you use for the gigantic 18B version. From your code and from your paper it would seem you used the vanilla ViT architecture. But I see in your code configs for a EVA02 style 16B version. Although I can't find anything in your paper. Are the experiments still ongoing?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions