How was Figure 4(a) produced?

Thanks for your wonderful work. I am curious about how fig4a in the article was drawn. 
![image](https://github.com/user-attachments/assets/c20913d3-cfeb-4eee-8a48-bee24e461152)

`# rgb_cams_ = (rgb_cams_ - self.mean.to(device)) / self.std.to(device)  # B*S,3,H,W`
`# feat_cams_ = self.encoder(rgb_cams_)  # B*S,latent_dim,H/8,W/8`
`feat_cams_ = rgb_cams_`

I simply set feat_cams_ to rgb_cams_ in the mvdet [code,](https://github.com/tteepe/TrackTacular/blob/main/WorldTrack/models/mvdet.py) and I can get a reasonable single-view project visualization result, but I can't get a reasonable multi-view fusion result. (Tried on the multiviewx dataset)

`pro_img_all = feat_mems[0].permute(0, 2, 3, 1).cpu().numpy() # 6,3,160,250 -> 6,160,250,3`
`pro_img_all = np.mean(pro_img_all, axis=0) # 160,250,3`

![image](https://github.com/user-attachments/assets/0d53f7a3-65e9-44e6-b728-a61877e2e5fc)

Can you provide ideas for making this figure? I think this is a good way to verify whether the mapping process is correct.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How was Figure 4(a) produced? #10

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How was Figure 4(a) produced? #10

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions