[feat] Introduce cross-modality and multi-modality support; modularize CrossEncoder class
#3554
+10,266
−8,061
Loading