Learning Audio-Video Modalities from Image Captions
- Arsha Nagrani*
- , Paul Hongsuck Seo
- , Bryan Seybold
- , Anja Hauth
- , Santiago Manen
- , Chen Sun
- , Cordelia Schmid
*Corresponding author for this work
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
43
Link opens in a new tab
Citations
(Scopus)