Skip to main navigation Skip to search Skip to main content

Learning Audio-Video Modalities from Image Captions

  • Arsha Nagrani*
  • , Paul Hongsuck Seo
  • , Bryan Seybold
  • , Anja Hauth
  • , Santiago Manen
  • , Chen Sun
  • , Cordelia Schmid
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'Learning Audio-Video Modalities from Image Captions'. Together they form a unique fingerprint.
Sort by

Keyphrases

Earth and Planetary Sciences

Social Sciences

Computer Science