Mit.edu [artificial-intelligence]
-
18:10 Jun 11, 2024
DenseAV, developed at MIT, learns to parse and understand the meaning of language just by watching videos of people talking, with potential applications in multimedia search, language learning, and robotics.