MIT's New AI Bridges the Gap Between Sight and Sound - No Labels Needed! MIT researchers have developed a groundbreaking machine-learning model that learns to link audio and visual data from unlabeled video clips, much like people naturally connect the sight of a cello bow... artificial intelligence audio-visual learning computer vision machine learning multimodal AI robotics unsupervised learning