Computer Vision Meetup: Next Generation of Video Understanding with Twelve Labs

preview_player
Показать описание
The evolution of video understanding has followed a similar trajectory to language and image understanding – with the rise of large pre-trained foundation models trained on a huge amount of data. Given the surge of multimodal research lately, video foundation models are becoming even more powerful to decipher the rich visual information embedded in videos. This talk will explore diverse use cases of video understanding and provide a glimpse of Twelve Labs offerings.

Speaker: James Le is the Head of Developer Experience at Twelve Labs, a startup building multimodal foundation models for video understanding. Previously, he worked at ML Infrastructure startups such as Superb AI and Snorkel AI, while contributing to the popular Full-Stack Deep Learning course series. He is also the host of Datacast, a podcast featuring conversations with founders, investors, and operators in the data and AI infrastructure space to unpack the narrative journeys of their careers.

Not a Meetup member? Sign up to attend the next event:

Recorded on Feb 15, 2024 at the AI, Machine Learning and Data Science Meetup.

#computervision #machinelearning #datascience #ai #artificialintelligence
Рекомендации по теме