Computer Vision Meetup: Next Generation of Video Understanding with Twelve Labs

Показать описание

The evolution of video understanding has followed a similar trajectory to language and image understanding – with the rise of large pre-trained foundation models trained on a huge amount of data. Given the surge of multimodal research lately, video foundation models are becoming even more powerful to decipher the rich visual information embedded in videos. This talk will explore diverse use cases of video understanding and provide a glimpse of Twelve Labs offerings.

Speaker: James Le is the Head of Developer Experience at Twelve Labs, a startup building multimodal foundation models for video understanding. Previously, he worked at ML Infrastructure startups such as Superb AI and Snorkel AI, while contributing to the popular Full-Stack Deep Learning course series. He is also the host of Datacast, a podcast featuring conversations with founders, investors, and operators in the data and AI infrastructure space to unpack the narrative journeys of their careers.

Not a Meetup member? Sign up to attend the next event:

Recorded on Feb 15, 2024 at the AI, Machine Learning and Data Science Meetup.

#computervision #machinelearning #datascience #ai #artificialintelligence

Voxel51

Рекомендации по теме

Computer Vision Meetup: Next Generation of Video Understanding with Twelve Labs

Computer Vision Meetup: Next Generation of Video Understanding with Twelve Labs

Computer Vision Meetup: Storia AI - Next-Generation Image/Video Editor Built with Generative AI

Computer Vision Meetup: GenAI for Video: Diffusion-Based Editing and Generation

Computer Vision Meetup: Drones, Data and One Direction of Computer Vision

Computer Vision Meetup: Using AI to Test Software, Techniques and Tools

Computer Vision Meetup: Adapting to Change: Foundation Models, APIs, Past, Present and Future of AI

Computer Vision Meetup: Performance Optimisation for Multimodal LLMs

Computer Vision Meetup: Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation

Computer Vision Meetup: A Practical Approach to Deep Learning for Computer Vision with Tensorflow 2

Computer Vision Meetup: Learning Robot Perception and Control using Vision with Action

Computer Vision Meetup: Unleashing the Potential of Visual Data: Vector Databases in Computer Vision

Computer Vision Meetup: Deep Dive into Responsible and Unbiased GenAI for Computer Vision

Computer Vision Meetup: Why You Should Evaluate Your End-to-End LLM applications with In-House Data

Computer Vision Meetup: EgoSchema: A Dataset for Truly Long-Form Video Understanding

Data Brain Meetup: Next Generation of AI Intrapreneurship and Entrepreneurship

Meetup Computer Vision Paris #19

Computer Vision Meetup: Evaluating RAG Models for LLMs: Key Metrics and Frameworks

Computer Vision Meetup: Wearable Vision Sensors

Computer Vision Meetup: Unleashing the Potential of Visual Data: Vector Databases in Computer Vision

Computer Vision Meetup: Rise of the Intelligent Data Platform

Computer Vision Meetup: DreamSim - New Dimensions of Human Visual Similarity w/ Synthetic Data

Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne

Computer Vision Meetup: Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos

Computer Vision Meetup: Machine Learning for Fast, Motion-Robust MRI