Stanford CS25: V4 I From Large Language Models to Large Multimodal Models

preview_player
Показать описание
May 9, 2024
Speaker: Ming Ding, Zhipu AI

As large language models (LLMs) have made significant advancements over the past five years, there is growing anticipation for seamlessly integrating other modalities of perception (primarily visual) with the capabilities of large language models. This talk will start with the basics of large language models, discuss the academic community's attempts at multimodal models and structural updates over the past one year. We will focus on introducing CogVLM, a powerful open-source multimodal model with 17B parameters (equivalent to a 7B dense model), and CogAgent, a model designed for scenarios involving GUI and OCR. Finally, we will discuss the applications of multimodal models and viable research directions in academia.

About the speaker:
Ming Ding is a research scientist at Zhipu AI based in Beijing. He obtained his bachelor's and doctoral degrees at Tsinghua University, advised by Prof. Jie Tang. His research interests include multimodal, generative models, and pre-training technologies. He has led or participated in the research works about multimodal generative models such as CogView and CogVideo; multimodal understanding models CogVLM and CogAgent; and language models such as GLM and GLM-130B.

Рекомендации по теме
Комментарии
Автор

PLEASE UPLOAD cs231n, cs237a cs237b, cs224r

harshitmeena
Автор

I wish my lecturer university give newest and recent knowledge instead still focusing in OLD MACHINE LEARNING

kingki
Автор

The finding that what matters is your pretraining loss is huge. It means we are wasting so much $ on additional parameters for specific tasks when smaller models are needed only.

tusharkelkar
Автор

Por alguna extraña razón las mujeres le dan más valor a lo que escuchan que a lo que esta escrito.
Y valgan verdades, la sociedad es femenina por definición
Por alguna extraña razón la gente le da más valor a las promesas electorales de aquel que se autoproclama defensor de tus derechos y te pide el voto a cambio de mejorarte la vida.
Sin embargo, luego de unos años, las únicas vidas que han mejorado son las vidas de aquellos que se habían autoproclamado.
Si la política consiste en engañar a la gente y a cuanta más personas engañes mejor.
Pregunto, ¿tu te sientes parte del problema o de la solución?

- IMV P2rtido Polític0 en WordPr3ss
-- El primer partido político con funcionamiento interno verdaderamente democrático en la historia de España.
-- Sin machitos alfa, sin capataces a lomo de caballo blanco, sin tonto pollas, sin chulo playas, sin cuadros, sin mi3rdas

javierluna
Автор

Can you add a voiceover this is almost unintelligible, sorry.

Nnonymus