Leveraging Generative AI for Data Processing by Immanuel Trummer [DSDSD 2023]

preview_player
Показать описание
DSDSD - THE DUTCH SEMINAR ON DATA SYSTEMS DESIGN:
We hold bi-weekly talks on Fridays from 3:30 PM to 5 PM CET for and by researchers and practitioners designing (and implementing) data systems. The objective is to establish a new forum for the Dutch Data Systems community to come together, foster collaborations between its members, and bring in high-quality international speakers. We would like to invite all researchers, especially also Ph.D. students, who are working on related topics to join the events. It is an excellent opportunity to receive feedback early on from researchers in your field.

Abstract:
The year 2022 has been marked by several breakthrough results in the domain of generative AI, culminating in the rise of tools like ChatGPT, able to solve a variety of language-related tasks without specialized training. In this talk, I outline novel opportunities in the context of data management, enabled by these advances. I discuss several recent research projects, aimed at exploiting advanced language processing for tasks such as parsing a database manual to support automated tuning, or
mining data for patterns, described in natural language. Finally, I discuss our recent and ongoing research, aimed at synthesizing code for SQL processing in general-purpose programming languages, while enabling customization via natural language commands.

Bio:
Immanuel Trummer is assistant professor for computer science at Cornell University. His research covers various aspects of large-scale data management with the goal of making data analysis more efficient and more user-friendly. His publications were selected for “Best of VLDB”, for the ACM SIGMOD Research Highlight Award, and for publication in CACM as CACM Research Highlight. He is a recipient of the Google Faculty Research Award and alumnus of the German National Academic Foundation.
Рекомендации по теме