Want High Performing LLMs? Hint: It is All About Your Data // Vikram Chatterji // LLMs in Prod Con

Показать описание

This LLMs in Production Conference section is proudly sponsored by Galileo.

// Abstract
Building LLMs that work well in production, at scale, can be a slow, iterative, costly, and unpredictable process. While new LLMs emerge each day, similar to what we saw with the Transformers era, models are getting increasingly commoditized – the differentiator and key ingredient for high-performing models will be the data you feed it with.

This talk focuses on the criticality of ensuring data scientists work with high-quality data across the ML workflow, the importance of pre-training, and the common gotchas to avoid in the process.

// Bio
Vikram is the co-founder and CEO of Galileo, the first data-centric platform for model debugging.
Vikram previously led Product Management at Google AI where he painfully realized the criticality of good quality data for good quality model outcomes, as well as the highly manual nature of ML data debugging.