Unlocking Reliable GenAI: Strategies for Assessing LLMs in Real-World Applications

preview_player
Показать описание
Dive into the world of LLM reliability with HoneyHive. In this video, Dhruv uncovers the shortcomings of current evaluation methods and provides practical solutions to boost your GenAI application's performance. Learn innovative strategies for rapid iteration and leveraging human feedback to ensure safer operations. Dhruv also covers how using other models (LLMs) to help with your evaluation pipeline is required to scale your evaluations framework.

ABOUT THE SPEAKER:
Dhruv Singh, Co-founder & CTO, HoneyHive (ex- Microsoft)

ABOUT DATA COUNCIL:
Data Council brings together the brightest minds in data to share industry knowledge, technical architectures and best practices in building cutting edge data & AI systems and tools.

FIND US:
Рекомендации по теме