Hallucinations and Hyperparameters: Navigating the Quirks of LLMs - Yonatan Alexander

Показать описание

In my talk, Hallucinations and Hyperparameters: Navigating the Quirks of LLMs, I’ll start with an overview of the current state of LLMs before moving into the technical details of deployments. I’ll cover key elements such as hyperparameters, batching, GPU utilization, routing strategies, and token summarization—critical factors in optimizing both performance and cost.

We’ll then explore considerations for companies looking to adopt LLMs, including the decision between platform-based solutions and self-deployment. I’ll explain the differences between fine-tuning, Retrieval-Augmented Generation (RAG), and prefix caching, as well as how to choose the right model based on factors like cost, scalability, and control. Security will be a central focus, with discussions on prompt injection, jailbreak risks, and mitigation strategies.

For end users, I’ll provide best practices for prompt engineering, emphasizing how to maximize the effectiveness of LLMs in tasks like code generation and solving complex problems.

Finally, we’ll discuss how to stay ahead in the rapidly evolving AI landscape.

Рекомендации по теме

Hallucinations and Hyperparameters: Navigating the Quirks of LLMs - Yonatan Alexander

Hallucinations and Hyperparameters: Navigating the Quirks of LLMs - Yonatan Alexander

Navigating Privacy and Hallucination Concerns around Generative AI

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

When AI Tries To Reason With Itself [AutoGPT & More]

[DevFest Nantes 2022] How OpenAI Codex learned to write and refactor JavaScript !?!

A Deep Dive into Retrieval Augmented Generation - AI PM Community Session #37

L21 Embodied AI | Joanne Truong | Deep Learning | Fall 2021

Retrieval augmented generation; Extractive summarization

Gary Marcus - Deep Learning: A Critical Appraisal

The Power of GenAI in Life Sciences - Elizabeth Smalley ArisGlobal

Expert Panel: Decoding AI's Future with Open-Source

Zachary Mainen, Fundação Champalimaud: Serotonin and the regulation of neural inference and learning...

ICCC'20 Tutorials – Deep Dive Latent Space Tutorial (2): Notebook Walk-through

Jean Kaddour & Joshua Harris | Challenges and Applications of Large Language Models

01/06/2026 - AI Agent Research updates

Navigating Large Language Models with Vino Duraisamy from Snowflake

[Webinar] The State of Prompt Engineering

Free to play: UN Trade and Development's experience with developing its own open-source

[TTS] Generative Agents: Interactive Simulacra of Human Behavior (07 Apr 2023)

#187 The Power of Vector Databases and Semantic Search with Elan Dekel, VP of Product at Pinecone

Dive into the OpenAI API Part 2: Chat Playground Overview, System Prompt, Models, and Temperature

Alex Ratner: From Stanford PhD to Founding a Billion Dollar AI Startup

Using ChatGPT to improve a computer vision model | Encord x Data-Centric AI Community | Eric Landau

Free to play: UN Trade and Development's experience with developing its own open-source