AI Agents: Looping vs Planning

Показать описание

Here is a cleaned up version of the text, maintaining the original tone and key points:

Today, I want to discuss the ideas around looping versus planning agents. The react paper is well-known, where you have a train of thought, access to tools, and loop through thinking, using tools, and reasoning about next steps. While this is cool, it becomes more complicated and less reproducible for real-world applications beyond simple academic examples. Few-shot examples are hard to capture as inbound requests, number of tools, and feedback for improvement are unclear.

Looping may not be perfect, so the goal is to propose solutions that use more plans and tags while still reasoning about them in a fuzzy way. I think in terms of inputs and outputs, even with instructor models. With an agent, the output data structure should be a deterministically executable plan. We can fine-tune a model that takes requests and produces the correct plan.

Here's how we can do it:

1. Predict all necessary tools given a request, possibly using multiple hops and a recommendation system based on similar and complementary tools. There will be precision and recall trade-offs.

2. Given the request, retrieved tools, and their descriptions/instructions, generate an execution plan (DAG). The conversation iteratively changes the plan.

3. Fine-tune a model that takes inputs and tools to predict the final plan, assuming modifications don't change much.

4. Retrieve examples of successfully running plans given the request and tools to hydrate the prompt with few-shot examples of sophisticated plans.

5. If the plan is too complex to generate with fully implemented edges, implement individual edges by transitioning from one node to another's inputs using a react loop and few-shot examples.

The idea is to produce the entire plan separately from its execution, with the plan's construction being probabilistic rather than its execution. The goal is to produce artifacts for retrieval to create more few-shot examples, leaving a single artifact at the end of each conversation. This allows fine-tuning models to predict the output correctly in a single shot, essentially compiling the system.

Рекомендации по теме

Комментарии

I found this when searing "OODA AI Agents" as I had just learned about OODA, and I'm about 95% complete making my own AI Agent framework from scratch. Thank you for the video! I don't get to hear many people speaking about these topics, very very interesting!

zoewilliams

Thank you for sharing your thoughts. For me, there's a lot to unpack here, but your thoughts on planning agents sound pretty delicious.

woojay

More of this bro! Thanks, I'm happy you chose to share today🔥🤘

reiniervaneijk

Great rundown, thanks for posting it. I agree most of these react examples are very hand wavy, and although they work OK a few times, scaling them seems like a LOT of work. The idea of planning, with a dash of "what did similar requests need recently" definitely has some merit.

Thanks for sharing!

markwolfe

what is the current status on this? Have been thinking about this

AIEmployeesWithCosmo

AI Agents: Looping vs Planning

AI Agents: Looping vs Planning

5 Types of AI Agents: Autonomous Functions & Real-World Applications

🤖 Agentic AI & the OODA Loop – The Future of Intelligent Systems! 🚀

Google's Veo 2 vs OpenAI Sora

Concept Equation: AI Agent ≈ Model + Tools + Memory + Environment × Decision Loop

The Flash Byte: Agentic Architecture + HUMAN LOOP

AI Agent ≈ Model + Tools + Memory + Environment x Decision Loop

Openai's Practical Guide 📚to building agents | Must Watch

AI Agents Explained: Build Your Own AI Email Assistant (Step-by-Step)

Make.com Human in the Loop (HITL) - Make Enterprise Plan NOT Required!

Human in the loop Control of Multi agent Aerial Systems with Realistic Communication

How Human in the Loop Makes AI 10x More Powerful!

Introduction to Multi-Agent Reinforcement Learning

Understanding ReACT with LangChain

Create Your Own Product Manager Agentic Framework (with PM in the loop)

Mark Zuckerberg – Meta’s AGI Plan

OpenAI Changed AI Agents Forever (The Update Everyone Missed)

What is double loop learning? #developer #ai #saas

🤖 Building Hyper-Advanced Agents Step-by-Step: Autonomous Loops, Auto-Prompt Engineering, and More 🤖...

Path Planning via Reinforcement Learning with Closed-loop Motion Control and Field Tests

Self-Improving AI Feedback Loop! OpenAI o1 writes PHD's Code, 1X World Models

AI Isn’t Perfect: Why Human in the Loop Is Essential

Copilot Workspace - A reimagined developer inner loop with AI assistance at every step | Craft 2024

The Inside Loop | Ep. 35 | 3 Ways AI and Machine Learning will Help Brokers, Agents, and Clients