Create Multimodal Multi-Agent Apps with Autogen Studio | LLM Text to Speech Tutorial

preview_player
Показать описание
In this video, we will be looking at the newly released Autogen Studio, a UI interface for building AI agents that can collaborate to solve complex tasks. The concept concerns getting specialized agents with assigned skills to converse while solving tasks. For example, one agent would be responsible for the planning reasoning, and another would be responsible for the execution. Research from the Autogen team, who are part of the Microsoft Research team, suggests that Large language models perform much better when there is a feedback loop. We already see similar concepts with Langchain's Agents and also, to an extent, ChatGPT's Code interpreters.

In this video, we look at one of their examples for plotting graphs for stock prices for a specific year. We also show how we can get it to summarize research papers in audio format using multiple tools.

📚 Resources

👤 About Me - Ugo Osuji:
Рекомендации по теме
Комментарии
Автор

The end of a task is often accompanied by the agents all congratulating one another in an endless loop of 'thank you's' and praise. This can cost a lot of money over time... So make sure that you nip this activity at the bud when producing a 'SYSTEM MESSAGE' for each agent! Otherwise you could be paying for GPT-4 thanking itself 50 times in a row! Also those system messages can cost money on their own so read over them a few times and decide if some bits are really needed!

mickelodiansurname
Автор

Amazing tutorial, please give more, learnt a lot here.

rorydaines
Автор

Thanks for the video! Can you do a video showing us how to create these agent skills?

johnbarros
Автор

How to set api base in the environment? It doesn't work when I setup the API key and base in the agents.

AngusLou