Testing Llama 3: Did it Pass the Coding and Reasoning Test?

preview_player
Показать описание
Hey everyone, welcome back to another exciting tutorial where we test the powerful Llama 3 AI model live! Today, we’re diving deep into coding tests with Python, tackling logical reasoning questions, and even creating a classic snake game from scratch! 🌟

👇 What We Cover in This Video:
In this detailed walkthrough, we use the 70 billion parameter model from Hugging Face to handle various tasks from simple math functions to more complex coding challenges like creating an identity matrix. See how Llama 3 performs in real-time with our coding tests and logical reasoning problems. 🧠💻

Despite facing a tough challenge with the ECG sequence function, Llama 3 impressively passes numerous other difficult tests, showcasing its capabilities beyond many open-source models. 🏆

🎮 Snake Game Creation: We conclude with a fun coding session where we develop a snake game in Python, demonstrating both the potential and limits of AI in game development.

🔗 Resources:

✅ Subscribe for more AI-related content: Make sure to hit the like button to help more people discover this video. Stay tuned for more tutorials on utilising AI in real-world applications!

Timestamps:
0:00 Introduction
0:05 Overview of Llama 3
0:51 Start of Python Coding Tests
3:20 Logical and Reasoning Tests
4:57 Snake Game Creation in Python

#Llama3 #CodingTest #LogicalReasoning #Test #Llama3Testing #Llama3Test #Llama3 #MetaLlama3Testing #MetaLlama3Test #Llama3Testing #Llama3Test #MetaLlama3Testing #MetaLlama3Test
Рекомендации по теме
Комментарии
Автор

This is amazing! I tested it for quite complex calculation and it surely matching GPT4 capabilities, if not surpassing it. (70 billion one)

Cingku
Автор

I'm excited to see whether the 8B model will be good enough to handle tool usage in crewai

PrinzMegahertz
Автор

Thanks for not using clickbait.
Recommendation:
The sound at like 1:29 and all other instances of it, is way too loud compared to your voice.
Either make your voice twice as loud as that ding/ring sound or reduce that ding/ring sound by 2 4 times.

FusionDeveloper
Автор

4:34 did you read the derivation steps? it computes 5/6 first which gives 0.83 (approximately) then times 12 giving $9.96 so technically is not wrong.

Rcky
Автор

Is available via Ollama and lm studio?

Augmented_AI
Автор

i think llama 3 was trained to do snake game with a specific dataset or instruction, i tested here and generated the same exactly snake game copy, but what a impressive AI, very good content, keep the good work.

Fratex-gy
Автор

Woooohooo!! 👏👏👏👏 for you and for Meta as well! ❤❤

MeinDeutschkurs
Автор

Can you fine tune for a language other than Python or JS and see how it does with a less common or even esoteric language? I’m thinking Elixer or Haskell, or gleam would be a great candidate as it’s so new.

VastCNC
Автор

👋Thanks for the insight you provide us from the world of LLM.
I would be very happy if you include tests of how these models handle text in different natural languages. (Chinese, Arabic, Indian, some European and Cyrillic. I'm having a problem with my native language and some of the patterns right now. I have to translate first to English and then to my native language. It's a bit awkward..🌱

skarloti
Автор

Just think what will happen in 3 years 👍🔥

Автор

I really wish YouTubers would try a different test other than snake game

yellowboat
Автор

Which LLM passed the Expert level test?

firstlast
Автор

Asked it make snake in js, total fail

MavVRX
Автор

We are all busy testing those models… I wonder how many humans can actually pass those tests 😅

nickbobrowski
Автор

Please do Wizard llm the new one next if possible

bgNinjashows
Автор

why everyone love to create snake game? how about a maze game? At least I failed to use it to create maze game... (near success but still failed)

fenix
Автор

Is snake game become the industry standard to test the coding / gaming ability now?

dkirk