Testing Llama 3: Did it Pass the Coding and Reasoning Test?

Показать описание

Hey everyone, welcome back to another exciting tutorial where we test the powerful Llama 3 AI model live! Today, we’re diving deep into coding tests with Python, tackling logical reasoning questions, and even creating a classic snake game from scratch! 🌟

👇 What We Cover in This Video:
In this detailed walkthrough, we use the 70 billion parameter model from Hugging Face to handle various tasks from simple math functions to more complex coding challenges like creating an identity matrix. See how Llama 3 performs in real-time with our coding tests and logical reasoning problems. 🧠💻

Despite facing a tough challenge with the ECG sequence function, Llama 3 impressively passes numerous other difficult tests, showcasing its capabilities beyond many open-source models. 🏆

🎮 Snake Game Creation: We conclude with a fun coding session where we develop a snake game in Python, demonstrating both the potential and limits of AI in game development.

🔗 Resources:

✅ Subscribe for more AI-related content: Make sure to hit the like button to help more people discover this video. Stay tuned for more tutorials on utilising AI in real-world applications!

Timestamps:
0:00 Introduction
0:05 Overview of Llama 3
0:51 Start of Python Coding Tests
3:20 Logical and Reasoning Tests
4:57 Snake Game Creation in Python

#Llama3 #CodingTest #LogicalReasoning #Test #Llama3Testing #Llama3Test #Llama3 #MetaLlama3Testing #MetaLlama3Test #Llama3Testing #Llama3Test #MetaLlama3Testing #MetaLlama3Test

Рекомендации по теме

Комментарии

This is amazing! I tested it for quite complex calculation and it surely matching GPT4 capabilities, if not surpassing it. (70 billion one)

Cingku

I'm excited to see whether the 8B model will be good enough to handle tool usage in crewai

PrinzMegahertz

Thanks for not using clickbait.
Recommendation:
The sound at like 1:29 and all other instances of it, is way too loud compared to your voice.
Either make your voice twice as loud as that ding/ring sound or reduce that ding/ring sound by 2 4 times.

FusionDeveloper

4:34 did you read the derivation steps? it computes 5/6 first which gives 0.83 (approximately) then times 12 giving $9.96 so technically is not wrong.

Rcky

Is available via Ollama and lm studio?

Augmented_AI

i think llama 3 was trained to do snake game with a specific dataset or instruction, i tested here and generated the same exactly snake game copy, but what a impressive AI, very good content, keep the good work.

Fratex-gy

Woooohooo!! 👏👏👏👏 for you and for Meta as well! ❤❤

MeinDeutschkurs

Can you fine tune for a language other than Python or JS and see how it does with a less common or even esoteric language? I’m thinking Elixer or Haskell, or gleam would be a great candidate as it’s so new.

VastCNC

👋Thanks for the insight you provide us from the world of LLM.
I would be very happy if you include tests of how these models handle text in different natural languages. (Chinese, Arabic, Indian, some European and Cyrillic. I'm having a problem with my native language and some of the patterns right now. I have to translate first to English and then to my native language. It's a bit awkward..🌱

skarloti

Just think what will happen in 3 years 👍🔥

I really wish YouTubers would try a different test other than snake game

yellowboat

Which LLM passed the Expert level test?

firstlast

Asked it make snake in js, total fail

MavVRX

We are all busy testing those models… I wonder how many humans can actually pass those tests 😅

nickbobrowski

Please do Wizard llm the new one next if possible

bgNinjashows

why everyone love to create snake game? how about a maze game? At least I failed to use it to create maze game... (near success but still failed)

fenix

Is snake game become the industry standard to test the coding / gaming ability now?

dkirk

Testing Llama 3: Did it Pass the Coding and Reasoning Test?

Testing Llama 3: Did it Pass the Coding and Reasoning Test?

Zuck's new Llama is a beast

LLaMA 3 Tested!! Yes, It’s REALLY That GREAT

LLaMA 3 UNCENSORED 🥸 It Answers ANY Question

How Did Llama-3 Beat Models x200 Its Size?

Llama 3.1 is ACTUALLY really good! (and open source)

Llama-3.3: The BEST Opensource LLM EVER! Beats GPT-4o! (Fully Tested)

Meta Llama 3.1 is Game Over for GPT 4o ❓

LLAMA 3.3 70B Fully Tested ( Coding / Logic and reasoning / Math ) #LLAMA3.3

Llama-3.3 (Fully Tested) : The BEST OPEN LLM is HERE! (+O1 Pro Thoughts)

Llama 3.2 VISION Tested - Shockingly Censored! 🤬

Llama 3.3 70B is Here! EXPERTS Are Raving About This Open Model

Testing Llama 3: Evaluating Performance With Coding and Reasoning! Better Than GPT-4?

This Llama 3 is powerful and uncensored, let’s run it

Meta llama 3 unexpected results! 100 prompt test

Llama 405b BEAST already exploited | Here’s how

Shocked at the results Meta Llama 3 vs. Microsoft Phi-3 vs. OpenAI ChatGPT 3.5

Llama3.3 with Ollama with GUI Locally - Llama 70B Instruct Testing

Llama 405b: Full 92 page Analysis, and Uncontaminated SIMPLE Benchmark Results

LLaMA 405b Fully Tested - Open-Source WINS!

Meta's New Llama 3.2 is here - Run it Privately on your Computer

Llama 3.3 70B - THE BEST LOCAL AI YET!

New Llama 3.3 Shocks the AI World - Crushes GPT-4 and Costs Almost Nothing

Meta's NEW Llama 3.3 70B Update Tested LIVE…🤯