GROK 2 vs. LLAMA 3.1 - Cloud vs Home Server Ai Testing

Показать описание

GROK 2 is a surprisingly good beta mini version model that X released this week. I run a battery of tests with xAI's latest, Grok 2.0, against Meta's cutting-edge Llama 3.1 70B that is running locally. We do a variety of user centric tasking tests as well as some coding.

Be sure to 👍✅Subscribe✅👍 for more content like this!

Please share this video to help spread the word and drop a comment below with your thoughts or questions. Thanks for watching!

Chapters
0:00 Meta LLAMA 3.1 vs xAI GROK 2
1:45 Setup VS Code and Python for Ai
4:20 GROK 2 Python Game Creation
6:37 LLAMA 3.1 Python Game Creation
8:17 GROK 2 Creative Storytelling
10:42 LLAMA 3.1 Creative Storytelling
12:02 GROK 2 Deductive Reasoning
13:10 LLAMA 3.1 Deductive Reasoning
14:26 LLAMA 3.1 Count Words in Sentence
15:22 GROK 2 Count Words in Sentence
16:22 GROK 2 Decimal Precision and Counting
16:35 LLAMA 3.1 Decimal Precision and Counting
17:27 GROK 2 Recipe from Ingredients
18:47 LLAMA 3.1 Recipe from Ingredients
20:55 LLAMA 3.1 Food Budgeting and Planning
23:16 GROK 2 Food Budgeting and Planning
26:06 GROK 2 Fitness Coach and Planner
28:13 LLAMA 3.1 Fitness Coach and Planner
30:43 Home Ai Server Privacy vs Cloud Ai Provider
32:14 Conclusion

*****
As an Amazon Associate I earn from qualifying purchases.

When you click on links to various merchants on this site and make a purchase, this can result in this site earning a commission. Affiliate programs and affiliations include, but are not limited to, the eBay Partner Network.
*****

Рекомендации по теме

Комментарии

🎯 Key points for quick navigation:

00:00:00 *🥅 Testing Overview*
- Introduction of AI models: Grok 2.0 vs. Llama 3.1,
- Emphasis on practical usability in testing,
- Mention of the beta status of Grok 2.0.
00:02:11 *💻 Visual Studio Setup*
- Instructions for installing Visual Studio on Windows,
- Setup guidance for Python projects and PyGame installation.
00:04:18 *🎮 Game Development Test*
- Grok and Llama 3.1 tasked to create a game called "Flappy Block",
- Grok showed better performance in producing functional Python code,
- Initial outputs required debugging from both models.
00:08:29 *📖 Creative Storytelling*
- Models tasked with creating a humorous story,
- Output was verbose and not particularly funny,
- Neither model excelled in succinctness or humorous storytelling.
00:12:49 *🐱 Deductive Reasoning Test*
- Scenario involving a cat and a carrier tested models' deductive reasoning,
- Neither model provided a correct or insightful solution,
- Highlights challenges in AI's deductive abilities.
00:14:42 *🧮 Numerical Reasoning*
- Basic numerical comparison challenge,
- Both Grok and Llama 3.1 correctly identified the larger number.
00:17:19 *🍳 Recipe Creation*
- Models tasked with creating a recipe from limited ingredients,
- Both models successfully generated recipe instructions,
- Llama 3.1 provided additional context with serving size and tips.
00:20:59 *🛒 Budget Meal Planning*
- Creation of a grocery shopping list and weekly meal plan,
- Emphasis on budget and caloric needs,
- Pricing and specifics of the meal plan noted to be inaccurate or outdated.
22:41 *📊 Inaccuracies in Nutritional Guidelines*
- Discussion of caloric intake recommendations for a 40-year-old male.
- Assessment of budget shopping lists and recipe nutritional information.
25:57 *🏋️ Analysis of Fitness Plans*
- Evaluation of exercise instructions and plans for a year-end weight target.
- Critique of calorie burn estimates and effectiveness of workout plans.
31:02 *🔒 Privacy Concerns with AI Models*
- Consideration of privacy implications with cloud-based AI.
- Comparison between Grok 2.0 and Llama 3.1 regarding data privacy.

Made with HARPA AI

EDGSV-fm

10:40 - models often struggle with multi-turn conversations. I highly recommend starting a new conversation, especially when changing subjects dramatically (from coding to storytelling).

KastanDay

Messed up "video position shortcut" in description (Home Ai Server Privacy vs Cloud Ai Provider)

dllsmartphone

Do you have a guide on how to host Llama 70B like in this video? I couldn't find it in your previous videos.

nufh

Nice rig, how did you put it together

mikeygomes

can you share soucre code ui for LLAMA 3.1

termino

Run locally? not both
Uncensored? not both
No comparison points.

大支爺

GROK 2 vs. LLAMA 3.1 - Cloud vs Home Server Ai Testing

GROK 2 vs. LLAMA 3.1 - Cloud vs Home Server Ai Testing

Is Elon’s Grok 3 the new AI king?

Zuck's new Llama is a beast

GROK 2 Just Dropped - Is It Worth the Hype?

How Did Llama-3 Beat Models x200 Its Size?

Elon's New Grok-3 Just CRUSHED OpenAI O1 and Deepseek R1

Meta Llama 3.1 is Game Over for GPT 4o ❓

GROK 3 | First Impression and TESTS - Best AI On Earth?

Robots with HUMAN Skin, LLaMA 3 405b, Grok 2, Gen3 Video, Figure Robot, Meta AI Glasses

All You Need To Know About Running LLMs Locally

This new AI is powerful and uncensored… Let’s run it

Why Are Programmers Switching from ChatGPT to Claude 3.5

GPT 4.5 vs Claude 3.7 vs Grok 3 - Which Is Better?

Alibaba Qwen 2.5 - Max: Can It Beat ChatGPT , DeepSeek V3 & Llama 3.1 #AlibabaQwen #QwenMax #t...

Claude has taken control of my computer...

Grok-2 (Fully Tested) : The BEST & UNCENSORED MODEL is here? (Beats Claude-3.5 Sonnet, GPT-4O!?)

Wake up babe, a dangerous new open-source AI model is here

Sam Altman's new $200 ChatGPT has a big Elon problem...

Build Anything with Llama 3 Agents, Here’s How

Did xAI Cheat? The Truth About Grok-3’s Benchmarks!

Getting Started With Meta Llama 3.2 And its Variants With Groq And Huggingface

Elon Musk STUNS The Industry With GROK 2

Elon Musk's xAI introduces the new Grok-2 and Grok-2 Mini AI models #Grok2 #Grok #xAI #ElonMusk

Grok 3 vs DeepSeek R1 vs ChatGPT o3-mini with critical prompts #grok #deepseek #chatgpt #openai