I tested Mistral AI 7B vs ChatGPT (GPT 3.5 TURBO) on 20 Questions!!!

Показать описание

❤️ If you want to support the channel ❤️
Support here:

🧭 Follow me on 🧭

Рекомендации по теме

Комментарии

the one issue i noted is when you check llm please clear the chat history before new question, don't append new question, due to token issues that affect quality of response, chat gpt got high context window, but mistral doesnt

proflicxx

🎯 Key Takeaways for quick navigation:

00:00 🤖 The video introduces a comparison between Mistral AI's 7 billion parameter model and GPT 3.5 Turbo (ChatGPT) to evaluate their responses to various prompts.
01:18 🛡️ The presenter mentions concerns about data contamination and the difficulty of comparing Mistral AI's 7 billion parameter model with GPT-4 due to differences in model sizes and architecture.
03:37 💼 The comparison involves testing both models on a set of 20 questions covering categories like reflection, knowledge, code, and more, to observe and compare their responses in different contexts.
04:19 ⚖️ The comparison begins by evaluating responses to a reflection question regarding the use of Kubernetes, showcasing both Mistral AI and GPT 3.5 Turbo's generated arguments for and against Kubernetes.
05:17 🧮 The testing includes diverse questions, such as math problems and language puzzles, testing the models' ability to understand and provide accurate solutions, revealing variations in their performance.
06:51 🌍 The models are tested on questions related to general knowledge, including understanding family relations, countries' status, scientific concepts, and more, showcasing their comprehension and accuracy in different topics.
08:13 🗺️ The comparison includes political questions, testing the models' ability to provide neutral and accurate information regarding sensitive political topics like Taiwan's status.
09:46 🇫🇷 Both models are tasked with translating English text to French, showcasing variations in their translations and understanding of the given English phrases.
10:42 💻 The models are challenged with explaining code, finding bugs, and generating new code, revealing differences in their programming knowledge and capabilities.
12:47 📝 Mistral AI impresses with its detailed understanding and bug-spotting ability in code, showcasing its potential for code-related tasks.
13:39 🖥️ ChatGPT demonstrates accuracy in generating a Python function for finding leap years, highlighting its programming capabilities.
14:20 🎶 Both models attempt to generate a 12 Bar Blues chord progression in the key of E, showcasing their ability to generate musical content.
15:04 🌌 Mistral AI provides a JSON response for the five planets closest to the Sun, demonstrating its ability to understand and follow specific formatting instructions.
16:42 😃 ChatGPT impresses with an SVG code for a smiley face, showcasing its creativity and ability to generate specific design elements.
19:07 🛍️ Both models provide compelling product descriptions for a 100W wireless charger, demonstrating their capacity to generate marketing content.
20:05 🎤 ChatGPT excels in crafting a persuasive pitch to encourage YouTube viewers to subscribe, highlighting its ability to generate engaging and promotional content.
21:43 👏 The comparison concludes with an overall positive impression of Mistral AI's 7 billion parameter model, emphasizing its impressive performance throughout the test.

Made with HARPA AI

Devjwal

The distance from earth is given in AU (astronomical unit)
It is a valid distance unit in astronomy when dealing with intra solar system distances as it's too large for kilometers but too small for light years.
1 AU = Avg distance of earth from the sun.

vishalpanjeta

yep, the context issue is one thing, the overall chat settings like temp, top_K/P, rep penlty etc... it all can have possible influence on output [like not following the no-explain instruction].... still, thanks for your effort

yngeneer

Munch-House-En Tri-lemma (trilemma as in a triple dilemma) 👍 Great video, thanks for creating and sharing.

Serifinity

So early that even 720p is not available.

mathematicalninja

Look at the OpenOrca trained version of Mistral 7B next 🙂

IslandDave

I’ve been using Mistral 7B model for a while now and I feel it’s not as good as GPT3.5; now I prefer to use Orca2 13B model instead of Mistral 7B

ViewpointsVortex

The svg had colour of white, it may be working specially on a colour background. Maybe check it by changing colour. BTW mistral app is really fast compared to Chatgpt. That’s a pro.

eric

Sally can have either 0, 1, 2 or more sisters, if we consider she can have half sisters and so can her brothers

cuentadeyoutube

Re: Blue Progression: You don't have to know anything about music to see:
Mistral is just going back and forth between two chords (2 chords back and forth)
Chat-GPT3.5 - Gets all fancy and adds a B7 to the E7-A7 toggle. (All 1 chord, then toggle, then tri)
I don't even have to know what this sounds like to know GPT3.5 Won this round.

leafdriving

Can you implement Mistral AI on tabular dataset Q&A (likeusing langchain agents etc.)

anuvratshukla

I tested mistral on python function for leap years it did get it right even better than chatgpt

faaz

Coding tasks are bit shallow. Try to add some more - like converting the code from language to language, modifying / refactoring code by verbal instructions, asking llm to optimize the code or finding edge cases, writing tests

alx

Can you make a video on how to add knoeledge to Mistral 7B ?

Techonsapevole

I mean shoot that’s really good, if you train it with a gpt3.5 dataset it would preform exactly like it.

Nick_With_A_Stick

French translation was better in chatgpt, difference is so small but it is a tiny bit more poetic. But both are equivalent.

gui-zxdi

7:02 GPT 3.5 is actually right it says "including herself"

Somebodythatoverthinks

The audio feels muted 😅 thought it was my phone but I checked with other videos too

HB-klik

Munchousen syndrome is where a captive empathizes with their captor. There was a bank robbery in that town i believe where the hostages began to help the person who took them hostage.

NOFX

I tested Mistral AI 7B vs ChatGPT (GPT 3.5 TURBO) on 20 Questions!!!

I tested Mistral AI 7B vs ChatGPT (GPT 3.5 TURBO) on 20 Questions!!!

Mistral 7B -The Most Powerful 7B Model Yet 🚀 🚀

Get Started with Mistral 7B Locally in 6 Minutes

Mistral 7b - the best 7B model to date (paper explained)

NEW Mixtral 8x22b Tested - Mistral's New Flagship MoE Open-Source Model

NEW Mistral AI Update is INSANE (FREE!)

Fine-Tuning Mistral AI 7B for FREEE!!! (Hint: AutoTrain)

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Install Ollama on Windows 11 WSL - Run Llama 3 & Mistral Locally (2025 Guide)

NEW Mistral-7B v0.3 🇫🇷 TESTED: Uncensored, Function Calling, faster than llama3 8b?!

Mistral 7B Dolphin Uncensored - Is This The New SMALL KING? 👑

Mistral 7B 🖖 Beats LLaMA2 13b AND Can Run On Your Phone??

UNCENSORED Mistral v0.2 Dolphin LLM - Won't Refuse Anything!

100% kostenlose KI und besser als ChatGPT? Mistral im Test!

🔥🚀 Inferencing on Mistral 7B LLM with 4-bit quantization 🚀 - In FREE Google Colab

You're Prompting Mistral WRONG!

Create an AI Clone of yourself using MISTRAL 7b! credits: @zorothewiz

Mistral AI - je fais tourner un LLM sur mon macbook avec Mistral 7b

Mistral 7B: The BEST Tiny Model EVER! Beats LLAMA 2 (Installation Tutorial)

Mistral AI 7B LLM: The Game-Changer for AI Conversations #mistral #openai #anthropic

Codestral-Mamba (7B) : Testing the NEW Mamba Coding LLM by Mistral (Beats DeepSeek-V2, Qwen2?)

Fine-tuning a CRAZY Local Mistral 7B Model - Step by Step - together.ai

Install Mistral 7B Locally - Best OpenSource LLM Yet !! Testing and Review

Mistral AI: Free Tier & Lower Prices