Open Source LLMs Score Again! Introducing MPT-7B, Commercially Usable and Free of Charge

Показать описание

In this video, I introduce MPT-7B, the latest development in the MosaicML Foundation Series. It is a commercially-usable, open-source LLM that was trained on 1 trillion tokens of text and code.

Watch as we compare the MPT-7B to popular open-source LLMs like LLaMA series from Meta, the Pythia series from EleutherAI, the StableLM series from StabilityAI, and the OpenLLaMA model from Berkeley AI Research. See how MPT-7B surpasses these models in terms of commercial usability, training volume, input handling, training speed, and inference optimization.

The blog post can be found here:

Рекомендации по теме

Комментарии

I've experimented with it and it's not amazing, but Alibi sure is.

Looking forward to seeing that tech make it into a better 13b model

priestesslucy

Thanks!! it seems like the model is introduced in May 5. Why in the video I get the impression like it is introduced today?:) Maybe the upload date doesn’t match with the date the video was prepared?

nctamer

I wish someone would discuss the cost to run this model vs it being “free”. From what I can tell cloud based GPUs that are not going to be painfully slow are about $1.66/hr USD. Who of us that are just trying to get something started can afford or are willing to spend $1200/mth USD for 1 GPU. I’m a bit of a noob … am I wrong?

dbriand

how do I make this.. work? like if I wanted to use mpt-7-chat model like chatgpt, how do I set that up? sorry for being a braindead

pawekozowski

I am sorry to say mpt is horrible. I tried to use it, trained it using LoRA and it was still garbage. Do not waste your time.

timothymaggenti

Open Source LLMs Score Again! Introducing MPT-7B, Commercially Usable and Free of Charge

Open Source LLMs Score Again! Introducing MPT-7B, Commercially Usable and Free of Charge

5 Open-Source Options for Running LLMs Locally

Ep 15. Should You Use Open Source LLMs or GPT-4?

NEW Open-Source LLM Tops The Rankings...But Is It Actually Good?

Apple Shocks Again: Introducing OpenELM - Open Source AI Model That Changes Everything!

New LLM DESTROYS Every Other Model with 'Self Healing' (Open Source)

Qwen-2.5: The BEST Opensource LLM EVER! (Beats Llama 3.1-405B + On Par With GPT-4o)

Open Source LLMs: Viable for Production or a Low-Quality Toy?

Data Preparation Toolkit for LLM Application Developers

API For Open-Source Models 🔥 Easily Build With ANY Open-Source LLM

DeepSeek-v2.5: BEST Opensource LLM! (Beats Claude, GPT-4o, & Gemini) - Full Test

The Best Performing Instruct LLMs (open-source)

End To End LLM Project Using LLAMA 2- Open Source LLM Model From Meta

Finetuning Open-Source LLMs

Locally-hosted, offline LLM w/LlamaIndex + OPT (open source, instruction-tuning LLM)

A hacker's guide to open source LLMs - posit::conf(2023)

FREE Unstract AI Open Source 🤖Convert ANY Unstructured Data To RAG Ready LLM Data API Endpoints

LMSys Leaderboard: Which LLM is Currently The Best?

Mistral 7B LLM Open Source Large Language Model

E10 | 1 year after ChatGPT, where do the best open source LLMs stand?

OpenAI One Step Closer to SELF IMPROVING AI | AI Agents doing AI Research | MLE-bench

Risks of Large Language Models (LLM)

How to Build LLMs on Your Company’s Data While on a Budget

LLM Explained | What is LLM