Deploy your LLaMA-2 model to Google Cloud

preview_player
Показать описание
This is part of the "Deploy LLaMA-2 models to Google Cloud" series.
Learn how to make your own LLaMA-2 models and deploy them to Google Cloud. The tutorial will cover converting LLaMA GGML models to GGUF and deploying them to Google Cloud Run, Compute Engine and Kubernetes Engine. Use the $300 trial given by Google for new accounts for an almost free deployment of AI models.

Some of our AI products -
Or

#kubernetes #compute #engine #google #cloud #free #run #artifact #registery #trial #googlecloud
Рекомендации по теме
Комментарии
Автор

Bro Please provide a tutorial for uploading a docker image to the artifact registry repository. I'm confused about how to do it, hopefully next time you will be given a tutorial for this, thank you

MuhammadDzakiFakhrezi
Автор

Thanks bro ! What about pricing ? For example on a cold gpu.

beethoven
Автор

Hi, how did you uploaded docker image there in artifactory registry

i have downloaded 5gb GGUF file. and kept in models and all i have done but i am not able to dockerize locally as some certificate issues
github also supports only 2gb file

lakanavarapunagamanikantaa
Автор

hey i need a help to deploy llama model, can you please help me

MahadevAjagalla