Supercharge eCommerce Search: OpenAI's CLIP, BM25, and Python

Показать описание

We build a multi-modal hybrid search engine for ecommerce using OpenAI's CLIP, BM25, Pinecone vector database, and Python. The search engine processes text and image-based queries and can produce better results than traditional methods.

The search engine allows users to search and retrieve data using both text and visual queries, which is especially useful in e-commerce domains where users have a range of search queries, from specific product searches to image-based searches for related items.

By using CLIP and BM25, the search engine can process both text and image-based queries, providing users with a comprehensive search experience. Additionally, Pinecone vector database and Python allow for easy indexing, storage, and retrieval of data, making it possible to handle large volumes of data in real time.

📌 Example notebook:

🎙️ AI Dev Studio:

👾 Discord:

🤖 70% Discount on the NLP With Transformers in Python course:

🎉 Subscribe for Article and Video Updates!

00:00 Multi-modal hybrid search
01:05 Multi-modal hybrid search in e-commerce
05:14 How do we construct multi-modal embeddings
07:05 Difference between sparse and dense vectors
09:43 E-commerce search in Python
11:11 Connect to Pinecone vector db
12:04 Creating a Pinecone index
13:45 Data preparation
16:32 Creating BM25 sparse vectors
19:33 Creating dense vectors with sentence transformers
20:26 Indexing everything in Pinecone
24:41 Making hybrid queries
26:01 Mixing dense vs sparse with alpha
32:11 Adding product metadata filtering
34:13 Final thoughts on search

Рекомендации по теме

Комментарии

A demo of what we are about to learn in the beginning of the video would greatly help an infant such as myself in this field.

yamani

This channel is shockingly good for its subscriber count. Lucky I found you. Thanks!

iknowsolittle

very nice, the sparse and dense vector mix can apply to many sceanrios.

adamswang

This video is great! Instead of running on Colab, could you make a video that shows an up and down connection from an html front end to the Pinecone database, specifically uploading a PDF, vectoring it, querying, and displaying the results back through html? I also emailed you for some consulting work on a project. Thanks for the videos!

JasonMelanconEsq

I'm using s1 pod and trying to create an hybrid index with 10k vectors.
Will there any pricing difference between using a dense vector index alone and using a dense+sparse vector index from pinecone side?

gowthamkrish

This demo is fascinating. I would love to learn what technology to add to extend the demo, to maintain context between queries.

chrismaley

Amazing content as always. I was wondering, is it recommended to use embeddings such as the ones form Openai or cohere instead of BM25?

JuanLopez-ocyv

Hello James, great content. I have 1 query. How do we handle the query "show me blue jeans under $50", this "under $50" value while building a search engine. If you can guide me, would much appreciate it, thank you.

hemanshupan

Is there a reason why you didn't use CLIP to generate both image and text embeddings?

JohnKing

Hi thanks for sharing the video it is really useful. For this type of usage, other the Pinecone are there any other vector DB that run offline on local machine?

atomhero

Supercharge eCommerce Search: OpenAI's CLIP, BM25, and Python

Fast intro to multi-modal ML with OpenAI's CLIP

OpenAI CLIP Explained | Multi-modal ML

Text to Image Search AI App using Haystack and CLIP

How To Build A Personal Search Engine #coding #programming #computerscience

Using Vectors and @SingleStore to create an Image Matching and Search application

LLM use case: product search and filtering

Wie zit er achter OpenAI en ChatGPT?

SPLADE: the first search model to beat BM25

E251: HOW AI IS DRIVING RADICAL CHANGE IN ECOMMERCE SEARCH, MERCHANDISING & PERSONALISATION

Waymark AI: Creative on tap

Image search - Daily Elastic Byte S05E10

Build Semantic-Search with Elastic search and BERT vector embeddings. ( From scratch )

Deep Semantic Product Search

Building Multi-Modal Search with Vector Databases

The Only ChatGPT, Leonardo AI Prompt You'll Need | Copy & Paste

How to Use OpenAI Whisper to Fix YouTube Search

Cohere AI's LLM for Semantic Search in Python

Pinecone's New *Hybrid* Search - the future of search?

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

NER Powered Semantic Search in Python

AI thiết kế ảnh sản phẩm dành cho người kinh doanh online #aiacademy #hocnhanhai #chatgpt #photoroom...

RAG Architecture - Elastic Daily Bytes S05E11

ChatGPT Plugins: Build Your Own in Python!

Haystack US 2021 - Semantic Product Search – Vector Search for E-Commerce - Simon Hughes

Pinecone's New Hybrid Search - the future of search?