The Best Way to do Topic Modeling in Python - Top2Vec Introduction and Tutorial

preview_player
Показать описание
Join this channel to get access to perks:

If you enjoy this video, please subscribe.

If there's a specific video you would like to see or a tutorial series, let me know in the comments and I will try and make it.

You can follow me at:
Рекомендации по теме
Комментарии
Автор

Thank you very much for this video man, Is there a way to use pyLDAvis visualizations with top2vec?

abasisadegh
Автор

Your killing it lately with these videos. Keep up the great work.

justinhuang
Автор

hi! great video. is the dataset publicly available? if yes, could you please link it in the description?

pratikaraut
Автор

This is such a great find! What I am wondering is: Can you train a BERT sentencetransformer on a large set of documents spanning several projects, then have top2vec use these embeddings to make a topic model for each project (so basically, for each subset of the larger corpus)?

sjoerdbraaksma
Автор

can you show how to compare it to lda with topic information gain? or coherence score? something i’m curious to see

dankchan
Автор

Dude, this video is awesome. Breaking things down seems to be your super power.... 👌

Adrian_Marmy
Автор

I am a new subscriber and this .. was .. simply .. great! I wish there were more Top2Vec videos (ranging from beginner to advanced) . Keep up the excellent work. *hint* *hint* 🙂

dankchan
Автор

How do you filter stop words and how does this compare to Bartopic

RedCloudServices
Автор

Hi there, how would this work if there's multiple topics tagged to one line? Is it all mutually exclusive?

lukechen
Автор

Nice video. But when I listed the words for each topic it shows stop words only - isn't it supposed to remove them in preprocessing stage?

cuneyttyler
Автор

Great video! Do you know if top2vec has options for when you have a dataset too large to fit into RAM? I have a dataset that is something like 9gb of text that I've been trying to topic model with different methods, so I'd be curious to try this out. But I probably can't just load the whole thing into a list and pass it in

fetchthebattleaxe
Автор

Great video, very clear!
Do you know how Top2Vec deals with outliers? there is no 'outlier topic' at the end and all the documents seem to be assigned a topic. (I have BERTopic in mind where there is a -1 topic with the outliers)

juanmanuelaguiar
Автор

4:20
which is it!?
"Each word in that document, type, all th the items of that vector, all the documents.."

jordoobodi
Автор

I'm getting this error:
from top2vec import Top2Vec
ImportError: cannot import name 'Top2Vec' from partially initialized module 'top2vec' (most likely due to a circular import)

any ideas are appreciated.

I'm using an M1 Mac

tonyberber
Автор

Hi sir I checked out your NER Playlist and had a doubt . How can we calculate accuracy of a ner model ?

JayShankarpure
Автор

so do you suggest working with Top2Vec rather than LDA? I mean do you think doing those manual changes in implementing LDA and data preprocessing worth it? or let's stick to Top2Vec. by the way your videos are awesome and I am really interested to go deep into Topic Modeling.

sinabaghaei
Автор

How did you force the code to run on CPU?

patrykkoakowski
Автор

Great vid, as always. I'm interested in creating my own WordNet dataset, any ideas where I should start?

kosemekars
Автор

Thanks for a great video ! I installed top2vec and tried importing it it. I get following error 'No module named Any ideas

jshmo
Автор

Thanks, wish you my best to your channel, and CONGRATULATIONS,
How can we evaluate the topic modeling algo like top2vec or BerTopic
Thanks in advanced

amrmoursi
visit shbcf.ru