Identify Stocks on Reddit with SpaCy (NER in Python)

preview_player
Показать описание
We will learn how to process unstructured text data from Reddit and extract organization names so that any further analysis is automatically classified and results assigned to the correct stocks.

Organizations are mentioned in each subreddit in a variety of formats. Typically we will find two formats:

- Organization name, eg Tesla/Tesla Motors
- Ticker symbol, eg TSLA, tsla, or $TSLA

So, we need a reasonable competent NER process to accurately classify our data.

🤖 70% Discount on the NLP With Transformers in Python course:

Thanks for watching!
Рекомендации по теме
Комментарии
Автор

One of the best Youtube video. Thanks James!

AlgoTradingX
Автор

What's the advantage of using spacy as oppose to having a csv of ticker names and comparing it to the data scraped from the Reddit API? Is spacy faster and/or efficient?

egomalego