filmov
tv
Identify Stocks on Reddit with SpaCy (NER in Python)

Показать описание
We will learn how to process unstructured text data from Reddit and extract organization names so that any further analysis is automatically classified and results assigned to the correct stocks.
Organizations are mentioned in each subreddit in a variety of formats. Typically we will find two formats:
- Organization name, eg Tesla/Tesla Motors
- Ticker symbol, eg TSLA, tsla, or $TSLA
So, we need a reasonable competent NER process to accurately classify our data.
🤖 70% Discount on the NLP With Transformers in Python course:
Thanks for watching!
Organizations are mentioned in each subreddit in a variety of formats. Typically we will find two formats:
- Organization name, eg Tesla/Tesla Motors
- Ticker symbol, eg TSLA, tsla, or $TSLA
So, we need a reasonable competent NER process to accurately classify our data.
🤖 70% Discount on the NLP With Transformers in Python course:
Thanks for watching!
Комментарии