Python pandas - Text Feature Engineering - How to Extract a Substring

preview_player
Показать описание
How to do text feature engineering in Python pandas on the Titanic Kaggle Competition dataset.

Titanic Kaggle Competition

Regular Expressions in Python Cheat Sheet
Рекомендации по теме
Комментарии
Автор

I have a column of unstructured data which contains comments and feedback. Do you know of a library I can use to extract common keywords from these columns to find patterns? For example:

Comments
1. “Could not purchase due to budget”
2. “Budget wasn’t enough to purchase”
3. “Tool capabilities aren’t there for me”

Target column results from regex function
1. Budget
2. Budget
3. Capabilities

anthony