Python Tutorial: Text Munging with regular expressions

Показать описание

---
Regular expressions are a powerful tool for processing text.

We will use them for matching messages against known patterns,
for extracting key phrases, and for transforming sentences grammatically. These are the core pieces we need to create our ELIZA style bot.

Much of the magic of the ELIZA system relied on giving the _impression_ that the bot
had understood you, even though the underlying logic was extremely simple.
For example, asking ELIZA "do you remember when you ate strawberries in the garden?",
she would respond: "How could I forget when I ate strawberries in the garden?".

Part of what makes this example so compelling is the subject. We are asking about _memories_,
which we associate with our conscious minds and our sense of self. The memory itself,
of eating strawberries in the garden, invokes powerful emotions. But if we pick apart how the
response is generated, we see that it's actually quite simple.

To build an ELIZA-like system you need a few key components. The first is a simple
pattern matcher. This consist of a set of rules for matching user messages, like
"do you remember x"

To match patterns we use a technology called *regular expressions*, to use these in python we `import re`.
Regular expressions are a way to define patterns of characters, and then seeing if those patterns occur in a string.

In regular expressions, the dot character is special, and matches *any* character. The asterisk means "match 0 or more occurrences of this pattern", so "dot star" is basically a catch-all, it says match any string of characters.

We can check whether a message matches a pattern by calling re dot search brackets pattern comma message. This returns a match object.
If the string doesn't match the pattern, the match object will be `None`, so we can check if the string matches using a simple if statement.

Adding parentheses in the pattern string defines a `group`. A group is just a substring that we can retrieve after matching the string against the pattern.

We use the match object's `group` method to retrieve the parts of the string that matched. The default group, with index 0, is the whole string. The group with index one is the group we defined by including the parentheses in the pattern.

To make responses grammatically coherent, we will want to transform the extracted phrases from
first to second person and vice versa. In English, conjugating verbs is easy, and simply swapping
"I" and "you", "my" and "your" works in most cases.

For example, take the sentence "I walk my dog".
"You walk your dog".

The final step is to combine these logical pieces together.
We start with a pattern and a message. We extract the key phrase by creating a match object using pattern dot search, and then use the group method to extract the string represented by the parentheses.

We then choose a response appropriate to this pattern, and swap the pronouns so that the phrase makes sense when the bot says it.
We then insert the extracted phrase into the response, to partially echo back what the user talked about, giving the illusion that the bot has understood the question and remembers this experience.

Now it's your turn to build your own eliza style chatbot.

#Python #PythonTutorial #DataCamp #Chatbots #Python #TextMunging

Рекомендации по теме

Комментарии

i want to convert: Bowmore 46 year old (distilled 1964), 42.9%
into : Bowmore 46 year old 42.9%
how to go for it i am facing trouble because of () and the srtring in it.
treating (string ) is very confsuing.
please help.

vishumudgal

Python Tutorial: Text Munging with regular expressions

Python Tutorial: Text Munging with regular expressions

Importing and Munging Plain Text Data in Python

Python Tutorial for Beginners 2: Strings - Working with Textual Data

Python Tutorial - Input Manipulation and Text Strings

Python Tutorial: File Objects - Reading and Writing to Files

Python string manipulation for beginners

Python Tutorial for Beginners 5: Dictionaries - Working with Key-Value Pairs

Python - String Manipulation

Python course tutorials live streaming 10 hours session 311

Master String Manipulation in Python | Python for Beginners | #lecture8

Python Curses Tutorial #1 - Make GOOD Looking Terminal Apps!

Image Manipulation

Python Tutorial: re Module - How to Write and Match Regular Expressions (Regex)

#65 Python Tutorial for Beginners | File handling

Python String: Essential Techniques for Effective Text Manipulation

Text Files in Python || Python Tutorial || Learn Python Programming

Python CP Lesson 2: String Manipulation

Python Tutorial #14: Basic Text File Manipulation: Computing Tutor

Quick Python Tips: String Manipulation Explained

Image Processing with OpenCV and Python

Exploratory Data Analysis with Pandas Python

Data Analysis with Python - Full Course for Beginners (Numpy, Pandas, Matplotlib, Seaborn)

Python Data Munging

Python for Coding Interviews - Everything you need to Know