Knuth Morris Pratt (KMP) String Search Algorithm - tutorial with failure function in Java

Показать описание

This tutorial explains how the Knuth-Morris-Pratt (KMP) pattern matching algorithm works. Animated examples are used to quickly visualize the basic concept. Then the source code of an implementation written in Java is discussed, along with its running time complexity.

There are two parts to the algorithm. The first one creates a "failure function", which is nothing more than an array that holds the longest prefixes as they occur through out the pattern string. The main algorithm uses this prefix length array to calculate how much forward to shift the pattern string.

Source code written in Java:

The implementation is based on the pseudo-code taken from Wikipedia:

FAST PATTERN MATCHING IN STRINGS
DONALD E. KNUTH, JAMES H. MORRIS, JR. AND VAUGHAN R. PRATT

Written and narrated by Andre Violentyev

-----------------------------------------------------------------------------------------------------------------------------------------------
Errata (credit to @Achyut Sapkota):
My animation at 6:04 should have moved not by 2 but by 3 characters. By the way, at that point we'd also increment the pointer in the text to the next character. The source code (see above) is correct - once we get to index zero of the prefixLen array, which stores -1, we increment both the text and the pattern string pointers:
p = prefixLen[p];
if (p 'less than' 0) {
t++;
p++;
}
In the above example once we shift by 3, we essentially go "out of bounds" for the pattern string. The illustrations show that the first character, 'a', is at index 1 of the prefixLen array (not index 0, which is probably what is causing confusion since most programming languages use 0 based indexes for strings). So the formula works, it's just that, once we detect the "out of bounds" condition, we reset the pattern string pointer to the first character, and increment the text pointer by 1. In other words, we shift the pattern string so that its first character aligns with the next character in the text. This is what insures that each character in the text is examined at most twice.

Рекомендации по теме

Комментарии

This is the best explanation of how the prefix array is constructed. Awesome job and thanks!

maridavies

This channel honestly has the best explanation for all the difficult problems. The nlogn LIS implementation was too complex for me to understand, but then I stumbled upon this channel. The same goes for KMP! Amazing work! Really appreciate how you explain it in a simple manner!

mandeepsinghrr

Even a sloppy guy like me would understand this explanation, well done!

I also like how you comunicate :)

sloppyguy

The best explanation of the MIT version of KMP. Good job!!!
I learned the KMP algorithm through the 《Algorithm》which is public by Princeton University. The MIT version of KMP and Princeton's version is the two main version of the KMP algorithm.

xiaoxiaoxiao

Thank you sir
After lot of confusion in understanding this algorithm you made it easy to understand . Hope you make many videos like this.

prasannakumarnaidu

This is far the best explanation and discussion of the KMP, thank you for saving my semester.

junkaze

your explanation method (with the visualization ) and calm way of explaining really helped me understand the algorithm :)

irfanquader

I watched other videos and it finally clicked with this video, thanks. All your other videos are produced very well and great.

ram-ojij

Wow this is the a way better explanation than the GFG article. Thanks

abdalla

Just 1:44 into the video and got the ans I was long been pondering over, Thank you sir

vinayakf

Your videos are the best for learning these algorithms. Much appreciated.

Japopo

This was incredibly helpful for understanding the concepts behind the algorithm, Thank you!

danielsasse

thanks a lot for going through these classics, also your code looks simple and concise!

abelsimon

i am so glad i stumbled onto this channel

juniperandspice

The most efficient one out of those confusing videos

lukelyu

Hi Stable sort!

Please keep posting !!! You have the best videos I've ever seen!!!

AluSoft

Wow Sir. The explanation is really good!! Glad I've found your channel!!

mohamedmusharaf

Very good and illustrative explanation!

Prashantkumar-pnqq

Really appreciated your way of explanation sir

amitkumarprajapati

Hi, I think there is another error at 5:23 in the animation. The shift should be (8-2)=6 instead of (9-3)=6. Although they happen to have the same results, the last 'c' in the pattern is a mismatch so it should not be counted in the matched length. Consider a modified example, the string is "abdxxxabdxxxabcde" and the pattern is "abdxxxabcde". The fist mismatch is at the 'c' in the pattern, and the lps of 'c' is 0. The shift should be the matched length 8 minus the lps of the second 'b' which is 2, which results in 6, rather than (9 - 0) = 9. Do you feel it make sense?

mfrice

Knuth Morris Pratt (KMP) String Search Algorithm - tutorial with failure function in Java

9.1 Knuth-Morris-Pratt KMP String Matching Algorithm

Knuth–Morris–Pratt(KMP) Pattern Matching(Substring search)

Knuth–Morris–Pratt KMP - Find the Index of the First Occurrence in a String - Leetcode 28 - Python...

Visualization of Knuth-Morris-Pratt Algorithm

Knuth-Morris-Pratt Algorithm Visually Explained

Knuth-Morris-Pratt (KMP) algorithm | String Matching Algorithm | Substring Search

Knuth-Morris-Pratt algorithm (KMP) - Inside code

Knuth–Morris–Pratt (KMP) Pattern Matching Substring Search - First Occurrence Of Substring

KMP Algorithm | Searching for Patterns | GeeksforGeeks

Knuth Morris Pratt (KMP) String Search Algorithm - tutorial with failure function in Java

Knuth Morris Pratt Algorithmus - String Matching

Knuth–Morris–Pratt algorithm

Tutorial: The Knuth-Morris-Pratt (KMP) String Matching Algorithm

Knuth–Morris–Pratt(KMP) Pattern Matching(Substring search) Part2

Knuth-Morris-Pratt Algorithm

Knuth-Morris-Pratt KMP String Matching Algorithm

Knuth Morris Patterson (KMP) Pattern Matching Algorithm

Prefix Table Construction||Example-1||Knuth-Morris-Pratt KMP String Matching Algorithm||

Knuth Morris pratt Algorithm

KMP algorithm | Pattern search algorithm | string search algorithm

Knuth Morris Pratt KMP Algorithm | Pattern Search | String Matching | KMP String Matching Algorithm

The Knuth–Morris–Pratt string searching algorithm (KMP algorithm)

Knuth-Morris-Pratt - Pattern Matching

Knuth-Morris-Pratt KMP String Matching Algorithm | Search Pattern | GFG POTD