Contents
ToggleThompson Construction
Thompson's algorithm or Thompson's construction makes it possible to construct a automaton non-deterministic with epsilon-transition from a regular expression. The automaton is built recursively from basic patterns:
Unit elements like ∅, ε and a are recognized by
We consider the operators of the regular expression R according to the decreasing order of exteriority. The outermost operator cuts R into R1 | R2 , R1.R2 or R1* which are recognized by
Here is an example of construction with the expression (a | b) (a ∗ | ba ∗ | b ∗) ∗
We start the recursion by creating (A)(B) – the content is only indicative, we will see how to fill A and B as we go.
Then by placing the star on (B)* – here we take the opportunity to perform the OR function in (A) = (a|b).
The next step is to do the OR function in (B) = (C | D | E)
The process contained by descending recursively until obtaining the final Thompson automaton.
The latter should then be determined and minimized for better reading.
Some simple examples
- 01∗
- (0 + 1)01
- 00(0 + 1)*