Name	Name	Last commit message	Last commit date
parent directory ..
src	src
test	test
LICENSE	LICENSE
README.md	README.md
SimpleRegex.cabal	SimpleRegex.cabal
package.yaml	package.yaml
stack.yaml	stack.yaml

Name

Last commit message

Last commit date

Kleene Regular Expressions, in Haskell

This is literally the definition of a simple string recognizing regular expression in Haskell. It consists of the Reg datatype encompassing the five standard operations of regular expressions and an accept function that takes the expression and a string and returns a Boolean yes/no on recognition or failure. It is a direct implementation of Kleene's algebra:

L[[ε]] = {ε}
L[[a]] = {a}
L[[r · s]] = {u · v | u ∈ L[[r]] and v ∈ L[[s]]}
L[[r | s]] = L[[r]] ∪ L[[s]]
L[[r∗]] = {ε} ∪ L[[r · r*]]

Those equations are for: recognizing an empty string, recognizing a letter, recognizing two expressions in sequence, recognizing two expression alternatives, and the repetition operation.

The accept function has two helper functions that split the string, and all substrings, into all possible substrings such that every possible combination of string and expression are tested, and if the resulting tree of ands (from Sequencing and Repetition) and ors (from Alternation) has at least one complete collection of True from top to bottom then the function returns true.

This generation and comparison of substrings is grossly inefficient; an string of eight 'a's with a* will take 30 seconds on a modern laptop; increase that to twelve and you'll be waiting about an hour. The cost is 2^(n - 1), where n is the length of the string; this is a consequence of the sequencing operation. Sequences aren't just about letters: they could be about anything, including repetition (which itself creates new sequences) and other sequences, and the cost of examining every possible combination of sequencing creates this exponential cost.

It is quite amazing, though, to actually see a straightforward implementation of Kleene's Regular Expressions in code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Kleene Regular Expressions, in Haskell

FilesExpand file tree

01_SimpleRegex

Directory actions

More options

Directory actions

More options

Latest commit

History

01_SimpleRegex

Folders and files

parent directory

README.md

Kleene Regular Expressions, in Haskell