Lexical Analyzer Generator

Overview

This project provides an automatic lexical analyzer generator that processes lists of tokens defined by regular expressions. It is designed to facilitate the parsing and tokenization of strings according to user-defined patterns, making it a versatile tool for compiler construction, data parsing, and other applications requiring lexical analysis.

Features

Token Definition Parsing: Parse a list of token definitions provided in a specific format, including token names and their corresponding regular expressions.
Input String Lexical Analysis: Perform lexical analysis on a given input string, breaking it down into a sequence of token-lexeme pairs based on the provided token definitions.
Error Handling: Identify and report various types of errors, including syntax errors in the input, duplicate token names, and regular expressions that generate the empty string.

Input Format

The input to the program consists of two parts:

A list of token definitions, each comprising a token name and a token description (regular expression), separated by commas and terminated with a hash (#) symbol.
An input string composed of letters, digits, and space characters.

Example: token1 reg_exp1, token2 reg_exp2, ... tokenN reg_expN #

Output Format

The program outputs one of the following based on the input:

A sequence of tokens and their corresponding lexemes if the input is correctly formatted and matches the token definitions.
An error message if there are syntax errors, duplicate token names, or if a token's regular expression can generate the empty string.

How to Compile and Run

Compile the program using GCC with C++11 support:

g++ -std=c++11 your_program.cpp -o lexer

Run the program and redirect input from a file:

./lexer < input_file.txt

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
inputbuf.cpp		inputbuf.cpp
inputbuf.h		inputbuf.h
lexer.cpp		lexer.cpp
lexer.h		lexer.h
mylexicalanalyzer.h		mylexicalanalyzer.h
parser.cpp		parser.cpp
parser.h		parser.h
reg.h		reg.h
stringmatch.h		stringmatch.h
test1.sh		test1.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lexical Analyzer Generator

Overview

Features

Input Format

Output Format

How to Compile and Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lexical Analyzer Generator

Overview

Features

Input Format

Output Format

How to Compile and Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages