Skip to content

Dread2/PAEE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PDF Advanced Extraction Engine (PAEE)

The PDF Advanced Extraction Engine (PAEE) is a rule-based document processing system designed to automatically extract and structure information from regulatory and policy documents.

Currently, it is being tested on Interstate Oil & Gas Compact Commission (IOGCC) policy documents.

Quick Start

  • Report a bug: See Documentation/user-guide/reporting-issues.rst
  • Get the latest build: Coming Soon
  • Build the system: See Documentation/user-guide/quickly-build-PAEE-on-linux.rst

Essential Documentation

All users should review:

  • Building requirements: Documentation/process/changes.rst
  • License information: See COPYING

Maintainers

  • Julian E. Tong — Semiconductor Engineer

IMPORTANT

PAEE is released under the GNU General Public License Version 2 (GPLv2).

The GPLv2 includes a warranty disclaimer stating that the software is provided "AS IS", without warranty of any kind, to the extent permitted by law. Users assume all risks regarding quality, performance, and costs of repair.

Distributor Requirements Under GPLv2

Any distributor must:

  • Retain all original copyright notices.
  • Include a complete copy of the GPLv2 license text.
  • Provide the corresponding source code when distributing binaries, or include a written offer to provide the source code.
  • Clearly document any modifications made to the software.
  • Ensure that any redistributed version remains licensed under GPLv2.

Commercial License Notice

Charged Technology Solutions LLC maintains a separate commercial license for this build, providing additional terms, enterprise support, and extended document-processing capabilities.

The commercial license is independent of the GPLv2 version and does not restrict the rights granted under GPLv2.

Features

  • Live Processing
  • Graph Visualization
  • GPU-Based Highlighting
  • Export Pipeline (.txt, .xlsx, .sql)
  • Split-Screen Layout
  • Performance Dashboard

About

PAEE™ - rule-based document processing system designed to automatically extract and structure information.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors