The PDF Advanced Extraction Engine (PAEE) is a rule-based document processing system designed to automatically extract and structure information from regulatory and policy documents.
Currently, it is being tested on Interstate Oil & Gas Compact Commission (IOGCC) policy documents.
- Report a bug: See
Documentation/user-guide/reporting-issues.rst - Get the latest build: Coming Soon
- Build the system: See
Documentation/user-guide/quickly-build-PAEE-on-linux.rst
All users should review:
- Building requirements:
Documentation/process/changes.rst - License information: See
COPYING
- Julian E. Tong — Semiconductor Engineer
PAEE is released under the GNU General Public License Version 2 (GPLv2).
The GPLv2 includes a warranty disclaimer stating that the software is provided "AS IS", without warranty of any kind, to the extent permitted by law. Users assume all risks regarding quality, performance, and costs of repair.
Any distributor must:
- Retain all original copyright notices.
- Include a complete copy of the GPLv2 license text.
- Provide the corresponding source code when distributing binaries, or include a written offer to provide the source code.
- Clearly document any modifications made to the software.
- Ensure that any redistributed version remains licensed under GPLv2.
Charged Technology Solutions LLC maintains a separate commercial license for this build, providing additional terms, enterprise support, and extended document-processing capabilities.
The commercial license is independent of the GPLv2 version and does not restrict the rights granted under GPLv2.
- Live Processing
- Graph Visualization
- GPU-Based Highlighting
- Export Pipeline (
.txt,.xlsx,.sql) - Split-Screen Layout
- Performance Dashboard