title	Reading and Assembly
has_children	true
nav_order	2

Overview

World Modelers document management / reading / assembly / HMI workflows

This document describes usage modes that are explicitly supported and documented as workflow templates (W1-5 below). While these templates don’t cover every conceivable combination and usage mode, they are likely to be relevant for typical use cases. Customizations to the ontology and the usage of associated tools cuts across all of these workflows, and can in principle be relevant for any of them, therefore, these are documented separately.

Documentation by Workflow

W1. Reading only

In this usage mode the goal is to perform information extraction from a set of input documents that are manually compiled and brought into a form that one of the reading systems takes as input (e.g., a folder with txt files). This usage mode doesn't involve document management through DART or multi-reader integration/assembly through INDRA, but rather, meaning that users are responsible for their own downstream processing of reader outputs and derived analysis.

Systems used:

Readers: any of Eidos, HUME, or Sofia

W2. Reading + integration/assembly

In this usage mode, the goal is to perform information extraction on a set of input documents and then use INDRA to benefit from some combination of (i) multi-reader integration (ii) knowledge assembly functionalities (iii) standardized representation of reader outputs (iv) standard programmatic APIs for interfacing with readers and their outputs. The user in this mode is again responsible for document gathering and preparing inputs for reading but may use INDRA as a wrapper around calling one or more reading systems. The user is expected to then either use assembled INDRA Statements as objects or in a JSON-serialized form for their own downstream analysis.

Systems used:

Readers: one or more of Eidos, HUME, or Sofia
Integration/assembly: INDRA

W3. Document management + reading + integration/assembly

In this usage mode, the DART system is used for managing documents and using a standardized interface between DART and the reading systems. This avoids having to manually prepare input files for reading. The rest of the workflow follows W2 in that INDRA is used for integration. The user then takes INDRA Statements for further downstream analysis.

Systems used:

Document management: DART
Readers: Eidos, HUME, Sofia
Integration/assembly: INDRA

W4. Document management + reading + integration/assembly + HMI

This usage mode goes beyond W2 or W3 by loading INDRA outputs into Causemos to explore, curate, and derive models from the assembled causal information. However, in this setting, the user is not expecting a service architecture to support incremental reading/assembly during runtime. The usage of DART is technically not required for W4 but it allows linking back to documents and examining their metadata which is advantageous. This workflow can be understood as a one-time run of W2/W3 and then loading results into the HMI as a “static” corpus.

Systems used:

Document management: DART
Readers: Eidos, HUME, Sofia
Integration/assembly: INDRA
HMI: Causemos

W5. Document management + reading + integration/assembly + HMI + BYOD

This workflow builds on W4 and also enables users to add their own documents during runtime through Causemos. This requires DART, one or more readers, and INDRA World to be running as services.

Systems used:

Document management: DART
Readers: Eidos, HUME, Sofia
Integration/assembly: INDRA
HMI: Causemos

Documentation by Component

Readers
- Eidos
  
  Eidos is the machine reading system developed by the CLU lab at University of Arizona.
  
  Workflows: (W1, W2, W3, W4, W5)
- HUME
  
  Hume is BBN's machine reading system that extracts causal relations from text and supports clustering for ontology construction.
  
  Workflows: (W1, W2, W3, W4, W5)
- Sofia
  
  Sofia is a machine reading system developed at CMU that extracts causal relations from text.
  
  Workflows: (W1, W2, W3, W4, W5)
Integration/assembly
- Indra World
  
  INDRA World is a knowledge assembly system that integrates causal relations extracted by multiple reading systems, standardizes their representation, finds ontological relationships between relations, calculates overall confidence, and has a configurable pipeline to process and filter causal knowledge.
  
  Workflows: (W2, W3, W4, W5)
Document management
- DART
  
  The Data Analytics and Reasoning Toolkit (DART) is the data ingestion pipeline for the World Modelers platform.
  
  Workflows: (W3, W4, W5)
HMI (Human-Machine Interface)
- Causemos
  
  Causemos is the main HMI for the World Modelers program, built and maintained by Uncharted Software.
  
  Workflows: (W4, W5)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overview

World Modelers document management / reading / assembly / HMI workflows

Documentation by Workflow

W1. Reading only

W2. Reading + integration/assembly

W3. Document management + reading + integration/assembly

W4. Document management + reading + integration/assembly + HMI

W5. Document management + reading + integration/assembly + HMI + BYOD

Documentation by Component

FilesExpand file tree

reading-assembly.md

Latest commit

History

reading-assembly.md

File metadata and controls

Overview

World Modelers document management / reading / assembly / HMI workflows

Documentation by Workflow

W1. Reading only

W2. Reading + integration/assembly

W3. Document management + reading + integration/assembly

W4. Document management + reading + integration/assembly + HMI

W5. Document management + reading + integration/assembly + HMI + BYOD

Documentation by Component