Understanding Computational Requirements of
Preservation and Reconstruction
1. Data Integration and Information Gathering about Decision Processes Using Geospatial Electronic Records
Our work addresses the tradeoffs of electronic information preservation in terms of file format, data volumes and
computational requirements. We have evaluated storage and retrieval efficiency of boundary data representations for
LLS, TIGER and DLG data structures.
2. Simulation Environment for Understanding Computational Requirements of Preservation and Reconstruction
We focused on evaluating the information value, data volumes and computational
requirements during decision making processes when preservation is our main objective.
The problem is stated as information gathering about decision processes using geospatial electronic records and
described in more details below.
The government makes a large number of high-confidence decisions using geospatial electronic records.
Decision makers might process maps and photographs called raster data, vector data that represent linear
features like county boundaries or streams, and statistics in tabular form to arrive to a decision that affects
the lives of many citizens. The problem is to document, preserve, and reconstruct the processes later, often years
after the initial decision has been made.
While any government decision process is complicated on its own, tracking the analysis of geospatial electronic records
supporting government decisions adds another layer of complexity. We must understand the cost of information preservation
and the value of preserved information. A team from the National Center for Supercomputing Applications works with the
National Archives and Records Administration to provide software tool (Image Provenance to Learn -
IP2Learn) for:
- simulating complicated high-confidence
decision scenarios,
- preserving the gathered information in temporally sustainable data containers, and
- reconstructing high-assurance decision making processes.