Ingest Procedure: Outline

Home Alexandria Digital Library

Ingest Procedure: Outline Version

Metadata from raw to ready-to-search


Overview: Metadata will arrive into ADL's hands in various formats, cleanliness, style, and type. We seek to encourage participants to format metadata before we receive it. Yet, some transformations will need to take place on most metadata before the metadata is ready to be inserted into the Catalog. This document makes an attempt at outlining the process of ingesting metadata.

1. Metadata, solicited or unsolicited, will come to ADL. We will actively encourage participants to format and structure data so that it arrives in relatively working order. When appropriate ADL will provide forms and explanations to participants. Perform preliminary crosswalk on the metadata.

2. Whether metadata comes to ADL in a MS Access form or in plain, ASCII files, ADL will parse and further organize the metadata. If necessary (if the participant did not/could not enter metadata in the proper format, e.g.; date) run calculations on data, format with correct delimiters, and organize into proper fields. Revisit original crosswalk; update to reflect the metadata in hand.

3. After formating the metadata as much as possible, ADL will then place metadata into some raw table structure in the ingest database. Right now, that raw database is called "ingest." At this point, the metadata may need additional manipulations, transformations, or corrections.

4. Data checking will now take place. Q/A the data again, inside the tables. When the metadata reaches a "clean" state, it will then be placed in the s_schema tables within the "ingest" database. At this point, the metadata should be formatted, correct, free of error, and matching field definitions. If not, return to data checking (Q/A) until the metadata is clean.

5. When metadata is ready, distribute from ingest into the "prod" database.

6. Document above steps.

7. Backup databases regularly onto tape.


Home Alexandria Digital Library
Last modified on 1996-10-16 at 19:54 GMT by the Alexandria Web Team