DAS
Packaging Information
Binds Content Information and Preservation Description Information into an AIP (Directory paths, file names, METS, etc.)
de jure
By right or rightful claim/entitlement
Virus scan
Can include a quarantine period
Administration
Central hub for internal and external interactions Oversees other five services Interfaces with producers, consumers, and management
Semantic Information
Clarifies the meaning or appropriate interpretation of the Content Data Object
Content Information
Composed of Content Data Object and Representation Information
Forensic Toolkit
Computer forensics software that can scan a hard drive for things like deleted files, PII, and malware
CCSDS
Consultative Committee for Space Data Systems
Content of AIP
Content Information Preservation Descriptive Information Packaging Information
DANS
Data Archiving and Network Services
DSA
Data Seal of Approval
Structure Information
Describes the format of the digital object
DACS
Describing Archives a Content Standard A content standard for populating finding aids
Format identification and validaiton
Determine types of files
DRAMBORA
Digital Repository Audit Method Based on Risk Assessment Repository audit tool Methodology for self-assessment Encourages organizations to establish a comprehensive self-awareness of their objectives, activities, and assets before identifying, assessing, and managing the risks implicit within their organization Developed by Digital Curation Centre (DCC) and Digital Preservation Europe (DPE)
DIP
Dissemination Information Package What is given to the user May be in different format or have less metadata than AIP
Access Rights Information
Documents any conditions or restrictions pertaining to preservation or access Can include rights enforcement mechanisms (License terms, access permissions, preservation terms and conditions, etc.)
Access tools
EAD finding aids, MARC records
EAD
Encoded Archival Description- metadata structure for archival records An encoding standard for detailed finding aids
Fixity Information
Ensures that the Content Information has not been altered in an undocumented way (Checksums, digital signatures, digital watermarks, etc.)
FOXML
Fedora Object XML Encoding schema for packaging AIPs
BagIt
File packaging specification •Creates a structured directory •Captures data about what's inside the package •Lists entire structure and checksum
ISAD(G)
General International Standard Archival Description Standard to provided general guidance for the preparation of archival descriptions A structural standard for constructing finding aids Should be used in conjunction with existing national standards or as the basis for development of national standards Created by International Council on Archives (ICA) Sub-Commitee on Descriptive Standards (CBPS)
Provenance Information
History of the Content Information (Creation, alterations, chain of custody, preservation actions and outcomes)
de facto
In fact, in effect
Donor agreements
Include information about recovered deleted files
Inter-archive associations
Independent archives, cooperating archives, federated archives, and shared functional areas
Representation Information
Information necessary to render and understand the bit sequences of the CDO Composed of Structure Information, Semantic Information, and Other Representation Information
Content Data Object
Information that is the focus of preservation
Components of OAIS Functional Model
Ingest, Preservation Planning, Data Management, Archival Storage, Administration, and Access
Access
Intermediary between users and Archival Storage and Data Management Implement security/access control External interface with consumers
ISO
International Organization for Standardization
InterPARES
International Research on Permanent Authentic Records in Electronic Systems Documented requirements for authenticity
Representation Networks
Layers of Representation Information (A to understand B, B to understand C, C to understand D, and so on)
Archival Storage
Long-term storage and maintenance of materials Safeguard mechanisms Retrieval by request of consumer No external interface
Data Management
Maintains databases of descriptive metadata Data about internal system operations Queries and reports from data Supports search and retrieval No external interface
Checksum generation
Make sure that changes aren't made
Preservation Planning
Mapping out strategy Monitoring external environment for developments Developing recommendations based on developments No external interface
METS
Metadata Encoding and Transmission Standard XML-based Supports encoding/ packaging of metadata and object Self-descriptive header, descriptive metadata, administrative metadata, list of files, structural map of components, relationships between components, list of 'behaviors' Maintained by LoC
Transfer guidelines
Methods and documentation
MPLP
More Product Less Process
NARA
National Archives and Records Administration
NDIIPP
National Digital Information Infrastructure and Preservation Program One of the creators of BagIt Program of LoC
ISO Standard 14721
OAIS Reference Model (2002)
OCLC
Online Computer Library Center, Inc.
OAIS
Open Archival Information System
Archivists' Toolkit
Open source archival data management system Result of the collaboration of UC San Diego, NYU, and Five Colleges, Inc.
ArchivesSpace
Open source archives information management application
Common Services
Operating system services, network services, and security services Computing and networking backbone
PII
Personal Identification Information- do not include in DIP
PREMIS
Preservation Metadata: Implementation Strategies Data dictionary Implementation and technology neutral
PAIMAS
Producer-Archive Interface Methodology Abstract Standard ISO 20652:2006 Standard that covers the first stages of the ingest process defined by OAIS
PAIS
Producer-Archive Interface Standard
Key point for taking physical control of electronic records
Protect authenticity
Preservation Description Information (PDI)
Reference Information Context Information Provenance Information Fixity Information Access Rights Information
Context Information
Relationship to other Content Information objects (Thematically, as versions in alternative formats, etc.)
RXP
Repository eXchange Package Based on PREMIS and METS Supports extraction and packaging of metadata for transfer to another repository
RLG
Research Libraries Group
Hex editors
Show files as individual byte representations in hexadecimal notation and ASCII Can be used to discover strings of text that do not properly render
Access copies
Smaller files for access by researchers- some info can/must be excluded
Archon
Software that generates EAD and MARC records and publishes archival descriptive information
DSpace
Store, organize, and manage repository content in the cloud
SIP
Submission Information Package What is submitted to the OAIS by the producer The digital object and any metadata created by producer
Descriptive Information
Supports discovery and retrieval via finding aids (Dublin Core/ EAD metadata record, etc.)
ExifTool
Tool for extracting technical metadata
Apache Tika, ExifTool, and MediaInfo
Tools for extracting technical metadata (characterization)
FITS, FIDO, DROID, and Siegfried
Tools for extracting technical metadata (identification)
Quick View Plus and IrfanView
Tools for looking at the content of files without opening them
Treesize Pro and Disk Analyzer Pro
Tools for viewing high level info •Size of files on disk •Proportion of files types
TIPR
Towards Interoperable Preservation Repositories Developed RXP
Offline IMAP
Transfers emails
TDR
Trusted Digital Repository Standard developed by RLG and OCLC to define characteristics of a sustainable digital archive that could serve large-scale, heterogeneous collections
TRAC
Trustworthy Repositories Audit & Certification: Criteria & Checklist Set of criteria to facilitate the certification of digital repositories capable of reliably storing, migrating, and providing access to digital collections Developed by RLG and NARA
Reference Information
Uniquely identifies the Content Information (Accession number, ISBN, etc.)
Forensic disk imaging
Used to copy entire disk or drive
Working Copy
Used to ensure that no inadvertent changes or accidental deletions are made to the original during collection analysis
Archive-It
Web harvesting tool
Fixity
bit-level integrity
Write blockers
physical devices used to prevent inadvertent writing to the device
rsync
server to server transfer
Processing Steps (7)
•Accessioning records •Gathering contextual information about the records •Performing a conservation assessment •Establishing an arrangement scheme •Arranging the records physically, if necessary •Describing the records •Creating access tools
Contextual Information
•Background research on person(s), family(ies), or organization(s) responsible for creating the records •List events or activities reflected in the records •Identify record-keeping practices revealed by the records •Describe the functions and activities that led to the generation of the records
COPTR
•Community Owned Digital Preservation Tool Registry •Wiki to help you find the right tools
Arrangement and description steps (9)
•Create processing plan •Gather contextual information about the materials (iterative process) •Examine the content (find sensitive information) •Arrange the materials intellectually •Arrange the materials physically (if necessary) •Describe the materials (iterative process) •Extract technical metadata •Create access copies (dependent on institution's infrastructure) •Move materials into a preservation environment (dependent on institution's infrastructure)
Technical metadata
•Describes technical processes used to produce, or required to use a digital object •Can be extracted at various points during processing
Digital Package
•Digital object •Metadata describing digital object
Archivematica METS
•Extracts technical metadata •Automatically generates unique ids for materials •Used to package AIPs
Validation
•File formats have technical specifications that need to be met to be considered valid •File can be invalid but render just fine
BitCurator
•Includes software-based write-blocking •Creates authentic copies of content through disk imaging and cryptographic hashing •Mounts forensically packaged disk images to view & export contents •Captures file system metadata •Establishes trustworthy chains of custody through documentation of curatorial actions (log files, PREMIS records) •Generates reports that characterize the contents of disks and directories •Identifies and documents duplicate files •Discovers and exposes associated contextual information •Identifies PII •Exports contents of disks and directories for AIP and DIP •Specifically designed for archivists
Risks in copying files
•Incomplete/incorrect transfers •Missing files •Changes in file system metadata, such as date last accessed
Identification
•Look at structure of bytes in a file and match to patterns in specific file formats •Can't determine file type just from extension
Processing Plan
•Overview •Archival Description •Appraisal •Arrangement •Preservation
Accessioning Steps
•Taking physical and administrative control of the records •Performing a conservation assessment •Using case files to manage information about the accession •Identifying the arrangement and description priority
Conservation assessment
•Virus scan •Checksum generation •Format identification and validation
Ingest
Accepting custody to preparing for archival retention External interface with producer
ASCII
American Standard Code for Information Interchange
MARC21
An encoding standard for collection-level finding aids
AIC
Archival Information Collection Content and metadata for multiple objects (AIUs) grouped into a collection Metadata for each component AIU and for AIC Organizational device in the form of a conceptual layer
AIP
Archival Information Package What is stored and preserved by OAIS The digital object and all metadata submitted by producer and created by archive
AIU
Archival Information Unit Content and metadata for single object (could be composed of multiple physical or digital parts)
