PREMIS - Preservation Metadata: Implementation Strategies

DAITSS: Dark Archive in the
Sunshine State
Priscilla Caplan,
Florida Center for Library Automation
DCC Workshop on Long-term Curation
within Digital Repositories
Cambridge, July 2005
DAITSS: Dark Archive in the Sunshine State
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
State Universities
FCLA
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Designed as a “dark archive”
Preservation repository functions only
Based on OAIS functional architecture
PREMIS conformance (Partly)
Format migration and normalization
Redundant redundancy
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Designed as a “dark archive”
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
No real-time online access
No access to external users
Dissemination by request
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Preservation repository functions only
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Documentation
Understandability
Authentication
Authenticity
Format strategies
Renderability
Media
management
Secure
storage
Description
Capture
Selection
Viability
Integrity
Identity
Availability
Preservation Pyramid
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Documentation
Understandability
Authentication
Authenticity
Format strategies
Renderability
Media
management
Secure
storage
Description
Capture
Selection
Viability
Integrity
Identity
Availability
Preservation Pyramid
DCC Workshop on Long-Term Curation, July 2005
DAITSS
DAITSS: Dark Archive in the Sunshine State
Based on OAIS functional architecture
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
OAIS Functional Architecture
DCC Workshop on Long-Term Curation, July 2005
DAITSS Functional Architecture
Reporting
L
L
I
I
Mgmt
DB
B
R
A
R
Y
SIP
Ingest
B
Access
AIP
Storage
management
DIP
R
A
R
Y
DAITSS: Dark Archive in the Sunshine State
PREMIS conformance (partly)
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
PREMIS DATA MODEL
Intellectual
Entities
Rights
OBJECTS:
Representations
Files
Bitstreams
Agents
Events
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
DAITSS DATA MODEL
Intellectual
Entities
OBJECTS:
Representations
Files
Bitstreams
Events
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Data File
MarkupFile
XML
TIFFFile
TextFile
SGML
PDFFile
DTD
Bitstream
Audio
Image
JPEGImage
Text
TIFFImage
DCC Workshop on Long-Term Curation, July 2005
Video
DAITSS: Dark Archive in the Sunshine State
Format migration and normalization
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Format treatment
 Treatment based on
background reports and
action plans
 Recognized formats can
get full preservation
treatment
 Files are normalized on
Ingest if possible
 Files are migrated on
Ingest if necessary
 Files can be migrated by
Disseminating and reIngesting









In


AIFF 1.3
AIFF-C 1.0
JFIF 1.02
PDF 1.2 – 1.6
Plain text
TIFF 5.0, 6.0
WAVE
XML 1.0
XML DTD 1.0
process
JPEG2000
AVI
• MPEG, pcm
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Redundant redundancy
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State





Creating multiple master copies of files
Calculating two message digests
Storing metadata as XML and in RDBMs
Normalizing files when possible
Always retaining original packages as submitted
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Next steps for DAITSS
 Finish programming
• Dissemination
• Forward migration
 Go into full production in the FCLA Digital Archive
 Implement at a few partner institutions
 Release as Open Source Software
 Continue adding support for more formats
 Make fully PREMIS-compliant
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
Next steps for the FCLA Digital Archive
 Implement DAITSS in full production
• Retrospective
• Prospective
 Investigate repository certification
 Test exchange of packages with other
repositories
 Collect cost data and model cost recovery
DCC Workshop on Long-Term Curation, July 2005
DAITSS: Dark Archive in the Sunshine State
For more information
http://www.fcla.edu/digitalArchive/
[email protected]
DCC Workshop on Long-Term Curation, July 2005