A Study on Audiovisual Metadata - DC-2004

A Study on Audiovisual
Metadata
Duan Minglian, Yao Xingxing, Zhang Jiuzhen
Dept. of Lib. and Inf. Science, Peking Univ.
DC2004
[email protected]
1/36
Introduction
DC2004
[email protected]
2/36
Supported by the National Social
Science Foundation of China, A
project entitled Innovative Study on
Audiovisual Metadata and its Retrieval
has been in progress since March 2002,
at the Department of Information
Management, Peking University.
DC2004
[email protected]
3/36
1 Study Design
DC2004
[email protected]
4/36
Three Preparative Studies
First, the status of audiovisual
information resources and their
characteristics were collected by
a web survey and then analyzed
to substantiate our objects.
DC2004
[email protected]
5/36
Three Preparative Studies
Second, general users’ and
managers’
needs
were
examined carefully through a
questionnaire survey.
DC2004
[email protected]
6/36
Three Preparative Studies
Third, existing metadata
methods and projects on
audiovisual resources were
investigated and explored to get
an overview.
DC2004
[email protected]
7/36
2 Characteristics
of Audiovisual
resources
DC2004
[email protected]
8/36
Characteristics of Audiovisual
resources
Large Amounts,
Rapid Increase.
Various Types, Different Formats.
Widely Distributed Collecting
Organizations, Large Holdings.
Special Characteristics, Serious
Information Organization Problems.
DC2004
[email protected]
9/36
Oriental
Space
Time
Live Broadcast
Stories of Grassroots
Another unique title
for every program
Oriental People
A Program Called “Oriental Space Time”
from China Central TV station
DC2004
[email protected]
10/36
Various information: creator (such as
actor, singer, player, speaker, director,
etc.) playing time, start point, end point,
system, aspect ratio, sound
characteristics, color, projection speed,
playing speed and so on.
As for date, there are copyright
validity dates, first playing dates, last
playing dates, license time and
manufacture dates.
DC2004
[email protected]
11/36
3 Users’ Needs
DC2004
[email protected]
12/36
33
individuals
organizations
117
Figure 2:Responses of users’ needs
DC2004
[email protected]
13/36
20
13
common users
60
professional
users
libraries and
information
centers
TV stations
57
Figure 3:Responses of users’ needs
DC2004
[email protected]
14/36
Users’ needs are summarized as the following:
a. Most users are young to middle-aged
people, with higher educational background.
b. Users’ needs vary distinctively. The
objective resources and users’ works or
hobbies are closely related.
c. To satisfy special demands of different
users, different organizations provide entirely
different services.
DC2004
[email protected]
15/36
4 Our Study on
Audiovisual
Metadata Set
DC2004
[email protected]
16/36
4.1 Principles of Design
a. User’s Needs Principle.
b. Simplicity and Operability
Principle.
c. Breadth and Depth Principle.
d. Open and Extensibility
Principle.
e. Interoperability Principle
DC2004
[email protected]
17/36
4.2 Description layers of
audiovisual resources
DC2004
[email protected]
18/36
Resources
Collective layer
Is part of
Has parts
Individual layer
Is part of
Has parts
Analytic layer
DC2004
Static pictures: a group of picture
Audio information: a series of …
Videos: series TV play, or a column
Static pictures: a picture of the group
Audio information: a tape of the series
Videos: a program of the play or column
Static pictures: an object of the picture
Audio information: a song of the tape
Videos: a segment of the program
[email protected]
19/36
4.3 Audiovisual Metadata Set
DC2004
[email protected]
20/36
Referred to DC elements
and qualifiers, our
Audiovisual Metadata Set
presents 15 elements and 84
qualifiers altogether.
DC2004
[email protected]
21/36
Elements:
Title
Creator
Subject and Keywords
Description
Publisher
Contributor
Date
Resource Type
DC2004
[email protected]
22/36
Elements:
Format
Resource Identifier
Source
Language
Relation
Rights Management
Physical Description
DC2004
[email protected]
23/36
Qualifiers for the element
“Description” : notes, abstracts,
audience, awards,
tableOfContents, version,
colorMode, shootingPlace,
cameraMotion, sceneRange,
cameraAngle, placeofCollection,
and holdingInstitution.
DC2004
[email protected]
24/36
Qualifiers for the element
“Date” : created, issued,
modified, valid, published,
manufactured, copyright,
firstBroadcasted, broadcasted,
shot.
DC2004
[email protected]
25/36
Qualifiers for the element
“Language” : track, subtitle
and ISO639-2 (Code System
Qualifier)
DC2004
[email protected]
26/36
Qualifiers for the element
“Rights Management” :
secretLevel, owner, kind,
statement, user,
authorizedScope, deadline,
usage, and times.
DC2004
[email protected]
27/36
Qualifiers for the element
“Physical Description” :
extentOfItem, playingTime,
startPoint, endPoint, system,
aspectRatio,
specialProjectionCharacteristics,
soundCharacteristics,
trackConfiguration, color,
projectionSpeed, and size.
DC2004
[email protected]
28/36
Example 1: An Analytic Layer Record of the Video recording
Title: 泰山日出
Creator [personal]: 刘慧, 摄像
Creator [personal]: 董平, 导演
Creator [personal]: 张政, 主持
Type: 动态图像
Physical Description [startPoint]: 00: 36: 52: 00
Physical Description [endPoint]: 00: 40: 39: 00
Description [abstracts]: 泰山日出、太阳及观日石
Description [shootingPlace]: 山东泰山
Date [shot]: 1997-10-01
Relation [isPartOf]: 山东泰安素材
Description [placeOfCollection]: 北京
Description [holdingInstitution]: 中央电视台
Identifier [callNumber]: S000007
DC2004
[email protected]
29/36
Example 2: An Individual Layer Record of a Video recording
Title: 山东泰安素材
Creator [personal]: 刘慧, 摄像
Creator [personal]: 董平, 导演
Creator [personal]: 张政, 主持
Type: 动态图像
Publisher [placeOfManufacture]: 北京
Publisher [manufacturerName]: 中央电视台
Date [manufactured]: 1997-10-01
Physical Description [extentOfItem]: 1 录像带
Physical Description [playingTime]: 94min.
Physical Description [color]: 彩色
Physical Description [startPoint]: 00: 01: 00: 00
Physical Description [endPoint]: 00: 94: 00: 00
Physical Description [system]: PAL
DC2004
[email protected]
30/36
Language [track]: chi
Description [notes]: 1997年10月由山东泰安电视台负责赴泰安拍摄
。
Description [abstracts]: 拍摄了山东泰山的正阳门、岱庙、南天门
、岱庙大殿里的房顶、碧霞祠、大佛像等建筑;泰山风景,如,
泰山山景、岱庙古树、树结、石碑、石刻、岩石松、高山流水、
石头小路、群山等;尤其是泰山的日出、日落和山雾;泰山的挑
山夫、爬山的儿童、登山的人、农民、岱庙执勤民警。此外,还
拍摄了岱庙表演、皮影、泰山脚下招待所全景、房子。
Description [shootingPlace]: 中国山东泰山
Date [shot]: 1997-10-01
Description [placeOfCollection]: 北京
Description [holdingInstitution]: 中央电视台
Identifier [callNumber]: S000007
Rights Management [owner]: 中央电视台
Rights Management [authorizedScope]: 可在国内与海外播放
Rights Management [deadline]: 2002-11-01至0000-00-00
DC2004
[email protected]
31/36
5 Conclusion and
Future works
DC2004
[email protected]
32/36
The
Audiovisual Metadata Set is just a
preliminary achievement of our
project.
The mapping of our set to both DC
and MARC (CNMARC and
USMARC) is listed in our full report
as a form. This lays the foundation of
future interoperation among these
metadata sets.
Work is still in progress!
DC2004
[email protected]
33/36
Some of the future tasks
To
evaluate and refine the Audiovisual
Metadata Set through real application.
To encode the Set both with XML
Document Type Definition and RDF
Schema.
To register the Set with authoritative
metadata registries.
To popularize the Set by publishing.
DC2004
[email protected]
34/36
To
design an audiovisual resource
management system and to develop
guidelines for it.
To accelerate the efficient retrieval
mechanism of audiovisual resources.
To realize the interoperation among
different metadata sets.
To construct a well-suited Digital
video library and then offer
convenient services for users.
DC2004
[email protected]
35/36
Thank you for your attention!
DC2004
[email protected]
36/36