Searching for Alternatives- Understanding the Problem

The MeSH Translation Database:
From Idea to Reality
Stuart J. Nelson, MD
AN IDEA
(Crossing Boundaries of Language)
Acknowledgements
Per Olsson
 Boris Loginov
 Michael Schopen
 Arne Jakobsson

Known Translations of MeSH

In UMLS
–

Complete Translations
–

Dutch, Finnish, French, German, Italian, Portugese,
Russian, Spanish
Arabic, Chinese, Czech, Greek, Japanese, Swedish,
Thai, Turkish
In Progress or Planned or Hoped For
–
Korean, Lithuanian, Norwegian, Polish, Slovakian,
Slovene
Why Bother With Translation?
The Fundamental Problem of IR
 Authors Finding Keywords
 Supporting Additional Indexing
 Interoperability in UMLS

Difficulties for Translators
MeSH was Term-Oriented
 Non-synonymous Entry Vocabulary
 Need to Track Changes
 Can’t See Current Work

Example - Federal Aid
In 2000 - Entry term
(NRW)
Financial Support
 In 2001 - Entry term
(NRW)
Financing, Government

Other Difficulties
No English Equivalent for a Term
 Inclusion in UMLS Problematic

–
–
Concept-Oriented
Need to Identify Meaning of Translated Term
Background
A New Environment for MeSH
Data Structure Similar to UMLS
 Off Mainframe
 Platform Independent Clients

A New Data Structure

Descriptor Class
–

Concepts
–

Composed of Concepts
Composed of Terms
Terms
Attributes at Each Level

Descriptors
–

Concepts
–

Trees
Definitions
Terms
–
Sources
Example: Exercise
Old View
Term-centric: the descriptor
Main Heading = Exercise
ST=T056 ST=T040
Descriptor UI= D015444
Definition = Physical activity done with the
intention of improving or maintaining physical
fitness or health.
Entry Term = Aerobic Exercise
ST=T056 ST=T040
SR=Narrower
Entry Term = Exercise, Aerobic
ST=T056 ST=T040
SR=Narrower
Entry Term = Exercise, Isometric
ST=T056 ST=T040
SR=Narrower
Entry Term = Exercise, Physical
ST=T056 ST=T040
SR=Equivalent
Entry Term = Isometric Exercise
ST=T056 ST=T040
SR=Narrower
Notable
Differences
Definition
Only one definition possible
Each concept can be defined
Semantic Type (ST)
Applied (redundantly) to each term
Applied to each concept
Semantic Relation (SR)
Applied (redundantly) to each term
Applied to subordinate concepts
New View
Concept-centric: the descriptor class
Descriptor Class = Exercise
Descriptor UI = D015444
Preferred concept = Exercise [Concept UI = M0023063]
ST = T040 (Organism Function)
ST = T056 (Daily or Recreational Activity)
Definition = Physical activity done with the intention
of improving or maintaining physical fitness or health.
Term = Exercise
Term = Exercise, Physical
Subordinate concept = Exercise, Aerobic [Concept UI = M0023062]
ST = T040 (Organism Function)
ST = T056 (Daily or Recreational Activity)
SR = Narrower
Definition = Exercise designed to improve an
individual's cardiovascular condition.
Term = Exercise, Aerobic
Term = Aerobic Exercise
Subordinate concept = Exercise, Isometric [Concept UI = M0023064]
ST = T040 (Organism Function)
ST = T056 (Daily or Recreational Activity)
SR = Narrower
Definition = Active exercise performed against stable
Term resistance,
= Exercise, without
Isometric
change in the length of the muscle.
Term = Isometric Exercise
Coordinating Translations
Why?
Correct Linking of Terms
 Easy Insertion into UMLS
 Support non-English Concepts
 Easier to Maintain Currency

Coordinating Translations How?
Single Database - Web Interface
 Add Language as a Term Property
 Translated Terms added to Concept
 Non-English Concepts added to Descriptor

Additional Issues

Character Set
–
–

Oracle 8I
UTF8
Copyright Issues
THE REALITY
(Crossing Boundaries by Cooperation)
Acknowledgements
Natalie Arluk -OCCS
 MeSH Staff

–
–
–
Jacque-Lynne Schulman
Allan Savage
Douglas Jouhnston
Michael Schopen - DIMDI
 ALL THE TRANSLATORS

Administration

Central Administrator
–
–
–

Default Thesaurus ID
Establish Local Administrator
Approve New Concepts
Local Administrator
–
–
Establish Privileges for Translator
Authorize Changes to Concepts
Initializing the Translation

Existing Translation
–
–

Code Sets
Review and Approval
New Translations
Design Considerations (1)
Finding What Needs Work


Tree Browsing
Searching
–
–

By Tree
By Term
Color Codes For Status
–
–
–
–
–
Translated and Approved
Translated, Pending Approval
Untranslatable
Needing Translation
New MeSH in Progress
Design Considerations (2)
What Version of MeSH?

New Mesh
–
–
–

Current MeSH
–

In Progress
Provisional Marked for Review
Authorized
Rare Additions (e.g., AIDS, SARS virus)
Different Fonts
Design Considerations (3)
Processing Requirements

Creating New Terms
–
–
–

Existing Concept
Preferred Term for Language
NLM Review Unnecessary after Approval
Creating New Concepts
–
–
–
Defining Concept in English
Notify NLM for Review after Approval
Requires Authorization
Year End Needs -Cutover

File Formats for Redistribution
–
–
English MeSH
Translated MeSH
Publications
 UMLS Inclusion

Last Steps
Problem Reports
 Translator’s Manual
 Enlistment of Translators
 Feedback on Interface Design
