Agenda - bioCADDIE

Agenda
v 
v 
v 
v 
Presentation regarding terminology services –
Ergin/Cui
Discuss Tech team timelines and deliverables
– All /Anu
Come up a with a high level technology view
of the architecture for guidance of prototype
development (architecture diagram) Claudiu
Brief updates from others
Supported by the NIH grant 1U24 AI117966-01 to the University of California, San Diego
1
Terminology Services (Ergin)
v 
Requirements?
Supported by the NIH grant 1U24 AI117966-01 to the University of California, San Diego
2
ngram
Input
Query Entry
Terminology
Server
ontology
Prepare
Identify
Entities
Organize
results
Expand
(synonyms,
hyponyms)
Run
Execute
query
Presentation
Format and
List
Apply Filter
Categorization
Refine
Terminology in search
Query Entry
ngram
Input
Query Entry
Terminology
Server
ontology
Prepare
Identify
Entities
Organize
results
Expand
(synonyms,
hyponyms)
Run
Execute
query
Presentation
Categorization
Refine
Format and
List
Apply Filter
ngram
Input
Query Entry
Terminology
Server
ontology
Prepare
Identify
Entities
Organize
results
Expand
(synonyms,
hyponyms)
Run
Execute
query
Presentation
Categorization
Refine
Format and
List
Apply Filter
Terminology Server in Search
Search string à Phrases and operators
v  Phrases à Term
v 
w 
Term identification (NER)
Term à Term class (entity type, semantic
type)
v  Term à Related terms (synonyms,
hyponyms)
v  Term à Semantic links (?)
v 
Terminology Server in Indexing
Document à Phrases
v  Phrases à Terms
v 
w 
v 
Term identification (NER)
Term à Related terms
Terminology – other
Organize/customize search results based
on datasource
v  Terminology Management functionality
v 
w 
Import (owl, rdf), Update, Map, List, view
meta information(date create, update, who,
when?, is active, …)
NIF – OntoQuest
v 
v 
v 
v 
v 
v 
v 
v 
v 
v 
v 
List all ontologies
Display information about a particular ontology:
Download an ontology file: Download the file (.owl)
corresponding to the given ontology.
Get defined categories in an ontology
Get Concept
Search Concepts
Get parents/children of a given concept
Get subclasses/superclasses, parts/whole of a given
concept
Get siblings of a given concept
Search terms starting with the query
Search term properties
Terminology
v 
Primary
w 
w 
w 
v 
Disease (Condition)
Drug (Chemical, Substance, Compound)
Omic: Gene – Protein
Auxillary
w 
w 
Body Structure
Procedure
•  Laboratory
w 
v 
Organism
Maybe
w 
Physical object
Terminology Content
v 
MeSH
w 
v 
Backbone
Supported by
w 
Gene ontology/nomenclature
•  GO, HGNC, NCBI
w 
w 
w 
w 
v 
SNOMED CT
NCI Thesaurus
PubChem
UMLS CUIS as the glue
Maybe
w 
Protein db
•  HPRD, NCBI
w 
w 
w 
ICD9, ICD10
RxNorm
LOINC
OpenGALEN
GRAIL language
v  Not free
v 
CTS-R2 implementations
Draft Standard, submitted by Mayo Clinics
v  http://wiki.hl7.org/index.php?
title=Common_Terminology_Services__Release_2_%28Normative%29
v  NCBO Bioportal – REST
v  NCBO Bioportal – RDF
v  eXist – An XML database
v  LexEVS
v 
Graph DB
v 
v 
Search string à Phrases and operators
Phrases à Term
w 
v 
v 
v 
v 
v 
Term à Term class (entity type, semantic type)
Term à Related terms (synonyms, hyponyms)
Term à Semantic links (?)
Organize/customize search results based on datasource
Terminology Management functionality
w 
v 
Term identification (NER)
Import (owl, rdf), Update, Map, List, view meta
information(date create, update, who, when?, is active, …)
Small data: 20-30 million entires
Conclusion
v 
v 
Start with a graph db
Populate
w 
w 
w 
w 
v 
MeSH
SNOMED CT
NCIT
…
Implement REST services for
w 
w 
w 
Normalize a phrase
Find Term
Find related term
•  parent/child
•  Synonyms
w 
Management (add, list, view, …)
Timeline & deliverables
v 
Discuss from excel spreadsheet + Jeff’s
comments
Architecture Diagram
v 
From Claudiu
Updates from Others
Other issues
v 
Any other issues?
v 
Thank You