Kai Liu, Phil Yang, Huayi Wu, Wenwen Li, Zhenlong Li, Tim Miao, Qunying Huang, Center for Intelligent Spatial Computing, George Mason University Douglas Nebert Federal Geographic Data Committee (FGDC) Contents 1. Geospatial Metadata and Clearinghouse 2. GEOSS Clearinghouse 3. GEOSS Clearinghouse Administration and Functions 4. Metadata Transformation in Harvesting 5. Metadata Creating 6. Metadata Search 1. Geospatial Metadata and Clearinghouse Metadata: “Data about Data” Geospatial metadata: used to describe how geospatial & attribute data was collected & processed into its final form Allows agencies to: Organize and maintain an organization's data investment Share data set information through data catalogs and clearinghouses Provide access for download or display and analysis through web services Allows users to: Discover, evaluate, and access geodata resources. Standards Allows communication between peers; Minimizes data integration time … costs Permits data operations under known conditions Numerous Geographic Metadata standards exist: Dublin Core, FGDC CSDGM, ISO19115 (UML) ISO19139 (XML) OGC WxS Capability Document Dublin Core Year 1995: NCSA/OCLC Metadata workshop, Dublin, Ohio 15 elements in Simple Dublin Core Metadata: all optional, all repeatable, Title Contributor Source Subject Date Language Description Type Relation Creator Format Coverage Publisher Identifier Rights Dublin Core FGDC CSDGM CSDGM is the current US Federal Metadata standard. Standard Organized into 10 sections Section 1: Identification Information Section 2: Data Quality Information Section 3: Spatial Data Organization Information Section 4: Spatial Reference Information Section 5: Entity and Attribute Information Section 6: Distribution Information Section 7: Metadata Reference Information Section 8: Citation Information Section 9: Time Period Information Section 10: Contact Information FGDC CSDGM Example ISO 19115 International consensus being built on structured metadata within a broader Geomatics Standard under ISO Technical Committee 211 (TC 211) UML abstract model ISO 19115 reflects FGDC, TC 287, ANZLIC and other inputs Specifies content guidance (field vocabulary and structures), like FGDC ANSI adoption 12/24/2003 ISO 19115 Organization Metadata Contact Data Identification Info Content Information Distribution Info Spatial Representation Metadata Online Source Data Quality Info Reference System Language Character Set Hierarchy Standard Name, Version Date Responsible Party Citation Metadata Maintenance Metadata Constraints Application Schema Info Portrayal Catalog Info Extension Information Required Conditional or Optional Core ISO Metadata Title Abstract Reference Date Format Responsible Party Extent Geographic Representation Type location Language Characterset Topic Category Scale Reference System Lineage On-line Resource ISO 19139 XML Schema implementation of ISO 19115 Metadata Storage and Access Storage: File system Database Access: browse, request CSW SRU HtmlParser GetCapabilities Local Filesystem Clearinghouse 2. GEOSS Clearinghouse The GEOSS Clearinghouse is a common search facility for the Intergovernmental Group on Earth Observation (GEO) based the GeoNetwork Opensource. Through the harvesting mechanism, GEOSS clearinghouse collects all types of metadata of EO data, services and other resources Through distributed search of harvested metadata or registered metadata catalogues, EO data, services, and related resources can be discovered and accessed. The GEOSS Clearinghouse supports a variety of geospatial standards: Dublin Core, FGDC CSDGM, ISO 19115/19139 metadata, CSW, RSS, and SRU for search, WMS related OGC standards for data access and visualization, as linked from the metadata. GEOSS Clearinghouse Main Page Supports Multiple languages Architecture GEOSS clearinghouse uses Web application with service based architecture (Jeeves) which is a Java Easy Engine for Very Effective Systems. Publishing of the simple systems on the Internet by Jeeves is possible without Java language knowledge and with little effort. All the interfaces are expressed through XML and XSL files Jeeves Architecture Business Logic 3. GEOSS Clearinghouse Administration and Functions Administration User Management Search Harvesting Metadata Edit and Management Visualization Administration User Management Harvesting Metadata Edit and Management Editing Metadata Visualization 4. Metadata Transformation in Harvesting Harvesting is the process of collecting remote metadata and storing them locally for a faster access. Currently, about 26 000 metadata were harvested to the GEOSS clearinghouse. Where GEOSS Harvest From? The GEOSS Clearinghouse is harvesting from the following protocols: Z39.50 "GeoProfile" (manually-initiated, under testing) CSW 2.0.2 baseline, AP ISO, ebRIM with no extensions WebDAV, sitemaps, and Web Accessible Folders (WAF) OGC GetCapabilities (WMS, WFS, WCS) endpoints Local file access for batch ingest of packaged, static metadata Also available to the system, are the following protocols: THREDDS Catalog OAI-PMH ISO 23950 "SRU" GeoNetwork "native" Harvest From Example: CSR Harvest From Example: USGS WAF Metadata of the following three standards can be harvested directly. Dublin-core FGDC CSDGM ISO19115, ISO19139 Other formats must be converted to ISO19139 Metadata Formats Transformation As for formats, the following "Full" metadata formats are recognized and parsed into ISO 19139 metadata for ingest: ISO 19115, 19119 XML (no transform, per 19139) ebRIM common information model XML, including the CSR form FGDC CSDGM Version 2 XML (no transform) CSW csw:Record XML (no transform) OGC GetCapabilities XML and other XML Stylesheets will be used for the Metadata Formats Transformation Stylesheet( *.xsl) Original Metadata ISO Metadata Stytlesheet example: <?xml version="1.0" encoding="UTF-8"?> <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> <xsl:template match="metadata"> <uuid><xsl:value-of select="mdFileID"/></uuid> </xsl:template> </xsl:stylesheet> Metadata Transformation in CSR Harvesting CSR: which is short for the GEOSS Component and Service Registry. ebRIMcomponent2ISO19139.xsl will be used for component’s translating <gmd:fileIdentifier> <gco:CharacterString> <xsl:value-of select="csw:GetRecordByIdResponse/rim:Organization/@*[name()='id']"/> </gco:CharacterString> </gmd:fileIdentifier> Also, ebRIMservice2ISO19139.xsl will be used for service’s translating <gmd:fileIdentifier> <gco:CharacterString> <xsl:value-of select="csw:GetRecordByIdResponse/rim: Service /@*[name()='id']"/> </gco:CharacterString> </gmd:fileIdentifier> Example: Original Rim Metadata Example: The Same Record in GEOSS Clearinghouse Demo: Harvsting Demo: Harvsting Demo: Harvsting Demo: Harvsting Demo: Harvsting 5. Metadata Creating In the process of creating metadata, a template can be chosen. Some templates have been developed like below: FGDC template Dublin Core Template 6. Metadata Search Local search Using Lucene to index metadata Remote search CSW Search http://clearinghouse.cisc.gmu.edu/srv/en/csw RSS Search http://clearinghouse.cisc.gmu.edu/srv/en/rss.search SRU Search http://clearinghouse.cisc.gmu.edu/srv/en/portal.sru Advanced Search ISO Query-ables Name Definition Data Type Property Mapping to Information Model Subject The topic of the content of the resource CharacterString MD_Metadata.identificationInfo.AbstractMD_Identification.de scriptiveKeywords.MD_Keywords.keyword plus MD_Metadata.identificationInfo.MD_DataIdentification.topicC ategory Title A name given to the resource CharacterString MD_Metadata.identificationInfo.AbstractMD_Identification.cit ation.CI_Citation.title Abstract A summary of the content of the resource CharacterString MD_Metadata.identificationInfo.AbstractMD_Identification.ab stract AnyText A target for full-text search of character data types in a catagloue CharacterString Whole resource text. Format The physical or digital manifestation of the resource CharacterString MD_Metadata.distributionInfo.MD_Distribution.distributionFor mat.MD_Format.name Identifier A unique reference to the record within the catalogue Identifier MD_Metadata.fileIdentifier ISO Query-ables (contd.) Modified Data on which the record was created or updated within the catalogue Date-8601, example: 2000-08-21 MD_Metadata.dateStamp.Date Type The nature or genre of the content of the resource. Type can include general categories, generes or aggregation levels of content Codelist: dataset, datasetcollection, service, application MD_Metadata.hierarchyLevel.MD_ScopeCode/@codeListValu e. If MD_Metadata .hierarchyLevel is missing, ‘Type’ will be considered as “Dataset” (default). Bounding Box A bounding box for identifying a geographic area of interest BoundingBox, BoundingBox WestBoundLongitude SouthBoundLatitude EastBoundLongitude NorthBoundLatitude CRS Geographic Coordinate Reference System (Authority and ID) for the BoundingBox Identifier CRS Authority: MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe renceSystemIdentifier.RS_Identifier.codeSpace ID: MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe renceSystemIdentifier.RS_Identifier.code Version: MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe renceSystemIdentifier.RS_Identifier.version Example 1: CSW Post Request <?xml version="1.0" encoding="UTF-8"?> <csw:GetRecords xmlns="http://www.opengis.net/cat/csw/2.0.2" xmlns:csw="http://www.opengis.net/cat/csw/2.0.2" xmlns:gmd="http://www.isotc211.org/2005/gmd" xmlns:ogc="http://www.opengis.net/ogc" xmlns:gml="http://www.opengis.net/gml" xmlns:rim="urn:oasis:names:tc:ebxml-regrep:xsd:rim:3.0" service="CSW" version="2.0.2" outputFormat="application/xml" outputSchema="http://www.isotc211.org/2005/gmd" resultType="results" startPosition="1" maxRecords="5"> <csw:Query typeNames="csw:Record"> <csw:ElementSetName>full</csw:ElementSetName> <csw:Constraint version="1.1.0"> <ogc:Filter> <ogc:PropertyIsLike escapeChar="\" singleChar="?" wildCard="*"> <ogc:PropertyName>Type</ogc:PropertyName> <ogc:Literal>dataset</ogc:Literal> </ogc:PropertyIsLike> </ogc:Filter> </csw:Constraint> </csw:Query> </csw:GetRecords> Example 2: RSS Request Example 3: SRU Request GEOSS Clearinghouse Demonstration http://clearinghouse.cisc.gmu.edu/geonetwork/ Acknowledgements & Further Info Website: The clearinghouse is operating at http://clearinghouse.cisc.gmu.edu/geonetwork Contactor: Chaowei Phil Yang: [email protected] Douglas Nebert: [email protected] The project is supported by FGDC grant, ESIP Products & Services and Air Quality Committees, and NASA Geospatial Interoperability grant. Kai Liu, Chaowei Phil Yang, Huayi Wu, Wenwen Li, Zhenlong Li, Tim Miao, Qunying Huang, Douglas Nebert Center for Intelligent Spatial Computing George Mason University & FGDC http://cisc.gmu.edu/ Let’s get Started Multilanguage Metadata Let’s Check the metadata Let’s check the metadata standards Let’s check the metadata standards Let’s check the metadata in XML Categories Recent Changes Advanced Search Create New Metadata Create New Metadata(1) Create New Metadata(2) Create New Metadata(3) Create New Metadata(4) Create New Metadata(5) Create New Metadata(5) Create New Metadata(6)
© Copyright 2026 Paperzz