Skip to content

Commit

Permalink
Merge branch 'klevetate-gh-pages' into gh-pages
Browse files Browse the repository at this point in the history
Resolving merge issues with base. Intro section should have been in the front matter only
  • Loading branch information
Tom Honeyman committed Feb 11, 2020
2 parents 4ad1017 + 0aaa565 commit 3d8e6fe
Show file tree
Hide file tree
Showing 2 changed files with 169 additions and 93 deletions.
101 changes: 56 additions & 45 deletions _things/thing-11.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,12 +3,12 @@ layout: thing
thing: 11
title: "What's my metadata schema?"
description: |
**Metadata** are the lifeblood for finding and reusing research
**Metadata** is the lifeblood for finding and reusing research
data. Data is only as valuable as the metadata which describes and
connects it.
overview:
- Getting started: What is metadata and what sort of metadata is critical for research data?
- Learn more: Metadata schema go formal and become standards
- Learn more: Metadata schemas go formal and become standards
- Challenge me: Hands on with ANZLIC or XML - You choose!
category: Metadata & more
---
Expand All @@ -21,71 +21,52 @@ quality, format, location and contact information. Creating metadata to
describe research data is very similar to the process for descriptive
cataloguing of library resources.

Metadata schema are sets of metadata elements (or fields) for describing
a particular type of information resource. Numerous metadata schema
A metadata schema is a set of metadata elements (or fields) for describing
a particular type of information resource. Numerous metadata schemas
exist for describing research data across different disciplines.
Probably our most familiar metadata schema are those commonly used in
Probably our most familiar metadata schemas are those commonly used in
library catalogues and publication repositories such as MARC and Dublin
Core.

1\. Start by reading the ANDS [Introduction to
Metadata](https://www.ands.org.au/working-with-data/metadata) to
1. Start by reading the ARDC [Metadata](https://ardc.edu.au/resources/working-with-data/metadata/) page to
understand what metadata is and why it is the lifeblood of research data
sharing.\
\
sharing.

2. Now look closely at this good quality metadata record for research
data: [Long-term variation of surface phytoplankton chlorophyll a in the
Southern Ocean during
1965-2002](http://gcmd.nasa.gov/KeywordSearch/Metadata.do?Portal=amd_au&KeywordPath=Parameters%7CCLIMATE+INDICATORS%7CATMOSPHERIC%26%2347%3BOCEAN+INDICATORS%7CTELECONNECTIONS%7CANTARCTIC+OSCILLATION&OrigMetadataNode=AADC&EntryId=chlorophyll_65-02&MetadataView=Full&MetadataType=0&lbnode=mdlb3).Why
data: [Long-term variation of surface phytoplankton chlorophyll a in the Southern Ocean during 1965-2002](http://dx.doi.org/doi:10.4225/15/5a384270f2b61). Why
do you think this record is considered ‘high quality’? **Hint:**
consider both the type and quality of information provided. What
metadata included in this record help discovery and reuse of the data?
Look back at the ANDS Introduction to Metadata for ideas and think about
Look back at the ARDC Metadata page for ideas and think about
previous Things such as licensing and sensitive data.

3\. **If you have time:** Sadly, it’s not hard to find examples of low
3. **If you have time:** Sadly, it’s not hard to find examples of low
quality metadata describing research data. Read this short two page
article [Avoiding Data Dumpsters - Toward Equitable and Useful Data
Sharing](http://www.nejm.org/doi/pdf/10.1056/NEJMp1605148) on the power
of good quality, schema-compliant metadata (N Engl J Med. 374 2016 May
11. DOI: 10.1056/NEJMp1605148).
Sharing](https://doi.org/10.1056/NEJMp1605148)(Merson, L., Gaye, O., & Guerin, P. J. (2016). New England Journal of Medicine, 374(25). doi:10.1056/nejmp1605148) on the power
of good quality, schema-compliant metadata.

**Consider:** Why, if metadata is the lifeblood of data discoverability
and reuse, is it often neglected or not richly done when data is
published?

### Well Done! You are halfway through\
- Now take some time to reflect

Go back over any notes that you may have made as you went through the
Things

Consider:

- Why you are doing 23things - are you achieving this?
- Let us know on the [Data Librarians Google+
page](https://plus.google.com/u/0/communities/105455769899183786145 "Data Librarians Google+ page")
if you are achieving your desired outcomes, have discovered new
things you didn't think you would, found that data is your "thing",
or just reflect on your journey so far

## Learn more
### Metadata schema and standards
### Metadata schemas and standards

A metadata standard is a schema that has been formally approved and
published, with governance procedures in place to maintain and update
the standard. Examples include ANZLIC (Australia and New Zealand Spatial
Information Council) and DDI (Document, Discover and Interoperate).
the standard. Examples include [ANZLIC](https://www.anzlic.gov.au) (Australia and New Zealand Spatial
Information Council) and [DDI](https://ddialliance.org) (Document, Discover and Interoperate).

Numerous metadata standards exist and the standard chosen to describe
Numerous metadata standards exist and the one chosen to describe
resources such as research data should be appropriate to the project or
discipline.

1\. Start by reading this short guide to [Selecting a metadata
standard](https://web.archive.org/web/20120211163323/http://marinemetadata.org/guides/mdatastandards/standardselect "Evaluating and selecting a metadata standard").
1. Start by reading this short guide to [Selecting a metadata
standard](https://marinemetadata.org/guides/mdatastandards/standardselect).
Keep this Guide open.

2\. Choose one disciplinary standard which sparks your interest from the
2. Choose one disciplinary standard which sparks your interest from the
UK Digital Curation Centre’s [Directory of Disciplinary
Metadata](http://www.dcc.ac.uk/resources/metadata-standards "DCC Directory of Disciplinary metadata").
Use the Guide above to critique your chosen standard.
Expand All @@ -104,16 +85,46 @@ choose between two!
- requires you to install open source ANZMet Lite metadata entry tool
- Is a hands-on experience in creating an ANZLIC compliant metadata
record
- [Go to Option
1](https://www.ands.org.au/working-with-data/skills/23-research-data-things/all23/thing-11/thing-11-challenge-me-options#option1)

Various tools exist to support the creation of standards-based metadata. For this activity, we will download the freely available ANZMet Lite metadata entry tool and create an ANZLIC compliant metadata record.

The ANZLIC Metadata Profile is most commonly used to describe geographic (or spatial) resources. It was first published in 2007 and is based on an the International Standard: ISO 19115:2005.

1. **Start by installing** the [ANZMet Lite metadata collection tool](https://www.anzlic.gov.au/resources/metadata#ANZMetLiteTool) (Windows only). Information about the tool, including user guides are also available on this page.

- To install the software click on the link and choose to open or save the file (we chose open)
- Click on setup.exe and ‘run’
- A wizard will guide you through the installation process.

2. Now open your installation of ANZMet Lite and follow the prompts to **create a compliant ANZLIC record**. Start by giving your file a name and browse to the location you wish to save your file to.

3. Create a “New Unlinked Metadata” record - you can either make up the metadata or reuse metadata from a record in Research Data Australia. Stick with the default ‘No’ when asked whether the resource is part of a package and ‘dataset’ as the hierarchy level.

4. **See if you can complete and save a valid ANZLIC record**. The mandatory elements are highlighted and ‘tips’ are available on screen - look for the orange icon.

**Reflect** on your experience using the ANZMet Lite Tool. Are you aware of other free or open source tools for creating for metadata?

**Option 2: Hands on with XML**

- does not require you to install software
- Is a hands-on experience with XML and may help prepare you for Thing
13 (Crosswalks)
- [Go to Option
2](https://www.ands.org.au/working-with-data/skills/23-research-data-things/all23/thing-11/thing-11-challenge-me-options#option2)

XML is the language most commonly used for machine to machine transfer of metadata. An understanding of XML is valuable for anyone working ‘under the hood’ with metadata: for example, harvesting or crosswalking metadata.

For this activity we will use Dublin Core. It is a well-used metadata standard across domains and resource types and it forms the basis of many other metadata standards. Let's get started!

1. **Start by creating a ‘simple’ metadata record** in Dublin Core using the free [Dublin Core Generator online tool](http://nsteffel.github.io/dublin_core_generator/) (use the [Simple Generator](http://nsteffel.github.io/dublin_core_generator/generator_nq.html)). Provide some content for at least the first 5 of the 15 Dublin Core elements - you can either make up the metadata or reuse metadata from a record in Research Data Australia. Once you’ve created the metadata record:

- tick all the output options
- display the output as XML
- click 'Generate Metadata!' and
- copy the XML text

2. Paste the XML text into the left hand pane of the [Online XML Editor](http://www.tutorialspoint.com/online_xml_editor.htm) noting the tree display on the right hand side. Spend a few moments checking out the features of this tool before manually editing your XML to change the author name, add a subject term and add a few other metadata elements. Does the metadata look correct in the tree display?

**Reflect** on XML as a metadata tool.


**Consider:** the metadata creation tools we have explored in Thing 11 -
what needs to happen for these tools to have much wider uptake?
**Consider** the metadata creation tools we have explored in Thing 11 -
what needs to happen for these tools to have much wider uptake?
161 changes: 113 additions & 48 deletions _things/thing-13.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,29 +17,23 @@ category: "Metadata & more"

We learned in Thing 11 that lots of metadata schemas exist to support
different data types and disciplines. There are times when metadata
created in one schema will need to be transformed or crosswalked to
another so that metadata can been shared between systems and is more
discoverable.
created in one schema will need to be mapped, transformed, or "crosswalked" to
another so that metadata can been shared between systems.

1\. Read the Wikipedia [introduction to
crosswalks.](https://en.wikipedia.org/wiki/Schema_crosswalk)
1. Read the Wikipedia [introduction to
crosswalks](https://en.wikipedia.org/wiki/Schema_crosswalk).

2.  Now we’ll look at an example of a metadata record that has been
2.  Now we’ll look at an example of a CSIRO metadata record that has been
crosswalked to comply with different metadata standards so it can be
found in various repositories. In separate windows in your browser, take
a look at how the metadata for the same record has been transformed (or
crosswalked) so it complies with different metadata standards and
systems.
a look at how the metadata for the 'WAMSI Node 1.1 - Marmion Lagoon CTD and Mooring Measurement Program (July 2007 to May 2008)' has been transformed (or crosswalked) so it complies with different metadata standards and
systems:

- First look at this CSIRO
[record](http://www.marine.csiro.au/marq/edd_search.Browse_Citation?txtSession=8603)
in MarLIN. It was created to comply with the ISO19115 standard for
- in [marlin](http://www.marine.csiro.au/marq/edd_search.Browse_Citation?txtSession=8603)
This record was created to comply with the ISO19115 standard for
describing geospatial data
- This is the same record as it appears in the [CSIRO Data Access
Portal](http://doi.org/10.4225/08/50F624A9E6D5C "CSIRO data access portal - record")
- And the same record again as it was crosswalked to the RIF-CS schema
for [Research Data
Australia](https://researchdata.ands.org.au/wamsi-node-11-2007-2008/444960/ "Research data Australia record")
- in the [CSIRO Data Access Portal](http://doi.org/10.4225/08/50F624A9E6D5C "CSIRO data access portal - record")
- in [Research Data Australia](https://researchdata.ands.org.au/wamsi-node-11-2007-2008/444960/ "Research data Australia record") where it has been cross-walked to [RIF-CS](https://documentation.ands.org.au/display/DOC/About+RIF-CS).

**Consider:** What do you notice about these schemas? Do all three
records provide exactly the same information? Or are there differences?
Expand All @@ -49,25 +43,21 @@ about having to crosswalk?
## Learn more
### Try your hand at a crosswalk

Let’s map schema!
Let’s map schemas!

Mapping and crosswalking metadata from one schema to another enables
metadata harvesting and sharing between systems.

1\. Crosswalks are viewed more easily in a table or spreadsheet format.
Take a look at [this
example](http://www.ddialliance.org/resources/ddi-profiles/dc "Dublin core example")
of a Dublin Core<->DDI crosswalk table.

2\. Time to get hands on! Start by choosing any existing metadata record
to work with. Here’s an
[example](https://researchdata.ands.org.au/eeg-perception-microtones-information-stimuli).
This will be your “source” record. Copy the Dublin Core<->DDI
crosswalk
[table](http://www.ddialliance.org/resources/ddi-profiles/dc "DDI crosswalk table")
from \#1 above into Word or Excel. Can you create a high level metadata
mapping from your source record to Dublin Core? (Hint: Click on
“Registry View” in the bottom right-hand corner of the example source
1. Crosswalks are viewed more easily in a table or spreadsheet format.
Take a look at this mapping of [Dublin Core to DDI](http://www.ddialliance.org/resources/ddi-profiles/dc "Dublin core example").

2. Time to get hands on! Start by choosing any existing metadata record
to work with. Here’s an [example](https://researchdata.ands.org.au/eeg-perception-microtones-information-stimuli).
This will be your “source” record.
- Copy the Dublin Core-DDI crosswalk [table](http://www.ddialliance.org/resources/ddi-profiles/dc "DDI crosswalk table")
from \#1 above into a blank document or spreadsheet.
Can you create a high level metadata mapping from your source record to Dublin Core? (**Hint:** Click on
“Registry View” in the bottom right-hand corner of the footer in the source RDA
record). The Dublin Core Metadata Element Set [explained in more
detail](http://dublincore.org/documents/dces/ "Dublin core set explained in more details").
Don’t worry if you don’t have time to complete the entire record - just
Expand All @@ -77,29 +67,104 @@ get a feel for the process.
Crosswalk. How did you overcome it?

## Challenge me
### An introduction to XSL for crosswalks

Hands on with XML!
### An introduction to the technical side of crosswalks

This activity will provide an interactive introduction to XSLT using a
free online tool.

Many well-known metadata standards are expressed as XML schemas,
including MARCXML, Dublin Core, MODS, METS, EAD, RIF-CS and others. The
flexible structure of XML makes it possible to convert data from one
metadata standard to another using an XSLT. XSLT (Extensible Stylesheet
Language Transformations) is a language for transforming XML documents
into other XML documents.

1\. Start by having a look at some of the
[crosswalks](http://www.ands.org.au/online-services/rif-cs-schema/crosswalks-transform-your-metadata "Research Data Australia crosswalks")
developed for ingesting records into Research Data Australia.

2\. Go to the free online tool [XSLT Test
Tool](http://xslttest.appspot.com/ "XSLT test tool") free online tool.
Follow the steps in [this
exercise](https://www.ands.org.au/working-with-data/skills/23-research-data-things/all23/thing-13/technical-crosswalk-exercise "23 things crosswalk exercise")
to create, edit and display XML.
metadata standard to another using XSLT (Extensible Stylesheet
Language Transformations). XSLT is a language for transforming XML documents
into other XML documents (or into HTML), and XSLT files can be considered machine-readable crosswalks.

1. Start by having a look at some of the [machine-readable crosswalk XSLT files](http://www.ands.org.au/online-services/rif-cs-schema/crosswalks-transform-your-metadata "Research Data Australia crosswalks") for ingesting records into Research Data Australia.

2. Go to the free online tool [XSLT Test Tool](http://xslttest.appspot.com/ "XSLT test tool").
Follow these steps to create, edit and display XML:

- Copy the **XML** below and paste it into the page at http://xslttest.appspot.com in the first edit box:

```xml
<?xml version="1.0" encoding="UTF-8"?>
<books>
<book>
<author>Pam Jacobs</author>
<title>The Book About Pam</title>
<publisher>Good Books</publisher>
</book>
<book>
<author>Neil Simon</author>
<title>Neil's Book</title>
<publisher>Books to Go</publisher>
</book>
</books>
```

- Next, copy the **XSL** below and paste it into the second edit box:

```xml
<?xml version="1.0" encoding="UTF-8"?>

<xsl:stylesheet version="1.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

<xsl:template match="/">
<html>
<body>
<h2>Book Collection</h2>
<table border="1">
<tr bgcolor="#FC99EF">
<th>Title</th>
<th>Author</th>
<th>Publisher</th>
</tr>
<xsl:for-each select="books/book">
<tr>
<td><xsl:value-of select="title"/></td>
<td><xsl:value-of select="author"/></td>
<td><xsl:value-of select="publisher"/></td>
</tr>
</xsl:for-each>
</table>
</body>
</html>
</xsl:template>
</xsl:stylesheet>
```

- Click **Run Transformation** to generate the resultant XML, that looks like:

```xml
<html>
<body>
<h2>Book Collection</h2>
<table border="1">
<tr bgcolor="#FC99EF">
<th>Title</th>
<th>Author</th>
<th>Publisher</th>
</tr>
<tr>
<td>The Book About Pam</td>
<td>Pam Jacobs</td>
<td>Good Books</td>
</tr>
<tr>
<td>Neil's Book</td>
<td>Neil Simon</td>
<td>Books to Go</td>
</tr>
</table>
</body>
</html>
```

**Note** the above XML is in fact also **HTML** that constructs a table.

- Preview the table by clicking **Open Result**
- or save the result to a .html file and open it in a browser to see the formatted table!

**Consider:** do you think experience with XML would be useful in your
workplace?

0 comments on commit 3d8e6fe

Please sign in to comment.