From 29ce7e87dcb0dd50ba5613621c9a69be948c22e7 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 10:21:57 +0800 Subject: [PATCH 01/31] Minor format change --- _things/thing-19.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index 083f047..d81ad11 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -19,7 +19,7 @@ category: 'Hands on with data & tools' Apps and APIs are tools which make data much more discoverable and usable for all of us - not just researchers!  You can choose to explore -both APIs and APPs or just one. +both APIs and Apps or just one. **Option 1: How are APIs used with data?** @@ -173,4 +173,4 @@ National organisations: International organisations: -- [Re3data.org](https://www.re3data.org/search?query=api "Re3data") \ No newline at end of file +- [Re3data.org](https://www.re3data.org/search?query=api "Re3data") From e7487feb97af264d57d027ce9d285fecde41197d Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 10:36:46 +0800 Subject: [PATCH 02/31] Removed images They were on an ANDS host and fairly sure copyright usage is with Trove. --- _things/thing-19.md | 6 +----- 1 file changed, 1 insertion(+), 5 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index d81ad11..bf586da 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -51,11 +51,7 @@ to see some clever uses of Trove APIs. **** Galley](http://help.nla.gov.au/trove/building-with-trove/application-gallery "Trove") to find out what was making news **today** in history? -Click on *Culture Collage* icon ![Trove-Culture -Collage](https://www.ands.org.au/__data/assets/image/0003/574536/trove-culturecollage.png) ->> then click on the *On this day ![Trove On this -day](https://www.ands.org.au/__data/assets/image/0008/574550/trove-on-this-day.png)* -API icon ! +- Click on *Culture Collage* then click on the *On this day* API icon ! **Option 2: Apps for researchers and citizen scientists (is that you?)** From 6b8574a2f4c07dd5b06094f4961e64ecc9751a23 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 10:37:58 +0800 Subject: [PATCH 03/31] Removed *** --- _things/thing-19.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index bf586da..a741b69 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -55,8 +55,8 @@ to find out what was making news **today** in history? **Option 2: Apps for researchers and citizen scientists (is that you?)** -Often several little APIs underpin **** a (software) **App(**lication) -(program).  Data gatherers in the field use apps to make data collection +Often several little APIs underpin a (software) App(lication) +(program). Data gatherers in the field use apps to make data collection easier, more accurate and more rich. eg fill out data forms offline on their smartphones and tablets, capture images, photos, videos and sounds, track locations on maps (probably using the Google Map API), and From d410315d703c58093ededb37e507c81d5ce19f47 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 10:46:27 +0800 Subject: [PATCH 04/31] Formatting changes Removed duplicate URL and stray / --- _things/thing-19.md | 25 ++++++++++++------------- 1 file changed, 12 insertions(+), 13 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index a741b69..c749db9 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -28,10 +28,10 @@ to cleverly make data more discoverable and re-usable eg the Google Maps API is very widely used to record the exact location of a species, or photograph and lets developers embed Google Maps on webpages. -1\. Start by watching this easy, short (3.24 min) video, ‘[What is an +1. Start by watching this easy, short (3.24 min) video, ‘[What is an API?](https://www.youtube.com/watch?v=s7wmiS2mSXY)’ -2\. Metadata records often show at least 2 different APIs in action. +2. Metadata records often show at least 2 different APIs in action. [Interview material from Western Sydney women's oral history project: 'From farms to freeways: Women's memories of Western Sydney'](https://researchdata.ands.org.au/interview-material-western-western-sydney/561264/) @@ -41,7 +41,7 @@ metadata record shows both: where the data was collected) - Views and Accesses API -3\. Trove at the National Library of Australia uses APIs to showcase +3. Trove at the National Library of Australia uses APIs to showcase National Library of Australia collections in amazing ways. Check out the [Trove Applications Galley](http://help.nla.gov.au/trove/building-with-trove/application-gallery "Trove") @@ -86,15 +86,14 @@ millions of Australian resources including newspaper articles, images and manuscripts. The Trove API enables people to create new applications, tools  and  interfaces using the rich store of Trove data. -1\. Start by browsing around the [Trove Application +1. Start by browsing around the [Trove Application Gallery](http://help.nla.gov.au/trove/building-with-trove/application-gallery) for some of the creative ways people have used the Trove API. Be inspired! -2\. To see more of the power of one API, watch this [short -video](https://www.youtube.com/watch?v=GMkGlOYvz8I)[(3:09 -min)](http://help.nla.gov.au/trove/building-with-trove/api). The screen -is a bit fuzzy but you can get the idea of this innovative “use your +2. To see more of the power of one API, watch this [short +video](https://www.youtube.com/watch?v=GMkGlOYvz8I)(3:09 +min). The screen is a bit fuzzy but you can get the idea of this innovative “use your voice to query Trove” API. **Share** your thoughts on how APIs could make data more exciting and @@ -102,16 +101,16 @@ accessible for more people. **If you have time to get hands on!** -3\. Browse the +3. Browse the [introduction](http://help.nla.gov.au/trove/building-with-trove) to the [Trove API](http://help.nla.gov.au/trove/building-with-trove/api). -4\. Go the [Trove API Console](http://troveconsole.herokuapp.com/) and +4. Go the [Trove API Console](http://troveconsole.herokuapp.com/) and click on some of the example searches provided. The results are in a machine readable format but look closely and you can see the citation details eg title, date, source, pagination -4. Try constructing your own API query to see what results you +5. Try constructing your own API query to see what results you get.  Note that: If you want to use them in your own code you'll need to get an API key and add '&key=\[Your API key\]' to the url. @@ -125,7 +124,7 @@ APIs and Apps are the transformation agents which make data and services powerful and accessible. These two options give you a chance to see how APIs can be applied to either data or services. -**Option 1**: **Hands-on API tutorials**\ +**Option 1**: **Hands-on API tutorials** Are you new to APIs and want a hands-on introduction to commonly used APIs, including Twitter and YouTube? Choose one API from the [Codecademy APIs](https://www.codecademy.com/apis) and work through the online @@ -134,7 +133,7 @@ tutorial to learn more about the API you chose. **Consider:** how the APIs could be applied to a dataset to make it more reusable? -**Option 2: APIs for ANDS Services**\ +**Option 2: APIs for ANDS Services** If your institution uses ANDS services, you might want to use one or more of these APIs to enhance your services. From 45f14c27939009f78b8f2f1dc669818613659d72 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 13:55:08 +0800 Subject: [PATCH 05/31] Fixed ANDS dev links --- _things/thing-19.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index c749db9..3a8d9cc 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -141,11 +141,11 @@ Explore one of these APIs to get a feel for what the API does and whether it may be useful to your organisation: - [OAI-PMH - Provider](http://developers.ands.org.au/services/collections-registry-api/oai/ "OAI"): + Provider](https://developers.ands.org.au/services/collections-registry-api/oai/ "OAI"): use this API if your system is a harvester which is compatible with the OAI-PMH protocol. - [Research Activity Grants - API](http://developers.ands.org.au/services/research-activity-api/ "Grants API"): + API](https://developers.ands.org.au/services/research-activity-api/ "Grants API"): provides machine access to research activity information contributed to Research Data Australia such as grants and projects. A light-weight method to query the ANDS Registry for specific From def9b4b6451cfeedcbe1f54ae86e7df4fb657d0e Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 14:10:57 +0800 Subject: [PATCH 06/31] Fixed ANDS redirect errors --- _things/thing-19.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index 3a8d9cc..fd10bed 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -141,11 +141,11 @@ Explore one of these APIs to get a feel for what the API does and whether it may be useful to your organisation: - [OAI-PMH - Provider](https://developers.ands.org.au/services/collections-registry-api/oai/ "OAI"): + Provider](https://documentation.ands.org.au/login.action?os_destination=%2Fservices%2Fcollections-registry-api%2Foai%2Findex.action "OAI"): use this API if your system is a harvester which is compatible with the OAI-PMH protocol. - [Research Activity Grants - API](https://developers.ands.org.au/services/research-activity-api/ "Grants API"): + API](https://documentation.ands.org.au/login.action?os_destination=%2Fservices%2Fresearch-activity-api%2Findex.action "Grants API"): provides machine access to research activity information contributed to Research Data Australia such as grants and projects. A light-weight method to query the ANDS Registry for specific From 9987af73ccc69d60e70ee1a71f0d61c31a9a2202 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 14:26:17 +0800 Subject: [PATCH 07/31] Fixed the ANDS links again This time for sure! --- _things/thing-19.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index fd10bed..847380a 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -141,11 +141,11 @@ Explore one of these APIs to get a feel for what the API does and whether it may be useful to your organisation: - [OAI-PMH - Provider](https://documentation.ands.org.au/login.action?os_destination=%2Fservices%2Fcollections-registry-api%2Foai%2Findex.action "OAI"): + Provider](https://documentation.ands.org.au/display/DOC/OAI-PMH+Provider "OAI"): use this API if your system is a harvester which is compatible with the OAI-PMH protocol. - [Research Activity Grants - API](https://documentation.ands.org.au/login.action?os_destination=%2Fservices%2Fresearch-activity-api%2Findex.action "Grants API"): + API](https://documentation.ands.org.au/display/DOC/Research+Activities+API "Grants API"): provides machine access to research activity information contributed to Research Data Australia such as grants and projects. A light-weight method to query the ANDS Registry for specific From e59466208fe013b9b7ea7861c61840ba2d1fba38 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 14:37:20 +0800 Subject: [PATCH 08/31] Replaced Codeacademy link --- _things/thing-19.md | 7 ++----- 1 file changed, 2 insertions(+), 5 deletions(-) diff --git a/_things/thing-19.md b/_things/thing-19.md index 847380a..30f7532 100644 --- a/_things/thing-19.md +++ b/_things/thing-19.md @@ -126,12 +126,9 @@ APIs can be applied to either data or services. **Option 1**: **Hands-on API tutorials** Are you new to APIs and want a hands-on introduction to commonly used -APIs, including Twitter and YouTube? Choose one API from the [Codecademy -APIs](https://www.codecademy.com/apis) and work through the online -tutorial to learn more about the API you chose. +APIs, including Twitter and YouTube? Work through some or all of the [tutorial from freeCodeCamp](https://www.youtube.com/watch?v=GZvSYJDk-us) to get hands on learning about working with APIs. -**Consider:** how the APIs could be applied to a dataset to make it more -reusable? +**Consider:** how the APIs could be applied to a dataset to make it more reusable? **Option 2: APIs for ANDS Services** If your institution uses ANDS services, you might want to use one or From 4ddd37ec3bda14a34443fe03d8af4b91bba021b8 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 15:03:58 +0800 Subject: [PATCH 09/31] Removed stray *** --- _things/thing-20.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-20.md b/_things/thing-20.md index 16cd457..f658bef 100644 --- a/_things/thing-20.md +++ b/_things/thing-20.md @@ -36,7 +36,7 @@ then, for all of us to have an understanding of spatial data. **Just for fun**: enter your address in the Atlas of Living Australia and see what birds and plants have been reported [in your street or suburb](https://biocache.ala.org.au/explore/your-area "ALA: Explore Your Area"). -You may be surprised at how ‘alive’ your street is. **** +You may be surprised at how ‘alive’ your street is. **Consider:** why do you think these geospatial visualisations are so powerful? @@ -134,4 +134,4 @@ customise your map. **Consider**: the data world is hungry for Geospatial tools and metadata and there is growing demand for people with these skills. How can these -skills be encouraged in your institution? \ No newline at end of file +skills be encouraged in your institution? From 2c94394f3b3c7759d94ae1e8dd3622254022d27e Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Fri, 6 Mar 2020 15:16:15 +0800 Subject: [PATCH 10/31] Removed images The text seems to be enough to guide people through the activity. --- _things/thing-20.md | 7 ++----- 1 file changed, 2 insertions(+), 5 deletions(-) diff --git a/_things/thing-20.md b/_things/thing-20.md index f658bef..a17b418 100644 --- a/_things/thing-20.md +++ b/_things/thing-20.md @@ -54,14 +54,11 @@ a particular location. Let’s look at a couple of examples**.** 1. Go to [Research Data Australia](https://researchdata.ands.org.au/) and choose **Map Search** from under the search - box.![T20-RDA](https://www.ands.org.au/__data/assets/image/0012/577983/rda.png){width="339" - height="127"} + box. 2. Zoom in on a location in Australia and use the rectangle tool at the top of the map to draw a search box around your chosen location. Hint: choosing somewhere on the coast (such as the Newcastle area) - will give you a wider range of datasets.\ - ![T20-click - image](https://www.ands.org.au/__data/assets/image/0011/577982/click.png) + will give you a wider range of datasets. 3. Then click on the green *Search* button 4. Browse through your results and look at the list of provider organisations and subject headings for the records in your result From a84ae571475dfb5b0380edec557225c4cdc736ff Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 09:02:25 +0800 Subject: [PATCH 11/31] Remove reference to thing number --- _things/thing-21.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 4cc3051..0584a90 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -18,7 +18,7 @@ Data horror stories: how did it happen? Why is "clean” data important? Public policy, changes to medical protocols and economic decisions all depend on accurate and complete -data. Thing 21 looks at the why and what of “dirty data.” +data. This thing looks at the why and what of “dirty data.” 1. Pregnant men? Yes! Read some [real data horror stories](http://www.relevategroup.com/blog/dirty-data-horror-stories-–-when-good-data-goes-bad "When good data goes bad: blog") @@ -138,4 +138,4 @@ either: **Consider:** Carpentry course Trainers and Helpers are often in high demand. Would you consider becoming a trainer to teach tools such as -Open Refine? \ No newline at end of file +Open Refine? From ddf2613bbc5908dcba61272365b4c001dee64e0f Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 09:09:59 +0800 Subject: [PATCH 12/31] Changed link #1 in getting started The blog no longer exists and it was possibly transphobic anyway. --- _things/thing-21.md | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 0584a90..4459478 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -20,9 +20,7 @@ Why is "clean” data important? Public policy, changes to medical protocols and economic decisions all depend on accurate and complete data. This thing looks at the why and what of “dirty data.” -1. Pregnant men? Yes! Read some [real data horror - stories](http://www.relevategroup.com/blog/dirty-data-horror-stories-–-when-good-data-goes-bad "When good data goes bad: blog") - about dirty data. +1. Read some [real data horror stories](https://www.arnnet.com.au/article/197085/perils_dirty_data/? "Perils of dirty data: blog") about dirty data. 2. How does data get dirty in the first place? This [short video](http://aspcapro.org/resource/saving-lives-research-data/gis-video-what-makes-data-dirty "ASAPCA video") (5 min) from the ASPCA (American Society for Prevention of Cruelty From 0d3070c8f8b1be45fe448937f8ad21914c70b9b2 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 09:21:22 +0800 Subject: [PATCH 13/31] Updated ASPCA video link --- _things/thing-21.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 4459478..70d92d7 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -22,7 +22,7 @@ data. This thing looks at the why and what of “dirty data.” 1. Read some [real data horror stories](https://www.arnnet.com.au/article/197085/perils_dirty_data/? "Perils of dirty data: blog") about dirty data. 2. How does data get dirty in the first place? This [short - video](http://aspcapro.org/resource/saving-lives-research-data/gis-video-what-makes-data-dirty "ASAPCA video") + video](https://vimeo.com/95289871 "ASAPCA video") (5 min) from the ASPCA (American Society for Prevention of Cruelty to Animals) shows simply how incomplete, inaccurate data can occur. You can imagine how the resultant problems would multiply From 0c6d2cea953e613d379e6389f8778aae12cb971b Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 11:29:21 +0800 Subject: [PATCH 14/31] Fixed formatting --- _things/thing-21.md | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 70d92d7..9405a86 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -68,10 +68,11 @@ data from PDFs, with a focus on introducing Tabula, a free open-source tool build for this specific task. 1. Get ready: go to [Extracting Data From - PDFs](http://schoolofdata.org/extracting-data-from-pdfs/) - 1. Download the correct version of Tabula for your operating - system, and java runtime if required - 2. note this tutorial doesn’t work on scanned pdfs + PDFs](http://schoolofdata.org/extracting-data-from-pdfs/) and download the correct version of Tabula for your operating + system, and java runtime if required. + + (Note: this tutorial doesn’t work on scanned pdfs) + 2. Work through as much of the Tabula tutorial as you can and remember this tutorial for the next time you get a PDF with valuable (and hard-to-extract) data. From e3075b940a890578e7cfe2cc04923c33831a440b Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 11:41:20 +0800 Subject: [PATCH 15/31] Fixed markdown error --- _things/thing-21.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 9405a86..98ac94c 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -103,7 +103,7 @@ creeping into your The School of Data have some really interesting [Data Cleansing](http://schoolofdata.org/courses/#IntroDataCleaning) modules. -**Consider: **strategies for encouraging data to be published in more +**Consider:** strategies for encouraging data to be published in more re-usable formats rather than PDF. ## Challenge me From b1b7778a385e5ae812f801fe4a73ef403f5d17cb Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 11:56:24 +0800 Subject: [PATCH 16/31] Updated old link for LC Fixed some odd spaces too. --- _things/thing-21.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 98ac94c..0779d70 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -115,8 +115,8 @@ manipulate data; clean up dirty data; and to transform datasets. **Option 1: If you are new to OpenRefine** -1. Start by watching[introduction to - OpenRefine](https://youtu.be/B70J_H_zAWM) (6 .48mins) to learn how +1. Start by watching [introduction to + OpenRefine](https://youtu.be/B70J_H_zAWM) (6.48mins) to learn how it can be used to clean up messy data. 2. Now get hands on!  You will need to download OpenRefine and the dataset for this activity. Work your way through as much as [this @@ -131,7 +131,7 @@ Take a look through the lesson materials used to teach OpenRefine in either: 1. OpenRefine in [Library - Carpentry](https://github.com/LibraryCarpentry/week-four-library-carpentry/blob/master/lesson-materials/Basic-OpenRefine-functions-I.md) + Carpentry](https://librarycarpentry.org/lc-open-refine/) 2. OpenRefine in [Data Carpentry for Ecology](http://www.datacarpentry.org/OpenRefine-ecology-lesson/) From e54588988de6e1b1f8c3ff147c5f763a6d416612 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 11:58:20 +0800 Subject: [PATCH 17/31] Updated front matter --- _things/thing-21.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 0779d70..1b01c0f 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -7,10 +7,10 @@ description: | Dig in to dirty data.  What is it?  Why should we care? Try your hand at using an open source data cleansing tool. overview: - Getting started: Pregnant men? The data says so! Getting down and dirty with data + Getting started: Data horror stories! Getting down and dirty with data Learn more: Turn a PDF ‘tabula rasa’ into usable data with Tabula Challenge me: OpenRefine is a powerful tool for cleaning up lots of  dirty data ---- + --- ## Getting started ### Dirty data stories From 1c7482f9518fd648ec16416731bb2e52eec34143 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Mon, 9 Mar 2020 12:00:49 +0800 Subject: [PATCH 18/31] Fixed front matter --- _things/thing-21.md | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/_things/thing-21.md b/_things/thing-21.md index 1b01c0f..1b65a97 100644 --- a/_things/thing-21.md +++ b/_things/thing-21.md @@ -7,10 +7,11 @@ description: | Dig in to dirty data.  What is it?  Why should we care? Try your hand at using an open source data cleansing tool. overview: - Getting started: Data horror stories! Getting down and dirty with data - Learn more: Turn a PDF ‘tabula rasa’ into usable data with Tabula - Challenge me: OpenRefine is a powerful tool for cleaning up lots of  dirty data - --- + Getting started: Data horror stories! Getting down and dirty with data + Learn more: Turn a PDF ‘tabula rasa’ into usable data with Tabula + Challenge me: OpenRefine is a powerful tool for cleaning up lots of  dirty data +--- + ## Getting started ### Dirty data stories From c9da4bf3576d404f266f668e9d67975549421d65 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 08:34:39 +0800 Subject: [PATCH 19/31] Removed reference to the Things --- _things/thing-22.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/_things/thing-22.md b/_things/thing-22.md index 465c155..6ee0d96 100644 --- a/_things/thing-22.md +++ b/_things/thing-22.md @@ -28,8 +28,7 @@ research alphabet soup) are: - Universities - 41 universities generate data, graduate and train new researchers (ANU, UWA, UQ, UTas, UNSW etc) -- CSIRO - have visited the CSIRO Data Access Portal several times in - 23(rd) Things +- CSIRO - Australia's national science agency - Funders - ARC and NHMRC - Governments - state and federal departments fund research and produce their own data (e.g. Bureau of Meteorology for weather, @@ -154,4 +153,4 @@ min) to see how it can be used to build national or small maps, particularly for social science and health. **Consider:** how researchers can use NCRIS facilities to help generate, -manage, share and publish their data. \ No newline at end of file +manage, share and publish their data. From 2c864866faf2f9777f7f9c5df4a5dda2f9f85e92 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 09:10:06 +0800 Subject: [PATCH 20/31] Remove reference to old orgs In Getting started - removed references to Nectar, ANDS, and RDS. --- _things/thing-22.md | 8 ++------ 1 file changed, 2 insertions(+), 6 deletions(-) diff --git a/_things/thing-22.md b/_things/thing-22.md index 6ee0d96..79bc46f 100644 --- a/_things/thing-22.md +++ b/_things/thing-22.md @@ -48,13 +48,9 @@ Let’s focus on NCRIS - it’s amazing. NCRIS is designed to take a national approach to providing the world’s best research infrastructure for Australia. NCRIS facilities provide storage for data ([RDS](http://www.rds.edu.au/)), research computer -networking across Australia ([AARNet](https://www.aarnet.edu.au/)), -tools and virtual laboratories for researchers -([NeCTAR](https://nectar.org.au/)), very, very big data crunching +networking across Australia ([AARNet](https://www.aarnet.edu.au/)), very, very big data crunching ([NCI](https://nci.org.au/)), as well as lots of specialised research -facilities. ANDS is also one of the 27 NCRIS facilities. In 2018 ANDS, -Nectar and RDS were combined to create the Australian Research Data -Commons ([ARDC](ardc.edu.au "ARDC")). +facilities. This includes the Australian Research Data Commons ([ARDC](ardc.edu.au "ARDC")). - Browse over some [NCRIS case studies](http://www.education.gov.au/ncris-case-studies) to get an From fc467690378f6eb8d7a15003d56e6d9491393508 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 09:21:19 +0800 Subject: [PATCH 21/31] Fixed self link Not sure if linking to the github page is correct though? --- _things/thing-23.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 2a71ce0..ecc47f6 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -37,7 +37,7 @@ Join an online group - Research Data Alliance - international [interest and working groups](https://rd-alliance.org/groups) on a wide range of data topics -- ALIA[special interest +- ALIA [special interest groups](https://www.alia.org.au/our-communities/alia-groups) - data is increasingly a normal part of every interest group - [RESEARCH-DATAMAN](https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=RESEARCH-DATAMAN) @@ -61,12 +61,12 @@ Hacky Hours are often fun - just search ‘hacky hour’ in your browser Get social with Twitter -@ARDC\_au  @openaccess\_oz   @ScientificData   @UKDataArchive   \#ResearchData   @resdatall +@ARDC_au  @openaccess_oz   @ScientificData   @UKDataArchive   #ResearchData   @resdatall ### Keep on learning through “Do It Yourself“ - Bookmark [23 (research data) - Things](https://www.ands.org.au/working-with-data/skills/23-research-data-things "23(research data) Things") + Things](https://github.com/au-research/ARDC-23-things "23 (research data) Things") and make a regular date with yourself to refresh your skills - [Managing Data @Melbourne](https://library.unimelb.edu.au/Digital-Scholarship/training_and_outreach/data "Managing data @Melbourne") @@ -201,4 +201,4 @@ with your supervisor. - Download your [participation certificate](https://www.ands.org.au/__data/assets/word_doc/0004/590773/ParticipationCertificate.docx "23(research data) Things participation certificate") -- Update your PD records and ALIA PD points \ No newline at end of file +- Update your PD records and ALIA PD points From db4a2fc4a40ddb0e1122510a0e4afe63244161a7 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 09:25:18 +0800 Subject: [PATCH 22/31] Updated links in Get data tech-savvy --- _things/thing-23.md | 8 ++------ 1 file changed, 2 insertions(+), 6 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index ecc47f6..5f22b62 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -94,15 +94,11 @@ boundaries of data management: - Start by checking out the **Challenge me** activities for 23Things and grow your tech-data literacy skills -- Carpentry courses are offered around Australia by various data - community groups (for an overview see [Challenge me: Thing - 17](https://www.ands.org.au/working-with-data/skills/23-research-data-things/all23/thing-17#t17-challenge "thing 17 - Challenge me") - of 23 (research data) Things +- [The Carpentries](https://carpentries.org/) courses are offered around Australia by various community groups - Get on the email lists for [ResBaz](https://resbazblog.wordpress.com/ "ResBaz blog"), [GovHack](https://www.govhack.org/ "GovHack") -- [](http://us7.campaign-archive2.com/home/?u=b542ef52e49302569068046d9&id=22b849a4ee)[ARDC - News](https://ands-nectar-rds.us7.list-manage.com/track/click?u=b542ef52e49302569068046d9&id=603e0435ba&e=08d3e46578) +- [ARDC News](https://us7.campaign-archive.com/home/?u=b542ef52e49302569068046d9&id=22b849a4ee) (ARDC fortnightly newsletter) has lots of webinars and tech-focused events From b30267763780aea0bd1576475851d9760ffdad77 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 09:39:10 +0800 Subject: [PATCH 23/31] Updated CSU link --- _things/thing-23.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 5f22b62..cfbb090 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -27,7 +27,7 @@ this list! ### Keep in touch and keep on learning - [Sign up to ARDC - News](https://ands-nectar-rds.us7.list-manage.com/track/click?u=b542ef52e49302569068046d9&id=603e0435ba&e=08d3e46578) + News](https://us7.campaign-archive.com/home/?u=b542ef52e49302569068046d9&id=22b849a4ee) (ARDC fortnightly newsletter) to find out about events near you. Join an online group @@ -92,7 +92,7 @@ Get social with Twitter Opportunities abound for those who want to stretch the technical boundaries of data management: -- Start by checking out the **Challenge me** activities for 23Things +- Start by checking out the **Challenge me** activities for 23 (research data) Things and grow your tech-data literacy skills - [The Carpentries](https://carpentries.org/) courses are offered around Australia by various community groups - Get on the email lists for @@ -104,7 +104,7 @@ boundaries of data management: ### Get serious about your data future -Now you have completed 23 (research data) things think about:  Will you +Now you have completed 23 (research data) Things think about:  Will you help to fill the growing shortage of people with data skills in the one of the [sexiest jobs of the 21st century](http://www.smh.com.au/national/tertiary-education/data-scientists-are-the-rock-stars-of-business-20150819-gj2i0r.html)? @@ -144,7 +144,7 @@ specifically in Data Management are also available e.g. **Charles Sturt University**[**Graduate Certificate and Masters Specialisation in Data -Management**](http://www.csu.edu.au/infostudies/ "Graduate Certificate and Masters Specialisation in Data Management") +Management**](https://study.csu.edu.au/courses/library-information-studies/graduate-certificate-data-management "Graduate Certificate and Masters Specialisation in Data Management") From 2017 onwards the School of Information Studies at Charles Sturt University will be offering a Graduate Certificate in Data Management From 0a193a96d7d093206e3f1590dedc0e63292f6da8 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 09:52:01 +0800 Subject: [PATCH 24/31] Changed the CAUL link No longer exists. Replaced with paper by Samantha Searle, et. al. --- _things/thing-23.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index cfbb090..8174877 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -110,8 +110,7 @@ of the [sexiest jobs of the 21st century](http://www.smh.com.au/national/tertiary-education/data-scientists-are-the-rock-stars-of-business-20150819-gj2i0r.html)? And, data jobs aren’t always mathematical or statistical! -- Data management and the role of the data - [librarians](http://www.caul.edu.au/caul-programs/research/data-management "Data management and the role of the data librarians") +- [Librarians as partners in research data service](https://doi.org/10.1108/PROG-02-2015-0013 "Librarians as partners in research data service") - Fiona Tweedie is a fascinating example of someone who started her career in Roman History and is now [Data Scientist for the Australian From c4834bdd3eee5a78bc8683e6a527d2edbc3143ec Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:05:10 +0800 Subject: [PATCH 25/31] Minor format change --- _things/thing-23.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 8174877..3024834 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -134,7 +134,7 @@ for everyone. Clearinghouse](http://dmtclearinghouse.esipfed.org "ESIP Data Management Training Clearinghouse") - Free online [MOOC on Big Data Visualization](https://www.futurelearn.com/courses/big-data-visualisation "MOOC on big data visualisation") -- Free online MOOC on [Big Data Measuring and predicting human +- Free online [MOOC on Big Data Measuring and predicting human behaviour](https://www.futurelearn.com/courses/big-data) Sometimes there are individual data units offered within a discipline From cd822507fd0b0047e78c03b49bb442d16ac02cee Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:08:44 +0800 Subject: [PATCH 26/31] Removed CSU unit names The course numbers had changed and the names slightly. Best to remove it for future proofing. --- _things/thing-23.md | 21 ++++----------------- 1 file changed, 4 insertions(+), 17 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 3024834..4d909db 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -141,32 +141,19 @@ Sometimes there are individual data units offered within a discipline degree, others have a number of electives or majors. Some courses specifically in Data Management are also available -e.g. **Charles Sturt University**[**Graduate Certificate and Masters +e.g. **Charles Sturt University** [**Graduate Certificate and Masters Specialisation in Data Management**](https://study.csu.edu.au/courses/library-information-studies/graduate-certificate-data-management "Graduate Certificate and Masters Specialisation in Data Management") -From 2017 onwards the School of Information Studies at Charles Sturt -University will be offering a Graduate Certificate in Data Management -(GCDM). As practicing information professionals or researchers, formal +The School of Information Studies at Charles Sturt University will be +offering a Graduate Certificate in Data Management(GCDM). +As practicing information professionals or researchers, formal study in Data Management will enable you to demonstrate you can fill an identified professional skill and knowledge gap. Completion of the GCDM qualification provides evidence you have studied the fundamental principles, key concepts, tools, standards, and techniques of data curation and data management. -The GCDM will comprise of four subjects – the first three subjects are -core and are: - -- INF461 Introduction to Data Management: Governance, Standards and - Ethics; -- INF462 Data Curation; and, -- INF463 Introduction to Data Tools and Analytics. - -For the 4^th^ subject, students are offered a choice of either: - -- INF449 Research Data Management; or, -- INF522 Project Management in Information Agencies - This Graduate Certificate is offered part time over one year (two subjects per session) and is completely online. This mode provides students with the flexibility to study at a time and place that suits From 0bada49f4683c337a7070ea6ee9bce668de2492b Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:12:54 +0800 Subject: [PATCH 27/31] Fixed wording in last section. --- _things/thing-23.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 4d909db..7f063bb 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -165,10 +165,10 @@ Information Studies at CSU, you can elect to do the subjects as above to graduate with a Specialisation in Data Management: a Master of Information Studies (Data Management). -### How was your 23 (rd) Things adventure? +### How was your 23 (research data) Things adventure? -You don't have to have done all the 23 Things - 23T was about your -learning, not completing all 23, whatever number of Things you did is +You don't have to have done all the 23 Things - this is about your +learning, not completing all 23. Whatever number of Things you did is fine. It's time to reflect on what you did, what you learned and what you got From 87da893f3088d4e1d6e3d058794f4c818f375c88 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:27:35 +0800 Subject: [PATCH 28/31] Update thing-23.md --- _things/thing-23.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/_things/thing-23.md b/_things/thing-23.md index 7f063bb..055c5f7 100644 --- a/_things/thing-23.md +++ b/_things/thing-23.md @@ -145,8 +145,8 @@ e.g. **Charles Sturt University** [**Graduate Certificate and Masters Specialisation in Data Management**](https://study.csu.edu.au/courses/library-information-studies/graduate-certificate-data-management "Graduate Certificate and Masters Specialisation in Data Management") -The School of Information Studies at Charles Sturt University will be -offering a Graduate Certificate in Data Management(GCDM). +The School of Information Studies at Charles Sturt University +offers a Graduate Certificate in Data Management(GCDM). As practicing information professionals or researchers, formal study in Data Management will enable you to demonstrate you can fill an identified professional skill and knowledge gap. Completion of the GCDM From 4711d2bf802d41a6e554c0973c5852eb977a4c40 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:37:13 +0800 Subject: [PATCH 29/31] Changed ANDS to ARDC For all the DOI minting --- _things/thing-08.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/_things/thing-08.md b/_things/thing-08.md index b322928..2eeac68 100644 --- a/_things/thing-08.md +++ b/_things/thing-08.md @@ -87,7 +87,7 @@ settings?  Why, or why not? ## Challenge me ### Minting DOIs for research data -The ANDS DOI service (Cite My Data) enables research organisations to +The ARDC DOI service enables research organisations to assign Digital Object Identifiers (DOIs) to research datasets and collections as well as well as associated software, workflows, models and grey literature. The service is intended for use by publicly funded @@ -96,7 +96,7 @@ machine-to-machine or manual service. 1. Start by reading an overview of the [ARDC DOI service](https://ardc.edu.au/services/identifier/doi/ "DOI service") - and perhaps the ANDS DOI service [technical + and perhaps the ARDC DOI service [technical documentation](https://documentation.ands.org.au/display/DOC/DOI+Service). 2. Consider also the need for guidelines around when a DOI will be minted, by whom and how DOIs will be managed to ensure long term @@ -108,9 +108,9 @@ machine-to-machine or manual service. and Management Guide](https://www2.griffith.edu.au/__data/assets/pdf_file/0039/198795/Griffith-University-DOI-guide_1.1-Rev.pdf "Griffith University DOI Guide") -### Want to know more about the ANDS DOI minting service for data, and now grey literature? +### Want to know more about the ARCD DOI minting service for data, and now grey literature? -**Webinar: ANDS expanded DOI service: scope, requirements and FAQs**. +**Webinar: ARDC expanded DOI service: scope, requirements and FAQs**. Watch the [recording](https://youtu.be/FZPbhHI-a_c "Recording") or go through the From 5f51f6048633482927a0e7074a9fbee133489080 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 11:51:47 +0800 Subject: [PATCH 30/31] Minor formating changes --- _things/thing-17.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/_things/thing-17.md b/_things/thing-17.md index e7a3539..163b420 100644 --- a/_things/thing-17.md +++ b/_things/thing-17.md @@ -84,18 +84,18 @@ if we have such diverse needs. ### DIY to build your own data literacy Technical data wrangling skills are fast becoming a new form of data -literacy.  The four Carpentry concepts have really taken off over the +literacy. The four Carpentry concepts have really taken off over the past couple of years. -The “hour a week” for 23 (rd) Things doesn’t allow time to do any of the +The “hour a week” for 23 (research data) Things doesn’t allow time to do any of the Carpentry modules (most need a couple of hours to complete) but here’s a chance to find out what all the Carpentry buzz is about. Have a look at [this Environmental scan](https://docs.google.com/spreadsheets/d/1ZASH-NhsvXnyZNJkGzFNFp5_adn6Q-iX2N-tbv1Apbw/edit#gid=0) -of courses offering data skills. Then [choose 1 Carpentry to explore in +of courses offering data skills. Then choose 1 Carpentry to explore in depth or browse over them -all:] +all: 1. [Data Carpentry](http://www.datacarpentry.org) 2. [Library Carpentry](http://librarycarpentry.github.io/about/) From bf79add0e16233c8734588277aee1a7b4659d176 Mon Sep 17 00:00:00 2001 From: Andrew Kelly <61399567+edwardshaddow@users.noreply.github.com> Date: Wed, 11 Mar 2020 14:12:58 +0800 Subject: [PATCH 31/31] Removed the youtube playlist No one knows what it was - playlist has gone. --- _things/thing-22.md | 7 +------ 1 file changed, 1 insertion(+), 6 deletions(-) diff --git a/_things/thing-22.md b/_things/thing-22.md index 79bc46f..d18832c 100644 --- a/_things/thing-22.md +++ b/_things/thing-22.md @@ -133,14 +133,9 @@ published. **Option 1: Virtual Laboratories** -Explore one or more of the Nectar Virtual Laboratories - who uses these +Explore one or more of the [Nectar Virtual Laboratories](https://nectar.org.au/labs-and-tools/) - who uses these laboratories and what do they do? -- Explore in [words](https://nectar.org.au/labs-and-tools/) -- Explore in - [videos](https://www.youtube.com/playlist?list=PLkBeePYo-_VCwaNDa9q3Y4PtM71PJ7UB6) -- Pick one or two and explore in depth - **Option 2: Australian Urban Research Infrastructure Network (AURIN)** AURIN is a powerful mapping tool with a wide range of inbuilt datasets.