Learn about the key players in Australia’s research data management ecosystem and how these players combine to make generation, management and publication big data possible.
- Getting started: sample some acronym soup
- Learn more: it doesn’t get much bigger than a telescope which is a square kilometre big!
- Challenge me: explore some virtual laboratories and continental maps
Thing 22: What's in a name?
Data is at the heart of the Australian national innovation agenda. The key players who help enable the innovation agenda float in an acronym soup! Let’s find out who’s in the soup….
Read this article that explains the Australian Government initiatives to foster innovation through publishing and sharing data.
Put very! simply the main players in the Australian research landscape (a.k.a. research alphabet soup…) are:
- Universities - our 41 universities generate data, graduate and train new researchers (at ANU, UWA, UQ, UTas, UNSW etc)
- CSIRO - we have visited the CSIRO DAP (Data Access Portal) several times in 23(rd) Things
- Funders - ARC and NHMRC
- Governments - state and federal departments fund research and produce own data (e.g. Bureau of Meteorology for weather, Geoscience Australia for geoscience, ABS for statistics etc)
- Medical Research Institutes (AAMRI) and hospitals - conduct research and produce data
- Businesses reuse research and government data and generate their own data. BCA (Business Council of Australia) brings together the big names in data production and use
- NCRIS (National Collaborative Research Infrastructure Strategy) is funded by the Australian government to drive research excellence and collaboration between 35,000 researchers, government and industry to deliver practical outcomes
Let’s focus on NCRIS - it’s amazing. NCRIS is designed to take a national approach to providing the world’s best research infrastructure for Australia. NCRIS facilities provide storage for data (RDS), research computer networking across Australia (AARNET), tools and virtual labs for researchers (NeCTAR), very, very big data crunching (NCI), as well as lots of specialised research facilities. ANDS, who is putting together these 23 (rd) Things, is one of the 27 NCRIS facilities.
- Browse over some NCRIS case studies to get an idea of what data and activities are produced by NCRIS facilities.
2. Just for fun:
- Fancy some Snakes and Ladders? Find out how some of the Australian infrastructure players fit together: Print the game and start playing!
- Play the Australian Research Data Acronym quiz and get your acronym literacy score! Created by the ORCID group for a bit of fun.
For NZ: What's happening in NZ?
- NESI (New Zealand eScience Infrastructure) is New Zealand's computing research infrastructure, providing high performance computers and support systems to enable the country's researchers to tackle the world's largest problems
- Declaration on Open and Transparent Government The New Zealand government wants public services to be radically transformed for the benefit of all New Zealanders - and ICT is a key tool that will make this possible.
Thing 22: What's in a name?
We often hear terms such as “big data” and “data deluge”.
And it doesn’t get much bigger than astronomy and satellite data! Let’s look at 2 big data projects that are only possible because of national collaborations - most of whom love acronyms!.
Option 1: ASKAP - reaching for the stars
The Australian Square Kilometre Array Pathfinder (ASKAP) telescope is made of 36 identical 12-metre wide dish antennas. These produce some 2.5 gigabytes of data per second, equivalent to 75 petabytes per year!
Learn about managing this big data project by hearing from the researchers who use ASKAP - watch this stunning 3min video.
Storing, analysing, managing and publishing such big data usually requires a collaborative effort across a number of organisations. In the case of ASKAP data for example:
- the data is captured at the Murchison Radio-astronomy Observatory (MRO), approximately 315 km northeast of Geraldton in WA.
- It is then transmitted 730km to the Pawsey Supercomputing Centre in Perth where it is processed and stored.
- Processed data is then published via the CSIRO DAP (Data Access Porta)l with metadata records harvested to RDA (Research Data Australia) to enhance discovery.
It’s hard to imagine just how many organisations work together to make this apparently simple workflow possible. Consider for example:
- funding bodies (eg NCRIS, federal and state governments)
- research organisations (eg CSIRO, Swinburne University, University of WA),
- infrastructure providers (AARNET, Pawsey Supercomputing Centre, ANDS)
Option 2: the Data Cube- satellites mapping the earth
The Australian Geoscience Data Cube is another example where collaboration between a number of organisations has resulted in new ways of organising, analysing and managing the large amounts of data collected from Earth Observation Satellites (EOS).
This wonderful 4.35min video explains the concept behind Geoscience Australia's Data Cube and the partners who made it possible. By collaborating, Australian organisations including GA (Geoscience Australia), CSIRO and the NCI (National Computational Infrastructure) have created an outcome that any one of these organisations in isolation could never have achieved.
Consider: why big data is often publicly available, yet the so called 'long tail of research data' (smaller data sets) are often not published.
Thing 22: What's in a name?
Option 1: Virtual Laboratories
Explore one or more of the NeCTAR Virtual Laboratories - who uses these labs and what do they do?
Option 2: AURIN
AURIN is a powerful mapping tool with a wide range of inbuilt datasets. Watch this 3.44min “how to use AURIN” video to see how it can be used to build national or small maps, particularly for social science and health.
Consider: how researchers can use NCRIS facilities to help gereate, manage, share and publish their data.