-
Notifications
You must be signed in to change notification settings - Fork 36
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #329 from ptsefton/use-cases-2024-07
Use cases 2024 07
- Loading branch information
Showing
11 changed files
with
856 additions
and
47 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,20 +1,20 @@ | ||
--- | ||
page_id: arkisto | ||
redirect_from: | ||
- /in-use/arkisto | ||
title: Arkisto | ||
description: Arkisto uses RO-Crate for packaging data objects in the 3 uses cases described below, Modern PARADISEC, UTS Research Data Repository and UTS Cultural Datasets. As part of these use-cases they have been developing or enhancing their tooling to facilitate their use of RO-Crate | ||
description: Arkisto was a project website that aimed to outline a principled approach to research data management. | ||
url: https://arkisto-platform.github.io/ | ||
image: arkisto.png | ||
domains: [] | ||
tasks: [data_handling] | ||
roles: [data_steward, information_architect, repository_manager] | ||
--- | ||
|
||
## Arkisto | ||
|
||
The basis of Arkisto is that the long-term preservability of well-described data is always the first consideration. | ||
Data on an Arkisto deployment is alway available on disc (or object storage) with a complete description independently of any services such as websites or APIs. Once the data is safe and well described, Arkisto has a flexible model for how data can be accessed using a variety of services. | ||
Arkisto is built on top of Research Object Crate (RO-Crate) and the Oxford Common File System Layout (OCFL). | ||
With Arkisto there is no messy data migration. | ||
Arkisto was a project website that aimed to outline a principled approach to research data management. The site is no longer being maintained. | ||
|
||
The work done on Arkisto has been developed into a set of protocols: Protocols for Implementing Long-term Archival Repositories Services (PILARS) <http://w3id.org/ldac/pilars>. See the [RRKive](rrkive) and [PILARS](pilars) pages for more information. | ||
|
||
Arkisto was built on top of Research Object Crate (RO-Crate) and the Oxford Common File System Layout (OCFL). | ||
|
||
<img src="assets/img/arkisto.png" alt="Arkisto logo" width="300px"> |
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,14 +2,257 @@ | |
page_id: paradisec | ||
redirect_from: | ||
- /in-use/paradisec | ||
title: PARADISEC | ||
description: Modern PARADISEC demonstrates the use of RO-Crate to describe the collections and items | ||
url: https://arkisto-platform.github.io/case-studies/paradisec/ | ||
title: "PARADISEC" | ||
description: Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC) holds 16,100 hours of audio recordings and 2,800 hours of video recordings that might otherwise have been lost. These recordings are of performance, narrative, singing, and other oral tradition. This amounts to over 220 terabytes, and represents 1,370 languages, mainly from the Pacific region. | ||
url: https://www.paradisec.org.au/ | ||
domains: [language_text] | ||
tasks: [data_handling] | ||
roles: [information_architect, repository_manager] | ||
--- | ||
|
||
# Modern PARADISEC | ||
# PARADISEC: Pacific and Regional Archive for Digital Sources in Endangered Cultures | ||
|
||
[Modern PARADISEC](https://arkisto-platform.github.io/case-studies/paradisec/) demonstrates the use of RO-Crate to describe the collections and items. The demonstrator includes an elastic search service and a webserver but the key feature is that it keeps working with only the filesystem and a webserver. | ||
|
||
|
||
## PARADISEC and the Language Data Commons of Australia | ||
|
||
From 2022 - 2028 PARADISEC is working with the national project the [Language Data Commons of Australia (LDACA)] to build accessible language material from Australia and the Pacific region. This work allows us to plan more digitisation projects and to add textual transcripts and summaries to audio items in the collection. | ||
|
||
PARADISEC has an architecture which ensures that all data is stored in commodity storage with metadata adjacent to the files. This way of working has proved to be a low-cost and robust way to maintain an archive and this has been an inspiration for the [PILARS: Protocols for Implementing Long-term Archival Repository Services]. PARADISEC is currently undergoing an update where metadata is being migrated to RO-Crate which will be stored alongside materials. | ||
|
||
PARADISEC uses the same API as the [Language Data Commons of Australia (LDACA)] which can serve RO-Crate. | ||
|
||
[Language Data Commons of Australia (LDACA)]: ldaca | ||
[PILARS: Protocols for Implementing Long-term Archival Repository Services]: pilars | ||
|
||
|
||
## Example Crate | ||
|
||
``` | ||
{ | ||
"@context": [ | ||
"https://w3id.org/ro/crate/1.2-DRAFT/context", | ||
{ | ||
"@vocab": "http://schema.org/" | ||
}, | ||
"http://purl.archive.org/language-data-commons/context.json", | ||
{ | ||
"Geometry": "http://www.opengis.net/ont/geosparql#Geometry", | ||
"asWKT": "http://www.opengis.net/ont/geosparql#asWKT" | ||
}, | ||
"https://w3id.org/ldac/context" | ||
], | ||
"@graph": [ | ||
{ | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011", | ||
"@type": [ | ||
"Data", | ||
"Object", | ||
"RepositoryObject" | ||
], | ||
"conformsTo": { | ||
"@id": "https://w3id.org/ldac/profile#Object" | ||
}, | ||
"additionalType": "item", | ||
"identifier": [ | ||
{ | ||
"@id": "#identifier_domain" | ||
}, | ||
{ | ||
"@id": "#identifier_id" | ||
}, | ||
{ | ||
"@id": "#identifier_itemIdentifier" | ||
}, | ||
{ | ||
"@id": "#identifier_doi" | ||
}, | ||
{ | ||
"@id": "#identifier_collectionIdentifier" | ||
} | ||
], | ||
"name": "Singing and Enchanting, Conversations on Traditional tribal battles", | ||
"description": "Singing and enchanting by women and general conversations with group of men mainly on traditional feuds and tribal battles by warring groups.\r\n(Steven Gagau, May 2018)", | ||
"contentLocation": [ | ||
{ | ||
"@id": "#place-Kiriwina" | ||
} | ||
], | ||
"dateCreated": "2018-05-09", | ||
"dateModified": "2019-11-20", | ||
"datePublished": "2019-11-20", | ||
"publisher": { | ||
"@id": "http://nla.gov.au/nla.party-811611" | ||
}, | ||
"private": false, | ||
"inLanguage": [ | ||
{ | ||
"@id": "#language-kij" | ||
} | ||
], | ||
"subjectLanguages": [ | ||
{ | ||
"@id": "#language-kij" | ||
} | ||
], | ||
"countries": [ | ||
{ | ||
"@id": "#country-Papua New Guinea" | ||
} | ||
], | ||
"license": { | ||
"@id": "#license-3" | ||
}, | ||
"memberOf": { | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1" | ||
}, | ||
"bornDigital": false, | ||
"digitisedOn": "2018-04-27", | ||
"external": false, | ||
"languageAsGiven": "Kiriwina", | ||
"metadataExportable": true, | ||
"tapesReturned": false, | ||
"hasPart": [ | ||
{ | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.mp3" | ||
}, | ||
{ | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.wav" | ||
} | ||
], | ||
"translator": [ | ||
{ | ||
"@id": "#person-4871" | ||
} | ||
] | ||
}, | ||
{ | ||
"@id": "ro-crate-metadata.json", | ||
"@type": "CreativeWork", | ||
"conformsTo": { | ||
"@id": "https://w3id.org/ro/crate/1.2-DRAFT" | ||
}, | ||
"about": { | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011" | ||
} | ||
}, | ||
{ | ||
"@id": "http://nla.gov.au/nla.party-595489", | ||
"@type": "Organisation", | ||
"name": "Australian National University" | ||
}, | ||
{ | ||
"@id": "#identifier_domain", | ||
"@type": "PropertyValue", | ||
"name": "domain", | ||
"value": "paradisec.org.au" | ||
}, | ||
{ | ||
"@id": "#identifier_id", | ||
"@type": "PropertyValue", | ||
"name": "id", | ||
"value": "RL1-011" | ||
}, | ||
{ | ||
"@id": "#identifier_itemIdentifier", | ||
"@type": "PropertyValue", | ||
"name": "itemIdentifier", | ||
"value": "011" | ||
}, | ||
{ | ||
"@id": "#identifier_doi", | ||
"@type": "PropertyValue", | ||
"name": "doi", | ||
"value": "10.4225/72/5afda71574a1c" | ||
}, | ||
{ | ||
"@id": "#identifier_collectionIdentifier", | ||
"@type": "PropertyValue", | ||
"name": "collectionIdentifier", | ||
"value": "RL1" | ||
}, | ||
{ | ||
"@id": "#country-Papua New Guinea", | ||
"@type": "Country", | ||
"code": "PG", | ||
"name": "Papua New Guinea" | ||
}, | ||
{ | ||
"@id": "#license-3", | ||
"@type": "CreativeWork", | ||
"name": "Closed (subject to the access condition details)" | ||
}, | ||
{ | ||
"@id": "http://nla.gov.au/nla.party-811611", | ||
"@type": "Person", | ||
"familyName": "Lawton", | ||
"givenName": "Ralph", | ||
"name": "Ralph Lawton", | ||
"email": "[email protected]" | ||
}, | ||
{ | ||
"@id": "#person-4871", | ||
"@type": "Person", | ||
"familyName": "Hull", | ||
"givenName": "Grace", | ||
"name": "Grace Hull" | ||
}, | ||
{ | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.mp3", | ||
"@type": "File", | ||
"contentSize": 17315968, | ||
"dateCreated": "2018-05-11", | ||
"dateModified": "2018-05-18", | ||
"encodingFormat": "audio/mpeg", | ||
"name": "RL1-011-A.mp3", | ||
"doi": "10.4225/72/5afda751b05d1", | ||
"bitrate": 127718, | ||
"duration": 1084.63, | ||
"channels": 2, | ||
"sampleRate": 44100 | ||
}, | ||
{ | ||
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.wav", | ||
"@type": "File", | ||
"contentSize": 625002484, | ||
"dateCreated": "2018-05-11", | ||
"dateModified": "2018-05-18", | ||
"encodingFormat": "audio/vnd.wav", | ||
"name": "RL1-011-A.wav", | ||
"doi": "10.4225/72/5afda74dd11d2", | ||
"bitrate": 4609917, | ||
"duration": 1084.62, | ||
"channels": 2, | ||
"sampleRate": 96000 | ||
}, | ||
{ | ||
"@id": "#geo-151.0,-8.6-151.2,-8.4", | ||
"@type": "Geometry", | ||
"asWKT": "POLYGON((151.0 -8.4, 151.2 -8.4, 151.2 -8.6, 151.0 -8.6, 151.0 -8.4))" | ||
}, | ||
{ | ||
"@id": "#geo-150.294,-8.879-151.2,-8.32", | ||
"@type": "Geometry", | ||
"asWKT": "POLYGON((150.294 -8.32, 151.2 -8.32, 151.2 -8.879, 150.294 -8.879, 150.294 -8.32))" | ||
}, | ||
{ | ||
"@id": "#place-Kiriwina", | ||
"@type": "Place", | ||
"geo": { | ||
"@id": "#geo-151.0,-8.6-151.2,-8.4" | ||
}, | ||
"name": "Kiriwina" | ||
}, | ||
{ | ||
"@id": "#language-kij", | ||
"@type": "Language", | ||
"code": "kij", | ||
"location": { | ||
"@id": "#geo-150.294,-8.879-151.2,-8.32" | ||
}, | ||
"name": "Kilivila" | ||
} | ||
] | ||
} | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
--- | ||
page_id: pilars | ||
redirect_from: | ||
- /in-use/arkisto | ||
title: "PILARS" | ||
description: PILARS is a set of Protocols for Implementing Long-term Archival Repository Services | ||
url: https://w3id.org/ldac/pilars/ | ||
image: | ||
domains: [] | ||
tasks: [data_handling] | ||
roles: [data_steward, information_architect, repository_manager] | ||
--- | ||
|
||
# PILARS | ||
|
||
The Protocols for Implementing Long-term Archival Repository Services (PILARS, <http://w3id.org/ldac/pilars>) are inspired by the continuing success of the technical approach taken over two decades by [PARADISEC](paradisec). PARADISEC houses cultural heritage material from more than 1360 languages with standard metadata, with data stored in commodity services (initially files on disk, now objects in a cloud storage service), and metadata adjacent to the data. PILARS was developed through work with the Language Data Commons of Australia to generalise the PARADISEC approach to other disciplines. | ||
|
||
PILARS is aimed at IT practitioners, archivists, librarians, researchers and infrastructure managers involved in long-term data management. The protocols are intended to be complementary to the existing practices and principles of those disciplines. | ||
|
||
## Rationale | ||
|
||
In a research context it is important to be able to support the FAIR principles, ensuring that data is well described by metadata, is identified with persistent identifiers, and that shared services with good governance are in place to store interoperable data, to make it findable and provide appropriate access controls. | ||
|
||
These protocols could form the basis for design, evaluation or procurement of archival repository-services, but also allow for data custodians to begin organising data in a format ready for archiving and digital preservation as long as they have access to some kind of commodity storage, by using a range of tools. | ||
|
||
The PILARS Protocols call for the implementation of a linked-data metadata standard for describing data; RO-Crate represents one option for this. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
--- | ||
page_id: rrkive | ||
redirect_from: | ||
- /in-use/arkisto | ||
title: RRkive | ||
description: RRKive is a website aimed to outline a principled approach to research data management with guidance on data storage and metadata | ||
url: https://rrkive.org/ | ||
image: rrkive.png | ||
domains: [] | ||
tasks: [data_handling] | ||
roles: [data_steward, information_architect, repository_manager] | ||
--- | ||
|
||
# RRKive | ||
|
||
|
||
The RRKive site, managed by the Language Data Commons of Australia collects resources related to implementing Archival Repository ([RRKive](https://rkive.org/)) solutions based on a set of Protocols know as the PILARS: Principles for Implementing Long-term Archival Repository Services. https://w3id.org/ldac/pilars. RRkive is built on RO-Crate as a way to describe data objects for long term archiving. | ||
|
||
RRKive is a replacement for [Arkisto](arkisto). | ||
|
||
|
||
|
||
![RRKive logo is the word RRKive with a parrot sitting on it](assets/img/rrkive.png) |
Oops, something went wrong.