generated from linkml/linkml-project-template
-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #191 from include-dcc/doc_update
Doc and readme updates
- Loading branch information
Showing
5 changed files
with
76 additions
and
30 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,31 +1,47 @@ | ||
# INCLUDE v2 LinkML Data Model | ||
[![INCLUDE DCC Logo](src/docs/images/include-dcc_logo.png)](https://includedcc.org/) | ||
|
||
This repository includes the LinkML model and tooling for the v2 Data Model | ||
# INCLUDE LinkML Data Model | ||
|
||
# Where to find the data model: | ||
## Overview | ||
|
||
* The YAML files are in `src > linkml` | ||
* The JSON-LD schema is in `src > data > schematic` | ||
This repository hosts the INCLUDE LinkML Model, a specialized data model crafted to empower collaboration and streamline data integration within Down syndrome research. By harnessing advanced [LinkML](https://linkml.io/) features, it provides a versatile framework for harmonizing data representation, articulating metadata, and delineating intricate relationships between entities. This structured approach not only promotes seamless data exchange but also facilitates interoperability and scalability across diverse research initiatives. | ||
|
||
## Getting Started | ||
|
||
To work with the INCLUDE Model, follow these steps: | ||
|
||
# Poetry Environment | ||
This repo requires poetry as a system dependency | ||
1. **Clone the Repository**: Clone this repository to your local machine. | ||
|
||
https://python-poetry.org/docs/#installation | ||
2. **Install Poetry**: If not already installed, install Poetry as a system dependency. Refer to [Poetry Documentation](https://python-poetry.org/docs/#installation) for installation instructions. | ||
|
||
When poetry is installed run: | ||
`poetry install` | ||
3. **Install Dependencies**: Run the following command to install project dependencies using Poetry: | ||
|
||
```bash | ||
poetry install | ||
``` | ||
|
||
After running `poetry install`, you can proceed with further setup or usage instructions specific to your project. | ||
|
||
## Repository Structure | ||
|
||
# Workflow to update the model from: | ||
## [INCLUDE Schemasheets Google Sheets Project](https://docs.google.com/spreadsheets/d/1w6zDfz3_yrCjjrqfpXBGNmd0LZL4B03gr1KfzJtk5Cs/edit?usp=sharing) | ||
1. Run the Schematic JSON-LD transformer to generate the JSON-LD that Schematic will use to generate Manifests | ||
The repository structure is organized as follows: | ||
|
||
` poetry run python src/main.py schematic_transform ` | ||
- `src/linkml`: Contains the YAML files defining the INCLUDE Model. | ||
|
||
4. Run Schematic configured to point at the generated json-ld at src/schematic/include_schematic_linkml.jsonld | ||
## Updates Since Previous Release (v2.2.0) | ||
|
||
### Model Enhancements: | ||
|
||
- Updated Assay component to include latest Virtual Biorepository model. | ||
- Updated Study component and added new Dataset and DatasetManifest components to support new Study/Dataset pages in Data Hub. | ||
- Inclusion of over 25 new slots. | ||
- 5 new enumerations: EnumClinicalDataSourceType, EnumDataCategory, EnumGuidType, EnumParticipantLifespanStage, EnumResearchDomain. | ||
|
||
### CLI Enhancements: | ||
|
||
- **Validation**: Streamlines data cleaning and validation via the command line (CLI), allowing users to specify the data type and file path. The CLI reads, cleans, and validates data using LinkML-defined models for robust validation. For more details, use: | ||
|
||
```bash | ||
validate-data --help | ||
``` | ||
|
||
`schematic manifest --config */schematic/config.yml get -s` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,43 @@ | ||
# About my-project | ||
## INCLUDE LinkML Model | ||
|
||
... | ||
The INCLUDE LinkML Model is designed to streamline collaboration and data integration focusing on Down syndrome research and healthcare advancement. It provides a standardized framework for defining metadata related to Down syndrome research, promoting seamless data sharing and collaboration. | ||
|
||
### About INCLUDE | ||
The INCLUDE (INvestigation of Co-occurring conditions across the Lifespan to Understand Down syndromE) Project, launched in June 2018, supports critical health and quality-of-life needs for individuals with Down syndrome by investigating co-occurring conditions such as Alzheimer’s disease/dementia, autism, cataracts, celiac disease, congenital heart disease, and diabetes. | ||
|
||
### About LinkML | ||
LinkML is a flexible modeling language that allows you to author schemas (“models”) in YAML, describing the structure of your data. Designed for both simple and complex use cases, LinkML facilitates the creation of interoperable and standard-compliant data models. | ||
|
||
### Key Features | ||
- **Standardized Metadata:** Ensures clarity, consistency, and interoperability across diverse datasets and research endeavors. | ||
|
||
- **Facilitated Collaboration:** Promotes collaboration among researchers, clinicians, and stakeholders, accelerating research progress. | ||
|
||
- **Accessible Data Hub:** Provides easy access to valuable resources, datasets, and insights through the Data Hub. | ||
|
||
- **Targeted Basic Science Studies:** Focuses on high-risk, high-reward research on chromosome 21 to better understand Down syndrome biology and develop new treatments. | ||
|
||
- **Large Cohort Assembly:** Connects existing cohorts and enrolls new participants to study the full range of conditions experienced by individuals with Down syndrome across their lifespan. | ||
|
||
- **Inclusive Clinical Trials:** Identifies and supports clinical trials addressing conditions common in the Down syndrome population, establishing measures specific to Down syndrome for evaluating clinical success. | ||
|
||
### How LinkML Enhances the Model | ||
- **Flexible Data Modeling:** LinkML supports a range of data structures, from simple spreadsheets to complex interlinked schemas, enabling comprehensive and adaptable metadata descriptions. | ||
|
||
- **Interoperability:** Designed to work with semantic frameworks like RDF and familiar developer frameworks like JSON, ensuring that data models are widely accessible and usable across different platforms and applications. | ||
|
||
- **Human-Readable Syntax:** Utilizing YAML for schema definitions, LinkML offers an approachable syntax for both technical and non-technical users, promoting broader participation in schema development. | ||
|
||
- **Integration Capabilities:** LinkML's compatibility with various data standards and technologies facilitates seamless integration with existing systems and data sources, enhancing data utility and research outcomes. | ||
|
||
### How to Get Involved | ||
- **Explore the Model:** Gain a deeper understanding of essential metadata elements for Down syndrome research. | ||
|
||
- **Collaborate:** Join forces to leverage the INCLUDE LinkML Model and accelerate progress. | ||
|
||
- **Contribute:** Share expertise, insights, and data to enrich the model and shape the future of Down syndrome research. | ||
|
||
### Start Exploring | ||
Visit [INCLUDE Data Coordinating Center (DCC)](https://includedcc.org/) to access the INCLUDE LinkML Model and join the collaborative effort toward transforming Down syndrome research and healthcare. | ||
|
||
For more information about LinkML, visit [LinkML](https://linkml.io/). |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.