You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/configuration.rst
+11-9Lines changed: 11 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,6 @@
1
-
.. _configuration:
2
-
1
+
###################
3
2
Configure Schematic
4
-
===================
3
+
###################
5
4
6
5
This is an example config for Schematic. All listed values are those that are the default if a config is not used. Remove any fields in the config you don't want to change.
7
6
If you remove all fields from a section, the entire section should be removed including the header.
@@ -48,35 +47,38 @@ Change the values of any fields you do want to change. Please view the installa
48
47
49
48
This document will go into detail what each of these configurations mean.
50
49
50
+
***********
51
51
Asset Store
52
-
-----------
52
+
***********
53
53
54
54
Synapse
55
-
~~~~~~~
55
+
========
56
56
This describes where assets such as manifests are stored and the configurations of the asset store is described
57
57
under the asset store section.
58
58
59
59
* master_fileview_id: Synapse ID of the file view listing all project data assets.
60
60
* config: Path to the synapse config file, either absolute or relative to this file. Note, if you use `synapse config` command, you will have to provide the full path to the configuration file.
61
61
* manifest_basename: Base name that manifest files will be saved as on Synapse. The Component will be appended to it so for example: `synapse_storage_manifest_biospecimen.csv`
62
62
63
+
**********
63
64
Manifest
64
-
--------
65
+
**********
65
66
This describes information about manifests as it relates to generation and validation. Note: some of these configurations can be overwritten by the CLI commands.
66
67
67
68
* manifest_folder: Location where manifests will saved to. This can be a relative or absolute path on your local machine.
68
69
* title: Title or title prefix given to generated manifest(s). This is used to name the manifest file saved locally.
69
70
* data_type: Data types of manifests to be generated or data type (singular) to validate manifest against. If you wanted all the available manifests, you can input "all manifests"
70
71
71
-
72
+
******
72
73
Model
73
-
-----
74
+
******
74
75
Describes the location of your schema
75
76
76
77
* location: This is the location of your schema jsonld, it must be a path relative to this file or absolute path. Currently URL's are NOT supported, so you will have to download the jsonld data model. Here is an example: https://raw.githubusercontent.com/ncihtan/data-models/v24.9.1/HTAN.model.jsonld
77
78
79
+
*************
78
80
Google Sheets
79
-
-------------
81
+
*************
80
82
Schematic leverages the Google API to generate manifests. This section is for using google sheets with Schematic
81
83
82
84
* service_acct_creds: Path to the google service account creds, either absolute or relative to this file. This is the path to the service account credentials file that you download from Google Cloud Platform.
Copy file name to clipboardExpand all lines: docs/source/index.rst
+20-18Lines changed: 20 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,8 +5,9 @@
5
5
6
6
.. _index:
7
7
8
+
######################################
8
9
Welcome to Schematic's documentation!
9
-
=====================================
10
+
######################################
10
11
11
12
.. warning::
12
13
This documentation site is a work in progress, and the sublinks may change. Apologies for the inconvenience.
@@ -28,15 +29,16 @@ Schematic tackles these goals:
28
29
:depth: 2
29
30
:local:
30
31
32
+
*******************
31
33
Important Concepts
32
-
------------------
34
+
*******************
33
35
34
36
.. important::
35
37
36
38
Before moving reading more about schematic, this section covers essential concepts relevant for using the Schematic tool effectively.
37
39
38
40
Synapse FileViews
39
-
~~~~~~~~~~~~~~~~~
41
+
=================
40
42
Users are responsible for setting up a **FileView** that integrates with Schematic. Note that FileViews appear under the "Tables" tab in Synapse and can be named according to the project's needs. For instance, a FileView for the **Project A** could have a different name than a FileView for the **Project B**.
41
43
42
44
For more information on Synapse projects, visit:
@@ -45,17 +47,17 @@ For more information on Synapse projects, visit:
Folders in Synapse allow users to organize data within projects. More details on uploading and organizing data can be found at `Synapse folders <https://help.synapse.org/docs/Uploading-and-Organizing-Data-Into-Projects,-Files,-and-Folders.2048327716.html>`_
51
53
52
54
Synapse Datasets
53
-
~~~~~~~~~~~~~~~~
55
+
================
54
56
55
57
This is an object in Synapse which appears under the "Dataset" tab and represents a user-defined collection of Synapse files and versions. https://help.synapse.org/docs/Datasets.2611281979.html
56
58
57
59
JSON-LD
58
-
~~~~~~~
60
+
=======
59
61
JSON-LD is a lightweight Linked Data format. The usage of JSON-LD to capture our data models
60
62
extends beyond the creation, validation, and submission of annotations/manifests into Synapse
61
63
It can create relationships between different data models and, in the future, drive
@@ -64,22 +66,22 @@ and their relationships is also possible which allows the community to see the d
64
66
connections between all the data uploaded into Synapse.
65
67
66
68
Manifest
67
-
~~~~~~~~
69
+
========
68
70
69
71
A manifest is a structured file that contains metadata about files under a "top level folder".
70
72
The metadata includes information of the files such as data type and etc.
71
73
The manifest can also used to annotate the data on Synapse and create a file view
72
74
that enables the FAIR principles on each of the files in the "top level folder".
73
75
74
76
Component/Data type
75
-
~~~~~~~~~~~~~~~~~~~
77
+
===================
76
78
"component" and "data type" are used interchangeably. The component/data type is determined from the specified JSON-LD data model.
77
79
If the string "component" exists in the depends on column, the "Attribute" value in that row is a data type.
78
80
Examples of a data type is "Biospecimen", "Patient": https://github.com/Sage-Bionetworks/schematic/blob/develop/tests/data/example.model.csv#L3.
79
81
Each data type/component should a manifest template that has different columns.
80
82
81
83
Project Data Layout
82
-
~~~~~~~~~~~~~~~~~~~
84
+
===================
83
85
84
86
Regardless of data layout, the data in your Synapse Project(s) are uploaded into Synapse Folders to be curated and annotated by schematic.
85
87
In both layouts listed below, the project administrators along with the data contributors may have preferences on how the
@@ -93,19 +95,19 @@ different things under these two layouts.
93
95
94
96
In both of these layouts, these are really just groupings of resources.
95
97
96
-
98
+
*******************
97
99
Schematic services
98
-
------------------
100
+
*******************
99
101
100
102
The following are the four main endpoints that assist with the high-level goals outlined above, with additional goals to come.
101
103
102
104
Manifest Generation
103
-
~~~~~~~~~~~~~~~~~~~
105
+
===================
104
106
105
107
Provides a manifest template for users for a particular project or data type. If a project with annotations already exists, a semi-filled-out template can be provided to the user. This ensures they do not start from scratch. If there are no existing annotations and manifests, an empty manifest template is provided.
106
108
107
-
Manifest Validation
108
-
~~~~~~~~~~~~~~~~~~~
109
+
Validating a Manifest
110
+
=====================
109
111
110
112
Given a filled-out manifest:
111
113
@@ -116,7 +118,7 @@ Given a filled-out manifest:
116
118
- Validation results are provided before the manifest file is uploaded into Synapse.
117
119
118
120
Manifest Submission
119
-
~~~~~~~~~~~~~~~~~~~
121
+
===================
120
122
121
123
Given a filled out manifest, this will allow you to submit the manifest to the "top level folder".
122
124
This is validates the manifest and...
@@ -130,13 +132,13 @@ This is validates the manifest and...
130
132
More validation documentation can be found here: https://sagebionetworks.jira.com/wiki/spaces/SCHEM/pages/3302785036/Schematic+Validation
131
133
132
134
Data Model Visualization
133
-
~~~~~~~~~~~~~~~~~~~~~~~~
135
+
========================
134
136
135
137
These endpoints allows you to visulize your data models and their relationships with each other.
136
138
137
-
139
+
**************
138
140
API reference
139
-
-------------
141
+
**************
140
142
141
143
For the entire Python API reference documentation, you can visit the docs here: https://sage-bionetworks.github.io/schematic/
0 commit comments