You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+10-15Lines changed: 10 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,16 +9,14 @@ This package uses `flake8` - it's settings are described in [setup.cfg](setup.cf
9
9
10
10
## The Development Life Cycle
11
11
12
-
### Fork and clone this repository
12
+
### Clone this repository
13
13
14
-
1. See the [Github docs](https://help.github.com/articles/fork-a-repo/) for how to make a copy (a fork) of a repository to your own Github account.
15
-
1. Then, [clone the repository](https://help.github.com/articles/cloning-a-repository/) to your local machine so you can begin making changes.
16
-
1. Add this repository as an [upstream remote](https://help.github.com/en/articles/configuring-a-remote-for-a-fork) on your local git repository so that you are able to fetch the latest commits.
14
+
1.[Clone the repository](https://help.github.com/articles/cloning-a-repository/) to your local machine so you can begin making changes.
17
15
1. On your local machine make sure you have the latest version of the `develop` branch:
`pipenv` is a Python package manager. Learn more about [pipenv](https://pipenv.pypa.io/en/latest/) and how to install it.
38
36
39
-
```
40
-
# Coming soon
41
-
```
42
37
43
38
### Developing
44
39
45
40
The GENIE project follows the standard [git flow](https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow) development strategy.
46
41
> To ensure the most fluid development, try not to push to your `develop` or `main` branch.
47
42
48
-
1. (Assuming you have followed all 4 steps above in the "fork and clone this repository" section). Navigate to your cloned repository on your computer/server.
43
+
1. Navigate to your cloned repository on your computer/server.
49
44
1. Make sure your `develop` branch is up to date with the `Sage-Bionetworks/Genie` `develop` branch.
50
45
51
46
```
52
-
cd {your-github-username}/Genie
47
+
cd Genie
53
48
git checkout develop
54
-
git pull upstream develop
49
+
git pull
55
50
```
56
51
57
-
1. Create a feature branch which off the `develop` branch. If there is a GitHub/JIRA issue that you are addressing, name the branch after the issue with some more detail (like `{GH|JIRA}-123-add-some-new-feature`).
52
+
1. Create a feature branch which off the `develop` branch. If there is a GitHub/JIRA issue that you are addressing, name the branch after the issue with some more detail (like `{GH|GEN}-123-add-some-new-feature`).
58
53
59
54
```
60
55
git checkout develop
61
-
git checkout -b JIRA-123-new-feature
56
+
git checkout -b GEN-123-new-feature
62
57
```
63
58
64
59
1. At this point, you have only created the branch locally, you need to push this to your fork on GitHub.
You should now be able to see the branch on GitHub. Make commits as you deem necessary. It helps to provide useful commit messages - a commit message saying 'Update' is a lot less helpful than saying 'Remove X parameter because it was unused'.
@@ -86,7 +81,7 @@ The GENIE project follows the standard [git flow](https://www.atlassian.com/git/
86
81
black ./
87
82
```
88
83
89
-
1. Once you have completed all the steps above, in Github, create a pull request from the feature branch of your fork to the `develop` branch of Sage-Bionetworks/Genie.
84
+
1. Once you have completed all the steps above, in Github, create a pull request from the feature branch to the `develop` branch of Sage-Bionetworks/Genie.
90
85
91
86
> *A code maintainer must review and accept your pull request.* A code review ideally happens with both the contributor and the reviewer present, but is not strictly required for contributing. This can be performed remotely (e.g., Zoom, Hangout, or other video or phone conference).
Copy file name to clipboardExpand all lines: README.md
+8-6Lines changed: 8 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -97,6 +97,8 @@ These are instructions on how you would develop and test the pipeline locally.
97
97
1. Make sure you have read through the [GENIE Onboarding Docs](https://sagebionetworks.jira.com/wiki/spaces/APGD/pages/2163344270/Onboarding) and have access to all of the required repositories, resources and synapse projects for Main GENIE.
98
98
1. Be sure you are invited to the Synapse GENIE Admin team.
99
99
1. Make sure you are a Synapse certified user: [Certified User - Synapse User Account Types](https://help.synapse.org/docs/Synapse-User-Account-Types.2007072795.html#SynapseUserAccountTypes-CertifiedUser)
100
+
1. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile)
101
+
1. Be sure to clone the annotation-tools repo: https://github.com/Sage-Bionetworks/annotation-tools and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile).
100
102
1. Clone this repo and install the package locally.
101
103
102
104
```
@@ -131,7 +133,7 @@ These are instructions on how you would develop and test the pipeline locally.
131
133
synapse login
132
134
```
133
135
134
-
1. Run the different pipelines on the test project. The `--project_id syn7208886` points to the test project.
136
+
1. Run the different steps of the pipeline on the test project. The `--project_id syn7208886` points to the test project. You should always be using the test project when developing, testing and running locally.
135
137
136
138
1. Validate all the files **excluding vcf files**:
137
139
@@ -151,7 +153,7 @@ These are instructions on how you would develop and test the pipeline locally.
151
153
python bin/input_to_database.py main --project_id syn7208886 --deleteOld
152
154
```
153
155
154
-
1. Process the mutation data. Be sure to clone this repo: https://github.com/Sage-Bionetworks/annotation-tools and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). This repo houses the code that re-annotates the mutation data with genome nexus. The `--createNewMafDatabase` will create a new mutation tables in the test project. This flag is necessary for production data for two main reasons:
156
+
1. Process the mutation data. This command uses the `annotation-tools` repo that you cloned previously which houses the code that standardizes/merges the mutation (both maf and vcf) files and re-annotates the mutation data with genome nexus. The `--createNewMafDatabase` will create a new mutation tables in the test project. This flag is necessary for production data for two main reasons:
155
157
* During processing of mutation data, the data is appended to the data, so without creating an empty table, there will be duplicated data uploaded.
156
158
* By design, Synapse Tables were meant to be appended to. When a Synapse Tables is updated, it takes time to index the table and return results. This can cause problems for the pipeline when trying to query the mutation table. It is actually faster to create an entire new table than updating or deleting all rows and appending new rows when dealing with millions of rows.
157
159
* If you run this more than once on the same day, you'll run into an issue with overwriting the narrow maf table as it already exists. Be sure to rename the current narrow maf database under `Tables` in the test synapse project and try again.
@@ -160,16 +162,16 @@ These are instructions on how you would develop and test the pipeline locally.
1. Create a consortium release. Be sure to add the `--test` parameter. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). For consistency, the processingDate specified here should match the one used for TEST pipeline in [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
165
+
1. Create a consortium release. Be sure to add the `--test` parameter. For consistency, the `processingDate` specified here should match the one used in the `consortium_map` for the `TEST` key [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
164
166
165
167
```
166
-
python bin/database_to_staging.py Jul-2022 ../cbioportal TEST --test
168
+
python bin/database_to_staging.py <processingDate> ../cbioportal TEST --test
167
169
```
168
170
169
-
1. Create a public release. Be sure to add the `--test` parameter. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). For consistency, the processingDate specified here should match the one used for TEST pipeline in [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
171
+
1. Create a public release. Be sure to add the `--test` parameter. For consistency, the `processingDate` specified here should match the one used in the `public_map` for the `TEST` key [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
170
172
171
173
```
172
-
python bin/consortium_to_public.py Jul-2022 ../cbioportal TEST --test
174
+
python bin/consortium_to_public.py <processingDate> ../cbioportal TEST --test
0 commit comments