Skip to content

Commit 2f1e0c5

Browse files
committed
update to match current workflows
1 parent 2331fcf commit 2f1e0c5

File tree

2 files changed

+18
-21
lines changed

2 files changed

+18
-21
lines changed

CONTRIBUTING.md

Lines changed: 10 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -9,16 +9,14 @@ This package uses `flake8` - it's settings are described in [setup.cfg](setup.cf
99

1010
## The Development Life Cycle
1111

12-
### Fork and clone this repository
12+
### Clone this repository
1313

14-
1. See the [Github docs](https://help.github.com/articles/fork-a-repo/) for how to make a copy (a fork) of a repository to your own Github account.
15-
1. Then, [clone the repository](https://help.github.com/articles/cloning-a-repository/) to your local machine so you can begin making changes.
16-
1. Add this repository as an [upstream remote](https://help.github.com/en/articles/configuring-a-remote-for-a-fork) on your local git repository so that you are able to fetch the latest commits.
14+
1. [Clone the repository](https://help.github.com/articles/cloning-a-repository/) to your local machine so you can begin making changes.
1715
1. On your local machine make sure you have the latest version of the `develop` branch:
1816

1917
```
2018
git checkout develop
21-
git pull upstream develop
19+
git pull
2220
```
2321
2422
### Install development dependencies
@@ -36,35 +34,32 @@ pip install -r requirements.txt
3634
#### pipenv
3735
`pipenv` is a Python package manager. Learn more about [pipenv](https://pipenv.pypa.io/en/latest/) and how to install it.
3836
39-
```
40-
# Coming soon
41-
```
4237
4338
### Developing
4439
4540
The GENIE project follows the standard [git flow](https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow) development strategy.
4641
> To ensure the most fluid development, try not to push to your `develop` or `main` branch.
4742
48-
1. (Assuming you have followed all 4 steps above in the "fork and clone this repository" section). Navigate to your cloned repository on your computer/server.
43+
1. Navigate to your cloned repository on your computer/server.
4944
1. Make sure your `develop` branch is up to date with the `Sage-Bionetworks/Genie` `develop` branch.
5045
5146
```
52-
cd {your-github-username}/Genie
47+
cd Genie
5348
git checkout develop
54-
git pull upstream develop
49+
git pull
5550
```
5651
57-
1. Create a feature branch which off the `develop` branch. If there is a GitHub/JIRA issue that you are addressing, name the branch after the issue with some more detail (like `{GH|JIRA}-123-add-some-new-feature`).
52+
1. Create a feature branch which off the `develop` branch. If there is a GitHub/JIRA issue that you are addressing, name the branch after the issue with some more detail (like `{GH|GEN}-123-add-some-new-feature`).
5853
5954
```
6055
git checkout develop
61-
git checkout -b JIRA-123-new-feature
56+
git checkout -b GEN-123-new-feature
6257
```
6358
6459
1. At this point, you have only created the branch locally, you need to push this to your fork on GitHub.
6560
6661
```
67-
git push --set-upstream origin JIRA-123-new-feature
62+
git push --set-upstream origin GEN-123-new-feature
6863
```
6964
7065
You should now be able to see the branch on GitHub. Make commits as you deem necessary. It helps to provide useful commit messages - a commit message saying 'Update' is a lot less helpful than saying 'Remove X parameter because it was unused'.
@@ -86,7 +81,7 @@ The GENIE project follows the standard [git flow](https://www.atlassian.com/git/
8681
black ./
8782
```
8883
89-
1. Once you have completed all the steps above, in Github, create a pull request from the feature branch of your fork to the `develop` branch of Sage-Bionetworks/Genie.
84+
1. Once you have completed all the steps above, in Github, create a pull request from the feature branch to the `develop` branch of Sage-Bionetworks/Genie.
9085
9186
> *A code maintainer must review and accept your pull request.* A code review ideally happens with both the contributor and the reviewer present, but is not strictly required for contributing. This can be performed remotely (e.g., Zoom, Hangout, or other video or phone conference).
9287

README.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -97,6 +97,8 @@ These are instructions on how you would develop and test the pipeline locally.
9797
1. Make sure you have read through the [GENIE Onboarding Docs](https://sagebionetworks.jira.com/wiki/spaces/APGD/pages/2163344270/Onboarding) and have access to all of the required repositories, resources and synapse projects for Main GENIE.
9898
1. Be sure you are invited to the Synapse GENIE Admin team.
9999
1. Make sure you are a Synapse certified user: [Certified User - Synapse User Account Types](https://help.synapse.org/docs/Synapse-User-Account-Types.2007072795.html#SynapseUserAccountTypes-CertifiedUser)
100+
1. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile)
101+
1. Be sure to clone the annotation-tools repo: https://github.com/Sage-Bionetworks/annotation-tools and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile).
100102
1. Clone this repo and install the package locally.
101103

102104
```
@@ -131,7 +133,7 @@ These are instructions on how you would develop and test the pipeline locally.
131133
synapse login
132134
```
133135
134-
1. Run the different pipelines on the test project. The `--project_id syn7208886` points to the test project.
136+
1. Run the different steps of the pipeline on the test project. The `--project_id syn7208886` points to the test project. You should always be using the test project when developing, testing and running locally.
135137
136138
1. Validate all the files **excluding vcf files**:
137139
@@ -151,7 +153,7 @@ These are instructions on how you would develop and test the pipeline locally.
151153
python bin/input_to_database.py main --project_id syn7208886 --deleteOld
152154
```
153155
154-
1. Process the mutation data. Be sure to clone this repo: https://github.com/Sage-Bionetworks/annotation-tools and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). This repo houses the code that re-annotates the mutation data with genome nexus. The `--createNewMafDatabase` will create a new mutation tables in the test project. This flag is necessary for production data for two main reasons:
156+
1. Process the mutation data. This command uses the `annotation-tools` repo that you cloned previously which houses the code that standardizes/merges the mutation (both maf and vcf) files and re-annotates the mutation data with genome nexus. The `--createNewMafDatabase` will create a new mutation tables in the test project. This flag is necessary for production data for two main reasons:
155157
* During processing of mutation data, the data is appended to the data, so without creating an empty table, there will be duplicated data uploaded.
156158
* By design, Synapse Tables were meant to be appended to. When a Synapse Tables is updated, it takes time to index the table and return results. This can cause problems for the pipeline when trying to query the mutation table. It is actually faster to create an entire new table than updating or deleting all rows and appending new rows when dealing with millions of rows.
157159
* If you run this more than once on the same day, you'll run into an issue with overwriting the narrow maf table as it already exists. Be sure to rename the current narrow maf database under `Tables` in the test synapse project and try again.
@@ -160,16 +162,16 @@ These are instructions on how you would develop and test the pipeline locally.
160162
python bin/input_to_database.py mutation --project_id syn7208886 --deleteOld --genie_annotation_pkg ../annotation-tools --createNewMafDatabase
161163
```
162164
163-
1. Create a consortium release. Be sure to add the `--test` parameter. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). For consistency, the processingDate specified here should match the one used for TEST pipeline in [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
165+
1. Create a consortium release. Be sure to add the `--test` parameter. For consistency, the `processingDate` specified here should match the one used in the `consortium_map` for the `TEST` key [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
164166
165167
```
166-
python bin/database_to_staging.py Jul-2022 ../cbioportal TEST --test
168+
python bin/database_to_staging.py <processingDate> ../cbioportal TEST --test
167169
```
168170
169-
1. Create a public release. Be sure to add the `--test` parameter. Be sure to clone the cbioportal repo: https://github.com/cBioPortal/cbioportal and `git checkout` the version of the repo pinned to the [Dockerfile](https://github.com/Sage-Bionetworks/Genie/blob/main/Dockerfile). For consistency, the processingDate specified here should match the one used for TEST pipeline in [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
171+
1. Create a public release. Be sure to add the `--test` parameter. For consistency, the `processingDate` specified here should match the one used in the `public_map` for the `TEST` key [nf-genie.](https://github.com/Sage-Bionetworks-Workflows/nf-genie/blob/main/main.nf)
170172
171173
```
172-
python bin/consortium_to_public.py Jul-2022 ../cbioportal TEST --test
174+
python bin/consortium_to_public.py <processingDate> ../cbioportal TEST --test
173175
```
174176
175177
## Production

0 commit comments

Comments
 (0)