Skip to content

Commit 2ae125b

Browse files
Update note.md
1 parent 1297836 commit 2ae125b

File tree

1 file changed

+9
-7
lines changed

1 file changed

+9
-7
lines changed

dev/docker/note.md

Lines changed: 9 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -30,23 +30,27 @@ try https://variantspark.readthedocs.io/en/latest/getting_started.html
3030
...
3131
```
3232
- variant-spark --spark --master 'local[*]' -- importance -if gitHub/VariantSpark/data/chr22_1000.vcf -ff gitHub/VariantSpark/data/chr22-labels.csv -fc 22_16050408 -v -rn 500 -rbs 20 -ro -sr 13
33-
- pip3 install --no-cache-dir -r /app/VariantSpark/requirements.txt # install python dependency
3433

34+
- `which variant-spark` # to find variant-spark bash script
35+
- seek bash script: https://github.com/aehrc/VariantSpark/tree/master/bin/variant-spark
36+
- it requires to set up spark
37+
38+
- pip3 install --no-cache-dir -r /app/VariantSpark/requirements.txt # install python dependency
3539
## pip install variant-spark
3640
- /usr/local/lib/python3.8/dist-packages/varspark is installed, includes
3741
- /usr/local/lib/python3.8/dist-packages/varspark/jars/variant-spark_2.12-0.5.5-all.jar
3842
- but this jar is not fat jar which didn't includes au.csiro.aehrc.third.hail-is
3943
- /usr/local/lib/python3.8/dist-packages/varspark: from variant-spark-0.5.5.tar.gz/varspark
4044
- /usr/local/share/variant-spark/data/chr22*.vcf: from variant-spark-0.5.5.tar.gz/target/data
4145
- /usr/local/bin/jvariant-spark and variant-spark etc: from variant-spark-0.5.5.tar.gz/target/bin
42-
-
43-
## p
46+
47+
## pip install hail==0.2.74
4448
- hail-all-spark.jar : installed by pip3 install hail==0.2.74 inside the requirement.txt
4549
- is used by the Python hail package at runtime.
4650
- /usr/local/lib/python3.8/dist-packages/hail/backend/hail-all-spark.jar
4751
- jar tf hail-all-spark.jar | grep hail | grep SparkBackend
4852

49-
- mvn install with hail
53+
## mvn install
5054
- Maven will try to download a JAR matching hail_2.12_3.1:0.2.74 from repo: au.csiro.aehrc.third.hail-is based on pom.xml
5155
- the JAR is stored in your local Maven repository (~/.m2/repository/au/csiro/aehrc/third/hail-is/hail_2.12_3.1/0.2.74/).
5256
@@ -57,9 +61,7 @@ try https://variantspark.readthedocs.io/en/latest/getting_started.html
5761
- python: vshl.random_forest_model(...) calls scala RFModel.scala based on park classpath
5862
- summary: python calls scala depend on hail-all-spark.jar but not mvn installed hails
5963

60-
- which variant-spark # to find variant-spark bash script
61-
- orginal from https://github.com/aehrc/VariantSpark/tree/master/bin/variant-spark
62-
- it requires to set up spark
64+
6365

6466

6567

0 commit comments

Comments
 (0)