@@ -30,23 +30,27 @@ try https://variantspark.readthedocs.io/en/latest/getting_started.html
3030 ...
3131 ```
3232 - variant-spark --spark --master 'local[*]' -- importance -if gitHub/VariantSpark/data/chr22_1000.vcf -ff gitHub/VariantSpark/data/chr22-labels.csv -fc 22_16050408 -v -rn 500 -rbs 20 -ro -sr 13
33- - pip3 install --no-cache-dir -r /app/VariantSpark/requirements.txt # install python dependency
3433
34+ - ` which variant-spark ` # to find variant-spark bash script
35+ - seek bash script: https://github.com/aehrc/VariantSpark/tree/master/bin/variant-spark
36+ - it requires to set up spark
37+
38+ - pip3 install --no-cache-dir -r /app/VariantSpark/requirements.txt # install python dependency
3539## pip install variant-spark
3640- /usr/local/lib/python3.8/dist-packages/varspark is installed, includes
3741 - /usr/local/lib/python3.8/dist-packages/varspark/jars/variant-spark_2.12-0.5.5-all.jar
3842 - but this jar is not fat jar which didn't includes au.csiro.aehrc.third.hail-is
3943 - /usr/local/lib/python3.8/dist-packages/varspark: from variant-spark-0.5.5.tar.gz/varspark
4044- /usr/local/share/variant-spark/data/chr22* .vcf: from variant-spark-0.5.5.tar.gz/target/data
4145- /usr/local/bin/jvariant-spark and variant-spark etc: from variant-spark-0.5.5.tar.gz/target/bin
42- -
43- ## p
46+
47+ ## pip install hail==0.2.74
4448- hail-all-spark.jar : installed by pip3 install hail==0.2.74 inside the requirement.txt
4549 - is used by the Python hail package at runtime.
4650 - /usr/local/lib/python3.8/dist-packages/hail/backend/hail-all-spark.jar
4751 - jar tf hail-all-spark.jar | grep hail | grep SparkBackend
4852
49- - mvn install with hail
53+ ## mvn install
5054 - Maven will try to download a JAR matching hail_2.12_3.1:0.2.74 from repo: au.csiro.aehrc.third.hail-is based on pom.xml
5155 - the JAR is stored in your local Maven repository (~/.m2/repository/au/csiro/aehrc/third/hail-is/hail_2.12_3.1/0.2.74/).
5256
@@ -57,9 +61,7 @@ try https://variantspark.readthedocs.io/en/latest/getting_started.html
5761 - python: vshl.random_forest_model(...) calls scala RFModel.scala based on park classpath
5862 - summary: python calls scala depend on hail-all-spark.jar but not mvn installed hails
5963
60- - which variant-spark # to find variant-spark bash script
61- - orginal from https://github.com/aehrc/VariantSpark/tree/master/bin/variant-spark
62- - it requires to set up spark
64+
6365
6466
6567
0 commit comments