Skip to content

Commit 7898bb7

Browse files
authored
Merge pull request #56 from poseidon-framework/poseidon27
Poseidon v2.7.0
2 parents 6580561 + 69d6db0 commit 7898bb7

File tree

3 files changed

+26
-0
lines changed

3 files changed

+26
-0
lines changed

POSEIDON_yml_fields.tsv

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@ indFileChkSum 1 genotypeData md5 checksum of the indFile String FALSE
1919
snpSet 1 genotypeData Can be either 1240K, HumanOrigins or Other depending on the list of SNPs used String (1240K|HumanOrigins|Other) FALSE
2020
jannoFile 0 relative path to jannoFile String Path FALSE
2121
jannoFileChkSum 0 md5 checksum of the jannoFile String FALSE
22+
sequencingSourceFile 0 relative path to sequencingSourceFile String Path FALSE
23+
sequencingSourceFileChkSum 0 md5 checksum of the sequencingSourceFile String FALSE
2224
bibFile 0 relative path to bibFile String Path FALSE
2325
bibFileChkSum 0 md5 checksum of the bibFile String FALSE
2426
readmeFile 0 relative path to readmeFile String Path FALSE

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -148,4 +148,6 @@ V 1.1.0: The authors of @Gassenhauer_2021 made some previously restricted sample
148148
V 1.0.0: Creation of the package
149149
```
150150

151+
## The Sequencing Source file
151152

153+
Poseidon 2.7.0 added an option to specify sequencing source data. This is a tab-separated table, much like the Janno file, but following a different schema, specified in the file `sequencingSourceFile_columns.tsv`. Note that the primary entities in this table are Sequencing entities (typically corresponding to DNA libraries or even multiple runs/lanes of the same library). The link to the Individuals listed in the Janno-file are made through a foreign-key relationship into `Poseidon_ID`.

sequencingSourceFile_columns.tsv

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,22 @@
1+
sequencingSourceFile_column_name description data_type multi choice range choice_options range_lower range_upper mandatory unique
2+
3+
Poseidon_ID The Poseidon_ID field that this sequencing entity corresponds to, from the Janno-file. String FALSE FALSE FALSE TRUE FALSE
4+
sample_accession The sample accession code as used in INSDC databases, including ENA and SRA (Example: SAMEA7050454) String FALSE FALSE FALSE TRUE TRUE
5+
study_accession The study accession code as used in INSDC databases, including ENA and SRA (Example: PRJEB39316) String FALSE FALSE FALSE FALSE FALSE
6+
run_accession The run accession code as used in INSDC databases, including ENA and SRA (Example: ERR4331996) String FALSE FALSE FALSE FALSE FALSE
7+
sample_alias The sample alias defined by the submitter String FALSE FALSE FALSE FALSE FALSE
8+
secondary_sample_accession A secondary sample accession, as used at the ENA for historical reasons (Example: ERS4811084) String FALSE FALSE FALSE FALSE TRUE
9+
first_public The date (YYYY-MM-DD) this sample was first made public Date FALSE FALSE FALSE FALSE FALSE
10+
last_updated The date (YYYY-MM-DD) this sample was last updated Date FALSE FALSE FALSE FALSE FALSE
11+
instrument_model The name of the instrument used (Example: Illumina HiSeq 2500) String FALSE FALSE FALSE FALSE FALSE
12+
library_layout The library layout of the sequencing entity (Example: SINGLE) String FALSE FALSE FALSE FALSE FALSE
13+
library_source The source of the DNA library (Example: GENOMIC) String FALSE FALSE FALSE FALSE FALSE
14+
instrument_platform The platform brand or type of the sequencer (Example: ILLUMINA) String FALSE FALSE FALSE FALSE FALSE
15+
library_name This is the library name the submitter has entered. Can sometimes be useful to figure out which Poseidon_ID this entity belongs to String FALSE FALSE FALSE FALSE FALSE
16+
library_strategy The strategy used to create the library (Example: WGS) String FALSE FALSE FALSE FALSE FALSE
17+
fastq_ftp The FTP-link(s) (URL) to the FASTQ file(s) (Example: ftp.sra.ebi.ac.uk/vol1/fastq/ERR433/009/ERR4332639/ERR4332639.fastq.gz) URL TRUE FALSE FALSE FALSE FALSE
18+
fastq_aspera The Aspera-link (URL) to the FASTQ-file(s). (Example: fasp.sra.ebi.ac.uk:/vol1/fastq/ERR433/009/ERR4332639/ERR4332639.fastq.gz) URL TRUE FALSE FALSE FALSE FALSE
19+
fastq_bytes The number of bytes of the FASTQ-file(s) in bytes Integer TRUE FALSE TRUE 0 Inf FALSE FALSE
20+
fastq_md5 The MD5 hash(es) of the FASTQ-file(s) String TRUE FALSE FALSE FALSE FALSE
21+
read_count The number of reads Integer FALSE FALSE TRUE 0 Inf FALSE FALSE
22+
submitted_ftp The URL(s) to the originally submitted file(s) before it got converted to FASTQ. This can sometimes be helpful for processing String TRUE FALSE FALSE FALSE FALSE

0 commit comments

Comments
 (0)