You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -148,4 +148,6 @@ V 1.1.0: The authors of @Gassenhauer_2021 made some previously restricted sample
148
148
V 1.0.0: Creation of the package
149
149
```
150
150
151
+
## The Sequencing Source file
151
152
153
+
Poseidon 2.7.0 added an option to specify sequencing source data. This is a tab-separated table, much like the Janno file, but following a different schema, specified in the file `sequencingSourceFile_columns.tsv`. Note that the primary entities in this table are Sequencing entities (typically corresponding to DNA libraries or even multiple runs/lanes of the same library). The link to the Individuals listed in the Janno-file are made through a foreign-key relationship into `Poseidon_ID`.
Poseidon_IDThe Poseidon_ID field that this sequencing entity corresponds to, from the Janno-file.StringFALSEFALSEFALSETRUEFALSE
4
+
sample_accessionThe sample accession code as used in INSDC databases, including ENA and SRA (Example: SAMEA7050454)StringFALSEFALSEFALSETRUETRUE
5
+
study_accessionThe study accession code as used in INSDC databases, including ENA and SRA (Example: PRJEB39316)StringFALSEFALSEFALSEFALSEFALSE
6
+
run_accessionThe run accession code as used in INSDC databases, including ENA and SRA (Example: ERR4331996)StringFALSEFALSEFALSEFALSEFALSE
7
+
sample_aliasThe sample alias defined by the submitterStringFALSEFALSEFALSEFALSEFALSE
8
+
secondary_sample_accessionA secondary sample accession, as used at the ENA for historical reasons (Example: ERS4811084)StringFALSEFALSEFALSEFALSETRUE
9
+
first_publicThe date (YYYY-MM-DD) this sample was first made publicDateFALSEFALSEFALSEFALSEFALSE
10
+
last_updatedThe date (YYYY-MM-DD) this sample was last updatedDateFALSEFALSEFALSEFALSEFALSE
11
+
instrument_modelThe name of the instrument used (Example: Illumina HiSeq 2500)StringFALSEFALSEFALSEFALSEFALSE
12
+
library_layoutThe library layout of the sequencing entity (Example: SINGLE)StringFALSEFALSEFALSEFALSEFALSE
13
+
library_sourceThe source of the DNA library (Example: GENOMIC)StringFALSEFALSEFALSEFALSEFALSE
14
+
instrument_platformThe platform brand or type of the sequencer (Example: ILLUMINA)StringFALSEFALSEFALSEFALSEFALSE
15
+
library_nameThis is the library name the submitter has entered. Can sometimes be useful to figure out which Poseidon_ID this entity belongs toStringFALSEFALSEFALSEFALSEFALSE
16
+
library_strategyThe strategy used to create the library (Example: WGS)StringFALSEFALSEFALSEFALSEFALSE
17
+
fastq_ftpThe FTP-link(s) (URL) to the FASTQ file(s) (Example: ftp.sra.ebi.ac.uk/vol1/fastq/ERR433/009/ERR4332639/ERR4332639.fastq.gz)URLTRUEFALSEFALSEFALSEFALSE
18
+
fastq_asperaThe Aspera-link (URL) to the FASTQ-file(s). (Example: fasp.sra.ebi.ac.uk:/vol1/fastq/ERR433/009/ERR4332639/ERR4332639.fastq.gz)URLTRUEFALSEFALSEFALSEFALSE
19
+
fastq_bytesThe number of bytes of the FASTQ-file(s) in bytesIntegerTRUEFALSETRUE0InfFALSEFALSE
20
+
fastq_md5The MD5 hash(es) of the FASTQ-file(s)StringTRUEFALSEFALSEFALSEFALSE
21
+
read_countThe number of readsIntegerFALSEFALSETRUE0InfFALSEFALSE
22
+
submitted_ftpThe URL(s) to the originally submitted file(s) before it got converted to FASTQ. This can sometimes be helpful for processingStringTRUEFALSEFALSEFALSEFALSE
0 commit comments