Skip to content

Commit ca1b3d4

Browse files
committed
Updated docs
1 parent b3000fc commit ca1b3d4

23 files changed

+12134
-161
lines changed

docs/_sources/changes.rst.txt

Lines changed: 578 additions & 0 deletions
Large diffs are not rendered by default.

docs/_sources/contact.rst.txt

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
Contact
2+
-------
3+
4+
Author: Jose Fernandez Navarro <[email protected]>

docs/_sources/example.rst.txt

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,39 @@
1+
Examples
2+
--------
3+
4+
The following is an example of an BASH file to run the ST pipeline.
5+
This example is for version 1.6.0 of the ST pipeline.
6+
7+
.. code-block:: bash
8+
9+
#!/bin/bash
10+
11+
# FASTQ reads
12+
FW=/YOUR_RUN/R1.fastq.gz
13+
RV=/YOUR_RUN/R2.fastq.gz
14+
15+
# References for mapping, annotation and nonRNA-filtering
16+
MAP=/mouse/GRCm38_86v2/StarIndex
17+
ANN=/mouse/GRCm38_86v2/annotation/annotation.gtf
18+
CONT=/mouse/GRCm38_86v2/ncRNA/StarIndex
19+
20+
# Barcodes settings
21+
ID=/stpipeline/ids/YOUR_IDs.txt
22+
23+
# Output folder and experiment name
24+
# Do not use / or \ in the experiment name
25+
OUTPUT=/your_experiment_folder
26+
EXP=YOUR_EXP_NAME
27+
28+
# Running the pipeline
29+
st_pipeline_run.py \
30+
--output-folder $OUTPUT \
31+
--ids $ID \
32+
--ref-map $MAP \
33+
--ref-annotation $ANN \
34+
--expName $EXP \
35+
--htseq-no-ambiguous \
36+
--verbose \
37+
--log-file $OUTPUT/${EXP}_log.txt \
38+
--contaminant-index $CONT \
39+
$FW $RV

docs/_sources/index.rst.txt

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
.. ST Pipeline documentation.
2+
3+
Welcome to ST Pipeline's documentation!
4+
=======================================
5+
6+
Contents:
7+
8+
.. toctree::
9+
:maxdepth: 2
10+
11+
intro
12+
installation
13+
manual
14+
example
15+
changes
16+
license
17+
contact
18+
19+
20+
21+
Indices and tables
22+
==================
23+
24+
* :ref:`genindex`
25+
* :ref:`modindex`
26+
* :ref:`search`

docs/_sources/installation.rst.txt

Lines changed: 57 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,57 @@
1+
Installing the Spatial Transcriptomics pipeline
2+
-----------------------------------------------
3+
4+
These are the general instructions for installing the st_pipeline from scratch
5+
on you compute environment. All the commands can be performed as a user with no
6+
elevated permissions.
7+
8+
We recommend to download and install Anaconda (https://www.anaconda.com/products/individual)
9+
10+
We then create a virtual environment from which we will run the pipeline in.
11+
Type the following command:
12+
13+
``conda create -n pipeline python=3.6 anaconda``
14+
15+
The name for the virtual environment that we have just created is specified by
16+
the -n flag. Here is is called pipeline, but this can be anything that you want
17+
to name it. To run the pipeline, this virtual environment must be activated. To
18+
activate the virtual environment, enter the following command:
19+
20+
``source activate pipeline``
21+
22+
Where pipeline is the name of your virtual environment (here the virtual
23+
environment is called pipeline). To deactivate the virtual environment, type the
24+
following command:
25+
26+
``source deactivate``
27+
28+
You need to obtain the pipeline from github to use it. The following steps will
29+
tell you how to perform this.
30+
31+
Change to your home directory
32+
33+
``cd``
34+
35+
Clone the repository from github
36+
37+
``git clone git://github.com/SpatialTranscriptomicsResearch/st_pipeline.git``
38+
39+
Change into the st_pipeline directory
40+
41+
``cd st_pipeline``
42+
43+
Activate the virtual environment (if not already active)
44+
45+
``source activate pipeline``
46+
47+
Install the pipeline
48+
49+
``python setup.py build``
50+
51+
``python setup.py install``
52+
53+
Alternatively, you can simply install the pipeline using PyPy:
54+
55+
``pip install stpipeline``
56+
57+
Now the pipeline is installed and ready to be run.

docs/_sources/intro.rst.txt

Lines changed: 61 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,61 @@
1+
Introduction
2+
------------
3+
4+
The ST Pipeline contains the tools and scripts needed to process
5+
and analyze the raw files generated with the Spatial Transcriptomics
6+
or Visium in FASTQ format to generate datasets for down-stream analysis.
7+
The ST pipeline can also be used to process single cell RNA-seq data as
8+
long as a file with barcodes identifying each cell is provided.
9+
The ST Pipeline can also process RNA-Seq datasets generated with
10+
or without UMIs.
11+
12+
The ST Pipeline has been optimized for speed, robustness and
13+
it is very easy to use with many parameters to adjust all the settings.
14+
The ST Pipeline is fully parallel and has constant memory use.
15+
The ST Pipeline allows to skip any of the steps and to use the
16+
genome or the transcriptome as reference.
17+
18+
The following files/parameters are required:
19+
20+
- FASTQ files (Read 1 containing the spatial information and the UMI
21+
and read 2 containing the genomic sequence)
22+
- A genome index generated with STAR
23+
- An annotation file in GTF or GFF format (optional)
24+
- The file containing the barcodes and array coordinates
25+
(look at the folder "ids" and chose the correct one).
26+
Basically this file contains 3 columns (BARCODE, X and Y),
27+
so if you provide this file with barcodes identinfying cells (for example),
28+
the ST pipeline can be used for single cell data.
29+
This file is optional too.
30+
- A name for the dataset
31+
32+
The ST pipeline has multiple parameters mostly related to trimming,
33+
mapping and annotation but generally the default values are good enough.
34+
You can see a full description of the parameters
35+
typing "st_pipeline_run.py --help" after you have installed the ST pipeline.
36+
37+
The input FASTQ files can be given in gzip/bzip format as well.
38+
39+
Basically what the ST pipeline does is:
40+
41+
- Quality trimming (read 1 and read 2):
42+
- Remove low quality bases
43+
- Sanity check (reads same length, reads order, etc..)
44+
- Check quality UMI (if provided)
45+
- Remove artifacts (PolyT, PolyA, PolyG, PolyN and PolyC) of user defined length
46+
- Check for AT and GC content
47+
- Discard reads with a minimum number of bases of that failed any of the checks above
48+
- Contamimant filter e.x. rRNA genome (Optional)
49+
- Mapping with STAR (only read 2)
50+
- Demultiplexing with [Taggd](https://github.com/SpatialTranscriptomicsResearch/taggd) (only read 1)
51+
- Keep reads (read 2) that contain a valid barcode and are correctly mapped
52+
- Annotate the reads with htseq-count (optional)
53+
- Group annotated reads by barcode(spot position) and gene to get a read count
54+
- In the grouping/counting only unique molecules (UMIs) are kept.
55+
56+
You can see a graphical more detailed description of the workflow in the documents workflow.pdf and workflow_extended.pdf
57+
58+
The output will be a matrix of counts (genes as columns, spots as rows),
59+
a BED file containing the transcripts (Read name, coordinate, gene, etc..), and a JSON
60+
file with useful stats.
61+
The ST pipeline will also output a log file with useful information.

docs/_sources/license.rst.txt

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
License
2+
-------
3+
4+
The MIT License (MIT)
5+
Copyright (c) 2016 Jose Fernandez Navarro.
6+
All rights reserved.
7+
8+
* Jose Fernandez Navarro <[email protected]>
9+
10+
Permission is hereby granted, free of charge, to any person obtaining a copy of
11+
this software and associated documentation files (the "Software"), to deal in
12+
the Software without restriction, including without limitation the rights to
13+
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of
14+
the Software, and to permit persons to whom the Software is furnished to do so,
15+
subject to the following conditions:
16+
17+
The above copyright notice and this permission notice shall be included in all
18+
copies or substantial portions of the Software.
19+
20+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
21+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS
22+
FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
23+
COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
24+
IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
25+
CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

0 commit comments

Comments
 (0)