STAR is an aligner for RNA sequencing. It rapidly aligns RNA-seq reads against a genome index. For more information, see the tool's GitHub repo and manual.
Function Call
tc.STAR(
read_one,
read_two=None,
output_path=None,
tool_args="",
database_name="GRCh38",
database_version="1",
is_async=False,
)
Function Arguments
See the Notes section below for more details.
Argument | Use in place of: | Description |
---|---|---|
read_one |
--readFilesIn |
Paths to single-end read input file, or R1 of paired-end read input files. The files can be a local or remote, see Using Files. |
read_two |
--readFilesIn |
(optional) Path to read 2 of paired-end read input files. This can be a local filepath or an AWS S3 URI. |
output_path |
--outFileNamePrefix |
(optional) Path (directory) to where the output files will be downloaded. If omitted, skips download. The files can be a local or remote, see Using Files. |
tool_args |
all other arguments | (optional) Additional arguments to be passed to STAR. This should be a string of arguments like the command line. See Supported Additional Arguments for more details. |
database_name |
--genomeDir * |
(optional) Name of database to use for STAR alignment. Defaults to "GRCh38" (human genome). |
database_version |
--genomeDir * |
(optional) Version of database to use for STAR alignment. Defaults to "1" . |
is_async |
Whether to run a job asynchronously. See Async Runs for more. |
See the Databases section for more details.
Notes
Single-end and paired-end inputs
Paired-end read inputs should be specified with both read_one
and read_two
.
For single-end read inputs, specify the input as read_one
argument and omit read_two
.
Tool Versions
Toolchest currently supports version 2.7.9a of STAR. Every request to run STAR with Toolchest will default to this version.
Databases
Toolchest currently supports the following databases for STAR:
database_name |
database_version |
Description |
---|---|---|
GRCh38 |
1 |
GRCh38 (human) genome, built from patch GRCh38.p13 using STAR 2.7.4a |
Supported Additional Arguments
--alignEndsProtrude
--alignEndsType
--alignInsertionFlush
--alignIntronMax
--alignIntronMin
--alignMatesGapMax
--alignSJDBoverhangMin
--alignSJoverhangMin
--alignSJstitchMismatchNmax
--alignSoftClipAtReferenceEnds
--alignSplicedMateMapLmin
--alignSplicedMateMapLminOverLmate
--alignTranscriptsPerReadNmax
--alignTranscriptsPerWindowNmax
--alignWindowsPerReadNmax
--outFilterMatchNmin
--outFilterMismatchNmax
--outFilterMismatchNoverReadLmax
--outFilterMultimapNmax
--outFilterType
--outReadsUnmapped
--outSAMstrandField
--outSAMtype
--quantMode
--quantTranscriptomeBAMcompression
--quantTranscriptomeBan
--readFilesCommand
--readFilesType
--readMapNumber
--readMatesLengthsIn
--readStrand
--runRNGseed
--scoreDelBase
--scoreDelOpen
--scoreGap
--scoreGapATAC
--scoreGapGCAG
--scoreGapNoncan
--scoreGenomicLengthLog2scale
--scoreInsBase
--scoreInsOpen
--scoreStitchSJshift
--seedMultimapNmax
--seedNoneLociPerWindow
--seedPerReadNmax
--seedPerWindowNmax
--seedSearchLmax
--seedSearchStartLmax
--seedSearchStartLmaxOverLread
--seedSplitMin
--sjdbInsertSave
--twopassMode
--winAnchorDistNbins
--winAnchorMultimapNmax
--winBinNbits
--winFlankNbins
--winReadCoverageBasesMin
--winReadCoverageRelativeMin
Additional arguments can be specified under the tool_args
argument.