Space Ranger1.2, printed on 12/18/2024
spaceranger mkfastq is available for converting BCLs to Space Ranger-compatible FASTQs. You can also use Illumina bcl2fastq directly to generate FASTQs. Choose this method if bcl2fastq is tightly integrated into your sequencing workflow or if you want more control over demultiplexing parameters.
Demultiplexing Visium data with Illumina bcl2fastq requires the correct specification of the sample sheet and command-line options. This guide will walk you through what you'll need to do to generate Space Ranger-compatible FASTQs.
Visium Spatial Gene Expression libraries include 'unique dual-indexing' sample indexes. This means that there is a unique sample index barcode in the both the i7
and i5
index reads (also known as I1
and I2
). When demultiplexing flowcells where both index reads have been sequenced, bcl2fastq requires that both index sequences match the expected sequence for a read to be assigned to that sample. This solves the 'index hopping' issue present on Illumina patterned flowcell sequencers.
Visium libraries can be successfully demultiplexed using only the i7 (also known as I1) index read, but index-hopped reads will not be filtered from your data. Because each slide area uses the same spot barcodes, index hopping may lead to visible artifacts in your results. Therefore we strongly recommend sequencing both index reads. |
You can download the Sample Index Reference for the Dual Index Plate TT, Set A here: CSV | JSON
The index
sequence in the sample index reference file should be entered into the index
column of the bcl2fastq
sample sheet.
Either the index2_workflow_a
or index2_workflow_b
sequence should be entered into the index2
column of the bcl2fastq
sample sheet, depending
on the sequencing instrument in use.
index2_workflow_a
: NovaSeq™ 6000 v1, MiSeq™, HiSeq™ 2500, and HiSeq™ 2000.index2_workflow_b
: NovaSeq™ 6000 v1.5, iSeq™ 100, MiniSeq™, NextSeq™, HiSeq™ X, and HiSeq™ 3000/4000.More information about dual-indexing is available in the Illumina Indexed Sequencing Overview Guide
The Illumina Experiment Manager can also be used to create sample sheets for use with bcl2fastq.
When you plan an experiment, you should know the name of the sample index set used for each sample, which comes from the reagent kit (such as "SI-TT-A2"). For each sample, enter its lane, sample name, and sample index set into the Illumina bcl2fastq sample sheet. Here is a bcl2fastq sample sheet for a HiSeq 2500:
[Header] EMFileVersion,4 [Reads] 28 120 [Data] Lane,Sample_ID,Sample_Name,index,index2,Sample_Project,Original_Sample_ID 1,test_sample,test_sample,TGGTCCCAAG,CCTCTGGCGT,H5T2YBCX3,test_sample
In the examples below, ${FLOWCELL_DIR}
is the directory that contains a flowcell's Data folder, ${OUTPUT_DIR}
is the directory that you want to output FASTQs to, and
${SAMPLE_SHEET_PATH}
is the path to the sample sheet CSV you created.
bcl2fastq Version 2.17 or higher
$ bcl2fastq \ --minimum-trimmed-read-length=8 \ --mask-short-adapter-reads=8 \ --ignore-missing-positions \ --ignore-missing-controls \ --ignore-missing-filter \ --ignore-missing-bcls \ -r 6 -w 6 \ -R ${FLOWCELL_DIR} \ --output-dir=${OUTPUT_DIR} \ --interop-dir=${INTEROP_DIR} \ --sample-sheet=${SAMPLE_SHEET_PATH}
After generating FASTQs, follow the Single Sample Analysis
documentation. You will pass the value in the Sample_Name
column of
the sample sheet to --sample
argument to
spaceranger count
. The value of the
--fastqs
argument should be
${OUTPUT_DIR}/${PROJECT_NAME}
where
${OUTPUT_DIR}
is as defined above and ${PROJECT_NAME}
is the value in the Sample_Project
column in the sample sheet.