The 10x Chromium Single Cell 3′ Gene Expression kit captures polyadenylated mRNA at the 3′ end inside nano-litre droplets (GEMs). Each GEM contains a single cell and one gel bead. The gel bead carries tens of thousands of barcoded oligo-dT primers, one unique 16-bp cell barcode (CBC) and many unique UMI tags.
Multiple chemistry versions exist:
| Version | Cell Barcode | UMI | Status |
|---|---|---|---|
| V2 | 16 bp | 10 bp | Discontinued |
| V3 / V3.1 | 16 bp | 12 bp | Current |
| V4 (GEM-X) | 16 bp | 12 bp | Current |
Library structure is identical between V3, V3.1, and V4; V2 differs only in UMI length. Unless noted, V2 steps are drawn below.
Adapter and Primer Sequences
Bead oligo-dT (V2):
|--5'- CTACACGACGCTCTTCCGATCT[16-bp CBC][10-bp UMI](T)30VN -3'
Bead oligo-dT (V3 / V3.1 / V4):
|--5'- CTACACGACGCTCTTCCGATCT[16-bp CBC][12-bp UMI](T)30VN -3'
Template Switching Oligo (TSO):
5'- AAGCAGTGGTATCAACGCAGAGTACATGGGRGG -3'
(rG = riboguanosine)
cDNA Forward primer: 5'- CTACACGACGCTCTTCCGATCT -3'
cDNA Reverse primer:
V2: 5'- AAGCAGTGGTATCAACGCAGAGTACAT -3'
V3/V3.1/V4: 5'- AAGCAGTGGTATCAACGCAGAG -3'
TruSeq Read 1 primer: 5'- ACACTCTTTCCCTACACGACGCTCTTCCGATCT -3'
TruSeq Read 2 primer: 5'- GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT -3'
TruSeq adapter (dsDNA, T overhang):
5'- GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
3'- TCTAGCCTTCTCG -5'
Library PCR primer 1: 5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTC -3'
Library PCR primer 2: 5'- CAAGCAGAAGACGGCATACGAGAT[8-bp sample index]GTGACTGGAGTTCAGACGTGT -3'
Sample index sequencing primer: 5'- GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
Illumina P5: 5'- AATGATACGGCGACCACCGAGATCTACAC -3'
Illumina P7: 5'- CAAGCAGAAGACGGCATACGAGAT -3'
Step-by-Step Library Generation
Step 1 — mRNA capture and reverse transcription inside GEMs
Poly-A mRNA is captured by the oligo-dT bead primer inside the droplet. MMLV reverse transcriptase extends using the mRNA as template.
|--5'-CTACACGACGCTCTTCCGATCT [16-bp CBC] [10-bp UMI] (T)30VN--------> (A)30B-mRNA-5'
Step 2 — Terminal transferase adds extra Cs (non-templated)
MMLV’s terminal transferase activity appends 2–3 extra cytosines to the 3′ end of the cDNA.
|--5'-CTACACGACGCTCTTCCGATCT [16-bp CBC] [10-bp UMI] (dT)VN-cDNA-CCC -3' (pA)B-mRNA-5'
Step 3 — TSO template switching for second strand synthesis
The TSO’s 3′ riboguanosines anneal to the non-templated Cs, and MMLV switches template to extend through the TSO.
|--5'-CTACACGACGCTCTTCCGATCT [16-bp CBC] [10-bp UMI] (dT)VN-cDNA-CCC----------> <---------GGGrGrGrGATGTACTCTGCGTTGATACCACTGCTT -5' (TSO)
Full-length first-strand cDNA:
|--5'-CTACACGACGCTCTTCCGATCT [CBC] [UMI] (dT)VN-cDNA-CCATGTACTCTGCGTTGATACCACTGCTT -3' 3'-GATGTGCTGCGAGAAGGCTAGA [CBC] [UMI] (pA)B-cDNA-GGTACATGAGACGCAACTATGGTGACGAA -5'
Step 4 — cDNA amplification with Forward and Reverse primers
PCR amplifies full-length cDNA using the cDNA Forward primer (anneals to TruSeq Read 1 portion) and cDNA Reverse primer (anneals to TSO portion).
5'-CTACACGACGCTCTTCCGATCT --------> |--5'-CTACACGACGCTCTTCCGATCT [CBC] [UMI] (dT)VN-cDNA-CCATGTACTCTGCGTTGATACCACTGCTT -3' 3'-GATGTGCTGCGAGAAGGCTAGA [CBC] [UMI] (pA)B-cDNA-GGTACATGAGACGCAACTATGGTGACGAA -5' <--------TACATGAGACGCAACTATGGTGACGAA -5'
Step 5 — Fragmentase fragmentation and A-tailing
The amplified cDNA is enzymatically fragmented and A-tailed, producing three fragment classes:
Product 1 (TSO-containing 5′ fragment — not amplifiable in next step):
5'-AAGCAGTGGTATCAACGCAGAGTACATGGG -cDNA-*A -3' 3'- A*TTCGTCACCATAGTTGCGTCTCATGTACCC -cDNA -5'
Product 2 (internal fragment — semi-suppressive PCR, not amplifiable):
5'- cDNA ...*A -3' 3'- A* cDNA -5'
Product 3 (Read 1 + CBC + UMI + 3′ cDNA — the only amplifiable fragment):
5'-CTACACGACGCTCTTCCGATCT [CBC] [UMI] (dT)VN-cDNA*A -3' 3'- A*GATGTGCTGCGAGAAGGCTAGA [CBC] [UMI] (pA)B-cDNA -5'
Step 6 — TruSeq adapter ligation
Double-stranded TruSeq adapter (T overhang) is ligated to the A-tailed Product 3:
5'-CTACACGACGCTCTTCCGATCT [CBC] [UMI] (dT)VN-cDNA-AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3' 3'- A*GATGTGCTGCGAGAAGGCTAGA [CBC] [UMI] (pA)B-cDNA-TCTAGCCTTCTCG -5'
Step 7 — Library PCR amplification
Library PCR Primer 1 (extends from P5 through TruSeq Read 1) and Library PCR Primer 2 (carries sample index and P7) amplify the final library.
5'-AATGATACGGCGACCACCGAGATCTACAC TCTTTCCCTACACGACGCTC --------> 5'-CTACACGACGCTCTTCCGATCT [CBC] [UMI] (dT)VN-cDNA-AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3' 3'- A*GATGTGCTGCGAGAAGGCTAGA [CBC] [UMI] (pA)B-cDNA-TCTAGCCTTCTCG -5' <---------TGTGCAGACTTGAGGTCAGTG [8-bp idx] TAGAGCATACGGCAGAAGACGAAC -5'
Final Library Structure
V2 (16 bp CBC + 10 bp UMI)
5'-AATGATACGGCGACCACCGAGATCTACAC TCTTTCCCTACACGACGCTCTTCCGATCT NNNNNNNNNNNNNNNN NNNNNNNNNN (dT)VN-cDNA-AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC NNNNNNNN ATCTCGTATGCCGTCTTCTGCTTG -3' 3'-TTACTATGCCGCTGGTGGCTCTAGATGTG AGAAAGGGATGTGCTGCGAGAAGGCTAGA NNNNNNNNNNNNNNNN NNNNNNNNNN (pA)B-cDNA-TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG NNNNNNNN TAGAGCATACGGCAGAAGACGAAC -5'Illumina P5 TruSeq Read 1 16 bp CBC 10 bp UMI cDNATruSeq Read 2 8 bp idx Illumina P7
V3 / V3.1 / V4 (16 bp CBC + 12 bp UMI)
5'-AATGATACGGCGACCACCGAGATCTACAC TCTTTCCCTACACGACGCTCTTCCGATCT NNNNNNNNNNNNNNNN NNNNNNNNNNNN (dT)VN-cDNA-AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC NNNNNNNN ATCTCGTATGCCGTCTTCTGCTTG -3' 3'-TTACTATGCCGCTGGTGGCTCTAGATGTG AGAAAGGGATGTGCTGCGAGAAGGCTAGA NNNNNNNNNNNNNNNN NNNNNNNNNNNN (pA)B-cDNA-TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG NNNNNNNN TAGAGCATACGGCAGAAGACGAAC -5'Illumina P5 TruSeq Read 1 16 bp CBC 12 bp UMI cDNATruSeq Read 2 8 bp idx Illumina P7
Library Sequencing
Step 1 — Read 1: cell barcode + UMI (bottom strand as template)
5'-ACACTCTTTCCCTACACGACGCTCTTCCGATCT -------------------------> 3'-TTACTATGCCGCTGGTGGCTCTAGATGTG AGAAAGGGATGTGCTGCGAGAAGGCTAGA NNNNNNNNNNNNNNNN NNNNNNNNNN (pA)B-cDNA-TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG NNNNNNNN TAGAGCATACGGCAGAAGACGAAC -5'
Sequence 26 cycles (V2: 16 bp CBC + 10 bp UMI) or 28 cycles (V3+: 16 bp CBC + 12 bp UMI).
Step 2 — Index 1 (i7): sample index (bottom strand as template)
5'-GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -------> 3'-TTACTATGCCGCTGGTGGCTCTAGATGTG AGAAAGGGATGTGCTGCGAGAAGGCTAGA NNNNNNNNNNNNNNNN NNNNNNNNNN (pA)B-cDNA-TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG NNNNNNNN TAGAGCATACGGCAGAAGACGAAC -5'
8 cycles.
Step 3 — Read 2: cDNA (top strand as template, cluster regeneration)
5'-AATGATACGGCGACCACCGAGATCTACAC TCTTTCCCTACACGACGCTCTTCCGATCT NNNNNNNNNNNNNNNN NNNNNNNNNN (dT)VN-cDNA-AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC NNNNNNNN ATCTCGTATGCCGTCTTCTGCTTG -3' <------TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG -5'
98 cycles (cDNA).
Key Points
- Read 1 carries the cell barcode and UMI — it does not contain cDNA sequence.
- Read 2 carries the cDNA fragment — used for gene alignment.
- Only one sample index (i7) is used; i5 is not used in this assay.
- Cell Ranger uses the whitelist for CBC correction during demultiplexing.