Illumina adapter and primer sequences

Source: http://bioinformatics.cvr.ac.uk/blog/illumina-adapter-and-primer-sequences/

Illumina Adapter and Primer Sequences

Illumina libraries are normally constructed by ligating adapters to short fragments (100 – 1000bp) of DNA. The exception to this is if Nextera is used (see end of this post) or where PCR amplicons have been constructed that already incorporate the P5/P7 ends that bind to the flowcell.

Illumina Paired End Adapters (cannot be used for multiplexing)

Top adapter
5′ ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’

Bottom adapter
5′ P-GATCGGAAGAGCGGTTCAGCAGGAATGCCGAG 3’

Note that the last 12nt of the adapters are complementary (when the bottom adapter is viewed 3’-5’ as below) hence the name ‘forked adapters’. The adapters are annealed together then ligated to both ends of the library DNA. The bottom adapter is 5’-phosphorylated in order to promote ligation. The top adapter has a phosphorothioate bond (*) before the terminal T, ensuring that exonucleases cannot digest the T overhang that pairs to the A-tail added to library fragments.

Structure of Illumina forked PE adapter

5’ ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
|||||||||||| <-Ligated to library fragment
3’ GAGCCGTAAGGACGACTTGGCGAGAAGGCTAG-P 5’
1
2
3

5’ ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
|||||||||||| <-Ligated to library fragment
3’ GAGCCGTAAGGACGACTTGGCGAGAAGGCTAG-P 5’

PCR with partially complementary primers then extends the ends and resolves the forks, adding unique termini that bind to the oligos on the surface of the flow cell (P5 blue/P7 red, also see diagram at foot of page).

PE PCR Primer 1.0 (P5) (same as universal adapter)
5' AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’

PE PCR Primer 2.0 (P7)
5' CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATC*T 3’

Structure of Illumina TruSeq™ indexed forked adapters

5’ AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T
|||||||||||| 3’GTTCGTCTTCTGCCGTATGCTCTA-index-CACTGACCTCAAGTCTGCACACGAGAAGGCTAG-P here
1
2
3

5’ AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T
|||||||||||| 3’GTTCGTCTTCTGCCGTATGCTCTA-index-CACTGACCTCAAGTCTGCACACGAGAAGGCTAG-P here

The last 12nt of the adapters are complementary, allowing them to anneal and form the forked structure. The adapter is ligated to both ends of the A-tailed DNA library, generating larger floppy overhangs than with the paired-end adapters on the first page. Note that while the top adapter is identical to the Illumina Universal oligo, the bottom adapter is different to the PE adapter in the purple highlighted section. The adapter already has the index and complete P7/P5 ends.

PCR with the following primers resolves the forked ends to generate products with no floppy overhangs. The sequences that bind to the flow cell (P5 blue/P7 red) finish up at opposite ends of the library fragments.

PCR Primer 1.0 (P5)
5’ AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGA 3’

PCR Primer 2.0 (P7)
5’ CAAGCAGAAGACGGCATACGAGAT 3’

The following oligos (provided in the MiSeq reagent cartridge) are used to prime the sequence reads. Note that the index read primer is complementary to the Read 2 sequencing primer (see diagram below). This is used to sequence the hexamer index tag in the forward direction after read 1 is complete, before the reverse strand is synthesised by bridge amplification.

Multiplexing Read 1 Sequencing Primer
5' ACACTCTTTCCCTACACGACGCTCTTCCGATCT

Multiplexing Index Read Sequencing Primer
5' GATCGGAAGAGCACACGTCTGAACTCCAGTCAC

Multiplexing Read 2 Sequencing Primer
5' GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT

When ordering primers for use in Illumina libraries, make certain to include the modifications (e.g. 5’-phosphorylation and phosphorothioate bonds on the 3' terminal nucleotide) and ensure the oligos are PAGE purified. Even small amounts of n-1 primers will lead to messy out-of-phase sequencing and cause clusters to fail filtering. Costs per oligo for 0.2µmole synthesis scale and PAGE are in region of £40.

TruSeq™ DNA Sample Prep Kit v2

There are currently two versions of the kit, each with 12 different adapters that incorporate unique index tags – allowing samples to be multiplexed on the same sequencing run.

Kit A contains indexes: 2, 4, 5, 6, 7, 12, 20, 21, 22, 23, 25, 27.
Kit B contains indexes: 1, 3, 8, 9, 10, 11, 13, 14, 15, 16, 18, 19.

NOTE that all the indexed adapters should be 5’-Phosphorylated. For unknown reasons adapters 13-27 have an extra 2 bases (these are not used for the indexing). Illumina also reserve certain numbers e.g. 17, 24 and 26. The 6-base index tag sequences are in italics below.

TruSeq Universal Adapter
5’ AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T

TruSeq Adapter, Index 1
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 2
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 3
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 4
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACTGACCAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 5
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 6
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGCCAATATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 7
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 8
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACACTTGAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 9
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 10
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAGCTTATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 11
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGGCTACATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 12
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 13
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 14
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTTCCGTATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 15
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 16
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 18
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 19
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 20
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 21
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 22
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 23
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGTGGATATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 25
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACACTGATATATCTCGTATGCCGTCTTCTGCTTG
TruSeq Adapter, Index 27
5’ GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTCCTTTATCTCGTATGCCGTCTTCTGCTTG

NEBNext® DNA Library Prep

The NEB kit uses a short adapter which is supplied as a single self-complimentary oligo forming a stem-loop. It has a Uracil base that is later cleaved and removed by Uracil Glycosylase and base excision repair enzyme mix (USER).

NEBNext adaptor for Illumina

5’ P-GATCGGAAGAGCACACGTCTGAACTCCAGTC-U-ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’

Oligo is designed to self-anneal forming a stem-loop structure as below. This may help to prevent formation of adapter dimers during ligation.

/ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
U |||||||||||| <-Ligated to library fragments
-CTGACCTCAAGTCTGCACACGAGAAGGCTAG-P 5’
1
2
3

/ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
U |||||||||||| <-Ligated to library fragments
-CTGACCTCAAGTCTGCACACGAGAAGGCTAG-P 5’

                        5’  ACACTCTTTCCCTACACGACGCTCTTCCGATC*T------

Uracil cleaved and removed ||||||||||||
to form forked structure 3’ CTGACCTCAAGTCTGCACACGAGAAGGCTAG--------
1
2
3

                        5’  ACACTCTTTCCCTACACGACGCTCTTCCGATC*T------

Uracil cleaved and removed ||||||||||||
to form forked structure 3’ CTGACCTCAAGTCTGCACACGAGAAGGCTAG--------

The Index tags and the P5/P7 ends are added by PCR using universal and tagged primers. The end result is exactly the same as TruSeq.

NEBnext Universal primer
AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T

NEBnext Indexed primers 1 – 12 (6-mer indexes)

Index 1 CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 2 CAAGCAGAAGACGGCATACGAGATACATCGGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T
Index 3 CAAGCAGAAGACGGCATACGAGATGCCTAAGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 4 CAAGCAGAAGACGGCATACGAGATTGGTCAGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T
Index 5 CAAGCAGAAGACGGCATACGAGATCACTGTGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 6 CAAGCAGAAGACGGCATACGAGATATTGGCGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T
Index 7 CAAGCAGAAGACGGCATACGAGATGATCTGGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 8 CAAGCAGAAGACGGCATACGAGATTCAAGTGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T
Index 9 CAAGCAGAAGACGGCATACGAGATCTGATCGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 10 CAAGCAGAAGACGGCATACGAGATAAGCTAGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T
Index 11 CAAGCAGAAGACGGCATACGAGATGTAGCCGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT
Index 12 CAAGCAGAAGACGGCATACGAGATTACAAGGTGACTGGAGTTCAGACGTGTGCTCTTCCGATC
T

SURESELECT (POST-CAPTURE INDEXING)

This begins with a shorter bottom adapter that is extended to add the P5 end in the pre-capture PCR. The post-capture PCR step adds the index and P7 end. Note The NEB adapter is more efficient than the InPE adapter in my comparative tests.

InPE adapter (indexing paired end adapter)

5' ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
|||||||||||| <-Ligated to library fragments
3’ TCTGCACACGAGAAGGCTAG-P 5’
1
2
3

5' ACACTCTTTCCCTACACGACGCTCTTCCGATC*T 3’
|||||||||||| <-Ligated to library fragments
3’ TCTGCACACGAGAAGGCTAG-P 5’

PRE-CAPTURE PCR

PCR Primer 1.0 [Tm 70deg]
5’ AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACG*A

Multiplex PCR Primer 2.0 [Tm 67deg]
5' GTGACTGGAGTTCAGACGTGTGCTCTTCCGAT*C

POST-CAPTURE PCR Indexing Primers
2nd PCR reaction (post-capture amplification) adds indexes and P7 sequence.

Universal Primer [Tm 75deg]
5' AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATC*T

PCR Primer, Index 1 5’ CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTC
PCR Primer, Index 2 5’ CAAGCAGAAGACGGCATACGAGATACATCGGTGACTGGAGTT
C
PCR Primer, Index 3 5’ CAAGCAGAAGACGGCATACGAGATGCCTAAGTGACTGGAGTTC
PCR Primer, Index 4 5’ CAAGCAGAAGACGGCATACGAGATTGGTCAGTGACTGGAGTT
C
PCR Primer, Index 5 5’ CAAGCAGAAGACGGCATACGAGATCACTGTGTGACTGGAGTTC
PCR Primer, Index 6 5’ CAAGCAGAAGACGGCATACGAGATATTGGCGTGACTGGAGTT
C
PCR Primer, Index 7 5’ CAAGCAGAAGACGGCATACGAGATGATCTGGTGACTGGAGTTC
PCR Primer, Index 8 5’ CAAGCAGAAGACGGCATACGAGATTCAAGTGTGACTGGAGTT
C
PCR Primer, Index 9 5’ CAAGCAGAAGACGGCATACGAGATCTGATCGTGACTGGAGTTC
PCR Primer, Index 10 5’ CAAGCAGAAGACGGCATACGAGATAAGCTAGTGACTGGAGTT
C
PCR Primer, Index 11 5’ CAAGCAGAAGACGGCATACGAGATGTAGCCGTGACTGGAGTTC
PCR Primer, Index 12 5’ CAAGCAGAAGACGGCATACGAGATTACAAGGTGACTGGAGTT
C

Guidelines for Low-Level Pooling
Some sequencing experiments require the use of fewer than 12 index sequences in a lane with a high cluster density. In such cases, a careful selection of indexes is required to ensure optimum cluster discrimination by having different bases at each cycle of the index read. Illumina recommends the following sets of indexes for low-level pooling experiments

Pool of 2 samples:
Index 6 GCCAAT
Index 12 CTTGTA

Pool of 3 samples:
Index 4 TGACCA
Index 6 GCCAAT
Index 12 CTTGTA

Pool of 6 samples:
Index 2 CGATGT
Index 4 TGACCA
Index 5 ACAGTG
Index 6 GCCAAT
Index 7 CAGATC
Index 12 CTTGTA

Nextera Sample Preparation

The sequences in Nextera libraries are different to all the other workflows.

Nextera® transposase sequences (FC-121-1031, FC-121-1030)

The Tn5 transposase cuts the sample DNA and adds the following sequence at either end of each fragment, with the highlighted sequence next to the library insert.

5’ TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG
Read 1 >

5’ GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG
Read 2 >

Nextera® Index Kit – PCR primers (FC-121-1012, FC-121-1011)

PCR with the following primers adds the P5 and P7 termimi that bind to the flowcell and also the dual 8bp index tags (denoted by the i5 and i7 below).

5’ AATGATACGGCGACCACCGAGATCTACAC[i5]TCGTCGGCAGCGTC

5’ CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG

If trimming adapters from Nextera runs should cut the reads at CTGTCTCTTATACACATCT instead of the usual AGATCGGAAGAGC. Use of cutadapt, trim_galore or similar program is recommended with custom adapter specified.

你可能感兴趣的:(Illumina adapter and primer sequences)