Professional Documents
Culture Documents
……TCTATCTGTCTGTCTGTCTATCTATCTA……
……TCTATCTGTCTGTCTGTCTATCTATCTA……
A B ……TCTATCTGTCTGTCTGTCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……TCTATCTGTCTGTCTGTCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……TCTATCTGTCTGTCTGTCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……TCTATCTGTCTGTCTGTCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……TCTATCTGTCTGTCTATCTATCTATCTA……
……
A ……TCTATCTGTCTGTCTGTCTATCTATCTA…… 16 allele
Criteria
- Target marker
- Basically 20 extended CODIS STR and Amelogenin + additional STRs
- 23 PowerPlex Y-STRs and Y-M175
- Microhaplptypes suggested by Kidd et al.
- Small sized amplicons is adapted as possible
- while primer is not overlapping with core region
- finally ranged in 70bp ~ 270bp
- Avoid SNP with 1% variation reported in primer binding area
Resource
- STRBase (http://www.cstl.nist.gov/div831/strbase/)
- UCSC genome browser (http://genome.ucsc.edu/)
- Primer 3 v.0.4.0 (http://frodo.wi.mit.edu/primer3/)
Current workflow for MPS on a MiSeq system
• Template DNA • Fluorometer • TruSeq Nano DNA LT • Library Quantification • Cluster generation and
; 1 ng of Korean DNAs ; Quant-iT™ PicoGreen Sample preparation Kit ; KAPA library quantification kit sequencing on MiSeq
dsDNA assays (invitrogen) ; 2 x 250 bp (Paired-end)
• Amplicon purification
: Enzymatic purification • Agilent BioAnalyzer • Agilent BioAnalyzer
using EXO-SAP IT
② End repairs
①
③ Beads purifications
④ Adenylate 3’ ends
⑤ ②
⑥ Beads purifications
Autosomal STR
Y-STR
Microhaplotype
Simple Workflow of MPS on MiSeq system
• Template DNA • Nextera XT Index kit • Library pooling with • Cluster generation and
; 1 ng DNA samples (Illumina) equal amount (10ng/ul) sequencing on a MiSeq
; 2 x 300 bp (Paired-end)
• Amplicon 1/10 dilution • Agilent BioAnalyzer • Beads purification
; X1.1 ~ 1.2X beads ratio to
remove non specific
amplicons
• Library Quantification
; KAPA library quantification
kit for Illumina platforms
Allelic size range of 23 autosomal STRs and 2 sex markers
KplexSeq-Auto25
Average depth of coverage (DoC) for the 25 markers
PowerPlex
Locus MPS analysis GlobalFiler Comments
Fusion
D1S1656 14.4±3.3% 12.2% 14.2% Tetranucleotide, compound
D22S1045 13.2±5.9% 16.3% 16.4% Trinucleotide, simple
D10S1248 12.9±2.6% 11.5% 12.4% Tetranucleotide, simple
D3S1358 12.5±2.7% 11.0% 11.9% Tetranucleotide, compound
D18S51 12.0±3.4% 12.4% 14.6% Tetranucleotide, compound
D2S1338 11.6±2.7% 11.7% 13.9% Tetranucleotide, compound
vWA 11.6±2.7% 10.7% 11.2% Tetranucleotide, compound
D19S433 11.6±2.3% 10.0% 11.0% Tetranucleotide, compound
D12S391 11.5±2.9% 13.7% 15.8% Tetranucleotide, compound
FGA 10.8±2.7% 11.6% 12.1% Tetranucleotide, compound
D6S1043 10.6±2.2% - - Tetranucleotide, compound
D8S1179 9.9±2.2% 9.6% 10.9% Tetranucleotide, compound
D16S539 9.6±3.9% 9.5% 10.2% Tetranucleotide, simple
CSF1PO 9.5±2.9% 8.8% 9.5% Tetranucleotide, simple
D21S11 9.5±2.0% 10.5% 11.6% Tetranucleotide, complex
D5S818 8.5±3.0% 9.2% 9.5% Tetranucleotide, simple
D7S820 8.4±3.3% 8.3% 11.0% Tetranucleotide, simple
D2S441 6.7±2.6% 8.1% 9.2% Tetranucleotide, compound
D13S317 6.5±3.3% 9.2% 9.8% Tetranucleotide, simple
Penta E 6.1±2.4% - 7.6% Pentanucleotide, simple
TPOX 4.5±1.8% 5.6% 5.5% Tetranucleotide, simple
TH01 4.4±1.8% 4.5% 4.6% Tetranucleotide, simple
Penta D 2.1±1.0% - 6.8% Pentanucleotide, simple
Repeat region
Insertion: A
Kplex-23 system * Actually, 9 allele with ‘A’ insertion in 5' flanking region
• MPS run
– For 1ng of 13 Koreans male samples
– In-house panel by Kim et al. (FSIG, 2017)
• Added SE33, D4S2048 and Y-M175
• Reduced 1st round PCR cycles (29 ⇒ 26)
– Precision ID GlobalFiler NGS STR Panel
• Ion PGM Hi-Q Chef Kit
• Ion Chef and PGM
MPS analysis of Autosomal STRs using In-house
and Commercial STR panels
• Results
– All concordant genotypes with CE result
when using modified in-house panel
– Extra read observed on Precision ID panel
Analyzing sequence variants of SE33 for 315 American and Korean samples
Analysis of MPS data
KplexSeq-Y24
Average depth of coverage (DoC) for the 23 Y-STRs
Stutter
Noise
N −2 a N −1 b N+1c
Locus
Average SD Average SD Average SD Average SD
DYS19 0.0073 0.0039 0.1064 0.0198 0.0139 0.0268 0.0006 0.0008
DYS385 0.0116 0.0083 0.1211 0.0424 0.0142 0.0110 0.0008 0.0023
DYS389I 0.0067 0.0051 0.0902 0.0241 0.0108 0.0133 0.0007 0.0012
DYS389II 0.0232 0.0061 0.1982 0.0256 0.0160 0.0068 0.0011 0.0015
DYS390 0.0087 0.0085 0.1086 0.0174 0.0064 0.0028 0.0004 0.0009
DYS391 0.0063 0.0066 0.0915 0.0193 0.0111 0.0155 0.0004 0.0024
DYS392 0.0178 0.0068 0.1484 0.0245 0.0682 0.0167 0.0048 0.0049
DYS393 0.0080 0.0052 0.1074 0.0183 0.0224 0.0081 0.0009 0.0032
DYS437 0.0025 0.0016 0.0598 0.0195 0.0107 0.0085 0.0007 0.0032
DYS438 0.0020 0.0046 0.0291 0.0106 0.0051 0.0057 0.0016 0.0043
DYS439 0.0047 0.0057 0.0778 0.0115 0.0170 0.0080 0.0006 0.0023
DYS448 0.0023 0.0044 0.0222 0.0121 0.0090 0.0102 0.0021 0.0035
DYS456 0.0090 0.0028 0.1196 0.0147 0.0269 0.0059 0.0005 0.0003
DYS458 0.0133 0.0046 0.1363 0.0173 0.0165 0.0048 0.0009 0.0025
DYS481 0.0537 0.0122 0.2761 0.0309 0.0636 0.0114 0.0012 0.0019
DYS533 0.0059 0.0120 0.0912 0.0151 0.0188 0.0113 0.0004 0.0008
DYS549 0.0059 0.0065 0.0928 0.0199 0.0176 0.0060 0.0004 0.0008
DYS570 0.0113 0.0035 0.1270 0.0187 0.0244 0.0097 0.0008 0.0024
DYS576 0.0117 0.0042 0.1283 0.0181 0.0241 0.0125 0.0011 0.0038
DYS635 0.0061 0.0028 0.0946 0.0182 0.0121 0.0052 0.0005 0.0014
DYS643 0.0019 0.0030 0.0299 0.0082 0.0047 0.0044 0.0009 0.0024
YGATAH4 0.0021 0.0011 0.0950 0.0235 0.0143 0.0138 0.0008 0.0038
http://medicine.yale.edu/lab/kidd/publications/publications.aspx
Microhaplotype MPS panel
DNA samples
1ng of 2800M Standard DNA and 122 Korean samples
Results
Observed reads per sample
Read depth 13753 10502 16976 11374 15277 9113 11331 9065 9547 13051 10447 13875 5318
상대적 비율(%)
2.41 1.84 2.97 1.99 2.67 1.59 1.98 1.59 1.67 2.28 1.83 2.43 0.93
Marker D13S169 D18S1122 D21S1263 D22S1159 D5S1970 DLEU1 DRD3 EDAR FAM99A FAT1 FRMD4A GATA4 GFI1B
Read depth 13642 12341 15300 12394 10665 5702 9363 10958 10063 5305 12382 10136 13795
상대적 비율(%)
2.39 2.16 2.68 2.17 1.87 1.00 1.64 1.92 1.76 0.93 2.17 1.77 2.41
Results
Example) D13S169
rs9536429
rs1927847 T/C rs7492234 rs9536430
T/C T/C C/T
Haplotype
2800M T T T T 1 - TTTT
C C C T 2 - CCCT
T T T T 1 - TTTT
Sample
T T C T 2 - TTCT
Reference seq.
Results
Genotypes of 143 SNPs in 2800M standard DNA
Marker SNPs_Included Haplotype Marker SNPs_Included Haplotype Marker SNPs_Included Haplotype
D13S169 rs1927847/rs9536429/rs7492234/rs9536430 1 TTTT FRMD4A rs10796164/rs10796165/rs17154765/rs10796166 1 GCTG TAS2R1 rs41461/rs41462 1 CC
2 CCCT 2 ACTG 2 TC
D5S1970 rs74865590/rs438055/rs370672/rs6555108 1 CAAG GFI1B rs606141/rs8193001/rs56256724/rs633153 1 GTCC DLEU1 rs806301/rs2066700 1 CC
2 CAGA 2 GCCT 2 TC
D21S1263 rs8126597/rs6517970/rs8131148/rs6517971 1 GACT CPNE4 rs1225051/rs1225050/rs1225049/rs1225048 1 AATA TTC12 rs2288159/rs10891537 1 GT
2 GCTC 2 AACA 2 TT
LINC00111 rs2838081/rs2838082/rs78902658/rs2838083 1 AACG SUDS3 rs1503767/rs11068953 1 TA CCR2 rs4513489/rs6441961 1 TC
2 AACG 2 TA 2 TC
FAM99A rs12802112/rs28631755/rs7112918/rs4752777 1 ACTG ADH7 rs4699748/rs2584461/rs1442492 1 ACA GATA4 rs1390950/rs2898295 1 TA
2 GCCG 2 GCA 2 TA
LRRC63 rs9562648/rs9562649/rs2765614 1 GCG C14ORF43 rs12717560/rs12878166 1 GT NPEPPS rs3760370/rs3760371 1 CT
2 GCG 2 GT 2 TC
LRRN2 rs17413714/rs2772234/rs1610401/rs1610400 1 CACC COG2 rs2296796/rs2296797/rs2296798 1 GGG LRRC2 rs6808142/rs17030627 1 CT
2 CACC 2 GGG 2 TG
COL4A1 rs1192204/rs1192205/rs3825483/rs3825481 1 CGTT ARHGAP27 rs1059504/rs8327 1 AA ACN9 rs17168174/rs10246622 1 CG
2 CGCT 2 AA 2 CG
IGSF21 rs11810587/rs1336130/rs1533623/rs1533622 1 TTAA RXRA rs3118582/rs10776839 1 TG TYRP1 rs1408800/rs1408801 1 AA
2 TCAG 2 TG 2 AA
LINC01233 rs4932999/rs4932769/rs2361019/rs2860462 1 TGAA OR52S1P1 rs10500616/rs2499936 1 CG CDH4 rs10854214/rs10854215 1 AA
2 TGAA 2 CG 2 GG
LOC642852 rs6518223/rs2838868/rs7279250/rs8133697 1 TCAG GNGT2 rs2233362/rs634370 1 AA PAPD7 rs870348/rs870347 1 TA
2 TCTG 2 AA 2 AA
COL4A3 rs6714835/rs6756898/rs12617010 1 GTC CEP104 rs4648344/rs6663840 1 CA DRD3 rs3732783/rs6280 1 TT
2 TCC 2 CA 2 TT
SGCG rs8181845/rs679482/rs9510616 1 TCA FAT1 rs1280100/rs1280099 1 AG ATXN1 rs4565296/rs4431439/rs179939 1 GCA
2 TCA 2 GA 2 GTG
PLCG2 rs16956011/rs3934955/rs3934956/rs4073828 1 ACCG PLIN3 rs1055919/rs2271057 1 TA EDAR rs260694/rs11123719/rs11691107 1 TTC
2 GACA 2 CA 2 TTC
ITGB6 rs3101043/rs3111398/rs72623112 1 TTG COMT rs4818/rs4680 1 CA USH2A rs4528199/ rs6604596 1 GA
2 CTG 2 GG 2 AA
KIF16B rs6044080/rs17674942/rs6044081/rs16997830 1 CTGA LYPD6B rs2170607/rs10497052 1 AG
2 CTGA 2 GG
D18S1122 rs621320/rs621340/rs678179/rs621766 1 AGAA PAH rs2133298/rs3817446 1 TT
2 AGAA 2 TC
D22S1159 rs763040/rs5764924/rs763041 1 GAA NCAM1 rs2303377/rs2303378 1 TG
2 GAA 2 CG
Results
Genotypes of 143 SNPs in 2800M standard DNA
Marker SNPs_Included Haplotype Marker SNPs_Included Haplotype Marker SNPs_Included Haplotype
D13S169 rs1927847/rs9536429/rs7492234/rs9536430 1 TTTT FRMD4A rs10796164/rs10796165/rs17154765/rs10796166 1 GCTG TAS2R1 rs41461/rs41462 1 CC
2 CCCT 2 ACTG 2 TC
D5S1970 rs74865590/rs438055/rs370672/rs6555108 1 CAAG GFI1B rs606141/rs8193001/rs56256724/rs633153 1 GTCC DLEU1 rs806301/rs2066700 1 CC
2 CAGA 2 GCCT 2 TC
D21S1263 rs8126597/rs6517970/rs8131148/rs6517971 1 GACT CPNE4 rs1225051/rs1225050/rs1225049/rs1225048 1 AATA TTC12 rs2288159/rs10891537 1 GT
2 GCTC 2 AACA 2 TT
LINC00111 rs2838081/rs2838082/rs78902658/rs2838083 1 AACG SUDS3 rs1503767/rs11068953 1 TA CCR2 rs4513489/rs6441961 1 TC
2 AACG 2 TA 2 TC
FAM99A rs12802112/rs28631755/rs7112918/rs4752777 1 ACTG ADH7 rs4699748/rs2584461/rs1442492 1 ACA GATA4 rs1390950/rs2898295 1 TA
2 GCCG 2 GCA 2 TA
LRRC63 rs9562648/rs9562649/rs2765614 1 GCG C14ORF43 rs12717560/rs12878166 1 GT NPEPPS rs3760370/rs3760371 1 CT
2 GCG 2 GT 2 TC
LRRN2 rs17413714/rs2772234/rs1610401/rs1610400 1 CACC COG2 rs2296796/rs2296797/rs2296798 1 GGG LRRC2 rs6808142/rs17030627 1 CT
2 CACC 2 GGG 2 TG
COL4A1 rs1192204/rs1192205/rs3825483/rs3825481 1 CGTT ARHGAP27 rs1059504/rs8327 1 AA ACN9 rs17168174/rs10246622 1 CG
2 CGCT 2 AA 2 CG
IGSF21 rs11810587/rs1336130/rs1533623/rs1533622 1 TTAA RXRA rs3118582/rs10776839 1 TG TYRP1 rs1408800/rs1408801 1 AA
2 TCAG 2 TG 2 AA
LINC01233 rs4932999/rs4932769/rs2361019/rs2860462 1 TGAA OR52S1P1 rs10500616/rs2499936 1 CG CDH4 rs10854214/rs10854215 1 AA
2 TGAA 2 CG 2 GG
LOC642852 rs6518223/rs2838868/rs7279250/rs8133697 1 TCAG GNGT2 rs2233362/rs634370 1 AA PAPD7 rs870348/rs870347 1 TA
2 TCTG 2 AA 2 AA
COL4A3 rs6714835/rs6756898/rs12617010 1 GTC CEP104 rs4648344/rs6663840 1 CA DRD3 rs3732783/rs6280 1 TT
2 TCC 2 CA 2 TT
SGCG rs8181845/rs679482/rs9510616 1 TCA FAT1 rs1280100/rs1280099 1 AG ATXN1 rs4565296/rs4431439/rs179939 1 GCA
2 TCA 2 GA 2 GTG
PLCG2 rs16956011/rs3934955/rs3934956/rs4073828 1 ACCG PLIN3 rs1055919/rs2271057 1 TA EDAR rs260694/rs11123719/rs11691107 1 TTC
2 GACA 2 CA 2 TTC
ITGB6 rs3101043/rs3111398/rs72623112 1 TTG COMT rs4818/rs4680 1 CA USH2A rs4528199/ rs6604596 1 GA
2 CTG 2 GG 2 AA
KIF16B rs6044080/rs17674942/rs6044081/rs16997830 1 CTGA LYPD6B rs2170607/rs10497052 1 AG
2 CTGA 2 GG
D18S1122 rs621320/rs621340/rs678179/rs621766 1 AGAA PAH rs2133298/rs3817446 1 TT
2 AGAA 2 TC
D22S1159 rs763040/rs5764924/rs763041 1 GAA NCAM1 rs2303377/rs2303378 1 TG
2 GAA 2 CG
FRMD4A 0.639 0.611 0.169 0.831 0.590 0.341 1.39
Results
D22S1159
GFI1B
0.631
0.656
0.654
0.655
0.176
0.180
0.824
0.820
0.591 0.330
Preparing
0.595 manuscript…
0.363 1.45
1.36
= 3.82 × 1037
SUDS3
•
0.623 0.741 0.112 0.888 0.690 0.319 1.33
PLIN3
• 0.459 0.514 0.336 0.664 0.393 0.154 0.92
LINC01233 0.762 0.741 0.114 0.886 0.695 0.531 2.10
NCAM1 0.508 0.495 0.336 0.664 0.422 0.195 1.02
SGCG 0.803 0.753 0.116 0.884 0.709 0.605 2.54
CDH4 0.525 0.479 0.341 0.659 0.417 0.210 1.05
LOC642852 0.689 0.717 0.125 0.875 0.669 0.411 1.61
CEP104 0.549 0.525 0.345 0.655 0.416 0.234 1.11
D18S1122 0.664 0.710 0.126 0.874 0.666 0.375 1.49
LRRC2 0.426 0.475 0.370 0.630 0.365 0.131 0.87
CPNE4 0.705 0.704 0.146 0.854 0.644 0.436 1.69
PAH 0.402 0.369 0.427 0.573 0.334 0.115 0.84
COG2 0.680 0.681 0.162 0.838 0.619 0.398 1.56
DRD3 0.385 0.380 0.440 0.560 0.326 0.105 0.81
FRMD4A 0.639 0.611 0.169 0.831 0.590 0.341 1.39
ADH7 0.344 0.343 0.456 0.544 0.316 0.083 0.76
D22S1159 0.631 0.654 0.176 0.824 0.591 0.330 1.36
TYRP1 0.344 0.332 0.492 0.508 0.283 0.083 0.76
GFI1B 0.656 0.655 0.180 0.820 0.595 0.363 1.45
EDAR 0.246 0.245 0.592 0.408 0.233 0.044 0.66
FAT1 0.623 0.657 0.188 0.812 0.580 0.319 1.33
ATXN1 0.074 0.103 0.832 0.168 0.100 0.005 0.54
COMT 0.664 0.667 0.189 0.811 0.590 0.375 1.49
Future work
• Validation study
– Sensitivity and specificity
source software