(SQSTM1) sequestosome 1 [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
3299
|
Official symbol
|
SQSTM1
|
Official name
|
sequestosome 1
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 5 (NC_000005.9) : 179233387...179265077 (+)
|
Map
|
5q35
|
Length
|
31691 nt
|
NM_003900.4
|
SQSTM1, mRNA isoform 1
|
NM_001142298.1
|
SQSTM1, mRNA isoform 2
|
NM_001142299.1
|
SQSTM1, mRNA isoform 3
|
Accession
|
Name
|
Organism
|
Length
|
Q13501
|
Sequestosome-1
|
Homo sapiens
|
440 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
ZIP3; PDB3; p62B; p62; p60; OSIL; A170
|
Alternative name(s) |
phosphotyrosine independent ligand for the Lck SH2 domain p62; EBI3-associated protein p60; ubiquitin-binding protein p62; Paget disease of bone 3; oxidative stress induced like
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
This gene encodes a multifunctional protein that binds ubiquitin and regulates activation of the nuclear factor kappa-B (NF-kB) signaling pathway. The protein functions as a scaffolding/adaptor protein in concert with TNF receptor-associated factor 6 to mediate activation of NF-kB in response to upstream signals. Alternatively spliced transcript variants encoding either the same or different isoforms have been identified for this gene. Mutations in this gene result in sporadic and familial Paget disease of bone. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
TGCGTCGGCT |
TCCGGCCGCC |
TTCCGCGGCC |
ACCGCCGGGC |
CCGCTCCCGC |
CGCCGACGCC |
CAGGTGCGCC |
AGGTGCGGGC |
|
81 |
CGGGCGGGGG |
TCGCGCTCAC |
CTTTCTGGCC |
GCTGAGTGCC |
GCGTACCAGG |
ACAGCGAGAG |
GAAGGCGCAC |
AGGCAGAAGA |
|
161 |
GCAGCAGCGT |
CAGGAAGGTG |
CCATTGCGGA |
GCCTCATCTC |
CTCGGGTGCG |
CGGCGGGCGC |
CCGCGGGGCC |
GAGGCTGCAT |
|
241 |
GGCCCGGGGG |
ACCGGGGCCG |
GGGCGCAGGG |
GTCGGAAGGC |
GGCGGCGGCG |
GCGGCAGGGG |
CCCCGGCCCC |
GGGTCGGGGA |
|
321 |
GGGGCGGGGG |
GCCCGGGGCC |
GGGCGGGGAC |
CGGGCCAGGG |
AGCGCGCCGG |
CCGCCCCTCA |
GGGCGCAAGC |
TTTGTGCCCT |
|
401 |
GTACTCAGGG |
AAGAGGAACA |
GGCTCAGAAG |
GGCAGAGGCA |
GGTATCAGGC |
TCACTGCAGA |
TATCAGGGGC |
GCGGGACACT |
|
481 |
GGCGGCCTCG |
CCTCCGCGGC |
AGGGCCGGGC |
CGGGCCGGGC |
TGGGCTGGGC |
TGGGCGGCGA |
GAGCCGCGGC |
CCGGCCTGGA |
|
561 |
TCTGGGGCCT |
GGATCTGGGG |
CCGCCGCGGA |
GTCGACGGCG |
CAGGGCGGGG |
CGGCCCGGAT |
TTAAAGGGGC |
CGCAGCACCG |
|
641 |
CCGTCGCCGG |
CGCCGCGAGG |
GGGTGGGGTG |
GGGGCCGGCG |
GCCGGGATCC |
CGATCGGCTC |
CCGCAGCCCC |
GCGTGGGCTC |
|
721 |
GTGCGAGTCG |
GCCTCAGGTA |
AGGCTGGAGT |
GGGAGTGCAG |
GTCGACCGCA |
GCCGGGGCGG |
GGGGCGGGCG |
GCGGGGGCGG |
|
801 |
CGCTCCGGGA |
CCCCGGTACC |
CTTCTCAGCA |
ACATCTCCTC |
CGCGCGCGGA |
CCCCGACCCC |
ATTCGCACGT |
TCTCGGGGCT |
|
881 |
CTTTCCCGGG |
CGTGAGGGGC |
TCTGGGTGGC |
GCGGGGGACT |
GGGCGGTTGA |
AGCCGGGAGC |
GACCGCTGCC |
TTCGCTGCCG |
|
961 |
CCCAGGGCGC |
TTCCCCGCTC |
AGGAGCTTCC |
TCTGGGCCTC |
TGGACGGAGG |
CGCGCAGGGG |
CCCGGGAGGC |
GGGAACGATG |
|
1041 |
GGCCTTTCTA |
GGCGGTATCA |
GGGCCGATGC |
TCGTGGATGC |
AGAGCAGTGA |
CCAGCCCAGA |
ACCGTACCGG |
CTTCCCGGGG |
|
1121 |
CAGGGGCGGC |
CCGAGAGCGC |
CGCTAACGGG |
AGCAACGGGC |
CCTGCCCCCG |
GTGGTGTAGG |
GGCCACCTCG |
CCCCGCCCAG |
|
1201 |
CCCAGCCCGA |
TATTGATGGG |
GGCCCACGCC |
TTCATTGTTT |
TTTTGTTGTT |
GTTGTTTTTT |
GTTGTTTTTT |
GTTTGTTTGT |
|
1281 |
TTTTGAGACG |
GAGTCTCGCT |
CTGTTGCCCA |
GGCTGCAGTG |
CAGTGGCGCA |
ATCTCGGCTC |
ACTGCACCCT |
CCGCCTCCCG |
|
1361 |
GGTTCAAGCG |
ATTCCCCTGC |
CCCAGCCTCC |
CGAGTAGCTG |
GGACTACAGG |
CGGCCGCCAC |
CACACCTGGC |
TAATTTTTGT |
|
1441 |
ATTTTTAGTA |
GACACGAGGT |
TTCACCGTAT |
TGGCCAGGCT |
GGTCTCGAAC |
TCCTGACCTT |
GTGATCCGCC |
CGCCTCGGCC |
|
1521 |
TCACAAAGTG |
CTGGGATTAC |
AGGCGTGAGC |
CACCACGCAC |
GGCCCATTTG |
CGTTCTAATT |
TCAGGCGTTC |
CTCAATCCTC |
|
1601 |
TGTGGAACCA |
GGAAGGGCGC |
AAATTATAAA |
GGGACTGGCC |
CAGCGCCGCC |
TCTGGGCAGG |
CTTTGGGCAG |
CGCCTGGCCC |
|
1681 |
GCTGCCGGCT |
TGGACCTCCC |
AGACCTAGGG |
GCCCGGTTCC |
TGGTGGAGGC |
TGCAGGGACC |
TCTGCCCCAC |
CCGCCCGGGG |
|
1761 |
GAGGCCCGAG |
GGGCTGGACT |
CAGACTGAGA |
TTGAATGCGG |
CTTTGTCTTC |
CTAGTTCAGC |
CCCGGCCCAC |
ACCTGGGGCT |
|
1841 |
GAGTGGAATC |
GGGAGCTTCG |
AGGGGTCTGG |
ACAGAGAGAT |
TGATGCCAAG |
AAGGGGGTGG |
CCGAGCCAGA |
GGTTGAAGTG |
|
1921 |
GGCTGGATCC |
TGAGGCCCCC |
TGTTAAAGGA |
GAGGGCTCCC |
CACTCAGTGC |
TCCTGGAACT |
TTCCGAACTA |
GAGACTGGGA |
|
2001 |
CTTATAGGAG |
CCTTCTAGAG |
GAAACTGTCG |
TGTTTTCACA |
GGTGTCTTCT |
ATTTGTGGCA |
AAGGATAATG |
GCTTTTCACT |
|
2081 |
TAGGTTGTGA |
CATAAAGGGC |
CTTAGAAATT |
GTTAATGAGT |
TACTTAATGT |
TAAAACTTAG |
TCACCCAGAG |
GCCGGGCGCG |
|
2161 |
GTGGCTCATG |
CCTGTAATCC |
CAGCACTTTG |
TGAGGCCGAG |
GCAGGCGAAT |
CACGAGGTCA |
GGAGATCGAG |
ACCATCCTGG |
|
2241 |
CTAACACGGT |
GAAACCCCGT |
CTCTACTAAA |
AAATACAAAA |
AATTAGCCGG |
GTGTGGTGGC |
TCATGCCTGT |
AATCCCAGCA |
|
2321 |
CTCTGAGAGG |
CTAAGGCAGG |
AGGATCATTT |
GAGCTCATCA |
GTTCAAGACC |
AGCCTGGGCA |
ACATAGTGAC |
ACCTCATCTC |
|
2401 |
ATTAAAAATT |
TTAAAACAAA |
TTTTTTTGGA |
TTTTTTTGTA |
ATTTTATTTA |
TTTATTTATT |
TATTTTTAGT |
TTATCTTATT |
|
2481 |
TTTTTTTTTG |
AGACGGAGTC |
TTGCTCTGTC |
GCCCAGGCTG |
GAGTGCAGTG |
GCACTATCTG |
GGCTCACTGC |
AAGCTCCGCC |
|
2561 |
TCCCACCTTC |
ACCCCATTCT |
CCTGCCTCAG |
CCTCCCGAGT |
AGCTGGGACT |
ACAGGCACCC |
ACCACCACGC |
TCGGCTGATT |
|
2641 |
TTTTGTATTT |
TTATTAGAGA |
CGGGGTTTCA |
CCGTGTTAGC |
CAGGATGGTC |
TCGATCTCCT |
GACCTCGTGA |
TCTACCTGCC |
|
2721 |
TCGGCCTCCC |
AAAATGCTGG |
GATTACAGGC |
GTGAGTCACC |
ACCCCGGCCT |
TTTTTTTTTT |
TTTTTTTTTT |
AAGACGGAGT |
|
2801 |
CTCGGTCTGT |
CGCCCAGGTT |
GGAGTGCAGT |
GGCACCATCT |
CGGCTCACTG |
CAACCTCAGC |
CTCCCAGGTT |
CAAGCGATTC |
|
2881 |
TCCTGCCTCA |
GCCTCCCGAG |
TAGCTGGGAT |
TATAGGCGCC |
CGCCACCACG |
CCTGGCTAAT |
TTTTTTTTTT |
TTTTTTTTTT |
|
2961 |
TTTTTTTTGA |
GACAGAGTCT |
CGCTATGTCG |
CCCAGGCTGG |
AGTGCGATGG |
CAGAATCTCG |
GCTTACTGCA |
ACTTCCACCT |
|
3041 |
CCTGGATACA |
AGCAATTCTG |
CTGCCTCATC |
CTCCTGAGTA |
GCTGGGATTA |
CAGGTGCACG |
GCACCAAGCC |
CGGCTAATTT |
|
3121 |
TTTTGTATTT |
TTAGTAGAGA |
TAGGGTGTCA |
CCATGTTGGT |
CAGGCTGGTC |
TCAAACTCCT |
GACCTCGTGA |
TCCACCTGCT |
|
3201 |
TCAGCCTCCC |
AAAGTGCTGG |
GATTACAGGC |
ATGAGCCACT |
GCGCCTGGCC |
CTTTTTTTTT |
TTTTTTTTTT |
TTTTTGAAAC |
|
3281 |
GGAGTCTTGC |
TCCCTGGCCC |
AGGCTGGAGT |
GCAGTTGAGT |
GATCTGGGCT |
CACTGCAACC |
TCCGCTTCCC |
GGGTTCAAGC |
|
3361 |
GATTCTCCTG |
CCTCAGCCAC |
CTGAGTAGCT |
GAGATTACAG |
GCGTGTGCTA |
CCACACCCGG |
CTAATTTTTA |
TATTTTTAGT |
|
3441 |
AGAGATGGGG |
TTTCACCATG |
TTGGTCAGGC |
TGGTTTCGAA |
CTCCTGACCT |
CAGGTGATCC |
ACCTGCCTCA |
GCCTCCCAAA |
|
3521 |
GTGCTGGGAT |
TACAGGTGTG |
AGCCACCGGC |
GGCGCCCAGC |
CTAATTTTTG |
TATTTTTAGT |
AGAGATGGGG |
TTTCACCATG |
|
3601 |
TTGTCCAGGC |
TGGTCTCCAA |
CTCCTGACCT |
CAGGTGATCT |
GCTCACTTTG |
GCCCCTCAGA |
GTGCTGGGAT |
TACAGGCGTG |
|
3681 |
AGCCAACGCA |
CCGGGCCAAC |
AATTTTTTTT |
TAATTTAAAT |
TTAAATTTTT |
ATTTTTTAAT |
TTTTTATTTT |
ACTTTAAGTT |
|
3761 |
CTAGGGTACA |
TGTGCACAAT |
TTGCAGGTTT |
GTTACATATG |
TATACATGTG |
CCCGGTTGGT |
ATGCTGCACC |
CATTAACTCA |
|
3841 |
TCATTTACAT |
TAGATATATC |
TCCTAATGCT |
ATCCCTCCCC |
TCTTCCCCCA |
CCCCACGACA |
GGCCCCAGTG |
TGTGACGTTC |
|
3921 |
CCCACCCTGT |
GTCCAAGTGT |
TCTCATTGTT |
CAATTCCCAC |
CTATGAGTGA |
GAACATGCGG |
TGTTTGGTTT |
TCTGTCCTTG |
|
4001 |
TGATAGTTTG |
CTCAGAATGG |
TTTCCAGCTT |
CATCCGTGTC |
CCTACAAAGG |
ACATGAACTC |
ATTCTTTTTT |
ATGGCTGCAT |
|
4081 |
AGTATTCCAT |
GGTGTATATG |
TGCCACATTT |
TCTTAATCCA |
GTCTATCATT |
GATGGACATT |
TGGGTTGGTT |
CCAAGTCTTT |
|
4161 |
GCTATTGTGA |
ATAGTGCCGC |
AATAAACATA |
CGTGTGCATG |
TGTCTTTATA |
GCAGCATGAT |
TTATAATCCT |
TTGGGTATAT |
|
4241 |
ACCCAGTAAT |
GGGATTGCTG |
GGTCAAATGG |
CATTTCTAGT |
TCTAGATCCC |
TGAGGAATCG |
CTACACTGAC |
TTCCACAATG |
|
4321 |
GTTGAACTAG |
TTTACAGTCC |
CACCAACAGT |
GTAAAAGCAT |
TCCTATTTCT |
CCACATCCTC |
TCCAGCACCT |
GTTGTTTCCT |
|
4401 |
GGGTTTTTAA |
TGATTGGCAT |
TCTAACTGGT |
GTGAGATGCT |
ATCTCAATGT |
GGTTTTGATT |
TGCATTTCTC |
TGATGGCCAG |
|
4481 |
TGATGCTGAG |
CATTTTTTCA |
TGTGTCTGCG |
GGCCAACAAT |
TTTTTTAAGG |
AAAAAAAAAA |
AGTGACTCAG |
TTGGGCACTG |
|
4561 |
TGGCCTACGC |
CTGTAATCCA |
AGTGAGACAG |
CCAAGTCTAA |
AGGGGTCCCG |
CTTTGGGAGG |
CCGAGACGGG |
CGGATCACGA |
|
4641 |
GGTCAGGAGA |
CCGAGACCAT |
CTTAGCTAAC |
ACGGTGAAAC |
CCCGTCTCTA |
CTAAAAATTA |
GCCGGGAGTG |
GTGGCGGGCA |
|
4721 |
CCTGTAGTCC |
CAGCTACTCG |
GGAGGCTGAG |
GCAGGAGAAT |
AGCTTCAACC |
TGTGAGGCGG |
AGCTTTCAGT |
GAGCCGAGAT |
|
4801 |
CGCACCACTG |
CACTCCAGCC |
TGGGCGACAG |
AGCGAGACTC |
CGTCTCAAAA |
AAAAAAAGAG |
GTCCAGGAGA |
AACTCCCACA |
|
4881 |
CCTGCCTAAG |
CACTGGAAGA |
ACTGGGTAGA |
GCCACAGAAG |
CTCTGCAGGG |
GGGAGGAGCT |
TTGCAGGGGG |
AGGAGCTATG |
|
4961 |
CAGGGGGAGG |
AGCTATGCAG |
GGGGAGGAGC |
ATGCAGGGGG |
AGGAGCTATG |
CAGGGGGAGG |
AGCTATGCAG |
GGGGGAGGAG |
|
5041 |
CTTGGTCTCA |
TGTTCGGGGT |
GGAACTTGGG |
ATTCTATCTG |
GGAGGCGAGA |
AACCAGCTAG |
CGGGACTCTC |
TCTCGCTTTG |
|
5121 |
CTGAGAGTCC |
CTGTTTCCCT |
TTTTTTCCTT |
CTTGCCCAAT |
AAATTCCATT |
TTTCTCACTC |
TTCAAAGTGT |
CTGCGAGATT |
|
5201 |
AATCTCTCAT |
GGCCGCTGCA |
CAAGAACCTG |
GCTTTTAGCT |
GAACTAAGGA |
GAAAGTCCTA |
CAACAGTTTG |
GCGTGCAACA |
|
5281 |
TGGGGCTTGA |
GAAAGGGTGA |
GTGAGATGCA |
AACCAAGAAA |
TTTTTTTCCT |
CTCTTTCTAA |
GCCTATTTAT |
CTTCGGACTT |
|
5361 |
CTGAGGGGGA |
GGGGAGGGGA |
AACCGTGACC |
CCACCCCCTT |
GGTCTCCGTG |
GCCTTTTCCT |
TACTTCTGGA |
CGGATGGGCG |
|
5441 |
AACGGCGGTT |
CTCTGTCGCC |
CAGGCTGGAG |
TGCAGTGGCG |
CCATCTCGGC |
TCACTGCAAG |
CTCCACCTCC |
CGGGTTCACA |
|
5521 |
CCATTCTCCT |
GCCTCAGCCT |
CCTGAGTAGC |
TGGGACTACA |
GGCGCCCGCC |
ACCACGCCCG |
GCTGATTTTT |
TGTATTTTTA |
|
5601 |
GTAGAGACGG |
GGTTTCACCG |
TGTTAGCCAG |
GATGGTCTCG |
ATCTCCTGAC |
TTCATGATCC |
GCCCGCCTCG |
GCCTCCCAAA |
|
5681 |
GTGCTGGGAT |
TACAGGCGTG |
AGCCACCGCG |
CCCACCCCTC |
CATGAAGAAA |
GTTTTTCTAA |
ATGAAAAATG |
TTTAGGACGC |
|
5761 |
TCAGGAGAGA |
AAGAACAGAT |
TAAGGAATGA |
TCTCCACCGC |
ACAGACCTCA |
AGGCTGTTAT |
GCATGCAGGG |
CACAGTTCCA |
|
5841 |
GTGCAAATGT |
CCGCAGGCAC |
GGCATGAGGG |
CCCCCACTGG |
GTGCCTCGGG |
CTCCTTCCAG |
GGCAGCAGTT |
GAAGCGGGTC |
|
5921 |
GTACTGCAGG |
CCTGCAGAAA |
GGCTGGGGTT |
CCTCTCCTTT |
TCGGGTTCCT |
TCCTGATGGA |
ATCCTAGGTT |
TTCAGCGGTT |
|
6001 |
CCTTCCTGAT |
GGAATCCTAG |
GTTTTCAGCA |
GCTTCCGGAG |
GCCGCCTGGA |
CTGCAGAAGG |
CGCCCTGGGC |
GTGTGTACTT |
|
6081 |
CCACCTCCGC |
AGGCGCTGGC |
ATCTCCAGGC |
TGCTTCAGGC |
CGCTTTGGAC |
CTCACCATCG |
GTAGCATCTG |
CGTCTTCTTG |
|
6161 |
AGGTGACTGG |
AGCCCGTCTA |
TGGTTTCAGC |
CATTGACTTC |
TGGACATGGA |
AGTGTCGAGG |
CCATTTTTGG |
CCCCTCTCAT |
|
6241 |
ATCCGGGAAG |
GGCTCCGTGC |
CCAGTAGAGG |
TGCAGTGCAG |
CAGCCAGCTC |
GCAGCAGTGC |
TCCCGTAGAG |
CTGATGGCAT |
|
6321 |
TGGCAGGAGT |
GGCGGCTCAG |
CACAGGGAAG |
TTCTCCAGTG |
CCCAGGCCCA |
GGCTTGCTCT |
CGGTTTAGGT |
CCCATGGCAC |
|
6401 |
GTGGGGCTCT |
GGACCTGCCT |
GCCTCCAAGA |
CAGCTCTTCC |
AAGGTTGAGG |
GTTGGAACTC |
CACAGCCTCA |
GCCTCCCAAA |
|
6481 |
GCTTGACCAG |
GCCCAGCCAT |
AGCTAGTCTA |
GGAAAAGAGC |
AAATGGCCGT |
GTTTTTATAT |
TTTGACTTTT |
GCATTTCAAT |
|
6561 |
ATTAATTTTT |
TTGGTAATGT |
AATGCATTCA |
AAACATGGTC |
CCCCCCCCCC |
CAAAAAAAAA |
AGAAAAGAAA |
AAGAAAAAAG |
|
6641 |
CATGTTTCTA |
ACCAGGGGTC |
CTTCATCTTC |
TGCAGATGCC |
AAGGGTGTCC |
CTGGCACAGT |
AAAAGTTTGG |
AAGCCCTGCC |
|
6721 |
CTAAAGCCTC |
ACCTCACCTG |
GCAGCTGGGT |
GTTTCATTCA |
TAGCAAATGT |
TCTGTATATT |
CACCATGTAC |
CAAACACACA |
|
6801 |
TGTCACTAAT |
GATACAAACA |
CATGTGCACT |
GACATTGACC |
TCCTCACTGT |
CTTACTATAG |
TCCCATCCAC |
AGCCACCTTA |
|
6881 |
TCAGAGTCCA |
TGGGACTCTG |
ACAACTTGTA |
ATGGGAAAAT |
TAACCGTGCC |
TGTAGGGGCC |
GCCTGGGTAG |
CGGGAGGTGG |
|
6961 |
TTCATGAGTA |
CCAGGAAAGC |
TGGCCCTGGG |
GCAGACCTGG |
GCAGGGCAGA |
GCACAGCTTG |
CAGGACCTAG |
TACACAGTGT |
|
7041 |
TGGGAATTGA |
ATTTGGGTTT |
TGACCTTTGA |
AGCTGTGCAT |
AAAGTCTAAA |
ACAAGAAGAG |
GAGTGGAGGC |
TGAGGTTGTA |
|
7121 |
TTTGTATTTT |
AAAATGAGCA |
TTTGCCACAC |
AAAGCCAGGA |
GCTTCAGACC |
AGCGTGGTTA |
ACACAGCGAG |
ACCCTCCTCT |
|
7201 |
CTGTGAGAAA |
TTAAAAAAAA |
AACAATAATA |
AAAGCCAGGC |
ATGTTGGCAT |
GTACCTGTAG |
TCCTAGCTGC |
TCAGGGGGTG |
|
7281 |
AGGCGGGAGG |
ATCACTTGAC |
TCCTGGGTTA |
CAGTGAGCTC |
TAATTGCCAC |
TGCACTTCAG |
CCTGGGCCAC |
AGAGTGAGAC |
|
7361 |
CTTTTCTCAA |
AATAAAATAA |
AATATAACCT |
TTTGGGCAGG |
CATGGAAAGG |
GGCTGGAGAG |
AATAAGAGGA |
AACAGAGATA |
|
7441 |
AGCCCCCTCC |
CTGGGAGCAC |
AAGGGACTCA |
TCCCCAGACA |
TGGCACCACA |
GGCAAACAGA |
CACTGTCACA |
AAACTAGCAT |
|
7521 |
ATTATCTTTC |
CTAGAAGTCC |
TTTGTTTTCC |
CCAAGTGCCC |
TTTCCCCCAA |
CCTTTTGTTG |
GTTTACGAGC |
TCTCAATTCT |
|
7601 |
AACCTCCTAG |
TACACGAATA |
AAAATGTCTA |
TCGGGGTGGT |
GGCGTGAGCC |
TGTAGTCCCA |
GGTACTTGAG |
AGGCAGAGGT |
|
7681 |
GGGAGGATCC |
TCTGACCATA |
AGAGTTCCTG |
ACCAGCCTGT |
GCAATACTGA |
CTCCATCTCA |
AACAAAAAAC |
AGAAAGCTAT |
|
7761 |
ACAACTCCGT |
TTCTCTTGCT |
AATTTGTCTT |
TTGTCAGTTT |
AATTTGCAGG |
CCCCAGATGC |
TGAATCTAAG |
AGGGCAGAGG |
|
7841 |
AAAAGCTTTT |
CCTCCCTGAC |
ATAACCATCC |
AGGAGAAAGT |
CTGGAATGGA |
GGGTCAGAGG |
GGTAGGGAAT |
GGAGTTAGGA |
|
7921 |
GAAGTAGATA |
GATTCAGGAC |
AGTCTTTGGA |
GAATGTACAT |
CACTGAGGTC |
AAGATGTCCT |
AGGGAATGGC |
CCTTCTGGGT |
|
8001 |
TCACAATTCT |
GCTTCCGTCG |
GTTAACAGCA |
ACTTAGTCCC |
TTCTATGCCT |
GTGGGGATAA |
TCAGTCAGGA |
TAGGCTGGAT |
|
8081 |
TATGCTGCTC |
TAACAACCCC |
AAATTTCTCT |
GATCTAATAA |
AAGTATACTC |
AAAGCTGTAT |
ACCCTTCAAG |
GGTCCCCTGG |
|
8161 |
GGTCTCTGCT |
CATTGTAGTT |
CCTGCCCTGG |
CTACAGGCAT |
AACCATGTGG |
AAATCAAAGG |
CTTTCTTGGC |
AAAGAGAAAG |
|
8241 |
AGAGCTCTAG |
AAGCTTTAGC |
ACTGGCAAAT |
AAATAAATGC |
TTGGCGTATA |
GGTGATATAT |
GGCTCTTCTG |
GCTCAGGTCC |
|
8321 |
TTGGCCAGAA |
TTAATCACAT |
GGTCTCACCT |
CAAGGGAACA |
GGTGAGTTAA |
ATGCAGAAAA |
ATATTCAAGC |
TGGGTGAGGT |
|
8401 |
GGCTCACGCC |
TCTAATCCCA |
GCACTTTAGG |
AGGCCGAGGC |
GGGAGGATCG |
CTTGAGCCCA |
AGAAGATCAT |
GGTTGCCGTG |
|
8481 |
ATCCATGATC |
ATACCACGGC |
ACTCCAGACT |
GGGGACCAAA |
GTGAGACCTT |
GTCTCTAAAA |
ATTAAAATAT |
TTAGAATGTT |
|
8561 |
TCCTAGCATA |
AAAGATGTGC |
TACGTGTGTA |
TTATTACTGT |
TGCTATTGCC |
AGAAAGTACA |
ATCAACAGGA |
ACTGCTGGTG |
|
8641 |
GCTTGAGGGG |
GAGCATATGG |
TCATAGCTGA |
CTCCCAGGTT |
TCTGGCTTGT |
ATTGTGACTG |
TTTCTGGCCT |
GTATTGTGAC |
|
8721 |
CATCACTGCC |
ATCAGGAACC |
CTAGAGCAGG |
TCGGGTGTGG |
TGGCTCACGC |
CTGTAATCCC |
AGTACTTTGG |
GAGGCTGAGG |
|
8801 |
TGGGTGGATC |
ACTTGAGGCC |
AAGAGTTCAA |
GACCAGCCTG |
GCCAACATGG |
TGAAACCCCG |
TCTCTACTAA |
AAATACAAAA |
|
8881 |
ATTAGCCAGA |
TATGGTGGTG |
GGCGCCTGTA |
ATCCCAGCTA |
CTTGGGAGGC |
TGAGGCACAA |
GAATTGCTTG |
AATCCGAGAG |
|
8961 |
GTGGAGGTTG |
CAGTGAGCAG |
AGATCAAGCC |
ACTGTACCCC |
AGCCTGGGCA |
ACAGAGCAAG |
ACTCTGTCTC |
GAAAAAAAAA |
|
9041 |
AAGAAAAGAA |
AACCTGGAGC |
AGGCATGGGG |
TATACGTCAT |
GGGTCACTAC |
TCTTGGATAC |
AATTGACGAA |
AACCACAATC |
|
9121 |
AACTGGCTTA |
AGCCGTTTTA |
TAACCCAGCA |
TTAGGCCAGC |
CTGGATCCAG |
GGGTCAGATA |
CTGAGATCAA |
GCTTTGATCT |
|
9201 |
GCTCAGGATT |
CCCCCCTCCC |
CAAGGCTCCT |
GCTTCCAGAT |
GACCCCAGTG |
TGCATTTCGT |
CCGCCAGCGT |
GGGTCACATG |
|
9281 |
TCCATCCCTG |
ACCAGGGTGC |
TACAGTGCTC |
TGGTTGGCCA |
AGCTTTGGCC |
ATGGGGTTTA |
CCTCACCCTA |
CTTTTACGCA |
|
9361 |
AGAGTATGTG |
GTGGCCTAAC |
ACAGGTGGGG |
GACCCTGCTT |
CAATACCAGC |
CTTTATCCAC |
AGATGACCCC |
ACGCCTTCTG |
|
9441 |
CTCCAGAGCA |
TCAGGGTTAT |
AACACATGGG |
ACACACGGGG |
CACACATACA |
CTCCCACGCA |
ACATCCACAC |
ACCCCACAAA |
|
9521 |
ACTACCTGAG |
CCCACATAGT |
CAGGGCAGGG |
ACAGGCACAG |
AACAAAGTGG |
GTGCACAGGG |
ACTCGCAGGG |
AGGTCGAAGA |
|
9601 |
GCTGAGTCTC |
GTCGGGAGGC |
TGAGGGGGAA |
GAGCGGGCAC |
TGGGCTGCGA |
GTGTGCAGAG |
CTGGAGGTGG |
GGGACTGCGG |
|
9681 |
TAAGTACCCG |
TCCTGGGGTG |
CGGGCCCTGC |
TGGGAAGCAT |
TGGGAGCAAG |
AAGGGTTAGA |
TCCTCAAAGG |
AGGATAATTC |
|
9761 |
TTCCAGAAGG |
TTTTCTTTGT |
TTTGTTTTGT |
TTTGTTGAGG |
CACCAAGATT |
GGAGTGCAGT |
GGCGCCATGT |
CAGCTCACTG |
|
9841 |
CAACCTCCGC |
CTCCGAGGCT |
CAAGCAATTC |
TCTTGCCTCA |
GCCTCCCGAC |
TAGCTGGGAC |
TACAGGCACC |
TGCCGCCACG |
|
9921 |
CCCAACTAAT |
TTTTTTAATT |
TTTAGTACAG |
ATGGGGTTTC |
ACTCTGTTGG |
CCAGGCTGGT |
CTCGAACTCC |
TGACCTCGTG |
|
10001 |
ATCCGCCCAC |
CTCGGCCTCC |
CAGTGCTGGG |
ACTACAGGTG |
TGAGCCACCG |
CGCCTGGCCT |
CTTCCAGAAG |
GTTTTCGATG |
|
10081 |
GCAGCAGGTG |
GAGGGGAGCG |
AGAGGGCCAT |
CGCTCCTGCT |
GCAAGGGCAG |
CCCCTCCACG |
GGATCCCACA |
CTCCTGACCC |
|
10161 |
GTTCAGTATA |
GACGGGGTAC |
TCCCGGTGCG |
GCCTGTCACC |
GCTCTGTCTG |
GCGCTGGCGC |
TCCTAGAGAA |
GGCCATGCCG |
|
10241 |
ACCTCTCCCC |
CACCTCGCTC |
CAGATTCCCC |
TTCTTGAGCT |
GCTTTTCCCA |
CAGTCCCCCC |
AGGCGCCACT |
GAGAGCCTCC |
|
10321 |
CGTCTTCGAT |
GGGCTTGGGG |
CGCCTGGACC |
CTTCCGGGGT |
CACTTTGCCG |
TCTTCCTCCA |
GCCCCCCTCC |
CTACTCCTTG |
|
10401 |
TCGCCGCCCC |
CCCGTCTTTT |
CAGGACCTTG |
CGCACTCAGG |
CTGGCCCTAT |
GCTTTGGGGT |
ACCCCAGCAG |
TGCCCCCGGG |
|
10481 |
TCTCCGGCCA |
CCCGGTGACG |
GAGGGAGGGA |
GAGGCTGGCG |
AGGGGGCGGC |
GCCTTAGGTC |
CCAGCGCGAC |
CCGCCTCTCA |
|
10561 |
GCCCCTCCCG |
CCGGCTACAG |
CCGCGCGGGA |
CGGGGCAGGG |
ACCTCGAGAA |
CACCGGGGAC |
TTGCGGGCCG |
GTGGCCCCGG |
|
10641 |
GCAGGCCACT |
TCTGACGCGG |
GTGCGAGGCC |
TTCCGCGGGC |
GTGCACGGTT |
GGGCCGGAGC |
CGCGCCGGGG |
GCCGGGACAC |
|
10721 |
GCCTGGGGTC |
AGCGCTGGCG |
TCGCCGCTCG |
GGTGGGGTGG |
GCCGGGTTCG |
GGTCCAGTCT |
CCTCCCCACG |
GGCGGCCCGC |
|
10801 |
GCCCCGTCTT |
CCAGTGACCC |
CCACATCCCT |
TGCGCCCCGA |
GCTGGAACTT |
CGGCAGGTTC |
TGCCCCGGGC |
CGCGCGCGCT |
|
10881 |
TTCTGGGGAC |
CAGGGCTGTG |
GTTCCTGCTC |
CGCGTCGTGC |
CTTTAGACAC |
CGCCGGGCGT |
TCGTCCTCCA |
TCCCTCCCCC |
|
10961 |
CTTTTCCCCA |
CTCCGCGGTT |
GACGAGGGCC |
GCCCCCAAGT |
GGCCATGTCC |
CCCGGCATGG |
ACGACCCCTC |
AGTGCCCCTG |
|
11041 |
AGGGCCCAGG |
GTCTCGCCCC |
CGTCCCGAGG |
CAGGCCATGT |
TTGAATTTAT |
TTGGAGAAAT |
TCGCCCAGCC |
CAGGGGACCT |
|
11121 |
CTTTCTGCAG |
CTCTGGCCTT |
CCAGTAGGAG |
AGGGTCCCTG |
GGGCTCTCAG |
GCCTGACAAC |
TTGGCATCAC |
TGAATTCTCT |
|
11201 |
AACCATGTCT |
GTCCATGACT |
TGAACTCCTT |
TTCTTTTAGA |
AAGATCTTTG |
CAAACCCCTT |
GTGGGCGTGC |
CCCTCCCATG |
|
11281 |
GCCCCCGTGC |
CTCTTGCCTG |
GGCTTTGGGC |
TTGTCTGAAA |
GGGCTAAAAG |
GGGGTGATGC |
CTGGCCGGGC |
GGGTGGCTCA |
|
11361 |
CGCCTGCAGT |
CCCAGCACTT |
TGGGAGGCCG |
AGGTGGGCGG |
ATCACCTGAA |
GTCAGGAGTT |
CAAGACCATC |
CTGGCCAACA |
|
11441 |
TAGTGAAACC |
CATCTCTACT |
AAAAATACAA |
AAATTAGCAA |
GGCTTGGTGG |
CAGGCACCTG |
TAGTCCCAGC |
TACTTGGGAG |
|
11521 |
GCTGAGGCAG |
GAGCAGGAGA |
ATTGCTTGAA |
CCTGGGAGGC |
AGAGGTTGCA |
GTGAGCTGAG |
ATCGTGCCAC |
TGCACTTCAG |
|
11601 |
CCTGGGTGAC |
AGAGAGAGAC |
TGTCTCAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAGTGATG |
TCCAGTGCCT |
GGGGATTTTT |
|
11681 |
CCTAGGGAGT |
GGTCAAGGAA |
GGCCCCGCTG |
AGGAGGAGAC |
ATTTAGGGTG |
GGAAGTTGCC |
ATGTAGGGCA |
GGAGTGGAAA |
|
11761 |
GGGTTTTGGA |
CCAGAGGGAG |
AGCCAGGGAA |
AAGTCACCAG |
GTAGAATGAG |
CTGGCATGAT |
TAGGAAGATG |
GCAAGGGAGG |
|
11841 |
GCTGTGGCCC |
TCCCATGTGA |
GTGGAAGGGA |
GTGGCTGCAG |
CGCAGAAGGC |
CTGGAGGCCC |
CCGCACAGAC |
CAGGACCTCT |
|
11921 |
GTAGAGGAAC |
TCATCCCCTG |
TGTCCCCTGT |
GATTGTCAAT |
CTCCCTAAAG |
ATGGCCCAGA |
GCAGTGCGGC |
CTGAATCCCT |
|
12001 |
CAGTGGCCTT |
GTCTCTGGCT |
GCCCCAGCAC |
CCACGCGGAC |
ACTTGACTCC |
ACGCCCCAAA |
GAAAAGAAGC |
GAACCTGGGA |
|
12081 |
ACACCACTGC |
CAAGGCATAG |
GATTATTTGG |
GAGGGGGGAA |
GGGGGAACTG |
AGGAGGTGGT |
TACAACATGC |
TGGGCAGCAA |
|
12161 |
CAGACAGAAC |
CAGACCACCC |
CTGTGGCTGC |
CCCGGCTGGT |
CTCCTGGGAC |
CACTGGGCGC |
TTGGCTCAGG |
CCTCTCCTGC |
|
12241 |
CCTCCCCACT |
CTTCCACCTC |
AGCCCCCCTG |
CAGTCTCTGT |
CCTGCCCTGA |
CCCCCCAGCT |
TACAGCCCCA |
AAAGCAGCTA |
|
12321 |
AACAACTCGC |
ACACCCACGG |
GGCATCGCCT |
GGGAGGGCAG |
CCCAAATCCT |
TCCACTTCAG |
CCCCATTTGA |
AAGATGAGGA |
|
12401 |
AATGAGAGGC |
TGGTCCATAG |
TGGGTCTCGC |
GGCTCAGGTG |
CCGAGCCTTC |
CCTGGGCATG |
CACGCCGGGC |
CCTGGAGGGT |
|
12481 |
GGGAAGGGGC |
CAGTGGACGC |
GGGGAGCCTG |
CGGGGTGGGA |
CTGCATCGGG |
AAAGGGGAAG |
GAGTCAGAGG |
CGAGAAGGGG |
|
12561 |
GAGAGTGTCT |
GTCTGGCTCC |
AGCCGCTGCA |
CGCTCTTCCT |
GCTCAGGGGA |
CTCACGGTGA |
CCCCGGAGCC |
ACTCCCCAGC |
|
12641 |
CCAGCCTCCA |
GGTAAGAGGT |
CACTGAGATG |
GGTGGCAGCA |
GGGGCCGGGG |
ATCCCCCTAT |
TACGACAGCG |
GTCATGGGAC |
|
12721 |
GCTGACTCAC |
TGCCGGCCAG |
ACCACCTGAC |
CTCCGCGGCG |
GGAGGAGAGG |
GCCCTGCCAG |
GGGGTTCCCG |
CCGCGCCTTG |
|
12801 |
TTTACCTCCG |
GGAGGCCTCG |
GCCTCGCGTG |
GGGGCAGGGC |
GGCCGCTGGG |
GCCGCAAGGC |
GTGCGGGGAA |
GGGCCAGAGC |
|
12881 |
CGGGTGTCCA |
CCCCAGCTTC |
CCAAAGACTC |
CCTCTTCTGT |
GCTTCCTTCT |
CCCCTCCCCG |
CCCCCCCCCC |
AGTCTCTTCA |
|
12961 |
CATGCCCCTA |
GCCCCCGCGG |
AAACTTCCCG |
CGATCCCAAA |
CGGGCCAAAT |
GGCGAGAAAG |
CAAAGGAGCT |
CCTTCTTGGG |
|
13041 |
GGTGAGTGGG |
GCGCCTTGAG |
CGCTTCCTCA |
AAGCTATGTT |
CCCAGAGCCA |
CAGGCCTTCC |
TTGTGTCCCT |
CACCCTGCTC |
|
13121 |
AGACCGGGCC |
ATAGCCGGGG |
GCTGGGGCAG |
GAAAGCCGGC |
CCTCGGCGGG |
GGCCACGTGG |
CTCTCAGGCG |
CCTGGGCTGC |
|
13201 |
TGAGTCACGC |
TTGGCCAGCA |
CCTGTCTGTA |
GGCCACAGCC |
TCTGCCAGCA |
CGCCCCTCTG |
TGTCCCCTGC |
CCCTGTCTGC |
|
13281 |
AAGGCAGTGG |
CTCCAGCAGG |
CCCTGGGGCA |
TTTTCCACTC |
TCCACCGCCG |
GATGCAGGGA |
GAGGCCTGAA |
CCCTCTCCAC |
|
13361 |
AGGGCTGCTC |
TGGGCAGGGT |
GGAAGCCTTG |
CCCACTTCGG |
AGCCCTCCGG |
GAAGGATCAT |
TCACACCTGT |
GGACCAGCCC |
|
13441 |
CTGCTGTGCG |
CACACCCACA |
CATCACCTTC |
GCACCTGACT |
GGCCCCATCC |
AGCCACTCTT |
GCCTCCCTCT |
GGGTTTCCTC |
|
13521 |
CCCTGGGAGG |
TTTCTCCAGC |
TCCTGCAAGC |
CCTGGGCTGA |
AATGGCATGA |
GTTGGACCCA |
GCAGGTTCTG |
ACCTCCTACT |
|
13601 |
CACAGGACCT |
TGCCTGGGAG |
GCTCCAGAGG |
GTGACCACTC |
GTCCTGCCCC |
TCTCCTTGCC |
CCAGTTCTGG |
CGGACAGGTT |
|
13681 |
ACTCTGGTGG |
CATAAAGCAG |
TGTTTCTTCC |
TTCCTAGCTG |
AGGAGGCTGT |
TGGCTGACCC |
CCTTGGCTGC |
CCACAAGGCC |
|
13761 |
AACGGGCCTG |
AGCCCCCACA |
GGGCCATGGG |
CATTACCTGC |
TGAATTGAGG |
AGCCCATAAG |
GAGTCACTTG |
GACCACAGTG |
|
13841 |
AACACTTGGC |
GACCACTGAC |
ACTCAGGAGA |
CCTTAGCTGG |
TCCTCCAGCA |
CCTCTCAACT |
CCACTCCTAC |
TAAACTGGGA |
|
13921 |
ACTTCTCTGG |
TGCTCAGGCC |
AGAGTCGGGG |
TCCGTCACCG |
AGTATGCTAT |
GCGCTGCCCA |
TCACCGAGGA |
TGCCATGCGC |
|
14001 |
TGTAAGAGGG |
CTGCCACCGC |
GGCAGGCTGA |
CCATGGCAGG |
GTCGGAACAG |
CAACCTGAGA |
GCCAGCTTGT |
TCTGGCCAGC |
|
14081 |
AGTGCCCACT |
GGGCGACCTA |
GCAGCCTCCT |
GATATGGGGG |
CTGTGTCCCC |
CTCTCCCTGC |
ACTGGGTACC |
CCCAACTGAG |
|
14161 |
GATATTGCTG |
AGTCATGGCC |
AGGCCCAAGC |
CTGGGAGGGG |
CGAGGGGCTG |
GACCCCCGCC |
AGTACCCTGA |
TCCCAGGTGC |
|
14241 |
AGAGGCTGGA |
GCCCAGGCCT |
GTATGAGTGC |
CAGGGCCGGT |
TTCCTGGGGT |
CCTTGGTGCA |
CCGGGGCAAT |
GAAGAGAGGG |
|
14321 |
GTCAGCAATG |
AGGGGGCCGG |
GAGACCTGGA |
GCGAGGGGTA |
GCGGGGAAGG |
GGAGAGTAGT |
GAAGGGGCCT |
CTGCAGGGCG |
|
14401 |
GCTCTCGCGC |
CGCGACGACG |
GTGGCGGGGG |
CGGGGAGGGC |
GCGAGAGACT |
CCGCCCCTCT |
CGAGGCGGGG |
CGGGGCCTCC |
|
14481 |
GCGTTCGCTA |
CAAAAGCCGC |
GCGGCGGCTG |
CGACCGGGAC |
GGCCCGTTTT |
CCGCCAGCTC |
GCCGCTCGCT |
ATGGCGTCGC |
|
14561 |
TCACCGTGAA |
GGCCTACCTT |
CTGGGCAAGG |
AGGACGCGGC |
GCGCGAGATT |
CGCCGCTTCA |
GCTTCTGCTG |
CAGCCCCGAG |
|
14641 |
CCTGAGGCGG |
AAGCCGAGGC |
TGCGGCGGGT |
CCGGGACCCT |
GCGAGCGGCT |
GCTGAGCCGG |
GTGGCCGCCC |
TGTTCCCCGC |
|
14721 |
GCTGCGGCCT |
GGCGGCTTCC |
AGGCGCACTA |
CCGCGGTGAG |
CGGGCCGGGG |
AGCGGCGGGG |
GCGGTGACGC |
AGGCCGGACA |
|
14801 |
CGGCCTCCTG |
CCGCGGGGTG |
GCTGCCCCCT |
CCCTTCTCGG |
CGACGCCTGG |
CGGGCCGTGA |
GGGGGTCTGC |
GCTGGCTGCT |
|
14881 |
CCCTGGATGG |
CGGTGGCCTG |
CATGGGTCCC |
CAGTTCGGCC |
ATGGGAGCCG |
GCCTGGTGAC |
TGGAGTGGTG |
ACCAAGGCCG |
|
14961 |
GGACCCGCTG |
CTCAGCGTCG |
GCCCCCTGGG |
GCGGTGGAGC |
CCTGCCGGCC |
GGGGGCTCGA |
GCCTGGGGGC |
GTCAGACGCC |
|
15041 |
CCGCTCCACC |
CCCCGCGCTG |
TTGGGGATTT |
TGGCAAGGAC |
GCGCCGGGGC |
GAACGCTCTG |
GCTCTCCGCG |
GGCACTGGGT |
|
15121 |
GGTCAGGCGG |
GCACTCGGGT |
TACACTGACA |
CCTTGCTGCG |
CCAGGTGGTG |
GGTTCAGATA |
ATGCCCTGGA |
GGAGCCGGGC |
|
15201 |
GAGCGCCGGC |
GAGGGGAGGG |
AGTGACGCGG |
GTAAACAAGC |
GCGGGGGTGC |
GGGGGACTCG |
CGAGCGCCGC |
GACAGCGCCT |
|
15281 |
GGGAGAAGGG |
CACGGATCGC |
CGGCGGAACG |
CTCCGAGCCA |
GGTCGAGTAC |
AGATGTTTTC |
CCATTGGCAA |
GTGGACGAGA |
|
15361 |
ACGTTCTTCA |
GAAGTGTTGG |
TGTTGGCACA |
GAAGCCCTGT |
TTCCTGCTGC |
GCTGGTGTGA |
CCAGTGGCTG |
CTGGGGGTGG |
|
15441 |
GGTTAGGGAG |
GTGGTTGTGG |
CCAGGAGGGC |
GGGAGGTGGC |
CAAGGCCGGC |
CCCTGGGAGG |
GTGCAGTGCT |
AGGACCTCCC |
|
15521 |
TCTGGAGCGC |
TGCCAGCATA |
CCAGGCCCTC |
TCCTATTCTT |
AAAAAAAAAA |
AAATTGTGAT |
GTTATTGAGC |
TGTAACTGAA |
|
15601 |
ATAAGGGTTC |
ACCCATTTAG |
TGTACAAGTC |
AGTGGTTTTC |
ACTATTTTCA |
TAGGTTTGTG |
GACCATATTC |
AGTGTGAGAG |
|
15681 |
CTTTTCATCA |
CCTTATAAAC |
GCCATACATA |
CCTTTTATCA |
CCCTCTTATC |
CACATGCCCC |
CAGCAACCTC |
CTTTCTGTAT |
|
15761 |
CTATTCATCT |
TGCAATCCTG |
GACATTTCAT |
GTAAATAGAA |
TCAGACAATG |
TAGTCTTGCA |
AGTGGTTTCT |
TTCACTTAGT |
|
15841 |
GTAATGTGTT |
CAGTGTTGTG |
GCACATATCA |
GAACTAGTTT |
TTTTTGGAGG |
AATAATATTC |
TGTTGTGTGG |
ACAGGCCATG |
|
15921 |
TTTTGTTTAC |
CTGTTCATTA |
GTTGACAGAC |
ATTTGGGTTG |
TTTGCATCTT |
TGGGTGCCTC |
ACCTGTGCCC |
TCGGTGGATG |
|
16001 |
GCGTGGTTTG |
GTCTGCACAG |
CTGTGCTTTA |
AAGCCATTTC |
AGCTCATATG |
TATCTGTCAC |
AAATGGAGAC |
TACATACAAA |
|
16081 |
TACGTGTCTT |
TCAGCTTTGT |
TAGGGAAGGA |
AGCAGGCAGA |
TGCCCTTGTC |
TTCTCCTTTA |
CTCCCAAGCC |
ACTAGAAGTC |
|
16161 |
TAAGGCCTTT |
GTGGGGCTGG |
GCCCCTCAGG |
AGCAGGTCAC |
AGGCAGGGAT |
CTCCCACAGT |
TGAAGACGGA |
CATGGGCCTG |
|
16241 |
GTCGGATGGG |
GAATGAGGTG |
GCACGTTTCT |
GAAAAGGCAG |
CTGGCCCAAG |
GCTAAATAAG |
TGCAGCAGCC |
AAAGCTGCTG |
|
16321 |
CCCCTGTGCA |
CAGGCCCTCC |
GCCCCCTGCA |
TGTGGCTGTG |
GTCCAGGGCC |
CAGGTCGGAG |
CACTCACCTT |
CCAGGAGGTG |
|
16401 |
CCAGAGCAAG |
GGGGTAGTCT |
TGCCTCTCAC |
TCCTGCCCTC |
TGTGGCTCAA |
GTAGGTGTGT |
TTGTTTATAG |
CCCTGTGAGT |
|
16481 |
GTCCCTTTCA |
TACTTGCCTC |
AGCCCATTCC |
AGCAGCTTAT |
GTCCAGCTGA |
GAACCCCTGG |
GTGCTCACGT |
GCTGTCTTTT |
|
16561 |
AAACAATCTA |
GATGAGGACG |
GGGACTTGGT |
TGCCTTTTCC |
AGTGACGAGG |
AATTGACAAT |
GGCCATGTCC |
TACGTGAAGG |
|
16641 |
ATGACATCTT |
CCGAATCTAC |
ATTAAAGGTA |
AGGGGCTGCT |
CTGGGGGCTG |
CCTGAAGCCA |
GCTCAGCTTG |
TACTCAGTTC |
|
16721 |
CCTGCTGAGT |
AAAAAACAGG |
GCTCGATGTT |
CCACCAATGA |
AGGGGTCAGC |
AATTTGAGGG |
CTGTTTAAGA |
CAGAGACATA |
|
16801 |
GGCCAGGTGT |
GGCTCACGCC |
TGTAATTCCA |
GCACTTTGGG |
AGGCCGAGGC |
GGGCAGGTCA |
CCTGAGGTCA |
GGAGTTTAAG |
|
16881 |
AACAGCCTGG |
CCAACATGGT |
GAAACCTTGT |
CGCTACTAAA |
AATACAAAAA |
TTAGCCGGGT |
GTGGTGGTAC |
ATGCTTCTAG |
|
16961 |
TCCCAGCTAC |
TCAGGAGGCT |
GAGACGAGAA |
TCACTTAAAC |
CTGGAGAGCG |
GAGGTTGCCA |
TGAGCCGAAA |
TCACATGACT |
|
17041 |
GTACTCCAGC |
CTAGGCGACA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAGACATT |
TTAAGTGCTG |
TGCCTGACCT |
|
17121 |
GAGAAAGAGG |
AGTCCATGTT |
CACTCTAGGG |
ATGGGGTCTG |
GTGCCGTGGT |
GCTTGGGTTA |
GGGATGGGGT |
CTGGTGCCGT |
|
17201 |
GGCGCTTGGG |
TGAAGGGCAA |
GGCCAAAGCT |
GTGCAGACAG |
GGCTCCTTGC |
TGCTGCTCTG |
CTGCCTGGGC |
CAAACAGACA |
|
17281 |
CAGGGACTGG |
GAACCTCCTA |
GCAGGTTCTT |
GGTGGCTCTG |
CTGCCCTCAC |
CTAAGTGGCT |
GAATTTTGTG |
TGGATTCCAT |
|
17361 |
GCTGGAGAGC |
AGGGCCGGGG |
GCCTTGCTGG |
CAGTGACAGC |
CCCACAGTGA |
CGACAGAGGG |
GGAGGACTTT |
AGGGGGTCCC |
|
17441 |
ACCCTAGCGG |
CTCTCTTTAC |
CCTTCCTGTA |
GAGAAAAAAG |
AGTGCCGGCG |
GGACCACCGC |
CCACCGTGTG |
CTCAGGAGGC |
|
17521 |
GCCCCGCAAC |
ATGGTGCACC |
CCAATGTGAT |
CTGCGATGGC |
TGCAATGGGC |
CTGTGGTAGG |
AACCCGCTAC |
AAGTGCAGCG |
|
17601 |
TCTGCCCAGA |
CTACGACTTG |
TGTAGCGTCT |
GCGAGGGAAA |
GGGCTTGCAC |
CGGGGGCACA |
CCAAGCTCGC |
ATTCCCCAGC |
|
17681 |
CCCTTCGGGC |
ACCTGTCTGA |
GGTGAGCAGG |
CCCTCTGTGC |
AGGCCTGGGG |
TGGGCTCAGG |
GTGGCAGGAA |
CCTTGACCCG |
|
17761 |
CTCACTGCCT |
GCCGCTCTGC |
TAATTCCTCC |
CCCAGGGCTT |
CTCGCACAGC |
CGCTGGCTCC |
GGAAGGTGAA |
ACACGGACAC |
|
17841 |
TTCGGGTGGC |
CAGGATGGGA |
AATGGGTCCA |
CCAGGAAACT |
GGAGCCCACG |
TCCTCCTCGT |
GCAGGGGAGG |
CCCGCCCTGG |
|
17921 |
CCCCACGGCA |
GAATCAGGTG |
AGGCTTGTGT |
TGGAACCTGC |
TTCTGATTGG |
TGACAGTAGT |
CAGGCAGCCT |
GTGTGCAGGG |
|
18001 |
CCCTTGTGCA |
AAGCGTGTGT |
GCAAGGCAAG |
AATTCAGGAT |
ACCCCCCACC |
TTCCTGGTGC |
CCTACAATCA |
CACAAGAACC |
|
18081 |
CTGCAAAGTG |
GGGTGTATTC |
TCTCCATTTC |
CCAAATGGGG |
AAACTGAGGT |
GCCTAAGTGC |
CTGAGGCCAC |
AAATTTACCT |
|
18161 |
GCACAGCCCT |
TCCTACCTCA |
GGAGGCTGCC |
CTCTCAAGGT |
ACCCTGAGGT |
GCAGGCAGGG |
AGGCCCTTCC |
AGCCCAGGGG |
|
18241 |
TCTTTGATGC |
ACTTTGTTCT |
CTTTTGTGAT |
GGTTGTCAGG |
AAGATCAGAG |
CCAAAGTTGC |
TGAAGTCCTT |
TGAAACATAG |
|
18321 |
TTATAAGTGA |
AAGACTTACC |
GTTAGCTTTG |
TAGTCTAGAT |
TTTTGGATTC |
TGATTTTATT |
AATGTTACTG |
TGTCCTGTGA |
|
18401 |
ATCACCTCAC |
CCTTTGGGAC |
AAAGATGGGG |
ATGTTTGCTT |
GACTTTGAGT |
AAATAACATA |
TTTACTCAAG |
GAGGTGATCA |
|
18481 |
ATATTCACAG |
TGTACTGAGC |
CTGAGCCTCT |
GTGGGGTTGC |
TGAGCACCAG |
GGTCACAGAT |
GAGGGGGAGA |
TGGCACGGGA |
|
18561 |
GAGTGGAGAT |
GCTCTCTGTG |
CAGGGCCAGG |
GGGTGCAGAG |
TGGGAGGAAG |
GAGAGGGGGA |
TGCTGAGTGG |
GTCACTGGAC |
|
18641 |
AAGATGTCCG |
GGTTAAAGGT |
CACCCGGGAA |
CACAGGGACC |
TTGGCAAGAA |
GGTGACAGGA |
CTGTGACAGG |
TATCCAAGGC |
|
18721 |
ATTAAAGATA |
TCTTTATCTT |
ATCTTTGTAA |
AAATCAAAGC |
TTCTGGTCCA |
TCGGAGGATC |
CGAGTGTGAA |
TTTCCTGAAG |
|
18801 |
AACGTTGGGG |
AGAGTGTGGC |
AGCTGCCCTT |
AGCCCTCTGG |
GTGAGTGCAC |
CTCCTTGCCC |
AGTGCTTCCC |
TAACTCAGCC |
|
18881 |
TGCACTTTAT |
GTAACTTTCA |
CCTGGAATAC |
TGCAAAGGAA |
TGGGTAATTG |
ACATGCCCTT |
GACACTGGTG |
AGGATTTGTT |
|
18961 |
GCCTCAAATC |
AACCTTTAGT |
AGTGCTGGAC |
CACGGGCAAC |
TCAAGGTTGA |
ATTCCCTGAC |
AAAATTCTCG |
AGCTTTCTAC |
|
19041 |
ATGGAGTGAA |
GTCGAATCAG |
CTGGCATTTG |
GTTGGGATAA |
TCCCAGTGAG |
GGAGGGTGAC |
CAGACTGTGG |
TTGCAGGTCT |
|
19121 |
TGCCTGTCAC |
CTCTGACTGC |
CCTCCTTTGG |
ATGAACAATA |
GTCTGCAATT |
TCCCCTAAGC |
CAGGCCTTGG |
GATTTGGACT |
|
19201 |
TTTGAGGACC |
TGGGTAGTAG |
TTAGTTGACA |
TGCTATTTAT |
CTTTTCCTTC |
TTTTAGTTTA |
CAACCCCCAA |
ACCTTTCTGG |
|
19281 |
TGCTTCTGAG |
TGCTGGGCCT |
GCCTCAGAGC |
GGGGAGAAGG |
TGGAGGCAGG |
TGCTCTGGGT |
GACACCCGGT |
TCTGGTGCCT |
|
19361 |
CAGGTTGCAC |
AGTGGGTCTG |
TGGGGTGGGT |
GAAGGTGCCA |
CCTGACCCCA |
GGCATGAGCT |
CAGGCTCTGA |
GACCCCTCCC |
|
19441 |
TGCTGGGCTT |
TAGAAAGTGC |
CTGTTGCGTC |
TGGCTAAGGG |
TCTCGGGTGG |
GTGCTGCAGG |
GTTAGGCATG |
CGGCAGTGAG |
|
19521 |
AATCAGTGAA |
TGAAGCCAAC |
TTCTAGTTCA |
AGATGACGGT |
GGATCTGTGA |
TGGCAGATGG |
ATCGCTGGGA |
CACCTGCATC |
|
19601 |
TCTCGCTAGA |
CGAGTGAGCT |
TGTTTTTGTG |
ACTGCAGTGG |
CAGGCTGTGT |
CCACCCAGCC |
TGTCCAAAGG |
TTGTGTTCAG |
|
19681 |
TGTCTGAAAG |
AATGCTGCTA |
GGAGGGGCGT |
CCGAACCATG |
CCTGCTGCCT |
CCTGTGGCTA |
CAGGGCGTGA |
CTGCCATCTG |
|
19761 |
CTCCAACTTT |
GCTCATTCCG |
ATTTTTTTTT |
TTTTTTGAGA |
TGGAGTCTTG |
CTCTGTCACG |
GAGGCTGGAG |
TGCAGTGGTG |
|
19841 |
CCATCTCTGC |
TCACTGCAAC |
CTCCCCCTCC |
TGGGTTCAAG |
TGATTCTCCT |
GCTTCAGCCT |
CCTGGGTAAG |
TGGGATTACA |
|
19921 |
GGTACCCGCC |
ACCACGCCCA |
GCTAAGTTTT |
GTATTTTTTT |
AGTAGAGACG |
GGGTTTCGCC |
ACGTTGGCCA |
GGCTGGTCTT |
|
20001 |
GAACTTCTGA |
CCTCAGGTGA |
TCCACCCGTC |
TTGGCCTCCC |
AAAGTGCTGG |
GATTACAGGT |
GTGAACCACT |
GCGCCTGGCC |
|
20081 |
TTGCTCATTC |
CACTTTGAGG |
CGGCAGTGAC |
ATCTTGGCAT |
TGCTTCTGAA |
CTAAACAGCC |
CAAACACGCA |
TGGCTTCTGT |
|
20161 |
ACTAGAGTTT |
AGAGGTGAAG |
TCAGAGAATA |
GGAAAGAAGA |
ATCGCTAGTC |
TGTTTTTTTT |
TTTTTTTCTT |
TGAGACAGAC |
|
20241 |
TTTCACCCTT |
GTCGCCTAGG |
CTGGAGTGCA |
GTGGTGTGAT |
CTCGGCTCAC |
TGCAACCTCT |
GCCTTCCAGG |
TTCAAGCGAT |
|
20321 |
TCTCCTGCCT |
CAGCCTCCCA |
AGTAGCTGGG |
ATTATAGGTG |
CCCACCACAA |
CGCCTAGCTA |
TTTTTTGTAT |
TTTTAGTAGA |
|
20401 |
GACGGGGTTT |
CACTGTGTTG |
GCCAGGCTGA |
TCTCGACACC |
TGCCTCGGCC |
TCCCAAAGTG |
CTGGGATTAC |
AGGTGTGAGC |
|
20481 |
CACCGAGCCC |
GGCCAAGAAT |
CAGTATTCTT |
AATGCTTTCC |
AAGGAGGGTA |
AACAGTGGAA |
CACCGAGGTC |
ATGATTGAGA |
|
20561 |
GATCGGCTGT |
GCCTAAATCC |
TGCCACTCAC |
TCTGGACGTA |
GTAAAGGTCT |
GAGAGATTGC |
CGGTGAATTT |
GGAGATATGA |
|
20641 |
ACAAGAGTCC |
TTCACTCAAA |
GAACAAAGGA |
GACATCCCGG |
GAAGTCACAA |
AACCAAGTCT |
AGGTTCAGGT |
AGAGACTTTT |
|
20721 |
AAAACAGTGC |
AGATCCCCAG |
ACCTTACCTT |
TTTTGATCCT |
GGTGAGTGAG |
TGTTGGTGGG |
GCCTGGGATT |
TGTCTGGGAG |
|
20801 |
GCTGCCCAGG |
TGCTTGGGGT |
CACACTGTCT |
CAAGACCCAC |
CTGTGCTGGT |
GCAGCATCTC |
GGAACTGAAC |
TGGACGCCCC |
|
20881 |
ACTTGGCACA |
GACAGGAATA |
ATCTTGCAGA |
TGCTCAGATG |
TCTTTTTTTT |
TCTGAGACGG |
AGTCTCGCTC |
TGTTGCCCAG |
|
20961 |
GCTGGAGTGC |
AGTGGCACGA |
TCTTGGCTCC |
TGGGTTCACG |
CCATTCTCCT |
GTCTCAGCCT |
CCCGAGTAGC |
TGGGACCACA |
|
21041 |
GGCGCCCACC |
ACCACGCCCG |
GCTAGTTTTT |
TGTATTTTTA |
GTAGAGACGG |
GGTTTCACCT |
TGTTGGCCAG |
GATGGTCTCG |
|
21121 |
ATATCCTGAC |
CTCGTGATCC |
ACCCGCCTCA |
GCCTCCCCAA |
GTGCTGGGAT |
TACAGGCGTG |
AGCCACTGCA |
CCCGGCCCTC |
|
21201 |
AGATGTCTTT |
GAGCACTAAA |
AGCTAAAGGC |
TTAAAGTGGA |
TAGAATTGCT |
TTCTCCGAAC |
TAGGGAGGAG |
CCTGAAGCTC |
|
21281 |
TACAAACAAG |
GAATATTTCA |
GTAAAATAGA |
CTGAGTGGTT |
TCAGCTGAGG |
AGAGCCAGTG |
GAGGAGCGGC |
TTAGACTGTT |
|
21361 |
GTGGAAGTTA |
GGGCTAGTTT |
CTTTCTTTAT |
GAGGAACTAT |
AAGCTGGATG |
ACTCACAGGT |
TAACTGCCTA |
GAAGGGACTG |
|
21441 |
TTCCCAGAGC |
AGGGCCCAAA |
GGGGAACGAA |
CGAGTCAACA |
GCTGCCTCCC |
GGTGGCAGTG |
TGTGTAGGGT |
AAGCGGCAGC |
|
21521 |
TTTCATGTGA |
AGTGCCAAGG |
CCTTTTGGTT |
GGGGGGCTGA |
GAACCTGCTG |
AGGGAGCCAC |
AACTGAGACC |
CAGGCTGCTC |
|
21601 |
CTGGCTGGGA |
GGCTACAGGC |
CCCGAAGCCA |
CAGGGCCCCT |
CCTGGTCGTG |
GGATCTTTGA |
TAAACCGACA |
CGCAGAGGCT |
|
21681 |
TTGTGAGGCA |
GCAGGGCACG |
GAGCATCGTC |
TAGTCTTTTT |
TTTAAGTGAA |
AGCACATTTC |
TTAAGAAAGT |
AAAGGAATAC |
|
21761 |
AAGAATGGCT |
ACTCCGTAGA |
CAAAGTAGCC |
TGTCCAGTCT |
TACGACAAGG |
CCTGAAAATC |
TCACGCTTTC |
ATCACCAAGT |
|
21841 |
TGGAGAGATC |
CTAATTCTAT |
AATTTCTCTG |
ATCTGCTTAC |
AGGCGAGTCT |
CATTACCCAT |
TTACAAAGCA |
TATGGGTATT |
|
21921 |
GATTGTACTA |
TTCTTTCAAC |
TTTTCAACTT |
TTCTGTTTGT |
TTCAACCTTT |
TTTTTTTCTT |
ATCTTTTTTT |
TTTTTTTTTT |
|
22001 |
TTTTGAGACA |
GACTTTTGCT |
CTTGTTGTCC |
AGGCTGGAGT |
GCAATGGTGT |
GATCTTGGCT |
CACTACAACC |
TCTGCCTCCC |
|
22081 |
GGGTTCAAGC |
GATTCTCCTG |
CCTCAGCCTT |
TCCCAGTAGC |
TGGGATTACA |
GGCATGCACC |
ATCACGCCTG |
GCTAATTTTG |
|
22161 |
TATTTTTAGT |
AGAGACGAGG |
TTTCTCTGTG |
TTGGTCAGGC |
TGGTCTCGAA |
CTCCCGACCT |
CAGGTGATCT |
GCCCGCCTTG |
|
22241 |
GCCTCCCAAA |
ATGCTGGGAT |
TACAGGCGTG |
AGCCACTGTA |
CCCGGCAGCT |
TCAACCTTTT |
CATAATCACA |
AGTTGGGAAG |
|
22321 |
AATAAAGGAG |
AAAAATAATT |
TAAATATTAC |
AGATGGAGGT |
ACTTGTAGTT |
AAGCCTGCTT |
TTACCTCTGC |
TCCCCCCTTA |
|
22401 |
AACTCTATTG |
AAATAACAGG |
AAAGGGCTTA |
CACAAAAAAG |
AAAACACCAT |
AGACAAAGAA |
AGTAATAGAG |
GAAAAGACAA |
|
22481 |
AGATGTGATC |
AGATTTTGGA |
AGATGGACAT |
GGAGGGAGGG |
TGGCAACCTG |
CTGAGGGAAG |
CTTCAGAAAT |
CTCTGTGCCA |
|
22561 |
GAACAGGGCA |
TCCTGGTGAG |
AAGGAAGACC |
ATCTGTTCAG |
GTACCAGGAA |
AGGCAAGAGG |
TGGGAGTGAT |
CGGGGGACAG |
|
22641 |
GAACAAGGGG |
GGAGGGTTGA |
AAACCTCTAG |
GGAGTGTCAG |
CTAGAGACTT |
CAGGGCCCAC |
TTGGCCAGAT |
GATGGCTCTT |
|
22721 |
GCCAACAATT |
GAAGGAGACC |
TCATGTTTAA |
TTCCAGGAGC |
ACCTGACCCA |
GGGAAGCTTC |
AGCATGGGAT |
ACTAGAGACA |
|
22801 |
GGCCAGGGTG |
AGGCACCAGC |
CTGAAACTGG |
GCATCCCGAA |
TGAGCAATGG |
GCCCAGCAGT |
CAGTTCCCTG |
GCCAGGCAGG |
|
22881 |
AGGTGGGGAG |
TCCCTGCTGA |
GGGAGGTTGA |
ATGCCCCCAG |
AGAAGTGTCC |
TACAGCTCCT |
GCCTGCCAGA |
CCCCCAGGCA |
|
22961 |
CCCTGCAGAG |
GGAAAGTTAG |
TGATGGCGCT |
GGCCCCACAG |
AAGGCTTCCT |
GTCAACCTTA |
GTGCTTCAGT |
GGGATCAGAC |
|
23041 |
CCCTGGGTCA |
CCAGCCATCT |
GAGGAAAGCC |
CCCACACATG |
GGAGAAAAGG |
TCTTAAAGGG |
AAGAGAATGT |
TAATGTCCTC |
|
23121 |
AGTGAGATAC |
AAAAGAGCAT |
TGTCTGCATG |
AAAAAGGAAC |
AGGATACTAG |
AAAAAGGATA |
ACAAAGGGAA |
CAGAATAAGA |
|
23201 |
AAGTTCTTAA |
AATTAAAAAA |
TAGAAAAAAA |
ACTTCAAAAG |
AAGGCTTAGA |
TTCTACTATT |
GTCTCAAGGA |
GATATTCCAA |
|
23281 |
AAGCAAAATA |
GAGATAGTTT |
TTTTATCTTT |
CCTCTCCTCC |
TAATGTAGGA |
ACAAGTCTAG |
AAGCCCAATA |
TAGGATGAGT |
|
23361 |
AGGAATTCCA |
GAAAGAATAG |
AAAACTGAAA |
GGAGGAAATG |
AACATAGAAT |
GCAAGATAGT |
TTCTTGGCAC |
CAGGCATGGT |
|
23441 |
GGCTCATGCC |
TGTAATCCCA |
ACACTTTGGG |
AGACCAAGGC |
AGGCAGATGA |
CTTGAGCTCA |
CGAGTTTGAG |
ACCAGCCTTG |
|
23521 |
GCAACATAGG |
GAGACACCAT |
CCCCCACTTC |
CCGTGTCTAC |
AAACACTATG |
AAAATTAGCC |
AGGCATGGAG |
TTGTGTACTT |
|
23601 |
ATGGCCCCAG |
CTACGTGGGA |
GGCTGAGGTT |
GGGGATGGCT |
TGAGCCTGGA |
AGGCAGAGGG |
TGCAACGAGC |
CTGGATTGTA |
|
23681 |
CCACTGCACT |
CCATCCTGGG |
AGACAGAAAC |
AGACCTTGTC |
TCAAACAAAA |
CAAAACAAAA |
CAAAACAAAA |
AACAAAAATG |
|
23761 |
CAAGATAGTT |
TCTTGGCAAT |
GAGGCCATTA |
TCTTCTAGAT |
TGAAAGCGGC |
CCTGACAATG |
ACTGAGAAAG |
ACCCCACAGC |
|
23841 |
AAAGCTTATC |
ATTGTCAAGT |
GTCAGAACCC |
TGAGGATAAA |
GAAAGAGGGG |
CTTCTAAAAC |
TCTAGGTTAT |
TGTAAATACT |
|
23921 |
TGGGAATACA |
AAGAGCAAAC |
AAACATGAAC |
AGCTAGGAGA |
CAATGGAACA |
ACTGATGACT |
CCAGAAATCA |
GTTAAAAAGG |
|
24001 |
CTTTTCGGCC |
TAGAATTCTT |
TTGTTTTTGA |
AACCGGGTCT |
CACTGTCATC |
TGGGCTGGAG |
AGCAGTGGTA |
CAGTTGTCTT |
|
24081 |
CATCTCGTGG |
GCTCAAGTGA |
TCCTTCTGCT |
TCAGCCTCCC |
CAGTAGCTGG |
GACTACAGGC |
ATGTGCCACT |
GTGCCTAGAT |
|
24161 |
ATTTTTTTTT |
TTCTTTTTCT |
TTCCCCACCC |
CCGAGTCAGA |
GTCTCGCTGT |
CGCCCAGGCT |
GGAGTGCAGC |
GGCGTGATCT |
|
24241 |
CAGCTCACTG |
CAGTTTCCAC |
CTCCTGGGTT |
CAAGCAATTT |
TCTGCCTCAG |
CCTCCCGAGT |
AGCTGGGATT |
ACAGGCACCC |
|
24321 |
GCCACCATGC |
CTGGCTAATT |
TTTATATTTT |
TAGTAGAGAT |
GGGGTTTCAC |
CATCTTGGCC |
AGGCTGGTCT |
TGAACTCCTG |
|
24401 |
ACCTCATGAT |
CTACCTGCCT |
TCGTCTCCCA |
AAGTGCTGGG |
ATTACAGGCA |
TGAGCCACTG |
TGCCCAGCCC |
CGTGCCCAGC |
|
24481 |
TATTTTAAAA |
TTTTTTTGTA |
GAGATGGGGT |
CTTGCCATGC |
TGCCCAGGCC |
GATTTTGAAC |
TCCTGGGCTC |
AAGAGATTTC |
|
24561 |
TCATGTTGTT |
CTCCCAAAGT |
GTTAGGATTA |
CATGCATGAG |
TCACTGTGCC |
TGGTCCAGCC |
TAGAATTCTA |
TACCCAGCTA |
|
24641 |
AAACGATCAA |
GTATCAGGGC |
AGAACAAAGT |
TACTTTCTGG |
TATACAGTCT |
TAATTATTTT |
AACCTTGGTT |
TGAACCACAT |
|
24721 |
GAAGAGAGAA |
ACCCAAAATG |
AAGCACAGAG |
GTATTAGGAA |
ACGGGGGACC |
TAACAGGACA |
GAGGGTGAAC |
GTAAAGGGGG |
|
24801 |
TCACAGCATC |
ACAGCGCAGC |
CTCTGCCCAG |
AGAGCTGCCA |
GTCCTGGCTG |
GAACAGGAGG |
CTGCACAGCA |
AGCGGCACTC |
|
24881 |
TCTATGGAAA |
ACACCAGAAC |
CAGTGGGGTC |
TCTGATCCAC |
TTCACCCTTG |
TAGAAAACTA |
TGGAGATGCT |
GTGGGAGAAC |
|
24961 |
ATAGGGAAAG |
TTAGCAAATG |
CAAAGAGAAG |
GCAGGCTGGG |
TGCAGTGGCT |
CATGCCTGTA |
ATCCAAGCAC |
TTTGGGAGGC |
|
25041 |
CAAGGCAGGC |
AGATCACCTG |
AGGTCAGGAG |
TTCAAGACCA |
GCCTGGCCAA |
CATGGTGAAA |
CCCCGTCTCT |
ACTAAAAATA |
|
25121 |
TAAAAATTAG |
CTGGGCGTGG |
TGGCACGTGC |
CTATAGTCCC |
AGCTACTCGG |
GAGGCTGAGG |
CAGGAGAATC |
ACTTGAACCT |
|
25201 |
GGTGGGCAGA |
GGCTGCAGTG |
AGCCGAGAGA |
TTGCGGCACT |
GCGCTCCAGC |
CTGGGCAACA |
GAGCGAGACT |
CTGTCTACAA |
|
25281 |
AACAAAGGAA |
AAGAGAAGCC |
ATTATCAGTG |
GTAGGAAAAC |
AAAAAGCGAT |
ACAAAGAAAT |
AGTCTGCTAC |
TTGACCCAGC |
|
25361 |
AAAGAACATT |
TAGATAGAGG |
TCCTAGCTGA |
AACATTGAAT |
GTTCTTTTTT |
TTTTTTGAGA |
TGGAGTCTTG |
CTCTGTCACC |
|
25441 |
AGGCTGGAGT |
GCAGTGGCGC |
CAACTTGGCT |
CACTGCAACC |
TCCGCCTCCC |
AGGTTCAAGC |
GATTCTTCTG |
CCTCGGCCTC |
|
25521 |
CCGAGTAACT |
GAGACTACAG |
GCGATTGCCA |
CCACGCCCAG |
CTGATTTTTT |
GTATTTTTAG |
TAGAGACGGG |
GTTTCACCAT |
|
25601 |
ATTAGCCAGG |
GTAGTCTCGA |
TCTCCTGATC |
TCGTGATCCG |
CCCGCCTCGG |
CCTCCCAAAG |
TGCTGGGATT |
ACAGGCGTGA |
|
25681 |
GCCACTGCGC |
TCGGCCTACT |
GAATGTTCTT |
TTAACAAAAA |
ATTGCGTCTT |
GGAAAGGGTA |
GTGGAAGGAG |
AGGCAGGTAC |
|
25761 |
TGGTCCAGAG |
TAGGAAGTGC |
GCAGATGTGA |
ACTCCAGCAG |
CAGGCAAGAG |
CAGCGTGAGC |
TGAAAGGGCA |
CAGGGAGAGT |
|
25841 |
TTGGGGAGTG |
GGATGGGGGC |
AGGGAGTGTG |
CACCACAGCC |
TTTGGCAACT |
GGTGACTCTA |
AGCTGTGGGT |
GTGGAGACTT |
|
25921 |
TTGACACATT |
TAATTTCAAA |
AATAGTGCAG |
GTGATTCTCC |
CCTCTGTCCC |
TGGTGAGTCA |
CGTGAGCCTG |
GAGGTGTGCA |
|
26001 |
GGTTCCCTGT |
GGGGCCAGCG |
TTGGGCCCTA |
CGCAGCGCCT |
GCACGGCCCA |
TCCACCTCCC |
AGAGCAGAGC |
TCCGGCGTTG |
|
26081 |
AGGTGTGCGT |
GTGTGTCGTC |
GCACAAAGCC |
TCTGTGTGCA |
GGTGTCAGGA |
GGCACATGGC |
CTTCCATCAC |
TGTGGCTGGA |
|
26161 |
AGCGCCTGCC |
ACATGTGGTA |
TTGGCTCTCC |
CTGTACTTAG |
GGCCTGGCCC |
CACCTGCCGT |
GTGGCCCGTG |
TCCTGCATGG |
|
26241 |
TTAGGAAAAA |
GGTCTCTTTC |
GACTTTCTGG |
CTCAGAAGCT |
GACTTTAGAT |
GCTAGCTGAG |
GTTAATGTTT |
TACTGAATTG |
|
26321 |
GAGAAAGAGA |
AAGGTCCAGT |
ATCATGGGCC |
CACCAAGAAT |
GTTTCCAGAA |
GCCACAGAGA |
TTTGTAGCTG |
GGATATGGGG |
|
26401 |
AGAAGCCTCT |
GACTCATGGG |
TTTGTATCGT |
CTGGTCCCAT |
ACTGGCTGTG |
TGATTGCGGG |
GTCGAGCTGG |
GTAGAACCTA |
|
26481 |
GCACCTGCAT |
CCACACGCTG |
AGTGCCACCA |
TCCAGACACT |
TAGCTGCTTG |
TGGGGACTGA |
ACGTTGAGAT |
CTTCGTGAGT |
|
26561 |
CTGTAGTCTC |
CACAGGCCAA |
GCTCCTGCTT |
GCAGGTGCAT |
CCTTGGGGGA |
ACTTCACGGC |
TTGCTCTTTC |
CTCCTCCGCC |
|
26641 |
TCTAGGCATT |
GAAGTTGATA |
TCGATGTGGA |
GCACGGAGGG |
AAAAGAAGCC |
GCCTGACCCC |
CGTCTCTCCA |
GAGAGTTCCA |
|
26721 |
GCACAGAGGA |
GAAGAGCAGC |
TCACAGCCAA |
GCAGCTGCTG |
CTCTGACCCC |
AGCAAGCCGG |
GTGGGAATGT |
TGAGGGCGCC |
|
26801 |
ACGCAGTCTC |
TGGCGGAGCA |
GATGAGGAAG |
ATCGCCTTGG |
AGTCCGAGGG |
GCGCCCTGAG |
GCAAGCCTGT |
GCCCCTCCCG |
|
26881 |
CCACCTGGGA |
CCACGGCCAG |
CCTAGTGATC |
TGTGGCCTGC |
ACCTCCGCCT |
CATCCTCAGC |
ACCTCTGCAG |
CCCCACTTAC |
|
26961 |
AAACCCGAGG |
GAGCTGCTGC |
TGCTGCAGTG |
ATGTCTGTGC |
CATTAAAGTC |
ACGCTGGGAA |
CCTGCTAGAA |
CTTTGTAGTT |
|
27041 |
ACTTGGTCTT |
TGTGAGTGGC |
CACTGTTCCC |
CCTAGACCCC |
TGCAGCCTTA |
ACTGCACGTG |
TGCATGCGTG |
CTCCCCGACT |
|
27121 |
GTCTGCCAGG |
AGCCAGGGCC |
ATGGTCAGGC |
TTGGCCTGTT |
GCGCGTGTCT |
CCTGTGTGCT |
CATGGTGAGT |
TTTGTTCCAG |
|
27201 |
GAACAGATGG |
AGTCGGATAA |
CTGTTCAGGA |
GGAGATGATG |
ACTGGACCCA |
TCTGTCTTCA |
AAAGAAGTGG |
ACCCGTCTAC |
|
27281 |
AGGTGAACTC |
CAGTCCCTAC |
AGATGCCAGA |
ATCCGAAGGG |
CCAAGCTCTC |
TGGACCCCTC |
CCAGGAGGGA |
CCCACAGGGC |
|
27361 |
TGAAGGAAGC |
TGCCTTGTAC |
CCACATCTCC |
CGCCAGGCAA |
GTGAACCAAG |
AGGTTTTGTA |
CATATTCCTA |
CCTTTCCCTT |
|
27441 |
TAGAGCATCC |
TGCCCTCCTC |
TGATTTCAGC |
GACACAAACA |
GAAGGATGAG |
ATGTTCTCCA |
CTGCAGGGCT |
GTCTGTAGGT |
|
27521 |
GTGGGAGGTT |
AGGAGTTGGT |
TTTGTCCTAT |
TATGTGTACC |
CTGAGCAATA |
GCGAGTAAGC |
TCTGCTAATG |
CAGTTCTGAA |
|
27601 |
AATGTTTTTC |
TTTAGGCAAA |
CTCCAGAGCC |
AGGAATATTA |
ATTGTAGGAG |
TTTCTAAAAC |
TTAACATGCA |
GACCAGGAAT |
|
27681 |
TAAGGCAGGT |
GTGACACAGA |
GAGGGGGCAG |
CACTGGGTGT |
GTCCCTACTA |
CTCACACTCA |
AGTGCCGCCT |
CAGGGAGTGC |
|
27761 |
TGGTCTGAGA |
GGGGGGGGGG |
TCATAGCCAA |
GATCCCTGTA |
GGAGCCAGGC |
AGAAGCCACC |
GTTGAAGTGG |
ATGGTGTGGC |
|
27841 |
AGGCAGCAGT |
GGGGGGGTGG |
GGGGTGGGGA |
CAGCTGACAG |
AAACTCCTCA |
TTGTCACATG |
GCAGCCGTGT |
GTAAACCAAG |
|
27921 |
GTCCGTGTGC |
ACAACCTGCT |
GTGGGCCGCC |
GCTGTGTGTC |
TTCTCTGCCT |
TAGGGCAGGG |
GATGGCGAGC |
TATGGCCTAT |
|
28001 |
GGGCCAAGAG |
TAGTTCTTAT |
TTTTGTTTAT |
TTATTTATTT |
ATTTTTATTG |
ATCATTCTTG |
GGTGTTTCTC |
GCAGAGGGGG |
|
28081 |
ATTTGGCAGG |
GTCATAGGAC |
AATAGTGGAG |
GGAAGGTCAG |
CAGATAAACA |
AGTGAACAAA |
GGTCTCTGGT |
TTTCCTAGGC |
|
28161 |
AGAGGACCCT |
GCGGCCTTCC |
GCAGCGTTTG |
TGTCCCTGGG |
TACTTGAGAT |
TAGGGAGTGG |
TGATGACTCT |
TAAGGAGCAT |
|
28241 |
GCTGCCTTCA |
AGCATCTGTT |
TAACAAAGCA |
CATCTTGCAC |
CGCCCTTAAT |
CCATTTAACC |
CTGAGTGGAC |
ACAGCACATG |
|
28321 |
TTTCAGAGAG |
CACGGGGTTG |
GGGGTAAGGT |
CACAGATCAA |
CAGGATAAGA |
ATTTTTCTTA |
GTACAGAACA |
AAATGAAAAG |
|
28401 |
TCTCCCATGT |
CTACTTCTTT |
CCACACAGAC |
ACGGCAACCA |
TCCGATTTCT |
CAATCTTTTC |
CCCACCTTTC |
CCCGCTTTCT |
|
28481 |
ATTCCACAAA |
GCCGCCATTG |
TCATCATGGC |
CCGTTCTCAA |
TGAGCTGTTG |
GGTACACCTC |
CCAGACGGGG |
TGGTGGCCGG |
|
28561 |
GCAGAGGGGC |
TCCTCACTTC |
CCAGTAGGGG |
CGGCCGGGCA |
GAGGCGCCCC |
TCACATCCCG |
GACGGGGTGG |
CTGCCGGGCG |
|
28641 |
GAGGGTCTCC |
TCACTTCTCA |
GACGGGGCGG |
CCGGGCAGAG |
ACGCTCCTCA |
CCTCCCGGAT |
GGGGTCGCGG |
CCAGGCAGAG |
|
28721 |
GCGCTCCTCA |
CATCCCAGAC |
AGGGCGGCGG |
GGCAGAGGCG |
CTCCCCACAT |
CTCAGACGAT |
GGGTGGCCGG |
GCAGAGACGC |
|
28801 |
TCCTCACTTT |
CCAGACTGGG |
CAGCCAGGCA |
GAGGGGCTCC |
TCACATTCCA |
GACGATGGGC |
GGCCAGGCAG |
AGACGCTCCT |
|
28881 |
CACTTCCCAG |
ACGGGGTGGC |
GGCCGGGCAG |
AGGCTGCAAT |
CTCGGCACTT |
TGGGAGGCTA |
AGGCAGGCAG |
CTGGGAGGTG |
|
28961 |
GAGGTTGTAG |
CGAGCCGAGA |
TCACGCCACT |
GCACTCCAGC |
CTGGGCACCA |
TTGAGCACTG |
AGTGAACCAG |
ACTCCGTCTG |
|
29041 |
CAATCCCGGC |
ACCTCGGGAG |
GCCGAGGCTG |
GCGGATCACT |
CGTGGTTAGG |
AGCTGGAGAC |
CAGCCCGGCC |
AACACAGCGA |
|
29121 |
AACCCCGTCT |
CCACCAAAAA |
AACACGAAAA |
CCAGTCAGTC |
GTGGCGGCGC |
GCACCTGCAA |
TCGCAGGCAC |
TCGGCAGGCT |
|
29201 |
GAGGCAAGAG |
AATCAGGCAG |
GGAGGTTGCA |
GTGAGCCGAG |
ATGGCAGCAG |
TACAGTCCAG |
CTTTGGCTCG |
GCATCAGAGG |
|
29281 |
GAGACCGTGG |
AGAAAGAGGG |
AGAGGGAAAG |
CTATATCATT |
CTTATGTCTT |
TGCATCATCA |
TAGCTTAGCA |
TCTACTCATA |
|
29361 |
AGTGAGAACA |
TATGATATTT |
GGTATCCATT |
CCTGAATTAC |
TTTACTTAGG |
ATAATGGCCT |
CCAGCTCCGT |
CCAAGTTGCT |
|
29441 |
GCAAAAGGTA |
TTATTTCGTT |
CCTTTTTGTG |
GCTGAGTAGT |
ATTCCATGGT |
GTATATATAC |
CACATTTTCT |
TTATCCACTC |
|
29521 |
ATTGCTTGAT |
GGGCAGTTAG |
GTTGGTTCCA |
CATCTTTGCA |
ATTGTGAGTT |
GTGCTGCTCC |
AGATATCATC |
TTTAACTCCT |
|
29601 |
TTGCCTTCTC |
CACATACATT |
TCCAAGTCCT |
GTTCATTCTA |
CCTCCAAAAT |
GTATCTTGTA |
TCCATTCATC |
TCTCTCCATC |
|
29681 |
TTCAATCTAT |
TTCAATGCCC |
CATCATCTCT |
TGCATGGAGG |
AGTGTAATAA |
TTGGCTAACT |
GGCCTGTTCT |
TACATTTTAA |
|
29761 |
AATCAAAAGA |
TGTGACAGGT |
GAAATGCCTA |
TTTCAGTGTC |
CATTGATGGT |
TCTGCTTACA |
CACCACCTGG |
CTGCCTGGTG |
|
29841 |
TCGCAGTGGC |
AGAGTTGAGC |
AGTGTGAAAA |
AGACTGCTTG |
GCCCTTTACA |
GGGAAAGCAG |
GTCCACTGTG |
GCCTGTGAGG |
|
29921 |
ACGAGAGCTC |
TGGGCAGGCT |
CGGACACTGG |
CAGACCCTGG |
TCCTGGCTGG |
CCAAGGCAGC |
AGGGTATGTG |
TTTCGGGTCA |
|
30001 |
CTCACAGGGC |
TCAGCACCAC |
TCCTCATGGC |
TTCCTTACTG |
TTTCGGCAGA |
GGCTGACCCG |
CGGCTGATTG |
AGTCCCTCTC |
|
30081 |
CCAGATGCTG |
TCCATGGGCT |
TCTCTGATGA |
AGGCGGCTGG |
CTCACCAGGC |
TCCTGCAGAC |
CAAGAACTAT |
GACATCGGAG |
|
30161 |
CGGCTCTGGA |
CACCATCCAG |
TATTCAAAGC |
ATCCCCCGCC |
GTTGTGACCA |
CTTTTGCCCA |
CCTCTTCTGC |
GTGCCCCTCT |
|
30241 |
TCTGTCTCAT |
AGTTGTGTTA |
AGCTTGCGTA |
GAATTGCAGG |
TCTCTGTACG |
GGCCAGTTTC |
TCTGCCTTCT |
TCCAGGATCA |
|
30321 |
GGGGTTAGGG |
TGCAAGAAGC |
CATTTAGGGC |
AGCAAAACAA |
GTGACATGAA |
GGGAGGGTCC |
CTGTGTGTGT |
GTGTGCTGAT |
|
30401 |
GTTTCCTGGG |
TGCCCTGGCT |
CCTTGCAGCA |
GGGCTGGGCC |
TGCGAGACCC |
AAGGCTCACT |
GCAGCGCGCT |
CCTGACCCCT |
|
30481 |
CCCTGCAGGG |
GCTACGTTAG |
CAGCCCAGCA |
CATAGCTTGC |
CTAATGGCTT |
TCACTTTCTC |
TTTTGTTTTA |
AATGACTCAT |
|
30561 |
AGGTCCCTGA |
CATTTAGTTG |
ATTATTTTCT |
GCTACAGACC |
TGGTACACTC |
TGATTTTAGA |
TAAAGTAAGC |
CTAGGTGTTG |
|
30641 |
TCAGCAGGCA |
GGCTGGGGAG |
GCCAGTGTTG |
TGGGCTTCCT |
GCTGGGACTG |
AGAAGGCTCA |
CGAAGGGCAT |
CCGCAATGTT |
|
30721 |
GGTTTCACTG |
AGAGCTGCCT |
CCTGGTCTCT |
TCACCACTGT |
AGTTCTCTCA |
TTTCCAAACC |
ATCAGCTGCT |
TTTAAAATAA |
|
30801 |
GATCTCTTTG |
TAGCCATCCT |
GTTAAATTTG |
TAAACAATCT |
AATTAAATGG |
CATCAGCACT |
TTAACCAATG |
ACGTTTGCAT |
|
30881 |
AGAGAGAAAT |
GATTGACAGT |
AAGTTTATTG |
TTAATGGTTC |
TTACAGAGTA |
TCTTTAAAAG |
TGCCTTAGGG |
GAACCCTGTC |
|
30961 |
CCTCCTAACA |
AGTGTATCTC |
GATTAATAAC |
CTGCCAGTCC |
CAGATCACAC |
ATCATCATCG |
AAGTCTTCCC |
CAGTTATAAA |
|
31041 |
GAGGTCACAT |
AGTCGTGTGG |
GTCGAGGATT |
CTGTGCCTCC |
AGGACCAGGG |
GCCCACCCTC |
TGCCCAGGGA |
GTCCTTGCGT |
|
31121 |
CCCATGAGGT |
CTTCCCGCAA |
GGCCTCTCAG |
ACCCAGATGT |
GACGGGGTGT |
GTGGCCCGAG |
GAAGCTGGAC |
AGCGGCAGTG |
|
31201 |
GGCCTGCTGA |
GGCCTTCTCT |
TGAGGCCTGT |
GCTCTGGGGG |
TCCCTTGCTT |
AGCCTGTGCT |
GGACCAGCTG |
GCCTGGGGTC |
|
31281 |
CCTCTGAAGA |
GACCTTGGCT |
GCTCACTGTC |
CACATGTGAA |
CTTTTTCTAG |
GTGGCAGGAC |
AAATTGCGCC |
CATTTAGAGG |
|
31361 |
ATGTGGCTGT |
AACCTGCTGG |
ATGGGACTCC |
ATAGCTCCTT |
CCCAGGACCC |
CTCAGCTCCC |
CGGCACTGCA |
GTCTGCAGAG |
|
31441 |
TTCTCCTGGA |
GGCAGGGGCT |
GCTGCCTTGT |
TTCACCTTCC |
ATGTCAGGCC |
AGCCTGTCCC |
TGAAAGAGAA |
GATGGCCATG |
|
31521 |
CCCTCCATGT |
GTAAGAACAA |
TGCCAGGGCC |
CAGGAGGACC |
GCCTGCCCTG |
CCTGGGCCTT |
GGCTGGGCCT |
CTGGTTCTGA |
|
31601 |
CACTTTCTGC |
TGGAAGCTGT |
CAGGCTGGGA |
CAGGCTTTGA |
TTTTGAGGGT |
TAGCAAGACA |
AAGCAAATAA |
ATGCCTTCCA |
|
31681 |
CCTCACCGCA |
A |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:8878|SQSTM1|NC_000005.9:179233387...179265077 (+)
TGCGTCGGCTTCCGGCCGCCTTCCGCGGCCACCGCCGGGCCCGCTCCCGCCGCCGACGCCCAGGTGCGCCAGGTGCGGGC
CGGGCGGGGGTCGCGCTCACCTTTCTGGCCGCTGAGTGCCGCGTACCAGGACAGCGAGAGGAAGGCGCACAGGCAGAAGA
GCAGCAGCGTCAGGAAGGTGCCATTGCGGAGCCTCATCTCCTCGGGTGCGCGGCGGGCGCCCGCGGGGCCGAGGCTGCAT
GGCCCGGGGGACCGGGGCCGGGGCGCAGGGGTCGGAAGGCGGCGGCGGCGGCGGCAGGGGCCCCGGCCCCGGGTCGGGGA
GGGGCGGGGGGCCCGGGGCCGGGCGGGGACCGGGCCAGGGAGCGCGCCGGCCGCCCCTCAGGGCGCAAGCTTTGTGCCCT
GTACTCAGGGAAGAGGAACAGGCTCAGAAGGGCAGAGGCAGGTATCAGGCTCACTGCAGATATCAGGGGCGCGGGACACT
GGCGGCCTCGCCTCCGCGGCAGGGCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCGGCGAGAGCCGCGGCCCGGCCTGGA
TCTGGGGCCTGGATCTGGGGCCGCCGCGGAGTCGACGGCGCAGGGCGGGGCGGCCCGGATTTAAAGGGGCCGCAGCACCG
CCGTCGCCGGCGCCGCGAGGGGGTGGGGTGGGGGCCGGCGGCCGGGATCCCGATCGGCTCCCGCAGCCCCGCGTGGGCTC
GTGCGAGTCGGCCTCAGGTAAGGCTGGAGTGGGAGTGCAGGTCGACCGCAGCCGGGGCGGGGGGCGGGCGGCGGGGGCGG
CGCTCCGGGACCCCGGTACCCTTCTCAGCAACATCTCCTCCGCGCGCGGACCCCGACCCCATTCGCACGTTCTCGGGGCT
CTTTCCCGGGCGTGAGGGGCTCTGGGTGGCGCGGGGGACTGGGCGGTTGAAGCCGGGAGCGACCGCTGCCTTCGCTGCCG
CCCAGGGCGCTTCCCCGCTCAGGAGCTTCCTCTGGGCCTCTGGACGGAGGCGCGCAGGGGCCCGGGAGGCGGGAACGATG
GGCCTTTCTAGGCGGTATCAGGGCCGATGCTCGTGGATGCAGAGCAGTGACCAGCCCAGAACCGTACCGGCTTCCCGGGG
CAGGGGCGGCCCGAGAGCGCCGCTAACGGGAGCAACGGGCCCTGCCCCCGGTGGTGTAGGGGCCACCTCGCCCCGCCCAG
CCCAGCCCGATATTGATGGGGGCCCACGCCTTCATTGTTTTTTTGTTGTTGTTGTTTTTTGTTGTTTTTTGTTTGTTTGT
TTTTGAGACGGAGTCTCGCTCTGTTGCCCAGGCTGCAGTGCAGTGGCGCAATCTCGGCTCACTGCACCCTCCGCCTCCCG
GGTTCAAGCGATTCCCCTGCCCCAGCCTCCCGAGTAGCTGGGACTACAGGCGGCCGCCACCACACCTGGCTAATTTTTGT
ATTTTTAGTAGACACGAGGTTTCACCGTATTGGCCAGGCTGGTCTCGAACTCCTGACCTTGTGATCCGCCCGCCTCGGCC
TCACAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCACGGCCCATTTGCGTTCTAATTTCAGGCGTTCCTCAATCCTC
TGTGGAACCAGGAAGGGCGCAAATTATAAAGGGACTGGCCCAGCGCCGCCTCTGGGCAGGCTTTGGGCAGCGCCTGGCCC
GCTGCCGGCTTGGACCTCCCAGACCTAGGGGCCCGGTTCCTGGTGGAGGCTGCAGGGACCTCTGCCCCACCCGCCCGGGG
GAGGCCCGAGGGGCTGGACTCAGACTGAGATTGAATGCGGCTTTGTCTTCCTAGTTCAGCCCCGGCCCACACCTGGGGCT
GAGTGGAATCGGGAGCTTCGAGGGGTCTGGACAGAGAGATTGATGCCAAGAAGGGGGTGGCCGAGCCAGAGGTTGAAGTG
GGCTGGATCCTGAGGCCCCCTGTTAAAGGAGAGGGCTCCCCACTCAGTGCTCCTGGAACTTTCCGAACTAGAGACTGGGA
CTTATAGGAGCCTTCTAGAGGAAACTGTCGTGTTTTCACAGGTGTCTTCTATTTGTGGCAAAGGATAATGGCTTTTCACT
TAGGTTGTGACATAAAGGGCCTTAGAAATTGTTAATGAGTTACTTAATGTTAAAACTTAGTCACCCAGAGGCCGGGCGCG
GTGGCTCATGCCTGTAATCCCAGCACTTTGTGAGGCCGAGGCAGGCGAATCACGAGGTCAGGAGATCGAGACCATCCTGG
CTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCA
CTCTGAGAGGCTAAGGCAGGAGGATCATTTGAGCTCATCAGTTCAAGACCAGCCTGGGCAACATAGTGACACCTCATCTC
ATTAAAAATTTTAAAACAAATTTTTTTGGATTTTTTTGTAATTTTATTTATTTATTTATTTATTTTTAGTTTATCTTATT
TTTTTTTTTGAGACGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACTATCTGGGCTCACTGCAAGCTCCGCC
TCCCACCTTCACCCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCCACCACCACGCTCGGCTGATT
TTTTGTATTTTTATTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTACCTGCC
TCGGCCTCCCAAAATGCTGGGATTACAGGCGTGAGTCACCACCCCGGCCTTTTTTTTTTTTTTTTTTTTTAAGACGGAGT
CTCGGTCTGTCGCCCAGGTTGGAGTGCAGTGGCACCATCTCGGCTCACTGCAACCTCAGCCTCCCAGGTTCAAGCGATTC
TCCTGCCTCAGCCTCCCGAGTAGCTGGGATTATAGGCGCCCGCCACCACGCCTGGCTAATTTTTTTTTTTTTTTTTTTTT
TTTTTTTTGAGACAGAGTCTCGCTATGTCGCCCAGGCTGGAGTGCGATGGCAGAATCTCGGCTTACTGCAACTTCCACCT
CCTGGATACAAGCAATTCTGCTGCCTCATCCTCCTGAGTAGCTGGGATTACAGGTGCACGGCACCAAGCCCGGCTAATTT
TTTTGTATTTTTAGTAGAGATAGGGTGTCACCATGTTGGTCAGGCTGGTCTCAAACTCCTGACCTCGTGATCCACCTGCT
TCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCGCCTGGCCCTTTTTTTTTTTTTTTTTTTTTTTTGAAAC
GGAGTCTTGCTCCCTGGCCCAGGCTGGAGTGCAGTTGAGTGATCTGGGCTCACTGCAACCTCCGCTTCCCGGGTTCAAGC
GATTCTCCTGCCTCAGCCACCTGAGTAGCTGAGATTACAGGCGTGTGCTACCACACCCGGCTAATTTTTATATTTTTAGT
AGAGATGGGGTTTCACCATGTTGGTCAGGCTGGTTTCGAACTCCTGACCTCAGGTGATCCACCTGCCTCAGCCTCCCAAA
GTGCTGGGATTACAGGTGTGAGCCACCGGCGGCGCCCAGCCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATG
TTGTCCAGGCTGGTCTCCAACTCCTGACCTCAGGTGATCTGCTCACTTTGGCCCCTCAGAGTGCTGGGATTACAGGCGTG
AGCCAACGCACCGGGCCAACAATTTTTTTTTAATTTAAATTTAAATTTTTATTTTTTAATTTTTTATTTTACTTTAAGTT
CTAGGGTACATGTGCACAATTTGCAGGTTTGTTACATATGTATACATGTGCCCGGTTGGTATGCTGCACCCATTAACTCA
TCATTTACATTAGATATATCTCCTAATGCTATCCCTCCCCTCTTCCCCCACCCCACGACAGGCCCCAGTGTGTGACGTTC
CCCACCCTGTGTCCAAGTGTTCTCATTGTTCAATTCCCACCTATGAGTGAGAACATGCGGTGTTTGGTTTTCTGTCCTTG
TGATAGTTTGCTCAGAATGGTTTCCAGCTTCATCCGTGTCCCTACAAAGGACATGAACTCATTCTTTTTTATGGCTGCAT
AGTATTCCATGGTGTATATGTGCCACATTTTCTTAATCCAGTCTATCATTGATGGACATTTGGGTTGGTTCCAAGTCTTT
GCTATTGTGAATAGTGCCGCAATAAACATACGTGTGCATGTGTCTTTATAGCAGCATGATTTATAATCCTTTGGGTATAT
ACCCAGTAATGGGATTGCTGGGTCAAATGGCATTTCTAGTTCTAGATCCCTGAGGAATCGCTACACTGACTTCCACAATG
GTTGAACTAGTTTACAGTCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCACATCCTCTCCAGCACCTGTTGTTTCCT
GGGTTTTTAATGATTGGCATTCTAACTGGTGTGAGATGCTATCTCAATGTGGTTTTGATTTGCATTTCTCTGATGGCCAG
TGATGCTGAGCATTTTTTCATGTGTCTGCGGGCCAACAATTTTTTTAAGGAAAAAAAAAAAGTGACTCAGTTGGGCACTG
TGGCCTACGCCTGTAATCCAAGTGAGACAGCCAAGTCTAAAGGGGTCCCGCTTTGGGAGGCCGAGACGGGCGGATCACGA
GGTCAGGAGACCGAGACCATCTTAGCTAACACGGTGAAACCCCGTCTCTACTAAAAATTAGCCGGGAGTGGTGGCGGGCA
CCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATAGCTTCAACCTGTGAGGCGGAGCTTTCAGTGAGCCGAGAT
CGCACCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAGAGGTCCAGGAGAAACTCCCACA
CCTGCCTAAGCACTGGAAGAACTGGGTAGAGCCACAGAAGCTCTGCAGGGGGGAGGAGCTTTGCAGGGGGAGGAGCTATG
CAGGGGGAGGAGCTATGCAGGGGGAGGAGCATGCAGGGGGAGGAGCTATGCAGGGGGAGGAGCTATGCAGGGGGGAGGAG
CTTGGTCTCATGTTCGGGGTGGAACTTGGGATTCTATCTGGGAGGCGAGAAACCAGCTAGCGGGACTCTCTCTCGCTTTG
CTGAGAGTCCCTGTTTCCCTTTTTTTCCTTCTTGCCCAATAAATTCCATTTTTCTCACTCTTCAAAGTGTCTGCGAGATT
AATCTCTCATGGCCGCTGCACAAGAACCTGGCTTTTAGCTGAACTAAGGAGAAAGTCCTACAACAGTTTGGCGTGCAACA
TGGGGCTTGAGAAAGGGTGAGTGAGATGCAAACCAAGAAATTTTTTTCCTCTCTTTCTAAGCCTATTTATCTTCGGACTT
CTGAGGGGGAGGGGAGGGGAAACCGTGACCCCACCCCCTTGGTCTCCGTGGCCTTTTCCTTACTTCTGGACGGATGGGCG
AACGGCGGTTCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCCATCTCGGCTCACTGCAAGCTCCACCTCCCGGGTTCACA
CCATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCCGGCTGATTTTTTGTATTTTTA
GTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACTTCATGATCCGCCCGCCTCGGCCTCCCAAA
GTGCTGGGATTACAGGCGTGAGCCACCGCGCCCACCCCTCCATGAAGAAAGTTTTTCTAAATGAAAAATGTTTAGGACGC
TCAGGAGAGAAAGAACAGATTAAGGAATGATCTCCACCGCACAGACCTCAAGGCTGTTATGCATGCAGGGCACAGTTCCA
GTGCAAATGTCCGCAGGCACGGCATGAGGGCCCCCACTGGGTGCCTCGGGCTCCTTCCAGGGCAGCAGTTGAAGCGGGTC
GTACTGCAGGCCTGCAGAAAGGCTGGGGTTCCTCTCCTTTTCGGGTTCCTTCCTGATGGAATCCTAGGTTTTCAGCGGTT
CCTTCCTGATGGAATCCTAGGTTTTCAGCAGCTTCCGGAGGCCGCCTGGACTGCAGAAGGCGCCCTGGGCGTGTGTACTT
CCACCTCCGCAGGCGCTGGCATCTCCAGGCTGCTTCAGGCCGCTTTGGACCTCACCATCGGTAGCATCTGCGTCTTCTTG
AGGTGACTGGAGCCCGTCTATGGTTTCAGCCATTGACTTCTGGACATGGAAGTGTCGAGGCCATTTTTGGCCCCTCTCAT
ATCCGGGAAGGGCTCCGTGCCCAGTAGAGGTGCAGTGCAGCAGCCAGCTCGCAGCAGTGCTCCCGTAGAGCTGATGGCAT
TGGCAGGAGTGGCGGCTCAGCACAGGGAAGTTCTCCAGTGCCCAGGCCCAGGCTTGCTCTCGGTTTAGGTCCCATGGCAC
GTGGGGCTCTGGACCTGCCTGCCTCCAAGACAGCTCTTCCAAGGTTGAGGGTTGGAACTCCACAGCCTCAGCCTCCCAAA
GCTTGACCAGGCCCAGCCATAGCTAGTCTAGGAAAAGAGCAAATGGCCGTGTTTTTATATTTTGACTTTTGCATTTCAAT
ATTAATTTTTTTGGTAATGTAATGCATTCAAAACATGGTCCCCCCCCCCCCAAAAAAAAAAGAAAAGAAAAAGAAAAAAG
CATGTTTCTAACCAGGGGTCCTTCATCTTCTGCAGATGCCAAGGGTGTCCCTGGCACAGTAAAAGTTTGGAAGCCCTGCC
CTAAAGCCTCACCTCACCTGGCAGCTGGGTGTTTCATTCATAGCAAATGTTCTGTATATTCACCATGTACCAAACACACA
TGTCACTAATGATACAAACACATGTGCACTGACATTGACCTCCTCACTGTCTTACTATAGTCCCATCCACAGCCACCTTA
TCAGAGTCCATGGGACTCTGACAACTTGTAATGGGAAAATTAACCGTGCCTGTAGGGGCCGCCTGGGTAGCGGGAGGTGG
TTCATGAGTACCAGGAAAGCTGGCCCTGGGGCAGACCTGGGCAGGGCAGAGCACAGCTTGCAGGACCTAGTACACAGTGT
TGGGAATTGAATTTGGGTTTTGACCTTTGAAGCTGTGCATAAAGTCTAAAACAAGAAGAGGAGTGGAGGCTGAGGTTGTA
TTTGTATTTTAAAATGAGCATTTGCCACACAAAGCCAGGAGCTTCAGACCAGCGTGGTTAACACAGCGAGACCCTCCTCT
CTGTGAGAAATTAAAAAAAAAACAATAATAAAAGCCAGGCATGTTGGCATGTACCTGTAGTCCTAGCTGCTCAGGGGGTG
AGGCGGGAGGATCACTTGACTCCTGGGTTACAGTGAGCTCTAATTGCCACTGCACTTCAGCCTGGGCCACAGAGTGAGAC
CTTTTCTCAAAATAAAATAAAATATAACCTTTTGGGCAGGCATGGAAAGGGGCTGGAGAGAATAAGAGGAAACAGAGATA
AGCCCCCTCCCTGGGAGCACAAGGGACTCATCCCCAGACATGGCACCACAGGCAAACAGACACTGTCACAAAACTAGCAT
ATTATCTTTCCTAGAAGTCCTTTGTTTTCCCCAAGTGCCCTTTCCCCCAACCTTTTGTTGGTTTACGAGCTCTCAATTCT
AACCTCCTAGTACACGAATAAAAATGTCTATCGGGGTGGTGGCGTGAGCCTGTAGTCCCAGGTACTTGAGAGGCAGAGGT
GGGAGGATCCTCTGACCATAAGAGTTCCTGACCAGCCTGTGCAATACTGACTCCATCTCAAACAAAAAACAGAAAGCTAT
ACAACTCCGTTTCTCTTGCTAATTTGTCTTTTGTCAGTTTAATTTGCAGGCCCCAGATGCTGAATCTAAGAGGGCAGAGG
AAAAGCTTTTCCTCCCTGACATAACCATCCAGGAGAAAGTCTGGAATGGAGGGTCAGAGGGGTAGGGAATGGAGTTAGGA
GAAGTAGATAGATTCAGGACAGTCTTTGGAGAATGTACATCACTGAGGTCAAGATGTCCTAGGGAATGGCCCTTCTGGGT
TCACAATTCTGCTTCCGTCGGTTAACAGCAACTTAGTCCCTTCTATGCCTGTGGGGATAATCAGTCAGGATAGGCTGGAT
TATGCTGCTCTAACAACCCCAAATTTCTCTGATCTAATAAAAGTATACTCAAAGCTGTATACCCTTCAAGGGTCCCCTGG
GGTCTCTGCTCATTGTAGTTCCTGCCCTGGCTACAGGCATAACCATGTGGAAATCAAAGGCTTTCTTGGCAAAGAGAAAG
AGAGCTCTAGAAGCTTTAGCACTGGCAAATAAATAAATGCTTGGCGTATAGGTGATATATGGCTCTTCTGGCTCAGGTCC
TTGGCCAGAATTAATCACATGGTCTCACCTCAAGGGAACAGGTGAGTTAAATGCAGAAAAATATTCAAGCTGGGTGAGGT
GGCTCACGCCTCTAATCCCAGCACTTTAGGAGGCCGAGGCGGGAGGATCGCTTGAGCCCAAGAAGATCATGGTTGCCGTG
ATCCATGATCATACCACGGCACTCCAGACTGGGGACCAAAGTGAGACCTTGTCTCTAAAAATTAAAATATTTAGAATGTT
TCCTAGCATAAAAGATGTGCTACGTGTGTATTATTACTGTTGCTATTGCCAGAAAGTACAATCAACAGGAACTGCTGGTG
GCTTGAGGGGGAGCATATGGTCATAGCTGACTCCCAGGTTTCTGGCTTGTATTGTGACTGTTTCTGGCCTGTATTGTGAC
CATCACTGCCATCAGGAACCCTAGAGCAGGTCGGGTGTGGTGGCTCACGCCTGTAATCCCAGTACTTTGGGAGGCTGAGG
TGGGTGGATCACTTGAGGCCAAGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAA
ATTAGCCAGATATGGTGGTGGGCGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCACAAGAATTGCTTGAATCCGAGAG
GTGGAGGTTGCAGTGAGCAGAGATCAAGCCACTGTACCCCAGCCTGGGCAACAGAGCAAGACTCTGTCTCGAAAAAAAAA
AAGAAAAGAAAACCTGGAGCAGGCATGGGGTATACGTCATGGGTCACTACTCTTGGATACAATTGACGAAAACCACAATC
AACTGGCTTAAGCCGTTTTATAACCCAGCATTAGGCCAGCCTGGATCCAGGGGTCAGATACTGAGATCAAGCTTTGATCT
GCTCAGGATTCCCCCCTCCCCAAGGCTCCTGCTTCCAGATGACCCCAGTGTGCATTTCGTCCGCCAGCGTGGGTCACATG
TCCATCCCTGACCAGGGTGCTACAGTGCTCTGGTTGGCCAAGCTTTGGCCATGGGGTTTACCTCACCCTACTTTTACGCA
AGAGTATGTGGTGGCCTAACACAGGTGGGGGACCCTGCTTCAATACCAGCCTTTATCCACAGATGACCCCACGCCTTCTG
CTCCAGAGCATCAGGGTTATAACACATGGGACACACGGGGCACACATACACTCCCACGCAACATCCACACACCCCACAAA
ACTACCTGAGCCCACATAGTCAGGGCAGGGACAGGCACAGAACAAAGTGGGTGCACAGGGACTCGCAGGGAGGTCGAAGA
GCTGAGTCTCGTCGGGAGGCTGAGGGGGAAGAGCGGGCACTGGGCTGCGAGTGTGCAGAGCTGGAGGTGGGGGACTGCGG
TAAGTACCCGTCCTGGGGTGCGGGCCCTGCTGGGAAGCATTGGGAGCAAGAAGGGTTAGATCCTCAAAGGAGGATAATTC
TTCCAGAAGGTTTTCTTTGTTTTGTTTTGTTTTGTTGAGGCACCAAGATTGGAGTGCAGTGGCGCCATGTCAGCTCACTG
CAACCTCCGCCTCCGAGGCTCAAGCAATTCTCTTGCCTCAGCCTCCCGACTAGCTGGGACTACAGGCACCTGCCGCCACG
CCCAACTAATTTTTTTAATTTTTAGTACAGATGGGGTTTCACTCTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCGTG
ATCCGCCCACCTCGGCCTCCCAGTGCTGGGACTACAGGTGTGAGCCACCGCGCCTGGCCTCTTCCAGAAGGTTTTCGATG
GCAGCAGGTGGAGGGGAGCGAGAGGGCCATCGCTCCTGCTGCAAGGGCAGCCCCTCCACGGGATCCCACACTCCTGACCC
GTTCAGTATAGACGGGGTACTCCCGGTGCGGCCTGTCACCGCTCTGTCTGGCGCTGGCGCTCCTAGAGAAGGCCATGCCG
ACCTCTCCCCCACCTCGCTCCAGATTCCCCTTCTTGAGCTGCTTTTCCCACAGTCCCCCCAGGCGCCACTGAGAGCCTCC
CGTCTTCGATGGGCTTGGGGCGCCTGGACCCTTCCGGGGTCACTTTGCCGTCTTCCTCCAGCCCCCCTCCCTACTCCTTG
TCGCCGCCCCCCCGTCTTTTCAGGACCTTGCGCACTCAGGCTGGCCCTATGCTTTGGGGTACCCCAGCAGTGCCCCCGGG
TCTCCGGCCACCCGGTGACGGAGGGAGGGAGAGGCTGGCGAGGGGGCGGCGCCTTAGGTCCCAGCGCGACCCGCCTCTCA
GCCCCTCCCGCCGGCTACAGCCGCGCGGGACGGGGCAGGGACCTCGAGAACACCGGGGACTTGCGGGCCGGTGGCCCCGG
GCAGGCCACTTCTGACGCGGGTGCGAGGCCTTCCGCGGGCGTGCACGGTTGGGCCGGAGCCGCGCCGGGGGCCGGGACAC
GCCTGGGGTCAGCGCTGGCGTCGCCGCTCGGGTGGGGTGGGCCGGGTTCGGGTCCAGTCTCCTCCCCACGGGCGGCCCGC
GCCCCGTCTTCCAGTGACCCCCACATCCCTTGCGCCCCGAGCTGGAACTTCGGCAGGTTCTGCCCCGGGCCGCGCGCGCT
TTCTGGGGACCAGGGCTGTGGTTCCTGCTCCGCGTCGTGCCTTTAGACACCGCCGGGCGTTCGTCCTCCATCCCTCCCCC
CTTTTCCCCACTCCGCGGTTGACGAGGGCCGCCCCCAAGTGGCCATGTCCCCCGGCATGGACGACCCCTCAGTGCCCCTG
AGGGCCCAGGGTCTCGCCCCCGTCCCGAGGCAGGCCATGTTTGAATTTATTTGGAGAAATTCGCCCAGCCCAGGGGACCT
CTTTCTGCAGCTCTGGCCTTCCAGTAGGAGAGGGTCCCTGGGGCTCTCAGGCCTGACAACTTGGCATCACTGAATTCTCT
AACCATGTCTGTCCATGACTTGAACTCCTTTTCTTTTAGAAAGATCTTTGCAAACCCCTTGTGGGCGTGCCCCTCCCATG
GCCCCCGTGCCTCTTGCCTGGGCTTTGGGCTTGTCTGAAAGGGCTAAAAGGGGGTGATGCCTGGCCGGGCGGGTGGCTCA
CGCCTGCAGTCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACCTGAAGTCAGGAGTTCAAGACCATCCTGGCCAACA
TAGTGAAACCCATCTCTACTAAAAATACAAAAATTAGCAAGGCTTGGTGGCAGGCACCTGTAGTCCCAGCTACTTGGGAG
GCTGAGGCAGGAGCAGGAGAATTGCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATCGTGCCACTGCACTTCAG
CCTGGGTGACAGAGAGAGACTGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTGATGTCCAGTGCCTGGGGATTTTT
CCTAGGGAGTGGTCAAGGAAGGCCCCGCTGAGGAGGAGACATTTAGGGTGGGAAGTTGCCATGTAGGGCAGGAGTGGAAA
GGGTTTTGGACCAGAGGGAGAGCCAGGGAAAAGTCACCAGGTAGAATGAGCTGGCATGATTAGGAAGATGGCAAGGGAGG
GCTGTGGCCCTCCCATGTGAGTGGAAGGGAGTGGCTGCAGCGCAGAAGGCCTGGAGGCCCCCGCACAGACCAGGACCTCT
GTAGAGGAACTCATCCCCTGTGTCCCCTGTGATTGTCAATCTCCCTAAAGATGGCCCAGAGCAGTGCGGCCTGAATCCCT
CAGTGGCCTTGTCTCTGGCTGCCCCAGCACCCACGCGGACACTTGACTCCACGCCCCAAAGAAAAGAAGCGAACCTGGGA
ACACCACTGCCAAGGCATAGGATTATTTGGGAGGGGGGAAGGGGGAACTGAGGAGGTGGTTACAACATGCTGGGCAGCAA
CAGACAGAACCAGACCACCCCTGTGGCTGCCCCGGCTGGTCTCCTGGGACCACTGGGCGCTTGGCTCAGGCCTCTCCTGC
CCTCCCCACTCTTCCACCTCAGCCCCCCTGCAGTCTCTGTCCTGCCCTGACCCCCCAGCTTACAGCCCCAAAAGCAGCTA
AACAACTCGCACACCCACGGGGCATCGCCTGGGAGGGCAGCCCAAATCCTTCCACTTCAGCCCCATTTGAAAGATGAGGA
AATGAGAGGCTGGTCCATAGTGGGTCTCGCGGCTCAGGTGCCGAGCCTTCCCTGGGCATGCACGCCGGGCCCTGGAGGGT
GGGAAGGGGCCAGTGGACGCGGGGAGCCTGCGGGGTGGGACTGCATCGGGAAAGGGGAAGGAGTCAGAGGCGAGAAGGGG
GAGAGTGTCTGTCTGGCTCCAGCCGCTGCACGCTCTTCCTGCTCAGGGGACTCACGGTGACCCCGGAGCCACTCCCCAGC
CCAGCCTCCAGGTAAGAGGTCACTGAGATGGGTGGCAGCAGGGGCCGGGGATCCCCCTATTACGACAGCGGTCATGGGAC
GCTGACTCACTGCCGGCCAGACCACCTGACCTCCGCGGCGGGAGGAGAGGGCCCTGCCAGGGGGTTCCCGCCGCGCCTTG
TTTACCTCCGGGAGGCCTCGGCCTCGCGTGGGGGCAGGGCGGCCGCTGGGGCCGCAAGGCGTGCGGGGAAGGGCCAGAGC
CGGGTGTCCACCCCAGCTTCCCAAAGACTCCCTCTTCTGTGCTTCCTTCTCCCCTCCCCGCCCCCCCCCCAGTCTCTTCA
CATGCCCCTAGCCCCCGCGGAAACTTCCCGCGATCCCAAACGGGCCAAATGGCGAGAAAGCAAAGGAGCTCCTTCTTGGG
GGTGAGTGGGGCGCCTTGAGCGCTTCCTCAAAGCTATGTTCCCAGAGCCACAGGCCTTCCTTGTGTCCCTCACCCTGCTC
AGACCGGGCCATAGCCGGGGGCTGGGGCAGGAAAGCCGGCCCTCGGCGGGGGCCACGTGGCTCTCAGGCGCCTGGGCTGC
TGAGTCACGCTTGGCCAGCACCTGTCTGTAGGCCACAGCCTCTGCCAGCACGCCCCTCTGTGTCCCCTGCCCCTGTCTGC
AAGGCAGTGGCTCCAGCAGGCCCTGGGGCATTTTCCACTCTCCACCGCCGGATGCAGGGAGAGGCCTGAACCCTCTCCAC
AGGGCTGCTCTGGGCAGGGTGGAAGCCTTGCCCACTTCGGAGCCCTCCGGGAAGGATCATTCACACCTGTGGACCAGCCC
CTGCTGTGCGCACACCCACACATCACCTTCGCACCTGACTGGCCCCATCCAGCCACTCTTGCCTCCCTCTGGGTTTCCTC
CCCTGGGAGGTTTCTCCAGCTCCTGCAAGCCCTGGGCTGAAATGGCATGAGTTGGACCCAGCAGGTTCTGACCTCCTACT
CACAGGACCTTGCCTGGGAGGCTCCAGAGGGTGACCACTCGTCCTGCCCCTCTCCTTGCCCCAGTTCTGGCGGACAGGTT
ACTCTGGTGGCATAAAGCAGTGTTTCTTCCTTCCTAGCTGAGGAGGCTGTTGGCTGACCCCCTTGGCTGCCCACAAGGCC
AACGGGCCTGAGCCCCCACAGGGCCATGGGCATTACCTGCTGAATTGAGGAGCCCATAAGGAGTCACTTGGACCACAGTG
AACACTTGGCGACCACTGACACTCAGGAGACCTTAGCTGGTCCTCCAGCACCTCTCAACTCCACTCCTACTAAACTGGGA
ACTTCTCTGGTGCTCAGGCCAGAGTCGGGGTCCGTCACCGAGTATGCTATGCGCTGCCCATCACCGAGGATGCCATGCGC
TGTAAGAGGGCTGCCACCGCGGCAGGCTGACCATGGCAGGGTCGGAACAGCAACCTGAGAGCCAGCTTGTTCTGGCCAGC
AGTGCCCACTGGGCGACCTAGCAGCCTCCTGATATGGGGGCTGTGTCCCCCTCTCCCTGCACTGGGTACCCCCAACTGAG
GATATTGCTGAGTCATGGCCAGGCCCAAGCCTGGGAGGGGCGAGGGGCTGGACCCCCGCCAGTACCCTGATCCCAGGTGC
AGAGGCTGGAGCCCAGGCCTGTATGAGTGCCAGGGCCGGTTTCCTGGGGTCCTTGGTGCACCGGGGCAATGAAGAGAGGG
GTCAGCAATGAGGGGGCCGGGAGACCTGGAGCGAGGGGTAGCGGGGAAGGGGAGAGTAGTGAAGGGGCCTCTGCAGGGCG
GCTCTCGCGCCGCGACGACGGTGGCGGGGGCGGGGAGGGCGCGAGAGACTCCGCCCCTCTCGAGGCGGGGCGGGGCCTCC
GCGTTCGCTACAAAAGCCGCGCGGCGGCTGCGACCGGGACGGCCCGTTTTCCGCCAGCTCGCCGCTCGCTATGGCGTCGC
TCACCGTGAAGGCCTACCTTCTGGGCAAGGAGGACGCGGCGCGCGAGATTCGCCGCTTCAGCTTCTGCTGCAGCCCCGAG
CCTGAGGCGGAAGCCGAGGCTGCGGCGGGTCCGGGACCCTGCGAGCGGCTGCTGAGCCGGGTGGCCGCCCTGTTCCCCGC
GCTGCGGCCTGGCGGCTTCCAGGCGCACTACCGCGGTGAGCGGGCCGGGGAGCGGCGGGGGCGGTGACGCAGGCCGGACA
CGGCCTCCTGCCGCGGGGTGGCTGCCCCCTCCCTTCTCGGCGACGCCTGGCGGGCCGTGAGGGGGTCTGCGCTGGCTGCT
CCCTGGATGGCGGTGGCCTGCATGGGTCCCCAGTTCGGCCATGGGAGCCGGCCTGGTGACTGGAGTGGTGACCAAGGCCG
GGACCCGCTGCTCAGCGTCGGCCCCCTGGGGCGGTGGAGCCCTGCCGGCCGGGGGCTCGAGCCTGGGGGCGTCAGACGCC
CCGCTCCACCCCCCGCGCTGTTGGGGATTTTGGCAAGGACGCGCCGGGGCGAACGCTCTGGCTCTCCGCGGGCACTGGGT
GGTCAGGCGGGCACTCGGGTTACACTGACACCTTGCTGCGCCAGGTGGTGGGTTCAGATAATGCCCTGGAGGAGCCGGGC
GAGCGCCGGCGAGGGGAGGGAGTGACGCGGGTAAACAAGCGCGGGGGTGCGGGGGACTCGCGAGCGCCGCGACAGCGCCT
GGGAGAAGGGCACGGATCGCCGGCGGAACGCTCCGAGCCAGGTCGAGTACAGATGTTTTCCCATTGGCAAGTGGACGAGA
ACGTTCTTCAGAAGTGTTGGTGTTGGCACAGAAGCCCTGTTTCCTGCTGCGCTGGTGTGACCAGTGGCTGCTGGGGGTGG
GGTTAGGGAGGTGGTTGTGGCCAGGAGGGCGGGAGGTGGCCAAGGCCGGCCCCTGGGAGGGTGCAGTGCTAGGACCTCCC
TCTGGAGCGCTGCCAGCATACCAGGCCCTCTCCTATTCTTAAAAAAAAAAAAATTGTGATGTTATTGAGCTGTAACTGAA
ATAAGGGTTCACCCATTTAGTGTACAAGTCAGTGGTTTTCACTATTTTCATAGGTTTGTGGACCATATTCAGTGTGAGAG
CTTTTCATCACCTTATAAACGCCATACATACCTTTTATCACCCTCTTATCCACATGCCCCCAGCAACCTCCTTTCTGTAT
CTATTCATCTTGCAATCCTGGACATTTCATGTAAATAGAATCAGACAATGTAGTCTTGCAAGTGGTTTCTTTCACTTAGT
GTAATGTGTTCAGTGTTGTGGCACATATCAGAACTAGTTTTTTTTGGAGGAATAATATTCTGTTGTGTGGACAGGCCATG
TTTTGTTTACCTGTTCATTAGTTGACAGACATTTGGGTTGTTTGCATCTTTGGGTGCCTCACCTGTGCCCTCGGTGGATG
GCGTGGTTTGGTCTGCACAGCTGTGCTTTAAAGCCATTTCAGCTCATATGTATCTGTCACAAATGGAGACTACATACAAA
TACGTGTCTTTCAGCTTTGTTAGGGAAGGAAGCAGGCAGATGCCCTTGTCTTCTCCTTTACTCCCAAGCCACTAGAAGTC
TAAGGCCTTTGTGGGGCTGGGCCCCTCAGGAGCAGGTCACAGGCAGGGATCTCCCACAGTTGAAGACGGACATGGGCCTG
GTCGGATGGGGAATGAGGTGGCACGTTTCTGAAAAGGCAGCTGGCCCAAGGCTAAATAAGTGCAGCAGCCAAAGCTGCTG
CCCCTGTGCACAGGCCCTCCGCCCCCTGCATGTGGCTGTGGTCCAGGGCCCAGGTCGGAGCACTCACCTTCCAGGAGGTG
CCAGAGCAAGGGGGTAGTCTTGCCTCTCACTCCTGCCCTCTGTGGCTCAAGTAGGTGTGTTTGTTTATAGCCCTGTGAGT
GTCCCTTTCATACTTGCCTCAGCCCATTCCAGCAGCTTATGTCCAGCTGAGAACCCCTGGGTGCTCACGTGCTGTCTTTT
AAACAATCTAGATGAGGACGGGGACTTGGTTGCCTTTTCCAGTGACGAGGAATTGACAATGGCCATGTCCTACGTGAAGG
ATGACATCTTCCGAATCTACATTAAAGGTAAGGGGCTGCTCTGGGGGCTGCCTGAAGCCAGCTCAGCTTGTACTCAGTTC
CCTGCTGAGTAAAAAACAGGGCTCGATGTTCCACCAATGAAGGGGTCAGCAATTTGAGGGCTGTTTAAGACAGAGACATA
GGCCAGGTGTGGCTCACGCCTGTAATTCCAGCACTTTGGGAGGCCGAGGCGGGCAGGTCACCTGAGGTCAGGAGTTTAAG
AACAGCCTGGCCAACATGGTGAAACCTTGTCGCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGTACATGCTTCTAG
TCCCAGCTACTCAGGAGGCTGAGACGAGAATCACTTAAACCTGGAGAGCGGAGGTTGCCATGAGCCGAAATCACATGACT
GTACTCCAGCCTAGGCGACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACATTTTAAGTGCTGTGCCTGACCT
GAGAAAGAGGAGTCCATGTTCACTCTAGGGATGGGGTCTGGTGCCGTGGTGCTTGGGTTAGGGATGGGGTCTGGTGCCGT
GGCGCTTGGGTGAAGGGCAAGGCCAAAGCTGTGCAGACAGGGCTCCTTGCTGCTGCTCTGCTGCCTGGGCCAAACAGACA
CAGGGACTGGGAACCTCCTAGCAGGTTCTTGGTGGCTCTGCTGCCCTCACCTAAGTGGCTGAATTTTGTGTGGATTCCAT
GCTGGAGAGCAGGGCCGGGGGCCTTGCTGGCAGTGACAGCCCCACAGTGACGACAGAGGGGGAGGACTTTAGGGGGTCCC
ACCCTAGCGGCTCTCTTTACCCTTCCTGTAGAGAAAAAAGAGTGCCGGCGGGACCACCGCCCACCGTGTGCTCAGGAGGC
GCCCCGCAACATGGTGCACCCCAATGTGATCTGCGATGGCTGCAATGGGCCTGTGGTAGGAACCCGCTACAAGTGCAGCG
TCTGCCCAGACTACGACTTGTGTAGCGTCTGCGAGGGAAAGGGCTTGCACCGGGGGCACACCAAGCTCGCATTCCCCAGC
CCCTTCGGGCACCTGTCTGAGGTGAGCAGGCCCTCTGTGCAGGCCTGGGGTGGGCTCAGGGTGGCAGGAACCTTGACCCG
CTCACTGCCTGCCGCTCTGCTAATTCCTCCCCCAGGGCTTCTCGCACAGCCGCTGGCTCCGGAAGGTGAAACACGGACAC
TTCGGGTGGCCAGGATGGGAAATGGGTCCACCAGGAAACTGGAGCCCACGTCCTCCTCGTGCAGGGGAGGCCCGCCCTGG
CCCCACGGCAGAATCAGGTGAGGCTTGTGTTGGAACCTGCTTCTGATTGGTGACAGTAGTCAGGCAGCCTGTGTGCAGGG
CCCTTGTGCAAAGCGTGTGTGCAAGGCAAGAATTCAGGATACCCCCCACCTTCCTGGTGCCCTACAATCACACAAGAACC
CTGCAAAGTGGGGTGTATTCTCTCCATTTCCCAAATGGGGAAACTGAGGTGCCTAAGTGCCTGAGGCCACAAATTTACCT
GCACAGCCCTTCCTACCTCAGGAGGCTGCCCTCTCAAGGTACCCTGAGGTGCAGGCAGGGAGGCCCTTCCAGCCCAGGGG
TCTTTGATGCACTTTGTTCTCTTTTGTGATGGTTGTCAGGAAGATCAGAGCCAAAGTTGCTGAAGTCCTTTGAAACATAG
TTATAAGTGAAAGACTTACCGTTAGCTTTGTAGTCTAGATTTTTGGATTCTGATTTTATTAATGTTACTGTGTCCTGTGA
ATCACCTCACCCTTTGGGACAAAGATGGGGATGTTTGCTTGACTTTGAGTAAATAACATATTTACTCAAGGAGGTGATCA
ATATTCACAGTGTACTGAGCCTGAGCCTCTGTGGGGTTGCTGAGCACCAGGGTCACAGATGAGGGGGAGATGGCACGGGA
GAGTGGAGATGCTCTCTGTGCAGGGCCAGGGGGTGCAGAGTGGGAGGAAGGAGAGGGGGATGCTGAGTGGGTCACTGGAC
AAGATGTCCGGGTTAAAGGTCACCCGGGAACACAGGGACCTTGGCAAGAAGGTGACAGGACTGTGACAGGTATCCAAGGC
ATTAAAGATATCTTTATCTTATCTTTGTAAAAATCAAAGCTTCTGGTCCATCGGAGGATCCGAGTGTGAATTTCCTGAAG
AACGTTGGGGAGAGTGTGGCAGCTGCCCTTAGCCCTCTGGGTGAGTGCACCTCCTTGCCCAGTGCTTCCCTAACTCAGCC
TGCACTTTATGTAACTTTCACCTGGAATACTGCAAAGGAATGGGTAATTGACATGCCCTTGACACTGGTGAGGATTTGTT
GCCTCAAATCAACCTTTAGTAGTGCTGGACCACGGGCAACTCAAGGTTGAATTCCCTGACAAAATTCTCGAGCTTTCTAC
ATGGAGTGAAGTCGAATCAGCTGGCATTTGGTTGGGATAATCCCAGTGAGGGAGGGTGACCAGACTGTGGTTGCAGGTCT
TGCCTGTCACCTCTGACTGCCCTCCTTTGGATGAACAATAGTCTGCAATTTCCCCTAAGCCAGGCCTTGGGATTTGGACT
TTTGAGGACCTGGGTAGTAGTTAGTTGACATGCTATTTATCTTTTCCTTCTTTTAGTTTACAACCCCCAAACCTTTCTGG
TGCTTCTGAGTGCTGGGCCTGCCTCAGAGCGGGGAGAAGGTGGAGGCAGGTGCTCTGGGTGACACCCGGTTCTGGTGCCT
CAGGTTGCACAGTGGGTCTGTGGGGTGGGTGAAGGTGCCACCTGACCCCAGGCATGAGCTCAGGCTCTGAGACCCCTCCC
TGCTGGGCTTTAGAAAGTGCCTGTTGCGTCTGGCTAAGGGTCTCGGGTGGGTGCTGCAGGGTTAGGCATGCGGCAGTGAG
AATCAGTGAATGAAGCCAACTTCTAGTTCAAGATGACGGTGGATCTGTGATGGCAGATGGATCGCTGGGACACCTGCATC
TCTCGCTAGACGAGTGAGCTTGTTTTTGTGACTGCAGTGGCAGGCTGTGTCCACCCAGCCTGTCCAAAGGTTGTGTTCAG
TGTCTGAAAGAATGCTGCTAGGAGGGGCGTCCGAACCATGCCTGCTGCCTCCTGTGGCTACAGGGCGTGACTGCCATCTG
CTCCAACTTTGCTCATTCCGATTTTTTTTTTTTTTTGAGATGGAGTCTTGCTCTGTCACGGAGGCTGGAGTGCAGTGGTG
CCATCTCTGCTCACTGCAACCTCCCCCTCCTGGGTTCAAGTGATTCTCCTGCTTCAGCCTCCTGGGTAAGTGGGATTACA
GGTACCCGCCACCACGCCCAGCTAAGTTTTGTATTTTTTTAGTAGAGACGGGGTTTCGCCACGTTGGCCAGGCTGGTCTT
GAACTTCTGACCTCAGGTGATCCACCCGTCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAACCACTGCGCCTGGCC
TTGCTCATTCCACTTTGAGGCGGCAGTGACATCTTGGCATTGCTTCTGAACTAAACAGCCCAAACACGCATGGCTTCTGT
ACTAGAGTTTAGAGGTGAAGTCAGAGAATAGGAAAGAAGAATCGCTAGTCTGTTTTTTTTTTTTTTTCTTTGAGACAGAC
TTTCACCCTTGTCGCCTAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACTGCAACCTCTGCCTTCCAGGTTCAAGCGAT
TCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTATAGGTGCCCACCACAACGCCTAGCTATTTTTTGTATTTTTAGTAGA
GACGGGGTTTCACTGTGTTGGCCAGGCTGATCTCGACACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGC
CACCGAGCCCGGCCAAGAATCAGTATTCTTAATGCTTTCCAAGGAGGGTAAACAGTGGAACACCGAGGTCATGATTGAGA
GATCGGCTGTGCCTAAATCCTGCCACTCACTCTGGACGTAGTAAAGGTCTGAGAGATTGCCGGTGAATTTGGAGATATGA
ACAAGAGTCCTTCACTCAAAGAACAAAGGAGACATCCCGGGAAGTCACAAAACCAAGTCTAGGTTCAGGTAGAGACTTTT
AAAACAGTGCAGATCCCCAGACCTTACCTTTTTTGATCCTGGTGAGTGAGTGTTGGTGGGGCCTGGGATTTGTCTGGGAG
GCTGCCCAGGTGCTTGGGGTCACACTGTCTCAAGACCCACCTGTGCTGGTGCAGCATCTCGGAACTGAACTGGACGCCCC
ACTTGGCACAGACAGGAATAATCTTGCAGATGCTCAGATGTCTTTTTTTTTCTGAGACGGAGTCTCGCTCTGTTGCCCAG
GCTGGAGTGCAGTGGCACGATCTTGGCTCCTGGGTTCACGCCATTCTCCTGTCTCAGCCTCCCGAGTAGCTGGGACCACA
GGCGCCCACCACCACGCCCGGCTAGTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCTTGTTGGCCAGGATGGTCTCG
ATATCCTGACCTCGTGATCCACCCGCCTCAGCCTCCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCGGCCCTC
AGATGTCTTTGAGCACTAAAAGCTAAAGGCTTAAAGTGGATAGAATTGCTTTCTCCGAACTAGGGAGGAGCCTGAAGCTC
TACAAACAAGGAATATTTCAGTAAAATAGACTGAGTGGTTTCAGCTGAGGAGAGCCAGTGGAGGAGCGGCTTAGACTGTT
GTGGAAGTTAGGGCTAGTTTCTTTCTTTATGAGGAACTATAAGCTGGATGACTCACAGGTTAACTGCCTAGAAGGGACTG
TTCCCAGAGCAGGGCCCAAAGGGGAACGAACGAGTCAACAGCTGCCTCCCGGTGGCAGTGTGTGTAGGGTAAGCGGCAGC
TTTCATGTGAAGTGCCAAGGCCTTTTGGTTGGGGGGCTGAGAACCTGCTGAGGGAGCCACAACTGAGACCCAGGCTGCTC
CTGGCTGGGAGGCTACAGGCCCCGAAGCCACAGGGCCCCTCCTGGTCGTGGGATCTTTGATAAACCGACACGCAGAGGCT
TTGTGAGGCAGCAGGGCACGGAGCATCGTCTAGTCTTTTTTTTAAGTGAAAGCACATTTCTTAAGAAAGTAAAGGAATAC
AAGAATGGCTACTCCGTAGACAAAGTAGCCTGTCCAGTCTTACGACAAGGCCTGAAAATCTCACGCTTTCATCACCAAGT
TGGAGAGATCCTAATTCTATAATTTCTCTGATCTGCTTACAGGCGAGTCTCATTACCCATTTACAAAGCATATGGGTATT
GATTGTACTATTCTTTCAACTTTTCAACTTTTCTGTTTGTTTCAACCTTTTTTTTTTCTTATCTTTTTTTTTTTTTTTTT
TTTTGAGACAGACTTTTGCTCTTGTTGTCCAGGCTGGAGTGCAATGGTGTGATCTTGGCTCACTACAACCTCTGCCTCCC
GGGTTCAAGCGATTCTCCTGCCTCAGCCTTTCCCAGTAGCTGGGATTACAGGCATGCACCATCACGCCTGGCTAATTTTG
TATTTTTAGTAGAGACGAGGTTTCTCTGTGTTGGTCAGGCTGGTCTCGAACTCCCGACCTCAGGTGATCTGCCCGCCTTG
GCCTCCCAAAATGCTGGGATTACAGGCGTGAGCCACTGTACCCGGCAGCTTCAACCTTTTCATAATCACAAGTTGGGAAG
AATAAAGGAGAAAAATAATTTAAATATTACAGATGGAGGTACTTGTAGTTAAGCCTGCTTTTACCTCTGCTCCCCCCTTA
AACTCTATTGAAATAACAGGAAAGGGCTTACACAAAAAAGAAAACACCATAGACAAAGAAAGTAATAGAGGAAAAGACAA
AGATGTGATCAGATTTTGGAAGATGGACATGGAGGGAGGGTGGCAACCTGCTGAGGGAAGCTTCAGAAATCTCTGTGCCA
GAACAGGGCATCCTGGTGAGAAGGAAGACCATCTGTTCAGGTACCAGGAAAGGCAAGAGGTGGGAGTGATCGGGGGACAG
GAACAAGGGGGGAGGGTTGAAAACCTCTAGGGAGTGTCAGCTAGAGACTTCAGGGCCCACTTGGCCAGATGATGGCTCTT
GCCAACAATTGAAGGAGACCTCATGTTTAATTCCAGGAGCACCTGACCCAGGGAAGCTTCAGCATGGGATACTAGAGACA
GGCCAGGGTGAGGCACCAGCCTGAAACTGGGCATCCCGAATGAGCAATGGGCCCAGCAGTCAGTTCCCTGGCCAGGCAGG
AGGTGGGGAGTCCCTGCTGAGGGAGGTTGAATGCCCCCAGAGAAGTGTCCTACAGCTCCTGCCTGCCAGACCCCCAGGCA
CCCTGCAGAGGGAAAGTTAGTGATGGCGCTGGCCCCACAGAAGGCTTCCTGTCAACCTTAGTGCTTCAGTGGGATCAGAC
CCCTGGGTCACCAGCCATCTGAGGAAAGCCCCCACACATGGGAGAAAAGGTCTTAAAGGGAAGAGAATGTTAATGTCCTC
AGTGAGATACAAAAGAGCATTGTCTGCATGAAAAAGGAACAGGATACTAGAAAAAGGATAACAAAGGGAACAGAATAAGA
AAGTTCTTAAAATTAAAAAATAGAAAAAAAACTTCAAAAGAAGGCTTAGATTCTACTATTGTCTCAAGGAGATATTCCAA
AAGCAAAATAGAGATAGTTTTTTTATCTTTCCTCTCCTCCTAATGTAGGAACAAGTCTAGAAGCCCAATATAGGATGAGT
AGGAATTCCAGAAAGAATAGAAAACTGAAAGGAGGAAATGAACATAGAATGCAAGATAGTTTCTTGGCACCAGGCATGGT
GGCTCATGCCTGTAATCCCAACACTTTGGGAGACCAAGGCAGGCAGATGACTTGAGCTCACGAGTTTGAGACCAGCCTTG
GCAACATAGGGAGACACCATCCCCCACTTCCCGTGTCTACAAACACTATGAAAATTAGCCAGGCATGGAGTTGTGTACTT
ATGGCCCCAGCTACGTGGGAGGCTGAGGTTGGGGATGGCTTGAGCCTGGAAGGCAGAGGGTGCAACGAGCCTGGATTGTA
CCACTGCACTCCATCCTGGGAGACAGAAACAGACCTTGTCTCAAACAAAACAAAACAAAACAAAACAAAAAACAAAAATG
CAAGATAGTTTCTTGGCAATGAGGCCATTATCTTCTAGATTGAAAGCGGCCCTGACAATGACTGAGAAAGACCCCACAGC
AAAGCTTATCATTGTCAAGTGTCAGAACCCTGAGGATAAAGAAAGAGGGGCTTCTAAAACTCTAGGTTATTGTAAATACT
TGGGAATACAAAGAGCAAACAAACATGAACAGCTAGGAGACAATGGAACAACTGATGACTCCAGAAATCAGTTAAAAAGG
CTTTTCGGCCTAGAATTCTTTTGTTTTTGAAACCGGGTCTCACTGTCATCTGGGCTGGAGAGCAGTGGTACAGTTGTCTT
CATCTCGTGGGCTCAAGTGATCCTTCTGCTTCAGCCTCCCCAGTAGCTGGGACTACAGGCATGTGCCACTGTGCCTAGAT
ATTTTTTTTTTTCTTTTTCTTTCCCCACCCCCGAGTCAGAGTCTCGCTGTCGCCCAGGCTGGAGTGCAGCGGCGTGATCT
CAGCTCACTGCAGTTTCCACCTCCTGGGTTCAAGCAATTTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACCC
GCCACCATGCCTGGCTAATTTTTATATTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCAGGCTGGTCTTGAACTCCTG
ACCTCATGATCTACCTGCCTTCGTCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGCCCAGCCCCGTGCCCAGC
TATTTTAAAATTTTTTTGTAGAGATGGGGTCTTGCCATGCTGCCCAGGCCGATTTTGAACTCCTGGGCTCAAGAGATTTC
TCATGTTGTTCTCCCAAAGTGTTAGGATTACATGCATGAGTCACTGTGCCTGGTCCAGCCTAGAATTCTATACCCAGCTA
AAACGATCAAGTATCAGGGCAGAACAAAGTTACTTTCTGGTATACAGTCTTAATTATTTTAACCTTGGTTTGAACCACAT
GAAGAGAGAAACCCAAAATGAAGCACAGAGGTATTAGGAAACGGGGGACCTAACAGGACAGAGGGTGAACGTAAAGGGGG
TCACAGCATCACAGCGCAGCCTCTGCCCAGAGAGCTGCCAGTCCTGGCTGGAACAGGAGGCTGCACAGCAAGCGGCACTC
TCTATGGAAAACACCAGAACCAGTGGGGTCTCTGATCCACTTCACCCTTGTAGAAAACTATGGAGATGCTGTGGGAGAAC
ATAGGGAAAGTTAGCAAATGCAAAGAGAAGGCAGGCTGGGTGCAGTGGCTCATGCCTGTAATCCAAGCACTTTGGGAGGC
CAAGGCAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATA
TAAAAATTAGCTGGGCGTGGTGGCACGTGCCTATAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCT
GGTGGGCAGAGGCTGCAGTGAGCCGAGAGATTGCGGCACTGCGCTCCAGCCTGGGCAACAGAGCGAGACTCTGTCTACAA
AACAAAGGAAAAGAGAAGCCATTATCAGTGGTAGGAAAACAAAAAGCGATACAAAGAAATAGTCTGCTACTTGACCCAGC
AAAGAACATTTAGATAGAGGTCCTAGCTGAAACATTGAATGTTCTTTTTTTTTTTTGAGATGGAGTCTTGCTCTGTCACC
AGGCTGGAGTGCAGTGGCGCCAACTTGGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGCGATTCTTCTGCCTCGGCCTC
CCGAGTAACTGAGACTACAGGCGATTGCCACCACGCCCAGCTGATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCAT
ATTAGCCAGGGTAGTCTCGATCTCCTGATCTCGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGA
GCCACTGCGCTCGGCCTACTGAATGTTCTTTTAACAAAAAATTGCGTCTTGGAAAGGGTAGTGGAAGGAGAGGCAGGTAC
TGGTCCAGAGTAGGAAGTGCGCAGATGTGAACTCCAGCAGCAGGCAAGAGCAGCGTGAGCTGAAAGGGCACAGGGAGAGT
TTGGGGAGTGGGATGGGGGCAGGGAGTGTGCACCACAGCCTTTGGCAACTGGTGACTCTAAGCTGTGGGTGTGGAGACTT
TTGACACATTTAATTTCAAAAATAGTGCAGGTGATTCTCCCCTCTGTCCCTGGTGAGTCACGTGAGCCTGGAGGTGTGCA
GGTTCCCTGTGGGGCCAGCGTTGGGCCCTACGCAGCGCCTGCACGGCCCATCCACCTCCCAGAGCAGAGCTCCGGCGTTG
AGGTGTGCGTGTGTGTCGTCGCACAAAGCCTCTGTGTGCAGGTGTCAGGAGGCACATGGCCTTCCATCACTGTGGCTGGA
AGCGCCTGCCACATGTGGTATTGGCTCTCCCTGTACTTAGGGCCTGGCCCCACCTGCCGTGTGGCCCGTGTCCTGCATGG
TTAGGAAAAAGGTCTCTTTCGACTTTCTGGCTCAGAAGCTGACTTTAGATGCTAGCTGAGGTTAATGTTTTACTGAATTG
GAGAAAGAGAAAGGTCCAGTATCATGGGCCCACCAAGAATGTTTCCAGAAGCCACAGAGATTTGTAGCTGGGATATGGGG
AGAAGCCTCTGACTCATGGGTTTGTATCGTCTGGTCCCATACTGGCTGTGTGATTGCGGGGTCGAGCTGGGTAGAACCTA
GCACCTGCATCCACACGCTGAGTGCCACCATCCAGACACTTAGCTGCTTGTGGGGACTGAACGTTGAGATCTTCGTGAGT
CTGTAGTCTCCACAGGCCAAGCTCCTGCTTGCAGGTGCATCCTTGGGGGAACTTCACGGCTTGCTCTTTCCTCCTCCGCC
TCTAGGCATTGAAGTTGATATCGATGTGGAGCACGGAGGGAAAAGAAGCCGCCTGACCCCCGTCTCTCCAGAGAGTTCCA
GCACAGAGGAGAAGAGCAGCTCACAGCCAAGCAGCTGCTGCTCTGACCCCAGCAAGCCGGGTGGGAATGTTGAGGGCGCC
ACGCAGTCTCTGGCGGAGCAGATGAGGAAGATCGCCTTGGAGTCCGAGGGGCGCCCTGAGGCAAGCCTGTGCCCCTCCCG
CCACCTGGGACCACGGCCAGCCTAGTGATCTGTGGCCTGCACCTCCGCCTCATCCTCAGCACCTCTGCAGCCCCACTTAC
AAACCCGAGGGAGCTGCTGCTGCTGCAGTGATGTCTGTGCCATTAAAGTCACGCTGGGAACCTGCTAGAACTTTGTAGTT
ACTTGGTCTTTGTGAGTGGCCACTGTTCCCCCTAGACCCCTGCAGCCTTAACTGCACGTGTGCATGCGTGCTCCCCGACT
GTCTGCCAGGAGCCAGGGCCATGGTCAGGCTTGGCCTGTTGCGCGTGTCTCCTGTGTGCTCATGGTGAGTTTTGTTCCAG
GAACAGATGGAGTCGGATAACTGTTCAGGAGGAGATGATGACTGGACCCATCTGTCTTCAAAAGAAGTGGACCCGTCTAC
AGGTGAACTCCAGTCCCTACAGATGCCAGAATCCGAAGGGCCAAGCTCTCTGGACCCCTCCCAGGAGGGACCCACAGGGC
TGAAGGAAGCTGCCTTGTACCCACATCTCCCGCCAGGCAAGTGAACCAAGAGGTTTTGTACATATTCCTACCTTTCCCTT
TAGAGCATCCTGCCCTCCTCTGATTTCAGCGACACAAACAGAAGGATGAGATGTTCTCCACTGCAGGGCTGTCTGTAGGT
GTGGGAGGTTAGGAGTTGGTTTTGTCCTATTATGTGTACCCTGAGCAATAGCGAGTAAGCTCTGCTAATGCAGTTCTGAA
AATGTTTTTCTTTAGGCAAACTCCAGAGCCAGGAATATTAATTGTAGGAGTTTCTAAAACTTAACATGCAGACCAGGAAT
TAAGGCAGGTGTGACACAGAGAGGGGGCAGCACTGGGTGTGTCCCTACTACTCACACTCAAGTGCCGCCTCAGGGAGTGC
TGGTCTGAGAGGGGGGGGGGTCATAGCCAAGATCCCTGTAGGAGCCAGGCAGAAGCCACCGTTGAAGTGGATGGTGTGGC
AGGCAGCAGTGGGGGGGTGGGGGGTGGGGACAGCTGACAGAAACTCCTCATTGTCACATGGCAGCCGTGTGTAAACCAAG
GTCCGTGTGCACAACCTGCTGTGGGCCGCCGCTGTGTGTCTTCTCTGCCTTAGGGCAGGGGATGGCGAGCTATGGCCTAT
GGGCCAAGAGTAGTTCTTATTTTTGTTTATTTATTTATTTATTTTTATTGATCATTCTTGGGTGTTTCTCGCAGAGGGGG
ATTTGGCAGGGTCATAGGACAATAGTGGAGGGAAGGTCAGCAGATAAACAAGTGAACAAAGGTCTCTGGTTTTCCTAGGC
AGAGGACCCTGCGGCCTTCCGCAGCGTTTGTGTCCCTGGGTACTTGAGATTAGGGAGTGGTGATGACTCTTAAGGAGCAT
GCTGCCTTCAAGCATCTGTTTAACAAAGCACATCTTGCACCGCCCTTAATCCATTTAACCCTGAGTGGACACAGCACATG
TTTCAGAGAGCACGGGGTTGGGGGTAAGGTCACAGATCAACAGGATAAGAATTTTTCTTAGTACAGAACAAAATGAAAAG
TCTCCCATGTCTACTTCTTTCCACACAGACACGGCAACCATCCGATTTCTCAATCTTTTCCCCACCTTTCCCCGCTTTCT
ATTCCACAAAGCCGCCATTGTCATCATGGCCCGTTCTCAATGAGCTGTTGGGTACACCTCCCAGACGGGGTGGTGGCCGG
GCAGAGGGGCTCCTCACTTCCCAGTAGGGGCGGCCGGGCAGAGGCGCCCCTCACATCCCGGACGGGGTGGCTGCCGGGCG
GAGGGTCTCCTCACTTCTCAGACGGGGCGGCCGGGCAGAGACGCTCCTCACCTCCCGGATGGGGTCGCGGCCAGGCAGAG
GCGCTCCTCACATCCCAGACAGGGCGGCGGGGCAGAGGCGCTCCCCACATCTCAGACGATGGGTGGCCGGGCAGAGACGC
TCCTCACTTTCCAGACTGGGCAGCCAGGCAGAGGGGCTCCTCACATTCCAGACGATGGGCGGCCAGGCAGAGACGCTCCT
CACTTCCCAGACGGGGTGGCGGCCGGGCAGAGGCTGCAATCTCGGCACTTTGGGAGGCTAAGGCAGGCAGCTGGGAGGTG
GAGGTTGTAGCGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCACCATTGAGCACTGAGTGAACCAGACTCCGTCTG
CAATCCCGGCACCTCGGGAGGCCGAGGCTGGCGGATCACTCGTGGTTAGGAGCTGGAGACCAGCCCGGCCAACACAGCGA
AACCCCGTCTCCACCAAAAAAACACGAAAACCAGTCAGTCGTGGCGGCGCGCACCTGCAATCGCAGGCACTCGGCAGGCT
GAGGCAAGAGAATCAGGCAGGGAGGTTGCAGTGAGCCGAGATGGCAGCAGTACAGTCCAGCTTTGGCTCGGCATCAGAGG
GAGACCGTGGAGAAAGAGGGAGAGGGAAAGCTATATCATTCTTATGTCTTTGCATCATCATAGCTTAGCATCTACTCATA
AGTGAGAACATATGATATTTGGTATCCATTCCTGAATTACTTTACTTAGGATAATGGCCTCCAGCTCCGTCCAAGTTGCT
GCAAAAGGTATTATTTCGTTCCTTTTTGTGGCTGAGTAGTATTCCATGGTGTATATATACCACATTTTCTTTATCCACTC
ATTGCTTGATGGGCAGTTAGGTTGGTTCCACATCTTTGCAATTGTGAGTTGTGCTGCTCCAGATATCATCTTTAACTCCT
TTGCCTTCTCCACATACATTTCCAAGTCCTGTTCATTCTACCTCCAAAATGTATCTTGTATCCATTCATCTCTCTCCATC
TTCAATCTATTTCAATGCCCCATCATCTCTTGCATGGAGGAGTGTAATAATTGGCTAACTGGCCTGTTCTTACATTTTAA
AATCAAAAGATGTGACAGGTGAAATGCCTATTTCAGTGTCCATTGATGGTTCTGCTTACACACCACCTGGCTGCCTGGTG
TCGCAGTGGCAGAGTTGAGCAGTGTGAAAAAGACTGCTTGGCCCTTTACAGGGAAAGCAGGTCCACTGTGGCCTGTGAGG
ACGAGAGCTCTGGGCAGGCTCGGACACTGGCAGACCCTGGTCCTGGCTGGCCAAGGCAGCAGGGTATGTGTTTCGGGTCA
CTCACAGGGCTCAGCACCACTCCTCATGGCTTCCTTACTGTTTCGGCAGAGGCTGACCCGCGGCTGATTGAGTCCCTCTC
CCAGATGCTGTCCATGGGCTTCTCTGATGAAGGCGGCTGGCTCACCAGGCTCCTGCAGACCAAGAACTATGACATCGGAG
CGGCTCTGGACACCATCCAGTATTCAAAGCATCCCCCGCCGTTGTGACCACTTTTGCCCACCTCTTCTGCGTGCCCCTCT
TCTGTCTCATAGTTGTGTTAAGCTTGCGTAGAATTGCAGGTCTCTGTACGGGCCAGTTTCTCTGCCTTCTTCCAGGATCA
GGGGTTAGGGTGCAAGAAGCCATTTAGGGCAGCAAAACAAGTGACATGAAGGGAGGGTCCCTGTGTGTGTGTGTGCTGAT
GTTTCCTGGGTGCCCTGGCTCCTTGCAGCAGGGCTGGGCCTGCGAGACCCAAGGCTCACTGCAGCGCGCTCCTGACCCCT
CCCTGCAGGGGCTACGTTAGCAGCCCAGCACATAGCTTGCCTAATGGCTTTCACTTTCTCTTTTGTTTTAAATGACTCAT
AGGTCCCTGACATTTAGTTGATTATTTTCTGCTACAGACCTGGTACACTCTGATTTTAGATAAAGTAAGCCTAGGTGTTG
TCAGCAGGCAGGCTGGGGAGGCCAGTGTTGTGGGCTTCCTGCTGGGACTGAGAAGGCTCACGAAGGGCATCCGCAATGTT
GGTTTCACTGAGAGCTGCCTCCTGGTCTCTTCACCACTGTAGTTCTCTCATTTCCAAACCATCAGCTGCTTTTAAAATAA
GATCTCTTTGTAGCCATCCTGTTAAATTTGTAAACAATCTAATTAAATGGCATCAGCACTTTAACCAATGACGTTTGCAT
AGAGAGAAATGATTGACAGTAAGTTTATTGTTAATGGTTCTTACAGAGTATCTTTAAAAGTGCCTTAGGGGAACCCTGTC
CCTCCTAACAAGTGTATCTCGATTAATAACCTGCCAGTCCCAGATCACACATCATCATCGAAGTCTTCCCCAGTTATAAA
GAGGTCACATAGTCGTGTGGGTCGAGGATTCTGTGCCTCCAGGACCAGGGGCCCACCCTCTGCCCAGGGAGTCCTTGCGT
CCCATGAGGTCTTCCCGCAAGGCCTCTCAGACCCAGATGTGACGGGGTGTGTGGCCCGAGGAAGCTGGACAGCGGCAGTG
GGCCTGCTGAGGCCTTCTCTTGAGGCCTGTGCTCTGGGGGTCCCTTGCTTAGCCTGTGCTGGACCAGCTGGCCTGGGGTC
CCTCTGAAGAGACCTTGGCTGCTCACTGTCCACATGTGAACTTTTTCTAGGTGGCAGGACAAATTGCGCCCATTTAGAGG
ATGTGGCTGTAACCTGCTGGATGGGACTCCATAGCTCCTTCCCAGGACCCCTCAGCTCCCCGGCACTGCAGTCTGCAGAG
TTCTCCTGGAGGCAGGGGCTGCTGCCTTGTTTCACCTTCCATGTCAGGCCAGCCTGTCCCTGAAAGAGAAGATGGCCATG
CCCTCCATGTGTAAGAACAATGCCAGGGCCCAGGAGGACCGCCTGCCCTGCCTGGGCCTTGGCTGGGCCTCTGGTTCTGA
CACTTTCTGCTGGAAGCTGTCAGGCTGGGACAGGCTTTGATTTTGAGGGTTAGCAAGACAAAGCAAATAAATGCCTTCCA
CCTCACCGCAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_003900.4 (GI:188497651)
|
Name |
Sequestosome 1 (SQSTM1), transcript variant 1
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2923 nt
|
Map |
5q35
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179247841...179248140 | 179249957...179250052 | 179250857...179251086 | 179251181...179251322 |
179252145...179252225 | 179260031...179260245 | 179260586...179260781 | 179263435...179265077 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
300
|
300
|
1
|
Exon 2
|
301
|
396
|
96
|
1
|
Exon 3
|
397
|
626
|
230
|
1
|
Exon 4
|
627
|
768
|
142
|
1
|
Exon 5
|
769
|
849
|
81
|
1
|
Exon 6
|
850
|
1064
|
215
|
1
|
Exon 7
|
1065
|
1260
|
196
|
1
|
Exon 8
|
1261
|
2903
|
1643
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS34317
|
Nucleotide |
SQSTM1, mRNA isoform 1[NM_003900.4] : 96...1418
|
Length |
1323
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179247936...179248140 | 179249957...179250052 | 179250857...179251086 | 179251181...179251322 |
179252145...179252225 | 179260031...179260245 | 179260586...179260781 | 179263435...179263592 |
|
Start codon |
1
|
Translation |
NP_003891.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
CCTCTCGAGG |
CGGGGCGGGG |
CCTCCGCGTT |
CGCTACAAAA |
GCCGCGCGGC |
GGCTGCGACC |
GGGACGGCCC |
GTTTTCCGCC |
|
81 |
AGCTCGCCGC |
TCGCTATGGC |
GTCGCTCACC |
GTGAAGGCCT |
ACCTTCTGGG |
CAAGGAGGAC |
GCGGCGCGCG |
AGATTCGCCG |
|
161 |
CTTCAGCTTC |
TGCTGCAGCC |
CCGAGCCTGA |
GGCGGAAGCC |
GAGGCTGCGG |
CGGGTCCGGG |
ACCCTGCGAG |
CGGCTGCTGA |
|
241 |
GCCGGGTGGC |
CGCCCTGTTC |
CCCGCGCTGC |
GGCCTGGCGG |
CTTCCAGGCG |
CACTACCGCG |
ATGAGGACGG |
GGACTTGGTT |
|
321 |
GCCTTTTCCA |
GTGACGAGGA |
ATTGACAATG |
GCCATGTCCT |
ACGTGAAGGA |
TGACATCTTC |
CGAATCTACA |
TTAAAGAGAA |
|
401 |
AAAAGAGTGC |
CGGCGGGACC |
ACCGCCCACC |
GTGTGCTCAG |
GAGGCGCCCC |
GCAACATGGT |
GCACCCCAAT |
GTGATCTGCG |
|
481 |
ATGGCTGCAA |
TGGGCCTGTG |
GTAGGAACCC |
GCTACAAGTG |
CAGCGTCTGC |
CCAGACTACG |
ACTTGTGTAG |
CGTCTGCGAG |
|
561 |
GGAAAGGGCT |
TGCACCGGGG |
GCACACCAAG |
CTCGCATTCC |
CCAGCCCCTT |
CGGGCACCTG |
TCTGAGGGCT |
TCTCGCACAG |
|
641 |
CCGCTGGCTC |
CGGAAGGTGA |
AACACGGACA |
CTTCGGGTGG |
CCAGGATGGG |
AAATGGGTCC |
ACCAGGAAAC |
TGGAGCCCAC |
|
721 |
GTCCTCCTCG |
TGCAGGGGAG |
GCCCGCCCTG |
GCCCCACGGC |
AGAATCAGCT |
TCTGGTCCAT |
CGGAGGATCC |
GAGTGTGAAT |
|
801 |
TTCCTGAAGA |
ACGTTGGGGA |
GAGTGTGGCA |
GCTGCCCTTA |
GCCCTCTGGG |
CATTGAAGTT |
GATATCGATG |
TGGAGCACGG |
|
881 |
AGGGAAAAGA |
AGCCGCCTGA |
CCCCCGTCTC |
TCCAGAGAGT |
TCCAGCACAG |
AGGAGAAGAG |
CAGCTCACAG |
CCAAGCAGCT |
|
961 |
GCTGCTCTGA |
CCCCAGCAAG |
CCGGGTGGGA |
ATGTTGAGGG |
CGCCACGCAG |
TCTCTGGCGG |
AGCAGATGAG |
GAAGATCGCC |
|
1041 |
TTGGAGTCCG |
AGGGGCGCCC |
TGAGGAACAG |
ATGGAGTCGG |
ATAACTGTTC |
AGGAGGAGAT |
GATGACTGGA |
CCCATCTGTC |
|
1121 |
TTCAAAAGAA |
GTGGACCCGT |
CTACAGGTGA |
ACTCCAGTCC |
CTACAGATGC |
CAGAATCCGA |
AGGGCCAAGC |
TCTCTGGACC |
|
1201 |
CCTCCCAGGA |
GGGACCCACA |
GGGCTGAAGG |
AAGCTGCCTT |
GTACCCACAT |
CTCCCGCCAG |
AGGCTGACCC |
GCGGCTGATT |
|
1281 |
GAGTCCCTCT |
CCCAGATGCT |
GTCCATGGGC |
TTCTCTGATG |
AAGGCGGCTG |
GCTCACCAGG |
CTCCTGCAGA |
CCAAGAACTA |
|
1361 |
TGACATCGGA |
GCGGCTCTGG |
ACACCATCCA |
GTATTCAAAG |
CATCCCCCGC |
CGTTGTGACC |
ACTTTTGCCC |
ACCTCTTCTG |
|
1441 |
CGTGCCCCTC |
TTCTGTCTCA |
TAGTTGTGTT |
AAGCTTGCGT |
AGAATTGCAG |
GTCTCTGTAC |
GGGCCAGTTT |
CTCTGCCTTC |
|
1521 |
TTCCAGGATC |
AGGGGTTAGG |
GTGCAAGAAG |
CCATTTAGGG |
CAGCAAAACA |
AGTGACATGA |
AGGGAGGGTC |
CCTGTGTGTG |
|
1601 |
TGTGTGCTGA |
TGTTTCCTGG |
GTGCCCTGGC |
TCCTTGCAGC |
AGGGCTGGGC |
CTGCGAGACC |
CAAGGCTCAC |
TGCAGCGCGC |
|
1681 |
TCCTGACCCC |
TCCCTGCAGG |
GGCTACGTTA |
GCAGCCCAGC |
ACATAGCTTG |
CCTAATGGCT |
TTCACTTTCT |
CTTTTGTTTT |
|
1761 |
AAATGACTCA |
TAGGTCCCTG |
ACATTTAGTT |
GATTATTTTC |
TGCTACAGAC |
CTGGTACACT |
CTGATTTTAG |
ATAAAGTAAG |
|
1841 |
CCTAGGTGTT |
GTCAGCAGGC |
AGGCTGGGGA |
GGCCAGTGTT |
GTGGGCTTCC |
TGCTGGGACT |
GAGAAGGCTC |
ACGAAGGGCA |
|
1921 |
TCCGCAATGT |
TGGTTTCACT |
GAGAGCTGCC |
TCCTGGTCTC |
TTCACCACTG |
TAGTTCTCTC |
ATTTCCAAAC |
CATCAGCTGC |
|
2001 |
TTTTAAAATA |
AGATCTCTTT |
GTAGCCATCC |
TGTTAAATTT |
GTAAACAATC |
TAATTAAATG |
GCATCAGCAC |
TTTAACCAAT |
|
2081 |
GACGTTTGCA |
TAGAGAGAAA |
TGATTGACAG |
TAAGTTTATT |
GTTAATGGTT |
CTTACAGAGT |
ATCTTTAAAA |
GTGCCTTAGG |
|
2161 |
GGAACCCTGT |
CCCTCCTAAC |
AAGTGTATCT |
CGATTAATAA |
CCTGCCAGTC |
CCAGATCACA |
CATCATCATC |
GAAGTCTTCC |
|
2241 |
CCAGTTATAA |
AGAGGTCACA |
TAGTCGTGTG |
GGTCGAGGAT |
TCTGTGCCTC |
CAGGACCAGG |
GGCCCACCCT |
CTGCCCAGGG |
|
2321 |
AGTCCTTGCG |
TCCCATGAGG |
TCTTCCCGCA |
AGGCCTCTCA |
GACCCAGATG |
TGACGGGGTG |
TGTGGCCCGA |
GGAAGCTGGA |
|
2401 |
CAGCGGCAGT |
GGGCCTGCTG |
AGGCCTTCTC |
TTGAGGCCTG |
TGCTCTGGGG |
GTCCCTTGCT |
TAGCCTGTGC |
TGGACCAGCT |
|
2481 |
GGCCTGGGGT |
CCCTCTGAAG |
AGACCTTGGC |
TGCTCACTGT |
CCACATGTGA |
ACTTTTTCTA |
GGTGGCAGGA |
CAAATTGCGC |
|
2561 |
CCATTTAGAG |
GATGTGGCTG |
TAACCTGCTG |
GATGGGACTC |
CATAGCTCCT |
TCCCAGGACC |
CCTCAGCTCC |
CCGGCACTGC |
|
2641 |
AGTCTGCAGA |
GTTCTCCTGG |
AGGCAGGGGC |
TGCTGCCTTG |
TTTCACCTTC |
CATGTCAGGC |
CAGCCTGTCC |
CTGAAAGAGA |
|
2721 |
AGATGGCCAT |
GCCCTCCATG |
TGTAAGAACA |
ATGCCAGGGC |
CCAGGAGGAC |
CGCCTGCCCT |
GCCTGGGCCT |
TGGCTGGGCC |
|
2801 |
TCTGGTTCTG |
ACACTTTCTG |
CTGGAAGCTG |
TCAGGCTGGG |
ACAGGCTTTG |
ATTTTGAGGG |
TTAGCAAGAC |
AAAGCAAATA |
|
2881 |
AATGCCTTCC |
ACCTCACCGC |
AAAAAAAAAA |
AAAAAAAAAA |
AAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|188497651|ref|NM_003900.4|Sequestosome 1 (SQSTM1), transcript variant 1
CCTCTCGAGGCGGGGCGGGGCCTCCGCGTTCGCTACAAAAGCCGCGCGGCGGCTGCGACCGGGACGGCCCGTTTTCCGCC
AGCTCGCCGCTCGCTATGGCGTCGCTCACCGTGAAGGCCTACCTTCTGGGCAAGGAGGACGCGGCGCGCGAGATTCGCCG
CTTCAGCTTCTGCTGCAGCCCCGAGCCTGAGGCGGAAGCCGAGGCTGCGGCGGGTCCGGGACCCTGCGAGCGGCTGCTGA
GCCGGGTGGCCGCCCTGTTCCCCGCGCTGCGGCCTGGCGGCTTCCAGGCGCACTACCGCGATGAGGACGGGGACTTGGTT
GCCTTTTCCAGTGACGAGGAATTGACAATGGCCATGTCCTACGTGAAGGATGACATCTTCCGAATCTACATTAAAGAGAA
AAAAGAGTGCCGGCGGGACCACCGCCCACCGTGTGCTCAGGAGGCGCCCCGCAACATGGTGCACCCCAATGTGATCTGCG
ATGGCTGCAATGGGCCTGTGGTAGGAACCCGCTACAAGTGCAGCGTCTGCCCAGACTACGACTTGTGTAGCGTCTGCGAG
GGAAAGGGCTTGCACCGGGGGCACACCAAGCTCGCATTCCCCAGCCCCTTCGGGCACCTGTCTGAGGGCTTCTCGCACAG
CCGCTGGCTCCGGAAGGTGAAACACGGACACTTCGGGTGGCCAGGATGGGAAATGGGTCCACCAGGAAACTGGAGCCCAC
GTCCTCCTCGTGCAGGGGAGGCCCGCCCTGGCCCCACGGCAGAATCAGCTTCTGGTCCATCGGAGGATCCGAGTGTGAAT
TTCCTGAAGAACGTTGGGGAGAGTGTGGCAGCTGCCCTTAGCCCTCTGGGCATTGAAGTTGATATCGATGTGGAGCACGG
AGGGAAAAGAAGCCGCCTGACCCCCGTCTCTCCAGAGAGTTCCAGCACAGAGGAGAAGAGCAGCTCACAGCCAAGCAGCT
GCTGCTCTGACCCCAGCAAGCCGGGTGGGAATGTTGAGGGCGCCACGCAGTCTCTGGCGGAGCAGATGAGGAAGATCGCC
TTGGAGTCCGAGGGGCGCCCTGAGGAACAGATGGAGTCGGATAACTGTTCAGGAGGAGATGATGACTGGACCCATCTGTC
TTCAAAAGAAGTGGACCCGTCTACAGGTGAACTCCAGTCCCTACAGATGCCAGAATCCGAAGGGCCAAGCTCTCTGGACC
CCTCCCAGGAGGGACCCACAGGGCTGAAGGAAGCTGCCTTGTACCCACATCTCCCGCCAGAGGCTGACCCGCGGCTGATT
GAGTCCCTCTCCCAGATGCTGTCCATGGGCTTCTCTGATGAAGGCGGCTGGCTCACCAGGCTCCTGCAGACCAAGAACTA
TGACATCGGAGCGGCTCTGGACACCATCCAGTATTCAAAGCATCCCCCGCCGTTGTGACCACTTTTGCCCACCTCTTCTG
CGTGCCCCTCTTCTGTCTCATAGTTGTGTTAAGCTTGCGTAGAATTGCAGGTCTCTGTACGGGCCAGTTTCTCTGCCTTC
TTCCAGGATCAGGGGTTAGGGTGCAAGAAGCCATTTAGGGCAGCAAAACAAGTGACATGAAGGGAGGGTCCCTGTGTGTG
TGTGTGCTGATGTTTCCTGGGTGCCCTGGCTCCTTGCAGCAGGGCTGGGCCTGCGAGACCCAAGGCTCACTGCAGCGCGC
TCCTGACCCCTCCCTGCAGGGGCTACGTTAGCAGCCCAGCACATAGCTTGCCTAATGGCTTTCACTTTCTCTTTTGTTTT
AAATGACTCATAGGTCCCTGACATTTAGTTGATTATTTTCTGCTACAGACCTGGTACACTCTGATTTTAGATAAAGTAAG
CCTAGGTGTTGTCAGCAGGCAGGCTGGGGAGGCCAGTGTTGTGGGCTTCCTGCTGGGACTGAGAAGGCTCACGAAGGGCA
TCCGCAATGTTGGTTTCACTGAGAGCTGCCTCCTGGTCTCTTCACCACTGTAGTTCTCTCATTTCCAAACCATCAGCTGC
TTTTAAAATAAGATCTCTTTGTAGCCATCCTGTTAAATTTGTAAACAATCTAATTAAATGGCATCAGCACTTTAACCAAT
GACGTTTGCATAGAGAGAAATGATTGACAGTAAGTTTATTGTTAATGGTTCTTACAGAGTATCTTTAAAAGTGCCTTAGG
GGAACCCTGTCCCTCCTAACAAGTGTATCTCGATTAATAACCTGCCAGTCCCAGATCACACATCATCATCGAAGTCTTCC
CCAGTTATAAAGAGGTCACATAGTCGTGTGGGTCGAGGATTCTGTGCCTCCAGGACCAGGGGCCCACCCTCTGCCCAGGG
AGTCCTTGCGTCCCATGAGGTCTTCCCGCAAGGCCTCTCAGACCCAGATGTGACGGGGTGTGTGGCCCGAGGAAGCTGGA
CAGCGGCAGTGGGCCTGCTGAGGCCTTCTCTTGAGGCCTGTGCTCTGGGGGTCCCTTGCTTAGCCTGTGCTGGACCAGCT
GGCCTGGGGTCCCTCTGAAGAGACCTTGGCTGCTCACTGTCCACATGTGAACTTTTTCTAGGTGGCAGGACAAATTGCGC
CCATTTAGAGGATGTGGCTGTAACCTGCTGGATGGGACTCCATAGCTCCTTCCCAGGACCCCTCAGCTCCCCGGCACTGC
AGTCTGCAGAGTTCTCCTGGAGGCAGGGGCTGCTGCCTTGTTTCACCTTCCATGTCAGGCCAGCCTGTCCCTGAAAGAGA
AGATGGCCATGCCCTCCATGTGTAAGAACAATGCCAGGGCCCAGGAGGACCGCCTGCCCTGCCTGGGCCTTGGCTGGGCC
TCTGGTTCTGACACTTTCTGCTGGAAGCTGTCAGGCTGGGACAGGCTTTGATTTTGAGGGTTAGCAAGACAAAGCAAATA
AATGCCTTCCACCTCACCGCAAAAAAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001142298.1 (GI:214830437)
|
Name |
Sequestosome 1 (SQSTM1), transcript variant 2
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2931 nt
|
Map |
5q35
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179233387...179233590 | 179238573...179238681 | 179249957...179250052 | 179250857...179251086 |
179251181...179251322 | 179252145...179252225 | 179260031...179260245 | 179260586...179260781 |
179263435...179265077 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 9
|
1
|
204
|
204
|
1
|
Exon 11
|
205
|
313
|
109
|
1
|
Exon 2
|
314
|
409
|
96
|
1
|
Exon 3
|
410
|
639
|
230
|
1
|
Exon 4
|
640
|
781
|
142
|
1
|
Exon 5
|
782
|
862
|
81
|
1
|
Exon 6
|
863
|
1077
|
215
|
1
|
Exon 7
|
1078
|
1273
|
196
|
1
|
Exon 8
|
1274
|
2916
|
1643
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS47355
|
Nucleotide |
SQSTM1, mRNA isoform 2[NM_001142298.1] : 361...1431
|
Length |
1071
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179250004...179250052 | 179250857...179251086 | 179251181...179251322 | 179252145...179252225 |
179260031...179260245 | 179260586...179260781 | 179263435...179263592 |
|
Start codon |
1
|
Translation |
NP_001135770.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GCGTCGGCTT |
CCGGCCGCCT |
TCCGCGGCCA |
CCGCCGGGCC |
CGCTCCCGCC |
GCCGACGCCC |
AGGTGCGCCA |
GGTGCGGGCC |
|
81 |
GGGCGGGGGT |
CGCGCTCACC |
TTTCTGGCCG |
CTGAGTGCCG |
CGTACCAGGA |
CAGCGAGAGG |
AAGGCGCACA |
GGCAGAAGAG |
|
161 |
CAGCAGCGTC |
AGGAAGGTGC |
CATTGCGGAG |
CCTCATCTCC |
TCGGTGTCTG |
CGAGATTAAT |
CTCTCATGGC |
CGCTGCACAA |
|
241 |
GAACCTGGCT |
TTTAGCTGAA |
CTAAGGAGAA |
AGTCCTACAA |
CAGTTTGGCG |
TGCAACATGG |
GGCTTGAGAA |
AGGATGAGGA |
|
321 |
CGGGGACTTG |
GTTGCCTTTT |
CCAGTGACGA |
GGAATTGACA |
ATGGCCATGT |
CCTACGTGAA |
GGATGACATC |
TTCCGAATCT |
|
401 |
ACATTAAAGA |
GAAAAAAGAG |
TGCCGGCGGG |
ACCACCGCCC |
ACCGTGTGCT |
CAGGAGGCGC |
CCCGCAACAT |
GGTGCACCCC |
|
481 |
AATGTGATCT |
GCGATGGCTG |
CAATGGGCCT |
GTGGTAGGAA |
CCCGCTACAA |
GTGCAGCGTC |
TGCCCAGACT |
ACGACTTGTG |
|
561 |
TAGCGTCTGC |
GAGGGAAAGG |
GCTTGCACCG |
GGGGCACACC |
AAGCTCGCAT |
TCCCCAGCCC |
CTTCGGGCAC |
CTGTCTGAGG |
|
641 |
GCTTCTCGCA |
CAGCCGCTGG |
CTCCGGAAGG |
TGAAACACGG |
ACACTTCGGG |
TGGCCAGGAT |
GGGAAATGGG |
TCCACCAGGA |
|
721 |
AACTGGAGCC |
CACGTCCTCC |
TCGTGCAGGG |
GAGGCCCGCC |
CTGGCCCCAC |
GGCAGAATCA |
GCTTCTGGTC |
CATCGGAGGA |
|
801 |
TCCGAGTGTG |
AATTTCCTGA |
AGAACGTTGG |
GGAGAGTGTG |
GCAGCTGCCC |
TTAGCCCTCT |
GGGCATTGAA |
GTTGATATCG |
|
881 |
ATGTGGAGCA |
CGGAGGGAAA |
AGAAGCCGCC |
TGACCCCCGT |
CTCTCCAGAG |
AGTTCCAGCA |
CAGAGGAGAA |
GAGCAGCTCA |
|
961 |
CAGCCAAGCA |
GCTGCTGCTC |
TGACCCCAGC |
AAGCCGGGTG |
GGAATGTTGA |
GGGCGCCACG |
CAGTCTCTGG |
CGGAGCAGAT |
|
1041 |
GAGGAAGATC |
GCCTTGGAGT |
CCGAGGGGCG |
CCCTGAGGAA |
CAGATGGAGT |
CGGATAACTG |
TTCAGGAGGA |
GATGATGACT |
|
1121 |
GGACCCATCT |
GTCTTCAAAA |
GAAGTGGACC |
CGTCTACAGG |
TGAACTCCAG |
TCCCTACAGA |
TGCCAGAATC |
CGAAGGGCCA |
|
1201 |
AGCTCTCTGG |
ACCCCTCCCA |
GGAGGGACCC |
ACAGGGCTGA |
AGGAAGCTGC |
CTTGTACCCA |
CATCTCCCGC |
CAGAGGCTGA |
|
1281 |
CCCGCGGCTG |
ATTGAGTCCC |
TCTCCCAGAT |
GCTGTCCATG |
GGCTTCTCTG |
ATGAAGGCGG |
CTGGCTCACC |
AGGCTCCTGC |
|
1361 |
AGACCAAGAA |
CTATGACATC |
GGAGCGGCTC |
TGGACACCAT |
CCAGTATTCA |
AAGCATCCCC |
CGCCGTTGTG |
ACCACTTTTG |
|
1441 |
CCCACCTCTT |
CTGCGTGCCC |
CTCTTCTGTC |
TCATAGTTGT |
GTTAAGCTTG |
CGTAGAATTG |
CAGGTCTCTG |
TACGGGCCAG |
|
1521 |
TTTCTCTGCC |
TTCTTCCAGG |
ATCAGGGGTT |
AGGGTGCAAG |
AAGCCATTTA |
GGGCAGCAAA |
ACAAGTGACA |
TGAAGGGAGG |
|
1601 |
GTCCCTGTGT |
GTGTGTGTGC |
TGATGTTTCC |
TGGGTGCCCT |
GGCTCCTTGC |
AGCAGGGCTG |
GGCCTGCGAG |
ACCCAAGGCT |
|
1681 |
CACTGCAGCG |
CGCTCCTGAC |
CCCTCCCTGC |
AGGGGCTACG |
TTAGCAGCCC |
AGCACATAGC |
TTGCCTAATG |
GCTTTCACTT |
|
1761 |
TCTCTTTTGT |
TTTAAATGAC |
TCATAGGTCC |
CTGACATTTA |
GTTGATTATT |
TTCTGCTACA |
GACCTGGTAC |
ACTCTGATTT |
|
1841 |
TAGATAAAGT |
AAGCCTAGGT |
GTTGTCAGCA |
GGCAGGCTGG |
GGAGGCCAGT |
GTTGTGGGCT |
TCCTGCTGGG |
ACTGAGAAGG |
|
1921 |
CTCACGAAGG |
GCATCCGCAA |
TGTTGGTTTC |
ACTGAGAGCT |
GCCTCCTGGT |
CTCTTCACCA |
CTGTAGTTCT |
CTCATTTCCA |
|
2001 |
AACCATCAGC |
TGCTTTTAAA |
ATAAGATCTC |
TTTGTAGCCA |
TCCTGTTAAA |
TTTGTAAACA |
ATCTAATTAA |
ATGGCATCAG |
|
2081 |
CACTTTAACC |
AATGACGTTT |
GCATAGAGAG |
AAATGATTGA |
CAGTAAGTTT |
ATTGTTAATG |
GTTCTTACAG |
AGTATCTTTA |
|
2161 |
AAAGTGCCTT |
AGGGGAACCC |
TGTCCCTCCT |
AACAAGTGTA |
TCTCGATTAA |
TAACCTGCCA |
GTCCCAGATC |
ACACATCATC |
|
2241 |
ATCGAAGTCT |
TCCCCAGTTA |
TAAAGAGGTC |
ACATAGTCGT |
GTGGGTCGAG |
GATTCTGTGC |
CTCCAGGACC |
AGGGGCCCAC |
|
2321 |
CCTCTGCCCA |
GGGAGTCCTT |
GCGTCCCATG |
AGGTCTTCCC |
GCAAGGCCTC |
TCAGACCCAG |
ATGTGACGGG |
GTGTGTGGCC |
|
2401 |
CGAGGAAGCT |
GGACAGCGGC |
AGTGGGCCTG |
CTGAGGCCTT |
CTCTTGAGGC |
CTGTGCTCTG |
GGGGTCCCTT |
GCTTAGCCTG |
|
2481 |
TGCTGGACCA |
GCTGGCCTGG |
GGTCCCTCTG |
AAGAGACCTT |
GGCTGCTCAC |
TGTCCACATG |
TGAACTTTTT |
CTAGGTGGCA |
|
2561 |
GGACAAATTG |
CGCCCATTTA |
GAGGATGTGG |
CTGTAACCTG |
CTGGATGGGA |
CTCCATAGCT |
CCTTCCCAGG |
ACCCCTCAGC |
|
2641 |
TCCCCGGCAC |
TGCAGTCTGC |
AGAGTTCTCC |
TGGAGGCAGG |
GGCTGCTGCC |
TTGTTTCACC |
TTCCATGTCA |
GGCCAGCCTG |
|
2721 |
TCCCTGAAAG |
AGAAGATGGC |
CATGCCCTCC |
ATGTGTAAGA |
ACAATGCCAG |
GGCCCAGGAG |
GACCGCCTGC |
CCTGCCTGGG |
|
2801 |
CCTTGGCTGG |
GCCTCTGGTT |
CTGACACTTT |
CTGCTGGAAG |
CTGTCAGGCT |
GGGACAGGCT |
TTGATTTTGA |
GGGTTAGCAA |
|
2881 |
GACAAAGCAA |
ATAAATGCCT |
TCCACCTCAC |
CGCAAAAAAA |
AAAAAAAAAA |
A |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|214830437|ref|NM_001142298.1|Sequestosome 1 (SQSTM1), transcript variant 2
GCGTCGGCTTCCGGCCGCCTTCCGCGGCCACCGCCGGGCCCGCTCCCGCCGCCGACGCCCAGGTGCGCCAGGTGCGGGCC
GGGCGGGGGTCGCGCTCACCTTTCTGGCCGCTGAGTGCCGCGTACCAGGACAGCGAGAGGAAGGCGCACAGGCAGAAGAG
CAGCAGCGTCAGGAAGGTGCCATTGCGGAGCCTCATCTCCTCGGTGTCTGCGAGATTAATCTCTCATGGCCGCTGCACAA
GAACCTGGCTTTTAGCTGAACTAAGGAGAAAGTCCTACAACAGTTTGGCGTGCAACATGGGGCTTGAGAAAGGATGAGGA
CGGGGACTTGGTTGCCTTTTCCAGTGACGAGGAATTGACAATGGCCATGTCCTACGTGAAGGATGACATCTTCCGAATCT
ACATTAAAGAGAAAAAAGAGTGCCGGCGGGACCACCGCCCACCGTGTGCTCAGGAGGCGCCCCGCAACATGGTGCACCCC
AATGTGATCTGCGATGGCTGCAATGGGCCTGTGGTAGGAACCCGCTACAAGTGCAGCGTCTGCCCAGACTACGACTTGTG
TAGCGTCTGCGAGGGAAAGGGCTTGCACCGGGGGCACACCAAGCTCGCATTCCCCAGCCCCTTCGGGCACCTGTCTGAGG
GCTTCTCGCACAGCCGCTGGCTCCGGAAGGTGAAACACGGACACTTCGGGTGGCCAGGATGGGAAATGGGTCCACCAGGA
AACTGGAGCCCACGTCCTCCTCGTGCAGGGGAGGCCCGCCCTGGCCCCACGGCAGAATCAGCTTCTGGTCCATCGGAGGA
TCCGAGTGTGAATTTCCTGAAGAACGTTGGGGAGAGTGTGGCAGCTGCCCTTAGCCCTCTGGGCATTGAAGTTGATATCG
ATGTGGAGCACGGAGGGAAAAGAAGCCGCCTGACCCCCGTCTCTCCAGAGAGTTCCAGCACAGAGGAGAAGAGCAGCTCA
CAGCCAAGCAGCTGCTGCTCTGACCCCAGCAAGCCGGGTGGGAATGTTGAGGGCGCCACGCAGTCTCTGGCGGAGCAGAT
GAGGAAGATCGCCTTGGAGTCCGAGGGGCGCCCTGAGGAACAGATGGAGTCGGATAACTGTTCAGGAGGAGATGATGACT
GGACCCATCTGTCTTCAAAAGAAGTGGACCCGTCTACAGGTGAACTCCAGTCCCTACAGATGCCAGAATCCGAAGGGCCA
AGCTCTCTGGACCCCTCCCAGGAGGGACCCACAGGGCTGAAGGAAGCTGCCTTGTACCCACATCTCCCGCCAGAGGCTGA
CCCGCGGCTGATTGAGTCCCTCTCCCAGATGCTGTCCATGGGCTTCTCTGATGAAGGCGGCTGGCTCACCAGGCTCCTGC
AGACCAAGAACTATGACATCGGAGCGGCTCTGGACACCATCCAGTATTCAAAGCATCCCCCGCCGTTGTGACCACTTTTG
CCCACCTCTTCTGCGTGCCCCTCTTCTGTCTCATAGTTGTGTTAAGCTTGCGTAGAATTGCAGGTCTCTGTACGGGCCAG
TTTCTCTGCCTTCTTCCAGGATCAGGGGTTAGGGTGCAAGAAGCCATTTAGGGCAGCAAAACAAGTGACATGAAGGGAGG
GTCCCTGTGTGTGTGTGTGCTGATGTTTCCTGGGTGCCCTGGCTCCTTGCAGCAGGGCTGGGCCTGCGAGACCCAAGGCT
CACTGCAGCGCGCTCCTGACCCCTCCCTGCAGGGGCTACGTTAGCAGCCCAGCACATAGCTTGCCTAATGGCTTTCACTT
TCTCTTTTGTTTTAAATGACTCATAGGTCCCTGACATTTAGTTGATTATTTTCTGCTACAGACCTGGTACACTCTGATTT
TAGATAAAGTAAGCCTAGGTGTTGTCAGCAGGCAGGCTGGGGAGGCCAGTGTTGTGGGCTTCCTGCTGGGACTGAGAAGG
CTCACGAAGGGCATCCGCAATGTTGGTTTCACTGAGAGCTGCCTCCTGGTCTCTTCACCACTGTAGTTCTCTCATTTCCA
AACCATCAGCTGCTTTTAAAATAAGATCTCTTTGTAGCCATCCTGTTAAATTTGTAAACAATCTAATTAAATGGCATCAG
CACTTTAACCAATGACGTTTGCATAGAGAGAAATGATTGACAGTAAGTTTATTGTTAATGGTTCTTACAGAGTATCTTTA
AAAGTGCCTTAGGGGAACCCTGTCCCTCCTAACAAGTGTATCTCGATTAATAACCTGCCAGTCCCAGATCACACATCATC
ATCGAAGTCTTCCCCAGTTATAAAGAGGTCACATAGTCGTGTGGGTCGAGGATTCTGTGCCTCCAGGACCAGGGGCCCAC
CCTCTGCCCAGGGAGTCCTTGCGTCCCATGAGGTCTTCCCGCAAGGCCTCTCAGACCCAGATGTGACGGGGTGTGTGGCC
CGAGGAAGCTGGACAGCGGCAGTGGGCCTGCTGAGGCCTTCTCTTGAGGCCTGTGCTCTGGGGGTCCCTTGCTTAGCCTG
TGCTGGACCAGCTGGCCTGGGGTCCCTCTGAAGAGACCTTGGCTGCTCACTGTCCACATGTGAACTTTTTCTAGGTGGCA
GGACAAATTGCGCCCATTTAGAGGATGTGGCTGTAACCTGCTGGATGGGACTCCATAGCTCCTTCCCAGGACCCCTCAGC
TCCCCGGCACTGCAGTCTGCAGAGTTCTCCTGGAGGCAGGGGCTGCTGCCTTGTTTCACCTTCCATGTCAGGCCAGCCTG
TCCCTGAAAGAGAAGATGGCCATGCCCTCCATGTGTAAGAACAATGCCAGGGCCCAGGAGGACCGCCTGCCCTGCCTGGG
CCTTGGCTGGGCCTCTGGTTCTGACACTTTCTGCTGGAAGCTGTCAGGCTGGGACAGGCTTTGATTTTGAGGGTTAGCAA
GACAAAGCAAATAAATGCCTTCCACCTCACCGCAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001142299.1 (GI:214830450)
|
Name |
Sequestosome 1 (SQSTM1), transcript variant 3
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2848 nt
|
Map |
5q35
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179234002...179234122 | 179238573...179238681 | 179249957...179250052 | 179250857...179251086 |
179251181...179251322 | 179252145...179252225 | 179260031...179260245 | 179260586...179260781 |
179263435...179265077 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 10
|
1
|
121
|
121
|
1
|
Exon 11
|
122
|
230
|
109
|
1
|
Exon 2
|
231
|
326
|
96
|
1
|
Exon 3
|
327
|
556
|
230
|
1
|
Exon 4
|
557
|
698
|
142
|
1
|
Exon 5
|
699
|
779
|
81
|
1
|
Exon 6
|
780
|
994
|
215
|
1
|
Exon 7
|
995
|
1190
|
196
|
1
|
Exon 8
|
1191
|
2833
|
1643
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS47355
|
Nucleotide |
SQSTM1, mRNA isoform 3[NM_001142299.1] : 278...1348
|
Length |
1071
|
Location |
Chromosome 5 (NC_000005.9) strand : +
179250004...179250052 | 179250857...179251086 | 179251181...179251322 | 179252145...179252225 |
179260031...179260245 | 179260586...179260781 | 179263435...179263592 |
|
Start codon |
1
|
Translation |
NP_001135771.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGATTTAAAG |
GGGCCGCAGC |
ACCGCCGTCG |
CCGGCGCCGC |
GAGGGGGTGG |
GGTGGGGGCC |
GGCGGCCGGG |
ATCCCGATCG |
|
81 |
GCTCCCGCAG |
CCCCGCGTGG |
GCTCGTGCGA |
GTCGGCCTCA |
GTGTCTGCGA |
GATTAATCTC |
TCATGGCCGC |
TGCACAAGAA |
|
161 |
CCTGGCTTTT |
AGCTGAACTA |
AGGAGAAAGT |
CCTACAACAG |
TTTGGCGTGC |
AACATGGGGC |
TTGAGAAAGG |
ATGAGGACGG |
|
241 |
GGACTTGGTT |
GCCTTTTCCA |
GTGACGAGGA |
ATTGACAATG |
GCCATGTCCT |
ACGTGAAGGA |
TGACATCTTC |
CGAATCTACA |
|
321 |
TTAAAGAGAA |
AAAAGAGTGC |
CGGCGGGACC |
ACCGCCCACC |
GTGTGCTCAG |
GAGGCGCCCC |
GCAACATGGT |
GCACCCCAAT |
|
401 |
GTGATCTGCG |
ATGGCTGCAA |
TGGGCCTGTG |
GTAGGAACCC |
GCTACAAGTG |
CAGCGTCTGC |
CCAGACTACG |
ACTTGTGTAG |
|
481 |
CGTCTGCGAG |
GGAAAGGGCT |
TGCACCGGGG |
GCACACCAAG |
CTCGCATTCC |
CCAGCCCCTT |
CGGGCACCTG |
TCTGAGGGCT |
|
561 |
TCTCGCACAG |
CCGCTGGCTC |
CGGAAGGTGA |
AACACGGACA |
CTTCGGGTGG |
CCAGGATGGG |
AAATGGGTCC |
ACCAGGAAAC |
|
641 |
TGGAGCCCAC |
GTCCTCCTCG |
TGCAGGGGAG |
GCCCGCCCTG |
GCCCCACGGC |
AGAATCAGCT |
TCTGGTCCAT |
CGGAGGATCC |
|
721 |
GAGTGTGAAT |
TTCCTGAAGA |
ACGTTGGGGA |
GAGTGTGGCA |
GCTGCCCTTA |
GCCCTCTGGG |
CATTGAAGTT |
GATATCGATG |
|
801 |
TGGAGCACGG |
AGGGAAAAGA |
AGCCGCCTGA |
CCCCCGTCTC |
TCCAGAGAGT |
TCCAGCACAG |
AGGAGAAGAG |
CAGCTCACAG |
|
881 |
CCAAGCAGCT |
GCTGCTCTGA |
CCCCAGCAAG |
CCGGGTGGGA |
ATGTTGAGGG |
CGCCACGCAG |
TCTCTGGCGG |
AGCAGATGAG |
|
961 |
GAAGATCGCC |
TTGGAGTCCG |
AGGGGCGCCC |
TGAGGAACAG |
ATGGAGTCGG |
ATAACTGTTC |
AGGAGGAGAT |
GATGACTGGA |
|
1041 |
CCCATCTGTC |
TTCAAAAGAA |
GTGGACCCGT |
CTACAGGTGA |
ACTCCAGTCC |
CTACAGATGC |
CAGAATCCGA |
AGGGCCAAGC |
|
1121 |
TCTCTGGACC |
CCTCCCAGGA |
GGGACCCACA |
GGGCTGAAGG |
AAGCTGCCTT |
GTACCCACAT |
CTCCCGCCAG |
AGGCTGACCC |
|
1201 |
GCGGCTGATT |
GAGTCCCTCT |
CCCAGATGCT |
GTCCATGGGC |
TTCTCTGATG |
AAGGCGGCTG |
GCTCACCAGG |
CTCCTGCAGA |
|
1281 |
CCAAGAACTA |
TGACATCGGA |
GCGGCTCTGG |
ACACCATCCA |
GTATTCAAAG |
CATCCCCCGC |
CGTTGTGACC |
ACTTTTGCCC |
|
1361 |
ACCTCTTCTG |
CGTGCCCCTC |
TTCTGTCTCA |
TAGTTGTGTT |
AAGCTTGCGT |
AGAATTGCAG |
GTCTCTGTAC |
GGGCCAGTTT |
|
1441 |
CTCTGCCTTC |
TTCCAGGATC |
AGGGGTTAGG |
GTGCAAGAAG |
CCATTTAGGG |
CAGCAAAACA |
AGTGACATGA |
AGGGAGGGTC |
|
1521 |
CCTGTGTGTG |
TGTGTGCTGA |
TGTTTCCTGG |
GTGCCCTGGC |
TCCTTGCAGC |
AGGGCTGGGC |
CTGCGAGACC |
CAAGGCTCAC |
|
1601 |
TGCAGCGCGC |
TCCTGACCCC |
TCCCTGCAGG |
GGCTACGTTA |
GCAGCCCAGC |
ACATAGCTTG |
CCTAATGGCT |
TTCACTTTCT |
|
1681 |
CTTTTGTTTT |
AAATGACTCA |
TAGGTCCCTG |
ACATTTAGTT |
GATTATTTTC |
TGCTACAGAC |
CTGGTACACT |
CTGATTTTAG |
|
1761 |
ATAAAGTAAG |
CCTAGGTGTT |
GTCAGCAGGC |
AGGCTGGGGA |
GGCCAGTGTT |
GTGGGCTTCC |
TGCTGGGACT |
GAGAAGGCTC |
|
1841 |
ACGAAGGGCA |
TCCGCAATGT |
TGGTTTCACT |
GAGAGCTGCC |
TCCTGGTCTC |
TTCACCACTG |
TAGTTCTCTC |
ATTTCCAAAC |
|
1921 |
CATCAGCTGC |
TTTTAAAATA |
AGATCTCTTT |
GTAGCCATCC |
TGTTAAATTT |
GTAAACAATC |
TAATTAAATG |
GCATCAGCAC |
|
2001 |
TTTAACCAAT |
GACGTTTGCA |
TAGAGAGAAA |
TGATTGACAG |
TAAGTTTATT |
GTTAATGGTT |
CTTACAGAGT |
ATCTTTAAAA |
|
2081 |
GTGCCTTAGG |
GGAACCCTGT |
CCCTCCTAAC |
AAGTGTATCT |
CGATTAATAA |
CCTGCCAGTC |
CCAGATCACA |
CATCATCATC |
|
2161 |
GAAGTCTTCC |
CCAGTTATAA |
AGAGGTCACA |
TAGTCGTGTG |
GGTCGAGGAT |
TCTGTGCCTC |
CAGGACCAGG |
GGCCCACCCT |
|
2241 |
CTGCCCAGGG |
AGTCCTTGCG |
TCCCATGAGG |
TCTTCCCGCA |
AGGCCTCTCA |
GACCCAGATG |
TGACGGGGTG |
TGTGGCCCGA |
|
2321 |
GGAAGCTGGA |
CAGCGGCAGT |
GGGCCTGCTG |
AGGCCTTCTC |
TTGAGGCCTG |
TGCTCTGGGG |
GTCCCTTGCT |
TAGCCTGTGC |
|
2401 |
TGGACCAGCT |
GGCCTGGGGT |
CCCTCTGAAG |
AGACCTTGGC |
TGCTCACTGT |
CCACATGTGA |
ACTTTTTCTA |
GGTGGCAGGA |
|
2481 |
CAAATTGCGC |
CCATTTAGAG |
GATGTGGCTG |
TAACCTGCTG |
GATGGGACTC |
CATAGCTCCT |
TCCCAGGACC |
CCTCAGCTCC |
|
2561 |
CCGGCACTGC |
AGTCTGCAGA |
GTTCTCCTGG |
AGGCAGGGGC |
TGCTGCCTTG |
TTTCACCTTC |
CATGTCAGGC |
CAGCCTGTCC |
|
2641 |
CTGAAAGAGA |
AGATGGCCAT |
GCCCTCCATG |
TGTAAGAACA |
ATGCCAGGGC |
CCAGGAGGAC |
CGCCTGCCCT |
GCCTGGGCCT |
|
2721 |
TGGCTGGGCC |
TCTGGTTCTG |
ACACTTTCTG |
CTGGAAGCTG |
TCAGGCTGGG |
ACAGGCTTTG |
ATTTTGAGGG |
TTAGCAAGAC |
|
2801 |
AAAGCAAATA |
AATGCCTTCC |
ACCTCACCGC |
AAAAAAAAAA |
AAAAAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|214830450|ref|NM_001142299.1|Sequestosome 1 (SQSTM1), transcript variant 3
GGATTTAAAGGGGCCGCAGCACCGCCGTCGCCGGCGCCGCGAGGGGGTGGGGTGGGGGCCGGCGGCCGGGATCCCGATCG
GCTCCCGCAGCCCCGCGTGGGCTCGTGCGAGTCGGCCTCAGTGTCTGCGAGATTAATCTCTCATGGCCGCTGCACAAGAA
CCTGGCTTTTAGCTGAACTAAGGAGAAAGTCCTACAACAGTTTGGCGTGCAACATGGGGCTTGAGAAAGGATGAGGACGG
GGACTTGGTTGCCTTTTCCAGTGACGAGGAATTGACAATGGCCATGTCCTACGTGAAGGATGACATCTTCCGAATCTACA
TTAAAGAGAAAAAAGAGTGCCGGCGGGACCACCGCCCACCGTGTGCTCAGGAGGCGCCCCGCAACATGGTGCACCCCAAT
GTGATCTGCGATGGCTGCAATGGGCCTGTGGTAGGAACCCGCTACAAGTGCAGCGTCTGCCCAGACTACGACTTGTGTAG
CGTCTGCGAGGGAAAGGGCTTGCACCGGGGGCACACCAAGCTCGCATTCCCCAGCCCCTTCGGGCACCTGTCTGAGGGCT
TCTCGCACAGCCGCTGGCTCCGGAAGGTGAAACACGGACACTTCGGGTGGCCAGGATGGGAAATGGGTCCACCAGGAAAC
TGGAGCCCACGTCCTCCTCGTGCAGGGGAGGCCCGCCCTGGCCCCACGGCAGAATCAGCTTCTGGTCCATCGGAGGATCC
GAGTGTGAATTTCCTGAAGAACGTTGGGGAGAGTGTGGCAGCTGCCCTTAGCCCTCTGGGCATTGAAGTTGATATCGATG
TGGAGCACGGAGGGAAAAGAAGCCGCCTGACCCCCGTCTCTCCAGAGAGTTCCAGCACAGAGGAGAAGAGCAGCTCACAG
CCAAGCAGCTGCTGCTCTGACCCCAGCAAGCCGGGTGGGAATGTTGAGGGCGCCACGCAGTCTCTGGCGGAGCAGATGAG
GAAGATCGCCTTGGAGTCCGAGGGGCGCCCTGAGGAACAGATGGAGTCGGATAACTGTTCAGGAGGAGATGATGACTGGA
CCCATCTGTCTTCAAAAGAAGTGGACCCGTCTACAGGTGAACTCCAGTCCCTACAGATGCCAGAATCCGAAGGGCCAAGC
TCTCTGGACCCCTCCCAGGAGGGACCCACAGGGCTGAAGGAAGCTGCCTTGTACCCACATCTCCCGCCAGAGGCTGACCC
GCGGCTGATTGAGTCCCTCTCCCAGATGCTGTCCATGGGCTTCTCTGATGAAGGCGGCTGGCTCACCAGGCTCCTGCAGA
CCAAGAACTATGACATCGGAGCGGCTCTGGACACCATCCAGTATTCAAAGCATCCCCCGCCGTTGTGACCACTTTTGCCC
ACCTCTTCTGCGTGCCCCTCTTCTGTCTCATAGTTGTGTTAAGCTTGCGTAGAATTGCAGGTCTCTGTACGGGCCAGTTT
CTCTGCCTTCTTCCAGGATCAGGGGTTAGGGTGCAAGAAGCCATTTAGGGCAGCAAAACAAGTGACATGAAGGGAGGGTC
CCTGTGTGTGTGTGTGCTGATGTTTCCTGGGTGCCCTGGCTCCTTGCAGCAGGGCTGGGCCTGCGAGACCCAAGGCTCAC
TGCAGCGCGCTCCTGACCCCTCCCTGCAGGGGCTACGTTAGCAGCCCAGCACATAGCTTGCCTAATGGCTTTCACTTTCT
CTTTTGTTTTAAATGACTCATAGGTCCCTGACATTTAGTTGATTATTTTCTGCTACAGACCTGGTACACTCTGATTTTAG
ATAAAGTAAGCCTAGGTGTTGTCAGCAGGCAGGCTGGGGAGGCCAGTGTTGTGGGCTTCCTGCTGGGACTGAGAAGGCTC
ACGAAGGGCATCCGCAATGTTGGTTTCACTGAGAGCTGCCTCCTGGTCTCTTCACCACTGTAGTTCTCTCATTTCCAAAC
CATCAGCTGCTTTTAAAATAAGATCTCTTTGTAGCCATCCTGTTAAATTTGTAAACAATCTAATTAAATGGCATCAGCAC
TTTAACCAATGACGTTTGCATAGAGAGAAATGATTGACAGTAAGTTTATTGTTAATGGTTCTTACAGAGTATCTTTAAAA
GTGCCTTAGGGGAACCCTGTCCCTCCTAACAAGTGTATCTCGATTAATAACCTGCCAGTCCCAGATCACACATCATCATC
GAAGTCTTCCCCAGTTATAAAGAGGTCACATAGTCGTGTGGGTCGAGGATTCTGTGCCTCCAGGACCAGGGGCCCACCCT
CTGCCCAGGGAGTCCTTGCGTCCCATGAGGTCTTCCCGCAAGGCCTCTCAGACCCAGATGTGACGGGGTGTGTGGCCCGA
GGAAGCTGGACAGCGGCAGTGGGCCTGCTGAGGCCTTCTCTTGAGGCCTGTGCTCTGGGGGTCCCTTGCTTAGCCTGTGC
TGGACCAGCTGGCCTGGGGTCCCTCTGAAGAGACCTTGGCTGCTCACTGTCCACATGTGAACTTTTTCTAGGTGGCAGGA
CAAATTGCGCCCATTTAGAGGATGTGGCTGTAACCTGCTGGATGGGACTCCATAGCTCCTTCCCAGGACCCCTCAGCTCC
CCGGCACTGCAGTCTGCAGAGTTCTCCTGGAGGCAGGGGCTGCTGCCTTGTTTCACCTTCCATGTCAGGCCAGCCTGTCC
CTGAAAGAGAAGATGGCCATGCCCTCCATGTGTAAGAACAATGCCAGGGCCCAGGAGGACCGCCTGCCCTGCCTGGGCCT
TGGCTGGGCCTCTGGTTCTGACACTTTCTGCTGGAAGCTGTCAGGCTGGGACAGGCTTTGATTTTGAGGGTTAGCAAGAC
AAAGCAAATAAATGCCTTCCACCTCACCGCAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
9
|
Length |
204 nt
|
Location |
Chromosome 5 (NC_000005.9) : 179233387...179233590 (+)
|
Is part of |
SQSTM1, mRNA isoform 2
(NM_001142298.1)
|
Sequence |
Show
|
|
GCGTCGGCTTCCGGCCGCCTTCCGCGGCCACCGCCGGGCCCGCTCCCGCCGCCGACGCCCAGGTGCGCCAGGTGCGGGCC
GGGCGGGGGTCGCGCTCACCTTTCTGGCCGCTGAGTGCCGCGTACCAGGACAGCGAGAGGAAGGCGCACAGGCAGAAGAG
CAGCAGCGTCAGGAAGGTGCCATTGCGGAGCCTCATCTCCTCGG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
10
|
Length |
121 nt
|
Location |
Chromosome 5 (NC_000005.9) : 179234002...179234122 (+)
|
Is part of |
SQSTM1, mRNA isoform 3
(NM_001142299.1)
|
Sequence |
Show
|
|
GGATTTAAAGGGGCCGCAGCACCGCCGTCGCCGGCGCCGCGAGGGGGTGGGGTGGGGGCCGGCGGCCGGGATCCCGATCG
GCTCCCGCAGCCCCGCGTGGGCTCGTGCGAGTCGGCCTCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1
|
Length |
300 nt
|
Location |
Chromosome 5 (NC_000005.9) : 179247841...179248140 (+)
|
Is part of |
SQSTM1, mRNA isoform 1
(NM_003900.4)
|
Sequence |
Show
|
|
CCTCTCGAGGCGGGGCGGGGCCTCCGCGTTCGCTACAAAAGCCGCGCGGCGGCTGCGACCGGGACGGCCCGTTTTCCGCC
AGCTCGCCGCTCGCTATGGCGTCGCTCACCGTGAAGGCCTACCTTCTGGGCAAGGAGGACGCGGCGCGCGAGATTCGCCG
CTTCAGCTTCTGCTGCAGCCCCGAGCCTGAGGCGGAAGCCGAGGCTGCGGCGGGTCCGGGACCCTGCGAGCGGCTGCTGA
GCCGGGTGGCCGCCCTGTTCCCCGCGCTGCGGCCTGGCGGCTTCCAGGCGCACTACCGCG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
8
|
Length |
1643 nt
|
Location |
Chromosome 5 (NC_000005.9) : 179263435...179265077 (+)
|
Is part of |
SQSTM1, mRNA isoform 2
(NM_001142298.1)
SQSTM1, mRNA isoform 3
(NM_001142299.1)
SQSTM1, mRNA isoform 1
(NM_003900.4)
|
Sequence |
Show
|
|
AGGCTGACCCGCGGCTGATTGAGTCCCTCTCCCAGATGCTGTCCATGGGCTTCTCTGATGAAGGCGGCTGGCTCACCAGG
CTCCTGCAGACCAAGAACTATGACATCGGAGCGGCTCTGGACACCATCCAGTATTCAAAGCATCCCCCGCCGTTGTGACC
ACTTTTGCCCACCTCTTCTGCGTGCCCCTCTTCTGTCTCATAGTTGTGTTAAGCTTGCGTAGAATTGCAGGTCTCTGTAC
GGGCCAGTTTCTCTGCCTTCTTCCAGGATCAGGGGTTAGGGTGCAAGAAGCCATTTAGGGCAGCAAAACAAGTGACATGA
AGGGAGGGTCCCTGTGTGTGTGTGTGCTGATGTTTCCTGGGTGCCCTGGCTCCTTGCAGCAGGGCTGGGCCTGCGAGACC
CAAGGCTCACTGCAGCGCGCTCCTGACCCCTCCCTGCAGGGGCTACGTTAGCAGCCCAGCACATAGCTTGCCTAATGGCT
TTCACTTTCTCTTTTGTTTTAAATGACTCATAGGTCCCTGACATTTAGTTGATTATTTTCTGCTACAGACCTGGTACACT
CTGATTTTAGATAAAGTAAGCCTAGGTGTTGTCAGCAGGCAGGCTGGGGAGGCCAGTGTTGTGGGCTTCCTGCTGGGACT
GAGAAGGCTCACGAAGGGCATCCGCAATGTTGGTTTCACTGAGAGCTGCCTCCTGGTCTCTTCACCACTGTAGTTCTCTC
ATTTCCAAACCATCAGCTGCTTTTAAAATAAGATCTCTTTGTAGCCATCCTGTTAAATTTGTAAACAATCTAATTAAATG
GCATCAGCACTTTAACCAATGACGTTTGCATAGAGAGAAATGATTGACAGTAAGTTTATTGTTAATGGTTCTTACAGAGT
ATCTTTAAAAGTGCCTTAGGGGAACCCTGTCCCTCCTAACAAGTGTATCTCGATTAATAACCTGCCAGTCCCAGATCACA
CATCATCATCGAAGTCTTCCCCAGTTATAAAGAGGTCACATAGTCGTGTGGGTCGAGGATTCTGTGCCTCCAGGACCAGG
GGCCCACCCTCTGCCCAGGGAGTCCTTGCGTCCCATGAGGTCTTCCCGCAAGGCCTCTCAGACCCAGATGTGACGGGGTG
TGTGGCCCGAGGAAGCTGGACAGCGGCAGTGGGCCTGCTGAGGCCTTCTCTTGAGGCCTGTGCTCTGGGGGTCCCTTGCT
TAGCCTGTGCTGGACCAGCTGGCCTGGGGTCCCTCTGAAGAGACCTTGGCTGCTCACTGTCCACATGTGAACTTTTTCTA
GGTGGCAGGACAAATTGCGCCCATTTAGAGGATGTGGCTGTAACCTGCTGGATGGGACTCCATAGCTCCTTCCCAGGACC
CCTCAGCTCCCCGGCACTGCAGTCTGCAGAGTTCTCCTGGAGGCAGGGGCTGCTGCCTTGTTTCACCTTCCATGTCAGGC
CAGCCTGTCCCTGAAAGAGAAGATGGCCATGCCCTCCATGTGTAAGAACAATGCCAGGGCCCAGGAGGACCGCCTGCCCT
GCCTGGGCCTTGGCTGGGCCTCTGGTTCTGACACTTTCTGCTGGAAGCTGTCAGGCTGGGACAGGCTTTGATTTTGAGGG
TTAGCAAGACAAAGCAAATAAATGCCTTCCACCTCACCGCAAA
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : Q13501
|
Name |
Sequestosome-1
|
Alternative name(s) |
EBI3-associated protein of 60 kDa Phosphotyrosine-independent ligand for the Lck SH2 domain of 62 kDa Ubiquitin-binding protein p62
|
Synonym(s) |
EBIAP p60
|
Organism |
Homo sapiens
|
Length |
440 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Disease
|
Defects in SQSTM1 are a cause of Paget disease of bone (PDB) [MIM:602080]. PDB is a metabolic bone disease affecting the axial skeleton and characterized by focal areas of increased and disorganized bone turn-over due to activated osteoclasts. Manifestations of the disease include bone pain, deformity, pathological fractures, deafness, neurological complications and increased risk of osteosarcoma. PDB is a chronic disease affecting 2 to 3% of the population above the age of 40 years.
|
Domain
|
The UBA domain binds specifically 'Lys-63'-linked polyubiquitin chains of polyubiquitinated substrates. Mediates the interaction with TRIM55.
The OPR domain mediates homooligomerization and interactions with PRKCZ, PRKCI, MAP2K5 and NBR1.
The ZZ-type zinc finger mediates the interaction with RIPK1.
|
Function
|
Adapter protein which binds ubiquitin and may regulate the activation of NFKB1 by TNF-alpha, nerve growth factor (NGF) and interleukin-1. May play a role in titin/TTN downstream signaling in muscle cells. May regulate signaling cascades through ubiquitination. May be involved in cell differentiation, apoptosis, immune response and regulation of K(+) channels.
|
Induction
|
By proteasomal inhibitor PSI and prostaglandin J2 (PGJ2) (at protein level). By Phorbol 12-myristate 13-acetate (PMA).
|
Ptm
|
Phosphorylated. May be phosphorylated by PRKCZ (By similarity). Phosphorylated in vitro by TTN.
|
Similarity
|
Contains 1 OPR domain.
Contains 1 UBA domain.
Contains 1 ZZ-type zinc finger.
|
Subcellular location
|
Cytoplasm. Late endosome. Nucleus. Note=Sarcomere (By similarity). In cardiac muscles localizes to the sarcomeric band (By similarity). Localizes to late endosomes. May also localize to the nucleus. Accumulates in neurofibrillary tangles and in Lewy bodies of neurons from individuals with Alzheimer and Parkinson disease respectively. Enriched in Rosenthal fibers of pilocytic astrocytoma. In liver cells, accumulates in Mallory bodies associated with alcoholic hepatitis, Wilson disease, indian childhood cirrhosis and in hyaline bodies associated with hepatocellular carcinoma.
|
Subunit
|
Homooligomer or heterooligomer; may form homotypic arrays. Interacts directly with PRKCI and PRKCZ (Probable). Forms ternary complexes with PRKCZ and KCNAB2 or PRKCZ and GABBR3. Also interacts with KCNAB1, GABRR1, GABRR2 and GABRR3. Forms an NGF- induced complex with IKBKB, PRKCI and TRAF6 (By similarity). Interacts with EBI3, LCK, RASA1, PRKCZ, PRKCI, NR2F2, NTRK1, NTRK2, NTRK3, NBR1, MAP2K5, TRIM55 and MAPKAPK5. Interacts with the proteasome subunits PSMD4 and PSMC2. Interacts with K63- polyubiquitinated MAPT/TAU. Interacts with IKBKB through PRKCZ and PRKCI. Interacts with NGFR through TRAF6 and bridges that complex to NTRK1. Forms a complex with MAP2K5 and PRKCZ or PRKCI. Component of a ternary complex with PAWR and PRKCZ. Upon TNF-alpha stimulation, interacts with RIPK1 problably bridging IKBKB to the TNF-R1 complex composed of TNF-R1/TNFRSF1A, TRADD and RIPK1. Forms a complex with JUB/Ajuba, PRKCZ and TRAF6.
|
Tissue specificity
|
Ubiquitously expressed.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
ubiquitin-dependent protein catabolic process [GO:0006511]
apoptosis [GO:0006915]
anti-apoptosis [GO:0006916]
response to stress [GO:0006950]
protein localization [GO:0008104]
induction of apoptosis by extracellular signals [GO:0008624]
endosome transport [GO:0016197]
chaperone-mediated autophagy [GO:0016238]
intracellular signaling pathway [GO:0023034]
cell differentiation [GO:0030154]
regulation of I-kappaB kinase/NF-kappaB cascade [GO:0043122]
positive regulation of transcription from RNA polymerase II promoter [GO:0045944]
|
Cellular component
|
nucleoplasm [GO:0005654]
late endosome [GO:0005770]
cytosol [GO:0005829]
|
Molecular function
|
protein kinase C binding [GO:0005080]
zinc ion binding [GO:0008270]
receptor tyrosine kinase binding [GO:0030971]
SH2 domain binding [GO:0042169]
ubiquitin binding [GO:0043130]
|
|
|
|
|
|
|
|
|
|
|
|
|
With
|
Uniprot accession
|
IntAct
|
NBR1
|
Q14596
|
EBI-307104,EBI-742698
|
CASP8
|
Q14790
|
EBI-307104,EBI-78060
|
MLP3A
|
Q9H492
|
EBI-307104,EBI-720768
|
MLP3B
|
Q9GZQ8
|
EBI-307104,EBI-373144
|
RAD23A
|
P54725
|
EBI-307104,
|
GBRAP
|
O95166
|
EBI-307104,EBI-712001
|
PIK3CA
|
P42336
|
EBI-307104,EBI-2116585
|
SQSTM
|
Q13501
|
EBI-307104,EBI-307104
|
MLP3C
|
Q9BXW4
|
EBI-307104,EBI-2603996
|
GBRL2
|
P60520
|
EBI-307104,EBI-720116
|
ATG8 (xeno)
|
P38182
|
EBI-307104,EBI-2684
|
GBRL1
|
Q9H0R8
|
EBI-307104,EBI-746969
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
  |
Uniprot Identifier
|
Difference(s) with the 'canonical' sequence
|
Notes
|
Sequences
|
Isoform 1
|
Q13501-1
|
---
|
'Canonical' sequence
|
Get Fasta
|
Isoform 2
|
Q13501-2
|
1-84 : Missing
|
---
|
Get Fasta
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Region |
|
|
|
|
Compositional bias
|
272 - 294
|
23
|
Ser-rich
|
Q13501-COMPBIAS-272
|
Domain
|
20 - 102
|
83
|
OPR
|
Q13501-DOMAIN-20
|
Domain
|
389 - 434
|
46
|
UBA
|
Q13501-DOMAIN-389
|
Motif
|
228 - 233
|
6
|
TRAF6-binding
|
Q13501-MOTIF-228
|
Region
|
1 - 50
|
50
|
Interaction with LCK
|
Q13501-REGION-1
|
Region
|
43 - 107
|
65
|
Interaction with PRKCZ and dimerization (By similarity)
|
Q13501-REGION-43
|
Region
|
50 - 80
|
31
|
Interaction with PAWR
|
Q13501-REGION-50
|
Region
|
122 - 224
|
103
|
Interaction with GABRR3 (By similarity)
|
Q13501-REGION-122
|
Region
|
170 - 220
|
51
|
LIM protein-binding (LB)
|
Q13501-REGION-170
|
Region
|
269 - 440
|
172
|
Interaction with NTRK1 (By similarity)
|
Q13501-REGION-269
|
Zinc finger
|
122 - 167
|
46
|
ZZ-type
|
Q13501-ZN_FING-122
|
Natural variations |
|
|
|
|
Natural variant site
|
117 - 117
|
1
|
A -> V
|
VAR_023590
|
Natural variant site
|
274 - 274
|
1
|
E -> Q
|
VAR_023591
|
Natural variant site
|
274 - 274
|
1
|
E -> D (in dbSNP:rs55793208)
|
VAR_061707
|
Natural variant site
|
387 - 387
|
1
|
P -> L (in PDB)
|
VAR_023592
|
Natural variant site
|
392 - 392
|
1
|
P -> L (in PDB; no effect on polyubiquitin-binding)
|
VAR_023593
|
Natural variant site
|
399 - 399
|
1
|
S -> P (in PDB)
|
VAR_023594
|
Natural variant site
|
404 - 404
|
1
|
M -> V (in PDB; loss of polyubiquitin- binding)
|
VAR_023596
|
Natural variant site
|
404 - 404
|
1
|
M -> T (in PDB)
|
VAR_023595
|
Natural variant site
|
411 - 411
|
1
|
G -> S (in PDB; no effect on polyubiquitin-binding)
|
VAR_023597
|
Natural variant site
|
425 - 425
|
1
|
G -> R (in PDB; loss of polyubiquitin- binding)
|
VAR_023598
|
Alternative sequence
|
1 - 84
|
84
|
Missing
|
VSP_015841
|
Amino acid modifications |
|
|
|
|
Modified residue
|
24 - 24
|
1
|
Phosphoserine (By similarity)
|
Q13501-MOD_RES-24
|
Modified residue
|
148 - 148
|
1
|
Phosphotyrosine
|
Q13501-MOD_RES-148
|
Modified residue
|
170 - 170
|
1
|
Phosphoserine
|
Q13501-MOD_RES-170
|
Modified residue
|
176 - 176
|
1
|
Phosphoserine
|
Q13501-MOD_RES-176
|
Modified residue
|
207 - 207
|
1
|
Phosphoserine
|
Q13501-MOD_RES-207
|
Modified residue
|
266 - 266
|
1
|
Phosphoserine
|
Q13501-MOD_RES-266
|
Modified residue
|
269 - 269
|
1
|
Phosphothreonine
|
Q13501-MOD_RES-269
|
Modified residue
|
272 - 272
|
1
|
Phosphoserine
|
Q13501-MOD_RES-272
|
Modified residue
|
276 - 276
|
1
|
Phosphoserine
|
Q13501-MOD_RES-276
|
Modified residue
|
277 - 277
|
1
|
Phosphoserine
|
Q13501-MOD_RES-277
|
Modified residue
|
328 - 328
|
1
|
Phosphoserine
|
Q13501-MOD_RES-328
|
Modified residue
|
332 - 332
|
1
|
Phosphoserine
|
Q13501-MOD_RES-332
|
Modified residue
|
355 - 355
|
1
|
Phosphoserine
|
Q13501-MOD_RES-355
|
Modified residue
|
361 - 361
|
1
|
Phosphoserine
|
Q13501-MOD_RES-361
|
Modified residue
|
365 - 365
|
1
|
Phosphoserine
|
Q13501-MOD_RES-365
|
Modified residue
|
366 - 366
|
1
|
Phosphoserine
|
Q13501-MOD_RES-366
|
Modified residue
|
370 - 370
|
1
|
Phosphoserine
|
Q13501-MOD_RES-370
|
Experimental info |
|
|
|
|
Mutagenesis
|
7 - 7
|
1
|
K->A: Loss of interactions with PRKCZ, PRCKI and NBR1. Loss of dimerization; when associated with A-69
|
Q13501-MUTAGEN-7
|
Mutagenesis
|
9 - 9
|
1
|
Y->F: No effect on interaction with LCK
|
Q13501-MUTAGEN-9
|
Mutagenesis
|
13 - 13
|
1
|
K->A: No effect on interaction with PRKCI
|
Q13501-MUTAGEN-13
|
Mutagenesis
|
21 - 22
|
2
|
RR->AA: Loss of interaction with PRKCI. Alters dimerization
|
Q13501-MUTAGEN-21
|
Mutagenesis
|
67 - 67
|
1
|
Y->A: No effect on interaction with PRKCZ
|
Q13501-MUTAGEN-67
|
Mutagenesis
|
69 - 69
|
1
|
D->A: No effect on interactions with PRKCZ, PRKCI and NBR1. Loss of dimerization; when associated with A-7
|
Q13501-MUTAGEN-69
|
Mutagenesis
|
71 - 71
|
1
|
D->A: No effect on interaction with PRKCI
|
Q13501-MUTAGEN-71
|
Mutagenesis
|
73 - 73
|
1
|
D->A: No effect on interactions with PRKCZ and PRKCI
|
Q13501-MUTAGEN-73
|
Mutagenesis
|
80 - 80
|
1
|
D->A: No effect on interaction with PRKCI
|
Q13501-MUTAGEN-80
|
Mutagenesis
|
82 - 82
|
1
|
E->A: No effect on interaction with PRKCI
|
Q13501-MUTAGEN-82
|
Mutagenesis
|
398 - 398
|
1
|
L->V: No effect on polyubiquitin-binding
|
Q13501-MUTAGEN-398
|
Mutagenesis
|
406 - 406
|
1
|
F->V: Loss of polyubiquitin-binding
|
Q13501-MUTAGEN-406
|
Mutagenesis
|
413 - 413
|
1
|
L->V: No effect on polyubiquitin-binding
|
Q13501-MUTAGEN-413
|
Mutagenesis
|
417 - 417
|
1
|
L->V: Loss of polyubiquitin-binding
|
Q13501-MUTAGEN-417
|
Mutagenesis
|
431 - 431
|
1
|
I->V: Partial loss of polyubiquitin- binding
|
Q13501-MUTAGEN-431
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MASLTVKAYL |
LGKEDAAREI |
RRFSFCCSPE |
PEAEAEAAAG |
PGPCERLLSR |
VAALFPALRP |
GGFQAHYRDE |
DGDLVAFSSD |
|
81 |
EELTMAMSYV |
KDDIFRIYIK |
EKKECRRDHR |
PPCAQEAPRN |
MVHPNVICDG |
CNGPVVGTRY |
KCSVCPDYDL |
CSVCEGKGLH |
|
161 |
RGHTKLAFPS |
PFGHLSEGFS |
HSRWLRKVKH |
GHFGWPGWEM |
GPPGNWSPRP |
PRAGEARPGP |
TAESASGPSE |
DPSVNFLKNV |
|
241 |
GESVAAALSP |
LGIEVDIDVE |
HGGKRSRLTP |
VSPESSSTEE |
KSSSQPSSCC |
SDPSKPGGNV |
EGATQSLAEQ |
MRKIALESEG |
|
321 |
RPEEQMESDN |
CSGGDDDWTH |
LSSKEVDPST |
GELQSLQMPE |
SEGPSSLDPS |
QEGPTGLKEA |
ALYPHLPPEA |
DPRLIESLSQ |
|
401 |
MLSMGFSDEG |
GWLTRLLQTK |
NYDIGAALDT |
IQYSKHPPPL |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|Q13501|SQSTM_human Sequestosome-1
MASLTVKAYLLGKEDAAREIRRFSFCCSPEPEAEAEAAAGPGPCERLLSRVAALFPALRPGGFQAHYRDEDGDLVAFSSD
EELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCAQEAPRNMVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLH
RGHTKLAFPSPFGHLSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSVNFLKNV
GESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPSKPGGNVEGATQSLAEQMRKIALESEG
RPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQSLQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQ
MLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQYSKHPPPL
|
|
|
| |
|
|
|
|
|
|