(CTSB) cathepsin B [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
4779
|
Official symbol
|
CTSB
|
Official name
|
cathepsin B
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 8 (NC_000008.10) : 11700032...11725645 (-)
|
Map
|
8p23.1
|
Length
|
25614 nt
|
NM_001908.3
|
CTSB, mRNA isoform 1
|
NM_147780.2
|
CTSB, mRNA isoform 2
|
NM_147781.2
|
CTSB, mRNA isoform 3
|
NM_147782.2
|
CTSB, mRNA isoform 4
|
NM_147783.2
|
CTSB, mRNA isoform 5
|
Accession
|
Name
|
Organism
|
Length
|
P07858
|
Cathepsin B
|
Homo sapiens
|
339 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
CPSB; APPS
|
Alternative name(s) |
cysteine protease; amyloid precursor protein secretase; cathepsin B1; preprocathepsin B; APP secretase
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
The protein encoded by this gene is a lysosomal cysteine proteinase composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. It is also known as amyloid precursor protein secretase and is involved in the proteolytic processing of amyloid precursor protein (APP). Incomplete proteolytic processing of APP has been suggested to be a causative factor in Alzheimer disease, the most common cause of dementia. Overexpression of the encoded protein, which is a member of the peptidase C1 family, has been associated with esophageal adenocarcinoma and other tumors. At least five transcript variants encoding the same protein have been found for this gene. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGCGGGGCC |
GGGAGGGTAC |
TTAGGGCCGG |
GGCTGGCCCA |
GGCTACGGCG |
GCTGCAGGGC |
TCCGGCAACC |
GCTCCGGCAA |
|
81 |
CGCCAACCGC |
TCCGCTGCGC |
GCAGGCTGGG |
CTGCAGGCTC |
TCGGCTGCAG |
CGCTGGGTGA |
GTGCTGGGGA |
CCCGGGGCCA |
|
161 |
CCGCAGCGTA |
AGTGACCTTG |
GCGGGGACGG |
TGCTACCCGG |
CCGCCGAGAC |
GGGTTCCTCT |
GCGCCCTCAG |
TCGGGCCCAG |
|
241 |
GCGCGGCCCC |
GCGGCGTCCC |
TGGGGGCCGG |
CGGGGAGCCG |
GGACCCTCGG |
GACTGTCCCT |
GACGGGCGGG |
CTGGGGTGGG |
|
321 |
AGTCCGCGCG |
CTCCGAAGCG |
TCGGCGAGAA |
AAGCAGAAAA |
CAGCTCCGCC |
CGCCAGCCCT |
CTGCCCTCCG |
CTCCCCTCCC |
|
401 |
CGGGCTGTGC |
GCCGGACCCC |
GGCCCTCGGA |
GCGGGGACGC |
GGCCAGGACC |
GCCGAGGGAG |
GCGCCTGCGA |
GGAAGAGCTC |
|
481 |
GGCCGGGTCC |
GGAGACTGCT |
GCCTGGGACC |
GCGCTCCCAG |
CGCCTGGGCC |
TCGGTGTCTC |
CGGGCCAAAC |
TGCCGACATA |
|
561 |
ATCGCATCTG |
CCGGCATCTA |
TTTTCGGTTT |
ATTTCCCCCT |
CATTGCGAAG |
GATTTGCCTG |
GCCAACTTTC |
TGCGCAAGAT |
|
641 |
CCCACGCAAT |
TCCTGGGACC |
CCAGAAGACA |
GGTCCTGTTG |
AAGAACAGGA |
ATCTGGCACT |
GGGTGGGCTG |
GGGAGGAAGC |
|
721 |
CGCACGGTGT |
TAAATCCATA |
AACAGGAAGA |
GAAACCAGAC |
AGCGAAACCA |
AGAGGCGAAT |
GGGCGATTGG |
ATGCCGGTGG |
|
801 |
GGAGAAGGCC |
GGGGGCGCAC |
CCTGCTCCTG |
GACTCCAGTA |
AAGGGAGGCC |
GGGCAGAGTC |
CCTGGGGCGC |
CACCTCCCCC |
|
881 |
TCGGTTAGTA |
GCCCTGGAGG |
CCGGGGGGAG |
TTGGCCTCTG |
GGGAGCAGTG |
GGTGCTGGGT |
GTGGGGCGTT |
GCAGGCAGGC |
|
961 |
TGGGGTGGGC |
GACCCAGGTG |
GAAGTGAATT |
GCACTTGGCT |
TCCTGGTGGG |
CCTCTGTCAC |
CCCCTTCCCA |
GGCGCTGAGA |
|
1041 |
AAGCCAGCAG |
GCTGGCAAAG |
AAAAGGACCC |
TAGCGCAGGC |
CCCACACTCC |
TCCTCCTAAC |
GGACGAGAGA |
CCCCCCAAAC |
|
1121 |
CCACTGGAGA |
AGTGACGCTG |
TGGGGTTCAA |
ATGCAGACCT |
GGCACCTTTT |
TGTAGCCTGG |
AAAAACATTC |
CCACTGCCTG |
|
1201 |
CTGCCGGAGG |
AGAGGATAGC |
TGAGATGCAC |
TCTCTTTGAA |
TCCAAACGTT |
CAGGAACGTA |
AGGCGAAGAG |
GCCTAAGAGG |
|
1281 |
GCGTTGGCTG |
GCTCTGTCTC |
TCAGGCTGGA |
GCACAGTGGC |
GCGATCTCGG |
CTCACTACAA |
CTTCCGCCTC |
CCAAATTCAA |
|
1361 |
GCTATTCTCC |
TGCCTCAGCC |
TCCCGAGTAG |
CTGGGATTAC |
AGGTGCCCGC |
CACCACGCCC |
AGCTAATTTT |
TGTATTTTTA |
|
1441 |
GTAGAGATGG |
GGTTTCACCA |
TGTTGACCAG |
GCAGATCTTG |
ACCTCCTGAC |
CTCAGGTGAT |
CCGCCTGTCT |
CGGCCTCCCG |
|
1521 |
GTGAGTCACG |
GTGCCTGGCC |
AAGAACTGTT |
TCTTGTTGGC |
TCTGGTGCTG |
GTGACTTAGA |
ACCCGCCAGC |
TCCTGGAGAA |
|
1601 |
AGGGGCTGGG |
CCGCCCACCC |
TGTGTAGCTT |
TCCCAAAGAC |
AGAGTCAAAC |
GTCTCCTGGA |
GAACAGAGGC |
TTCCCTTCGT |
|
1681 |
CTTTGGTCAT |
TTGTCCTCTA |
GCTGGGGGTA |
CCCCCTGGTG |
GAAAGGCACA |
GGTCCCTTGC |
TCCCCAGGTG |
GCAACGCAGG |
|
1761 |
CCAGACACGG |
CCCTGGCACA |
GCTCTCCTGG |
GTGTTGGCTC |
AGGACAGCCC |
TGTTTCCAAC |
TGGTTAGGCG |
GTGAGGGGTG |
|
1841 |
GTGGCCCTTT |
GGTTCCAGGT |
TGAAACTGCC |
CATGTGGTGC |
TGATTTAGCA |
GACTGGGGAG |
GCTCTTTTTG |
TAGGCAGGTT |
|
1921 |
CTTTTCTTTC |
CCCAGCTGCT |
GGACCTGGGA |
GTTGGAAGAG |
AAGTTGCACC |
CATTTTAGGG |
GTAACAGATA |
TTTTCTGTTG |
|
2001 |
CTCTTGGTTG |
GATTGGGAAG |
TGAATTGAAG |
GGAGGTCACG |
TTTCAGGGGT |
GCCTTGGGAT |
GTCTGTCAGT |
GATTTTCTTT |
|
2081 |
TCTTTCTTAA |
TTTCTTTTTC |
TTTCTTTTTT |
TTTTTTTTTT |
GAGACACACT |
CCCTCTATCG |
CTCAGGCTGG |
AGTGCAGTGG |
|
2161 |
TGCGATCTCG |
GTTCACTGCC |
ACCTCCGCCT |
CTCATGTTGA |
AGCAATTCTC |
CTGTCTCAGC |
CTCCCTCCCA |
AGTAGCTGGG |
|
2241 |
ATTGCCAGTG |
CCCATCACCA |
CACCTGGCTT |
TTTTTTTTTT |
TTTTGTATTT |
TTAGTAGAGA |
CGGGCTTTCA |
CCATGTTAGC |
|
2321 |
CAGGCTGGTT |
TTCGAACTCC |
TGATCTCAAG |
TGATCCGCCT |
CAGCCTCCCA |
AAGTGGTAGG |
ATTACAGGCA |
TGAGCCACCG |
|
2401 |
CGCGGTGGAG |
GGGTAATTTT |
CTTAAATCTG |
GTAATGAGTT |
GTGGTTGTGT |
AGAGTAACAT |
ACCGTCCTTT |
CGAGATATGG |
|
2481 |
ACTGAAACAT |
TGAGAGGGAG |
GAGTTACAGG |
TATGTCGATT |
CTTCTTTTCT |
CTCTCTCCTT |
TTTTTTTTTG |
AGGTGGAGTC |
|
2561 |
TGACTCTCTC |
ACCCAGGCTG |
GAGTGCAGTG |
GCAAGATCTC |
AGCTCACTGC |
AACTCCGCTT |
CCTGTGTTCA |
AGCCATTCTC |
|
2641 |
CTGCCTCAGT |
CTCTCAACTA |
GCTGCGATTA |
CAGGCATGTG |
CCTCCACACT |
CAGCTAATTT |
TTTTATTTTT |
AGTAGAGATT |
|
2721 |
TTTTGTCTCT |
CCTAAAAAAA |
TCCAATGTAA |
AAAAATCCCA |
ATGTGGGGGT |
TTTGCCACGT |
TGGCCAGGCT |
GGTCTCGAAC |
|
2801 |
TCCTGACCTC |
GTGATCCGCC |
CACCTCGGCC |
TCCCAAAGCG |
CTGGGATTAC |
AGGTATGAGC |
CACTGCGCCC |
AGTCTGCTGC |
|
2881 |
CTCTTTTCAA |
TGGTCTGGCC |
TAAGGAAATT |
ATTGGAAACA |
TGTGCGGTTG |
AGTGATATTT |
ACTGGGCACT |
TCCACATGGT |
|
2961 |
CCATGTAAAG |
GGAGATGGTT |
GGGGTGACAG |
GCAGTTGAGT |
CTAGGGGAGG |
CATGTACAGA |
TGTGCTGTGC |
CTCTGGGATA |
|
3041 |
TCAGGGTGGC |
AGGCAGCAGT |
CTCTACGTCT |
GGTCCCAGGC |
TGCCTGAGAA |
AGAGCATGTG |
GGAGGCAAAC |
CTTGCGCCCT |
|
3121 |
GGCATGGTTG |
TTAATGTTTA |
TATTTACCCT |
AGCTTGTGTG |
GGGTAGGAGG |
TTTAGGGATC |
AAATTCCACT |
CTGTGTTTAG |
|
3201 |
ACATTTTTTT |
CTTTCTTTTT |
TTTTTTGAGA |
CAGTTTCACT |
CTGTCACCCA |
GGCTGGAATG |
CCGTGGCACA |
ATCTCAGCTC |
|
3281 |
ACTGCAACCT |
CACCTCCCAG |
GTTCAAGCAA |
TGTTCCTGCC |
TCAGCCTCCG |
GAGTAGCTGG |
GATTAGAGGC |
ATGCACCACC |
|
3361 |
ATGCCTGGCT |
AATTTTTGTA |
TTTTTAGTAG |
AGACCTCAAA |
TGATCTGCCC |
GCCTCGCCTC |
CCAAAGTGTT |
GGGATTACAG |
|
3441 |
GTGTGAGTCA |
CTGTGCCCAG |
CCATGTGTTT |
AGACTTTTAA |
CTAATCTCTT |
TTTTAGTTTC |
AAGCCTTATC |
GTCCGCCTCT |
|
3521 |
GTAGACCACT |
CCTGTGCCTG |
TTTCCTGATC |
CTTCCAAGGG |
CCATTGTATT |
CCCTGTCTGC |
TGCCCCTCTT |
TTGGATTCTT |
|
3601 |
CTGCACATTT |
TTTTGTTCAT |
GCATTCATTC |
ATTTATTGTT |
TGATTAATGA |
CAGGGTCTTG |
CTTTGTCTCC |
CAGGCTGGTG |
|
3681 |
TGCAGTGGTG |
CGACCACGGC |
TCACGGCAGC |
CTCAGCCACC |
CAGATGTAAG |
CGATCTGGTT |
CCCACCTCAG |
CCTCCCGAGT |
|
3761 |
AGTAAGTAGC |
TGGGACCACA |
GGCGGTGCCC |
AGGTTTTTTT |
TTTTTTTTTT |
TTTTTTTTCG |
TTGGTAGAGA |
CAGGGTCTCA |
|
3841 |
CTGTTGCCAG |
GACTGCTCTG |
AAACTCCTGG |
TTTCAAGTGA |
TCCCCTTGCC |
TCCTCCCACT |
TAGGCCTTCC |
AAATGCTGGG |
|
3921 |
ATTACAAGGC |
ATGAGTCACC |
TCCAGGCCTT |
TTTGTACTTT |
TAAAACTCTG |
CATCAGTGTA |
TAAACAATGT |
TATTAAAGTT |
|
4001 |
TATATGACTT |
CAGTTACACT |
ACATGGATCC |
TTTTTCACTC |
ACGGTTGTGA |
GATTTATTTC |
TGTTGCTACA |
TCCAGTTCTA |
|
4081 |
GTCCATTTGT |
GTTAAGTGCC |
CAGTGTGTTT |
ATCTACTGAG |
GGACAGTTAT |
GTTATTTCGT |
GTTCACTATT |
ATCCCATGCT |
|
4161 |
ACAATAAATA |
TCCTGTGTCT |
CCCAGATACT |
TAGAAGAGTT |
TCTGCAGGGC |
ACATGTGGGA |
GAGTTTGTTT |
CTGGGTCATG |
|
4241 |
AGGTGTGTTC |
ATCTTCCATC |
TTGCTAGATG |
CTGGCAAAGG |
GTTCTCCAGT |
GTGGTTGCAT |
CAATTTACTC |
GCCCAGCAGT |
|
4321 |
GTGCAGAGTT |
CCTGTTCCTC |
ACATTTACCA |
ACACTAGATA |
GTACCAGACT |
TTGATTTTTG |
CCAATCTGAT |
GGGTTTGAAG |
|
4401 |
TGGTACCCCT |
GTTTTAATTT |
CATGACCAGA |
AATTCAAATT |
TAATCTTTGC |
TGTAGGACAA |
CAGTAACCCA |
CTTATGCCTA |
|
4481 |
GTGTTCCATT |
ATTAGAACGC |
TAAGCATGTG |
GGAGTTTTTA |
CATCATACTG |
CTCAAGGTCA |
TCGCCAAGGT |
CTGATGTTTT |
|
4561 |
TACTCGTGCA |
AAAATTTAAA |
AAATTGCAAC |
CTCTGGCATA |
AATGGGTTGA |
GTGACACTTT |
TCCTGTTTTT |
ATTGTTGGTC |
|
4641 |
AGTGATGGCA |
TATTTGCTGG |
GTTTTTTTGT |
TTTTTTTTTG |
AAACGGAGTC |
TCACTGTGTC |
GCCAGGCTGG |
AGTGCAGTGG |
|
4721 |
TGTAATCTCG |
GCTCACTGTA |
ACCTCCGCCT |
CCCGGGTTCA |
AGTGATTCTC |
CTGCCTCAGC |
CCCCTGAGTA |
GCTGGGATTA |
|
4801 |
CAGGCGTGTG |
CCACCACGCC |
CAGCTAATTT |
TTGTATTTTT |
AGTAGAGACG |
GGATGTCACT |
ATGTTTGTCG |
GGCTGGTGTT |
|
4881 |
GAACTCCTGA |
GCTCATGATC |
TGCCTGTCTT |
GGCTTCCCTA |
AGTGCTGAGA |
TTACAGGCCT |
GAGCCACCGC |
TAGCCTATTT |
|
4961 |
ATTTTTTATT |
TTAAATTTTA |
ATTTTCTATA |
TAGAGACGAG |
GTCACTATCC |
TGCCCAGGCT |
GGTCTTAATC |
TCCTGGGTTC |
|
5041 |
AGTCAATCTT |
CCCACCTTGG |
CCTCCCGAAG |
TGTCAGAATT |
ATAGATGTGA |
GCCACTGTGC |
TCAGCCCAGA |
ACTGATGTTT |
|
5121 |
TCTAAATGCT |
GGGTGCTGAG |
AAGGATGTGT |
GGCTGGCAGT |
CTTGACTGTG |
TTATCTGTCT |
TTACCAGGCC |
AGTAACTTCT |
|
5201 |
TTGGTCTGGT |
CATCAAGATA |
ATCTAGCATC |
ACCAGCAAGC |
ATGCATGGAG |
AAGGATGGGC |
CCAATGTGGC |
CAAGATGGTA |
|
5281 |
ACGGGACCAG |
TAGAGAGCCC |
TGTAGAAGAC |
ATCTAGATAT |
TCTGCCCTAA |
GAGCCCGGAG |
GGCCGGGCTG |
TCTCATGACC |
|
5361 |
CTCTGACGTG |
CTGACCTGGA |
CTCTGGCAGA |
ATGTGCACAC |
ACACAGTCAC |
ACAGCTTCCT |
GGCTTGCGCA |
AGTCCCAGGA |
|
5441 |
GGGCGGTGCC |
AGCCACAGGC |
TTTTCCCATT |
CGAGGGTTGG |
AAGCGTATCA |
TCAAACCACA |
TCAGAGTGCT |
GGGGGCCACC |
|
5521 |
TGCCACCCAT |
TCCCAACCCA |
CTCAGCCTTC |
CTGGTGTTTG |
GGACATGCTT |
TGCTTTGGCA |
GTCAAGACAG |
CAGAACAAAT |
|
5601 |
CAACTTTTAA |
GGCCTTGTCA |
CTGATAGTAC |
AATTTCCATT |
ATTTTTCATC |
CAAATTAGGA |
TACTTCTGAA |
AATAGAAATG |
|
5681 |
ATGACTCTGG |
GATGCAAACG |
TTGGCTGTCC |
TATGTATAAG |
GAGATGGCTT |
TTCACGCTCC |
CAGTGACTGA |
GGAAGTTTCT |
|
5761 |
CCCAGATGGC |
GCTGCTCTGA |
GCCTGGTGCA |
GGGTAGGCAC |
TTTCAAAAGA |
GTGTCTCCTT |
GTATCTTCCA |
TCAGCCTTGC |
|
5841 |
GAGATGGGTA |
TCTGTTCCCA |
GGGCCCCAAG |
GGAGGAAAAC |
AGGACCTAGC |
TGGATCCAAG |
AGCTAGGCCT |
TTCTTTTTTT |
|
5921 |
TTTTTTTTTT |
GAGATGGAGT |
CTGACTCTGT |
CGCCCAGGCT |
GGAGTGCAGT |
GGCGTGATCT |
CGGCTCACTG |
CAAACTCCGC |
|
6001 |
CTCCCGGGTT |
CACGCCATTC |
TCCTGCCTCA |
GCCTCCCGAG |
TAGCTGGGAC |
TACAGGCGCC |
TGCCACCATG |
CCTGGCTATT |
|
6081 |
TTTTTGTATT |
TTTAGTAGAA |
ACGGGGTTTC |
ACCGTGTTAT |
CCAGGATGGT |
CTCGATCTCC |
TGACCTCGTG |
ATTCGCCCAC |
|
6161 |
CTCAGCCTCC |
CAAAGTACTG |
GGATTACAGG |
CGTGAGCCAG |
CATGCCCGGC |
CCAGAGCTAG |
GCCTTTCTGT |
GGCTGGCCTT |
|
6241 |
CGCGTCAGCC |
TCAACTACCC |
TGGTGTAATC |
TCGCCTGCGG |
TTGAATTAGG |
GAACCGCCGT |
GTTCTGCAAG |
CTGGAGAGGC |
|
6321 |
AGAACACTAA |
TGAGCAGAAC |
ACTAATCTCA |
TTGCAATCTC |
AAAGGATCTC |
TAAAAGCTTT |
TATAAAGCAG |
GCCCAAGGTC |
|
6401 |
CTTTGGTATC |
CGATGCAGAC |
GTGGTGAATG |
CATTGGCTCT |
GTCAGCATCT |
GAGCAAGTCA |
GTAACAGAAA |
TGGGGAGTAA |
|
6481 |
AAGCTTTCAG |
AACTTTCCAG |
AATATTGACT |
AAATTGTCTT |
GTTTACAACC |
AACAACGACA |
ACAAAAAATA |
ACTGCTGAGG |
|
6561 |
GCCTTCGTAG |
TGTCTGCTGT |
TTCAAGTGTA |
CAGTAGTCAT |
TTTGTCTGCA |
GGATGTGGGG |
TTGCTGTGGC |
TGACCTTGTA |
|
6641 |
CAATATTCCA |
CTCATAGGTG |
TCTTCAGGCC |
TATGGAGAGC |
AGCTTGCGTG |
GGCTGGGCCT |
GCAGTACCTG |
GTTTGCATAG |
|
6721 |
ATGATTGGCA |
GGTGGGCAGC |
ACGGGGAAGG |
ACCTGTGAGT |
GGCCAACCTG |
GTTCAGGTGA |
GGGAGGTGGA |
GTGGGGCTTC |
|
6801 |
TCTGCTTCCC |
CTGGTTCCCT |
GGAAGCCTCC |
AAGGCTGGTG |
AGCATCACTG |
CTGCCTCTGC |
ACACCTGTGT |
GCTGGGTGGG |
|
6881 |
TTTTCTGACA |
GGTTTTCAGT |
TGCTTCGGGG |
CTACAGCTGC |
AGGGAGCCTG |
CTCCATGGGA |
CAGATGGGCC |
TCTGGTGCCC |
|
6961 |
GTTCATCAGG |
GGACTGATGA |
GACCGAGGCC |
TGAGAGCCCT |
TTGGATTTTG |
TTTTTGTCCT |
TAATTTAATC |
ATAAGCCAAG |
|
7041 |
AATCTACTAA |
ACACAGTTCC |
ATTAGGGGCA |
AAGACGTAAC |
ACATCAGAGG |
CCACAGCAAG |
GCTGTGATTC |
ATACTCAAAA |
|
7121 |
AGGAAAGGTC |
TCTGGGTCAC |
AACAGAGCAT |
AGTTGAGGTC |
AGCACACTCC |
CACCCAGTGC |
AGGGCTGCTC |
CAGCATTGAG |
|
7201 |
GTGTGTCTGG |
CAGGTTGAAG |
TAGGGGAAGA |
TGAAACTCGC |
CGAAGTCTTG |
TTTTGTGGTT |
GCACTTAAGT |
GGTCAAAACT |
|
7281 |
TCAGGAGCAA |
CTGCCGTTAT |
TAGCGGTGAG |
TGCCAAGACT |
AGTTTTTATA |
GAAGAGAAAG |
AAACAAAGTA |
CTCTGGGAAG |
|
7361 |
GTCTTACTGA |
GCCTTCACAG |
TCTCCCCACC |
TTTCCACTGT |
TCCCGTGCTC |
TTAGCCGCTC |
TGCTGGCCTA |
TAAGGCACAG |
|
7441 |
TCTTCATTTG |
TGGCTTCTGG |
CAAAATGTAA |
GCACTTGACT |
TTTGTTTTTG |
TTTTGTTTTG |
TTTTGTTTTT |
TTGAGACGGA |
|
7521 |
GTTTCCCTCT |
TGTTGCCCAA |
GGTGGAGTGC |
AATGGTGCGA |
TCTCAGCCCA |
CTGCAGCCTC |
CACCTCCTGG |
GTTCAAGCAA |
|
7601 |
TTGTCCTGCT |
TCAGCCTCCC |
GAGTAGTTGG |
GATTATAGGT |
GCACAACCAC |
CACGCCTGGC |
TAATTTTTTG |
TATTTTTAGT |
|
7681 |
AGAAATGGGA |
TTTCACCATG |
TTAGCCAGGC |
TGGTCTCGAA |
CTCCTGACCT |
CAGGTGATCC |
ACCTCCTTGG |
CCTCCCAAAG |
|
7761 |
TGCTGGGATT |
ACAGGTGTGT |
GCCACTGTGC |
CCGGCCAACT |
TTCAATTCTT |
TAGAGCTGAC |
TATGAGAGGA |
GCCAGCAGTA |
|
7841 |
TAGCCACAGC |
ACCAACGAAT |
GAGGAAGAGC |
AAAATACTGC |
ATGACAGCTT |
TGCTAAGAAT |
TCTTTCACTT |
TTTTTGTCTA |
|
7921 |
TCAGCCAGGA |
GCTAGCAACT |
TGGCTTATTT |
GGAAATTTTA |
AGTGTACATA |
TCCTGTCTCC |
TTAAATCCTT |
TACAGATTTA |
|
8001 |
AAGTGCAGTC |
TACCTGAGGG |
CTCTGTGACC |
ATGTAAGAAA |
GCTTTTTCTT |
TCTTTTTTTT |
TCTCTGAGAC |
AGAGTGTTGC |
|
8081 |
TCTGTCGCCC |
AGGCTGGAGT |
GCAGTGGTGT |
GATCTTGGCT |
CACTGCAACC |
TCTGCCTCCT |
GGGTTCAAGC |
AATTTTCCTG |
|
8161 |
CCTCAGCTTC |
CTGAGTAGCT |
GGGACTACAG |
GCAGCACCAC |
CATGCCCGGC |
TGAGTTTTGT |
ATTTTTAGTA |
GAGACAGGGT |
|
8241 |
TTCACCATGT |
TGGCCAGGCT |
GGTCTTGAAC |
TCCTGACCTC |
GTGATCCGCC |
TGCCTCAGCC |
TCCCAAAGTG |
CTGGGATTAC |
|
8321 |
ATGCGTGAGC |
CATTGTGTCC |
GGCCTTTTTT |
TTTTTTTTTT |
TTTTTTTTTG |
AGACAGAGTC |
TCGCTCTGTT |
GCCCAGGCTG |
|
8401 |
GAGTGCAGTG |
GTGTGACCTC |
AGCTTACTGC |
AACCTCCGCT |
TCCTGGATTC |
AAGTGATTCT |
CCTGCCTCAG |
CCTCCCAAGT |
|
8481 |
AGGTGGGATT |
ACAGGCACCC |
ACCACCGTGC |
CTGGCTAATT |
TTTGTATTTT |
TAGTAGAGAC |
AGGAGTTTCA |
CCTTGTTTAG |
|
8561 |
TAGAGACAGG |
CTGGTCTCGA |
ACTCCTGACC |
TCAGGTGATC |
CGCCTGCCTT |
GACCTCCCAA |
AGCGCTGGGA |
TTACAGGCAT |
|
8641 |
GAGCCACTGT |
GCCTGGCCAG |
AAAGCCTTCT |
TTATTGAGCT |
TGGTGGCAGC |
CCAAAACTGA |
TTCTTTAAGG |
GTGTCAGGAC |
|
8721 |
TTAACACCTC |
CTGTGACTTA |
GCCGCACCTC |
CTCTCCTTTG |
ACTTTCATTC |
CACCTCCTTC |
CAGGATCGCA |
AGGTCCCTAT |
|
8801 |
TTGTCCTGGA |
AACGGCTTCA |
AGGTAGTCTA |
GGGTGCCGTT |
TGCCGGGGGA |
GGAAGGTGCT |
CTGGTTGATA |
GAGTCGCCTG |
|
8881 |
GCCGCACACT |
CTTTTTGGCA |
CATAACAACG |
TTCTACAGAG |
CCGGGGTGGA |
GCGTGCTTTC |
TCATAAGTGC |
TCTGCAGGTT |
|
8961 |
TGGAGAGAGA |
GGATATGAGG |
AGCACCCTTT |
TCTGTTTTTT |
TTAACCCAAA |
GATTAGCTTG |
GAAAAGGGGC |
AGAGGGGTGC |
|
9041 |
ACTGGAACTC |
AGGTCTGCCT |
AAGCAGCACA |
GCAGACCAAG |
GTCTAGAGAT |
GACATCTGCT |
CGCAGCTGTT |
CTTCCACCAG |
|
9121 |
CCCGCATCCT |
GGAAAGGGGT |
CTTGTGGCAC |
ACAAGAGTTC |
ACATCCTTCC |
CTCGTGAAAT |
AAGGACTTTG |
TGTTCATCAT |
|
9201 |
CTCTTGTAAG |
AAGCAGAGCA |
GAAAGCACAG |
AATTAAGAAA |
TAAAAGGGAA |
GTGGGTGCCT |
ATATAAAGGG |
AAGTGAAAAT |
|
9281 |
GGGTTGCTGT |
CCCATGCAAA |
GACCCTGGAA |
AGCTGTTAAC |
AGCTCAGCTT |
GTCACTTTCA |
CCATCTGCAT |
TTGTCCAGAG |
|
9361 |
TGATTGAGAT |
TTGCGTTGTT |
GTGGAGAGAA |
AGGCGCCTGT |
TGCACAATGG |
AGTGAGATTG |
CCACTGCTGT |
CAGGACCTCT |
|
9441 |
GTGTTTGGCT |
TGACACTTTT |
TGAGTTCTCA |
GCAGTCTCGG |
GACCCTCAAG |
AGTGGAAGCA |
TTTTTGGATG |
TTAAATGCTG |
|
9521 |
GGGTTAATTG |
AAGTTAAGAG |
CTTGTTTTAC |
TGGGCATGGT |
GGCTCACACC |
TATAATCCCG |
ACACTTTGGG |
AGGCCAAGGC |
|
9601 |
GGGCAGATCA |
CTTGAGTCCA |
GGAGTTTGAG |
ACCAGCCTGG |
GCAACATAGC |
AAAACCCCAT |
CTCTACAAAA |
AATACAAAGC |
|
9681 |
TGGGCGTAGT |
GGTGTATGCC |
TGTAGTCCCA |
GCTCCTTAGG |
AGGCTGAGGG |
GAGCAGATCT |
CTTGAGCCCA |
GGAGGCAGAG |
|
9761 |
TTTGCAGTGA |
GCCATGATCG |
TGCCACTGCA |
CTCCAGCCTG |
GGCAACAGAG |
TGAGATCCTG |
TCTTAAAACA |
AACAAAAAAA |
|
9841 |
ACAAACTTGT |
TTTCATTTAG |
ACTCTTCCTG |
GCGTTGGGGA |
CCTATTGGAA |
TAGGTTTAGT |
GTGAACTGAG |
AGCTAGAAGT |
|
9921 |
GTTAGAGGAG |
AGAGGGAGGG |
AACAGAGCCC |
GCTGGAGCGA |
GTGCCCTTCC |
TACCTTATCA |
CTGCATGCCA |
GGCATGTGCC |
|
10001 |
GGCGCTTTGG |
TCCTCCTCAT |
TTCATTCTTG |
ACTGCCACCT |
GAGACACGAT |
GGTTACTAGC |
TCCATTTTAT |
AGGTGGTGAA |
|
10081 |
ACTGAGGCTT |
GGGGAAGGTC |
AGACCCCAAG |
GGTGCCATTT |
AGTCAGTGGC |
AGAGCCAGAT |
CCAAATGCAG |
GTCTCCTGAC |
|
10161 |
TCCAAGTGCA |
GGGCTCATTT |
TATCGTCCGG |
TTGCAGCACG |
CTGGCGGCCC |
CTTGAGCCCC |
AACCTGGATA |
CCATAGGGGA |
|
10241 |
GGAGCAGAGA |
AGCCAGGAAC |
ACCACAGCCC |
TGGGCCAAGG |
TGCGGGGCTG |
AAAGAACTTC |
CCAGCGCTCA |
GCCTGGGACT |
|
10321 |
AGTGGAATGG |
GCTGGGCCCT |
GGGGCTGGCA |
GCGGTGGCCC |
CGGGGAGCCT |
GGGAATGAGT |
AGGGAGCACA |
GGGAGGTGTG |
|
10401 |
GGAGGGCCTG |
GGAACCATGA |
AAAGGAGGGC |
GGGTGCAGGG |
AAGTCGCCTG |
CTAGTGAAGT |
GGCGAGGGGG |
CCCTGTGGGA |
|
10481 |
CTCCAGGGAA |
TGGCCACGGC |
AGGTTGTCCT |
CCAGGAGTTG |
AGAGCCACTG |
GACATGGCAG |
CTGCCTGTGT |
TCTCAGCCAC |
|
10561 |
CACAGTAACC |
AAAGAAATCT |
TGGTTTTAAA |
ATTCAAGTTG |
CCATGGAAAC |
GCTCCCATCC |
TCGACTTGGC |
TTATTATTTA |
|
10641 |
AAATAACATC |
TCTACAGCAC |
AAAGCCCCCG |
GGTACATCCA |
AGGACACTGC |
TGTCTGCCCA |
CGAGACATGC |
TAACCTCACA |
|
10721 |
GTGTGGAGGC |
TGTGTGGGTC |
ACTGACATGC |
ATGGCCACGT |
GAGACGCTGC |
CTACCCACGA |
GTCACGGAAA |
AGGGGAAGAT |
|
10801 |
TATTAAGAAA |
GTCACTAGGG |
GCCAGGTCCA |
GTGGCTCATG |
CCTGTAATCC |
TAGCACTTTG |
GGAAGCTGAG |
GCAGTTAGAT |
|
10881 |
CCCTTGAAGC |
CAGGAGTTCA |
AGACCAGCCT |
GGCCAACATA |
ACGAAACACG |
GACTATACTA |
CAAAAATTAG |
CCAAACGTGG |
|
10961 |
TAGCACAGGC |
CTGTAATCCC |
AGCTACTCAG |
GAGGCTGAGG |
CACAAGAATC |
ACTTGAACCT |
GGGAGGTAGA |
GGTTTCAGTG |
|
11041 |
AGCCAAGATT |
GCGCCACTGT |
ACTCCAGCAT |
GTGCCACAGA |
GCGAGACTCC |
CATCTCAAAG |
TCACTAGGGA |
GGAAGCCTCA |
|
11121 |
TTGGTGGGAA |
GGAAGACCAA |
ATTGGAAATG |
CTCTGAGGAA |
TCATTAAAAC |
AAATGTCCTT |
TTATCAGTTT |
GGTGGCTCAG |
|
11201 |
GGCCTTTAGT |
AATACTGCCA |
ACTATTTTTC |
CTAGAAGAAA |
CAAAACTGAA |
ATAATAGGAA |
CATACTCACT |
TTTTTTTTTT |
|
11281 |
TCTTAAAAGT |
AAGGGTATGT |
TGTGAAAAAA |
AGTCTCCCCA |
CCGTAGTGAC |
CGACTGCCGT |
GCATCTTCCT |
TGGCATTTTG |
|
11361 |
CATGTAGTGG |
CAGGAGTGTT |
CCTACATGTG |
TAGATTGCTG |
AGAGGGTCAG |
ATGCTTATGG |
TCCTCAGTCA |
CCCACAGCTT |
|
11441 |
GCTTTTTCCC |
CACTTAACAT |
TGGGACTTGG |
GGCATTTTTA |
CTCTGTTAAT |
ACAATAGAAT |
TCACTTCAAC |
TAGTTGGTTT |
|
11521 |
TTTACTTTTA |
TTTTATTATT |
ATTATTTTTA |
GATGAAGTCT |
CAATCTGTCG |
TCCAGGCTGG |
AGTGCAGCCT |
CTGCCTCCTG |
|
11601 |
GGTTCAAGTG |
ATTCTACTGC |
CTCAGCCTCC |
CAAGTAGCTG |
GGATTACAGG |
CATGCGCCAC |
CACGCCTGGC |
TAATTTTTTG |
|
11681 |
TATTTAGTAG |
ATACAGGGTT |
TCACCACCTT |
AGTCAGGCTG |
GTCTCTAACT |
CCTGACCTCA |
GGTGATCCAA |
CCGCCTCGGC |
|
11761 |
TACAGGCATG |
CGCCACCGTG |
CCCCACCAAC |
TAATTGCTTT |
TTTAATGGTT |
GCTTCATATT |
CTATTTAACC |
ACTTACCGTA |
|
11841 |
ATTTAACAGT |
TACTCTGTTA |
TTGGATACTT |
GATTCATTTC |
CAGGACATGC |
AGATTTAGAA |
ACTGGTGGGC |
GTTGCTGGTT |
|
11921 |
GATGTCTCGG |
TGGTTGTGCC |
CACCCACCCC |
AGGCACTCAT |
TATCATGCCT |
TCCGTTTCCC |
CACTTCCTAA |
GCCCTGCAAG |
|
12001 |
AAGTGCCAAA |
CTGTCCAGCC |
CTTGCCAATC |
TGATACATGC |
TGAATACCTC |
CTTGATTTTA |
TCTGCACCTC |
CCTGAGGTTG |
|
12081 |
AACTTATTTT |
ACTTTATTTT |
ATTTTATTTT |
TGAGGCAGAG |
TCTCACTGTC |
ACCCAGGCAG |
GAATGCAGTG |
GTGAAATCTT |
|
12161 |
GGCTCACTGC |
AATCTCCACC |
TCCTGGGTTC |
AAGTGAGTTG |
AAGCAATTCT |
CCTGTCTCAG |
CCTCCCAAGT |
AGCTGCGATT |
|
12241 |
ACAGGCACCT |
GCCACCACAC |
CTGGGTAATT |
TTTGTATTTT |
TAGTGGAGAC |
GGGGTTTCAC |
CATGTTGGCC |
AGGCTGGTCT |
|
12321 |
CAAACTCCTG |
ACCTCAGGCG |
ATCCACCTGC |
CTCAGCCTCC |
CAAAGTGCTG |
AGATTACAAG |
CTTGAGCCAC |
CATGCCGGTT |
|
12401 |
GAACTTATTT |
TTATCTGTGT |
TGGCCATTTG |
TAGTTTTTCT |
ATTATGCTGG |
TTCCATTTTT |
CTGACTTTGA |
AGAGCCTTTT |
|
12481 |
GTGGAAACGA |
AGCCTAAAGT |
ATATGTGAGT |
ACTGCTTTGT |
TTCGTCAGTT |
TTTGTTTTAA |
AGAGACTTTT |
CAGTGTAATG |
|
12561 |
CAAGCATTTT |
CCCTTGAAGG |
CTGTGTTTCC |
TGTCAATCCT |
AAGCTTTCCT |
CCAGCATTAC |
CTTTTTAAAA |
AACTTTATTT |
|
12641 |
TCTATATTGA |
TGGGGTCTCA |
CAATGTTGTC |
CAGGCTGGTG |
TTGAACACCT |
GGCCTCAAGT |
GATCCACTTG |
CCTTGGCCTC |
|
12721 |
CCAAAGTACT |
GGGATTATAG |
GCATCAGCCA |
CCGCACCCAG |
CCTGTTTTTC |
AAAGGGCATT |
GATTTTTTTC |
ATAAAACTTT |
|
12801 |
TTAAATTAAG |
ATCTGTGGGC |
CTGGTGCGGT |
GCCTCACGCC |
TGTCATCGCA |
GCACTTTGGG |
AGGCTGAGTC |
AGGTGGATCA |
|
12881 |
CGAGGTCAGG |
CGTTCGAGAC |
CAGCCTGGCC |
AAAATGGTGA |
AACCCTGTCT |
TTACTAAAAA |
TATGAAAATT |
AGCTGGGCAT |
|
12961 |
GATGGCACAT |
GCCTGTAATC |
CCGCTGCTCG |
GGAAGCTGAG |
GTAGGAGAAT |
AGCTTGAACC |
CAGGAGGCAG |
AGGTTGCAGT |
|
13041 |
GAGCCGAGAT |
CATGCCATTG |
CACTCCAGCC |
TGGGGGACAG |
AGTAAGACTC |
CGTCTCAAAA |
AAACAAAACA |
ATTCTGTGTT |
|
13121 |
GCATGTGGTA |
CTTTTTGTGT |
GTGAGGAGTC |
CAGTGTGAAG |
ATTCAGACTT |
CAGGCAGCCA |
CTTGTACAAG |
CACTGTCCTG |
|
13201 |
TTTCCTCTCT |
GGCCTCACCT |
AGGTAACGCT |
GATTCCTCCA |
CGGAGGATGT |
GCTTCTGAGT |
GGTCCGTTGG |
GTGCTGTGCT |
|
13281 |
GATGAGCATC |
ACCCAGCATT |
TTACGACACA |
TGTGCTGCCC |
CAGAGGGCTG |
GGCTCCCGTC |
AGAGCTCTTT |
TCCACTGGCT |
|
13361 |
GGGTGCGGTG |
GCTCACACCT |
GTAATCCCAG |
CACTTTGGGA |
GGCTGAGGCC |
AGTGGATCAC |
CTGAGGTCAG |
GAGTTCGAGA |
|
13441 |
CCAGCCTGGC |
TAACATGGTA |
AAACCCCATC |
TCTACTAAAA |
ATACAGAAAT |
TAGGTGCGTG |
TGGTGGCGCA |
CACCTGTAAT |
|
13521 |
CCCAGCTACT |
TGGGAGGCTA |
AGAACCTGGG |
AGGTGGAGGC |
TGCAGTGAGC |
CAAGATTGTG |
CCACTGTACC |
CCAGACTGGG |
|
13601 |
TGACATGGCC |
CACAGACCAA |
GACCCTGTCT |
CAAAAAAAAA |
AAAAAAGCTT |
TTTTCCAGAG |
CCATCTTGCT |
GTCCTCATAT |
|
13681 |
ATTTATTCTC |
CTAGATGAAT |
TGTTGCTTAA |
GCAACAATGA |
AGTTTAAATT |
GTTCTTCAGA |
TAGAACCCCT |
AGGTGCCAGG |
|
13761 |
CAGTTTGTTA |
AGCACTGACA |
CCGCAGTCCT |
GTGACTTTCA |
TGACGACCTG |
TGACCTGTGG |
GAGGTAGAAC |
ACCTCACCCA |
|
13841 |
CCCGTGGGAG |
CCCCTGGAGT |
GACTGACAGG |
AACCCCTGCC |
TCACCCAGCT |
CCCCGCGCCG |
CGGCTCTCCC |
CAGCCCTGGA |
|
13921 |
TGCAGGCGCC |
CAGGAGACAT |
GTATTGCTTT |
TGTTGAGCTG |
CTCACTTAGG |
AGGGTGACTC |
AGAGTTCAAG |
CTGGAAAGCA |
|
14001 |
CTGGCTTGTG |
ACTCATGAGC |
CAGTGCAAGA |
TCAAACGGCT |
GGGGCATCGA |
GGTGAGAGTT |
TTCCTTCTCA |
GAAGCCTCAT |
|
14081 |
CTCCGCAGCC |
GGAAGCAGAG |
CCCTTGGCTG |
ACTGGAAAAA |
CCAGAGAGGC |
CCCGGGAGTG |
GGTGGATGGC |
CAGCCCAGCC |
|
14161 |
CCACCTCTCA |
GCCTCAACCT |
CCACCAGCCC |
ACACCGAGCT |
TGTTTGTCTG |
TGTCCTGTCG |
AAACTAGGAC |
TCCTGGATTG |
|
14241 |
TAACTTTTCT |
TACATTTCCC |
TGTCCCCTGG |
GTCCTCCACT |
TAGGGTGTAA |
TCACACAGAC |
AGGCTCTTAA |
CTGTTTCATA |
|
14321 |
TCACTCACTT |
GGGAAAGTGT |
CTCAAGCTGT |
TCTACAAATC |
CATGCAAAGG |
CCGTTTAAAA |
ATAGCAGCGA |
AGGCCCTGGA |
|
14401 |
CTCGGTCTCG |
TCCAGCACAG |
CCCCTTGGCT |
CTCTCTCTGG |
GCTCTGGCCG |
CCTGGCCCCC |
GGGGACCCAC |
ACGAGGTCAT |
|
14481 |
GGCGTGCTTC |
GGGCAGGGGG |
GCGGGGATCC |
CATAGACACC |
TCAGCTCCTT |
AAGAGTTCTC |
CGCCTGGGCC |
AGGACGAGCA |
|
14561 |
TGGGGGTCCC |
CACTGATGCC |
CGAGACGGTG |
CCCCTGTGTG |
TGTGAGCCCT |
CGACCCACAT |
AACAGAGAGG |
TGTCCTGATG |
|
14641 |
CCCTCTGTCC |
TCTCCAGGTG |
GATCTAGGAT |
CCGGCTTCCA |
ACATGTGGCA |
GCTCTGGGCC |
TCCCTCTGCT |
GCCTGCTGGT |
|
14721 |
GTTGGCCAAT |
GCCCGGAGCA |
GGCCCTCTTT |
CCATCCCCTG |
TCGGATGAGC |
TGGTCAACTA |
TGTCAACAAA |
CGGAATACCA |
|
14801 |
CGTGGCAGGT |
AGGCTGTGGG |
GCTGCGTCCT |
ATGTATGTCC |
CCTCCCAGGT |
CGGTCTGTGC |
ACACTGACTC |
CAGGAATGGG |
|
14881 |
AAGCCAGCCC |
TCACTGTGGC |
CCACAGAACA |
TTGTCCTGGA |
CTGTTGAAAA |
ATGGCCCTGA |
CCCTGAGTCA |
TGTCGGCTCT |
|
14961 |
GGACCACGCC |
TCCCTCCTTT |
GTCCCATACC |
CCACGTGTGT |
GCATGAGTGT |
GTGTGCATGT |
GTATGAATAT |
ATGGTTGTGT |
|
15041 |
GCACGTGGAG |
GTGTGTGGGG |
GAGTAAGTGA |
GCTTCGGAAG |
TGTTCAAAAC |
CAGCTAGCCC |
CTGTGACGCC |
CGCCGTGGCA |
|
15121 |
CAGTGGCAGT |
TTATCAGTCC |
CGCCCCTGAT |
TCCCTTTGTC |
TCTGCCCAGC |
CCTGACCCTC |
GGACTGAGAA |
CCAAGAATGA |
|
15201 |
GGAGTGAGCC |
ATGTTGAGGG |
CTGGAGAGGG |
TCTCTGATTG |
TCAGCAAATG |
GGAGCAGATC |
AGGGGAGACA |
CCCACTCTGC |
|
15281 |
CCGTGTGTCA |
CTCTGGGCTG |
TTCCCCAGTG |
CCCCCCAACC |
CTTGGGCCCT |
TGAATTGGGT |
AGGGTCTCTG |
GTTTTCCCTG |
|
15361 |
GGTTGGGCTT |
CGTGTGTGGG |
CAGCGTGCCC |
ATCCACCGCC |
ATCCCGGGAG |
ACCCTTCAGT |
CAGTGGGTGA |
CTCTCTTCCA |
|
15441 |
GGCCGGGCAC |
AACTTCTACA |
ACGTGGACAT |
GAGCTACTTG |
AAGAGGCTAT |
GTGGTACCTT |
CCTGGGTGGG |
CCCAAGCCAC |
|
15521 |
CCCAGAGGTG |
AGTGCCTGCT |
CCTCTGCACC |
GCTGTAATGT |
GAGTGGCAGG |
CGTTGGTTTG |
GGGCAGTGGG |
AAGTGGGAGA |
|
15601 |
GTGAAGGCCT |
CTCTGGCGTC |
CGCAGGGCTC |
ATGCTGCCCA |
GGGCTGCCGA |
CACCGCTTGA |
GGTACAGTAC |
TTGTTTCTTT |
|
15681 |
TCATTTTTAT |
ATTACTTTCT |
CTTTTTATTT |
TATTTTTTCC |
TTCTCAAGTT |
ATTAGTAGAG |
AAGATGCTGG |
TGCTTTTTTT |
|
15761 |
TGGTTTTTGA |
GAGAGTCTCA |
CTCTGTCACC |
CAGGCTGGAG |
TGCAGTGGCA |
CGATCTTGGC |
TCACTGCAAA |
CTCTGTCTCC |
|
15841 |
CGGGTTCAAG |
CGATTCTCAT |
GCCACAGCCT |
CCTGAGTAGC |
TGGGGCCACA |
GGCTTGCACC |
ACCGTGCACG |
GCTAATATTT |
|
15921 |
GTATTTTTAG |
TAGAGACTGG |
GTTTCGCCAT |
ACTGGCCAGG |
CTCGTCTCGA |
ACTCTTGGCC |
TCAGCAAGTC |
ATCTGCCCAA |
|
16001 |
CTCGGCATCC |
CAGAGTGCTG |
GGATTACCGG |
CATGAGCCAC |
CCCCAGTGTT |
TTCTATTGTG |
CTTCTCCAAG |
GTGTCTCAGC |
|
16081 |
CGCAGATGTG |
ACAGACAAGG |
TCTGGGTCCC |
CACTATGTGG |
GCAGACTCAA |
TCTTGAGTAA |
TTTAAGGGAA |
ATCTAAGAAA |
|
16161 |
AGCCAGATGA |
GGCCAAGTAT |
GGTGGCTCAC |
GCCTGTAATC |
CCAGCACTTT |
GGGATGCCAA |
GGCAAGCGGA |
TCACCTGAGG |
|
16241 |
TCAGGAGTTC |
GAGACCAGTC |
TGACCAACAT |
GATGAAACCC |
CGTCTCTACT |
AAAAATAAAA |
AACTTTGCCA |
GGCGTGGTGG |
|
16321 |
CGGGCGCCTG |
TAATCCCAGC |
TACTCGGGAG |
GCTGAGGCTA |
GAGAATCGCA |
TGAACCCGGG |
AGGCAGAGGT |
TGCGGTGAGC |
|
16401 |
TGAGATGATG |
CCACTTCCCA |
CGTGAGGATG |
GGGTAGTTAA |
GTTGCACGAC |
ACTGCCTTCT |
GCACAATGGG |
GATGATATGA |
|
16481 |
GAAATGATGA |
GAGGATCCTG |
GGCAGAGGTT |
GAGGCAGGAG |
AGCAGGGCAA |
GAAACACTAT |
CTGGACTGAA |
TAAACTTATT |
|
16561 |
TAAAAATGAG |
TTTATAATTT |
GTGGATAAAT |
TCATCTTAAA |
AAATCCATTA |
CATAAGCCTA |
GAAAGAAGCG |
TACAATATCG |
|
16641 |
CAAGTCAGGC |
CAGGTGTGGT |
GGATCATGCC |
TGTAATCCCA |
GCACTTTGGG |
AGGCCAAGGC |
AGGCAGATCA |
TTTGAAGCCA |
|
16721 |
GGAGTTTAAG |
ACCAGCCTGG |
CCAACATGGT |
GAAACCCTGT |
CTTTACTAAA |
AATACAAAAA |
TTAGCCGGGT |
GTGGTGGTGC |
|
16801 |
ATGCCTGTAA |
TCCCAGCTAC |
TCAGGAGGCT |
GAGGCACGAG |
AATCGATTGA |
ACTCAGGAGG |
CAGAGGTTGC |
AGTGGGCAGA |
|
16881 |
GATCTCGCCA |
CTGCACTCCA |
GCCTGGATGA |
CACAGTGAAA |
TTCTGTCCAC |
TCCCACCCTC |
CCGTCCCACC |
CAAAAAAAAA |
|
16961 |
ATCAAAAGGT |
GCATTGAGCA |
ACAGAATTTC |
CTTTTGTTTT |
ATGAAATTAC |
CAAGGACACC |
TGTCTGCAGC |
CTTCTTCGGC |
|
17041 |
ATGTCTTGTG |
CTCATTTCCT |
TGTTAGGGGA |
GTGTGGCCCA |
GCCAAAGTGG |
CCTTTGCAGG |
GATTGGGGAG |
GGAGTTGTGG |
|
17121 |
GATCAGAGCT |
TGTAATGAAG |
ACCTTCCTTT |
ATCCAGAGTT |
ATGTTTACCG |
AGGACCTGAA |
GCTGCCTGCA |
AGCTTCGATG |
|
17201 |
CACGGGAACA |
ATGGCCACAG |
TGTCCCACCA |
TCAAAGAGAT |
CAGAGACCAG |
GGCTCCTGTG |
GCTCCTGCTG |
GGTAAGGCCC |
|
17281 |
TGCTGGCTGG |
CGGGGAAGCG |
CTGGAGAGAA |
AGTGGGAGCA |
ACACTGGAGA |
GTCTTGGGGG |
ATTCGGGGTG |
GGGACAACTC |
|
17361 |
TGACAAGGCA |
AGTTATAGAA |
ACTTTCTGAG |
TCCCAGTTTC |
CATCAGTACA |
AAAATCACAA |
TCCCTCTGGC |
CATGAATGAT |
|
17441 |
GGCGAGGATT |
AGGTGGAGTG |
GCGGGCAGAG |
CATCCAGCAG |
ATTGCAAGTC |
CACGTGTACA |
GGTGGCGAAG |
CAGCTCCCTT |
|
17521 |
TCCCTGACAT |
GCTGGCCCGT |
CCGCAAATAC |
CAGGAGCTCT |
CACTGCTACT |
CTGCTTCAAG |
AAAGCATCCC |
TTTAGTGTCA |
|
17601 |
GTGAGCTGTC |
TTAATTTTGT |
CATTTAATTG |
TGGTAAAATA |
CACGTAACAG |
AAATGTAATA |
ATCTTAGCAA |
TCTTCTTTTG |
|
17681 |
TTTTCTTTTT |
CTTTCTTTTT |
TTTTTTTTTT |
TTTTTTTGGA |
GATGGAGTCT |
TGCTCTGTCA |
CCCAGGCTGG |
AGTGCAGTGG |
|
17761 |
TGGGATATTG |
GCTCACTGCA |
ACCTCTGCCT |
CCTGGGTTCA |
AGCAATTCTC |
CTGCCTCAGT |
CTCCCCAGTA |
GCTAAGACTA |
|
17841 |
CAGGCATGTG |
CCACCACGCC |
CAGCTAATTT |
TTGTATTTTT |
AGTAGAGATG |
GGGTTTTGCC |
ATGTTGGACA |
GCCTGGTCTC |
|
17921 |
AAACGCCTCA |
CCTGTTGATC |
TGCCTTCCTC |
GGCCTCGCAA |
AATGCTGGGA |
TTACAGGCGT |
GAGCCACCGT |
GCCCAGCAAC |
|
18001 |
TATTTTCAAG |
TGTACAGTTC |
TGTAGCATTA |
AGTACATTCA |
CAGTGTTGCT |
CAGCCACCAC |
CACCATCTGT |
CCCCCGAACT |
|
18081 |
CTTTTTCAGC |
TCGCAAGACA |
GAAACTCTGT |
CCCCATTAAC |
ACCAATATTG |
TAGCCCCTGG |
TAAGCCCCAC |
TCTACTTTCT |
|
18161 |
GTCTCTATGA |
ATTTGACTCC |
TAGGGACCTC |
ATACAAGTGG |
ATCACAACAG |
TATTTATTTT |
CTGGGTGAGC |
TGTGTTTTTT |
|
18241 |
GTTAAGAAAA |
AAACACAGCC |
AGGTACGGTA |
GCTCACGCCT |
GTAATCTCAA |
CACTTTGGGA |
GGCTGAGGCA |
GGCGGATCAC |
|
18321 |
CTGAGGTCAG |
GAGTTTGAGA |
CCAACCTGGC |
CAACATGGTG |
AAACCCTGTC |
TCTACTAAAA |
AGACAAAACT |
TAGCTGGGCC |
|
18401 |
TGGTGGCAGG |
CACCTATAAT |
CCCAGCTACT |
ATAGAGGCTG |
AGGCAGGAGA |
ATTGCTTGAA |
CCCAGGAGGC |
GTAGGTTGCA |
|
18481 |
GTGAGCTGAG |
ATTGCGCCAC |
TGCATTCCAG |
CCTGGGCAAC |
AAAAGTGAAA |
CTCCGCCTCC |
AGGAAAAAAA |
AAAAAAACCA |
|
18561 |
CACACACACA |
TACAAACTAA |
TACTACACAT |
TTTGCAGATT |
TCAGAAATGA |
ACCCAGTTTC |
CAGGCAGAGG |
TTCACGGGGG |
|
18641 |
TGCTTCTCTT |
GCTTTAAAAG |
CTGAGTTGGG |
CAAATCGTTG |
AGGCAGGAGA |
GTGGGGCAAG |
TGGCTCATCA |
CGCCTCTAAT |
|
18721 |
CCCAGCACTT |
TGGGAGGCCA |
AGGTGGGCAA |
ATCACTTGAA |
CCCGGGAGTT |
CAAGACCAGT |
CTGGCCAACG |
TAGCAAGACC |
|
18801 |
CCATTTCTAC |
CAAAGAAAAT |
AAAAGCTGAG |
TTTGAGCCCC |
AGGAGCGTCC |
CCTGGTGTTG |
AGAGATCAGT |
TGCCTACAAG |
|
18881 |
GTCTGAGGTG |
CCCCTGTGGC |
CCTTTGAGGG |
GACTGCTGCA |
GAGGGCCCAG |
CCCGGACATG |
GCAGCCTCAC |
CCGGTGGGGC |
|
18961 |
TCGTTCCTGC |
AGGCCTTCGG |
GGCTGTGGAA |
GCCATCTCTG |
ACCGGATCTG |
CATCCACACC |
AATGCGCACG |
TCAGCGTGGA |
|
19041 |
GGTGTCGGCG |
GAGGACCTGC |
TCACATGCTG |
TGGCAGCATG |
TGTGGGGACG |
GGTGAGTCAG |
GCTGTGCTTC |
CACAGCGGGT |
|
19121 |
TTAGTGCTGA |
GAGACCCTGG |
GCCCCAGCTT |
CTCAGTGGAG |
GGGACTTTGA |
GGACTTCCTG |
GGACTGCTGC |
GAGTCAGAAG |
|
19201 |
TGTTTCGGGG |
AGACTCCGAG |
AGTCTGGCAG |
GCAGGGCCTG |
GCAAGGCTGC |
TGCTTCCTGG |
GAGGTGCCTG |
AACCAAGGCT |
|
19281 |
GGCCCGGCAG |
AGCTGTCTGA |
GGGGTGGCAT |
CCCAGCAAAA |
CAGTCAGTTT |
CAGAAAATGT |
GGTGGAATGT |
CTGGCCCCTG |
|
19361 |
GTCTGGTTTG |
TGTCATCGCA |
TGTTCCTTTT |
TCCTGCTTGG |
GGAACCAGTT |
TGGGGCATTT |
CCTTATGTGT |
AGTTAGCGGG |
|
19441 |
GATGGCTCCA |
CCTTAACAAC |
AAGGTGGTGG |
CACAGAACTT |
TCTGCTCCAG |
CTGCCTGCAG |
CCTCGCCTTG |
GTTCCGCAGT |
|
19521 |
AGCGGTGTTC |
ACTGGCGGCT |
GCAGATCCGA |
AATGCCTGAG |
GGCTTAAAAA |
AATGCATGGG |
CTTCTCCCCT |
AACTAGTTAA |
|
19601 |
ATGGGAATCA |
CTGGCGATTC |
TCATTTCAGA |
GAACTGGTGG |
TGTTTTTAAG |
TACCTTAGGG |
AAGTTTAGCG |
TCTGCTCAGT |
|
19681 |
TGAGAATCAG |
TGTTCTACCC |
GGAAGGGCAC |
TCGTAGCCTG |
GTCTGGTATG |
GACATGAACA |
GGAGAGCCTC |
CTGTCTTCTC |
|
19761 |
CCGGATCTTT |
GTGGGCAGGG |
GTGGGGCTGG |
CGGTTATTCC |
CTGCAAGCTG |
TGCTTATCTA |
GGGAGCGTCC |
CTTGGAGGGT |
|
19841 |
TTGGGGTCTG |
GGAGGTCTGC |
TCGCACCACT |
TGCTGCAGCT |
GGGGGTGGGT |
CCACGAGTGG |
CCTCGGGCAC |
TTGGGTAGCA |
|
19921 |
CACAGTGGTC |
TGGAGAGCTG |
GTGGTGCTTC |
TCTCAGAGGT |
TTTCATTAGA |
GGCTGTCTTT |
TCAGCTGTAA |
TGGTGGCTAT |
|
20001 |
CCTGCTGAAG |
CTTGGAACTT |
CTGGACAAGA |
AAAGGCCTGG |
TTTCTGGTGG |
CCTCTATGAA |
TCCCATGTAG |
GTAAGTGTGT |
|
20081 |
CCCCTTGGCC |
ACTTTCTGGC |
CAGATGGATT |
GTTTGAGCAA |
TTAACCATCA |
TGGCTTTATT |
TGCCTTTATA |
AACTGGGGGT |
|
20161 |
TGAGACAGAG |
GGGCTGCTGA |
GAGGTGCTAG |
CCAGGTGCAC |
AGGCCTCTGG |
CAGGACCTGC |
CTGGCGTCCA |
TGCTGCAGGC |
|
20241 |
ACGAGGCTTG |
CCTTGCCCCA |
GGTCTTCTCC |
GTGGGGGCTG |
TAGGTTGACT |
CCGCTTTCTC |
CCGCGTCCCA |
TCAGGGTGCA |
|
20321 |
GACCGTACTC |
CATCCCTCCC |
TGTGAGCACC |
ACGTCAACGG |
CTCCCGGCCC |
CCATGCACGG |
GGGAGGGAGA |
TACCCCCAAG |
|
20401 |
TGTAGCAAGA |
TCTGTGAGCC |
TGGCTACAGC |
CCGACCTACA |
AACAGGACAA |
GCACTACGGT |
AAGGGGCCTG |
GGGCCTGGCC |
|
20481 |
ACGGCGCACG |
TGGAGGCTGG |
GGAGCTGCTG |
CATCCCTCCT |
CACGCTGCAG |
CGAAGAGGTC |
AGGGCTGAGG |
AGCCTTGGGG |
|
20561 |
TCCCGGAACT |
CTAGGATAGA |
GGAGGGGGAG |
TGATGCCCTC |
TTGCCAGGAG |
AAGCAGCACA |
CTCTCCACTT |
TCTGCCTGTT |
|
20641 |
CCACCCTGAA |
CTCAGCCTCA |
GCCCTTCCAA |
ACTGGAAGGG |
ACCAAAGCCC |
TCCTTTTACA |
AGGGAGTGGC |
GTGTCCTGGA |
|
20721 |
GTCTTGAGGT |
ATCCTGGCCT |
CTTCCGGGGC |
CTCGTGCTCA |
TTGCTCCGTT |
CTCCCATTTC |
TGAGCTTCCA |
GCCCCAGCCC |
|
20801 |
TGTCCTTAGT |
TCTTCAGGGG |
AGCCTTCCTT |
GAGCTCCCCT |
GGCAGGGCAA |
GTCCCCTTAG |
ATGCAGTCTT |
CCCCTGGAGG |
|
20881 |
CCACCAGAAA |
TCAGTGACTG |
GCTGACCGTG |
GCGTGCCACG |
AGGCACGGTC |
ACTGCCCTCC |
CCAACCCCGA |
GCTCGGTTCA |
|
20961 |
TTTTCCAGGA |
TACAATTCCT |
ACAGCGTCTC |
CAATAGCGAG |
AAGGACATCA |
TGGCCGAGAT |
CTACAAAAAC |
GGCCCCGTGG |
|
21041 |
AGGGAGCTTT |
CTCTGTGTAT |
TCGGACTTCC |
TGCTCTACAA |
GTCAGGTGCG |
TGCTGATGGC |
TGATGGCAAT |
AGAGGGTGGG |
|
21121 |
GGTCGGGAGG |
GGAGCCTGGG |
TGCCAGGATG |
GTTCATGTTG |
ACCAATAGGG |
CTGGATTGGG |
CAGGCAGGTG |
AGGGGCTGGG |
|
21201 |
GAAGAGGGCT |
GTGTGGGGCC |
TTCCATATAG |
GCTCACTCCT |
CTGTGGACCA |
GCCTGGCACC |
TGACTGTCTT |
TGTCCAGGTG |
|
21281 |
TTGTGCCAAC |
ACACACAATT |
CCAGAGGCCT |
CTCCCAGAGC |
CTCTGGGCTA |
GTTCCTTCCC |
ATCCATTAGC |
GTCCAGCTCA |
|
21361 |
TACATGTTCT |
CTGTGGGAAC |
GCCCCCCTCC |
CTGGTGGCTG |
GCCACGCCTG |
CCCTCGCCTT |
CCTGGCTTTG |
CATTGCTTTA |
|
21441 |
CCAGCATGTT |
TCTGTTATAA |
TGGCAGGCCG |
TTCGTGTTTG |
CACTGGCTGA |
CACATTTTCT |
TCATCTTTCT |
CTTTGCAAAG |
|
21521 |
GCCTAGATCG |
TGCCTCACAC |
CAGTGGATGA |
ACACACAGGG |
AGGACCTTGC |
CCTGGGTAGT |
GCCATAGGGA |
CTCCAACCAA |
|
21601 |
GGAATCTGGA |
CAGTCCCCAT |
CCCCAAGTCC |
TGTCTTAGAA |
ATCCCTTGCT |
CCAGATGAAC |
AGTCTTTGCT |
GACTGCATCT |
|
21681 |
ATCCCATTAA |
TATTTTGCTA |
GATTGGAAAC |
TTGTAAAACA |
ACAGTTTGAC |
AAAGGAAGAT |
GATTCTGGCA |
AGAAAAAGTT |
|
21761 |
TCTGTGAAGA |
TGGCACTTCT |
TAGCATAGTC |
CTCGCCCTTA |
TTATTAAAAG |
ATGCATTTCC |
ACTTTGACAG |
TGATTTCCCC |
|
21841 |
ACCCGAGAGC |
CTGGTTGGGC |
AGCCCTGAGC |
CCCTTAGCCA |
GTATCTTTCC |
CATCAGCTAG |
CATTAGCTGC |
CACCAGTCCT |
|
21921 |
CCCTTCTCCT |
CCCTCACCTC |
TTCAGGGTGT |
ACCTGGAGCA |
GCTATTAAAA |
CTCTTTAAGC |
CGTGTAACGC |
TTTTGACAAA |
|
22001 |
CCCTTACTTA |
CATGGAATCC |
CAGTATAAAG |
AACAGATGAA |
TCAGTCATCT |
AGCATTAAAT |
GTTTTATCAA |
TTAGAATGTA |
|
22081 |
TTAGAAATTC |
TATTTTTAAA |
GCCAACAAGC |
ACATTTTGTA |
GGAATCTATT |
TCAATATTTA |
CTCAACAACT |
TTGAGTGTGT |
|
22161 |
TGGGCCTTCA |
AAAATATATT |
TAATTCCCCA |
GTATTGTGGG |
ACCCCCCCTG |
GAGCCTCCTG |
GTGGAGCAGC |
AGGTTCCCTC |
|
22241 |
TGGAAGCTGT |
TTCTCCTTCC |
CGGAGCATGA |
CCAGCTGAGT |
GTGGGGGGTG |
CTGTGGGGCG |
TGGGACAGTG |
GCCCGGTCTC |
|
22321 |
GGTCTCAGCC |
AGTTCTTCCC |
TTTTCAGGAG |
TGTACCAACA |
CGTCACCGGA |
GAGATGATGG |
GTGGCCATGC |
CATCCGCATC |
|
22401 |
CTGGGCTGGG |
GAGTGGAGAA |
TGGCACACCC |
TACTGGCTGG |
TTGCCAACTC |
CTGGAACACT |
GACTGGGGTG |
ACAATGGTGA |
|
22481 |
GTGGCTGCCC |
CCTTCCTGCC |
AAGAACAGTG |
AATTGTGAGC |
CACACCCCGT |
GGCCATCTCG |
GCTTTCTCTG |
TTCTACCCAC |
|
22561 |
CGCACAGCCT |
TAAACCCTGG |
ACCTACGGCC |
AGGCTGTGAG |
CTCCTCCTAA |
GTGCCAGGCA |
GTCAGGAGTT |
CCCTTTTGCT |
|
22641 |
GTGAGGGCAG |
ACTCTGAGCA |
GCTTCAGAGC |
CAACGCCTGC |
AACAGTTCCC |
ACAGCATCGC |
GCCAGATCCT |
GTGATGGGAA |
|
22721 |
GGGTGACCGG |
GCAGGGGGCT |
TGCCCGTGGA |
GGTGTGCCCC |
ACGGCTCCAG |
AAGCCTTGTG |
GTGTTGAGGA |
CTGTGCTAGT |
|
22801 |
GGGCCAACAG |
CAGGAGTGGC |
CAGGGATGAG |
TGACTTAAGG |
TCTTTTAAGG |
ATGAGTCTGA |
CTATATTGGT |
TGACCCTTGT |
|
22881 |
CACACTTTAA |
AAGCACCTTA |
CTTTTTATTC |
CCAGGCTTCT |
TTAAAATACT |
CAGAGGACAG |
GATCACTGTG |
GAATCGAATC |
|
22961 |
AGAAGTGGTG |
GCTGGAATTC |
CACGCACCGA |
TCAGTACTGG |
GAAAAGATCT |
AATCTGCCGT |
GGGCCTGTCG |
TGCCAGTCCT |
|
23041 |
GGGGGCGAGA |
TCGGGGTAGA |
AATGCATTTT |
ATTCTTTAAG |
TTCACGTAAG |
ATACAAGTTT |
CAGACAGGGT |
CTGAAGGACT |
|
23121 |
GGATTGGCCA |
AACATCAGAC |
CTGTCTTCCA |
AGGAGACCAA |
GTCCTGGCTA |
CATCCCAGCC |
TGTGGTTACA |
GTGCAGACAG |
|
23201 |
GCCATGTGAG |
CCACCGCTGC |
CAGCACAGAG |
CGTCCTTCCC |
CCTGTAGACT |
AGTGCCGTAG |
GGAGTACCTG |
CTGCCCCAGC |
|
23281 |
TGACTGTGGC |
CCCCTCCGTG |
ATCCATCCAT |
CTCCAGGGAG |
CAAGACAGAG |
ACGCAGGAAT |
GGAAAGCGGA |
GTTCCTAACA |
|
23361 |
GGATGAAAGT |
TCCCCCATCA |
GTTCCCCCAG |
TACCTCCAAG |
CAAGTAGCTT |
TCCACATTTG |
TCACAGAAAT |
CAGAGGAGAG |
|
23441 |
ACGGTGTTGG |
GAGCCCTTTG |
GAGAACGCCA |
GTCTCCCAGG |
CCCCCTGCAT |
CTATCGAGTT |
TGCAATGTCA |
CAACCTCTCT |
|
23521 |
GATCTTGTGC |
TCAGCATGAT |
TCTTTAATAG |
AAGTTTTATT |
TTTTCGTGCA |
CTCTGCTAAT |
CATGTGGGTG |
AGCCAGTGGA |
|
23601 |
ACAGCGGGAG |
ACCTGTGCTA |
GTTTTACAGA |
TTGCCTCCTT |
ATGACGCGGC |
TCAAAAGGAA |
ACCAAGTGGT |
CAGGAGTTGT |
|
23681 |
TTCTGACCCA |
CTGATCTCTA |
CTACCACAAG |
GAAAATAGTT |
TAGGAGAAAC |
CAGCTTTTAC |
TGTTTTTGAA |
AAATTACAGC |
|
23761 |
TTCACCCTGT |
CAAGTTAACA |
AGGAATGCCT |
GTGCCAATAA |
AAGTTTTCTC |
CAACTTGAAG |
TCTACTCTGA |
TGGGATCTCA |
|
23841 |
GATCCTTTGT |
CACTGCCTAT |
AGACTTGTAG |
CTGCTGTCTC |
TCTTTGTCCC |
TGCAGAGAAT |
CACGTCCTGG |
AACTGCATGT |
|
23921 |
TCTTGCGACT |
CTTGGGACTT |
CATCTTAACT |
TCTCGCTGCC |
CCAGCCATGT |
TTTCAACCAT |
GGCATCCCTC |
CCCCAATTAG |
|
24001 |
TTCCCTGTCA |
TCCTCGTCAA |
CCTTCTCTGT |
AAGTGCCTGG |
TAAGCTTGCC |
CTTGCTTAAG |
AACTCAAAAC |
ATAGCTGTGC |
|
24081 |
TCTATTTTTT |
TGTTGTTGTT |
GTGACTGACA |
GAGTGAGATT |
CCGTCTCCCA |
GGCTGGAGTG |
CAGTGGCGCC |
TTCTCAGCTC |
|
24161 |
ACTGCAACCT |
GCAGCCTCCT |
AGATTCAAGC |
GATTCTCCTG |
CTTCAGCCTT |
CCGAGTAGCT |
GGGATGACAG |
GCACTCACCA |
|
24241 |
ATATGCCTGG |
GTAATTTTTG |
TATTTTTAAG |
TACATACAGG |
ATTTCACCAT |
GTTGGCCAGG |
CTAGTTTCAA |
ACTCCCGGCC |
|
24321 |
TCAGGTGGTC |
TGCCTGCCTC |
AGCCTCCCAA |
AGTGTTGGGA |
TTACAGGCGT |
GAGCCACTGG |
GCCCTGCCTG |
TATTTTTTAT |
|
24401 |
CAGCCACAAA |
TCCAGCAACA |
AGCTGAGGAT |
TCAGCTCATA |
AAACAGGCTT |
GGTGTCTTGG |
TGATCTCACA |
TAACCAAGAT |
|
24481 |
GCTACCCCGT |
GGGGAACCAC |
ATCCCCCTGG |
ATGCCCTCCA |
GCCTTGGTTT |
GGGCTGGAGT |
CAGGGCCTGT |
ATACAGTATT |
|
24561 |
TTGAATTTGT |
ATGCCACTGG |
TTTGCATTGC |
TGGTCAGGAA |
CTCTAGTGCT |
TTGCATAGCC |
CTGGTTTAGA |
AACATGTTAT |
|
24641 |
AGCAGTTCTT |
GGTATAGAGC |
AAACTAGAAG |
AACCAGCAAT |
CATTCCACTG |
TCCTGCCAAG |
GTACACCTCA |
GTACTCCCCT |
|
24721 |
TCCCAACTGA |
AGTGGTATGA |
GGCTAGCTCT |
TTCCAAAAGC |
ATTCAAGTTT |
GGCTTCTGAT |
GTGACTCAGA |
ATTTAGGAAC |
|
24801 |
CAGATGCTAG |
ATCAAATAAG |
CTCTGAAAAT |
CTGAGGAACA |
TTGTAGGAAA |
GGTTTGTTAA |
GCATCTCTTA |
AGTGCCATGA |
|
24881 |
TGAGCATAAC |
AGCCGGCCGT |
CGTGGCTCAC |
GCCTGTAATC |
CCAGCACTTT |
GGGAGGCCAA |
GGTGGGAGGA |
TGACAAGGTC |
|
24961 |
AGGAGTTCAA |
GACCAGCCTG |
GCCAACATGC |
TGAAACCTCA |
CCTCTACTAA |
AAATACAAAA |
ATTAGCTGGG |
CATGGTGGCA |
|
25041 |
CATGCCTGTA |
ATCCCAGCTA |
CTTGGGAGGC |
TGAGGCAGGA |
GAATCGCTTG |
AACCCGGGAG |
GCGGAGGTTG |
CAGTGAGCCA |
|
25121 |
AGACAGTGCC |
AGTGCACTCC |
AGCCTCGGTG |
ACAGCGCAAG |
GCTCCGTCTC |
AATAATTAAA |
AAAAAAAAAA |
AAAAAAAAAA |
|
25201 |
GGCCGGGCGC |
AGTGGCTCAA |
GCCTGTAATC |
CCAGCACTTT |
GGGAGGCTGA |
GGCGGGCAGA |
TCACCTGAGG |
TCAGGAGTTT |
|
25281 |
TGAGATCAGC |
CTTGGCAACA |
CGGTGAAACC |
CCATCTCTAC |
TAAAAATACA |
AAATTAGCCA |
AGCATGCTGG |
CACATGCCTG |
|
25361 |
TAATCCCAGC |
TACTCGGGAG |
GCTGAGGTAC |
GAGAATCGCT |
TGAACCTGGG |
AGGCAGAGGA |
TGCAGTGAGC |
CGAGATCACG |
|
25441 |
CCATTGCACT |
CCAGCCTGGG |
GGACAAGAGT |
GAATCTGTGT |
CTCACCAAAA |
AAAAAAAGAA |
AAAGAAAGAT |
GCTTAACAAA |
|
25521 |
GGTTACCATA |
AGCCACAAAT |
TCATAACCAC |
TTATCCTTCC |
AGTTTCAAGT |
AGAATATATT |
CATAACCTCA |
ATAAAGTTCT |
|
25601 |
CCCTGCTCCC |
AAAC |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:1508|CTSB|NC_000008.10:11700032...11725645 (-)
GGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCAA
CGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGTGAGTGCTGGGGACCCGGGGCCA
CCGCAGCGTAAGTGACCTTGGCGGGGACGGTGCTACCCGGCCGCCGAGACGGGTTCCTCTGCGCCCTCAGTCGGGCCCAG
GCGCGGCCCCGCGGCGTCCCTGGGGGCCGGCGGGGAGCCGGGACCCTCGGGACTGTCCCTGACGGGCGGGCTGGGGTGGG
AGTCCGCGCGCTCCGAAGCGTCGGCGAGAAAAGCAGAAAACAGCTCCGCCCGCCAGCCCTCTGCCCTCCGCTCCCCTCCC
CGGGCTGTGCGCCGGACCCCGGCCCTCGGAGCGGGGACGCGGCCAGGACCGCCGAGGGAGGCGCCTGCGAGGAAGAGCTC
GGCCGGGTCCGGAGACTGCTGCCTGGGACCGCGCTCCCAGCGCCTGGGCCTCGGTGTCTCCGGGCCAAACTGCCGACATA
ATCGCATCTGCCGGCATCTATTTTCGGTTTATTTCCCCCTCATTGCGAAGGATTTGCCTGGCCAACTTTCTGCGCAAGAT
CCCACGCAATTCCTGGGACCCCAGAAGACAGGTCCTGTTGAAGAACAGGAATCTGGCACTGGGTGGGCTGGGGAGGAAGC
CGCACGGTGTTAAATCCATAAACAGGAAGAGAAACCAGACAGCGAAACCAAGAGGCGAATGGGCGATTGGATGCCGGTGG
GGAGAAGGCCGGGGGCGCACCCTGCTCCTGGACTCCAGTAAAGGGAGGCCGGGCAGAGTCCCTGGGGCGCCACCTCCCCC
TCGGTTAGTAGCCCTGGAGGCCGGGGGGAGTTGGCCTCTGGGGAGCAGTGGGTGCTGGGTGTGGGGCGTTGCAGGCAGGC
TGGGGTGGGCGACCCAGGTGGAAGTGAATTGCACTTGGCTTCCTGGTGGGCCTCTGTCACCCCCTTCCCAGGCGCTGAGA
AAGCCAGCAGGCTGGCAAAGAAAAGGACCCTAGCGCAGGCCCCACACTCCTCCTCCTAACGGACGAGAGACCCCCCAAAC
CCACTGGAGAAGTGACGCTGTGGGGTTCAAATGCAGACCTGGCACCTTTTTGTAGCCTGGAAAAACATTCCCACTGCCTG
CTGCCGGAGGAGAGGATAGCTGAGATGCACTCTCTTTGAATCCAAACGTTCAGGAACGTAAGGCGAAGAGGCCTAAGAGG
GCGTTGGCTGGCTCTGTCTCTCAGGCTGGAGCACAGTGGCGCGATCTCGGCTCACTACAACTTCCGCCTCCCAAATTCAA
GCTATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCCCGCCACCACGCCCAGCTAATTTTTGTATTTTTA
GTAGAGATGGGGTTTCACCATGTTGACCAGGCAGATCTTGACCTCCTGACCTCAGGTGATCCGCCTGTCTCGGCCTCCCG
GTGAGTCACGGTGCCTGGCCAAGAACTGTTTCTTGTTGGCTCTGGTGCTGGTGACTTAGAACCCGCCAGCTCCTGGAGAA
AGGGGCTGGGCCGCCCACCCTGTGTAGCTTTCCCAAAGACAGAGTCAAACGTCTCCTGGAGAACAGAGGCTTCCCTTCGT
CTTTGGTCATTTGTCCTCTAGCTGGGGGTACCCCCTGGTGGAAAGGCACAGGTCCCTTGCTCCCCAGGTGGCAACGCAGG
CCAGACACGGCCCTGGCACAGCTCTCCTGGGTGTTGGCTCAGGACAGCCCTGTTTCCAACTGGTTAGGCGGTGAGGGGTG
GTGGCCCTTTGGTTCCAGGTTGAAACTGCCCATGTGGTGCTGATTTAGCAGACTGGGGAGGCTCTTTTTGTAGGCAGGTT
CTTTTCTTTCCCCAGCTGCTGGACCTGGGAGTTGGAAGAGAAGTTGCACCCATTTTAGGGGTAACAGATATTTTCTGTTG
CTCTTGGTTGGATTGGGAAGTGAATTGAAGGGAGGTCACGTTTCAGGGGTGCCTTGGGATGTCTGTCAGTGATTTTCTTT
TCTTTCTTAATTTCTTTTTCTTTCTTTTTTTTTTTTTTTTGAGACACACTCCCTCTATCGCTCAGGCTGGAGTGCAGTGG
TGCGATCTCGGTTCACTGCCACCTCCGCCTCTCATGTTGAAGCAATTCTCCTGTCTCAGCCTCCCTCCCAAGTAGCTGGG
ATTGCCAGTGCCCATCACCACACCTGGCTTTTTTTTTTTTTTTTGTATTTTTAGTAGAGACGGGCTTTCACCATGTTAGC
CAGGCTGGTTTTCGAACTCCTGATCTCAAGTGATCCGCCTCAGCCTCCCAAAGTGGTAGGATTACAGGCATGAGCCACCG
CGCGGTGGAGGGGTAATTTTCTTAAATCTGGTAATGAGTTGTGGTTGTGTAGAGTAACATACCGTCCTTTCGAGATATGG
ACTGAAACATTGAGAGGGAGGAGTTACAGGTATGTCGATTCTTCTTTTCTCTCTCTCCTTTTTTTTTTTGAGGTGGAGTC
TGACTCTCTCACCCAGGCTGGAGTGCAGTGGCAAGATCTCAGCTCACTGCAACTCCGCTTCCTGTGTTCAAGCCATTCTC
CTGCCTCAGTCTCTCAACTAGCTGCGATTACAGGCATGTGCCTCCACACTCAGCTAATTTTTTTATTTTTAGTAGAGATT
TTTTGTCTCTCCTAAAAAAATCCAATGTAAAAAAATCCCAATGTGGGGGTTTTGCCACGTTGGCCAGGCTGGTCTCGAAC
TCCTGACCTCGTGATCCGCCCACCTCGGCCTCCCAAAGCGCTGGGATTACAGGTATGAGCCACTGCGCCCAGTCTGCTGC
CTCTTTTCAATGGTCTGGCCTAAGGAAATTATTGGAAACATGTGCGGTTGAGTGATATTTACTGGGCACTTCCACATGGT
CCATGTAAAGGGAGATGGTTGGGGTGACAGGCAGTTGAGTCTAGGGGAGGCATGTACAGATGTGCTGTGCCTCTGGGATA
TCAGGGTGGCAGGCAGCAGTCTCTACGTCTGGTCCCAGGCTGCCTGAGAAAGAGCATGTGGGAGGCAAACCTTGCGCCCT
GGCATGGTTGTTAATGTTTATATTTACCCTAGCTTGTGTGGGGTAGGAGGTTTAGGGATCAAATTCCACTCTGTGTTTAG
ACATTTTTTTCTTTCTTTTTTTTTTTGAGACAGTTTCACTCTGTCACCCAGGCTGGAATGCCGTGGCACAATCTCAGCTC
ACTGCAACCTCACCTCCCAGGTTCAAGCAATGTTCCTGCCTCAGCCTCCGGAGTAGCTGGGATTAGAGGCATGCACCACC
ATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACCTCAAATGATCTGCCCGCCTCGCCTCCCAAAGTGTTGGGATTACAG
GTGTGAGTCACTGTGCCCAGCCATGTGTTTAGACTTTTAACTAATCTCTTTTTTAGTTTCAAGCCTTATCGTCCGCCTCT
GTAGACCACTCCTGTGCCTGTTTCCTGATCCTTCCAAGGGCCATTGTATTCCCTGTCTGCTGCCCCTCTTTTGGATTCTT
CTGCACATTTTTTTGTTCATGCATTCATTCATTTATTGTTTGATTAATGACAGGGTCTTGCTTTGTCTCCCAGGCTGGTG
TGCAGTGGTGCGACCACGGCTCACGGCAGCCTCAGCCACCCAGATGTAAGCGATCTGGTTCCCACCTCAGCCTCCCGAGT
AGTAAGTAGCTGGGACCACAGGCGGTGCCCAGGTTTTTTTTTTTTTTTTTTTTTTTTTCGTTGGTAGAGACAGGGTCTCA
CTGTTGCCAGGACTGCTCTGAAACTCCTGGTTTCAAGTGATCCCCTTGCCTCCTCCCACTTAGGCCTTCCAAATGCTGGG
ATTACAAGGCATGAGTCACCTCCAGGCCTTTTTGTACTTTTAAAACTCTGCATCAGTGTATAAACAATGTTATTAAAGTT
TATATGACTTCAGTTACACTACATGGATCCTTTTTCACTCACGGTTGTGAGATTTATTTCTGTTGCTACATCCAGTTCTA
GTCCATTTGTGTTAAGTGCCCAGTGTGTTTATCTACTGAGGGACAGTTATGTTATTTCGTGTTCACTATTATCCCATGCT
ACAATAAATATCCTGTGTCTCCCAGATACTTAGAAGAGTTTCTGCAGGGCACATGTGGGAGAGTTTGTTTCTGGGTCATG
AGGTGTGTTCATCTTCCATCTTGCTAGATGCTGGCAAAGGGTTCTCCAGTGTGGTTGCATCAATTTACTCGCCCAGCAGT
GTGCAGAGTTCCTGTTCCTCACATTTACCAACACTAGATAGTACCAGACTTTGATTTTTGCCAATCTGATGGGTTTGAAG
TGGTACCCCTGTTTTAATTTCATGACCAGAAATTCAAATTTAATCTTTGCTGTAGGACAACAGTAACCCACTTATGCCTA
GTGTTCCATTATTAGAACGCTAAGCATGTGGGAGTTTTTACATCATACTGCTCAAGGTCATCGCCAAGGTCTGATGTTTT
TACTCGTGCAAAAATTTAAAAAATTGCAACCTCTGGCATAAATGGGTTGAGTGACACTTTTCCTGTTTTTATTGTTGGTC
AGTGATGGCATATTTGCTGGGTTTTTTTGTTTTTTTTTTGAAACGGAGTCTCACTGTGTCGCCAGGCTGGAGTGCAGTGG
TGTAATCTCGGCTCACTGTAACCTCCGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCCCCTGAGTAGCTGGGATTA
CAGGCGTGTGCCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGACGGGATGTCACTATGTTTGTCGGGCTGGTGTT
GAACTCCTGAGCTCATGATCTGCCTGTCTTGGCTTCCCTAAGTGCTGAGATTACAGGCCTGAGCCACCGCTAGCCTATTT
ATTTTTTATTTTAAATTTTAATTTTCTATATAGAGACGAGGTCACTATCCTGCCCAGGCTGGTCTTAATCTCCTGGGTTC
AGTCAATCTTCCCACCTTGGCCTCCCGAAGTGTCAGAATTATAGATGTGAGCCACTGTGCTCAGCCCAGAACTGATGTTT
TCTAAATGCTGGGTGCTGAGAAGGATGTGTGGCTGGCAGTCTTGACTGTGTTATCTGTCTTTACCAGGCCAGTAACTTCT
TTGGTCTGGTCATCAAGATAATCTAGCATCACCAGCAAGCATGCATGGAGAAGGATGGGCCCAATGTGGCCAAGATGGTA
ACGGGACCAGTAGAGAGCCCTGTAGAAGACATCTAGATATTCTGCCCTAAGAGCCCGGAGGGCCGGGCTGTCTCATGACC
CTCTGACGTGCTGACCTGGACTCTGGCAGAATGTGCACACACACAGTCACACAGCTTCCTGGCTTGCGCAAGTCCCAGGA
GGGCGGTGCCAGCCACAGGCTTTTCCCATTCGAGGGTTGGAAGCGTATCATCAAACCACATCAGAGTGCTGGGGGCCACC
TGCCACCCATTCCCAACCCACTCAGCCTTCCTGGTGTTTGGGACATGCTTTGCTTTGGCAGTCAAGACAGCAGAACAAAT
CAACTTTTAAGGCCTTGTCACTGATAGTACAATTTCCATTATTTTTCATCCAAATTAGGATACTTCTGAAAATAGAAATG
ATGACTCTGGGATGCAAACGTTGGCTGTCCTATGTATAAGGAGATGGCTTTTCACGCTCCCAGTGACTGAGGAAGTTTCT
CCCAGATGGCGCTGCTCTGAGCCTGGTGCAGGGTAGGCACTTTCAAAAGAGTGTCTCCTTGTATCTTCCATCAGCCTTGC
GAGATGGGTATCTGTTCCCAGGGCCCCAAGGGAGGAAAACAGGACCTAGCTGGATCCAAGAGCTAGGCCTTTCTTTTTTT
TTTTTTTTTTGAGATGGAGTCTGACTCTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACTGCAAACTCCGC
CTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCTGCCACCATGCCTGGCTATT
TTTTTGTATTTTTAGTAGAAACGGGGTTTCACCGTGTTATCCAGGATGGTCTCGATCTCCTGACCTCGTGATTCGCCCAC
CTCAGCCTCCCAAAGTACTGGGATTACAGGCGTGAGCCAGCATGCCCGGCCCAGAGCTAGGCCTTTCTGTGGCTGGCCTT
CGCGTCAGCCTCAACTACCCTGGTGTAATCTCGCCTGCGGTTGAATTAGGGAACCGCCGTGTTCTGCAAGCTGGAGAGGC
AGAACACTAATGAGCAGAACACTAATCTCATTGCAATCTCAAAGGATCTCTAAAAGCTTTTATAAAGCAGGCCCAAGGTC
CTTTGGTATCCGATGCAGACGTGGTGAATGCATTGGCTCTGTCAGCATCTGAGCAAGTCAGTAACAGAAATGGGGAGTAA
AAGCTTTCAGAACTTTCCAGAATATTGACTAAATTGTCTTGTTTACAACCAACAACGACAACAAAAAATAACTGCTGAGG
GCCTTCGTAGTGTCTGCTGTTTCAAGTGTACAGTAGTCATTTTGTCTGCAGGATGTGGGGTTGCTGTGGCTGACCTTGTA
CAATATTCCACTCATAGGTGTCTTCAGGCCTATGGAGAGCAGCTTGCGTGGGCTGGGCCTGCAGTACCTGGTTTGCATAG
ATGATTGGCAGGTGGGCAGCACGGGGAAGGACCTGTGAGTGGCCAACCTGGTTCAGGTGAGGGAGGTGGAGTGGGGCTTC
TCTGCTTCCCCTGGTTCCCTGGAAGCCTCCAAGGCTGGTGAGCATCACTGCTGCCTCTGCACACCTGTGTGCTGGGTGGG
TTTTCTGACAGGTTTTCAGTTGCTTCGGGGCTACAGCTGCAGGGAGCCTGCTCCATGGGACAGATGGGCCTCTGGTGCCC
GTTCATCAGGGGACTGATGAGACCGAGGCCTGAGAGCCCTTTGGATTTTGTTTTTGTCCTTAATTTAATCATAAGCCAAG
AATCTACTAAACACAGTTCCATTAGGGGCAAAGACGTAACACATCAGAGGCCACAGCAAGGCTGTGATTCATACTCAAAA
AGGAAAGGTCTCTGGGTCACAACAGAGCATAGTTGAGGTCAGCACACTCCCACCCAGTGCAGGGCTGCTCCAGCATTGAG
GTGTGTCTGGCAGGTTGAAGTAGGGGAAGATGAAACTCGCCGAAGTCTTGTTTTGTGGTTGCACTTAAGTGGTCAAAACT
TCAGGAGCAACTGCCGTTATTAGCGGTGAGTGCCAAGACTAGTTTTTATAGAAGAGAAAGAAACAAAGTACTCTGGGAAG
GTCTTACTGAGCCTTCACAGTCTCCCCACCTTTCCACTGTTCCCGTGCTCTTAGCCGCTCTGCTGGCCTATAAGGCACAG
TCTTCATTTGTGGCTTCTGGCAAAATGTAAGCACTTGACTTTTGTTTTTGTTTTGTTTTGTTTTGTTTTTTTGAGACGGA
GTTTCCCTCTTGTTGCCCAAGGTGGAGTGCAATGGTGCGATCTCAGCCCACTGCAGCCTCCACCTCCTGGGTTCAAGCAA
TTGTCCTGCTTCAGCCTCCCGAGTAGTTGGGATTATAGGTGCACAACCACCACGCCTGGCTAATTTTTTGTATTTTTAGT
AGAAATGGGATTTCACCATGTTAGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCTCCTTGGCCTCCCAAAG
TGCTGGGATTACAGGTGTGTGCCACTGTGCCCGGCCAACTTTCAATTCTTTAGAGCTGACTATGAGAGGAGCCAGCAGTA
TAGCCACAGCACCAACGAATGAGGAAGAGCAAAATACTGCATGACAGCTTTGCTAAGAATTCTTTCACTTTTTTTGTCTA
TCAGCCAGGAGCTAGCAACTTGGCTTATTTGGAAATTTTAAGTGTACATATCCTGTCTCCTTAAATCCTTTACAGATTTA
AAGTGCAGTCTACCTGAGGGCTCTGTGACCATGTAAGAAAGCTTTTTCTTTCTTTTTTTTTCTCTGAGACAGAGTGTTGC
TCTGTCGCCCAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGCAATTTTCCTG
CCTCAGCTTCCTGAGTAGCTGGGACTACAGGCAGCACCACCATGCCCGGCTGAGTTTTGTATTTTTAGTAGAGACAGGGT
TTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCGTGATCCGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTAC
ATGCGTGAGCCATTGTGTCCGGCCTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGTTGCCCAGGCTG
GAGTGCAGTGGTGTGACCTCAGCTTACTGCAACCTCCGCTTCCTGGATTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGT
AGGTGGGATTACAGGCACCCACCACCGTGCCTGGCTAATTTTTGTATTTTTAGTAGAGACAGGAGTTTCACCTTGTTTAG
TAGAGACAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCTGCCTTGACCTCCCAAAGCGCTGGGATTACAGGCAT
GAGCCACTGTGCCTGGCCAGAAAGCCTTCTTTATTGAGCTTGGTGGCAGCCCAAAACTGATTCTTTAAGGGTGTCAGGAC
TTAACACCTCCTGTGACTTAGCCGCACCTCCTCTCCTTTGACTTTCATTCCACCTCCTTCCAGGATCGCAAGGTCCCTAT
TTGTCCTGGAAACGGCTTCAAGGTAGTCTAGGGTGCCGTTTGCCGGGGGAGGAAGGTGCTCTGGTTGATAGAGTCGCCTG
GCCGCACACTCTTTTTGGCACATAACAACGTTCTACAGAGCCGGGGTGGAGCGTGCTTTCTCATAAGTGCTCTGCAGGTT
TGGAGAGAGAGGATATGAGGAGCACCCTTTTCTGTTTTTTTTAACCCAAAGATTAGCTTGGAAAAGGGGCAGAGGGGTGC
ACTGGAACTCAGGTCTGCCTAAGCAGCACAGCAGACCAAGGTCTAGAGATGACATCTGCTCGCAGCTGTTCTTCCACCAG
CCCGCATCCTGGAAAGGGGTCTTGTGGCACACAAGAGTTCACATCCTTCCCTCGTGAAATAAGGACTTTGTGTTCATCAT
CTCTTGTAAGAAGCAGAGCAGAAAGCACAGAATTAAGAAATAAAAGGGAAGTGGGTGCCTATATAAAGGGAAGTGAAAAT
GGGTTGCTGTCCCATGCAAAGACCCTGGAAAGCTGTTAACAGCTCAGCTTGTCACTTTCACCATCTGCATTTGTCCAGAG
TGATTGAGATTTGCGTTGTTGTGGAGAGAAAGGCGCCTGTTGCACAATGGAGTGAGATTGCCACTGCTGTCAGGACCTCT
GTGTTTGGCTTGACACTTTTTGAGTTCTCAGCAGTCTCGGGACCCTCAAGAGTGGAAGCATTTTTGGATGTTAAATGCTG
GGGTTAATTGAAGTTAAGAGCTTGTTTTACTGGGCATGGTGGCTCACACCTATAATCCCGACACTTTGGGAGGCCAAGGC
GGGCAGATCACTTGAGTCCAGGAGTTTGAGACCAGCCTGGGCAACATAGCAAAACCCCATCTCTACAAAAAATACAAAGC
TGGGCGTAGTGGTGTATGCCTGTAGTCCCAGCTCCTTAGGAGGCTGAGGGGAGCAGATCTCTTGAGCCCAGGAGGCAGAG
TTTGCAGTGAGCCATGATCGTGCCACTGCACTCCAGCCTGGGCAACAGAGTGAGATCCTGTCTTAAAACAAACAAAAAAA
ACAAACTTGTTTTCATTTAGACTCTTCCTGGCGTTGGGGACCTATTGGAATAGGTTTAGTGTGAACTGAGAGCTAGAAGT
GTTAGAGGAGAGAGGGAGGGAACAGAGCCCGCTGGAGCGAGTGCCCTTCCTACCTTATCACTGCATGCCAGGCATGTGCC
GGCGCTTTGGTCCTCCTCATTTCATTCTTGACTGCCACCTGAGACACGATGGTTACTAGCTCCATTTTATAGGTGGTGAA
ACTGAGGCTTGGGGAAGGTCAGACCCCAAGGGTGCCATTTAGTCAGTGGCAGAGCCAGATCCAAATGCAGGTCTCCTGAC
TCCAAGTGCAGGGCTCATTTTATCGTCCGGTTGCAGCACGCTGGCGGCCCCTTGAGCCCCAACCTGGATACCATAGGGGA
GGAGCAGAGAAGCCAGGAACACCACAGCCCTGGGCCAAGGTGCGGGGCTGAAAGAACTTCCCAGCGCTCAGCCTGGGACT
AGTGGAATGGGCTGGGCCCTGGGGCTGGCAGCGGTGGCCCCGGGGAGCCTGGGAATGAGTAGGGAGCACAGGGAGGTGTG
GGAGGGCCTGGGAACCATGAAAAGGAGGGCGGGTGCAGGGAAGTCGCCTGCTAGTGAAGTGGCGAGGGGGCCCTGTGGGA
CTCCAGGGAATGGCCACGGCAGGTTGTCCTCCAGGAGTTGAGAGCCACTGGACATGGCAGCTGCCTGTGTTCTCAGCCAC
CACAGTAACCAAAGAAATCTTGGTTTTAAAATTCAAGTTGCCATGGAAACGCTCCCATCCTCGACTTGGCTTATTATTTA
AAATAACATCTCTACAGCACAAAGCCCCCGGGTACATCCAAGGACACTGCTGTCTGCCCACGAGACATGCTAACCTCACA
GTGTGGAGGCTGTGTGGGTCACTGACATGCATGGCCACGTGAGACGCTGCCTACCCACGAGTCACGGAAAAGGGGAAGAT
TATTAAGAAAGTCACTAGGGGCCAGGTCCAGTGGCTCATGCCTGTAATCCTAGCACTTTGGGAAGCTGAGGCAGTTAGAT
CCCTTGAAGCCAGGAGTTCAAGACCAGCCTGGCCAACATAACGAAACACGGACTATACTACAAAAATTAGCCAAACGTGG
TAGCACAGGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCACAAGAATCACTTGAACCTGGGAGGTAGAGGTTTCAGTG
AGCCAAGATTGCGCCACTGTACTCCAGCATGTGCCACAGAGCGAGACTCCCATCTCAAAGTCACTAGGGAGGAAGCCTCA
TTGGTGGGAAGGAAGACCAAATTGGAAATGCTCTGAGGAATCATTAAAACAAATGTCCTTTTATCAGTTTGGTGGCTCAG
GGCCTTTAGTAATACTGCCAACTATTTTTCCTAGAAGAAACAAAACTGAAATAATAGGAACATACTCACTTTTTTTTTTT
TCTTAAAAGTAAGGGTATGTTGTGAAAAAAAGTCTCCCCACCGTAGTGACCGACTGCCGTGCATCTTCCTTGGCATTTTG
CATGTAGTGGCAGGAGTGTTCCTACATGTGTAGATTGCTGAGAGGGTCAGATGCTTATGGTCCTCAGTCACCCACAGCTT
GCTTTTTCCCCACTTAACATTGGGACTTGGGGCATTTTTACTCTGTTAATACAATAGAATTCACTTCAACTAGTTGGTTT
TTTACTTTTATTTTATTATTATTATTTTTAGATGAAGTCTCAATCTGTCGTCCAGGCTGGAGTGCAGCCTCTGCCTCCTG
GGTTCAAGTGATTCTACTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCGCCACCACGCCTGGCTAATTTTTTG
TATTTAGTAGATACAGGGTTTCACCACCTTAGTCAGGCTGGTCTCTAACTCCTGACCTCAGGTGATCCAACCGCCTCGGC
TACAGGCATGCGCCACCGTGCCCCACCAACTAATTGCTTTTTTAATGGTTGCTTCATATTCTATTTAACCACTTACCGTA
ATTTAACAGTTACTCTGTTATTGGATACTTGATTCATTTCCAGGACATGCAGATTTAGAAACTGGTGGGCGTTGCTGGTT
GATGTCTCGGTGGTTGTGCCCACCCACCCCAGGCACTCATTATCATGCCTTCCGTTTCCCCACTTCCTAAGCCCTGCAAG
AAGTGCCAAACTGTCCAGCCCTTGCCAATCTGATACATGCTGAATACCTCCTTGATTTTATCTGCACCTCCCTGAGGTTG
AACTTATTTTACTTTATTTTATTTTATTTTTGAGGCAGAGTCTCACTGTCACCCAGGCAGGAATGCAGTGGTGAAATCTT
GGCTCACTGCAATCTCCACCTCCTGGGTTCAAGTGAGTTGAAGCAATTCTCCTGTCTCAGCCTCCCAAGTAGCTGCGATT
ACAGGCACCTGCCACCACACCTGGGTAATTTTTGTATTTTTAGTGGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCT
CAAACTCCTGACCTCAGGCGATCCACCTGCCTCAGCCTCCCAAAGTGCTGAGATTACAAGCTTGAGCCACCATGCCGGTT
GAACTTATTTTTATCTGTGTTGGCCATTTGTAGTTTTTCTATTATGCTGGTTCCATTTTTCTGACTTTGAAGAGCCTTTT
GTGGAAACGAAGCCTAAAGTATATGTGAGTACTGCTTTGTTTCGTCAGTTTTTGTTTTAAAGAGACTTTTCAGTGTAATG
CAAGCATTTTCCCTTGAAGGCTGTGTTTCCTGTCAATCCTAAGCTTTCCTCCAGCATTACCTTTTTAAAAAACTTTATTT
TCTATATTGATGGGGTCTCACAATGTTGTCCAGGCTGGTGTTGAACACCTGGCCTCAAGTGATCCACTTGCCTTGGCCTC
CCAAAGTACTGGGATTATAGGCATCAGCCACCGCACCCAGCCTGTTTTTCAAAGGGCATTGATTTTTTTCATAAAACTTT
TTAAATTAAGATCTGTGGGCCTGGTGCGGTGCCTCACGCCTGTCATCGCAGCACTTTGGGAGGCTGAGTCAGGTGGATCA
CGAGGTCAGGCGTTCGAGACCAGCCTGGCCAAAATGGTGAAACCCTGTCTTTACTAAAAATATGAAAATTAGCTGGGCAT
GATGGCACATGCCTGTAATCCCGCTGCTCGGGAAGCTGAGGTAGGAGAATAGCTTGAACCCAGGAGGCAGAGGTTGCAGT
GAGCCGAGATCATGCCATTGCACTCCAGCCTGGGGGACAGAGTAAGACTCCGTCTCAAAAAAACAAAACAATTCTGTGTT
GCATGTGGTACTTTTTGTGTGTGAGGAGTCCAGTGTGAAGATTCAGACTTCAGGCAGCCACTTGTACAAGCACTGTCCTG
TTTCCTCTCTGGCCTCACCTAGGTAACGCTGATTCCTCCACGGAGGATGTGCTTCTGAGTGGTCCGTTGGGTGCTGTGCT
GATGAGCATCACCCAGCATTTTACGACACATGTGCTGCCCCAGAGGGCTGGGCTCCCGTCAGAGCTCTTTTCCACTGGCT
GGGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCCAGTGGATCACCTGAGGTCAGGAGTTCGAGA
CCAGCCTGGCTAACATGGTAAAACCCCATCTCTACTAAAAATACAGAAATTAGGTGCGTGTGGTGGCGCACACCTGTAAT
CCCAGCTACTTGGGAGGCTAAGAACCTGGGAGGTGGAGGCTGCAGTGAGCCAAGATTGTGCCACTGTACCCCAGACTGGG
TGACATGGCCCACAGACCAAGACCCTGTCTCAAAAAAAAAAAAAAAGCTTTTTTCCAGAGCCATCTTGCTGTCCTCATAT
ATTTATTCTCCTAGATGAATTGTTGCTTAAGCAACAATGAAGTTTAAATTGTTCTTCAGATAGAACCCCTAGGTGCCAGG
CAGTTTGTTAAGCACTGACACCGCAGTCCTGTGACTTTCATGACGACCTGTGACCTGTGGGAGGTAGAACACCTCACCCA
CCCGTGGGAGCCCCTGGAGTGACTGACAGGAACCCCTGCCTCACCCAGCTCCCCGCGCCGCGGCTCTCCCCAGCCCTGGA
TGCAGGCGCCCAGGAGACATGTATTGCTTTTGTTGAGCTGCTCACTTAGGAGGGTGACTCAGAGTTCAAGCTGGAAAGCA
CTGGCTTGTGACTCATGAGCCAGTGCAAGATCAAACGGCTGGGGCATCGAGGTGAGAGTTTTCCTTCTCAGAAGCCTCAT
CTCCGCAGCCGGAAGCAGAGCCCTTGGCTGACTGGAAAAACCAGAGAGGCCCCGGGAGTGGGTGGATGGCCAGCCCAGCC
CCACCTCTCAGCCTCAACCTCCACCAGCCCACACCGAGCTTGTTTGTCTGTGTCCTGTCGAAACTAGGACTCCTGGATTG
TAACTTTTCTTACATTTCCCTGTCCCCTGGGTCCTCCACTTAGGGTGTAATCACACAGACAGGCTCTTAACTGTTTCATA
TCACTCACTTGGGAAAGTGTCTCAAGCTGTTCTACAAATCCATGCAAAGGCCGTTTAAAAATAGCAGCGAAGGCCCTGGA
CTCGGTCTCGTCCAGCACAGCCCCTTGGCTCTCTCTCTGGGCTCTGGCCGCCTGGCCCCCGGGGACCCACACGAGGTCAT
GGCGTGCTTCGGGCAGGGGGGCGGGGATCCCATAGACACCTCAGCTCCTTAAGAGTTCTCCGCCTGGGCCAGGACGAGCA
TGGGGGTCCCCACTGATGCCCGAGACGGTGCCCCTGTGTGTGTGAGCCCTCGACCCACATAACAGAGAGGTGTCCTGATG
CCCTCTGTCCTCTCCAGGTGGATCTAGGATCCGGCTTCCAACATGTGGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGT
GTTGGCCAATGCCCGGAGCAGGCCCTCTTTCCATCCCCTGTCGGATGAGCTGGTCAACTATGTCAACAAACGGAATACCA
CGTGGCAGGTAGGCTGTGGGGCTGCGTCCTATGTATGTCCCCTCCCAGGTCGGTCTGTGCACACTGACTCCAGGAATGGG
AAGCCAGCCCTCACTGTGGCCCACAGAACATTGTCCTGGACTGTTGAAAAATGGCCCTGACCCTGAGTCATGTCGGCTCT
GGACCACGCCTCCCTCCTTTGTCCCATACCCCACGTGTGTGCATGAGTGTGTGTGCATGTGTATGAATATATGGTTGTGT
GCACGTGGAGGTGTGTGGGGGAGTAAGTGAGCTTCGGAAGTGTTCAAAACCAGCTAGCCCCTGTGACGCCCGCCGTGGCA
CAGTGGCAGTTTATCAGTCCCGCCCCTGATTCCCTTTGTCTCTGCCCAGCCCTGACCCTCGGACTGAGAACCAAGAATGA
GGAGTGAGCCATGTTGAGGGCTGGAGAGGGTCTCTGATTGTCAGCAAATGGGAGCAGATCAGGGGAGACACCCACTCTGC
CCGTGTGTCACTCTGGGCTGTTCCCCAGTGCCCCCCAACCCTTGGGCCCTTGAATTGGGTAGGGTCTCTGGTTTTCCCTG
GGTTGGGCTTCGTGTGTGGGCAGCGTGCCCATCCACCGCCATCCCGGGAGACCCTTCAGTCAGTGGGTGACTCTCTTCCA
GGCCGGGCACAACTTCTACAACGTGGACATGAGCTACTTGAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCAAGCCAC
CCCAGAGGTGAGTGCCTGCTCCTCTGCACCGCTGTAATGTGAGTGGCAGGCGTTGGTTTGGGGCAGTGGGAAGTGGGAGA
GTGAAGGCCTCTCTGGCGTCCGCAGGGCTCATGCTGCCCAGGGCTGCCGACACCGCTTGAGGTACAGTACTTGTTTCTTT
TCATTTTTATATTACTTTCTCTTTTTATTTTATTTTTTCCTTCTCAAGTTATTAGTAGAGAAGATGCTGGTGCTTTTTTT
TGGTTTTTGAGAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCACGATCTTGGCTCACTGCAAACTCTGTCTCC
CGGGTTCAAGCGATTCTCATGCCACAGCCTCCTGAGTAGCTGGGGCCACAGGCTTGCACCACCGTGCACGGCTAATATTT
GTATTTTTAGTAGAGACTGGGTTTCGCCATACTGGCCAGGCTCGTCTCGAACTCTTGGCCTCAGCAAGTCATCTGCCCAA
CTCGGCATCCCAGAGTGCTGGGATTACCGGCATGAGCCACCCCCAGTGTTTTCTATTGTGCTTCTCCAAGGTGTCTCAGC
CGCAGATGTGACAGACAAGGTCTGGGTCCCCACTATGTGGGCAGACTCAATCTTGAGTAATTTAAGGGAAATCTAAGAAA
AGCCAGATGAGGCCAAGTATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGATGCCAAGGCAAGCGGATCACCTGAGG
TCAGGAGTTCGAGACCAGTCTGACCAACATGATGAAACCCCGTCTCTACTAAAAATAAAAAACTTTGCCAGGCGTGGTGG
CGGGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCTAGAGAATCGCATGAACCCGGGAGGCAGAGGTTGCGGTGAGC
TGAGATGATGCCACTTCCCACGTGAGGATGGGGTAGTTAAGTTGCACGACACTGCCTTCTGCACAATGGGGATGATATGA
GAAATGATGAGAGGATCCTGGGCAGAGGTTGAGGCAGGAGAGCAGGGCAAGAAACACTATCTGGACTGAATAAACTTATT
TAAAAATGAGTTTATAATTTGTGGATAAATTCATCTTAAAAAATCCATTACATAAGCCTAGAAAGAAGCGTACAATATCG
CAAGTCAGGCCAGGTGTGGTGGATCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCATTTGAAGCCA
GGAGTTTAAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTTTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGTGC
ATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCACGAGAATCGATTGAACTCAGGAGGCAGAGGTTGCAGTGGGCAGA
GATCTCGCCACTGCACTCCAGCCTGGATGACACAGTGAAATTCTGTCCACTCCCACCCTCCCGTCCCACCCAAAAAAAAA
ATCAAAAGGTGCATTGAGCAACAGAATTTCCTTTTGTTTTATGAAATTACCAAGGACACCTGTCTGCAGCCTTCTTCGGC
ATGTCTTGTGCTCATTTCCTTGTTAGGGGAGTGTGGCCCAGCCAAAGTGGCCTTTGCAGGGATTGGGGAGGGAGTTGTGG
GATCAGAGCTTGTAATGAAGACCTTCCTTTATCCAGAGTTATGTTTACCGAGGACCTGAAGCTGCCTGCAAGCTTCGATG
CACGGGAACAATGGCCACAGTGTCCCACCATCAAAGAGATCAGAGACCAGGGCTCCTGTGGCTCCTGCTGGGTAAGGCCC
TGCTGGCTGGCGGGGAAGCGCTGGAGAGAAAGTGGGAGCAACACTGGAGAGTCTTGGGGGATTCGGGGTGGGGACAACTC
TGACAAGGCAAGTTATAGAAACTTTCTGAGTCCCAGTTTCCATCAGTACAAAAATCACAATCCCTCTGGCCATGAATGAT
GGCGAGGATTAGGTGGAGTGGCGGGCAGAGCATCCAGCAGATTGCAAGTCCACGTGTACAGGTGGCGAAGCAGCTCCCTT
TCCCTGACATGCTGGCCCGTCCGCAAATACCAGGAGCTCTCACTGCTACTCTGCTTCAAGAAAGCATCCCTTTAGTGTCA
GTGAGCTGTCTTAATTTTGTCATTTAATTGTGGTAAAATACACGTAACAGAAATGTAATAATCTTAGCAATCTTCTTTTG
TTTTCTTTTTCTTTCTTTTTTTTTTTTTTTTTTTTTTGGAGATGGAGTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGG
TGGGATATTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGTCTCCCCAGTAGCTAAGACTA
CAGGCATGTGCCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTTGCCATGTTGGACAGCCTGGTCTC
AAACGCCTCACCTGTTGATCTGCCTTCCTCGGCCTCGCAAAATGCTGGGATTACAGGCGTGAGCCACCGTGCCCAGCAAC
TATTTTCAAGTGTACAGTTCTGTAGCATTAAGTACATTCACAGTGTTGCTCAGCCACCACCACCATCTGTCCCCCGAACT
CTTTTTCAGCTCGCAAGACAGAAACTCTGTCCCCATTAACACCAATATTGTAGCCCCTGGTAAGCCCCACTCTACTTTCT
GTCTCTATGAATTTGACTCCTAGGGACCTCATACAAGTGGATCACAACAGTATTTATTTTCTGGGTGAGCTGTGTTTTTT
GTTAAGAAAAAAACACAGCCAGGTACGGTAGCTCACGCCTGTAATCTCAACACTTTGGGAGGCTGAGGCAGGCGGATCAC
CTGAGGTCAGGAGTTTGAGACCAACCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAAGACAAAACTTAGCTGGGCC
TGGTGGCAGGCACCTATAATCCCAGCTACTATAGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGTAGGTTGCA
GTGAGCTGAGATTGCGCCACTGCATTCCAGCCTGGGCAACAAAAGTGAAACTCCGCCTCCAGGAAAAAAAAAAAAAACCA
CACACACACATACAAACTAATACTACACATTTTGCAGATTTCAGAAATGAACCCAGTTTCCAGGCAGAGGTTCACGGGGG
TGCTTCTCTTGCTTTAAAAGCTGAGTTGGGCAAATCGTTGAGGCAGGAGAGTGGGGCAAGTGGCTCATCACGCCTCTAAT
CCCAGCACTTTGGGAGGCCAAGGTGGGCAAATCACTTGAACCCGGGAGTTCAAGACCAGTCTGGCCAACGTAGCAAGACC
CCATTTCTACCAAAGAAAATAAAAGCTGAGTTTGAGCCCCAGGAGCGTCCCCTGGTGTTGAGAGATCAGTTGCCTACAAG
GTCTGAGGTGCCCCTGTGGCCCTTTGAGGGGACTGCTGCAGAGGGCCCAGCCCGGACATGGCAGCCTCACCCGGTGGGGC
TCGTTCCTGCAGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCGGATCTGCATCCACACCAATGCGCACGTCAGCGTGGA
GGTGTCGGCGGAGGACCTGCTCACATGCTGTGGCAGCATGTGTGGGGACGGGTGAGTCAGGCTGTGCTTCCACAGCGGGT
TTAGTGCTGAGAGACCCTGGGCCCCAGCTTCTCAGTGGAGGGGACTTTGAGGACTTCCTGGGACTGCTGCGAGTCAGAAG
TGTTTCGGGGAGACTCCGAGAGTCTGGCAGGCAGGGCCTGGCAAGGCTGCTGCTTCCTGGGAGGTGCCTGAACCAAGGCT
GGCCCGGCAGAGCTGTCTGAGGGGTGGCATCCCAGCAAAACAGTCAGTTTCAGAAAATGTGGTGGAATGTCTGGCCCCTG
GTCTGGTTTGTGTCATCGCATGTTCCTTTTTCCTGCTTGGGGAACCAGTTTGGGGCATTTCCTTATGTGTAGTTAGCGGG
GATGGCTCCACCTTAACAACAAGGTGGTGGCACAGAACTTTCTGCTCCAGCTGCCTGCAGCCTCGCCTTGGTTCCGCAGT
AGCGGTGTTCACTGGCGGCTGCAGATCCGAAATGCCTGAGGGCTTAAAAAAATGCATGGGCTTCTCCCCTAACTAGTTAA
ATGGGAATCACTGGCGATTCTCATTTCAGAGAACTGGTGGTGTTTTTAAGTACCTTAGGGAAGTTTAGCGTCTGCTCAGT
TGAGAATCAGTGTTCTACCCGGAAGGGCACTCGTAGCCTGGTCTGGTATGGACATGAACAGGAGAGCCTCCTGTCTTCTC
CCGGATCTTTGTGGGCAGGGGTGGGGCTGGCGGTTATTCCCTGCAAGCTGTGCTTATCTAGGGAGCGTCCCTTGGAGGGT
TTGGGGTCTGGGAGGTCTGCTCGCACCACTTGCTGCAGCTGGGGGTGGGTCCACGAGTGGCCTCGGGCACTTGGGTAGCA
CACAGTGGTCTGGAGAGCTGGTGGTGCTTCTCTCAGAGGTTTTCATTAGAGGCTGTCTTTTCAGCTGTAATGGTGGCTAT
CCTGCTGAAGCTTGGAACTTCTGGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTATGAATCCCATGTAGGTAAGTGTGT
CCCCTTGGCCACTTTCTGGCCAGATGGATTGTTTGAGCAATTAACCATCATGGCTTTATTTGCCTTTATAAACTGGGGGT
TGAGACAGAGGGGCTGCTGAGAGGTGCTAGCCAGGTGCACAGGCCTCTGGCAGGACCTGCCTGGCGTCCATGCTGCAGGC
ACGAGGCTTGCCTTGCCCCAGGTCTTCTCCGTGGGGGCTGTAGGTTGACTCCGCTTTCTCCCGCGTCCCATCAGGGTGCA
GACCGTACTCCATCCCTCCCTGTGAGCACCACGTCAACGGCTCCCGGCCCCCATGCACGGGGGAGGGAGATACCCCCAAG
TGTAGCAAGATCTGTGAGCCTGGCTACAGCCCGACCTACAAACAGGACAAGCACTACGGTAAGGGGCCTGGGGCCTGGCC
ACGGCGCACGTGGAGGCTGGGGAGCTGCTGCATCCCTCCTCACGCTGCAGCGAAGAGGTCAGGGCTGAGGAGCCTTGGGG
TCCCGGAACTCTAGGATAGAGGAGGGGGAGTGATGCCCTCTTGCCAGGAGAAGCAGCACACTCTCCACTTTCTGCCTGTT
CCACCCTGAACTCAGCCTCAGCCCTTCCAAACTGGAAGGGACCAAAGCCCTCCTTTTACAAGGGAGTGGCGTGTCCTGGA
GTCTTGAGGTATCCTGGCCTCTTCCGGGGCCTCGTGCTCATTGCTCCGTTCTCCCATTTCTGAGCTTCCAGCCCCAGCCC
TGTCCTTAGTTCTTCAGGGGAGCCTTCCTTGAGCTCCCCTGGCAGGGCAAGTCCCCTTAGATGCAGTCTTCCCCTGGAGG
CCACCAGAAATCAGTGACTGGCTGACCGTGGCGTGCCACGAGGCACGGTCACTGCCCTCCCCAACCCCGAGCTCGGTTCA
TTTTCCAGGATACAATTCCTACAGCGTCTCCAATAGCGAGAAGGACATCATGGCCGAGATCTACAAAAACGGCCCCGTGG
AGGGAGCTTTCTCTGTGTATTCGGACTTCCTGCTCTACAAGTCAGGTGCGTGCTGATGGCTGATGGCAATAGAGGGTGGG
GGTCGGGAGGGGAGCCTGGGTGCCAGGATGGTTCATGTTGACCAATAGGGCTGGATTGGGCAGGCAGGTGAGGGGCTGGG
GAAGAGGGCTGTGTGGGGCCTTCCATATAGGCTCACTCCTCTGTGGACCAGCCTGGCACCTGACTGTCTTTGTCCAGGTG
TTGTGCCAACACACACAATTCCAGAGGCCTCTCCCAGAGCCTCTGGGCTAGTTCCTTCCCATCCATTAGCGTCCAGCTCA
TACATGTTCTCTGTGGGAACGCCCCCCTCCCTGGTGGCTGGCCACGCCTGCCCTCGCCTTCCTGGCTTTGCATTGCTTTA
CCAGCATGTTTCTGTTATAATGGCAGGCCGTTCGTGTTTGCACTGGCTGACACATTTTCTTCATCTTTCTCTTTGCAAAG
GCCTAGATCGTGCCTCACACCAGTGGATGAACACACAGGGAGGACCTTGCCCTGGGTAGTGCCATAGGGACTCCAACCAA
GGAATCTGGACAGTCCCCATCCCCAAGTCCTGTCTTAGAAATCCCTTGCTCCAGATGAACAGTCTTTGCTGACTGCATCT
ATCCCATTAATATTTTGCTAGATTGGAAACTTGTAAAACAACAGTTTGACAAAGGAAGATGATTCTGGCAAGAAAAAGTT
TCTGTGAAGATGGCACTTCTTAGCATAGTCCTCGCCCTTATTATTAAAAGATGCATTTCCACTTTGACAGTGATTTCCCC
ACCCGAGAGCCTGGTTGGGCAGCCCTGAGCCCCTTAGCCAGTATCTTTCCCATCAGCTAGCATTAGCTGCCACCAGTCCT
CCCTTCTCCTCCCTCACCTCTTCAGGGTGTACCTGGAGCAGCTATTAAAACTCTTTAAGCCGTGTAACGCTTTTGACAAA
CCCTTACTTACATGGAATCCCAGTATAAAGAACAGATGAATCAGTCATCTAGCATTAAATGTTTTATCAATTAGAATGTA
TTAGAAATTCTATTTTTAAAGCCAACAAGCACATTTTGTAGGAATCTATTTCAATATTTACTCAACAACTTTGAGTGTGT
TGGGCCTTCAAAAATATATTTAATTCCCCAGTATTGTGGGACCCCCCCTGGAGCCTCCTGGTGGAGCAGCAGGTTCCCTC
TGGAAGCTGTTTCTCCTTCCCGGAGCATGACCAGCTGAGTGTGGGGGGTGCTGTGGGGCGTGGGACAGTGGCCCGGTCTC
GGTCTCAGCCAGTTCTTCCCTTTTCAGGAGTGTACCAACACGTCACCGGAGAGATGATGGGTGGCCATGCCATCCGCATC
CTGGGCTGGGGAGTGGAGAATGGCACACCCTACTGGCTGGTTGCCAACTCCTGGAACACTGACTGGGGTGACAATGGTGA
GTGGCTGCCCCCTTCCTGCCAAGAACAGTGAATTGTGAGCCACACCCCGTGGCCATCTCGGCTTTCTCTGTTCTACCCAC
CGCACAGCCTTAAACCCTGGACCTACGGCCAGGCTGTGAGCTCCTCCTAAGTGCCAGGCAGTCAGGAGTTCCCTTTTGCT
GTGAGGGCAGACTCTGAGCAGCTTCAGAGCCAACGCCTGCAACAGTTCCCACAGCATCGCGCCAGATCCTGTGATGGGAA
GGGTGACCGGGCAGGGGGCTTGCCCGTGGAGGTGTGCCCCACGGCTCCAGAAGCCTTGTGGTGTTGAGGACTGTGCTAGT
GGGCCAACAGCAGGAGTGGCCAGGGATGAGTGACTTAAGGTCTTTTAAGGATGAGTCTGACTATATTGGTTGACCCTTGT
CACACTTTAAAAGCACCTTACTTTTTATTCCCAGGCTTCTTTAAAATACTCAGAGGACAGGATCACTGTGGAATCGAATC
AGAAGTGGTGGCTGGAATTCCACGCACCGATCAGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCGTGCCAGTCCT
GGGGGCGAGATCGGGGTAGAAATGCATTTTATTCTTTAAGTTCACGTAAGATACAAGTTTCAGACAGGGTCTGAAGGACT
GGATTGGCCAAACATCAGACCTGTCTTCCAAGGAGACCAAGTCCTGGCTACATCCCAGCCTGTGGTTACAGTGCAGACAG
GCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACCTGCTGCCCCAGC
TGACTGTGGCCCCCTCCGTGATCCATCCATCTCCAGGGAGCAAGACAGAGACGCAGGAATGGAAAGCGGAGTTCCTAACA
GGATGAAAGTTCCCCCATCAGTTCCCCCAGTACCTCCAAGCAAGTAGCTTTCCACATTTGTCACAGAAATCAGAGGAGAG
ACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCTCCCAGGCCCCCTGCATCTATCGAGTTTGCAATGTCACAACCTCTCT
GATCTTGTGCTCAGCATGATTCTTTAATAGAAGTTTTATTTTTTCGTGCACTCTGCTAATCATGTGGGTGAGCCAGTGGA
ACAGCGGGAGACCTGTGCTAGTTTTACAGATTGCCTCCTTATGACGCGGCTCAAAAGGAAACCAAGTGGTCAGGAGTTGT
TTCTGACCCACTGATCTCTACTACCACAAGGAAAATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTGAAAAATTACAGC
TTCACCCTGTCAAGTTAACAAGGAATGCCTGTGCCAATAAAAGTTTTCTCCAACTTGAAGTCTACTCTGATGGGATCTCA
GATCCTTTGTCACTGCCTATAGACTTGTAGCTGCTGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCTGGAACTGCATGT
TCTTGCGACTCTTGGGACTTCATCTTAACTTCTCGCTGCCCCAGCCATGTTTTCAACCATGGCATCCCTCCCCCAATTAG
TTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGTGCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAAACATAGCTGTGC
TCTATTTTTTTGTTGTTGTTGTGACTGACAGAGTGAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCGCCTTCTCAGCTC
ACTGCAACCTGCAGCCTCCTAGATTCAAGCGATTCTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGACAGGCACTCACCA
ATATGCCTGGGTAATTTTTGTATTTTTAAGTACATACAGGATTTCACCATGTTGGCCAGGCTAGTTTCAAACTCCCGGCC
TCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGGGCCCTGCCTGTATTTTTTAT
CAGCCACAAATCCAGCAACAAGCTGAGGATTCAGCTCATAAAACAGGCTTGGTGTCTTGGTGATCTCACATAACCAAGAT
GCTACCCCGTGGGGAACCACATCCCCCTGGATGCCCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCTGTATACAGTATT
TTGAATTTGTATGCCACTGGTTTGCATTGCTGGTCAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTAGAAACATGTTAT
AGCAGTTCTTGGTATAGAGCAAACTAGAAGAACCAGCAATCATTCCACTGTCCTGCCAAGGTACACCTCAGTACTCCCCT
TCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCCAAAAGCATTCAAGTTTGGCTTCTGATGTGACTCAGAATTTAGGAAC
CAGATGCTAGATCAAATAAGCTCTGAAAATCTGAGGAACATTGTAGGAAAGGTTTGTTAAGCATCTCTTAAGTGCCATGA
TGAGCATAACAGCCGGCCGTCGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATGACAAGGTC
AGGAGTTCAAGACCAGCCTGGCCAACATGCTGAAACCTCACCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCA
CATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCA
AGACAGTGCCAGTGCACTCCAGCCTCGGTGACAGCGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAAAAAAAAAAAAAA
GGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCAGGAGTTT
TGAGATCAGCCTTGGCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAGCATGCTGGCACATGCCTG
TAATCCCAGCTACTCGGGAGGCTGAGGTACGAGAATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGAGCCGAGATCACG
CCATTGCACTCCAGCCTGGGGGACAAGAGTGAATCTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAGATGCTTAACAAA
GGTTACCATAAGCCACAAATTCATAACCACTTATCCTTCCAGTTTCAAGTAGAATATATTCATAACCTCAATAAAGTTCT
CCCTGCTCCCAAAC
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001908.3 (GI:66346646)
|
Name |
Cathepsin B (CTSB), transcript variant 1
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
3783 nt
|
Map |
8p22
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11700032...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710987 | 11725509...11725645 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
137
|
137
|
1
|
Exon 4
|
138
|
288
|
151
|
1
|
Exon 5
|
289
|
374
|
86
|
1
|
Exon 6
|
375
|
489
|
115
|
1
|
Exon 7
|
490
|
608
|
119
|
1
|
Exon 8
|
609
|
694
|
86
|
1
|
Exon 9
|
695
|
838
|
144
|
1
|
Exon 10
|
839
|
955
|
117
|
1
|
Exon 11
|
956
|
1084
|
129
|
1
|
Exon 12
|
1085
|
3783
|
2699
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
rs1137063
|
SNP
|
152
|
c/t
|
1
|
?
|
dbSNP
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS5986
|
Nucleotide |
CTSB, mRNA isoform 1[NM_001908.3] : 163...1182
|
Length |
1020
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11702633...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710962 |
|
Start codon |
1
|
Translation |
NP_001899.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGGCGGGGC |
CGGGAGGGTA |
CTTAGGGCCG |
GGGCTGGCCC |
AGGCTACGGC |
GGCTGCAGGG |
CTCCGGCAAC |
CGCTCCGGCA |
|
81 |
ACGCCAACCG |
CTCCGCTGCG |
CGCAGGCTGG |
GCTGCAGGCT |
CTCGGCTGCA |
GCGCTGGGTG |
GATCTAGGAT |
CCGGCTTCCA |
|
161 |
ACATGTGGCA |
GCTCTGGGCC |
TCCCTCTGCT |
GCCTGCTGGT |
GTTGGCCAAT |
GCCCGGAGCA |
GGCCCTCTTT |
CCATCCCCTG |
|
241 |
TCGGATGAGC |
TGGTCAACTA |
TGTCAACAAA |
CGGAATACCA |
CGTGGCAGGC |
CGGGCACAAC |
TTCTACAACG |
TGGACATGAG |
|
321 |
CTACTTGAAG |
AGGCTATGTG |
GTACCTTCCT |
GGGTGGGCCC |
AAGCCACCCC |
AGAGAGTTAT |
GTTTACCGAG |
GACCTGAAGC |
|
401 |
TGCCTGCAAG |
CTTCGATGCA |
CGGGAACAAT |
GGCCACAGTG |
TCCCACCATC |
AAAGAGATCA |
GAGACCAGGG |
CTCCTGTGGC |
|
481 |
TCCTGCTGGG |
CCTTCGGGGC |
TGTGGAAGCC |
ATCTCTGACC |
GGATCTGCAT |
CCACACCAAT |
GCGCACGTCA |
GCGTGGAGGT |
|
561 |
GTCGGCGGAG |
GACCTGCTCA |
CATGCTGTGG |
CAGCATGTGT |
GGGGACGGCT |
GTAATGGTGG |
CTATCCTGCT |
GAAGCTTGGA |
|
641 |
ACTTCTGGAC |
AAGAAAAGGC |
CTGGTTTCTG |
GTGGCCTCTA |
TGAATCCCAT |
GTAGGGTGCA |
GACCGTACTC |
CATCCCTCCC |
|
721 |
TGTGAGCACC |
ACGTCAACGG |
CTCCCGGCCC |
CCATGCACGG |
GGGAGGGAGA |
TACCCCCAAG |
TGTAGCAAGA |
TCTGTGAGCC |
|
801 |
TGGCTACAGC |
CCGACCTACA |
AACAGGACAA |
GCACTACGGA |
TACAATTCCT |
ACAGCGTCTC |
CAATAGCGAG |
AAGGACATCA |
|
881 |
TGGCCGAGAT |
CTACAAAAAC |
GGCCCCGTGG |
AGGGAGCTTT |
CTCTGTGTAT |
TCGGACTTCC |
TGCTCTACAA |
GTCAGGAGTG |
|
961 |
TACCAACACG |
TCACCGGAGA |
GATGATGGGT |
GGCCATGCCA |
TCCGCATCCT |
GGGCTGGGGA |
GTGGAGAATG |
GCACACCCTA |
|
1041 |
CTGGCTGGTT |
GCCAACTCCT |
GGAACACTGA |
CTGGGGTGAC |
AATGGCTTCT |
TTAAAATACT |
CAGAGGACAG |
GATCACTGTG |
|
1121 |
GAATCGAATC |
AGAAGTGGTG |
GCTGGAATTC |
CACGCACCGA |
TCAGTACTGG |
GAAAAGATCT |
AATCTGCCGT |
GGGCCTGTCG |
|
1201 |
TGCCAGTCCT |
GGGGGCGAGA |
TCGGGGTAGA |
AATGCATTTT |
ATTCTTTAAG |
TTCACGTAAG |
ATACAAGTTT |
CAGACAGGGT |
|
1281 |
CTGAAGGACT |
GGATTGGCCA |
AACATCAGAC |
CTGTCTTCCA |
AGGAGACCAA |
GTCCTGGCTA |
CATCCCAGCC |
TGTGGTTACA |
|
1361 |
GTGCAGACAG |
GCCATGTGAG |
CCACCGCTGC |
CAGCACAGAG |
CGTCCTTCCC |
CCTGTAGACT |
AGTGCCGTAG |
GGAGTACCTG |
|
1441 |
CTGCCCCAGC |
TGACTGTGGC |
CCCCTCCGTG |
ATCCATCCAT |
CTCCAGGGAG |
CAAGACAGAG |
ACGCAGGAAT |
GGAAAGCGGA |
|
1521 |
GTTCCTAACA |
GGATGAAAGT |
TCCCCCATCA |
GTTCCCCCAG |
TACCTCCAAG |
CAAGTAGCTT |
TCCACATTTG |
TCACAGAAAT |
|
1601 |
CAGAGGAGAG |
ACGGTGTTGG |
GAGCCCTTTG |
GAGAACGCCA |
GTCTCCCAGG |
CCCCCTGCAT |
CTATCGAGTT |
TGCAATGTCA |
|
1681 |
CAACCTCTCT |
GATCTTGTGC |
TCAGCATGAT |
TCTTTAATAG |
AAGTTTTATT |
TTTTCGTGCA |
CTCTGCTAAT |
CATGTGGGTG |
|
1761 |
AGCCAGTGGA |
ACAGCGGGAG |
ACCTGTGCTA |
GTTTTACAGA |
TTGCCTCCTT |
ATGACGCGGC |
TCAAAAGGAA |
ACCAAGTGGT |
|
1841 |
CAGGAGTTGT |
TTCTGACCCA |
CTGATCTCTA |
CTACCACAAG |
GAAAATAGTT |
TAGGAGAAAC |
CAGCTTTTAC |
TGTTTTTGAA |
|
1921 |
AAATTACAGC |
TTCACCCTGT |
CAAGTTAACA |
AGGAATGCCT |
GTGCCAATAA |
AAGTTTTCTC |
CAACTTGAAG |
TCTACTCTGA |
|
2001 |
TGGGATCTCA |
GATCCTTTGT |
CACTGCCTAT |
AGACTTGTAG |
CTGCTGTCTC |
TCTTTGTCCC |
TGCAGAGAAT |
CACGTCCTGG |
|
2081 |
AACTGCATGT |
TCTTGCGACT |
CTTGGGACTT |
CATCTTAACT |
TCTCGCTGCC |
CCAGCCATGT |
TTTCAACCAT |
GGCATCCCTC |
|
2161 |
CCCCAATTAG |
TTCCCTGTCA |
TCCTCGTCAA |
CCTTCTCTGT |
AAGTGCCTGG |
TAAGCTTGCC |
CTTGCTTAAG |
AACTCAAAAC |
|
2241 |
ATAGCTGTGC |
TCTATTTTTT |
TGTTGTTGTT |
GTGACTGACA |
GAGTGAGATT |
CCGTCTCCCA |
GGCTGGAGTG |
CAGTGGCGCC |
|
2321 |
TTCTCAGCTC |
ACTGCAACCT |
GCAGCCTCCT |
AGATTCAAGC |
GATTCTCCTG |
CTTCAGCCTT |
CCGAGTAGCT |
GGGATGACAG |
|
2401 |
GCACTCACCA |
ATATGCCTGG |
GTAATTTTTG |
TATTTTTAAG |
TACATACAGG |
ATTTCACCAT |
GTTGGCCAGG |
CTAGTTTCAA |
|
2481 |
ACTCCCGGCC |
TCAGGTGGTC |
TGCCTGCCTC |
AGCCTCCCAA |
AGTGTTGGGA |
TTACAGGCGT |
GAGCCACTGG |
GCCCTGCCTG |
|
2561 |
TATTTTTTAT |
CAGCCACAAA |
TCCAGCAACA |
AGCTGAGGAT |
TCAGCTCATA |
AAACAGGCTT |
GGTGTCTTGG |
TGATCTCACA |
|
2641 |
TAACCAAGAT |
GCTACCCCGT |
GGGGAACCAC |
ATCCCCCTGG |
ATGCCCTCCA |
GCCTTGGTTT |
GGGCTGGAGT |
CAGGGCCTGT |
|
2721 |
ATACAGTATT |
TTGAATTTGT |
ATGCCACTGG |
TTTGCATTGC |
TGGTCAGGAA |
CTCTAGTGCT |
TTGCATAGCC |
CTGGTTTAGA |
|
2801 |
AACATGTTAT |
AGCAGTTCTT |
GGTATAGAGC |
AAACTAGAAG |
AACCAGCAAT |
CATTCCACTG |
TCCTGCCAAG |
GTACACCTCA |
|
2881 |
GTACTCCCCT |
TCCCAACTGA |
AGTGGTATGA |
GGCTAGCTCT |
TTCCAAAAGC |
ATTCAAGTTT |
GGCTTCTGAT |
GTGACTCAGA |
|
2961 |
ATTTAGGAAC |
CAGATGCTAG |
ATCAAATAAG |
CTCTGAAAAT |
CTGAGGAACA |
TTGTAGGAAA |
GGTTTGTTAA |
GCATCTCTTA |
|
3041 |
AGTGCCATGA |
TGAGCATAAC |
AGCCGGCCGT |
CGTGGCTCAC |
GCCTGTAATC |
CCAGCACTTT |
GGGAGGCCAA |
GGTGGGAGGA |
|
3121 |
TGACAAGGTC |
AGGAGTTCAA |
GACCAGCCTG |
GCCAACATGC |
TGAAACCTCA |
CCTCTACTAA |
AAATACAAAA |
ATTAGCTGGG |
|
3201 |
CATGGTGGCA |
CATGCCTGTA |
ATCCCAGCTA |
CTTGGGAGGC |
TGAGGCAGGA |
GAATCGCTTG |
AACCCGGGAG |
GCGGAGGTTG |
|
3281 |
CAGTGAGCCA |
AGACAGTGCC |
AGTGCACTCC |
AGCCTCGGTG |
ACAGCGCAAG |
GCTCCGTCTC |
AATAATTAAA |
AAAAAAAAAA |
|
3361 |
AAAAAAAAAA |
GGCCGGGCGC |
AGTGGCTCAA |
GCCTGTAATC |
CCAGCACTTT |
GGGAGGCTGA |
GGCGGGCAGA |
TCACCTGAGG |
|
3441 |
TCAGGAGTTT |
TGAGATCAGC |
CTTGGCAACA |
CGGTGAAACC |
CCATCTCTAC |
TAAAAATACA |
AAATTAGCCA |
AGCATGCTGG |
|
3521 |
CACATGCCTG |
TAATCCCAGC |
TACTCGGGAG |
GCTGAGGTAC |
GAGAATCGCT |
TGAACCTGGG |
AGGCAGAGGA |
TGCAGTGAGC |
|
3601 |
CGAGATCACG |
CCATTGCACT |
CCAGCCTGGG |
GGACAAGAGT |
GAATCTGTGT |
CTCACCAAAA |
AAAAAAAGAA |
AAAGAAAGAT |
|
3681 |
GCTTAACAAA |
GGTTACCATA |
AGCCACAAAT |
TCATAACCAC |
TTATCCTTCC |
AGTTTCAAGT |
AGAATATATT |
CATAACCTCA |
|
3761 |
ATAAAGTTCT |
CCCTGCTCCC |
AAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|66346646|ref|NM_001908.3|Cathepsin B (CTSB), transcript variant 1
GGGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCA
ACGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGTGGATCTAGGATCCGGCTTCCA
ACATGTGGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGTGTTGGCCAATGCCCGGAGCAGGCCCTCTTTCCATCCCCTG
TCGGATGAGCTGGTCAACTATGTCAACAAACGGAATACCACGTGGCAGGCCGGGCACAACTTCTACAACGTGGACATGAG
CTACTTGAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCAAGCCACCCCAGAGAGTTATGTTTACCGAGGACCTGAAGC
TGCCTGCAAGCTTCGATGCACGGGAACAATGGCCACAGTGTCCCACCATCAAAGAGATCAGAGACCAGGGCTCCTGTGGC
TCCTGCTGGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCGGATCTGCATCCACACCAATGCGCACGTCAGCGTGGAGGT
GTCGGCGGAGGACCTGCTCACATGCTGTGGCAGCATGTGTGGGGACGGCTGTAATGGTGGCTATCCTGCTGAAGCTTGGA
ACTTCTGGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTATGAATCCCATGTAGGGTGCAGACCGTACTCCATCCCTCCC
TGTGAGCACCACGTCAACGGCTCCCGGCCCCCATGCACGGGGGAGGGAGATACCCCCAAGTGTAGCAAGATCTGTGAGCC
TGGCTACAGCCCGACCTACAAACAGGACAAGCACTACGGATACAATTCCTACAGCGTCTCCAATAGCGAGAAGGACATCA
TGGCCGAGATCTACAAAAACGGCCCCGTGGAGGGAGCTTTCTCTGTGTATTCGGACTTCCTGCTCTACAAGTCAGGAGTG
TACCAACACGTCACCGGAGAGATGATGGGTGGCCATGCCATCCGCATCCTGGGCTGGGGAGTGGAGAATGGCACACCCTA
CTGGCTGGTTGCCAACTCCTGGAACACTGACTGGGGTGACAATGGCTTCTTTAAAATACTCAGAGGACAGGATCACTGTG
GAATCGAATCAGAAGTGGTGGCTGGAATTCCACGCACCGATCAGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCG
TGCCAGTCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTATTCTTTAAGTTCACGTAAGATACAAGTTTCAGACAGGGT
CTGAAGGACTGGATTGGCCAAACATCAGACCTGTCTTCCAAGGAGACCAAGTCCTGGCTACATCCCAGCCTGTGGTTACA
GTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACCTG
CTGCCCCAGCTGACTGTGGCCCCCTCCGTGATCCATCCATCTCCAGGGAGCAAGACAGAGACGCAGGAATGGAAAGCGGA
GTTCCTAACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGTACCTCCAAGCAAGTAGCTTTCCACATTTGTCACAGAAAT
CAGAGGAGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCTCCCAGGCCCCCTGCATCTATCGAGTTTGCAATGTCA
CAACCTCTCTGATCTTGTGCTCAGCATGATTCTTTAATAGAAGTTTTATTTTTTCGTGCACTCTGCTAATCATGTGGGTG
AGCCAGTGGAACAGCGGGAGACCTGTGCTAGTTTTACAGATTGCCTCCTTATGACGCGGCTCAAAAGGAAACCAAGTGGT
CAGGAGTTGTTTCTGACCCACTGATCTCTACTACCACAAGGAAAATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTGAA
AAATTACAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTGTGCCAATAAAAGTTTTCTCCAACTTGAAGTCTACTCTGA
TGGGATCTCAGATCCTTTGTCACTGCCTATAGACTTGTAGCTGCTGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCTGG
AACTGCATGTTCTTGCGACTCTTGGGACTTCATCTTAACTTCTCGCTGCCCCAGCCATGTTTTCAACCATGGCATCCCTC
CCCCAATTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGTGCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAAAC
ATAGCTGTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAGAGTGAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCGCC
TTCTCAGCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCGATTCTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGACAG
GCACTCACCAATATGCCTGGGTAATTTTTGTATTTTTAAGTACATACAGGATTTCACCATGTTGGCCAGGCTAGTTTCAA
ACTCCCGGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGGGCCCTGCCTG
TATTTTTTATCAGCCACAAATCCAGCAACAAGCTGAGGATTCAGCTCATAAAACAGGCTTGGTGTCTTGGTGATCTCACA
TAACCAAGATGCTACCCCGTGGGGAACCACATCCCCCTGGATGCCCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCTGT
ATACAGTATTTTGAATTTGTATGCCACTGGTTTGCATTGCTGGTCAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTAGA
AACATGTTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGAACCAGCAATCATTCCACTGTCCTGCCAAGGTACACCTCA
GTACTCCCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCCAAAAGCATTCAAGTTTGGCTTCTGATGTGACTCAGA
ATTTAGGAACCAGATGCTAGATCAAATAAGCTCTGAAAATCTGAGGAACATTGTAGGAAAGGTTTGTTAAGCATCTCTTA
AGTGCCATGATGAGCATAACAGCCGGCCGTCGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGA
TGACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCTGAAACCTCACCTCTACTAAAAATACAAAAATTAGCTGGG
CATGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGTTG
CAGTGAGCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGACAGCGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAAAA
AAAAAAAAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGG
TCAGGAGTTTTGAGATCAGCCTTGGCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAGCATGCTGG
CACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACGAGAATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGAGC
CGAGATCACGCCATTGCACTCCAGCCTGGGGGACAAGAGTGAATCTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAGAT
GCTTAACAAAGGTTACCATAAGCCACAAATTCATAACCACTTATCCTTCCAGTTTCAAGTAGAATATATTCATAACCTCA
ATAAAGTTCTCCCTGCTCCCAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_147780.2 (GI:66346647)
|
Name |
Cathepsin B (CTSB), transcript variant 2
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
3945 nt
|
Map |
8p22
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11700032...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710987 | 11718914...11718987 | 11721884...11721971 | 11725509...11725645 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
137
|
137
|
1
|
Exon 2
|
138
|
225
|
88
|
1
|
Exon 3a
|
226
|
299
|
74
|
1
|
Exon 4
|
300
|
450
|
151
|
1
|
Exon 5
|
451
|
536
|
86
|
1
|
Exon 6
|
537
|
651
|
115
|
1
|
Exon 7
|
652
|
770
|
119
|
1
|
Exon 8
|
771
|
856
|
86
|
1
|
Exon 9
|
857
|
1000
|
144
|
1
|
Exon 10
|
1001
|
1117
|
117
|
1
|
Exon 11
|
1118
|
1246
|
129
|
1
|
Exon 12
|
1247
|
3945
|
2699
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS5986
|
Nucleotide |
CTSB, mRNA isoform 2[NM_147780.2] : 325...1344
|
Length |
1020
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11702633...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710962 |
|
Start codon |
1
|
Translation |
NP_680090.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGGCGGGGC |
CGGGAGGGTA |
CTTAGGGCCG |
GGGCTGGCCC |
AGGCTACGGC |
GGCTGCAGGG |
CTCCGGCAAC |
CGCTCCGGCA |
|
81 |
ACGCCAACCG |
CTCCGCTGCG |
CGCAGGCTGG |
GCTGCAGGCT |
CTCGGCTGCA |
GCGCTGGGCT |
GGTGTGCAGT |
GGTGCGACCA |
|
161 |
CGGCTCACGG |
CAGCCTCAGC |
CACCCAGATG |
TAAGCGATCT |
GGTTCCCACC |
TCAGCCTCCC |
GAGTAGTGTC |
TTCAGGCCTA |
|
241 |
TGGAGAGCAG |
CTTGCGTGGG |
CTGGGCCTGC |
AGTACCTGGT |
TTGCATAGAT |
GATTGGCAGG |
TGGATCTAGG |
ATCCGGCTTC |
|
321 |
CAACATGTGG |
CAGCTCTGGG |
CCTCCCTCTG |
CTGCCTGCTG |
GTGTTGGCCA |
ATGCCCGGAG |
CAGGCCCTCT |
TTCCATCCCC |
|
401 |
TGTCGGATGA |
GCTGGTCAAC |
TATGTCAACA |
AACGGAATAC |
CACGTGGCAG |
GCCGGGCACA |
ACTTCTACAA |
CGTGGACATG |
|
481 |
AGCTACTTGA |
AGAGGCTATG |
TGGTACCTTC |
CTGGGTGGGC |
CCAAGCCACC |
CCAGAGAGTT |
ATGTTTACCG |
AGGACCTGAA |
|
561 |
GCTGCCTGCA |
AGCTTCGATG |
CACGGGAACA |
ATGGCCACAG |
TGTCCCACCA |
TCAAAGAGAT |
CAGAGACCAG |
GGCTCCTGTG |
|
641 |
GCTCCTGCTG |
GGCCTTCGGG |
GCTGTGGAAG |
CCATCTCTGA |
CCGGATCTGC |
ATCCACACCA |
ATGCGCACGT |
CAGCGTGGAG |
|
721 |
GTGTCGGCGG |
AGGACCTGCT |
CACATGCTGT |
GGCAGCATGT |
GTGGGGACGG |
CTGTAATGGT |
GGCTATCCTG |
CTGAAGCTTG |
|
801 |
GAACTTCTGG |
ACAAGAAAAG |
GCCTGGTTTC |
TGGTGGCCTC |
TATGAATCCC |
ATGTAGGGTG |
CAGACCGTAC |
TCCATCCCTC |
|
881 |
CCTGTGAGCA |
CCACGTCAAC |
GGCTCCCGGC |
CCCCATGCAC |
GGGGGAGGGA |
GATACCCCCA |
AGTGTAGCAA |
GATCTGTGAG |
|
961 |
CCTGGCTACA |
GCCCGACCTA |
CAAACAGGAC |
AAGCACTACG |
GATACAATTC |
CTACAGCGTC |
TCCAATAGCG |
AGAAGGACAT |
|
1041 |
CATGGCCGAG |
ATCTACAAAA |
ACGGCCCCGT |
GGAGGGAGCT |
TTCTCTGTGT |
ATTCGGACTT |
CCTGCTCTAC |
AAGTCAGGAG |
|
1121 |
TGTACCAACA |
CGTCACCGGA |
GAGATGATGG |
GTGGCCATGC |
CATCCGCATC |
CTGGGCTGGG |
GAGTGGAGAA |
TGGCACACCC |
|
1201 |
TACTGGCTGG |
TTGCCAACTC |
CTGGAACACT |
GACTGGGGTG |
ACAATGGCTT |
CTTTAAAATA |
CTCAGAGGAC |
AGGATCACTG |
|
1281 |
TGGAATCGAA |
TCAGAAGTGG |
TGGCTGGAAT |
TCCACGCACC |
GATCAGTACT |
GGGAAAAGAT |
CTAATCTGCC |
GTGGGCCTGT |
|
1361 |
CGTGCCAGTC |
CTGGGGGCGA |
GATCGGGGTA |
GAAATGCATT |
TTATTCTTTA |
AGTTCACGTA |
AGATACAAGT |
TTCAGACAGG |
|
1441 |
GTCTGAAGGA |
CTGGATTGGC |
CAAACATCAG |
ACCTGTCTTC |
CAAGGAGACC |
AAGTCCTGGC |
TACATCCCAG |
CCTGTGGTTA |
|
1521 |
CAGTGCAGAC |
AGGCCATGTG |
AGCCACCGCT |
GCCAGCACAG |
AGCGTCCTTC |
CCCCTGTAGA |
CTAGTGCCGT |
AGGGAGTACC |
|
1601 |
TGCTGCCCCA |
GCTGACTGTG |
GCCCCCTCCG |
TGATCCATCC |
ATCTCCAGGG |
AGCAAGACAG |
AGACGCAGGA |
ATGGAAAGCG |
|
1681 |
GAGTTCCTAA |
CAGGATGAAA |
GTTCCCCCAT |
CAGTTCCCCC |
AGTACCTCCA |
AGCAAGTAGC |
TTTCCACATT |
TGTCACAGAA |
|
1761 |
ATCAGAGGAG |
AGACGGTGTT |
GGGAGCCCTT |
TGGAGAACGC |
CAGTCTCCCA |
GGCCCCCTGC |
ATCTATCGAG |
TTTGCAATGT |
|
1841 |
CACAACCTCT |
CTGATCTTGT |
GCTCAGCATG |
ATTCTTTAAT |
AGAAGTTTTA |
TTTTTTCGTG |
CACTCTGCTA |
ATCATGTGGG |
|
1921 |
TGAGCCAGTG |
GAACAGCGGG |
AGACCTGTGC |
TAGTTTTACA |
GATTGCCTCC |
TTATGACGCG |
GCTCAAAAGG |
AAACCAAGTG |
|
2001 |
GTCAGGAGTT |
GTTTCTGACC |
CACTGATCTC |
TACTACCACA |
AGGAAAATAG |
TTTAGGAGAA |
ACCAGCTTTT |
ACTGTTTTTG |
|
2081 |
AAAAATTACA |
GCTTCACCCT |
GTCAAGTTAA |
CAAGGAATGC |
CTGTGCCAAT |
AAAAGTTTTC |
TCCAACTTGA |
AGTCTACTCT |
|
2161 |
GATGGGATCT |
CAGATCCTTT |
GTCACTGCCT |
ATAGACTTGT |
AGCTGCTGTC |
TCTCTTTGTC |
CCTGCAGAGA |
ATCACGTCCT |
|
2241 |
GGAACTGCAT |
GTTCTTGCGA |
CTCTTGGGAC |
TTCATCTTAA |
CTTCTCGCTG |
CCCCAGCCAT |
GTTTTCAACC |
ATGGCATCCC |
|
2321 |
TCCCCCAATT |
AGTTCCCTGT |
CATCCTCGTC |
AACCTTCTCT |
GTAAGTGCCT |
GGTAAGCTTG |
CCCTTGCTTA |
AGAACTCAAA |
|
2401 |
ACATAGCTGT |
GCTCTATTTT |
TTTGTTGTTG |
TTGTGACTGA |
CAGAGTGAGA |
TTCCGTCTCC |
CAGGCTGGAG |
TGCAGTGGCG |
|
2481 |
CCTTCTCAGC |
TCACTGCAAC |
CTGCAGCCTC |
CTAGATTCAA |
GCGATTCTCC |
TGCTTCAGCC |
TTCCGAGTAG |
CTGGGATGAC |
|
2561 |
AGGCACTCAC |
CAATATGCCT |
GGGTAATTTT |
TGTATTTTTA |
AGTACATACA |
GGATTTCACC |
ATGTTGGCCA |
GGCTAGTTTC |
|
2641 |
AAACTCCCGG |
CCTCAGGTGG |
TCTGCCTGCC |
TCAGCCTCCC |
AAAGTGTTGG |
GATTACAGGC |
GTGAGCCACT |
GGGCCCTGCC |
|
2721 |
TGTATTTTTT |
ATCAGCCACA |
AATCCAGCAA |
CAAGCTGAGG |
ATTCAGCTCA |
TAAAACAGGC |
TTGGTGTCTT |
GGTGATCTCA |
|
2801 |
CATAACCAAG |
ATGCTACCCC |
GTGGGGAACC |
ACATCCCCCT |
GGATGCCCTC |
CAGCCTTGGT |
TTGGGCTGGA |
GTCAGGGCCT |
|
2881 |
GTATACAGTA |
TTTTGAATTT |
GTATGCCACT |
GGTTTGCATT |
GCTGGTCAGG |
AACTCTAGTG |
CTTTGCATAG |
CCCTGGTTTA |
|
2961 |
GAAACATGTT |
ATAGCAGTTC |
TTGGTATAGA |
GCAAACTAGA |
AGAACCAGCA |
ATCATTCCAC |
TGTCCTGCCA |
AGGTACACCT |
|
3041 |
CAGTACTCCC |
CTTCCCAACT |
GAAGTGGTAT |
GAGGCTAGCT |
CTTTCCAAAA |
GCATTCAAGT |
TTGGCTTCTG |
ATGTGACTCA |
|
3121 |
GAATTTAGGA |
ACCAGATGCT |
AGATCAAATA |
AGCTCTGAAA |
ATCTGAGGAA |
CATTGTAGGA |
AAGGTTTGTT |
AAGCATCTCT |
|
3201 |
TAAGTGCCAT |
GATGAGCATA |
ACAGCCGGCC |
GTCGTGGCTC |
ACGCCTGTAA |
TCCCAGCACT |
TTGGGAGGCC |
AAGGTGGGAG |
|
3281 |
GATGACAAGG |
TCAGGAGTTC |
AAGACCAGCC |
TGGCCAACAT |
GCTGAAACCT |
CACCTCTACT |
AAAAATACAA |
AAATTAGCTG |
|
3361 |
GGCATGGTGG |
CACATGCCTG |
TAATCCCAGC |
TACTTGGGAG |
GCTGAGGCAG |
GAGAATCGCT |
TGAACCCGGG |
AGGCGGAGGT |
|
3441 |
TGCAGTGAGC |
CAAGACAGTG |
CCAGTGCACT |
CCAGCCTCGG |
TGACAGCGCA |
AGGCTCCGTC |
TCAATAATTA |
AAAAAAAAAA |
|
3521 |
AAAAAAAAAA |
AAGGCCGGGC |
GCAGTGGCTC |
AAGCCTGTAA |
TCCCAGCACT |
TTGGGAGGCT |
GAGGCGGGCA |
GATCACCTGA |
|
3601 |
GGTCAGGAGT |
TTTGAGATCA |
GCCTTGGCAA |
CACGGTGAAA |
CCCCATCTCT |
ACTAAAAATA |
CAAAATTAGC |
CAAGCATGCT |
|
3681 |
GGCACATGCC |
TGTAATCCCA |
GCTACTCGGG |
AGGCTGAGGT |
ACGAGAATCG |
CTTGAACCTG |
GGAGGCAGAG |
GATGCAGTGA |
|
3761 |
GCCGAGATCA |
CGCCATTGCA |
CTCCAGCCTG |
GGGGACAAGA |
GTGAATCTGT |
GTCTCACCAA |
AAAAAAAAAG |
AAAAAGAAAG |
|
3841 |
ATGCTTAACA |
AAGGTTACCA |
TAAGCCACAA |
ATTCATAACC |
ACTTATCCTT |
CCAGTTTCAA |
GTAGAATATA |
TTCATAACCT |
|
3921 |
CAATAAAGTT |
CTCCCTGCTC |
CCAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|66346647|ref|NM_147780.2|Cathepsin B (CTSB), transcript variant 2
GGGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCA
ACGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGCTGGTGTGCAGTGGTGCGACCA
CGGCTCACGGCAGCCTCAGCCACCCAGATGTAAGCGATCTGGTTCCCACCTCAGCCTCCCGAGTAGTGTCTTCAGGCCTA
TGGAGAGCAGCTTGCGTGGGCTGGGCCTGCAGTACCTGGTTTGCATAGATGATTGGCAGGTGGATCTAGGATCCGGCTTC
CAACATGTGGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGTGTTGGCCAATGCCCGGAGCAGGCCCTCTTTCCATCCCC
TGTCGGATGAGCTGGTCAACTATGTCAACAAACGGAATACCACGTGGCAGGCCGGGCACAACTTCTACAACGTGGACATG
AGCTACTTGAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCAAGCCACCCCAGAGAGTTATGTTTACCGAGGACCTGAA
GCTGCCTGCAAGCTTCGATGCACGGGAACAATGGCCACAGTGTCCCACCATCAAAGAGATCAGAGACCAGGGCTCCTGTG
GCTCCTGCTGGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCGGATCTGCATCCACACCAATGCGCACGTCAGCGTGGAG
GTGTCGGCGGAGGACCTGCTCACATGCTGTGGCAGCATGTGTGGGGACGGCTGTAATGGTGGCTATCCTGCTGAAGCTTG
GAACTTCTGGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTATGAATCCCATGTAGGGTGCAGACCGTACTCCATCCCTC
CCTGTGAGCACCACGTCAACGGCTCCCGGCCCCCATGCACGGGGGAGGGAGATACCCCCAAGTGTAGCAAGATCTGTGAG
CCTGGCTACAGCCCGACCTACAAACAGGACAAGCACTACGGATACAATTCCTACAGCGTCTCCAATAGCGAGAAGGACAT
CATGGCCGAGATCTACAAAAACGGCCCCGTGGAGGGAGCTTTCTCTGTGTATTCGGACTTCCTGCTCTACAAGTCAGGAG
TGTACCAACACGTCACCGGAGAGATGATGGGTGGCCATGCCATCCGCATCCTGGGCTGGGGAGTGGAGAATGGCACACCC
TACTGGCTGGTTGCCAACTCCTGGAACACTGACTGGGGTGACAATGGCTTCTTTAAAATACTCAGAGGACAGGATCACTG
TGGAATCGAATCAGAAGTGGTGGCTGGAATTCCACGCACCGATCAGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGT
CGTGCCAGTCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTATTCTTTAAGTTCACGTAAGATACAAGTTTCAGACAGG
GTCTGAAGGACTGGATTGGCCAAACATCAGACCTGTCTTCCAAGGAGACCAAGTCCTGGCTACATCCCAGCCTGTGGTTA
CAGTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACC
TGCTGCCCCAGCTGACTGTGGCCCCCTCCGTGATCCATCCATCTCCAGGGAGCAAGACAGAGACGCAGGAATGGAAAGCG
GAGTTCCTAACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGTACCTCCAAGCAAGTAGCTTTCCACATTTGTCACAGAA
ATCAGAGGAGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCTCCCAGGCCCCCTGCATCTATCGAGTTTGCAATGT
CACAACCTCTCTGATCTTGTGCTCAGCATGATTCTTTAATAGAAGTTTTATTTTTTCGTGCACTCTGCTAATCATGTGGG
TGAGCCAGTGGAACAGCGGGAGACCTGTGCTAGTTTTACAGATTGCCTCCTTATGACGCGGCTCAAAAGGAAACCAAGTG
GTCAGGAGTTGTTTCTGACCCACTGATCTCTACTACCACAAGGAAAATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTG
AAAAATTACAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTGTGCCAATAAAAGTTTTCTCCAACTTGAAGTCTACTCT
GATGGGATCTCAGATCCTTTGTCACTGCCTATAGACTTGTAGCTGCTGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCT
GGAACTGCATGTTCTTGCGACTCTTGGGACTTCATCTTAACTTCTCGCTGCCCCAGCCATGTTTTCAACCATGGCATCCC
TCCCCCAATTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGTGCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAA
ACATAGCTGTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAGAGTGAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCG
CCTTCTCAGCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCGATTCTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGAC
AGGCACTCACCAATATGCCTGGGTAATTTTTGTATTTTTAAGTACATACAGGATTTCACCATGTTGGCCAGGCTAGTTTC
AAACTCCCGGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGGGCCCTGCC
TGTATTTTTTATCAGCCACAAATCCAGCAACAAGCTGAGGATTCAGCTCATAAAACAGGCTTGGTGTCTTGGTGATCTCA
CATAACCAAGATGCTACCCCGTGGGGAACCACATCCCCCTGGATGCCCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCT
GTATACAGTATTTTGAATTTGTATGCCACTGGTTTGCATTGCTGGTCAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTA
GAAACATGTTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGAACCAGCAATCATTCCACTGTCCTGCCAAGGTACACCT
CAGTACTCCCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCCAAAAGCATTCAAGTTTGGCTTCTGATGTGACTCA
GAATTTAGGAACCAGATGCTAGATCAAATAAGCTCTGAAAATCTGAGGAACATTGTAGGAAAGGTTTGTTAAGCATCTCT
TAAGTGCCATGATGAGCATAACAGCCGGCCGTCGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAG
GATGACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCTGAAACCTCACCTCTACTAAAAATACAAAAATTAGCTG
GGCATGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGT
TGCAGTGAGCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGACAGCGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAA
AAAAAAAAAAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGA
GGTCAGGAGTTTTGAGATCAGCCTTGGCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAGCATGCT
GGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACGAGAATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGA
GCCGAGATCACGCCATTGCACTCCAGCCTGGGGGACAAGAGTGAATCTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAG
ATGCTTAACAAAGGTTACCATAAGCCACAAATTCATAACCACTTATCCTTCCAGTTTCAAGTAGAATATATTCATAACCT
CAATAAAGTTCTCCCTGCTCCCAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_147781.2 (GI:66346648)
|
Name |
Cathepsin B (CTSB), transcript variant 3
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
3902 nt
|
Map |
8p22
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11700032...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710987 | 11718869...11718987 | 11725509...11725645 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
137
|
137
|
1
|
Exon 3b
|
138
|
256
|
119
|
1
|
Exon 4
|
257
|
407
|
151
|
1
|
Exon 5
|
408
|
493
|
86
|
1
|
Exon 6
|
494
|
608
|
115
|
1
|
Exon 7
|
609
|
727
|
119
|
1
|
Exon 8
|
728
|
813
|
86
|
1
|
Exon 9
|
814
|
957
|
144
|
1
|
Exon 10
|
958
|
1074
|
117
|
1
|
Exon 11
|
1075
|
1203
|
129
|
1
|
Exon 12
|
1204
|
3902
|
2699
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS5986
|
Nucleotide |
CTSB, mRNA isoform 3[NM_147781.2] : 282...1301
|
Length |
1020
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11702633...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710962 |
|
Start codon |
1
|
Translation |
NP_680091.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGGCGGGGC |
CGGGAGGGTA |
CTTAGGGCCG |
GGGCTGGCCC |
AGGCTACGGC |
GGCTGCAGGG |
CTCCGGCAAC |
CGCTCCGGCA |
|
81 |
ACGCCAACCG |
CTCCGCTGCG |
CGCAGGCTGG |
GCTGCAGGCT |
CTCGGCTGCA |
GCGCTGGGTG |
TCTTCAGGCC |
TATGGAGAGC |
|
161 |
AGCTTGCGTG |
GGCTGGGCCT |
GCAGTACCTG |
GTTTGCATAG |
ATGATTGGCA |
GGTGGGCAGC |
ACGGGGAAGG |
ACCTGTGAGT |
|
241 |
GGCCAACCTG |
GTTCAGGTGG |
ATCTAGGATC |
CGGCTTCCAA |
CATGTGGCAG |
CTCTGGGCCT |
CCCTCTGCTG |
CCTGCTGGTG |
|
321 |
TTGGCCAATG |
CCCGGAGCAG |
GCCCTCTTTC |
CATCCCCTGT |
CGGATGAGCT |
GGTCAACTAT |
GTCAACAAAC |
GGAATACCAC |
|
401 |
GTGGCAGGCC |
GGGCACAACT |
TCTACAACGT |
GGACATGAGC |
TACTTGAAGA |
GGCTATGTGG |
TACCTTCCTG |
GGTGGGCCCA |
|
481 |
AGCCACCCCA |
GAGAGTTATG |
TTTACCGAGG |
ACCTGAAGCT |
GCCTGCAAGC |
TTCGATGCAC |
GGGAACAATG |
GCCACAGTGT |
|
561 |
CCCACCATCA |
AAGAGATCAG |
AGACCAGGGC |
TCCTGTGGCT |
CCTGCTGGGC |
CTTCGGGGCT |
GTGGAAGCCA |
TCTCTGACCG |
|
641 |
GATCTGCATC |
CACACCAATG |
CGCACGTCAG |
CGTGGAGGTG |
TCGGCGGAGG |
ACCTGCTCAC |
ATGCTGTGGC |
AGCATGTGTG |
|
721 |
GGGACGGCTG |
TAATGGTGGC |
TATCCTGCTG |
AAGCTTGGAA |
CTTCTGGACA |
AGAAAAGGCC |
TGGTTTCTGG |
TGGCCTCTAT |
|
801 |
GAATCCCATG |
TAGGGTGCAG |
ACCGTACTCC |
ATCCCTCCCT |
GTGAGCACCA |
CGTCAACGGC |
TCCCGGCCCC |
CATGCACGGG |
|
881 |
GGAGGGAGAT |
ACCCCCAAGT |
GTAGCAAGAT |
CTGTGAGCCT |
GGCTACAGCC |
CGACCTACAA |
ACAGGACAAG |
CACTACGGAT |
|
961 |
ACAATTCCTA |
CAGCGTCTCC |
AATAGCGAGA |
AGGACATCAT |
GGCCGAGATC |
TACAAAAACG |
GCCCCGTGGA |
GGGAGCTTTC |
|
1041 |
TCTGTGTATT |
CGGACTTCCT |
GCTCTACAAG |
TCAGGAGTGT |
ACCAACACGT |
CACCGGAGAG |
ATGATGGGTG |
GCCATGCCAT |
|
1121 |
CCGCATCCTG |
GGCTGGGGAG |
TGGAGAATGG |
CACACCCTAC |
TGGCTGGTTG |
CCAACTCCTG |
GAACACTGAC |
TGGGGTGACA |
|
1201 |
ATGGCTTCTT |
TAAAATACTC |
AGAGGACAGG |
ATCACTGTGG |
AATCGAATCA |
GAAGTGGTGG |
CTGGAATTCC |
ACGCACCGAT |
|
1281 |
CAGTACTGGG |
AAAAGATCTA |
ATCTGCCGTG |
GGCCTGTCGT |
GCCAGTCCTG |
GGGGCGAGAT |
CGGGGTAGAA |
ATGCATTTTA |
|
1361 |
TTCTTTAAGT |
TCACGTAAGA |
TACAAGTTTC |
AGACAGGGTC |
TGAAGGACTG |
GATTGGCCAA |
ACATCAGACC |
TGTCTTCCAA |
|
1441 |
GGAGACCAAG |
TCCTGGCTAC |
ATCCCAGCCT |
GTGGTTACAG |
TGCAGACAGG |
CCATGTGAGC |
CACCGCTGCC |
AGCACAGAGC |
|
1521 |
GTCCTTCCCC |
CTGTAGACTA |
GTGCCGTAGG |
GAGTACCTGC |
TGCCCCAGCT |
GACTGTGGCC |
CCCTCCGTGA |
TCCATCCATC |
|
1601 |
TCCAGGGAGC |
AAGACAGAGA |
CGCAGGAATG |
GAAAGCGGAG |
TTCCTAACAG |
GATGAAAGTT |
CCCCCATCAG |
TTCCCCCAGT |
|
1681 |
ACCTCCAAGC |
AAGTAGCTTT |
CCACATTTGT |
CACAGAAATC |
AGAGGAGAGA |
CGGTGTTGGG |
AGCCCTTTGG |
AGAACGCCAG |
|
1761 |
TCTCCCAGGC |
CCCCTGCATC |
TATCGAGTTT |
GCAATGTCAC |
AACCTCTCTG |
ATCTTGTGCT |
CAGCATGATT |
CTTTAATAGA |
|
1841 |
AGTTTTATTT |
TTTCGTGCAC |
TCTGCTAATC |
ATGTGGGTGA |
GCCAGTGGAA |
CAGCGGGAGA |
CCTGTGCTAG |
TTTTACAGAT |
|
1921 |
TGCCTCCTTA |
TGACGCGGCT |
CAAAAGGAAA |
CCAAGTGGTC |
AGGAGTTGTT |
TCTGACCCAC |
TGATCTCTAC |
TACCACAAGG |
|
2001 |
AAAATAGTTT |
AGGAGAAACC |
AGCTTTTACT |
GTTTTTGAAA |
AATTACAGCT |
TCACCCTGTC |
AAGTTAACAA |
GGAATGCCTG |
|
2081 |
TGCCAATAAA |
AGTTTTCTCC |
AACTTGAAGT |
CTACTCTGAT |
GGGATCTCAG |
ATCCTTTGTC |
ACTGCCTATA |
GACTTGTAGC |
|
2161 |
TGCTGTCTCT |
CTTTGTCCCT |
GCAGAGAATC |
ACGTCCTGGA |
ACTGCATGTT |
CTTGCGACTC |
TTGGGACTTC |
ATCTTAACTT |
|
2241 |
CTCGCTGCCC |
CAGCCATGTT |
TTCAACCATG |
GCATCCCTCC |
CCCAATTAGT |
TCCCTGTCAT |
CCTCGTCAAC |
CTTCTCTGTA |
|
2321 |
AGTGCCTGGT |
AAGCTTGCCC |
TTGCTTAAGA |
ACTCAAAACA |
TAGCTGTGCT |
CTATTTTTTT |
GTTGTTGTTG |
TGACTGACAG |
|
2401 |
AGTGAGATTC |
CGTCTCCCAG |
GCTGGAGTGC |
AGTGGCGCCT |
TCTCAGCTCA |
CTGCAACCTG |
CAGCCTCCTA |
GATTCAAGCG |
|
2481 |
ATTCTCCTGC |
TTCAGCCTTC |
CGAGTAGCTG |
GGATGACAGG |
CACTCACCAA |
TATGCCTGGG |
TAATTTTTGT |
ATTTTTAAGT |
|
2561 |
ACATACAGGA |
TTTCACCATG |
TTGGCCAGGC |
TAGTTTCAAA |
CTCCCGGCCT |
CAGGTGGTCT |
GCCTGCCTCA |
GCCTCCCAAA |
|
2641 |
GTGTTGGGAT |
TACAGGCGTG |
AGCCACTGGG |
CCCTGCCTGT |
ATTTTTTATC |
AGCCACAAAT |
CCAGCAACAA |
GCTGAGGATT |
|
2721 |
CAGCTCATAA |
AACAGGCTTG |
GTGTCTTGGT |
GATCTCACAT |
AACCAAGATG |
CTACCCCGTG |
GGGAACCACA |
TCCCCCTGGA |
|
2801 |
TGCCCTCCAG |
CCTTGGTTTG |
GGCTGGAGTC |
AGGGCCTGTA |
TACAGTATTT |
TGAATTTGTA |
TGCCACTGGT |
TTGCATTGCT |
|
2881 |
GGTCAGGAAC |
TCTAGTGCTT |
TGCATAGCCC |
TGGTTTAGAA |
ACATGTTATA |
GCAGTTCTTG |
GTATAGAGCA |
AACTAGAAGA |
|
2961 |
ACCAGCAATC |
ATTCCACTGT |
CCTGCCAAGG |
TACACCTCAG |
TACTCCCCTT |
CCCAACTGAA |
GTGGTATGAG |
GCTAGCTCTT |
|
3041 |
TCCAAAAGCA |
TTCAAGTTTG |
GCTTCTGATG |
TGACTCAGAA |
TTTAGGAACC |
AGATGCTAGA |
TCAAATAAGC |
TCTGAAAATC |
|
3121 |
TGAGGAACAT |
TGTAGGAAAG |
GTTTGTTAAG |
CATCTCTTAA |
GTGCCATGAT |
GAGCATAACA |
GCCGGCCGTC |
GTGGCTCACG |
|
3201 |
CCTGTAATCC |
CAGCACTTTG |
GGAGGCCAAG |
GTGGGAGGAT |
GACAAGGTCA |
GGAGTTCAAG |
ACCAGCCTGG |
CCAACATGCT |
|
3281 |
GAAACCTCAC |
CTCTACTAAA |
AATACAAAAA |
TTAGCTGGGC |
ATGGTGGCAC |
ATGCCTGTAA |
TCCCAGCTAC |
TTGGGAGGCT |
|
3361 |
GAGGCAGGAG |
AATCGCTTGA |
ACCCGGGAGG |
CGGAGGTTGC |
AGTGAGCCAA |
GACAGTGCCA |
GTGCACTCCA |
GCCTCGGTGA |
|
3441 |
CAGCGCAAGG |
CTCCGTCTCA |
ATAATTAAAA |
AAAAAAAAAA |
AAAAAAAAAG |
GCCGGGCGCA |
GTGGCTCAAG |
CCTGTAATCC |
|
3521 |
CAGCACTTTG |
GGAGGCTGAG |
GCGGGCAGAT |
CACCTGAGGT |
CAGGAGTTTT |
GAGATCAGCC |
TTGGCAACAC |
GGTGAAACCC |
|
3601 |
CATCTCTACT |
AAAAATACAA |
AATTAGCCAA |
GCATGCTGGC |
ACATGCCTGT |
AATCCCAGCT |
ACTCGGGAGG |
CTGAGGTACG |
|
3681 |
AGAATCGCTT |
GAACCTGGGA |
GGCAGAGGAT |
GCAGTGAGCC |
GAGATCACGC |
CATTGCACTC |
CAGCCTGGGG |
GACAAGAGTG |
|
3761 |
AATCTGTGTC |
TCACCAAAAA |
AAAAAAGAAA |
AAGAAAGATG |
CTTAACAAAG |
GTTACCATAA |
GCCACAAATT |
CATAACCACT |
|
3841 |
TATCCTTCCA |
GTTTCAAGTA |
GAATATATTC |
ATAACCTCAA |
TAAAGTTCTC |
CCTGCTCCCA |
AA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|66346648|ref|NM_147781.2|Cathepsin B (CTSB), transcript variant 3
GGGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCA
ACGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGTGTCTTCAGGCCTATGGAGAGC
AGCTTGCGTGGGCTGGGCCTGCAGTACCTGGTTTGCATAGATGATTGGCAGGTGGGCAGCACGGGGAAGGACCTGTGAGT
GGCCAACCTGGTTCAGGTGGATCTAGGATCCGGCTTCCAACATGTGGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGTG
TTGGCCAATGCCCGGAGCAGGCCCTCTTTCCATCCCCTGTCGGATGAGCTGGTCAACTATGTCAACAAACGGAATACCAC
GTGGCAGGCCGGGCACAACTTCTACAACGTGGACATGAGCTACTTGAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCA
AGCCACCCCAGAGAGTTATGTTTACCGAGGACCTGAAGCTGCCTGCAAGCTTCGATGCACGGGAACAATGGCCACAGTGT
CCCACCATCAAAGAGATCAGAGACCAGGGCTCCTGTGGCTCCTGCTGGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCG
GATCTGCATCCACACCAATGCGCACGTCAGCGTGGAGGTGTCGGCGGAGGACCTGCTCACATGCTGTGGCAGCATGTGTG
GGGACGGCTGTAATGGTGGCTATCCTGCTGAAGCTTGGAACTTCTGGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTAT
GAATCCCATGTAGGGTGCAGACCGTACTCCATCCCTCCCTGTGAGCACCACGTCAACGGCTCCCGGCCCCCATGCACGGG
GGAGGGAGATACCCCCAAGTGTAGCAAGATCTGTGAGCCTGGCTACAGCCCGACCTACAAACAGGACAAGCACTACGGAT
ACAATTCCTACAGCGTCTCCAATAGCGAGAAGGACATCATGGCCGAGATCTACAAAAACGGCCCCGTGGAGGGAGCTTTC
TCTGTGTATTCGGACTTCCTGCTCTACAAGTCAGGAGTGTACCAACACGTCACCGGAGAGATGATGGGTGGCCATGCCAT
CCGCATCCTGGGCTGGGGAGTGGAGAATGGCACACCCTACTGGCTGGTTGCCAACTCCTGGAACACTGACTGGGGTGACA
ATGGCTTCTTTAAAATACTCAGAGGACAGGATCACTGTGGAATCGAATCAGAAGTGGTGGCTGGAATTCCACGCACCGAT
CAGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCGTGCCAGTCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTA
TTCTTTAAGTTCACGTAAGATACAAGTTTCAGACAGGGTCTGAAGGACTGGATTGGCCAAACATCAGACCTGTCTTCCAA
GGAGACCAAGTCCTGGCTACATCCCAGCCTGTGGTTACAGTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGC
GTCCTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACCTGCTGCCCCAGCTGACTGTGGCCCCCTCCGTGATCCATCCATC
TCCAGGGAGCAAGACAGAGACGCAGGAATGGAAAGCGGAGTTCCTAACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGT
ACCTCCAAGCAAGTAGCTTTCCACATTTGTCACAGAAATCAGAGGAGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAG
TCTCCCAGGCCCCCTGCATCTATCGAGTTTGCAATGTCACAACCTCTCTGATCTTGTGCTCAGCATGATTCTTTAATAGA
AGTTTTATTTTTTCGTGCACTCTGCTAATCATGTGGGTGAGCCAGTGGAACAGCGGGAGACCTGTGCTAGTTTTACAGAT
TGCCTCCTTATGACGCGGCTCAAAAGGAAACCAAGTGGTCAGGAGTTGTTTCTGACCCACTGATCTCTACTACCACAAGG
AAAATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTGAAAAATTACAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTG
TGCCAATAAAAGTTTTCTCCAACTTGAAGTCTACTCTGATGGGATCTCAGATCCTTTGTCACTGCCTATAGACTTGTAGC
TGCTGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCTGGAACTGCATGTTCTTGCGACTCTTGGGACTTCATCTTAACTT
CTCGCTGCCCCAGCCATGTTTTCAACCATGGCATCCCTCCCCCAATTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTA
AGTGCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAAACATAGCTGTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAG
AGTGAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCGCCTTCTCAGCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCG
ATTCTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGACAGGCACTCACCAATATGCCTGGGTAATTTTTGTATTTTTAAGT
ACATACAGGATTTCACCATGTTGGCCAGGCTAGTTTCAAACTCCCGGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAA
GTGTTGGGATTACAGGCGTGAGCCACTGGGCCCTGCCTGTATTTTTTATCAGCCACAAATCCAGCAACAAGCTGAGGATT
CAGCTCATAAAACAGGCTTGGTGTCTTGGTGATCTCACATAACCAAGATGCTACCCCGTGGGGAACCACATCCCCCTGGA
TGCCCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCTGTATACAGTATTTTGAATTTGTATGCCACTGGTTTGCATTGCT
GGTCAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTAGAAACATGTTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGA
ACCAGCAATCATTCCACTGTCCTGCCAAGGTACACCTCAGTACTCCCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTT
TCCAAAAGCATTCAAGTTTGGCTTCTGATGTGACTCAGAATTTAGGAACCAGATGCTAGATCAAATAAGCTCTGAAAATC
TGAGGAACATTGTAGGAAAGGTTTGTTAAGCATCTCTTAAGTGCCATGATGAGCATAACAGCCGGCCGTCGTGGCTCACG
CCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATGACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCT
GAAACCTCACCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCT
GAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGA
CAGCGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAAAAAAAAAAAAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCC
CAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCAGGAGTTTTGAGATCAGCCTTGGCAACACGGTGAAACCC
CATCTCTACTAAAAATACAAAATTAGCCAAGCATGCTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACG
AGAATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGAGCCGAGATCACGCCATTGCACTCCAGCCTGGGGGACAAGAGTG
AATCTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAGATGCTTAACAAAGGTTACCATAAGCCACAAATTCATAACCACT
TATCCTTCCAGTTTCAAGTAGAATATATTCATAACCTCAATAAAGTTCTCCCTGCTCCCAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_147782.2 (GI:66346649)
|
Name |
Cathepsin B (CTSB), transcript variant 4
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
3871 nt
|
Map |
8p22
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11700032...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710987 | 11721884...11721971 | 11725509...11725645 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
137
|
137
|
1
|
Exon 2
|
138
|
225
|
88
|
1
|
Exon 4
|
226
|
376
|
151
|
1
|
Exon 5
|
377
|
462
|
86
|
1
|
Exon 6
|
463
|
577
|
115
|
1
|
Exon 7
|
578
|
696
|
119
|
1
|
Exon 8
|
697
|
782
|
86
|
1
|
Exon 9
|
783
|
926
|
144
|
1
|
Exon 10
|
927
|
1043
|
117
|
1
|
Exon 11
|
1044
|
1172
|
129
|
1
|
Exon 12
|
1173
|
3871
|
2699
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS5986
|
Nucleotide |
CTSB, mRNA isoform 4[NM_147782.2] : 251...1270
|
Length |
1020
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11702633...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710962 |
|
Start codon |
1
|
Translation |
NP_680092.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGGCGGGGC |
CGGGAGGGTA |
CTTAGGGCCG |
GGGCTGGCCC |
AGGCTACGGC |
GGCTGCAGGG |
CTCCGGCAAC |
CGCTCCGGCA |
|
81 |
ACGCCAACCG |
CTCCGCTGCG |
CGCAGGCTGG |
GCTGCAGGCT |
CTCGGCTGCA |
GCGCTGGGCT |
GGTGTGCAGT |
GGTGCGACCA |
|
161 |
CGGCTCACGG |
CAGCCTCAGC |
CACCCAGATG |
TAAGCGATCT |
GGTTCCCACC |
TCAGCCTCCC |
GAGTAGTGGA |
TCTAGGATCC |
|
241 |
GGCTTCCAAC |
ATGTGGCAGC |
TCTGGGCCTC |
CCTCTGCTGC |
CTGCTGGTGT |
TGGCCAATGC |
CCGGAGCAGG |
CCCTCTTTCC |
|
321 |
ATCCCCTGTC |
GGATGAGCTG |
GTCAACTATG |
TCAACAAACG |
GAATACCACG |
TGGCAGGCCG |
GGCACAACTT |
CTACAACGTG |
|
401 |
GACATGAGCT |
ACTTGAAGAG |
GCTATGTGGT |
ACCTTCCTGG |
GTGGGCCCAA |
GCCACCCCAG |
AGAGTTATGT |
TTACCGAGGA |
|
481 |
CCTGAAGCTG |
CCTGCAAGCT |
TCGATGCACG |
GGAACAATGG |
CCACAGTGTC |
CCACCATCAA |
AGAGATCAGA |
GACCAGGGCT |
|
561 |
CCTGTGGCTC |
CTGCTGGGCC |
TTCGGGGCTG |
TGGAAGCCAT |
CTCTGACCGG |
ATCTGCATCC |
ACACCAATGC |
GCACGTCAGC |
|
641 |
GTGGAGGTGT |
CGGCGGAGGA |
CCTGCTCACA |
TGCTGTGGCA |
GCATGTGTGG |
GGACGGCTGT |
AATGGTGGCT |
ATCCTGCTGA |
|
721 |
AGCTTGGAAC |
TTCTGGACAA |
GAAAAGGCCT |
GGTTTCTGGT |
GGCCTCTATG |
AATCCCATGT |
AGGGTGCAGA |
CCGTACTCCA |
|
801 |
TCCCTCCCTG |
TGAGCACCAC |
GTCAACGGCT |
CCCGGCCCCC |
ATGCACGGGG |
GAGGGAGATA |
CCCCCAAGTG |
TAGCAAGATC |
|
881 |
TGTGAGCCTG |
GCTACAGCCC |
GACCTACAAA |
CAGGACAAGC |
ACTACGGATA |
CAATTCCTAC |
AGCGTCTCCA |
ATAGCGAGAA |
|
961 |
GGACATCATG |
GCCGAGATCT |
ACAAAAACGG |
CCCCGTGGAG |
GGAGCTTTCT |
CTGTGTATTC |
GGACTTCCTG |
CTCTACAAGT |
|
1041 |
CAGGAGTGTA |
CCAACACGTC |
ACCGGAGAGA |
TGATGGGTGG |
CCATGCCATC |
CGCATCCTGG |
GCTGGGGAGT |
GGAGAATGGC |
|
1121 |
ACACCCTACT |
GGCTGGTTGC |
CAACTCCTGG |
AACACTGACT |
GGGGTGACAA |
TGGCTTCTTT |
AAAATACTCA |
GAGGACAGGA |
|
1201 |
TCACTGTGGA |
ATCGAATCAG |
AAGTGGTGGC |
TGGAATTCCA |
CGCACCGATC |
AGTACTGGGA |
AAAGATCTAA |
TCTGCCGTGG |
|
1281 |
GCCTGTCGTG |
CCAGTCCTGG |
GGGCGAGATC |
GGGGTAGAAA |
TGCATTTTAT |
TCTTTAAGTT |
CACGTAAGAT |
ACAAGTTTCA |
|
1361 |
GACAGGGTCT |
GAAGGACTGG |
ATTGGCCAAA |
CATCAGACCT |
GTCTTCCAAG |
GAGACCAAGT |
CCTGGCTACA |
TCCCAGCCTG |
|
1441 |
TGGTTACAGT |
GCAGACAGGC |
CATGTGAGCC |
ACCGCTGCCA |
GCACAGAGCG |
TCCTTCCCCC |
TGTAGACTAG |
TGCCGTAGGG |
|
1521 |
AGTACCTGCT |
GCCCCAGCTG |
ACTGTGGCCC |
CCTCCGTGAT |
CCATCCATCT |
CCAGGGAGCA |
AGACAGAGAC |
GCAGGAATGG |
|
1601 |
AAAGCGGAGT |
TCCTAACAGG |
ATGAAAGTTC |
CCCCATCAGT |
TCCCCCAGTA |
CCTCCAAGCA |
AGTAGCTTTC |
CACATTTGTC |
|
1681 |
ACAGAAATCA |
GAGGAGAGAC |
GGTGTTGGGA |
GCCCTTTGGA |
GAACGCCAGT |
CTCCCAGGCC |
CCCTGCATCT |
ATCGAGTTTG |
|
1761 |
CAATGTCACA |
ACCTCTCTGA |
TCTTGTGCTC |
AGCATGATTC |
TTTAATAGAA |
GTTTTATTTT |
TTCGTGCACT |
CTGCTAATCA |
|
1841 |
TGTGGGTGAG |
CCAGTGGAAC |
AGCGGGAGAC |
CTGTGCTAGT |
TTTACAGATT |
GCCTCCTTAT |
GACGCGGCTC |
AAAAGGAAAC |
|
1921 |
CAAGTGGTCA |
GGAGTTGTTT |
CTGACCCACT |
GATCTCTACT |
ACCACAAGGA |
AAATAGTTTA |
GGAGAAACCA |
GCTTTTACTG |
|
2001 |
TTTTTGAAAA |
ATTACAGCTT |
CACCCTGTCA |
AGTTAACAAG |
GAATGCCTGT |
GCCAATAAAA |
GTTTTCTCCA |
ACTTGAAGTC |
|
2081 |
TACTCTGATG |
GGATCTCAGA |
TCCTTTGTCA |
CTGCCTATAG |
ACTTGTAGCT |
GCTGTCTCTC |
TTTGTCCCTG |
CAGAGAATCA |
|
2161 |
CGTCCTGGAA |
CTGCATGTTC |
TTGCGACTCT |
TGGGACTTCA |
TCTTAACTTC |
TCGCTGCCCC |
AGCCATGTTT |
TCAACCATGG |
|
2241 |
CATCCCTCCC |
CCAATTAGTT |
CCCTGTCATC |
CTCGTCAACC |
TTCTCTGTAA |
GTGCCTGGTA |
AGCTTGCCCT |
TGCTTAAGAA |
|
2321 |
CTCAAAACAT |
AGCTGTGCTC |
TATTTTTTTG |
TTGTTGTTGT |
GACTGACAGA |
GTGAGATTCC |
GTCTCCCAGG |
CTGGAGTGCA |
|
2401 |
GTGGCGCCTT |
CTCAGCTCAC |
TGCAACCTGC |
AGCCTCCTAG |
ATTCAAGCGA |
TTCTCCTGCT |
TCAGCCTTCC |
GAGTAGCTGG |
|
2481 |
GATGACAGGC |
ACTCACCAAT |
ATGCCTGGGT |
AATTTTTGTA |
TTTTTAAGTA |
CATACAGGAT |
TTCACCATGT |
TGGCCAGGCT |
|
2561 |
AGTTTCAAAC |
TCCCGGCCTC |
AGGTGGTCTG |
CCTGCCTCAG |
CCTCCCAAAG |
TGTTGGGATT |
ACAGGCGTGA |
GCCACTGGGC |
|
2641 |
CCTGCCTGTA |
TTTTTTATCA |
GCCACAAATC |
CAGCAACAAG |
CTGAGGATTC |
AGCTCATAAA |
ACAGGCTTGG |
TGTCTTGGTG |
|
2721 |
ATCTCACATA |
ACCAAGATGC |
TACCCCGTGG |
GGAACCACAT |
CCCCCTGGAT |
GCCCTCCAGC |
CTTGGTTTGG |
GCTGGAGTCA |
|
2801 |
GGGCCTGTAT |
ACAGTATTTT |
GAATTTGTAT |
GCCACTGGTT |
TGCATTGCTG |
GTCAGGAACT |
CTAGTGCTTT |
GCATAGCCCT |
|
2881 |
GGTTTAGAAA |
CATGTTATAG |
CAGTTCTTGG |
TATAGAGCAA |
ACTAGAAGAA |
CCAGCAATCA |
TTCCACTGTC |
CTGCCAAGGT |
|
2961 |
ACACCTCAGT |
ACTCCCCTTC |
CCAACTGAAG |
TGGTATGAGG |
CTAGCTCTTT |
CCAAAAGCAT |
TCAAGTTTGG |
CTTCTGATGT |
|
3041 |
GACTCAGAAT |
TTAGGAACCA |
GATGCTAGAT |
CAAATAAGCT |
CTGAAAATCT |
GAGGAACATT |
GTAGGAAAGG |
TTTGTTAAGC |
|
3121 |
ATCTCTTAAG |
TGCCATGATG |
AGCATAACAG |
CCGGCCGTCG |
TGGCTCACGC |
CTGTAATCCC |
AGCACTTTGG |
GAGGCCAAGG |
|
3201 |
TGGGAGGATG |
ACAAGGTCAG |
GAGTTCAAGA |
CCAGCCTGGC |
CAACATGCTG |
AAACCTCACC |
TCTACTAAAA |
ATACAAAAAT |
|
3281 |
TAGCTGGGCA |
TGGTGGCACA |
TGCCTGTAAT |
CCCAGCTACT |
TGGGAGGCTG |
AGGCAGGAGA |
ATCGCTTGAA |
CCCGGGAGGC |
|
3361 |
GGAGGTTGCA |
GTGAGCCAAG |
ACAGTGCCAG |
TGCACTCCAG |
CCTCGGTGAC |
AGCGCAAGGC |
TCCGTCTCAA |
TAATTAAAAA |
|
3441 |
AAAAAAAAAA |
AAAAAAAAGG |
CCGGGCGCAG |
TGGCTCAAGC |
CTGTAATCCC |
AGCACTTTGG |
GAGGCTGAGG |
CGGGCAGATC |
|
3521 |
ACCTGAGGTC |
AGGAGTTTTG |
AGATCAGCCT |
TGGCAACACG |
GTGAAACCCC |
ATCTCTACTA |
AAAATACAAA |
ATTAGCCAAG |
|
3601 |
CATGCTGGCA |
CATGCCTGTA |
ATCCCAGCTA |
CTCGGGAGGC |
TGAGGTACGA |
GAATCGCTTG |
AACCTGGGAG |
GCAGAGGATG |
|
3681 |
CAGTGAGCCG |
AGATCACGCC |
ATTGCACTCC |
AGCCTGGGGG |
ACAAGAGTGA |
ATCTGTGTCT |
CACCAAAAAA |
AAAAAGAAAA |
|
3761 |
AGAAAGATGC |
TTAACAAAGG |
TTACCATAAG |
CCACAAATTC |
ATAACCACTT |
ATCCTTCCAG |
TTTCAAGTAG |
AATATATTCA |
|
3841 |
TAACCTCAAT |
AAAGTTCTCC |
CTGCTCCCAA |
A |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|66346649|ref|NM_147782.2|Cathepsin B (CTSB), transcript variant 4
GGGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCA
ACGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGCTGGTGTGCAGTGGTGCGACCA
CGGCTCACGGCAGCCTCAGCCACCCAGATGTAAGCGATCTGGTTCCCACCTCAGCCTCCCGAGTAGTGGATCTAGGATCC
GGCTTCCAACATGTGGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGTGTTGGCCAATGCCCGGAGCAGGCCCTCTTTCC
ATCCCCTGTCGGATGAGCTGGTCAACTATGTCAACAAACGGAATACCACGTGGCAGGCCGGGCACAACTTCTACAACGTG
GACATGAGCTACTTGAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCAAGCCACCCCAGAGAGTTATGTTTACCGAGGA
CCTGAAGCTGCCTGCAAGCTTCGATGCACGGGAACAATGGCCACAGTGTCCCACCATCAAAGAGATCAGAGACCAGGGCT
CCTGTGGCTCCTGCTGGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCGGATCTGCATCCACACCAATGCGCACGTCAGC
GTGGAGGTGTCGGCGGAGGACCTGCTCACATGCTGTGGCAGCATGTGTGGGGACGGCTGTAATGGTGGCTATCCTGCTGA
AGCTTGGAACTTCTGGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTATGAATCCCATGTAGGGTGCAGACCGTACTCCA
TCCCTCCCTGTGAGCACCACGTCAACGGCTCCCGGCCCCCATGCACGGGGGAGGGAGATACCCCCAAGTGTAGCAAGATC
TGTGAGCCTGGCTACAGCCCGACCTACAAACAGGACAAGCACTACGGATACAATTCCTACAGCGTCTCCAATAGCGAGAA
GGACATCATGGCCGAGATCTACAAAAACGGCCCCGTGGAGGGAGCTTTCTCTGTGTATTCGGACTTCCTGCTCTACAAGT
CAGGAGTGTACCAACACGTCACCGGAGAGATGATGGGTGGCCATGCCATCCGCATCCTGGGCTGGGGAGTGGAGAATGGC
ACACCCTACTGGCTGGTTGCCAACTCCTGGAACACTGACTGGGGTGACAATGGCTTCTTTAAAATACTCAGAGGACAGGA
TCACTGTGGAATCGAATCAGAAGTGGTGGCTGGAATTCCACGCACCGATCAGTACTGGGAAAAGATCTAATCTGCCGTGG
GCCTGTCGTGCCAGTCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTATTCTTTAAGTTCACGTAAGATACAAGTTTCA
GACAGGGTCTGAAGGACTGGATTGGCCAAACATCAGACCTGTCTTCCAAGGAGACCAAGTCCTGGCTACATCCCAGCCTG
TGGTTACAGTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCCCCTGTAGACTAGTGCCGTAGGG
AGTACCTGCTGCCCCAGCTGACTGTGGCCCCCTCCGTGATCCATCCATCTCCAGGGAGCAAGACAGAGACGCAGGAATGG
AAAGCGGAGTTCCTAACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGTACCTCCAAGCAAGTAGCTTTCCACATTTGTC
ACAGAAATCAGAGGAGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCTCCCAGGCCCCCTGCATCTATCGAGTTTG
CAATGTCACAACCTCTCTGATCTTGTGCTCAGCATGATTCTTTAATAGAAGTTTTATTTTTTCGTGCACTCTGCTAATCA
TGTGGGTGAGCCAGTGGAACAGCGGGAGACCTGTGCTAGTTTTACAGATTGCCTCCTTATGACGCGGCTCAAAAGGAAAC
CAAGTGGTCAGGAGTTGTTTCTGACCCACTGATCTCTACTACCACAAGGAAAATAGTTTAGGAGAAACCAGCTTTTACTG
TTTTTGAAAAATTACAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTGTGCCAATAAAAGTTTTCTCCAACTTGAAGTC
TACTCTGATGGGATCTCAGATCCTTTGTCACTGCCTATAGACTTGTAGCTGCTGTCTCTCTTTGTCCCTGCAGAGAATCA
CGTCCTGGAACTGCATGTTCTTGCGACTCTTGGGACTTCATCTTAACTTCTCGCTGCCCCAGCCATGTTTTCAACCATGG
CATCCCTCCCCCAATTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGTGCCTGGTAAGCTTGCCCTTGCTTAAGAA
CTCAAAACATAGCTGTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAGAGTGAGATTCCGTCTCCCAGGCTGGAGTGCA
GTGGCGCCTTCTCAGCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCGATTCTCCTGCTTCAGCCTTCCGAGTAGCTGG
GATGACAGGCACTCACCAATATGCCTGGGTAATTTTTGTATTTTTAAGTACATACAGGATTTCACCATGTTGGCCAGGCT
AGTTTCAAACTCCCGGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGGGC
CCTGCCTGTATTTTTTATCAGCCACAAATCCAGCAACAAGCTGAGGATTCAGCTCATAAAACAGGCTTGGTGTCTTGGTG
ATCTCACATAACCAAGATGCTACCCCGTGGGGAACCACATCCCCCTGGATGCCCTCCAGCCTTGGTTTGGGCTGGAGTCA
GGGCCTGTATACAGTATTTTGAATTTGTATGCCACTGGTTTGCATTGCTGGTCAGGAACTCTAGTGCTTTGCATAGCCCT
GGTTTAGAAACATGTTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGAACCAGCAATCATTCCACTGTCCTGCCAAGGT
ACACCTCAGTACTCCCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCCAAAAGCATTCAAGTTTGGCTTCTGATGT
GACTCAGAATTTAGGAACCAGATGCTAGATCAAATAAGCTCTGAAAATCTGAGGAACATTGTAGGAAAGGTTTGTTAAGC
ATCTCTTAAGTGCCATGATGAGCATAACAGCCGGCCGTCGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGG
TGGGAGGATGACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCTGAAACCTCACCTCTACTAAAAATACAAAAAT
TAGCTGGGCATGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGC
GGAGGTTGCAGTGAGCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGACAGCGCAAGGCTCCGTCTCAATAATTAAAAA
AAAAAAAAAAAAAAAAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATC
ACCTGAGGTCAGGAGTTTTGAGATCAGCCTTGGCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAG
CATGCTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACGAGAATCGCTTGAACCTGGGAGGCAGAGGATG
CAGTGAGCCGAGATCACGCCATTGCACTCCAGCCTGGGGGACAAGAGTGAATCTGTGTCTCACCAAAAAAAAAAAGAAAA
AGAAAGATGCTTAACAAAGGTTACCATAAGCCACAAATTCATAACCACTTATCCTTCCAGTTTCAAGTAGAATATATTCA
TAACCTCAATAAAGTTCTCCCTGCTCCCAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_147783.2 (GI:66346650)
|
Name |
Cathepsin B (CTSB), transcript variant 5
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
3857 nt
|
Map |
8p22
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11700032...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710987 | 11718914...11718987 | 11725509...11725645 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
137
|
137
|
1
|
Exon 3a
|
138
|
211
|
74
|
1
|
Exon 4
|
212
|
362
|
151
|
1
|
Exon 5
|
363
|
448
|
86
|
1
|
Exon 6
|
449
|
563
|
115
|
1
|
Exon 7
|
564
|
682
|
119
|
1
|
Exon 8
|
683
|
768
|
86
|
1
|
Exon 9
|
769
|
912
|
144
|
1
|
Exon 10
|
913
|
1029
|
117
|
1
|
Exon 11
|
1030
|
1158
|
129
|
1
|
Exon 12
|
1159
|
3857
|
2699
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS5986
|
Nucleotide |
CTSB, mRNA isoform 5[NM_147783.2] : 237...1256
|
Length |
1020
|
Location |
Chromosome 8 (NC_000008.10) strand : -
11702633...11702730 | 11703169...11703297 | 11704560...11704676 | 11705187...11705330 |
11705575...11705660 | 11706554...11706672 | 11708374...11708488 | 11710118...11710203 |
11710837...11710962 |
|
Start codon |
1
|
Translation |
NP_680093.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGGCGGGGC |
CGGGAGGGTA |
CTTAGGGCCG |
GGGCTGGCCC |
AGGCTACGGC |
GGCTGCAGGG |
CTCCGGCAAC |
CGCTCCGGCA |
|
81 |
ACGCCAACCG |
CTCCGCTGCG |
CGCAGGCTGG |
GCTGCAGGCT |
CTCGGCTGCA |
GCGCTGGGTG |
TCTTCAGGCC |
TATGGAGAGC |
|
161 |
AGCTTGCGTG |
GGCTGGGCCT |
GCAGTACCTG |
GTTTGCATAG |
ATGATTGGCA |
GGTGGATCTA |
GGATCCGGCT |
TCCAACATGT |
|
241 |
GGCAGCTCTG |
GGCCTCCCTC |
TGCTGCCTGC |
TGGTGTTGGC |
CAATGCCCGG |
AGCAGGCCCT |
CTTTCCATCC |
CCTGTCGGAT |
|
321 |
GAGCTGGTCA |
ACTATGTCAA |
CAAACGGAAT |
ACCACGTGGC |
AGGCCGGGCA |
CAACTTCTAC |
AACGTGGACA |
TGAGCTACTT |
|
401 |
GAAGAGGCTA |
TGTGGTACCT |
TCCTGGGTGG |
GCCCAAGCCA |
CCCCAGAGAG |
TTATGTTTAC |
CGAGGACCTG |
AAGCTGCCTG |
|
481 |
CAAGCTTCGA |
TGCACGGGAA |
CAATGGCCAC |
AGTGTCCCAC |
CATCAAAGAG |
ATCAGAGACC |
AGGGCTCCTG |
TGGCTCCTGC |
|
561 |
TGGGCCTTCG |
GGGCTGTGGA |
AGCCATCTCT |
GACCGGATCT |
GCATCCACAC |
CAATGCGCAC |
GTCAGCGTGG |
AGGTGTCGGC |
|
641 |
GGAGGACCTG |
CTCACATGCT |
GTGGCAGCAT |
GTGTGGGGAC |
GGCTGTAATG |
GTGGCTATCC |
TGCTGAAGCT |
TGGAACTTCT |
|
721 |
GGACAAGAAA |
AGGCCTGGTT |
TCTGGTGGCC |
TCTATGAATC |
CCATGTAGGG |
TGCAGACCGT |
ACTCCATCCC |
TCCCTGTGAG |
|
801 |
CACCACGTCA |
ACGGCTCCCG |
GCCCCCATGC |
ACGGGGGAGG |
GAGATACCCC |
CAAGTGTAGC |
AAGATCTGTG |
AGCCTGGCTA |
|
881 |
CAGCCCGACC |
TACAAACAGG |
ACAAGCACTA |
CGGATACAAT |
TCCTACAGCG |
TCTCCAATAG |
CGAGAAGGAC |
ATCATGGCCG |
|
961 |
AGATCTACAA |
AAACGGCCCC |
GTGGAGGGAG |
CTTTCTCTGT |
GTATTCGGAC |
TTCCTGCTCT |
ACAAGTCAGG |
AGTGTACCAA |
|
1041 |
CACGTCACCG |
GAGAGATGAT |
GGGTGGCCAT |
GCCATCCGCA |
TCCTGGGCTG |
GGGAGTGGAG |
AATGGCACAC |
CCTACTGGCT |
|
1121 |
GGTTGCCAAC |
TCCTGGAACA |
CTGACTGGGG |
TGACAATGGC |
TTCTTTAAAA |
TACTCAGAGG |
ACAGGATCAC |
TGTGGAATCG |
|
1201 |
AATCAGAAGT |
GGTGGCTGGA |
ATTCCACGCA |
CCGATCAGTA |
CTGGGAAAAG |
ATCTAATCTG |
CCGTGGGCCT |
GTCGTGCCAG |
|
1281 |
TCCTGGGGGC |
GAGATCGGGG |
TAGAAATGCA |
TTTTATTCTT |
TAAGTTCACG |
TAAGATACAA |
GTTTCAGACA |
GGGTCTGAAG |
|
1361 |
GACTGGATTG |
GCCAAACATC |
AGACCTGTCT |
TCCAAGGAGA |
CCAAGTCCTG |
GCTACATCCC |
AGCCTGTGGT |
TACAGTGCAG |
|
1441 |
ACAGGCCATG |
TGAGCCACCG |
CTGCCAGCAC |
AGAGCGTCCT |
TCCCCCTGTA |
GACTAGTGCC |
GTAGGGAGTA |
CCTGCTGCCC |
|
1521 |
CAGCTGACTG |
TGGCCCCCTC |
CGTGATCCAT |
CCATCTCCAG |
GGAGCAAGAC |
AGAGACGCAG |
GAATGGAAAG |
CGGAGTTCCT |
|
1601 |
AACAGGATGA |
AAGTTCCCCC |
ATCAGTTCCC |
CCAGTACCTC |
CAAGCAAGTA |
GCTTTCCACA |
TTTGTCACAG |
AAATCAGAGG |
|
1681 |
AGAGACGGTG |
TTGGGAGCCC |
TTTGGAGAAC |
GCCAGTCTCC |
CAGGCCCCCT |
GCATCTATCG |
AGTTTGCAAT |
GTCACAACCT |
|
1761 |
CTCTGATCTT |
GTGCTCAGCA |
TGATTCTTTA |
ATAGAAGTTT |
TATTTTTTCG |
TGCACTCTGC |
TAATCATGTG |
GGTGAGCCAG |
|
1841 |
TGGAACAGCG |
GGAGACCTGT |
GCTAGTTTTA |
CAGATTGCCT |
CCTTATGACG |
CGGCTCAAAA |
GGAAACCAAG |
TGGTCAGGAG |
|
1921 |
TTGTTTCTGA |
CCCACTGATC |
TCTACTACCA |
CAAGGAAAAT |
AGTTTAGGAG |
AAACCAGCTT |
TTACTGTTTT |
TGAAAAATTA |
|
2001 |
CAGCTTCACC |
CTGTCAAGTT |
AACAAGGAAT |
GCCTGTGCCA |
ATAAAAGTTT |
TCTCCAACTT |
GAAGTCTACT |
CTGATGGGAT |
|
2081 |
CTCAGATCCT |
TTGTCACTGC |
CTATAGACTT |
GTAGCTGCTG |
TCTCTCTTTG |
TCCCTGCAGA |
GAATCACGTC |
CTGGAACTGC |
|
2161 |
ATGTTCTTGC |
GACTCTTGGG |
ACTTCATCTT |
AACTTCTCGC |
TGCCCCAGCC |
ATGTTTTCAA |
CCATGGCATC |
CCTCCCCCAA |
|
2241 |
TTAGTTCCCT |
GTCATCCTCG |
TCAACCTTCT |
CTGTAAGTGC |
CTGGTAAGCT |
TGCCCTTGCT |
TAAGAACTCA |
AAACATAGCT |
|
2321 |
GTGCTCTATT |
TTTTTGTTGT |
TGTTGTGACT |
GACAGAGTGA |
GATTCCGTCT |
CCCAGGCTGG |
AGTGCAGTGG |
CGCCTTCTCA |
|
2401 |
GCTCACTGCA |
ACCTGCAGCC |
TCCTAGATTC |
AAGCGATTCT |
CCTGCTTCAG |
CCTTCCGAGT |
AGCTGGGATG |
ACAGGCACTC |
|
2481 |
ACCAATATGC |
CTGGGTAATT |
TTTGTATTTT |
TAAGTACATA |
CAGGATTTCA |
CCATGTTGGC |
CAGGCTAGTT |
TCAAACTCCC |
|
2561 |
GGCCTCAGGT |
GGTCTGCCTG |
CCTCAGCCTC |
CCAAAGTGTT |
GGGATTACAG |
GCGTGAGCCA |
CTGGGCCCTG |
CCTGTATTTT |
|
2641 |
TTATCAGCCA |
CAAATCCAGC |
AACAAGCTGA |
GGATTCAGCT |
CATAAAACAG |
GCTTGGTGTC |
TTGGTGATCT |
CACATAACCA |
|
2721 |
AGATGCTACC |
CCGTGGGGAA |
CCACATCCCC |
CTGGATGCCC |
TCCAGCCTTG |
GTTTGGGCTG |
GAGTCAGGGC |
CTGTATACAG |
|
2801 |
TATTTTGAAT |
TTGTATGCCA |
CTGGTTTGCA |
TTGCTGGTCA |
GGAACTCTAG |
TGCTTTGCAT |
AGCCCTGGTT |
TAGAAACATG |
|
2881 |
TTATAGCAGT |
TCTTGGTATA |
GAGCAAACTA |
GAAGAACCAG |
CAATCATTCC |
ACTGTCCTGC |
CAAGGTACAC |
CTCAGTACTC |
|
2961 |
CCCTTCCCAA |
CTGAAGTGGT |
ATGAGGCTAG |
CTCTTTCCAA |
AAGCATTCAA |
GTTTGGCTTC |
TGATGTGACT |
CAGAATTTAG |
|
3041 |
GAACCAGATG |
CTAGATCAAA |
TAAGCTCTGA |
AAATCTGAGG |
AACATTGTAG |
GAAAGGTTTG |
TTAAGCATCT |
CTTAAGTGCC |
|
3121 |
ATGATGAGCA |
TAACAGCCGG |
CCGTCGTGGC |
TCACGCCTGT |
AATCCCAGCA |
CTTTGGGAGG |
CCAAGGTGGG |
AGGATGACAA |
|
3201 |
GGTCAGGAGT |
TCAAGACCAG |
CCTGGCCAAC |
ATGCTGAAAC |
CTCACCTCTA |
CTAAAAATAC |
AAAAATTAGC |
TGGGCATGGT |
|
3281 |
GGCACATGCC |
TGTAATCCCA |
GCTACTTGGG |
AGGCTGAGGC |
AGGAGAATCG |
CTTGAACCCG |
GGAGGCGGAG |
GTTGCAGTGA |
|
3361 |
GCCAAGACAG |
TGCCAGTGCA |
CTCCAGCCTC |
GGTGACAGCG |
CAAGGCTCCG |
TCTCAATAAT |
TAAAAAAAAA |
AAAAAAAAAA |
|
3441 |
AAAAGGCCGG |
GCGCAGTGGC |
TCAAGCCTGT |
AATCCCAGCA |
CTTTGGGAGG |
CTGAGGCGGG |
CAGATCACCT |
GAGGTCAGGA |
|
3521 |
GTTTTGAGAT |
CAGCCTTGGC |
AACACGGTGA |
AACCCCATCT |
CTACTAAAAA |
TACAAAATTA |
GCCAAGCATG |
CTGGCACATG |
|
3601 |
CCTGTAATCC |
CAGCTACTCG |
GGAGGCTGAG |
GTACGAGAAT |
CGCTTGAACC |
TGGGAGGCAG |
AGGATGCAGT |
GAGCCGAGAT |
|
3681 |
CACGCCATTG |
CACTCCAGCC |
TGGGGGACAA |
GAGTGAATCT |
GTGTCTCACC |
AAAAAAAAAA |
AGAAAAAGAA |
AGATGCTTAA |
|
3761 |
CAAAGGTTAC |
CATAAGCCAC |
AAATTCATAA |
CCACTTATCC |
TTCCAGTTTC |
AAGTAGAATA |
TATTCATAAC |
CTCAATAAAG |
|
3841 |
TTCTCCCTGC |
TCCCAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|66346650|ref|NM_147783.2|Cathepsin B (CTSB), transcript variant 5
GGGGCGGGGCCGGGAGGGTACTTAGGGCCGGGGCTGGCCCAGGCTACGGCGGCTGCAGGGCTCCGGCAACCGCTCCGGCA
ACGCCAACCGCTCCGCTGCGCGCAGGCTGGGCTGCAGGCTCTCGGCTGCAGCGCTGGGTGTCTTCAGGCCTATGGAGAGC
AGCTTGCGTGGGCTGGGCCTGCAGTACCTGGTTTGCATAGATGATTGGCAGGTGGATCTAGGATCCGGCTTCCAACATGT
GGCAGCTCTGGGCCTCCCTCTGCTGCCTGCTGGTGTTGGCCAATGCCCGGAGCAGGCCCTCTTTCCATCCCCTGTCGGAT
GAGCTGGTCAACTATGTCAACAAACGGAATACCACGTGGCAGGCCGGGCACAACTTCTACAACGTGGACATGAGCTACTT
GAAGAGGCTATGTGGTACCTTCCTGGGTGGGCCCAAGCCACCCCAGAGAGTTATGTTTACCGAGGACCTGAAGCTGCCTG
CAAGCTTCGATGCACGGGAACAATGGCCACAGTGTCCCACCATCAAAGAGATCAGAGACCAGGGCTCCTGTGGCTCCTGC
TGGGCCTTCGGGGCTGTGGAAGCCATCTCTGACCGGATCTGCATCCACACCAATGCGCACGTCAGCGTGGAGGTGTCGGC
GGAGGACCTGCTCACATGCTGTGGCAGCATGTGTGGGGACGGCTGTAATGGTGGCTATCCTGCTGAAGCTTGGAACTTCT
GGACAAGAAAAGGCCTGGTTTCTGGTGGCCTCTATGAATCCCATGTAGGGTGCAGACCGTACTCCATCCCTCCCTGTGAG
CACCACGTCAACGGCTCCCGGCCCCCATGCACGGGGGAGGGAGATACCCCCAAGTGTAGCAAGATCTGTGAGCCTGGCTA
CAGCCCGACCTACAAACAGGACAAGCACTACGGATACAATTCCTACAGCGTCTCCAATAGCGAGAAGGACATCATGGCCG
AGATCTACAAAAACGGCCCCGTGGAGGGAGCTTTCTCTGTGTATTCGGACTTCCTGCTCTACAAGTCAGGAGTGTACCAA
CACGTCACCGGAGAGATGATGGGTGGCCATGCCATCCGCATCCTGGGCTGGGGAGTGGAGAATGGCACACCCTACTGGCT
GGTTGCCAACTCCTGGAACACTGACTGGGGTGACAATGGCTTCTTTAAAATACTCAGAGGACAGGATCACTGTGGAATCG
AATCAGAAGTGGTGGCTGGAATTCCACGCACCGATCAGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCGTGCCAG
TCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTATTCTTTAAGTTCACGTAAGATACAAGTTTCAGACAGGGTCTGAAG
GACTGGATTGGCCAAACATCAGACCTGTCTTCCAAGGAGACCAAGTCCTGGCTACATCCCAGCCTGTGGTTACAGTGCAG
ACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACCTGCTGCCC
CAGCTGACTGTGGCCCCCTCCGTGATCCATCCATCTCCAGGGAGCAAGACAGAGACGCAGGAATGGAAAGCGGAGTTCCT
AACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGTACCTCCAAGCAAGTAGCTTTCCACATTTGTCACAGAAATCAGAGG
AGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCTCCCAGGCCCCCTGCATCTATCGAGTTTGCAATGTCACAACCT
CTCTGATCTTGTGCTCAGCATGATTCTTTAATAGAAGTTTTATTTTTTCGTGCACTCTGCTAATCATGTGGGTGAGCCAG
TGGAACAGCGGGAGACCTGTGCTAGTTTTACAGATTGCCTCCTTATGACGCGGCTCAAAAGGAAACCAAGTGGTCAGGAG
TTGTTTCTGACCCACTGATCTCTACTACCACAAGGAAAATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTGAAAAATTA
CAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTGTGCCAATAAAAGTTTTCTCCAACTTGAAGTCTACTCTGATGGGAT
CTCAGATCCTTTGTCACTGCCTATAGACTTGTAGCTGCTGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCTGGAACTGC
ATGTTCTTGCGACTCTTGGGACTTCATCTTAACTTCTCGCTGCCCCAGCCATGTTTTCAACCATGGCATCCCTCCCCCAA
TTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGTGCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAAACATAGCT
GTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAGAGTGAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCGCCTTCTCA
GCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCGATTCTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGACAGGCACTC
ACCAATATGCCTGGGTAATTTTTGTATTTTTAAGTACATACAGGATTTCACCATGTTGGCCAGGCTAGTTTCAAACTCCC
GGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGGGCCCTGCCTGTATTTT
TTATCAGCCACAAATCCAGCAACAAGCTGAGGATTCAGCTCATAAAACAGGCTTGGTGTCTTGGTGATCTCACATAACCA
AGATGCTACCCCGTGGGGAACCACATCCCCCTGGATGCCCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCTGTATACAG
TATTTTGAATTTGTATGCCACTGGTTTGCATTGCTGGTCAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTAGAAACATG
TTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGAACCAGCAATCATTCCACTGTCCTGCCAAGGTACACCTCAGTACTC
CCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCCAAAAGCATTCAAGTTTGGCTTCTGATGTGACTCAGAATTTAG
GAACCAGATGCTAGATCAAATAAGCTCTGAAAATCTGAGGAACATTGTAGGAAAGGTTTGTTAAGCATCTCTTAAGTGCC
ATGATGAGCATAACAGCCGGCCGTCGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATGACAA
GGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCTGAAACCTCACCTCTACTAAAAATACAAAAATTAGCTGGGCATGGT
GGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGA
GCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGACAGCGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAAAAAAAAAA
AAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCAGGA
GTTTTGAGATCAGCCTTGGCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAGCATGCTGGCACATG
CCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACGAGAATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGAGCCGAGAT
CACGCCATTGCACTCCAGCCTGGGGGACAAGAGTGAATCTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAGATGCTTAA
CAAAGGTTACCATAAGCCACAAATTCATAACCACTTATCCTTCCAGTTTCAAGTAGAATATATTCATAACCTCAATAAAG
TTCTCCCTGCTCCCAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
3b
|
Length |
119 nt
|
Location |
Chromosome 8 (NC_000008.10) : 11718869...11718987 (-)
|
Is part of |
CTSB, mRNA isoform 3
(NM_147781.2)
|
Sequence |
Show
|
|
GTGTCTTCAGGCCTATGGAGAGCAGCTTGCGTGGGCTGGGCCTGCAGTACCTGGTTTGCATAGATGATTGGCAGGTGGGC
AGCACGGGGAAGGACCTGTGAGTGGCCAACCTGGTTCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
12
|
Length |
2699 nt
|
Location |
Chromosome 8 (NC_000008.10) : 11700032...11702730 (-)
|
Is part of |
CTSB, mRNA isoform 2
(NM_147780.2)
CTSB, mRNA isoform 5
(NM_147783.2)
CTSB, mRNA isoform 4
(NM_147782.2)
CTSB, mRNA isoform 3
(NM_147781.2)
CTSB, mRNA isoform 1
(NM_001908.3)
|
Sequence |
Show
|
|
GCTTCTTTAAAATACTCAGAGGACAGGATCACTGTGGAATCGAATCAGAAGTGGTGGCTGGAATTCCACGCACCGATCAG
TACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCGTGCCAGTCCTGGGGGCGAGATCGGGGTAGAAATGCATTTTATTC
TTTAAGTTCACGTAAGATACAAGTTTCAGACAGGGTCTGAAGGACTGGATTGGCCAAACATCAGACCTGTCTTCCAAGGA
GACCAAGTCCTGGCTACATCCCAGCCTGTGGTTACAGTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTC
CTTCCCCCTGTAGACTAGTGCCGTAGGGAGTACCTGCTGCCCCAGCTGACTGTGGCCCCCTCCGTGATCCATCCATCTCC
AGGGAGCAAGACAGAGACGCAGGAATGGAAAGCGGAGTTCCTAACAGGATGAAAGTTCCCCCATCAGTTCCCCCAGTACC
TCCAAGCAAGTAGCTTTCCACATTTGTCACAGAAATCAGAGGAGAGACGGTGTTGGGAGCCCTTTGGAGAACGCCAGTCT
CCCAGGCCCCCTGCATCTATCGAGTTTGCAATGTCACAACCTCTCTGATCTTGTGCTCAGCATGATTCTTTAATAGAAGT
TTTATTTTTTCGTGCACTCTGCTAATCATGTGGGTGAGCCAGTGGAACAGCGGGAGACCTGTGCTAGTTTTACAGATTGC
CTCCTTATGACGCGGCTCAAAAGGAAACCAAGTGGTCAGGAGTTGTTTCTGACCCACTGATCTCTACTACCACAAGGAAA
ATAGTTTAGGAGAAACCAGCTTTTACTGTTTTTGAAAAATTACAGCTTCACCCTGTCAAGTTAACAAGGAATGCCTGTGC
CAATAAAAGTTTTCTCCAACTTGAAGTCTACTCTGATGGGATCTCAGATCCTTTGTCACTGCCTATAGACTTGTAGCTGC
TGTCTCTCTTTGTCCCTGCAGAGAATCACGTCCTGGAACTGCATGTTCTTGCGACTCTTGGGACTTCATCTTAACTTCTC
GCTGCCCCAGCCATGTTTTCAACCATGGCATCCCTCCCCCAATTAGTTCCCTGTCATCCTCGTCAACCTTCTCTGTAAGT
GCCTGGTAAGCTTGCCCTTGCTTAAGAACTCAAAACATAGCTGTGCTCTATTTTTTTGTTGTTGTTGTGACTGACAGAGT
GAGATTCCGTCTCCCAGGCTGGAGTGCAGTGGCGCCTTCTCAGCTCACTGCAACCTGCAGCCTCCTAGATTCAAGCGATT
CTCCTGCTTCAGCCTTCCGAGTAGCTGGGATGACAGGCACTCACCAATATGCCTGGGTAATTTTTGTATTTTTAAGTACA
TACAGGATTTCACCATGTTGGCCAGGCTAGTTTCAAACTCCCGGCCTCAGGTGGTCTGCCTGCCTCAGCCTCCCAAAGTG
TTGGGATTACAGGCGTGAGCCACTGGGCCCTGCCTGTATTTTTTATCAGCCACAAATCCAGCAACAAGCTGAGGATTCAG
CTCATAAAACAGGCTTGGTGTCTTGGTGATCTCACATAACCAAGATGCTACCCCGTGGGGAACCACATCCCCCTGGATGC
CCTCCAGCCTTGGTTTGGGCTGGAGTCAGGGCCTGTATACAGTATTTTGAATTTGTATGCCACTGGTTTGCATTGCTGGT
CAGGAACTCTAGTGCTTTGCATAGCCCTGGTTTAGAAACATGTTATAGCAGTTCTTGGTATAGAGCAAACTAGAAGAACC
AGCAATCATTCCACTGTCCTGCCAAGGTACACCTCAGTACTCCCCTTCCCAACTGAAGTGGTATGAGGCTAGCTCTTTCC
AAAAGCATTCAAGTTTGGCTTCTGATGTGACTCAGAATTTAGGAACCAGATGCTAGATCAAATAAGCTCTGAAAATCTGA
GGAACATTGTAGGAAAGGTTTGTTAAGCATCTCTTAAGTGCCATGATGAGCATAACAGCCGGCCGTCGTGGCTCACGCCT
GTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATGACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGCTGAA
ACCTCACCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAG
GCAGGAGAATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGACAGTGCCAGTGCACTCCAGCCTCGGTGACAG
CGCAAGGCTCCGTCTCAATAATTAAAAAAAAAAAAAAAAAAAAAAAGGCCGGGCGCAGTGGCTCAAGCCTGTAATCCCAG
CACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCAGGAGTTTTGAGATCAGCCTTGGCAACACGGTGAAACCCCAT
CTCTACTAAAAATACAAAATTAGCCAAGCATGCTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTACGAGA
ATCGCTTGAACCTGGGAGGCAGAGGATGCAGTGAGCCGAGATCACGCCATTGCACTCCAGCCTGGGGGACAAGAGTGAAT
CTGTGTCTCACCAAAAAAAAAAAGAAAAAGAAAGATGCTTAACAAAGGTTACCATAAGCCACAAATTCATAACCACTTAT
CCTTCCAGTTTCAAGTAGAATATATTCATAACCTCAATAAAGTTCTCCCTGCTCCCAAA
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : P07858
|
Name |
Cathepsin B
|
Alternative name(s) |
APP secretase Cathepsin B1
|
Synonym(s) |
APPS
|
Organism |
Homo sapiens
|
Length |
339 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Catalytic activity
|
Hydrolysis of proteins with broad specificity for peptide bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule substrates (thus differing from cathepsin L). In addition to being an endopeptidase, shows peptidyl-dipeptidase activity, liberating C-terminal dipeptides.
|
Function
|
Thiol protease which is believed to participate in intracellular degradation and turnover of proteins. Has also been implicated in tumor invasion and metastasis.
|
Similarity
|
Belongs to the peptidase C1 family.
|
Subcellular location
|
Lysosome. Melanosome. Note=Identified by mass spectrometry in melanosome fractions from stage I to stage IV.
|
Subunit
|
Dimer of a heavy chain and a light chain cross-linked by a disulfide bond. Interacts with SRPX2.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
proteolysis [GO:0006508]
regulation of apoptosis [GO:0042981]
regulation of catalytic activity [GO:0050790]
|
Cellular component
|
lysosome [GO:0005764]
melanosome [GO:0042470]
|
Molecular function
|
cysteine-type endopeptidase activity [GO:0004197]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Molecule processing |
|
|
|
|
Propeptide
|
18 - 79
|
62
|
Activation peptide
|
PRO_0000026143
|
Propeptide
|
334 - 339
|
6
|
|
PRO_0000026147
|
Signal
|
1 - 17
|
17
|
Potential
|
P07858-SIGNAL-1
|
Sites |
|
|
|
|
Active site
|
108 - 108
|
1
|
|
P07858-ACT_SITE-108
|
Active site
|
278 - 278
|
1
|
|
P07858-ACT_SITE-278
|
Active site
|
298 - 298
|
1
|
|
P07858-ACT_SITE-298
|
Natural variations |
|
|
|
|
Natural variant site
|
26 - 26
|
1
|
L -> V (in dbSNP:rs12338)
|
VAR_006724
|
Natural variant site
|
53 - 53
|
1
|
S -> G (in dbSNP:rs1803250)
|
VAR_051511
|
Natural variant site
|
91 - 91
|
1
|
P -> L (in dbSNP:rs11548596)
|
VAR_051512
|
Natural variant site
|
235 - 235
|
1
|
S -> N (in dbSNP:rs17573)
|
VAR_014696
|
Amino acid modifications |
|
|
|
|
Disulfide bond
|
93 - 122
|
30
|
|
P07858-DISULFID-93
|
Disulfide bond
|
105 - 150
|
46
|
|
P07858-DISULFID-105
|
Disulfide bond
|
141 - 207
|
67
|
|
P07858-DISULFID-141
|
Disulfide bond
|
142 - 146
|
5
|
|
P07858-DISULFID-142
|
Disulfide bond
|
179 - 211
|
33
|
|
P07858-DISULFID-179
|
Disulfide bond
|
187 - 198
|
12
|
|
P07858-DISULFID-187
|
Glycosylation
|
192 - 192
|
1
|
N-linked (GlcNAc...)
|
P07858-CARBOHYD-192
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MWQLWASLCC |
LLVLANARSR |
PSFHPLSDEL |
VNYVNKRNTT |
WQAGHNFYNV |
DMSYLKRLCG |
TFLGGPKPPQ |
RVMFTEDLKL |
|
81 |
PASFDAREQW |
PQCPTIKEIR |
DQGSCGSCWA |
FGAVEAISDR |
ICIHTNAHVS |
VEVSAEDLLT |
CCGSMCGDGC |
NGGYPAEAWN |
|
161 |
FWTRKGLVSG |
GLYESHVGCR |
PYSIPPCEHH |
VNGSRPPCTG |
EGDTPKCSKI |
CEPGYSPTYK |
QDKHYGYNSY |
SVSNSEKDIM |
|
241 |
AEIYKNGPVE |
GAFSVYSDFL |
LYKSGVYQHV |
TGEMMGGHAI |
RILGWGVENG |
TPYWLVANSW |
NTDWGDNGFF |
KILRGQDHCG |
|
321 |
IESEVVAGIP |
RTDQYWEKI |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|P07858|CATB_human Cathepsin B
MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKL
PASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN
FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM
AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCG
IESEVVAGIPRTDQYWEKI
|
|
|
| |
|
|
|
|
|
|