(CTSD) cathepsin D [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
6215
|
Official symbol
|
CTSD
|
Official name
|
cathepsin D
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 11 (NC_000011.9) : 1773984...1785221 (-)
|
Map
|
11p15.5
|
Length
|
11238 nt
|
NM_001909.3
|
CTSD, mRNA isoform 1
|
Accession
|
Name
|
Organism
|
Length
|
P07339
|
Cathepsin D
|
Homo sapiens
|
412 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
MGC2311; CPSD; CLN10
|
Alternative name(s) |
lysosomal aspartyl peptidase; lysosomal aspartyl protease; ceroid-lipofuscinosis, neuronal 10; cathepsin D (lysosomal aspartyl protease)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
This gene encodes a lysosomal aspartyl protease composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. This proteinase, which is a member of the peptidase C1 family, has a specificity similar to but narrower than that of pepsin A. Transcription of this gene is initiated from several sites, including one which is a start site for an estrogen-regulated transcript. Mutations in this gene are involved in the pathogenesis of several diseases, including breast cancer and possibly Alzheimer disease. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
CGCACGCCGG |
CCGCGCCCAC |
GTGACCGGTC |
CGGGTGCAAA |
CACGCGGGTC |
AGCTGATCCG |
GCCCAACTGC |
GGCGTCATCC |
|
81 |
CGGCTATAAG |
CGCACGGCCT |
CGGCGACCCT |
CTCCGACCCG |
GCCGCCGCCG |
CCATGCAGCC |
CTCCAGCCTT |
CTGCCGCTCG |
|
161 |
CCCTCTGCCT |
GCTGGCTGCA |
CCCGCCTCCG |
CGCTCGTCAG |
GTGAAGCCTC |
AGGGGCCGGG |
GCTCAGGGAC |
GGGCAGGGGT |
|
241 |
CGCGGCGCCG |
AGGTCCCGGG |
GCCTGTGGTG |
ACTTTCGCGC |
TCCCCTGTGG |
CCCCCACGAG |
CCCCTTGCGC |
CCCCCGCGCT |
|
321 |
GGAATGCACC |
TGTGCCGCCC |
TGCGCGGCCT |
CCTGCACGGA |
CCACCCGCCT |
ACGGGGCGCC |
GGGCTCCGGA |
GGTGCAGGGG |
|
401 |
ACCCGGGGCA |
GAGGCGCCAG |
ATGCCTCTCC |
CCCATATGCC |
ACCCTGGGTT |
GTACCTTGAG |
GACTGCAGAC |
TGACCGCAGC |
|
481 |
CTCCCTGGAG |
ACGGGGCGGG |
GCGGGGGGAG |
GTAGTGCTCA |
TTCGGGGCAG |
GTGGAATTGG |
GGTCTGTACT |
GAGCGCCCTT |
|
561 |
GTTGCTGGAG |
ACCTAGGTCA |
GGCCTCAGAG |
CCCCCGAGTC |
TGGGCGAGTC |
CATTTCCTTA |
GGGACCCCTT |
TACCACCTGT |
|
641 |
GAACTGGGGG |
CTTTAAAAGT |
TTGCTCCAGC |
GGCTCTTATC |
ACAGGCCCTG |
GGCTGGGAGA |
CCCCTCGAGA |
CCCTAGGAGT |
|
721 |
TCCCATGTCC |
CTGAGAGAGG |
AGGAGGCATG |
GGGAGTGGGT |
CGGCTCACCC |
ACCCCGGGCC |
TGGGGTTGTG |
CTGTAGTGAG |
|
801 |
GCCCACACGC |
TCCTCAGGCC |
GATCCCCTGT |
GCCAGGTGAG |
GCCACCGATT |
GGGCCTGGAT |
GGGATGGGGC |
CCGGCCATGC |
|
881 |
CTGACCAGCT |
GGGCAGAGGA |
GGGCCATGCT |
GCAGTCTGCT |
TTCTTGACCC |
CCTCCCCAGC |
CCTTGCAAGG |
CAGCCCGCAT |
|
961 |
TCCCAGGAGG |
GGTATGCTGA |
CCCATCCCAT |
TGGGCACCTG |
CCCCACCCTT |
GCTCTGGGCC |
TTTGTGGGAG |
ACCTGGGATC |
|
1041 |
TGCGATGGGT |
CCACTGCCTT |
TTGGCAGGTG |
GGTGAGGTCA |
GAAGGCTGCA |
GGGGCTGGAG |
CTGGCTGGGC |
CAGCTGGGTA |
|
1121 |
GGACTGAGCC |
TCACCAAAGG |
CTGTGGGGAA |
TGGCCCGGGG |
GCGGGTAGCC |
CCAATTAAAG |
TCGTTGTGGG |
GGAGTAGCCA |
|
1201 |
CAAGCCTGAG |
CCTGCCTTGA |
CCTTGCCAGC |
CTATCCACAG |
GCCTCCCCTC |
TCCAAGGAGG |
ACAGACACAG |
CAGAGGGGAA |
|
1281 |
ACGATCCTGG |
GGCTTCTTGG |
AGGGAAGGGT |
AGCTGAATCC |
AAGCCCTCAC |
CCGATTCCAG |
CTCTTGTGCG |
ACTGATACTA |
|
1361 |
TTACACCTGC |
TTCCTGGTCC |
CTGGAGGGCG |
TGTCCCTCCC |
CCAGGACAAA |
ACCTGGAGCT |
CTTCCAGCCC |
ACCAGCTCTT |
|
1441 |
AGGCAATAAT |
CTCATCTTCC |
GGGATCACGC |
CCCTGACAAG |
CCAGGAAAAG |
CCAGCTATGA |
CCTTGTACTC |
TCAAGTCCCT |
|
1521 |
GGGGCAGGGA |
AGAGGTTTTA |
TTTAAGTGAT |
TAAAAGCCCA |
GGGGAGCTTC |
CTTGGAACAA |
GGAGTGGGTT |
CACACCAAGG |
|
1601 |
GGAAGGCCAG |
TGGCCCTGGG |
GGAGGAGCAG |
GGACCCCTCT |
CTCTCTTACT |
CGCTTCCTGG |
GTTTAGAACT |
CAGGACCCCG |
|
1681 |
ATCTCAGTCT |
GGAGCTCCCT |
CCTGCACCCT |
GGCTGGCGGT |
GTGCTGGGTG |
ACAGGACTCT |
GGAGGGGTAC |
CCTGAGTGCA |
|
1761 |
GCTGTCGGAG |
GAGGCAGGGC |
GGTGGGGGGG |
GCAGCACAGA |
AGCCTCTAAG |
GCCCCAGGTG |
CAGTCCTGGA |
CCTCGTGGAG |
|
1841 |
CCGCATGGAG |
TGAGGAGAGG |
TGCGGATGCC |
CAGAAACAGA |
TGTGGGATGA |
GGGCACTGGG |
CAGCCACAGG |
GTCCATGTGG |
|
1921 |
AGGAGGACAG |
GTAGTCAAGG |
AGGGCTTCTG |
GAGGTGGTGT |
GGAGGGCCCA |
TCTGATGGCC |
AGAGGAGGCC |
AGGCAGAGCT |
|
2001 |
GCCAGTGCCA |
GCCTGGAGGT |
GGGGCCACCT |
TCGTGCAGGT |
GTCTGGGGGT |
GGAGAGCAGG |
TGTGATGGGG |
GCTGGGTACA |
|
2081 |
GTGGGCTGCC |
TCAGAGCACT |
TTGGGCAGGA |
GTGACAGGTA |
CCCGCCAGCA |
CCCTGAGCAG |
CCATGCTGGC |
CACCATCCTG |
|
2161 |
GAAGAGACCA |
GCGCAGGTGC |
AGAGGGAGGA |
CGGGAGACCC |
TTGGGGGCTT |
TGGAGCCTCC |
AGAATGGCTG |
AGGAGAGGAG |
|
2241 |
GGTTGGGCAC |
ACTGGCCCCC |
AGGTTGGGTG |
TGTGGGGCTG |
AGGTGGGTGG |
CGGGTGACCA |
CTTCTTAGGA |
CTGTGGCCTG |
|
2321 |
TGCAACCTGG |
CGGGGGGCAA |
TGGGTTGCCA |
TTCACTGACT |
TGGGGGAGCT |
GGGCCAGCAG |
TTTCCAGGTG |
ACAGGCAGGA |
|
2401 |
GTTTGGTTTT |
GGCTGTGGCG |
ACTCTGAGAT |
TCCCCAGGGG |
CCTCCAGGTG |
GATGTGCAGC |
ATTGGCAGCG |
TCGGCCGGCA |
|
2481 |
GGCGGGAGGG |
CCTCCCTGAT |
ATGCCCCGAC |
CCGTGGTTGA |
CAGGATCCCG |
CTGCACAAGT |
TCACGTCCAT |
CCGCCGGACC |
|
2561 |
ATGTCGGAGG |
TTGGGGGCTC |
TGTGGAGGAC |
CTGATTGCCA |
AAGGCCCCGT |
CTCAAAGTAC |
TCCCAGGCGG |
TGCCAGCCGT |
|
2641 |
GACCGAGGGG |
CCCATTCCCG |
AGGTGCTCAA |
GAACTACATG |
GACGTGAGTA |
TGAGGTCTTA |
GCCCTGCTGC |
AAGCGGAGCC |
|
2721 |
ACTGTCAGGA |
GAGCTCCGTG |
GCAGTATGGG |
GAACATTCCC |
ACCTCCTGTT |
CTCAGCAGCG |
CAGTTTGAGT |
GGCTCACCTG |
|
2801 |
GCCACGGTGG |
CGTCCCGGCT |
CAGCCACACT |
CCTTTCTTCC |
CCATTCTGGA |
ACCTCCTGCC |
ACTGGCCCCT |
GTCATTGCAG |
|
2881 |
GGTCTTGGTC |
CTGCTAGGGC |
CTGGGAGGGT |
GATTGGGAGT |
GGCCTCGGGC |
CTTGGTGCCA |
GCTCGCCTGA |
GGGGGCGGTC |
|
2961 |
CTTGGGCCTG |
CCTGAGTGTG |
GCCGGCTGAC |
CTGGAGCCTT |
TCACTGCTGT |
CCTGCTGGGA |
GGCCTGTGGT |
GACTCGTGGC |
|
3041 |
CTCCCCAGCC |
CCTCCCCATC |
TTCTTCCTCT |
CGCAACAGCC |
CCTCCTGTGC |
CCACTACTCC |
TTCAGGGGGA |
AGCAGGATCC |
|
3121 |
AAGGTGGAGC |
ACTCTGGAAG |
CCACCTAGCA |
AGCTGGGGCT |
TGGTCAGCCT |
GGTCCCAGCT |
CTATGGGGTC |
AGTTCGAGGC |
|
3201 |
CAGGGCCTAC |
TGTCCAGCCT |
CGGGGCTCTG |
GCCCACTGTG |
GGGGAGCCTT |
GCCCTCTGTC |
CTGCTTGGCC |
CGAGTCCTGG |
|
3281 |
CTGTGACAGG |
AAGCCCAAGA |
CTCACAGGCA |
TGTGACTGGG |
CCAGGGGGGC |
CCTGGGGGGA |
ACCAGGCGCC |
GTGTGCCTGG |
|
3361 |
TCCCGACTGG |
CCAGTTCCTG |
ATTGTCCTAG |
CGCGCGAGCA |
AGCAGATGCA |
CGCATGCGCA |
CACATGCACA |
CACACACATG |
|
3441 |
GAAATTTGCT |
GAGTGTCCCC |
CTGCCAGTGG |
TCACTTCTGT |
GGGGCCATGA |
GTCAGCAGCT |
GCTGCTGCTC |
CGTGAAGCCA |
|
3521 |
GGCGGTTGGA |
GAAACATTGG |
GCTGGGCTGG |
TGCCAGGAAT |
CTGGTGGCTG |
ACGGTGGCCC |
AGGCTCCAAT |
CCTGGGGGAA |
|
3601 |
GCGGGCGCCC |
AGGCGACCCC |
AAACTCCAGG |
ACCCTCTTAC |
TCTCTGCCTC |
CTGAGAGGCT |
CGGCGGCATG |
GGACCCCTTG |
|
3681 |
TACTTGCCTG |
ATCCCTGAGT |
CAGCACCCCC |
ACCTGCGCCT |
GCTATCCCTG |
TGTACACACG |
GGGAAACTGA |
GGCCCTGGGC |
|
3761 |
CATTAAGGCA |
GGGTTGCTGG |
GCAGGCAGTG |
ATTGTAAGGC |
TTACCTTCTC |
CCCTTGACCA |
GCTGAACCCC |
TCTGCCTGGA |
|
3841 |
AGCCCTCCAA |
GCCTGGGGTT |
CACCCTGGAG |
GGCAGGGCAG |
GCACTGAGAC |
ACCCAGAATC |
AGACCTTGAC |
ATGGCCCCAA |
|
3921 |
CCTGGGGAGG |
AAGTCACTTC |
CTTTCTCAGA |
CCTTGGACTG |
CGGGCCACCA |
GGGGAGTCTT |
CCAGGCCGGG |
CTTTCCTGCA |
|
4001 |
CCCGGGGCTC |
AGGCTGAGGG |
CCACGTCTGT |
CCCCACCCTG |
GCTTGACCTC |
TGGCCTTGTC |
TTTTCCGTGG |
GGAAGTCCTC |
|
4081 |
AGCCTCACCA |
TGTATTGAGC |
AGGGGTAGGT |
GACAGAAGCC |
AGGGGTCTAG |
AGACCCCAAG |
TACCCGTGGG |
CATTAGGACC |
|
4161 |
GAGGGCTGGG |
GACCTGGCCC |
ATCTTCCCTG |
CAGGCTGAGC |
AGGTGGGAGT |
GGGTGGCTGT |
TGGCAGCTGT |
GGGCCCCGTG |
|
4241 |
ATGCCCCTCC |
CTCCAGTATG |
GGCCTTGGCT |
CTGGGGACAG |
CCGGGCCTTC |
TGAGGCCCAG |
TGGGGAGATG |
GGGCCCCCTC |
|
4321 |
TCCCATCCCT |
GACGGACCCT |
GTCCCCTGCC |
AGGCCCAGTA |
CTACGGGGAG |
ATTGGCATCG |
GGACGCCCCC |
CCAGTGCTTC |
|
4401 |
ACAGTCGTCT |
TCGACACGGG |
CTCCTCCAAC |
CTGTGGGTCC |
CCTCCATCCA |
CTGCAAACTG |
CTGGACATCG |
CTTGCTGTGA |
|
4481 |
GTCACGAACC |
CTGGCCCCGT |
CGCCCAGGTC |
CTGCCCTTCC |
GGGATGTCGC |
TGCAGGGCTG |
CCTCAGGAAT |
CACTTGGGCA |
|
4561 |
ACGCAATTCT |
CCTGCCTCTT |
GGCCCCGTGA |
GCCAGGCCAG |
CCCTCCACCC |
TGCTCCAGCC |
ACTGACTTTC |
TGGGTGAACC |
|
4641 |
ACCAGCTGTG |
GTCTTGCTCT |
CAGCAGGGCT |
GGGGCTGGGG |
TGGCCACAGA |
GGGAGCCGGC |
TGTGGCTGGG |
AGGGAGGCCC |
|
4721 |
GGGGTCACAG |
CCCACAGTCC |
CGGGGCTCTG |
GCATGATGGT |
GGGCCTCAGA |
TCCCCTCCAA |
ATCCCACCCT |
GGGGAGGCAG |
|
4801 |
CTTGGGGGGT |
GCGGGCTCCA |
TGGTAGATTG |
TAGGGGGATG |
GGAGGGCTAA |
GGCCGTGGCG |
TGGGCTGCGG |
GATGACCGGC |
|
4881 |
GGGCCCCCTT |
GTCGCCCGGG |
GCAGGGATCC |
ACCACAAGTA |
CAACAGCGAC |
AAGTCCAGCA |
CCTACGTGAA |
GAATGGTACC |
|
4961 |
TCGTTTGACA |
TCCACTATGG |
CTCGGGCAGC |
CTCTCCGGGT |
ACCTGAGCCA |
GGACACTGTG |
TCGGTGAGTC |
CCTCTGGGGC |
|
5041 |
CTTTCCCAGG |
ACTCGAGGGT |
GCCAGGGTGT |
GGGGTTCACC |
CACCGTGGGT |
ATGTGGTTGA |
AAGGGAGGGC |
TCGGTGATCC |
|
5121 |
CAGCCCAACC |
CCAGCCCTCA |
GGTGGCCGGG |
GCAGTCAGCA |
GGGTGAGGAG |
GGGTCTGGGA |
ATGGGCCTGG |
TTGTGTGCAG |
|
5201 |
GCTGGAGGGA |
CAGCCTCAAA |
CCCAGGGGGT |
ACAGGGGCAG |
GGGTCCCCGG |
AGTCAGGCCA |
CAATGAGTGG |
GAGGGAGGAC |
|
5281 |
AGGGCAGATC |
GATCGGGCTC |
TTTTTGGCAC |
ATTGGGTTTG |
AGGTGCCAGC |
AGGTGTTGAG |
GGCTGGACCT |
GGGGCTGCAC |
|
5361 |
AGGGTCCCTG |
CAGCAGGCCG |
AGGGCTTGGG |
GAGGCTTGGG |
GTTGGGAGGA |
TGTACCAGGA |
ACCCGCTGTG |
GGGGCTGCTG |
|
5441 |
GCGGAGAGTC |
ACCGGCAGGA |
GCCTGGGAGG |
GCAAGGGAGG |
GGGAGCAGCA |
GTGGCTGTTG |
GGGGCCCCGC |
CTGCATCCAC |
|
5521 |
CATCCTCAGG |
GCCCTGTGCA |
GAGCAACTCC |
CTGCCCTAGG |
AGAGGGTGAG |
CTGGCCCTTG |
TCACCCTGCC |
TGGGCCTCAG |
|
5601 |
TGAGTTCTCA |
CCTCGGAGCT |
GTCTGCTGGG |
GGTGGACCAG |
GCCACAAGGG |
GTTCAGGAGT |
TACGGGATGG |
TGACACAGCC |
|
5681 |
CCCAGCTCTG |
GAGCGGGCCA |
GGAGGGCAGC |
AGAGCCCCCC |
TGCAGGCCCA |
GGGACCCGGG |
AAGAGGGCCC |
CTTCCTCCAG |
|
5761 |
CTGAAGCTGC |
TCCTAACAGT |
TCCCTGCTGG |
GCTGGAGTCC |
AGTCTGTACT |
GGGGGCTCCT |
CAGAGCCCTC |
CTGTCTGGAC |
|
5841 |
CCTGCTCAGC |
CTGAACAGGA |
AATTGCCCCG |
TCTGCCCTCT |
TCCGCCCTCT |
TCTGCCCCCT |
CCGCCCCGTG |
TCAGCCTCAA |
|
5921 |
GCTGTCGTTC |
CCTTCCCACA |
TCCTGCTCTG |
GCCGTGTTCT |
CTCTCTGCAG |
CCTCATCCAG |
GGCTGGGGGA |
GGGGACAGGC |
|
6001 |
AGAGAAGGGG |
AAGGGGGCAG |
TATGGCTGTG |
AGCTTGGGAT |
GGGGTCGGCA |
GGTTCCCCAT |
CTCTCGGGCT |
CCTGGCCCAG |
|
6081 |
GCTGTGTCTT |
GTTCCCAGCG |
CTGAGGGCAG |
GAGCAGGACC |
TGCCTGTCAT |
TGGTGGTGGG |
AGTAAGAGGT |
CGGAGCAGGG |
|
6161 |
AGCGGAGCAA |
GGGGCCTGCC |
CGTCTGTGCT |
CCTGCCTGGT |
TCCCATCTCG |
TGTAAACCGA |
GCCCTGATGA |
CTTCCACGAA |
|
6241 |
GAGGGCCCCG |
CCATACCCCG |
TGTCCCCATC |
CCCAGCGGTG |
TTCAGCTCAG |
GGTTTCCAGC |
CCTTTCTTCT |
GGGCCCTCTG |
|
6321 |
GGCCCCATCG |
TGTGTGGGAT |
GGGCATCCAT |
TGAACTGGGT |
TTTGTAGCCT |
CATGCTCAGG |
GAGTGGTGTA |
GGGCTCAGCC |
|
6401 |
TGTCTGCTGC |
CCACTGACTC |
TGCCCTGGCC |
TGCAGGTGCC |
CTGCCAGTCA |
GCGTCGTCAG |
CCTCTGCCCT |
GGGCGGTGTC |
|
6481 |
AAAGTGGAGA |
GGCAGGTCTT |
TGGGGAGGCC |
ACCAAGCAGC |
CAGGCATCAC |
CTTCATCGCA |
GCCAAGTTCG |
ATGGCATCCT |
|
6561 |
GGGCATGGCC |
TACCCCCGCA |
TCTCCGTCAA |
CAACGTGCTG |
CCCGTCTTCG |
ACAACCTGAT |
GCAGCAGAAG |
CTGGTGGACC |
|
6641 |
AGAACATCTT |
CTCCTTCTAC |
CTGAGCAGGT |
GGGCGTGTGG |
GTTCCCTCTC |
GCTCCGCGTT |
GCTGGGAGGC |
AGGGCGGGGC |
|
6721 |
TGGACGGGGA |
GCCTTCTAGG |
CACCCCCTCT |
CAGTGCTGCC |
CCCTCCCTGC |
TGCTGTGCCA |
GAGCTCCTGA |
CCTCTGACCT |
|
6801 |
CAGGGCATCC |
GGGAGGCGGG |
GGTTGGCCGC |
CCTTCTGCAG |
AGGAGTAAGC |
GGCAGCACAG |
AGAAAGCTGT |
TTGGCCGGGG |
|
6881 |
TCTCCCAGTG |
GGAGGGGCTC |
GGGCCAGGCT |
GTGGGTCCTG |
GGACCTTGGC |
AGCTGGGCCT |
CCCTCCTATG |
AGAATGGACC |
|
6961 |
CAGGTCAGGG |
GTCGTGGCAG |
TTCCTCTTGG |
TTCAGTCGGC |
CCGTCGGTCC |
CCAGACCTGG |
TGATGGGCAC |
ATGTGGTCCT |
|
7041 |
GGTCCCGGGG |
TTGCTGATGG |
TGGAGAGGGT |
CATGGTGCCC |
AGGGGACCGG |
GGATCCCCGG |
GGAGGTGACC |
TTGGGTGCGT |
|
7121 |
ATGGGTCCCC |
GGCACCACGC |
TGCGAGCAGC |
TCTGTGGGCG |
TCCCCAGCAG |
GTGCCGGCTG |
CCCTCGAGGA |
GGAACATAGG |
|
7201 |
GGGCCCTTCC |
TTCTGGGAAA |
AGGGTTCTCC |
CCCAGGGCCC |
CCACCTGCAG |
CCGCTGCTCC |
AGCTTGGGTG |
ACCCATGGTC |
|
7281 |
AGGTGACCTT |
GGGAAGGGGA |
CGTCGGACTC |
AGTTAGTTTC |
CTCTTCTCGG |
GGCAGACCTG |
AGTGGAGGGT |
TCCACATAAG |
|
7361 |
AGGGGTGACT |
GGGGGTTGGG |
GGCTTTTGTT |
TTGGGGGTGG |
GCAGCGTGTC |
AGCCGGCGGC |
CTCCTGCCCT |
GAGCTGCGCC |
|
7441 |
TGGTCACTGC |
CCCACACTCT |
CCAGGGCTGA |
CAGGGGCAGG |
TTTACCTCAC |
GTGGCTCACT |
TGGCACTTGG |
AGTCTCTGGG |
|
7521 |
CCTGACCCCT |
AACCTTGGAT |
CGCTTCCGGC |
AGAATTCCGG |
CTGTAGGGCT |
GGCTGGGCCT |
CTTTGCTCTG |
CCCCTTGCCT |
|
7601 |
GGGCAGTGAA |
TACTCCTTAG |
CAACACCCAG |
GGCTAAGCTG |
CTCACTAGAG |
GCCAGGACAC |
AGGTGAACCG |
GCTGGGCCAT |
|
7681 |
TTCCCCAGGA |
GCCTTGGGGC |
ACAGGGGAGG |
CAGCCAGGTG |
AAAAGGGGTC |
CCCTGAGGGC |
AAGAGGGCAT |
GCAGGCATGA |
|
7761 |
GTGCCCACAT |
GGGGAGGGGG |
CACACAGCCT |
GAGTACCCAG |
GTCAGGGAGG |
GGGACACAGG |
CATGAATGCC |
CATGTGGGGG |
|
7841 |
AGGGGCACAC |
AGCATGAGTG |
CCCATGTGGG |
GGAGGGGCAC |
ACGGTATGAG |
TACCCAGGTC |
GGGGGAGGCA |
TACACAGGCA |
|
7921 |
TGAGTGCCAG |
GTGGAAGGAG |
GGGCACACAG |
AGTGAGTGCC |
CAGGTGGGGG |
AGGTGCACAT |
GGTTGTGAGT |
GCCCAGGTGT |
|
8001 |
AGTCTCAAGG |
CAGCAGCTTG |
GGCAGGAAGT |
GGAGCCAGGC |
AGGAGGGAGG |
GTCTGAGGGC |
TTCTGAAAGC |
ATGTTTGGTG |
|
8081 |
AGGAGAGGAG |
GGGTGGGAGG |
CGCTGATCAG |
GTTTCTACAC |
TTGGGATTGC |
AGAGGTGTTG |
ACAAGAGGCA |
AAGGCGGAGG |
|
8161 |
AGGCTGGAGG |
AGGGCGGAGG |
CCCCAATCGG |
TGTTGGGAGG |
ACTGGGTCAG |
GCCTGGCACT |
GCCTCGAGTG |
ACAGGCAGTG |
|
8241 |
GGATGGTGGC |
CAGCTTAGCT |
GCAGACGCTC |
TGGCGGGATG |
GCAGAACCGC |
CCCAGACACA |
GAGAGCTTCT |
CTACCAGGAC |
|
8321 |
CGGCAGGATT |
TGCTGCGTTG |
AAAGCTGTAC |
TTGAGCAATG |
TTTAGAAACA |
AACCCGGGCG |
ACATGGGTTG |
CAGGTCCTAG |
|
8401 |
GAAGTGCAGT |
GCGCTCCTGC |
CCAGGAGCAC |
CTTGGCTGGC |
CATCAGTGGT |
CTGGATGAGG |
GGGAGATGAG |
CGGACGTGGC |
|
8481 |
TCGGGGATGC |
AGGTGGAGGG |
TGTTCCCAGG |
AGCAGCCAGT |
GCAGAGGCCC |
TGCGGCCAGA |
ACCAGCCATC |
CCAACTTCCC |
|
8561 |
AGATTGTGCC |
ATCACTCCCT |
CCTGGAAGCC |
TTCTTTGGTT |
TTTTCTTCCA |
GACAGACAGA |
CAGGGTCCTC |
CCCAGTGGAG |
|
8641 |
CTCCTGGCAC |
TCACTTCCTT |
GTCTGCCGCT |
AGCCTTGACC |
CTGATGTCTG |
GCTGAACCTT |
GGCTCCTGAG |
CAGACCAGGA |
|
8721 |
GAACCTGAGG |
GTTGAGGAAA |
ACCTCTTCTG |
GGCCAGGCTG |
GGCCCACGCA |
GAGCAGCCCT |
ACCCCGGGAA |
GAAGGGAGTC |
|
8801 |
TGTCCCTCCT |
CCTGCCTTCT |
GGGCCTTTCC |
ATCCCTGCAG |
TTTCAGAAAG |
GCCCCTCCTT |
CAGGAAGCTC |
TCCCTGATTG |
|
8881 |
CCACATTCAC |
CCTCTTACTC |
CTGACCTGGC |
ACTCTTGTGC |
TTGGGGGGTG |
GCGTGGCAGC |
TGACTCCTCC |
CTTTTCCTCC |
|
8961 |
TAGGGACCCA |
GATGCGCAGC |
CTGGGGGTGA |
GCTGATGCTG |
GGTGGCACAG |
ACTCCAAGTA |
TTACAAGGGT |
TCTCTGTCCT |
|
9041 |
ACCTGAATGT |
CACCCGCAAG |
GCCTACTGGC |
AGGTCCACCT |
GGACCAGTGA |
GTAGTGGCTG |
CAGTCGGCTC |
CCCTGGGTTC |
|
9121 |
TGTGGGCGGG |
GGCGGTGTGC |
GGAGACCCTG |
GAGGACCCCG |
GTTCTGCAGG |
TGGGGGTTGC |
ATGTGGGGAG |
TAGTGGGAGC |
|
9201 |
TGGGCAAGAA |
AGAGATGGGG |
TCAGACCAGC |
CCTCCATGCC |
CCTCCTTGCC |
CCTCCATGCT |
CCCCATCACC |
TCCATCCCCT |
|
9281 |
CTATTCCCTC |
TATCCCTCCA |
TCCCTCCATT |
TCCTCCATGC |
CTCTGTGACT |
CTCCATGACC |
CACCATCCCT |
TCTGTCCATC |
|
9361 |
CCTCCATGCC |
CTCCATCCCC |
TCCATCCCCT |
CATCCCTCTG |
TGACTCTTCA |
TGACCCTCCA |
TCTCCTCCAT |
CCCTCCATCC |
|
9441 |
CCTCCATCCC |
TCCATCCCTC |
CATCCCCTCC |
ATCCCTCCAT |
CCCTCCATCC |
CCACATCCCT |
CTGTGACTCT |
CCATGACCCT |
|
9521 |
CCATCCCCTC |
CATCTGTCTA |
TCCCTCCATC |
CCTCCATCCC |
TCCATGCCCC |
TCCATCCCTT |
CATCCCCTCC |
ATCCCCTCCA |
|
9601 |
TCCCTCTATG |
CCCCTCCATC |
CCCTCCATCC |
CTCCATGCCC |
CTCACCACCA |
CCTGAGGGTC |
TCCCACCCCC |
TCTACCACTC |
|
9681 |
TGTGTCTCCT |
CTCCCACCCT |
CTTCCCTGGA |
GGGCTTACAG |
CCGGCTGTGC |
TTCCAGGAGC |
CCTGAGGGGA |
GGAGAGTGCA |
|
9761 |
GCCCAGCCAG |
GGGAGGGGCT |
CCCAGGGAGG |
GGCACTGGGC |
CCCCAGGGCA |
CACTCCAGTC |
CCGGCAGGGG |
CTTCACGCCC |
|
9841 |
TGACTCCCCG |
CAGGGTGGAG |
GTGGCCAGCG |
GGCTGACCCT |
GTGCAAGGAG |
GGCTGTGAGG |
CCATTGTGGA |
CACAGGCACT |
|
9921 |
TCCCTCATGG |
TGGGCCCGGT |
GGATGAGGTG |
CGCGAGCTGC |
AGAAGGCCAT |
CGGGGCCGTG |
CCGCTGATTC |
AGGGCGAGGT |
|
10001 |
GAGCGCCGGG |
GGCTGGGGCT |
GGGGCTGGGG |
CTGGCAGGGG |
GAGCCCCAAG |
GCCACCACTA |
CCACCCTGAC |
ACTGCTGTGA |
|
10081 |
CCCCTCTTAG |
TACATGATCC |
CCTGTGAGAA |
GGTGTCCACC |
CTGCCCGCGA |
TCACACTGAA |
GCTGGGAGGC |
AAAGGCTACA |
|
10161 |
AGCTGTCCCC |
AGAGGACTAC |
ACGCTCAAGG |
TGAGCGGGCA |
ATGGGGTGCC |
GCACGCCCCA |
GGTGAGCGGG |
CGGTGAGGGG |
|
10241 |
GCGCACGCTC |
CAGGTGAGCG |
GGCAACAGGT |
GGGGGGGCGG |
GTGGTGCTAG |
GCCTGGGTAC |
TGACCACCAG |
GGCCGTCCCA |
|
10321 |
GGTGTCGCAG |
GCCGGGAAGA |
CCCTCTGCCT |
GAGCGGCTTC |
ATGGGCATGG |
ACATCCCGCC |
ACCCAGCGGG |
CCACTCTGGA |
|
10401 |
TCCTGGGCGA |
CGTCTTCATC |
GGCCGCTACT |
ACACTGTGTT |
TGACCGTGAC |
AACAACAGGG |
TGGGCTTCGC |
CGAGGCTGCC |
|
10481 |
CGCCTCTAGT |
TCCCAAGGCG |
TCCGCGCGCC |
AGCACAGAAA |
CAGAGGAGAG |
TCCCAGAGCA |
GGAGGCCCCT |
GGCCCAGCGG |
|
10561 |
CCCCTCCCAC |
ACACACCCAC |
ACACTCGCCC |
GCCCACTGTC |
CTGGGCGCCC |
TGGAAGCCGG |
CGGCCCAAGC |
CCGACTTGCT |
|
10641 |
GTTTTGTTCT |
GTGGTTTTCC |
CCTCCCTGGG |
TTCAGAAATG |
CTGCCTGCCT |
GTCTGTCTCT |
CCATCTGTTT |
GGTGGGGGTA |
|
10721 |
GAGCTGATCC |
AGAGCACAGA |
TCTGTTTCGT |
GCATTGGAAG |
ACCCCACCCA |
AGCTTGGCAG |
CCGAGCTCGT |
GTATCCTGGG |
|
10801 |
GCTCCCTTCA |
TCTCCAGGGA |
GTCCCCTCCC |
CGGCCCTACC |
AGCGCCCGCT |
GGGCTGAGCC |
CCTACCCCAC |
ACCAGGCCGT |
|
10881 |
CCTCCCGGGC |
CCTCCCTTGG |
AAACCTGCCC |
TGCCTGAGGG |
CCCCTCTGCC |
CAGCTTGGGC |
CCAGCTGGGC |
TCTGCCACCC |
|
10961 |
TACCTGTTCA |
GTGTCCCGGG |
CCCGTTGAGG |
ATGAGGCCGC |
TAGAGGCCTG |
AGGATGAGCT |
GGAAGGAGTG |
AGAGGGGACA |
|
11041 |
AAACCCACCT |
TGTTGGAGCC |
TGCAGGGTGG |
TGCTGGGACT |
GAGCCAGTCC |
CAGGGGCATG |
TATTGGCCTG |
GAGGTGGGGT |
|
11121 |
TGGGATTGGG |
GGCTGGTGCC |
AGCCTTCCTC |
TGCAGCTGAC |
CTCTGTTGTC |
CTCCCCTTGG |
GCGGCTGAGA |
GCCCCAGCTG |
|
11201 |
ACATGGAAAT |
ACAGTTGTTG |
GCCTCCGGCC |
TCCCCTCT |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:1509|CTSD|NC_000011.9:1773984...1785221 (-)
CGCACGCCGGCCGCGCCCACGTGACCGGTCCGGGTGCAAACACGCGGGTCAGCTGATCCGGCCCAACTGCGGCGTCATCC
CGGCTATAAGCGCACGGCCTCGGCGACCCTCTCCGACCCGGCCGCCGCCGCCATGCAGCCCTCCAGCCTTCTGCCGCTCG
CCCTCTGCCTGCTGGCTGCACCCGCCTCCGCGCTCGTCAGGTGAAGCCTCAGGGGCCGGGGCTCAGGGACGGGCAGGGGT
CGCGGCGCCGAGGTCCCGGGGCCTGTGGTGACTTTCGCGCTCCCCTGTGGCCCCCACGAGCCCCTTGCGCCCCCCGCGCT
GGAATGCACCTGTGCCGCCCTGCGCGGCCTCCTGCACGGACCACCCGCCTACGGGGCGCCGGGCTCCGGAGGTGCAGGGG
ACCCGGGGCAGAGGCGCCAGATGCCTCTCCCCCATATGCCACCCTGGGTTGTACCTTGAGGACTGCAGACTGACCGCAGC
CTCCCTGGAGACGGGGCGGGGCGGGGGGAGGTAGTGCTCATTCGGGGCAGGTGGAATTGGGGTCTGTACTGAGCGCCCTT
GTTGCTGGAGACCTAGGTCAGGCCTCAGAGCCCCCGAGTCTGGGCGAGTCCATTTCCTTAGGGACCCCTTTACCACCTGT
GAACTGGGGGCTTTAAAAGTTTGCTCCAGCGGCTCTTATCACAGGCCCTGGGCTGGGAGACCCCTCGAGACCCTAGGAGT
TCCCATGTCCCTGAGAGAGGAGGAGGCATGGGGAGTGGGTCGGCTCACCCACCCCGGGCCTGGGGTTGTGCTGTAGTGAG
GCCCACACGCTCCTCAGGCCGATCCCCTGTGCCAGGTGAGGCCACCGATTGGGCCTGGATGGGATGGGGCCCGGCCATGC
CTGACCAGCTGGGCAGAGGAGGGCCATGCTGCAGTCTGCTTTCTTGACCCCCTCCCCAGCCCTTGCAAGGCAGCCCGCAT
TCCCAGGAGGGGTATGCTGACCCATCCCATTGGGCACCTGCCCCACCCTTGCTCTGGGCCTTTGTGGGAGACCTGGGATC
TGCGATGGGTCCACTGCCTTTTGGCAGGTGGGTGAGGTCAGAAGGCTGCAGGGGCTGGAGCTGGCTGGGCCAGCTGGGTA
GGACTGAGCCTCACCAAAGGCTGTGGGGAATGGCCCGGGGGCGGGTAGCCCCAATTAAAGTCGTTGTGGGGGAGTAGCCA
CAAGCCTGAGCCTGCCTTGACCTTGCCAGCCTATCCACAGGCCTCCCCTCTCCAAGGAGGACAGACACAGCAGAGGGGAA
ACGATCCTGGGGCTTCTTGGAGGGAAGGGTAGCTGAATCCAAGCCCTCACCCGATTCCAGCTCTTGTGCGACTGATACTA
TTACACCTGCTTCCTGGTCCCTGGAGGGCGTGTCCCTCCCCCAGGACAAAACCTGGAGCTCTTCCAGCCCACCAGCTCTT
AGGCAATAATCTCATCTTCCGGGATCACGCCCCTGACAAGCCAGGAAAAGCCAGCTATGACCTTGTACTCTCAAGTCCCT
GGGGCAGGGAAGAGGTTTTATTTAAGTGATTAAAAGCCCAGGGGAGCTTCCTTGGAACAAGGAGTGGGTTCACACCAAGG
GGAAGGCCAGTGGCCCTGGGGGAGGAGCAGGGACCCCTCTCTCTCTTACTCGCTTCCTGGGTTTAGAACTCAGGACCCCG
ATCTCAGTCTGGAGCTCCCTCCTGCACCCTGGCTGGCGGTGTGCTGGGTGACAGGACTCTGGAGGGGTACCCTGAGTGCA
GCTGTCGGAGGAGGCAGGGCGGTGGGGGGGGCAGCACAGAAGCCTCTAAGGCCCCAGGTGCAGTCCTGGACCTCGTGGAG
CCGCATGGAGTGAGGAGAGGTGCGGATGCCCAGAAACAGATGTGGGATGAGGGCACTGGGCAGCCACAGGGTCCATGTGG
AGGAGGACAGGTAGTCAAGGAGGGCTTCTGGAGGTGGTGTGGAGGGCCCATCTGATGGCCAGAGGAGGCCAGGCAGAGCT
GCCAGTGCCAGCCTGGAGGTGGGGCCACCTTCGTGCAGGTGTCTGGGGGTGGAGAGCAGGTGTGATGGGGGCTGGGTACA
GTGGGCTGCCTCAGAGCACTTTGGGCAGGAGTGACAGGTACCCGCCAGCACCCTGAGCAGCCATGCTGGCCACCATCCTG
GAAGAGACCAGCGCAGGTGCAGAGGGAGGACGGGAGACCCTTGGGGGCTTTGGAGCCTCCAGAATGGCTGAGGAGAGGAG
GGTTGGGCACACTGGCCCCCAGGTTGGGTGTGTGGGGCTGAGGTGGGTGGCGGGTGACCACTTCTTAGGACTGTGGCCTG
TGCAACCTGGCGGGGGGCAATGGGTTGCCATTCACTGACTTGGGGGAGCTGGGCCAGCAGTTTCCAGGTGACAGGCAGGA
GTTTGGTTTTGGCTGTGGCGACTCTGAGATTCCCCAGGGGCCTCCAGGTGGATGTGCAGCATTGGCAGCGTCGGCCGGCA
GGCGGGAGGGCCTCCCTGATATGCCCCGACCCGTGGTTGACAGGATCCCGCTGCACAAGTTCACGTCCATCCGCCGGACC
ATGTCGGAGGTTGGGGGCTCTGTGGAGGACCTGATTGCCAAAGGCCCCGTCTCAAAGTACTCCCAGGCGGTGCCAGCCGT
GACCGAGGGGCCCATTCCCGAGGTGCTCAAGAACTACATGGACGTGAGTATGAGGTCTTAGCCCTGCTGCAAGCGGAGCC
ACTGTCAGGAGAGCTCCGTGGCAGTATGGGGAACATTCCCACCTCCTGTTCTCAGCAGCGCAGTTTGAGTGGCTCACCTG
GCCACGGTGGCGTCCCGGCTCAGCCACACTCCTTTCTTCCCCATTCTGGAACCTCCTGCCACTGGCCCCTGTCATTGCAG
GGTCTTGGTCCTGCTAGGGCCTGGGAGGGTGATTGGGAGTGGCCTCGGGCCTTGGTGCCAGCTCGCCTGAGGGGGCGGTC
CTTGGGCCTGCCTGAGTGTGGCCGGCTGACCTGGAGCCTTTCACTGCTGTCCTGCTGGGAGGCCTGTGGTGACTCGTGGC
CTCCCCAGCCCCTCCCCATCTTCTTCCTCTCGCAACAGCCCCTCCTGTGCCCACTACTCCTTCAGGGGGAAGCAGGATCC
AAGGTGGAGCACTCTGGAAGCCACCTAGCAAGCTGGGGCTTGGTCAGCCTGGTCCCAGCTCTATGGGGTCAGTTCGAGGC
CAGGGCCTACTGTCCAGCCTCGGGGCTCTGGCCCACTGTGGGGGAGCCTTGCCCTCTGTCCTGCTTGGCCCGAGTCCTGG
CTGTGACAGGAAGCCCAAGACTCACAGGCATGTGACTGGGCCAGGGGGGCCCTGGGGGGAACCAGGCGCCGTGTGCCTGG
TCCCGACTGGCCAGTTCCTGATTGTCCTAGCGCGCGAGCAAGCAGATGCACGCATGCGCACACATGCACACACACACATG
GAAATTTGCTGAGTGTCCCCCTGCCAGTGGTCACTTCTGTGGGGCCATGAGTCAGCAGCTGCTGCTGCTCCGTGAAGCCA
GGCGGTTGGAGAAACATTGGGCTGGGCTGGTGCCAGGAATCTGGTGGCTGACGGTGGCCCAGGCTCCAATCCTGGGGGAA
GCGGGCGCCCAGGCGACCCCAAACTCCAGGACCCTCTTACTCTCTGCCTCCTGAGAGGCTCGGCGGCATGGGACCCCTTG
TACTTGCCTGATCCCTGAGTCAGCACCCCCACCTGCGCCTGCTATCCCTGTGTACACACGGGGAAACTGAGGCCCTGGGC
CATTAAGGCAGGGTTGCTGGGCAGGCAGTGATTGTAAGGCTTACCTTCTCCCCTTGACCAGCTGAACCCCTCTGCCTGGA
AGCCCTCCAAGCCTGGGGTTCACCCTGGAGGGCAGGGCAGGCACTGAGACACCCAGAATCAGACCTTGACATGGCCCCAA
CCTGGGGAGGAAGTCACTTCCTTTCTCAGACCTTGGACTGCGGGCCACCAGGGGAGTCTTCCAGGCCGGGCTTTCCTGCA
CCCGGGGCTCAGGCTGAGGGCCACGTCTGTCCCCACCCTGGCTTGACCTCTGGCCTTGTCTTTTCCGTGGGGAAGTCCTC
AGCCTCACCATGTATTGAGCAGGGGTAGGTGACAGAAGCCAGGGGTCTAGAGACCCCAAGTACCCGTGGGCATTAGGACC
GAGGGCTGGGGACCTGGCCCATCTTCCCTGCAGGCTGAGCAGGTGGGAGTGGGTGGCTGTTGGCAGCTGTGGGCCCCGTG
ATGCCCCTCCCTCCAGTATGGGCCTTGGCTCTGGGGACAGCCGGGCCTTCTGAGGCCCAGTGGGGAGATGGGGCCCCCTC
TCCCATCCCTGACGGACCCTGTCCCCTGCCAGGCCCAGTACTACGGGGAGATTGGCATCGGGACGCCCCCCCAGTGCTTC
ACAGTCGTCTTCGACACGGGCTCCTCCAACCTGTGGGTCCCCTCCATCCACTGCAAACTGCTGGACATCGCTTGCTGTGA
GTCACGAACCCTGGCCCCGTCGCCCAGGTCCTGCCCTTCCGGGATGTCGCTGCAGGGCTGCCTCAGGAATCACTTGGGCA
ACGCAATTCTCCTGCCTCTTGGCCCCGTGAGCCAGGCCAGCCCTCCACCCTGCTCCAGCCACTGACTTTCTGGGTGAACC
ACCAGCTGTGGTCTTGCTCTCAGCAGGGCTGGGGCTGGGGTGGCCACAGAGGGAGCCGGCTGTGGCTGGGAGGGAGGCCC
GGGGTCACAGCCCACAGTCCCGGGGCTCTGGCATGATGGTGGGCCTCAGATCCCCTCCAAATCCCACCCTGGGGAGGCAG
CTTGGGGGGTGCGGGCTCCATGGTAGATTGTAGGGGGATGGGAGGGCTAAGGCCGTGGCGTGGGCTGCGGGATGACCGGC
GGGCCCCCTTGTCGCCCGGGGCAGGGATCCACCACAAGTACAACAGCGACAAGTCCAGCACCTACGTGAAGAATGGTACC
TCGTTTGACATCCACTATGGCTCGGGCAGCCTCTCCGGGTACCTGAGCCAGGACACTGTGTCGGTGAGTCCCTCTGGGGC
CTTTCCCAGGACTCGAGGGTGCCAGGGTGTGGGGTTCACCCACCGTGGGTATGTGGTTGAAAGGGAGGGCTCGGTGATCC
CAGCCCAACCCCAGCCCTCAGGTGGCCGGGGCAGTCAGCAGGGTGAGGAGGGGTCTGGGAATGGGCCTGGTTGTGTGCAG
GCTGGAGGGACAGCCTCAAACCCAGGGGGTACAGGGGCAGGGGTCCCCGGAGTCAGGCCACAATGAGTGGGAGGGAGGAC
AGGGCAGATCGATCGGGCTCTTTTTGGCACATTGGGTTTGAGGTGCCAGCAGGTGTTGAGGGCTGGACCTGGGGCTGCAC
AGGGTCCCTGCAGCAGGCCGAGGGCTTGGGGAGGCTTGGGGTTGGGAGGATGTACCAGGAACCCGCTGTGGGGGCTGCTG
GCGGAGAGTCACCGGCAGGAGCCTGGGAGGGCAAGGGAGGGGGAGCAGCAGTGGCTGTTGGGGGCCCCGCCTGCATCCAC
CATCCTCAGGGCCCTGTGCAGAGCAACTCCCTGCCCTAGGAGAGGGTGAGCTGGCCCTTGTCACCCTGCCTGGGCCTCAG
TGAGTTCTCACCTCGGAGCTGTCTGCTGGGGGTGGACCAGGCCACAAGGGGTTCAGGAGTTACGGGATGGTGACACAGCC
CCCAGCTCTGGAGCGGGCCAGGAGGGCAGCAGAGCCCCCCTGCAGGCCCAGGGACCCGGGAAGAGGGCCCCTTCCTCCAG
CTGAAGCTGCTCCTAACAGTTCCCTGCTGGGCTGGAGTCCAGTCTGTACTGGGGGCTCCTCAGAGCCCTCCTGTCTGGAC
CCTGCTCAGCCTGAACAGGAAATTGCCCCGTCTGCCCTCTTCCGCCCTCTTCTGCCCCCTCCGCCCCGTGTCAGCCTCAA
GCTGTCGTTCCCTTCCCACATCCTGCTCTGGCCGTGTTCTCTCTCTGCAGCCTCATCCAGGGCTGGGGGAGGGGACAGGC
AGAGAAGGGGAAGGGGGCAGTATGGCTGTGAGCTTGGGATGGGGTCGGCAGGTTCCCCATCTCTCGGGCTCCTGGCCCAG
GCTGTGTCTTGTTCCCAGCGCTGAGGGCAGGAGCAGGACCTGCCTGTCATTGGTGGTGGGAGTAAGAGGTCGGAGCAGGG
AGCGGAGCAAGGGGCCTGCCCGTCTGTGCTCCTGCCTGGTTCCCATCTCGTGTAAACCGAGCCCTGATGACTTCCACGAA
GAGGGCCCCGCCATACCCCGTGTCCCCATCCCCAGCGGTGTTCAGCTCAGGGTTTCCAGCCCTTTCTTCTGGGCCCTCTG
GGCCCCATCGTGTGTGGGATGGGCATCCATTGAACTGGGTTTTGTAGCCTCATGCTCAGGGAGTGGTGTAGGGCTCAGCC
TGTCTGCTGCCCACTGACTCTGCCCTGGCCTGCAGGTGCCCTGCCAGTCAGCGTCGTCAGCCTCTGCCCTGGGCGGTGTC
AAAGTGGAGAGGCAGGTCTTTGGGGAGGCCACCAAGCAGCCAGGCATCACCTTCATCGCAGCCAAGTTCGATGGCATCCT
GGGCATGGCCTACCCCCGCATCTCCGTCAACAACGTGCTGCCCGTCTTCGACAACCTGATGCAGCAGAAGCTGGTGGACC
AGAACATCTTCTCCTTCTACCTGAGCAGGTGGGCGTGTGGGTTCCCTCTCGCTCCGCGTTGCTGGGAGGCAGGGCGGGGC
TGGACGGGGAGCCTTCTAGGCACCCCCTCTCAGTGCTGCCCCCTCCCTGCTGCTGTGCCAGAGCTCCTGACCTCTGACCT
CAGGGCATCCGGGAGGCGGGGGTTGGCCGCCCTTCTGCAGAGGAGTAAGCGGCAGCACAGAGAAAGCTGTTTGGCCGGGG
TCTCCCAGTGGGAGGGGCTCGGGCCAGGCTGTGGGTCCTGGGACCTTGGCAGCTGGGCCTCCCTCCTATGAGAATGGACC
CAGGTCAGGGGTCGTGGCAGTTCCTCTTGGTTCAGTCGGCCCGTCGGTCCCCAGACCTGGTGATGGGCACATGTGGTCCT
GGTCCCGGGGTTGCTGATGGTGGAGAGGGTCATGGTGCCCAGGGGACCGGGGATCCCCGGGGAGGTGACCTTGGGTGCGT
ATGGGTCCCCGGCACCACGCTGCGAGCAGCTCTGTGGGCGTCCCCAGCAGGTGCCGGCTGCCCTCGAGGAGGAACATAGG
GGGCCCTTCCTTCTGGGAAAAGGGTTCTCCCCCAGGGCCCCCACCTGCAGCCGCTGCTCCAGCTTGGGTGACCCATGGTC
AGGTGACCTTGGGAAGGGGACGTCGGACTCAGTTAGTTTCCTCTTCTCGGGGCAGACCTGAGTGGAGGGTTCCACATAAG
AGGGGTGACTGGGGGTTGGGGGCTTTTGTTTTGGGGGTGGGCAGCGTGTCAGCCGGCGGCCTCCTGCCCTGAGCTGCGCC
TGGTCACTGCCCCACACTCTCCAGGGCTGACAGGGGCAGGTTTACCTCACGTGGCTCACTTGGCACTTGGAGTCTCTGGG
CCTGACCCCTAACCTTGGATCGCTTCCGGCAGAATTCCGGCTGTAGGGCTGGCTGGGCCTCTTTGCTCTGCCCCTTGCCT
GGGCAGTGAATACTCCTTAGCAACACCCAGGGCTAAGCTGCTCACTAGAGGCCAGGACACAGGTGAACCGGCTGGGCCAT
TTCCCCAGGAGCCTTGGGGCACAGGGGAGGCAGCCAGGTGAAAAGGGGTCCCCTGAGGGCAAGAGGGCATGCAGGCATGA
GTGCCCACATGGGGAGGGGGCACACAGCCTGAGTACCCAGGTCAGGGAGGGGGACACAGGCATGAATGCCCATGTGGGGG
AGGGGCACACAGCATGAGTGCCCATGTGGGGGAGGGGCACACGGTATGAGTACCCAGGTCGGGGGAGGCATACACAGGCA
TGAGTGCCAGGTGGAAGGAGGGGCACACAGAGTGAGTGCCCAGGTGGGGGAGGTGCACATGGTTGTGAGTGCCCAGGTGT
AGTCTCAAGGCAGCAGCTTGGGCAGGAAGTGGAGCCAGGCAGGAGGGAGGGTCTGAGGGCTTCTGAAAGCATGTTTGGTG
AGGAGAGGAGGGGTGGGAGGCGCTGATCAGGTTTCTACACTTGGGATTGCAGAGGTGTTGACAAGAGGCAAAGGCGGAGG
AGGCTGGAGGAGGGCGGAGGCCCCAATCGGTGTTGGGAGGACTGGGTCAGGCCTGGCACTGCCTCGAGTGACAGGCAGTG
GGATGGTGGCCAGCTTAGCTGCAGACGCTCTGGCGGGATGGCAGAACCGCCCCAGACACAGAGAGCTTCTCTACCAGGAC
CGGCAGGATTTGCTGCGTTGAAAGCTGTACTTGAGCAATGTTTAGAAACAAACCCGGGCGACATGGGTTGCAGGTCCTAG
GAAGTGCAGTGCGCTCCTGCCCAGGAGCACCTTGGCTGGCCATCAGTGGTCTGGATGAGGGGGAGATGAGCGGACGTGGC
TCGGGGATGCAGGTGGAGGGTGTTCCCAGGAGCAGCCAGTGCAGAGGCCCTGCGGCCAGAACCAGCCATCCCAACTTCCC
AGATTGTGCCATCACTCCCTCCTGGAAGCCTTCTTTGGTTTTTTCTTCCAGACAGACAGACAGGGTCCTCCCCAGTGGAG
CTCCTGGCACTCACTTCCTTGTCTGCCGCTAGCCTTGACCCTGATGTCTGGCTGAACCTTGGCTCCTGAGCAGACCAGGA
GAACCTGAGGGTTGAGGAAAACCTCTTCTGGGCCAGGCTGGGCCCACGCAGAGCAGCCCTACCCCGGGAAGAAGGGAGTC
TGTCCCTCCTCCTGCCTTCTGGGCCTTTCCATCCCTGCAGTTTCAGAAAGGCCCCTCCTTCAGGAAGCTCTCCCTGATTG
CCACATTCACCCTCTTACTCCTGACCTGGCACTCTTGTGCTTGGGGGGTGGCGTGGCAGCTGACTCCTCCCTTTTCCTCC
TAGGGACCCAGATGCGCAGCCTGGGGGTGAGCTGATGCTGGGTGGCACAGACTCCAAGTATTACAAGGGTTCTCTGTCCT
ACCTGAATGTCACCCGCAAGGCCTACTGGCAGGTCCACCTGGACCAGTGAGTAGTGGCTGCAGTCGGCTCCCCTGGGTTC
TGTGGGCGGGGGCGGTGTGCGGAGACCCTGGAGGACCCCGGTTCTGCAGGTGGGGGTTGCATGTGGGGAGTAGTGGGAGC
TGGGCAAGAAAGAGATGGGGTCAGACCAGCCCTCCATGCCCCTCCTTGCCCCTCCATGCTCCCCATCACCTCCATCCCCT
CTATTCCCTCTATCCCTCCATCCCTCCATTTCCTCCATGCCTCTGTGACTCTCCATGACCCACCATCCCTTCTGTCCATC
CCTCCATGCCCTCCATCCCCTCCATCCCCTCATCCCTCTGTGACTCTTCATGACCCTCCATCTCCTCCATCCCTCCATCC
CCTCCATCCCTCCATCCCTCCATCCCCTCCATCCCTCCATCCCTCCATCCCCACATCCCTCTGTGACTCTCCATGACCCT
CCATCCCCTCCATCTGTCTATCCCTCCATCCCTCCATCCCTCCATGCCCCTCCATCCCTTCATCCCCTCCATCCCCTCCA
TCCCTCTATGCCCCTCCATCCCCTCCATCCCTCCATGCCCCTCACCACCACCTGAGGGTCTCCCACCCCCTCTACCACTC
TGTGTCTCCTCTCCCACCCTCTTCCCTGGAGGGCTTACAGCCGGCTGTGCTTCCAGGAGCCCTGAGGGGAGGAGAGTGCA
GCCCAGCCAGGGGAGGGGCTCCCAGGGAGGGGCACTGGGCCCCCAGGGCACACTCCAGTCCCGGCAGGGGCTTCACGCCC
TGACTCCCCGCAGGGTGGAGGTGGCCAGCGGGCTGACCCTGTGCAAGGAGGGCTGTGAGGCCATTGTGGACACAGGCACT
TCCCTCATGGTGGGCCCGGTGGATGAGGTGCGCGAGCTGCAGAAGGCCATCGGGGCCGTGCCGCTGATTCAGGGCGAGGT
GAGCGCCGGGGGCTGGGGCTGGGGCTGGGGCTGGCAGGGGGAGCCCCAAGGCCACCACTACCACCCTGACACTGCTGTGA
CCCCTCTTAGTACATGATCCCCTGTGAGAAGGTGTCCACCCTGCCCGCGATCACACTGAAGCTGGGAGGCAAAGGCTACA
AGCTGTCCCCAGAGGACTACACGCTCAAGGTGAGCGGGCAATGGGGTGCCGCACGCCCCAGGTGAGCGGGCGGTGAGGGG
GCGCACGCTCCAGGTGAGCGGGCAACAGGTGGGGGGGCGGGTGGTGCTAGGCCTGGGTACTGACCACCAGGGCCGTCCCA
GGTGTCGCAGGCCGGGAAGACCCTCTGCCTGAGCGGCTTCATGGGCATGGACATCCCGCCACCCAGCGGGCCACTCTGGA
TCCTGGGCGACGTCTTCATCGGCCGCTACTACACTGTGTTTGACCGTGACAACAACAGGGTGGGCTTCGCCGAGGCTGCC
CGCCTCTAGTTCCCAAGGCGTCCGCGCGCCAGCACAGAAACAGAGGAGAGTCCCAGAGCAGGAGGCCCCTGGCCCAGCGG
CCCCTCCCACACACACCCACACACTCGCCCGCCCACTGTCCTGGGCGCCCTGGAAGCCGGCGGCCCAAGCCCGACTTGCT
GTTTTGTTCTGTGGTTTTCCCCTCCCTGGGTTCAGAAATGCTGCCTGCCTGTCTGTCTCTCCATCTGTTTGGTGGGGGTA
GAGCTGATCCAGAGCACAGATCTGTTTCGTGCATTGGAAGACCCCACCCAAGCTTGGCAGCCGAGCTCGTGTATCCTGGG
GCTCCCTTCATCTCCAGGGAGTCCCCTCCCCGGCCCTACCAGCGCCCGCTGGGCTGAGCCCCTACCCCACACCAGGCCGT
CCTCCCGGGCCCTCCCTTGGAAACCTGCCCTGCCTGAGGGCCCCTCTGCCCAGCTTGGGCCCAGCTGGGCTCTGCCACCC
TACCTGTTCAGTGTCCCGGGCCCGTTGAGGATGAGGCCGCTAGAGGCCTGAGGATGAGCTGGAAGGAGTGAGAGGGGACA
AAACCCACCTTGTTGGAGCCTGCAGGGTGGTGCTGGGACTGAGCCAGTCCCAGGGGCATGTATTGGCCTGGAGGTGGGGT
TGGGATTGGGGGCTGGTGCCAGCCTTCCTCTGCAGCTGACCTCTGTTGTCCTCCCCTTGGGCGGCTGAGAGCCCCAGCTG
ACATGGAAATACAGTTGTTGGCCTCCGGCCTCCCCTCT
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001909.3 (GI:23110949)
|
Name |
Cathepsin D (CTSD)
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2205 nt
|
Map |
11p15.5
|
Location |
Chromosome 11 (NC_000011.9) strand : -
1773984...1774899 | 1775032...1775130 | 1775223...1775367 | 1776135...1776257 |
1778553...1778785 | 1780198...1780316 | 1780745...1780868 | 1782538...1782697 |
1785021...1785221 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
201
|
201
|
1
|
Exon 2
|
202
|
361
|
160
|
1
|
Exon 3
|
362
|
485
|
124
|
1
|
Exon 4
|
486
|
604
|
119
|
1
|
Exon 5
|
605
|
837
|
233
|
1
|
Exon 6
|
838
|
960
|
123
|
1
|
Exon 7
|
961
|
1105
|
145
|
1
|
Exon 8
|
1106
|
1204
|
99
|
1
|
Exon 9
|
1205
|
2120
|
916
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS7725
|
Nucleotide |
CTSD, mRNA isoform 1[NM_001909.3] : 134...1372
|
Length |
1239
|
Location |
Chromosome 11 (NC_000011.9) strand : -
1774732...1774899 | 1775032...1775130 | 1775223...1775367 | 1776135...1776257 |
1778553...1778785 | 1780198...1780316 | 1780745...1780868 | 1782538...1782697 |
1785021...1785088 |
|
Start codon |
1
|
Translation |
NP_001900.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GCGCACGCCG |
GCCGCGCCCA |
CGTGACCGGT |
CCGGGTGCAA |
ACACGCGGGT |
CAGCTGATCC |
GGCCCAACTG |
CGGCGTCATC |
|
81 |
CCGGCTATAA |
GCGCACGGCC |
TCGGCGACCC |
TCTCCGACCC |
GGCCGCCGCC |
GCCATGCAGC |
CCTCCAGCCT |
TCTGCCGCTC |
|
161 |
GCCCTCTGCC |
TGCTGGCTGC |
ACCCGCCTCC |
GCGCTCGTCA |
GGATCCCGCT |
GCACAAGTTC |
ACGTCCATCC |
GCCGGACCAT |
|
241 |
GTCGGAGGTT |
GGGGGCTCTG |
TGGAGGACCT |
GATTGCCAAA |
GGCCCCGTCT |
CAAAGTACTC |
CCAGGCGGTG |
CCAGCCGTGA |
|
321 |
CCGAGGGGCC |
CATTCCCGAG |
GTGCTCAAGA |
ACTACATGGA |
CGCCCAGTAC |
TACGGGGAGA |
TTGGCATCGG |
GACGCCCCCC |
|
401 |
CAGTGCTTCA |
CAGTCGTCTT |
CGACACGGGC |
TCCTCCAACC |
TGTGGGTCCC |
CTCCATCCAC |
TGCAAACTGC |
TGGACATCGC |
|
481 |
TTGCTGGATC |
CACCACAAGT |
ACAACAGCGA |
CAAGTCCAGC |
ACCTACGTGA |
AGAATGGTAC |
CTCGTTTGAC |
ATCCACTATG |
|
561 |
GCTCGGGCAG |
CCTCTCCGGG |
TACCTGAGCC |
AGGACACTGT |
GTCGGTGCCC |
TGCCAGTCAG |
CGTCGTCAGC |
CTCTGCCCTG |
|
641 |
GGCGGTGTCA |
AAGTGGAGAG |
GCAGGTCTTT |
GGGGAGGCCA |
CCAAGCAGCC |
AGGCATCACC |
TTCATCGCAG |
CCAAGTTCGA |
|
721 |
TGGCATCCTG |
GGCATGGCCT |
ACCCCCGCAT |
CTCCGTCAAC |
AACGTGCTGC |
CCGTCTTCGA |
CAACCTGATG |
CAGCAGAAGC |
|
801 |
TGGTGGACCA |
GAACATCTTC |
TCCTTCTACC |
TGAGCAGGGA |
CCCAGATGCG |
CAGCCTGGGG |
GTGAGCTGAT |
GCTGGGTGGC |
|
881 |
ACAGACTCCA |
AGTATTACAA |
GGGTTCTCTG |
TCCTACCTGA |
ATGTCACCCG |
CAAGGCCTAC |
TGGCAGGTCC |
ACCTGGACCA |
|
961 |
GGTGGAGGTG |
GCCAGCGGGC |
TGACCCTGTG |
CAAGGAGGGC |
TGTGAGGCCA |
TTGTGGACAC |
AGGCACTTCC |
CTCATGGTGG |
|
1041 |
GCCCGGTGGA |
TGAGGTGCGC |
GAGCTGCAGA |
AGGCCATCGG |
GGCCGTGCCG |
CTGATTCAGG |
GCGAGTACAT |
GATCCCCTGT |
|
1121 |
GAGAAGGTGT |
CCACCCTGCC |
CGCGATCACA |
CTGAAGCTGG |
GAGGCAAAGG |
CTACAAGCTG |
TCCCCAGAGG |
ACTACACGCT |
|
1201 |
CAAGGTGTCG |
CAGGCCGGGA |
AGACCCTCTG |
CCTGAGCGGC |
TTCATGGGCA |
TGGACATCCC |
GCCACCCAGC |
GGGCCACTCT |
|
1281 |
GGATCCTGGG |
CGACGTCTTC |
ATCGGCCGCT |
ACTACACTGT |
GTTTGACCGT |
GACAACAACA |
GGGTGGGCTT |
CGCCGAGGCT |
|
1361 |
GCCCGCCTCT |
AGTTCCCAAG |
GCGTCCGCGC |
GCCAGCACAG |
AAACAGAGGA |
GAGTCCCAGA |
GCAGGAGGCC |
CCTGGCCCAG |
|
1441 |
CGGCCCCTCC |
CACACACACC |
CACACACTCG |
CCCGCCCACT |
GTCCTGGGCG |
CCCTGGAAGC |
CGGCGGCCCA |
AGCCCGACTT |
|
1521 |
GCTGTTTTGT |
TCTGTGGTTT |
TCCCCTCCCT |
GGGTTCAGAA |
ATGCTGCCTG |
CCTGTCTGTC |
TCTCCATCTG |
TTTGGTGGGG |
|
1601 |
GTAGAGCTGA |
TCCAGAGCAC |
AGATCTGTTT |
CGTGCATTGG |
AAGACCCCAC |
CCAAGCTTGG |
CAGCCGAGCT |
CGTGTATCCT |
|
1681 |
GGGGCTCCCT |
TCATCTCCAG |
GGAGTCCCCT |
CCCCGGCCCT |
ACCAGCGCCC |
GCTGGGCTGA |
GCCCCTACCC |
CACACCAGGC |
|
1761 |
CGTCCTCCCG |
GGCCCTCCCT |
TGGAAACCTG |
CCCTGCCTGA |
GGGCCCCTCT |
GCCCAGCTTG |
GGCCCAGCTG |
GGCTCTGCCA |
|
1841 |
CCCTACCTGT |
TCAGTGTCCC |
GGGCCCGTTG |
AGGATGAGGC |
CGCTAGAGGC |
CTGAGGATGA |
GCTGGAAGGA |
GTGAGAGGGG |
|
1921 |
ACAAAACCCA |
CCTTGTTGGA |
GCCTGCAGGG |
TGGTGCTGGG |
ACTGAGCCAG |
TCCCAGGGGC |
ATGTATTGGC |
CTGGAGGTGG |
|
2001 |
GGTTGGGATT |
GGGGGCTGGT |
GCCAGCCTTC |
CTCTGCAGCT |
GACCTCTGTT |
GTCCTCCCCT |
TGGGCGGCTG |
AGAGCCCCAG |
|
2081 |
CTGACATGGA |
AATACAGTTG |
TTGGCCTCCG |
GCCTCCCCTC |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
|
2161 |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAAAAAAA |
AAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|23110949|ref|NM_001909.3|Cathepsin D (CTSD)
GCGCACGCCGGCCGCGCCCACGTGACCGGTCCGGGTGCAAACACGCGGGTCAGCTGATCCGGCCCAACTGCGGCGTCATC
CCGGCTATAAGCGCACGGCCTCGGCGACCCTCTCCGACCCGGCCGCCGCCGCCATGCAGCCCTCCAGCCTTCTGCCGCTC
GCCCTCTGCCTGCTGGCTGCACCCGCCTCCGCGCTCGTCAGGATCCCGCTGCACAAGTTCACGTCCATCCGCCGGACCAT
GTCGGAGGTTGGGGGCTCTGTGGAGGACCTGATTGCCAAAGGCCCCGTCTCAAAGTACTCCCAGGCGGTGCCAGCCGTGA
CCGAGGGGCCCATTCCCGAGGTGCTCAAGAACTACATGGACGCCCAGTACTACGGGGAGATTGGCATCGGGACGCCCCCC
CAGTGCTTCACAGTCGTCTTCGACACGGGCTCCTCCAACCTGTGGGTCCCCTCCATCCACTGCAAACTGCTGGACATCGC
TTGCTGGATCCACCACAAGTACAACAGCGACAAGTCCAGCACCTACGTGAAGAATGGTACCTCGTTTGACATCCACTATG
GCTCGGGCAGCCTCTCCGGGTACCTGAGCCAGGACACTGTGTCGGTGCCCTGCCAGTCAGCGTCGTCAGCCTCTGCCCTG
GGCGGTGTCAAAGTGGAGAGGCAGGTCTTTGGGGAGGCCACCAAGCAGCCAGGCATCACCTTCATCGCAGCCAAGTTCGA
TGGCATCCTGGGCATGGCCTACCCCCGCATCTCCGTCAACAACGTGCTGCCCGTCTTCGACAACCTGATGCAGCAGAAGC
TGGTGGACCAGAACATCTTCTCCTTCTACCTGAGCAGGGACCCAGATGCGCAGCCTGGGGGTGAGCTGATGCTGGGTGGC
ACAGACTCCAAGTATTACAAGGGTTCTCTGTCCTACCTGAATGTCACCCGCAAGGCCTACTGGCAGGTCCACCTGGACCA
GGTGGAGGTGGCCAGCGGGCTGACCCTGTGCAAGGAGGGCTGTGAGGCCATTGTGGACACAGGCACTTCCCTCATGGTGG
GCCCGGTGGATGAGGTGCGCGAGCTGCAGAAGGCCATCGGGGCCGTGCCGCTGATTCAGGGCGAGTACATGATCCCCTGT
GAGAAGGTGTCCACCCTGCCCGCGATCACACTGAAGCTGGGAGGCAAAGGCTACAAGCTGTCCCCAGAGGACTACACGCT
CAAGGTGTCGCAGGCCGGGAAGACCCTCTGCCTGAGCGGCTTCATGGGCATGGACATCCCGCCACCCAGCGGGCCACTCT
GGATCCTGGGCGACGTCTTCATCGGCCGCTACTACACTGTGTTTGACCGTGACAACAACAGGGTGGGCTTCGCCGAGGCT
GCCCGCCTCTAGTTCCCAAGGCGTCCGCGCGCCAGCACAGAAACAGAGGAGAGTCCCAGAGCAGGAGGCCCCTGGCCCAG
CGGCCCCTCCCACACACACCCACACACTCGCCCGCCCACTGTCCTGGGCGCCCTGGAAGCCGGCGGCCCAAGCCCGACTT
GCTGTTTTGTTCTGTGGTTTTCCCCTCCCTGGGTTCAGAAATGCTGCCTGCCTGTCTGTCTCTCCATCTGTTTGGTGGGG
GTAGAGCTGATCCAGAGCACAGATCTGTTTCGTGCATTGGAAGACCCCACCCAAGCTTGGCAGCCGAGCTCGTGTATCCT
GGGGCTCCCTTCATCTCCAGGGAGTCCCCTCCCCGGCCCTACCAGCGCCCGCTGGGCTGAGCCCCTACCCCACACCAGGC
CGTCCTCCCGGGCCCTCCCTTGGAAACCTGCCCTGCCTGAGGGCCCCTCTGCCCAGCTTGGGCCCAGCTGGGCTCTGCCA
CCCTACCTGTTCAGTGTCCCGGGCCCGTTGAGGATGAGGCCGCTAGAGGCCTGAGGATGAGCTGGAAGGAGTGAGAGGGG
ACAAAACCCACCTTGTTGGAGCCTGCAGGGTGGTGCTGGGACTGAGCCAGTCCCAGGGGCATGTATTGGCCTGGAGGTGG
GGTTGGGATTGGGGGCTGGTGCCAGCCTTCCTCTGCAGCTGACCTCTGTTGTCCTCCCCTTGGGCGGCTGAGAGCCCCAG
CTGACATGGAAATACAGTTGTTGGCCTCCGGCCTCCCCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1
|
Length |
201 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1785021...1785221 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GCGCACGCCGGCCGCGCCCACGTGACCGGTCCGGGTGCAAACACGCGGGTCAGCTGATCCGGCCCAACTGCGGCGTCATC
CCGGCTATAAGCGCACGGCCTCGGCGACCCTCTCCGACCCGGCCGCCGCCGCCATGCAGCCCTCCAGCCTTCTGCCGCTC
GCCCTCTGCCTGCTGGCTGCACCCGCCTCCGCGCTCGTCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
2
|
Length |
160 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1782538...1782697 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GATCCCGCTGCACAAGTTCACGTCCATCCGCCGGACCATGTCGGAGGTTGGGGGCTCTGTGGAGGACCTGATTGCCAAAG
GCCCCGTCTCAAAGTACTCCCAGGCGGTGCCAGCCGTGACCGAGGGGCCCATTCCCGAGGTGCTCAAGAACTACATGGAC
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
3
|
Length |
124 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1780745...1780868 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GCCCAGTACTACGGGGAGATTGGCATCGGGACGCCCCCCCAGTGCTTCACAGTCGTCTTCGACACGGGCTCCTCCAACCT
GTGGGTCCCCTCCATCCACTGCAAACTGCTGGACATCGCTTGCT
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
4
|
Length |
119 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1780198...1780316 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GGATCCACCACAAGTACAACAGCGACAAGTCCAGCACCTACGTGAAGAATGGTACCTCGTTTGACATCCACTATGGCTCG
GGCAGCCTCTCCGGGTACCTGAGCCAGGACACTGTGTCG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
5
|
Length |
233 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1778553...1778785 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GTGCCCTGCCAGTCAGCGTCGTCAGCCTCTGCCCTGGGCGGTGTCAAAGTGGAGAGGCAGGTCTTTGGGGAGGCCACCAA
GCAGCCAGGCATCACCTTCATCGCAGCCAAGTTCGATGGCATCCTGGGCATGGCCTACCCCCGCATCTCCGTCAACAACG
TGCTGCCCGTCTTCGACAACCTGATGCAGCAGAAGCTGGTGGACCAGAACATCTTCTCCTTCTACCTGAGCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
6
|
Length |
123 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1776135...1776257 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GGACCCAGATGCGCAGCCTGGGGGTGAGCTGATGCTGGGTGGCACAGACTCCAAGTATTACAAGGGTTCTCTGTCCTACC
TGAATGTCACCCGCAAGGCCTACTGGCAGGTCCACCTGGACCA
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
7
|
Length |
145 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1775223...1775367 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GGTGGAGGTGGCCAGCGGGCTGACCCTGTGCAAGGAGGGCTGTGAGGCCATTGTGGACACAGGCACTTCCCTCATGGTGG
GCCCGGTGGATGAGGTGCGCGAGCTGCAGAAGGCCATCGGGGCCGTGCCGCTGATTCAGGGCGAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
8
|
Length |
99 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1775032...1775130 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
TACATGATCCCCTGTGAGAAGGTGTCCACCCTGCCCGCGATCACACTGAAGCTGGGAGGCAAAGGCTACAAGCTGTCCCC
AGAGGACTACACGCTCAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
9
|
Length |
916 nt
|
Location |
Chromosome 11 (NC_000011.9) : 1773984...1774899 (-)
|
Is part of |
CTSD, mRNA isoform 1
(NM_001909.3)
|
Sequence |
Show
|
|
GTGTCGCAGGCCGGGAAGACCCTCTGCCTGAGCGGCTTCATGGGCATGGACATCCCGCCACCCAGCGGGCCACTCTGGAT
CCTGGGCGACGTCTTCATCGGCCGCTACTACACTGTGTTTGACCGTGACAACAACAGGGTGGGCTTCGCCGAGGCTGCCC
GCCTCTAGTTCCCAAGGCGTCCGCGCGCCAGCACAGAAACAGAGGAGAGTCCCAGAGCAGGAGGCCCCTGGCCCAGCGGC
CCCTCCCACACACACCCACACACTCGCCCGCCCACTGTCCTGGGCGCCCTGGAAGCCGGCGGCCCAAGCCCGACTTGCTG
TTTTGTTCTGTGGTTTTCCCCTCCCTGGGTTCAGAAATGCTGCCTGCCTGTCTGTCTCTCCATCTGTTTGGTGGGGGTAG
AGCTGATCCAGAGCACAGATCTGTTTCGTGCATTGGAAGACCCCACCCAAGCTTGGCAGCCGAGCTCGTGTATCCTGGGG
CTCCCTTCATCTCCAGGGAGTCCCCTCCCCGGCCCTACCAGCGCCCGCTGGGCTGAGCCCCTACCCCACACCAGGCCGTC
CTCCCGGGCCCTCCCTTGGAAACCTGCCCTGCCTGAGGGCCCCTCTGCCCAGCTTGGGCCCAGCTGGGCTCTGCCACCCT
ACCTGTTCAGTGTCCCGGGCCCGTTGAGGATGAGGCCGCTAGAGGCCTGAGGATGAGCTGGAAGGAGTGAGAGGGGACAA
AACCCACCTTGTTGGAGCCTGCAGGGTGGTGCTGGGACTGAGCCAGTCCCAGGGGCATGTATTGGCCTGGAGGTGGGGTT
GGGATTGGGGGCTGGTGCCAGCCTTCCTCTGCAGCTGACCTCTGTTGTCCTCCCCTTGGGCGGCTGAGAGCCCCAGCTGA
CATGGAAATACAGTTGTTGGCCTCCGGCCTCCCCTC
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : P07339
|
Name |
Cathepsin D
|
Alternative name(s) |
|
Synonym(s) |
|
Organism |
Homo sapiens
|
Length |
412 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Catalytic activity
|
Specificity similar to, but narrower than, that of pepsin A. Does not cleave the 4-Gln-|-His-5 bond in B chain of insulin.
|
Disease
|
Defects in CTSD are the cause of neuronal ceroid lipofuscinosis type 10 (CLN10) [MIM:610127]; also known as neuronal ceroid lipofuscinosis due to cathepsin D deficiency. A form of neuronal ceroid lipofuscinosis with onset at birth or early childhood. Neuronal ceroid lipofuscinoses are progressive neurodegenerative, lysosomal storage diseases characterized by intracellular accumulation of autofluorescent liposomal material, and clinically by seizures, dementia, visual loss, and/or cerebral atrophy.
|
Function
|
Acid protease active in intracellular protein breakdown. Involved in the pathogenesis of several diseases such as breast cancer and possibly Alzheimer disease.
|
Polymorphism
|
The Val-58 allele is significantly overrepresented in demented patients (11.8%) compared with non-demented controls (4.9%). Carriers of the Val-58 allele have a 3.1-fold increased risk for developing AD than non-carriers.
|
Similarity
|
Belongs to the peptidase A1 family.
|
Subcellular location
|
Lysosome. Melanosome. Note=Identified by mass spectrometry in melanosome fractions from stage I to stage IV.
|
Subunit
|
Consists of a light chain and a heavy chain.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
proteolysis [GO:0006508]
cell death [GO:0008219]
|
Cellular component
|
extracellular region [GO:0005576]
lysosome [GO:0005764]
melanosome [GO:0042470]
|
Molecular function
|
aspartic-type endopeptidase activity [GO:0004190]
|
|
|
|
|
|
|
|
|
|
|
|
|
With
|
Uniprot accession
|
IntAct
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Molecule processing |
|
|
|
|
Propeptide
|
19 - 64
|
46
|
Activation peptide
|
PRO_0000025949
|
Signal
|
1 - 18
|
18
|
|
P07339-SIGNAL-1
|
Sites |
|
|
|
|
Active site
|
97 - 97
|
1
|
|
P07339-ACT_SITE-97
|
Active site
|
295 - 295
|
1
|
|
P07339-ACT_SITE-295
|
Natural variations |
|
|
|
|
Natural variant site
|
58 - 58
|
1
|
A -> V (associated with increased risk in AD; possibly influences secretion and intracellular maturation; dbSNP:rs17571)
|
VAR_011621
|
Natural variant site
|
229 - 229
|
1
|
F -> I (in CLN10)
|
VAR_029362
|
Natural variant site
|
282 - 282
|
1
|
G -> R
|
VAR_058490
|
Natural variant site
|
383 - 383
|
1
|
W -> C (in CLN10)
|
VAR_029363
|
Amino acid modifications |
|
|
|
|
Disulfide bond
|
91 - 160
|
70
|
|
P07339-DISULFID-91
|
Disulfide bond
|
110 - 117
|
8
|
|
P07339-DISULFID-110
|
Disulfide bond
|
286 - 290
|
5
|
|
P07339-DISULFID-286
|
Disulfide bond
|
329 - 366
|
38
|
|
P07339-DISULFID-329
|
Glycosylation
|
134 - 134
|
1
|
N-linked (GlcNAc...)
|
P07339-CARBOHYD-134
|
Glycosylation
|
263 - 263
|
1
|
N-linked (GlcNAc...)
|
P07339-CARBOHYD-263
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MQPSSLLPLA |
LCLLAAPASA |
LVRIPLHKFT |
SIRRTMSEVG |
GSVEDLIAKG |
PVSKYSQAVP |
AVTEGPIPEV |
LKNYMDAQYY |
|
81 |
GEIGIGTPPQ |
CFTVVFDTGS |
SNLWVPSIHC |
KLLDIACWIH |
HKYNSDKSST |
YVKNGTSFDI |
HYGSGSLSGY |
LSQDTVSVPC |
|
161 |
QSASSASALG |
GVKVERQVFG |
EATKQPGITF |
IAAKFDGILG |
MAYPRISVNN |
VLPVFDNLMQ |
QKLVDQNIFS |
FYLSRDPDAQ |
|
241 |
PGGELMLGGT |
DSKYYKGSLS |
YLNVTRKAYW |
QVHLDQVEVA |
SGLTLCKEGC |
EAIVDTGTSL |
MVGPVDEVRE |
LQKAIGAVPL |
|
321 |
IQGEYMIPCE |
KVSTLPAITL |
KLGGKGYKLS |
PEDYTLKVSQ |
AGKTLCLSGF |
MGMDIPPPSG |
PLWILGDVFI |
GRYYTVFDRD |
|
401 |
NNRVGFAEAA |
RL |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|P07339|CATD_human Cathepsin D
MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY
GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC
QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQ
PGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPL
IQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGRYYTVFDRD
NNRVGFAEAARL
|
|
|
| |
|
|
|
|
|
|