(CTSL1) cathepsin L1 [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
5180
|
Official symbol
|
CTSL1
|
Official name
|
cathepsin L1
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 9 (NC_000009.11) : 90340973...90346383 (+)
|
Map
|
9q21.33
|
Length
|
5411 nt
|
NM_001912.4
|
CTSL1, mRNA isoform 1
|
NM_145918.2
|
CTSL1, mRNA isoform 2
|
Accession
|
Name
|
Organism
|
Length
|
P07711
|
Cathepsin L1
|
Homo sapiens
|
333 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
MEP; CATL; FLJ31037; CTSL
|
Alternative name(s) |
major excreted protein; cathepsin L
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
The protein encoded by this gene is a lysosomal cysteine proteinase that plays a major role in intracellular protein catabolism. Its substrates include collagen and elastin, as well as alpha-1 protease inhibitor, a major controlling element of neutrophil elastase activity. The encoded protein has been implicated in several pathologic processes, including myofibril necrosis in myopathies and in myocardial ischemia, and in the renal tubular response to proteinuria. This protein, which is a member of the peptidase C1 family, is a dimer composed of disulfide-linked heavy and light chains, both produced from a single protein precursor. At least two transcript variants encoding the same protein have been found for this gene. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGGCGGTGCC |
GGCCGAACCC |
AGACCCGAGG |
TTTTAGAAGC |
AGAGTCAGGC |
GAAGCTGGGC |
CAGAACCGCG |
ACCTCCGCAA |
|
81 |
CCTTGAGCGG |
CATCCGTGGA |
GTGCGCCTGC |
GCAGCTACGA |
CCGCAGCAGG |
AAAGCGCCGC |
CGGCCAGGCC |
CAGCTGTGGC |
|
161 |
CGGACAGGGA |
CTGGAAGAGA |
GGACGCGGTC |
GAGTAGGTGT |
GCACCAGCCC |
TGGCAACGAG |
AGCGTCTACC |
CCGAACTCTG |
|
241 |
CTGGCCTTGA |
GGTGGGGAAG |
CCGGGGAGGG |
CAGTTGAGGA |
CCCCGCGGAG |
GCGCGTGACT |
GGTTGAGCGG |
GCAGGCCAGC |
|
321 |
CTCCGAGCCG |
GGTGGACACA |
GGTACCGCAG |
CCAGGCCGCG |
CCGCGCCGAC |
TCAGGGCCTG |
GCCCGGCCAG |
ACAGGGAAGC |
|
401 |
TCAGTCCCCG |
CACGCCAGAC |
AGCGGTACTC |
CTGCTGGCGT |
CACCGCAAAC |
ATCCTCTGAC |
CGCTACAGCC |
AGTGTGTGGC |
|
481 |
GCAGGCGTCA |
TGTCCCCGGC |
CCTGCCACGC |
CTGGAGCCCT |
GGAAGCTGGC |
TGCAGGGCGC |
TGGCTTCCCG |
CGTGCGGCCA |
|
561 |
TATGACCCCG |
TCCCTGATTT |
AGGGGAGCAG |
TTTGGGGTGT |
CGGCAGCACA |
GGCCCAAGTG |
AATGAAGGAG |
GGAGCAGTGC |
|
641 |
GTGCTCTCCT |
TCCCAGTTTT |
TCCTGGGAAA |
GCATTTCAGA |
AAGGTTTCAT |
TTAAGGAGAG |
GTTGGGGCGG |
CGCGGTGGCT |
|
721 |
CACTCCTGTA |
ATCCCAGCAC |
TTTGGGAGGC |
TGAGGTGGGC |
GGATCACCTG |
AGGTCAGTAG |
TTCGAGACCA |
GCCTGGCCAA |
|
801 |
CATGGTGAAA |
CCCCGTCTCT |
ACTGAAAATA |
CAAAATTAGA |
CGGGCGAGGC |
GGCGCACGCC |
TGTAGTTCCA |
GCTATTCAAG |
|
881 |
AGGCTGAGGA |
AGAATGGCTT |
GAACCCGGGA |
GGCAGAGGTT |
GCTGTGAGCC |
GATATCGCGC |
CGTTGAACTC |
CAGCCTGGGC |
|
961 |
CACAGAGCAA |
GACTCCATCT |
CAAAAAATAA |
ATAAATAAAT |
AAATAAATAA |
ATAAATAAAT |
AAATAGGAGA |
GATTGGAAAA |
|
1041 |
CTTATCTCAG |
CTTTTGGTGT |
TTGTTAGTCA |
GGAAGATGTG |
TGAAGGCCTC |
CTAACTCTTG |
GGGATCTCTT |
TGTCCCTACT |
|
1121 |
TGGGAATCCC |
ACCTTATCAT |
TAGTGAGGTT |
TTGCCTGGGC |
ACGAAACCTG |
GATTTTTTGC |
GATTGGTACA |
AAACCTGGAT |
|
1201 |
CAACCGTTTC |
CCGGTTTCCT |
AGTTGTTGCC |
TTAAGCTTCT |
CACACACAAG |
GTAGTTTCAT |
ACCGTTCTCA |
TAACCTAAAT |
|
1281 |
TGTCATCGCA |
TAAACTGTTT |
CAGCTCCTAC |
AGCTCTGGAC |
AGGCTGCTTT |
TCATTTTGGT |
GAGTCCATCC |
AGTACCTCCA |
|
1361 |
CGTGCCCTGT |
TTTTCTCCAG |
GCACATCCTT |
GGCCTCTTCC |
ACAGTCCTTG |
GGTAAATGCT |
TGGGAGAATA |
ATTTAAATAT |
|
1441 |
TTTTATTCTA |
CCATGGTGGC |
CCTAATTTTT |
CAGGGGGCAG |
TAAGATGGCT |
TTTTAGGATT |
GGTCTAATCA |
GATCCTCATT |
|
1521 |
TTTGTTCCCT |
TCCTAGGTTT |
TAAAACATGA |
ATCCTACACT |
CATCCTTGCT |
GCCTTTTGCC |
TGGGAATTGC |
CTCAGCTACT |
|
1601 |
CTAACATTTG |
ATCACAGTTT |
AGAGGCACAG |
TGGACCAAGT |
GGAAGGCGAT |
GCACAACAGA |
TTATACGGCA |
TGGTTAGTGA |
|
1681 |
AACTTCCCCA |
GAAAGAATAG |
TCCTGGCTGT |
TGAGAAGTTT |
TAGTCAGAGA |
GTAGCTTCTA |
GAGGCCAGCT |
TTTACCAATA |
|
1761 |
GCCTAATGTA |
ATAACCTAAT |
GGCGTGGATT |
ATGAGCACAA |
TGTGGACATT |
CATCCTTGTT |
GTGTCTCAGT |
TTGGAGAACA |
|
1841 |
GCATCCCCAG |
AGGTGTCAAG |
CCTTCCCTTG |
CCATGGTTTC |
TCTTCCATCT |
CTGTCTGCAG |
ATTCACTTGG |
TGAGGATGAG |
|
1921 |
TTGGGTTTTA |
GGTAGAAGTA |
AAGAGCATCA |
GTTACATGTT |
TGCCTCTAGA |
ATGAAGAAGG |
ATGGAGGAGA |
GCAGTGTGGG |
|
2001 |
AGAAGAACAT |
GAAGATGATT |
GAACTGCACA |
ATCAGGAATA |
CAGGGAAGGG |
AAACACAGCT |
TCACAATGGC |
CATGAACGCC |
|
2081 |
TTTGGAGACA |
TGGTAAGTGT |
GCTGTGGACT |
GCCGAGCTCT |
GTGCTTCCTC |
TCTTCGGTTC |
TTTACTAAGG |
TAATCTCTTG |
|
2161 |
CTTTTCAACA |
TTTTATTTCC |
TTTTCCTTGA |
AGACCAGTGA |
AGAATTCAGG |
CAGGTGATGA |
ATGGCTTTCA |
AAACCGTAAG |
|
2241 |
CCCAGGAAGG |
GGAAAGTGTT |
CCAGGAACCT |
CTGTTTTATG |
AGGCCCCCAG |
ATCTGTGGAT |
TGGAGAGAGA |
AAGGCTACGT |
|
2321 |
GACTCCTGTG |
AAGAATCAGG |
TGAGACAGTG |
TCAGGTTCAG |
ACCTCCCATC |
TTCCCAGGAA |
AGCCAAGAAG |
TGATTGACAT |
|
2401 |
CTTTGTTTTG |
GTAGACTTTA |
AAGTGATGTA |
CAGTTCACTT |
TTTAACAGTA |
TTCAGATGTG |
TGAGCTGTTG |
TCAAAGTCTT |
|
2481 |
ATTATTTTTT |
TTTTGTGGAT |
GACAGCTTTT |
TTTAATTCCC |
TTTTCAGGGT |
CAGTGTGGTT |
CTTGTTGGGC |
TTTTAGTGCT |
|
2561 |
ACTGGTGCTC |
TTGAAGGACA |
GATGTTCCGG |
AAAACTGGGA |
GGCTTATCTC |
ACTGAGTGAG |
CAGAATCTGG |
TAGACTGCTC |
|
2641 |
TGGGCCTCAA |
GGCAATGAAG |
GCTGCAATGG |
TGGCCTAATG |
GATTATGCTT |
TCCAGTATGT |
TCAGGATAAT |
GGAGGCCTGG |
|
2721 |
ACTCTGAGGA |
ATCCTATCCA |
TATGAGGCAA |
CAGTAAGTGG |
AGCTCCTTGT |
CATCATTCCA |
GCTCAGCTTT |
TGGAAGGTGG |
|
2801 |
ACACTTTAAG |
AGATAACAGA |
TACCTTTTTC |
GGAATTCATA |
TATTAGGGCT |
GGGTGGGGTG |
GCTCACGCCT |
GTAATCCCAG |
|
2881 |
CCCTTTAGGA |
GGCTGAGCCA |
GGCAGATGGC |
TTAAGCTCAG |
GAGTTTGAGA |
CCAGCCTGGG |
CAACAAGGTG |
AAACCTTGAC |
|
2961 |
TCTATCAAAA |
ATACAACGAA |
AATTAGCCGG |
CTGTGGTGGT |
GTGCGTCTGT |
GGTCCCAGCT |
ATGCTGAGGT |
AGGGCGATTG |
|
3041 |
CTTGAGTGTG |
GGAGGTGGTT |
GTGCAAATGC |
ACTCTCCAGT |
CTGAGTGACA |
GAATGAGACC |
CTGTCTCCAA |
AAAAAAAAAA |
|
3121 |
AAGTTCACAT |
ATTAGATGGC |
AGAATATATA |
TCTGCAAATT |
GTCAGTACTG |
TCATTATAAA |
TTATTAGCCT |
TTGCACAATC |
|
3201 |
CTGTGATTAG |
ATGGGTAACA |
TATAGCTGTT |
CTTGCTTCTA |
TGTAACGTGA |
AGAATATACA |
AGCCATTCTA |
AATATACACA |
|
3281 |
TTTCTCTATT |
GTGAAAAATG |
TCTTATGGAA |
GTAAACCCAG |
AGGTCTCATT |
GAAGAGACTG |
GAGTCAATTT |
TCAGTTTCAG |
|
3361 |
ATTAACTAAT |
AGCATTTCAC |
TTCCTGGGAT |
GATTTATAAC |
AGTGGTGACT |
TGGAAATACT |
AAGCTGCACA |
CTGCTGAGTG |
|
3441 |
TTGTGGTTCT |
AACTCATGTT |
CTCCAGGAGG |
AGGACAGCAG |
TAATCAAGTC |
TTTTTTTTCC |
CTTTACCTTT |
GAAAGGAAGA |
|
3521 |
ATCCTGTAAG |
TACAATCCCA |
AGTATTCTGT |
TGCTAATGAC |
ACCGGCTTTG |
TGGACATCCC |
TAAGCAGGAG |
AAGGCCCTGA |
|
3601 |
TGAAGGCAGT |
TGCAACTGTG |
GGGCCCATTT |
CTGTTGCTAT |
TGATGCAGGT |
CATGAGTCCT |
TCCTGTTCTA |
TAAAGAAGGT |
|
3681 |
AAGCATATTT |
TTCTTTGTAG |
AAATTGATGC |
AGAAAAAGAG |
TATCATGACA |
TACGAGCATG |
ATAGACTCTG |
GTTTGCAAGT |
|
3761 |
AAAGTTCAGT |
GTAGATATTT |
AGGTGCTTAG |
CATCATAGTT |
GTTCATTCTG |
TACATGCAAT |
ATTTATACCT |
GATGTTTCCA |
|
3841 |
TTTGACATTA |
TGTTGCGTAT |
ATCTCCATGA |
ATATAAGTAC |
TTCATAACTT |
TTTATAAATA |
GCTATATTTA |
CTGTGTAGTT |
|
3921 |
TAATATACTT |
AAAATATTCT |
GCAATTGTTG |
GATTTTTATA |
ATTTTTCCAA |
AGATTTTTAC |
TTCTATTATT |
AAATCCGAAA |
|
4001 |
AGAGCCTTTT |
GATTCAAGGA |
TTTTAGTGTT |
CTGTGCTATT |
TTTTTAGATT |
GATATCCTAG |
AGTAGAATTA |
CAAGCTTACA |
|
4081 |
AAGAGGGGTC |
TTTTTTTCCT |
CTAGGAAAGC |
TTTTTTTTAA |
TGAAGGATTT |
ATGCTGATTT |
CTATTTCTGC |
TAGTTGTAAA |
|
4161 |
TGAGAGCTTA |
AATCCCTGGC |
CTTCTTATTG |
TTGTTAAGTA |
CTATCTGGAA |
TGTGTGTCAC |
TTCACTGAAT |
TCAGAGATGC |
|
4241 |
CTCATTCCCT |
GTGGGTGACA |
GGATGGGTTA |
CTGTCATGTG |
TCCTCTGGAG |
CTTCTCACCC |
CAGCCCTCAT |
TTTACCATCC |
|
4321 |
CAGGCATTTA |
TTTTGAGCCA |
GACTGTAGCA |
GTGAAGACAT |
GGATCATGGT |
GTGCTGGTGG |
TTGGCTACGG |
ATTTGAAAGC |
|
4401 |
ACAGAATCAG |
ATAACAATAA |
ATATTGGCTG |
GTGAAGAACA |
GGTATAAATT |
GCCAGAAATA |
CTTACATTTG |
AAATTCAAAA |
|
4481 |
GAGAATACTT |
ATTTGCAAAC |
AGTGTTTGGA |
TACAGTTCCA |
CATGCCCTTA |
GGCATACTTC |
TGAAATCCCC |
AGAAGTCTAA |
|
4561 |
GTTGAACATA |
TAATATTAAT |
GTTTGATTAT |
AAAAGTAAGA |
AACTGTCACA |
AATATCTTGG |
GAGTATGAAT |
ATAGTATTTG |
|
4641 |
TTCCATTGTA |
TAAAGGTGTA |
TCTAAATCTG |
AACAGTTTTT |
AAATTCTGAG |
GCATATCTGG |
CTGCAAAGAC |
ATCCTATTGT |
|
4721 |
TTTCCTGTCT |
CTGATCTGTA |
GTTCTGTGAA |
AGGTCACAGT |
TTGGAGGCTG |
GGATGGAAAT |
TATGTCCAGG |
TTATGTCAAG |
|
4801 |
AATTTTTTAA |
GGATAAACAG |
GAACATTGCC |
TGTCCTGATT |
CTTTCTGTTA |
AGTCTATGGC |
CTCTGGCACA |
GTGTTTAAGT |
|
4881 |
TGGGTGTTCC |
TGTTAATAAC |
CCTTTGGGCT |
TTATTCTTTC |
TCTGAAATGG |
AAAACCTGCT |
CTTTTTTCAG |
CTGGGGTGAA |
|
4961 |
GAATGGGGCA |
TGGGTGGCTA |
CGTAAAGATG |
GCCAAAGACC |
GGAGAAACCA |
TTGTGGAATT |
GCCTCAGCAG |
CCAGCTACCC |
|
5041 |
CACTGTGTGA |
GCTGGTGGAC |
GGTGATGAGG |
AAGGACTTGA |
CTGGGGATGG |
CGCATGCATG |
GGAGGAATTC |
ATCTTCAGTC |
|
5121 |
TACCAGCCCC |
CGCTGTGTCG |
GATACACACT |
CGAATCATTG |
AAGATCCGAG |
TGTGATTTGA |
ATTCTGTGAT |
ATTTTCACAC |
|
5201 |
TGGTAAATGT |
TACCTCTATT |
TTAATTACTG |
CTATAAATAG |
GTTTATATTA |
TTGATTCACT |
TACTGACTTT |
GCATTTTCGT |
|
5281 |
TTTTAAAAGG |
ATGTATAAAT |
TTTTACCTGT |
TTAAATAAAA |
TTTAATTTCA |
AATGTAGTGG |
TGGGGCTTCT |
TTCTATTTTT |
|
5361 |
GATGCACTGA |
ATTTTTGTGT |
AATAAAGAAC |
ATAATTGGGC |
TCTAAGCCAT |
A |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:1514|CTSL1|NC_000009.11:90340973...90346383 (+)
GGGCGGTGCCGGCCGAACCCAGACCCGAGGTTTTAGAAGCAGAGTCAGGCGAAGCTGGGCCAGAACCGCGACCTCCGCAA
CCTTGAGCGGCATCCGTGGAGTGCGCCTGCGCAGCTACGACCGCAGCAGGAAAGCGCCGCCGGCCAGGCCCAGCTGTGGC
CGGACAGGGACTGGAAGAGAGGACGCGGTCGAGTAGGTGTGCACCAGCCCTGGCAACGAGAGCGTCTACCCCGAACTCTG
CTGGCCTTGAGGTGGGGAAGCCGGGGAGGGCAGTTGAGGACCCCGCGGAGGCGCGTGACTGGTTGAGCGGGCAGGCCAGC
CTCCGAGCCGGGTGGACACAGGTACCGCAGCCAGGCCGCGCCGCGCCGACTCAGGGCCTGGCCCGGCCAGACAGGGAAGC
TCAGTCCCCGCACGCCAGACAGCGGTACTCCTGCTGGCGTCACCGCAAACATCCTCTGACCGCTACAGCCAGTGTGTGGC
GCAGGCGTCATGTCCCCGGCCCTGCCACGCCTGGAGCCCTGGAAGCTGGCTGCAGGGCGCTGGCTTCCCGCGTGCGGCCA
TATGACCCCGTCCCTGATTTAGGGGAGCAGTTTGGGGTGTCGGCAGCACAGGCCCAAGTGAATGAAGGAGGGAGCAGTGC
GTGCTCTCCTTCCCAGTTTTTCCTGGGAAAGCATTTCAGAAAGGTTTCATTTAAGGAGAGGTTGGGGCGGCGCGGTGGCT
CACTCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAGGTCAGTAGTTCGAGACCAGCCTGGCCAA
CATGGTGAAACCCCGTCTCTACTGAAAATACAAAATTAGACGGGCGAGGCGGCGCACGCCTGTAGTTCCAGCTATTCAAG
AGGCTGAGGAAGAATGGCTTGAACCCGGGAGGCAGAGGTTGCTGTGAGCCGATATCGCGCCGTTGAACTCCAGCCTGGGC
CACAGAGCAAGACTCCATCTCAAAAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAGGAGAGATTGGAAAA
CTTATCTCAGCTTTTGGTGTTTGTTAGTCAGGAAGATGTGTGAAGGCCTCCTAACTCTTGGGGATCTCTTTGTCCCTACT
TGGGAATCCCACCTTATCATTAGTGAGGTTTTGCCTGGGCACGAAACCTGGATTTTTTGCGATTGGTACAAAACCTGGAT
CAACCGTTTCCCGGTTTCCTAGTTGTTGCCTTAAGCTTCTCACACACAAGGTAGTTTCATACCGTTCTCATAACCTAAAT
TGTCATCGCATAAACTGTTTCAGCTCCTACAGCTCTGGACAGGCTGCTTTTCATTTTGGTGAGTCCATCCAGTACCTCCA
CGTGCCCTGTTTTTCTCCAGGCACATCCTTGGCCTCTTCCACAGTCCTTGGGTAAATGCTTGGGAGAATAATTTAAATAT
TTTTATTCTACCATGGTGGCCCTAATTTTTCAGGGGGCAGTAAGATGGCTTTTTAGGATTGGTCTAATCAGATCCTCATT
TTTGTTCCCTTCCTAGGTTTTAAAACATGAATCCTACACTCATCCTTGCTGCCTTTTGCCTGGGAATTGCCTCAGCTACT
CTAACATTTGATCACAGTTTAGAGGCACAGTGGACCAAGTGGAAGGCGATGCACAACAGATTATACGGCATGGTTAGTGA
AACTTCCCCAGAAAGAATAGTCCTGGCTGTTGAGAAGTTTTAGTCAGAGAGTAGCTTCTAGAGGCCAGCTTTTACCAATA
GCCTAATGTAATAACCTAATGGCGTGGATTATGAGCACAATGTGGACATTCATCCTTGTTGTGTCTCAGTTTGGAGAACA
GCATCCCCAGAGGTGTCAAGCCTTCCCTTGCCATGGTTTCTCTTCCATCTCTGTCTGCAGATTCACTTGGTGAGGATGAG
TTGGGTTTTAGGTAGAAGTAAAGAGCATCAGTTACATGTTTGCCTCTAGAATGAAGAAGGATGGAGGAGAGCAGTGTGGG
AGAAGAACATGAAGATGATTGAACTGCACAATCAGGAATACAGGGAAGGGAAACACAGCTTCACAATGGCCATGAACGCC
TTTGGAGACATGGTAAGTGTGCTGTGGACTGCCGAGCTCTGTGCTTCCTCTCTTCGGTTCTTTACTAAGGTAATCTCTTG
CTTTTCAACATTTTATTTCCTTTTCCTTGAAGACCAGTGAAGAATTCAGGCAGGTGATGAATGGCTTTCAAAACCGTAAG
CCCAGGAAGGGGAAAGTGTTCCAGGAACCTCTGTTTTATGAGGCCCCCAGATCTGTGGATTGGAGAGAGAAAGGCTACGT
GACTCCTGTGAAGAATCAGGTGAGACAGTGTCAGGTTCAGACCTCCCATCTTCCCAGGAAAGCCAAGAAGTGATTGACAT
CTTTGTTTTGGTAGACTTTAAAGTGATGTACAGTTCACTTTTTAACAGTATTCAGATGTGTGAGCTGTTGTCAAAGTCTT
ATTATTTTTTTTTTGTGGATGACAGCTTTTTTTAATTCCCTTTTCAGGGTCAGTGTGGTTCTTGTTGGGCTTTTAGTGCT
ACTGGTGCTCTTGAAGGACAGATGTTCCGGAAAACTGGGAGGCTTATCTCACTGAGTGAGCAGAATCTGGTAGACTGCTC
TGGGCCTCAAGGCAATGAAGGCTGCAATGGTGGCCTAATGGATTATGCTTTCCAGTATGTTCAGGATAATGGAGGCCTGG
ACTCTGAGGAATCCTATCCATATGAGGCAACAGTAAGTGGAGCTCCTTGTCATCATTCCAGCTCAGCTTTTGGAAGGTGG
ACACTTTAAGAGATAACAGATACCTTTTTCGGAATTCATATATTAGGGCTGGGTGGGGTGGCTCACGCCTGTAATCCCAG
CCCTTTAGGAGGCTGAGCCAGGCAGATGGCTTAAGCTCAGGAGTTTGAGACCAGCCTGGGCAACAAGGTGAAACCTTGAC
TCTATCAAAAATACAACGAAAATTAGCCGGCTGTGGTGGTGTGCGTCTGTGGTCCCAGCTATGCTGAGGTAGGGCGATTG
CTTGAGTGTGGGAGGTGGTTGTGCAAATGCACTCTCCAGTCTGAGTGACAGAATGAGACCCTGTCTCCAAAAAAAAAAAA
AAGTTCACATATTAGATGGCAGAATATATATCTGCAAATTGTCAGTACTGTCATTATAAATTATTAGCCTTTGCACAATC
CTGTGATTAGATGGGTAACATATAGCTGTTCTTGCTTCTATGTAACGTGAAGAATATACAAGCCATTCTAAATATACACA
TTTCTCTATTGTGAAAAATGTCTTATGGAAGTAAACCCAGAGGTCTCATTGAAGAGACTGGAGTCAATTTTCAGTTTCAG
ATTAACTAATAGCATTTCACTTCCTGGGATGATTTATAACAGTGGTGACTTGGAAATACTAAGCTGCACACTGCTGAGTG
TTGTGGTTCTAACTCATGTTCTCCAGGAGGAGGACAGCAGTAATCAAGTCTTTTTTTTCCCTTTACCTTTGAAAGGAAGA
ATCCTGTAAGTACAATCCCAAGTATTCTGTTGCTAATGACACCGGCTTTGTGGACATCCCTAAGCAGGAGAAGGCCCTGA
TGAAGGCAGTTGCAACTGTGGGGCCCATTTCTGTTGCTATTGATGCAGGTCATGAGTCCTTCCTGTTCTATAAAGAAGGT
AAGCATATTTTTCTTTGTAGAAATTGATGCAGAAAAAGAGTATCATGACATACGAGCATGATAGACTCTGGTTTGCAAGT
AAAGTTCAGTGTAGATATTTAGGTGCTTAGCATCATAGTTGTTCATTCTGTACATGCAATATTTATACCTGATGTTTCCA
TTTGACATTATGTTGCGTATATCTCCATGAATATAAGTACTTCATAACTTTTTATAAATAGCTATATTTACTGTGTAGTT
TAATATACTTAAAATATTCTGCAATTGTTGGATTTTTATAATTTTTCCAAAGATTTTTACTTCTATTATTAAATCCGAAA
AGAGCCTTTTGATTCAAGGATTTTAGTGTTCTGTGCTATTTTTTTAGATTGATATCCTAGAGTAGAATTACAAGCTTACA
AAGAGGGGTCTTTTTTTCCTCTAGGAAAGCTTTTTTTTAATGAAGGATTTATGCTGATTTCTATTTCTGCTAGTTGTAAA
TGAGAGCTTAAATCCCTGGCCTTCTTATTGTTGTTAAGTACTATCTGGAATGTGTGTCACTTCACTGAATTCAGAGATGC
CTCATTCCCTGTGGGTGACAGGATGGGTTACTGTCATGTGTCCTCTGGAGCTTCTCACCCCAGCCCTCATTTTACCATCC
CAGGCATTTATTTTGAGCCAGACTGTAGCAGTGAAGACATGGATCATGGTGTGCTGGTGGTTGGCTACGGATTTGAAAGC
ACAGAATCAGATAACAATAAATATTGGCTGGTGAAGAACAGGTATAAATTGCCAGAAATACTTACATTTGAAATTCAAAA
GAGAATACTTATTTGCAAACAGTGTTTGGATACAGTTCCACATGCCCTTAGGCATACTTCTGAAATCCCCAGAAGTCTAA
GTTGAACATATAATATTAATGTTTGATTATAAAAGTAAGAAACTGTCACAAATATCTTGGGAGTATGAATATAGTATTTG
TTCCATTGTATAAAGGTGTATCTAAATCTGAACAGTTTTTAAATTCTGAGGCATATCTGGCTGCAAAGACATCCTATTGT
TTTCCTGTCTCTGATCTGTAGTTCTGTGAAAGGTCACAGTTTGGAGGCTGGGATGGAAATTATGTCCAGGTTATGTCAAG
AATTTTTTAAGGATAAACAGGAACATTGCCTGTCCTGATTCTTTCTGTTAAGTCTATGGCCTCTGGCACAGTGTTTAAGT
TGGGTGTTCCTGTTAATAACCCTTTGGGCTTTATTCTTTCTCTGAAATGGAAAACCTGCTCTTTTTTCAGCTGGGGTGAA
GAATGGGGCATGGGTGGCTACGTAAAGATGGCCAAAGACCGGAGAAACCATTGTGGAATTGCCTCAGCAGCCAGCTACCC
CACTGTGTGAGCTGGTGGACGGTGATGAGGAAGGACTTGACTGGGGATGGCGCATGCATGGGAGGAATTCATCTTCAGTC
TACCAGCCCCCGCTGTGTCGGATACACACTCGAATCATTGAAGATCCGAGTGTGATTTGAATTCTGTGATATTTTCACAC
TGGTAAATGTTACCTCTATTTTAATTACTGCTATAAATAGGTTTATATTATTGATTCACTTACTGACTTTGCATTTTCGT
TTTTAAAAGGATGTATAAATTTTTACCTGTTTAAATAAAATTTAATTTCAAATGTAGTGGTGGGGCTTCTTTCTATTTTT
GATGCACTGAATTTTTGTGTAATAAAGAACATAATTGGGCTCTAAGCCATA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001912.4 (GI:209364548)
|
Name |
Cathepsin L1 (CTSL1), transcript variant 1
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
1730 nt
|
Map |
9q21-q22
|
Location |
Chromosome 9 (NC_000009.11) strand : +
90340973...90341312 | 90342508...90342643 | 90342941...90343063 | 90343164...90343310 |
90343499...90343723 | 90344487...90344649 | 90345295...90345412 | 90345922...90346383 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1b
|
1
|
340
|
340
|
1
|
Exon 2
|
341
|
476
|
136
|
1
|
Exon 3
|
477
|
599
|
123
|
1
|
Exon 4
|
600
|
746
|
147
|
1
|
Exon 5
|
747
|
971
|
225
|
1
|
Exon 6
|
972
|
1134
|
163
|
1
|
Exon 7
|
1135
|
1252
|
118
|
1
|
Exon 8
|
1253
|
1714
|
462
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS6675
|
Nucleotide |
CTSL1, mRNA isoform 1[NM_001912.4] : 351...1352
|
Length |
1002
|
Location |
Chromosome 9 (NC_000009.11) strand : +
90342518...90342643 | 90342941...90343063 | 90343164...90343310 | 90343499...90343723 |
90344487...90344649 | 90345295...90345412 | 90345922...90346021 |
|
Start codon |
1
|
Translation |
NP_001903.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGCGGTGCCG |
GCCGAACCCA |
GACCCGAGGT |
TTTAGAAGCA |
GAGTCAGGCG |
AAGCTGGGCC |
AGAACCGCGA |
CCTCCGCAAC |
|
81 |
CTTGAGCGGC |
ATCCGTGGAG |
TGCGCCTGCG |
CAGCTACGAC |
CGCAGCAGGA |
AAGCGCCGCC |
GGCCAGGCCC |
AGCTGTGGCC |
|
161 |
GGACAGGGAC |
TGGAAGAGAG |
GACGCGGTCG |
AGTAGGTGTG |
CACCAGCCCT |
GGCAACGAGA |
GCGTCTACCC |
CGAACTCTGC |
|
241 |
TGGCCTTGAG |
GTGGGGAAGC |
CGGGGAGGGC |
AGTTGAGGAC |
CCCGCGGAGG |
CGCGTGACTG |
GTTGAGCGGG |
CAGGCCAGCC |
|
321 |
TCCGAGCCGG |
GTGGACACAG |
GTTTTAAAAC |
ATGAATCCTA |
CACTCATCCT |
TGCTGCCTTT |
TGCCTGGGAA |
TTGCCTCAGC |
|
401 |
TACTCTAACA |
TTTGATCACA |
GTTTAGAGGC |
ACAGTGGACC |
AAGTGGAAGG |
CGATGCACAA |
CAGATTATAC |
GGCATGAATG |
|
481 |
AAGAAGGATG |
GAGGAGAGCA |
GTGTGGGAGA |
AGAACATGAA |
GATGATTGAA |
CTGCACAATC |
AGGAATACAG |
GGAAGGGAAA |
|
561 |
CACAGCTTCA |
CAATGGCCAT |
GAACGCCTTT |
GGAGACATGA |
CCAGTGAAGA |
ATTCAGGCAG |
GTGATGAATG |
GCTTTCAAAA |
|
641 |
CCGTAAGCCC |
AGGAAGGGGA |
AAGTGTTCCA |
GGAACCTCTG |
TTTTATGAGG |
CCCCCAGATC |
TGTGGATTGG |
AGAGAGAAAG |
|
721 |
GCTACGTGAC |
TCCTGTGAAG |
AATCAGGGTC |
AGTGTGGTTC |
TTGTTGGGCT |
TTTAGTGCTA |
CTGGTGCTCT |
TGAAGGACAG |
|
801 |
ATGTTCCGGA |
AAACTGGGAG |
GCTTATCTCA |
CTGAGTGAGC |
AGAATCTGGT |
AGACTGCTCT |
GGGCCTCAAG |
GCAATGAAGG |
|
881 |
CTGCAATGGT |
GGCCTAATGG |
ATTATGCTTT |
CCAGTATGTT |
CAGGATAATG |
GAGGCCTGGA |
CTCTGAGGAA |
TCCTATCCAT |
|
961 |
ATGAGGCAAC |
AGAAGAATCC |
TGTAAGTACA |
ATCCCAAGTA |
TTCTGTTGCT |
AATGACACCG |
GCTTTGTGGA |
CATCCCTAAG |
|
1041 |
CAGGAGAAGG |
CCCTGATGAA |
GGCAGTTGCA |
ACTGTGGGGC |
CCATTTCTGT |
TGCTATTGAT |
GCAGGTCATG |
AGTCCTTCCT |
|
1121 |
GTTCTATAAA |
GAAGGCATTT |
ATTTTGAGCC |
AGACTGTAGC |
AGTGAAGACA |
TGGATCATGG |
TGTGCTGGTG |
GTTGGCTACG |
|
1201 |
GATTTGAAAG |
CACAGAATCA |
GATAACAATA |
AATATTGGCT |
GGTGAAGAAC |
AGCTGGGGTG |
AAGAATGGGG |
CATGGGTGGC |
|
1281 |
TACGTAAAGA |
TGGCCAAAGA |
CCGGAGAAAC |
CATTGTGGAA |
TTGCCTCAGC |
AGCCAGCTAC |
CCCACTGTGT |
GAGCTGGTGG |
|
1361 |
ACGGTGATGA |
GGAAGGACTT |
GACTGGGGAT |
GGCGCATGCA |
TGGGAGGAAT |
TCATCTTCAG |
TCTACCAGCC |
CCCGCTGTGT |
|
1441 |
CGGATACACA |
CTCGAATCAT |
TGAAGATCCG |
AGTGTGATTT |
GAATTCTGTG |
ATATTTTCAC |
ACTGGTAAAT |
GTTACCTCTA |
|
1521 |
TTTTAATTAC |
TGCTATAAAT |
AGGTTTATAT |
TATTGATTCA |
CTTACTGACT |
TTGCATTTTC |
GTTTTTAAAA |
GGATGTATAA |
|
1601 |
ATTTTTACCT |
GTTTAAATAA |
AATTTAATTT |
CAAATGTAGT |
GGTGGGGCTT |
CTTTCTATTT |
TTGATGCACT |
GAATTTTTGT |
|
1681 |
GTAATAAAGA |
ACATAATTGG |
GCTCTAAGCC |
ATAAAAAAAA |
AAAAAAAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|209364548|ref|NM_001912.4|Cathepsin L1 (CTSL1), transcript variant 1
GGCGGTGCCGGCCGAACCCAGACCCGAGGTTTTAGAAGCAGAGTCAGGCGAAGCTGGGCCAGAACCGCGACCTCCGCAAC
CTTGAGCGGCATCCGTGGAGTGCGCCTGCGCAGCTACGACCGCAGCAGGAAAGCGCCGCCGGCCAGGCCCAGCTGTGGCC
GGACAGGGACTGGAAGAGAGGACGCGGTCGAGTAGGTGTGCACCAGCCCTGGCAACGAGAGCGTCTACCCCGAACTCTGC
TGGCCTTGAGGTGGGGAAGCCGGGGAGGGCAGTTGAGGACCCCGCGGAGGCGCGTGACTGGTTGAGCGGGCAGGCCAGCC
TCCGAGCCGGGTGGACACAGGTTTTAAAACATGAATCCTACACTCATCCTTGCTGCCTTTTGCCTGGGAATTGCCTCAGC
TACTCTAACATTTGATCACAGTTTAGAGGCACAGTGGACCAAGTGGAAGGCGATGCACAACAGATTATACGGCATGAATG
AAGAAGGATGGAGGAGAGCAGTGTGGGAGAAGAACATGAAGATGATTGAACTGCACAATCAGGAATACAGGGAAGGGAAA
CACAGCTTCACAATGGCCATGAACGCCTTTGGAGACATGACCAGTGAAGAATTCAGGCAGGTGATGAATGGCTTTCAAAA
CCGTAAGCCCAGGAAGGGGAAAGTGTTCCAGGAACCTCTGTTTTATGAGGCCCCCAGATCTGTGGATTGGAGAGAGAAAG
GCTACGTGACTCCTGTGAAGAATCAGGGTCAGTGTGGTTCTTGTTGGGCTTTTAGTGCTACTGGTGCTCTTGAAGGACAG
ATGTTCCGGAAAACTGGGAGGCTTATCTCACTGAGTGAGCAGAATCTGGTAGACTGCTCTGGGCCTCAAGGCAATGAAGG
CTGCAATGGTGGCCTAATGGATTATGCTTTCCAGTATGTTCAGGATAATGGAGGCCTGGACTCTGAGGAATCCTATCCAT
ATGAGGCAACAGAAGAATCCTGTAAGTACAATCCCAAGTATTCTGTTGCTAATGACACCGGCTTTGTGGACATCCCTAAG
CAGGAGAAGGCCCTGATGAAGGCAGTTGCAACTGTGGGGCCCATTTCTGTTGCTATTGATGCAGGTCATGAGTCCTTCCT
GTTCTATAAAGAAGGCATTTATTTTGAGCCAGACTGTAGCAGTGAAGACATGGATCATGGTGTGCTGGTGGTTGGCTACG
GATTTGAAAGCACAGAATCAGATAACAATAAATATTGGCTGGTGAAGAACAGCTGGGGTGAAGAATGGGGCATGGGTGGC
TACGTAAAGATGGCCAAAGACCGGAGAAACCATTGTGGAATTGCCTCAGCAGCCAGCTACCCCACTGTGTGAGCTGGTGG
ACGGTGATGAGGAAGGACTTGACTGGGGATGGCGCATGCATGGGAGGAATTCATCTTCAGTCTACCAGCCCCCGCTGTGT
CGGATACACACTCGAATCATTGAAGATCCGAGTGTGATTTGAATTCTGTGATATTTTCACACTGGTAAATGTTACCTCTA
TTTTAATTACTGCTATAAATAGGTTTATATTATTGATTCACTTACTGACTTTGCATTTTCGTTTTTAAAAGGATGTATAA
ATTTTTACCTGTTTAAATAAAATTTAATTTCAAATGTAGTGGTGGGGCTTCTTTCTATTTTTGATGCACTGAATTTTTGT
GTAATAAAGAACATAATTGGGCTCTAAGCCATAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_145918.2 (GI:125987604)
|
Name |
Cathepsin L1 (CTSL1), transcript variant 2
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
1587 nt
|
Map |
9q21-q22
|
Location |
Chromosome 9 (NC_000009.11) strand : +
90340973...90341167 | 90342508...90342643 | 90342941...90343063 | 90343164...90343310 |
90343499...90343723 | 90344487...90344649 | 90345295...90345412 | 90345922...90346383 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1a
|
1
|
195
|
195
|
1
|
Exon 2
|
196
|
331
|
136
|
1
|
Exon 3
|
332
|
454
|
123
|
1
|
Exon 4
|
455
|
601
|
147
|
1
|
Exon 5
|
602
|
826
|
225
|
1
|
Exon 6
|
827
|
989
|
163
|
1
|
Exon 7
|
990
|
1107
|
118
|
1
|
Exon 8
|
1108
|
1569
|
462
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS6675
|
Nucleotide |
CTSL1, mRNA isoform 2[NM_145918.2] : 206...1207
|
Length |
1002
|
Location |
Chromosome 9 (NC_000009.11) strand : +
90342518...90342643 | 90342941...90343063 | 90343164...90343310 | 90343499...90343723 |
90344487...90344649 | 90345295...90345412 | 90345922...90346021 |
|
Start codon |
1
|
Translation |
NP_666023.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GGCGGTGCCG |
GCCGAACCCA |
GACCCGAGGT |
TTTAGAAGCA |
GAGTCAGGCG |
AAGCTGGGCC |
AGAACCGCGA |
CCTCCGCAAC |
|
81 |
CTTGAGCGGC |
ATCCGTGGAG |
TGCGCCTGCG |
CAGCTACGAC |
CGCAGCAGGA |
AAGCGCCGCC |
GGCCAGGCCC |
AGCTGTGGCC |
|
161 |
GGACAGGGAC |
TGGAAGAGAG |
GACGCGGTCG |
AGTAGGTTTT |
AAAACATGAA |
TCCTACACTC |
ATCCTTGCTG |
CCTTTTGCCT |
|
241 |
GGGAATTGCC |
TCAGCTACTC |
TAACATTTGA |
TCACAGTTTA |
GAGGCACAGT |
GGACCAAGTG |
GAAGGCGATG |
CACAACAGAT |
|
321 |
TATACGGCAT |
GAATGAAGAA |
GGATGGAGGA |
GAGCAGTGTG |
GGAGAAGAAC |
ATGAAGATGA |
TTGAACTGCA |
CAATCAGGAA |
|
401 |
TACAGGGAAG |
GGAAACACAG |
CTTCACAATG |
GCCATGAACG |
CCTTTGGAGA |
CATGACCAGT |
GAAGAATTCA |
GGCAGGTGAT |
|
481 |
GAATGGCTTT |
CAAAACCGTA |
AGCCCAGGAA |
GGGGAAAGTG |
TTCCAGGAAC |
CTCTGTTTTA |
TGAGGCCCCC |
AGATCTGTGG |
|
561 |
ATTGGAGAGA |
GAAAGGCTAC |
GTGACTCCTG |
TGAAGAATCA |
GGGTCAGTGT |
GGTTCTTGTT |
GGGCTTTTAG |
TGCTACTGGT |
|
641 |
GCTCTTGAAG |
GACAGATGTT |
CCGGAAAACT |
GGGAGGCTTA |
TCTCACTGAG |
TGAGCAGAAT |
CTGGTAGACT |
GCTCTGGGCC |
|
721 |
TCAAGGCAAT |
GAAGGCTGCA |
ATGGTGGCCT |
AATGGATTAT |
GCTTTCCAGT |
ATGTTCAGGA |
TAATGGAGGC |
CTGGACTCTG |
|
801 |
AGGAATCCTA |
TCCATATGAG |
GCAACAGAAG |
AATCCTGTAA |
GTACAATCCC |
AAGTATTCTG |
TTGCTAATGA |
CACCGGCTTT |
|
881 |
GTGGACATCC |
CTAAGCAGGA |
GAAGGCCCTG |
ATGAAGGCAG |
TTGCAACTGT |
GGGGCCCATT |
TCTGTTGCTA |
TTGATGCAGG |
|
961 |
TCATGAGTCC |
TTCCTGTTCT |
ATAAAGAAGG |
CATTTATTTT |
GAGCCAGACT |
GTAGCAGTGA |
AGACATGGAT |
CATGGTGTGC |
|
1041 |
TGGTGGTTGG |
CTACGGATTT |
GAAAGCACAG |
AATCAGATAA |
CAATAAATAT |
TGGCTGGTGA |
AGAACAGCTG |
GGGTGAAGAA |
|
1121 |
TGGGGCATGG |
GTGGCTACGT |
AAAGATGGCC |
AAAGACCGGA |
GAAACCATTG |
TGGAATTGCC |
TCAGCAGCCA |
GCTACCCCAC |
|
1201 |
TGTGTGAGCT |
GGTGGACGGT |
GATGAGGAAG |
GACTTGACTG |
GGGATGGCGC |
ATGCATGGGA |
GGAATTCATC |
TTCAGTCTAC |
|
1281 |
CAGCCCCCGC |
TGTGTCGGAT |
ACACACTCGA |
ATCATTGAAG |
ATCCGAGTGT |
GATTTGAATT |
CTGTGATATT |
TTCACACTGG |
|
1361 |
TAAATGTTAC |
CTCTATTTTA |
ATTACTGCTA |
TAAATAGGTT |
TATATTATTG |
ATTCACTTAC |
TGACTTTGCA |
TTTTCGTTTT |
|
1441 |
TAAAAGGATG |
TATAAATTTT |
TACCTGTTTA |
AATAAAATTT |
AATTTCAAAT |
GTAGTGGTGG |
GGCTTCTTTC |
TATTTTTGAT |
|
1521 |
GCACTGAATT |
TTTGTGTAAT |
AAAGAACATA |
ATTGGGCTCT |
AAGCCATAAA |
AAAAAAAAAA |
AAAAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|125987604|ref|NM_145918.2|Cathepsin L1 (CTSL1), transcript variant 2
GGCGGTGCCGGCCGAACCCAGACCCGAGGTTTTAGAAGCAGAGTCAGGCGAAGCTGGGCCAGAACCGCGACCTCCGCAAC
CTTGAGCGGCATCCGTGGAGTGCGCCTGCGCAGCTACGACCGCAGCAGGAAAGCGCCGCCGGCCAGGCCCAGCTGTGGCC
GGACAGGGACTGGAAGAGAGGACGCGGTCGAGTAGGTTTTAAAACATGAATCCTACACTCATCCTTGCTGCCTTTTGCCT
GGGAATTGCCTCAGCTACTCTAACATTTGATCACAGTTTAGAGGCACAGTGGACCAAGTGGAAGGCGATGCACAACAGAT
TATACGGCATGAATGAAGAAGGATGGAGGAGAGCAGTGTGGGAGAAGAACATGAAGATGATTGAACTGCACAATCAGGAA
TACAGGGAAGGGAAACACAGCTTCACAATGGCCATGAACGCCTTTGGAGACATGACCAGTGAAGAATTCAGGCAGGTGAT
GAATGGCTTTCAAAACCGTAAGCCCAGGAAGGGGAAAGTGTTCCAGGAACCTCTGTTTTATGAGGCCCCCAGATCTGTGG
ATTGGAGAGAGAAAGGCTACGTGACTCCTGTGAAGAATCAGGGTCAGTGTGGTTCTTGTTGGGCTTTTAGTGCTACTGGT
GCTCTTGAAGGACAGATGTTCCGGAAAACTGGGAGGCTTATCTCACTGAGTGAGCAGAATCTGGTAGACTGCTCTGGGCC
TCAAGGCAATGAAGGCTGCAATGGTGGCCTAATGGATTATGCTTTCCAGTATGTTCAGGATAATGGAGGCCTGGACTCTG
AGGAATCCTATCCATATGAGGCAACAGAAGAATCCTGTAAGTACAATCCCAAGTATTCTGTTGCTAATGACACCGGCTTT
GTGGACATCCCTAAGCAGGAGAAGGCCCTGATGAAGGCAGTTGCAACTGTGGGGCCCATTTCTGTTGCTATTGATGCAGG
TCATGAGTCCTTCCTGTTCTATAAAGAAGGCATTTATTTTGAGCCAGACTGTAGCAGTGAAGACATGGATCATGGTGTGC
TGGTGGTTGGCTACGGATTTGAAAGCACAGAATCAGATAACAATAAATATTGGCTGGTGAAGAACAGCTGGGGTGAAGAA
TGGGGCATGGGTGGCTACGTAAAGATGGCCAAAGACCGGAGAAACCATTGTGGAATTGCCTCAGCAGCCAGCTACCCCAC
TGTGTGAGCTGGTGGACGGTGATGAGGAAGGACTTGACTGGGGATGGCGCATGCATGGGAGGAATTCATCTTCAGTCTAC
CAGCCCCCGCTGTGTCGGATACACACTCGAATCATTGAAGATCCGAGTGTGATTTGAATTCTGTGATATTTTCACACTGG
TAAATGTTACCTCTATTTTAATTACTGCTATAAATAGGTTTATATTATTGATTCACTTACTGACTTTGCATTTTCGTTTT
TAAAAGGATGTATAAATTTTTACCTGTTTAAATAAAATTTAATTTCAAATGTAGTGGTGGGGCTTCTTTCTATTTTTGAT
GCACTGAATTTTTGTGTAATAAAGAACATAATTGGGCTCTAAGCCATAAAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1b
|
Length |
340 nt
|
Location |
Chromosome 9 (NC_000009.11) : 90340973...90341312 (+)
|
Is part of |
CTSL1, mRNA isoform 1
(NM_001912.4)
|
Sequence |
Show
|
|
GGCGGTGCCGGCCGAACCCAGACCCGAGGTTTTAGAAGCAGAGTCAGGCGAAGCTGGGCCAGAACCGCGACCTCCGCAAC
CTTGAGCGGCATCCGTGGAGTGCGCCTGCGCAGCTACGACCGCAGCAGGAAAGCGCCGCCGGCCAGGCCCAGCTGTGGCC
GGACAGGGACTGGAAGAGAGGACGCGGTCGAGTAGGTGTGCACCAGCCCTGGCAACGAGAGCGTCTACCCCGAACTCTGC
TGGCCTTGAGGTGGGGAAGCCGGGGAGGGCAGTTGAGGACCCCGCGGAGGCGCGTGACTGGTTGAGCGGGCAGGCCAGCC
TCCGAGCCGGGTGGACACAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1a
|
Length |
195 nt
|
Location |
Chromosome 9 (NC_000009.11) : 90340973...90341167 (+)
|
Is part of |
CTSL1, mRNA isoform 2
(NM_145918.2)
|
Sequence |
Show
|
|
GGCGGTGCCGGCCGAACCCAGACCCGAGGTTTTAGAAGCAGAGTCAGGCGAAGCTGGGCCAGAACCGCGACCTCCGCAAC
CTTGAGCGGCATCCGTGGAGTGCGCCTGCGCAGCTACGACCGCAGCAGGAAAGCGCCGCCGGCCAGGCCCAGCTGTGGCC
GGACAGGGACTGGAAGAGAGGACGCGGTCGAGTAG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
5
|
Length |
225 nt
|
Location |
Chromosome 9 (NC_000009.11) : 90343499...90343723 (+)
|
Is part of |
CTSL1, mRNA isoform 2
(NM_145918.2)
CTSL1, mRNA isoform 1
(NM_001912.4)
|
Sequence |
Show
|
|
GGTCAGTGTGGTTCTTGTTGGGCTTTTAGTGCTACTGGTGCTCTTGAAGGACAGATGTTCCGGAAAACTGGGAGGCTTAT
CTCACTGAGTGAGCAGAATCTGGTAGACTGCTCTGGGCCTCAAGGCAATGAAGGCTGCAATGGTGGCCTAATGGATTATG
CTTTCCAGTATGTTCAGGATAATGGAGGCCTGGACTCTGAGGAATCCTATCCATATGAGGCAACA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
8
|
Length |
462 nt
|
Location |
Chromosome 9 (NC_000009.11) : 90345922...90346383 (+)
|
Is part of |
CTSL1, mRNA isoform 2
(NM_145918.2)
CTSL1, mRNA isoform 1
(NM_001912.4)
|
Sequence |
Show
|
|
CTGGGGTGAAGAATGGGGCATGGGTGGCTACGTAAAGATGGCCAAAGACCGGAGAAACCATTGTGGAATTGCCTCAGCAG
CCAGCTACCCCACTGTGTGAGCTGGTGGACGGTGATGAGGAAGGACTTGACTGGGGATGGCGCATGCATGGGAGGAATTC
ATCTTCAGTCTACCAGCCCCCGCTGTGTCGGATACACACTCGAATCATTGAAGATCCGAGTGTGATTTGAATTCTGTGAT
ATTTTCACACTGGTAAATGTTACCTCTATTTTAATTACTGCTATAAATAGGTTTATATTATTGATTCACTTACTGACTTT
GCATTTTCGTTTTTAAAAGGATGTATAAATTTTTACCTGTTTAAATAAAATTTAATTTCAAATGTAGTGGTGGGGCTTCT
TTCTATTTTTGATGCACTGAATTTTTGTGTAATAAAGAACATAATTGGGCTCTAAGCCATAA
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : P07711
|
Name |
Cathepsin L1
|
Alternative name(s) |
Major excreted protein
|
Synonym(s) |
MEP
|
Organism |
Homo sapiens
|
Length |
333 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Catalytic activity
|
Specificity close to that of papain. As compared to cathepsin B, cathepsin L exhibits higher activity toward protein substrates, but has little activity on Z-Arg-Arg- NHMec, and no peptidyl-dipeptidase activity.
|
Function
|
Important for the overall degradation of proteins in lysosomes.
|
Similarity
|
Belongs to the peptidase C1 family.
|
Subcellular location
|
Lysosome.
|
Subunit
|
Dimer of a heavy and a light chain linked by disulfide bonds.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
proteolysis [GO:0006508]
|
Cellular component
|
extracellular region [GO:0005576]
lysosome [GO:0005764]
|
Molecular function
|
cysteine-type endopeptidase activity [GO:0004197]
protein binding [GO:0005515]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Molecule processing |
|
|
|
|
Propeptide
|
18 - 113
|
96
|
Activation peptide
|
PRO_0000026244
|
Propeptide
|
289 - 291
|
3
|
|
PRO_0000026246
|
Signal
|
1 - 17
|
17
|
Potential
|
P07711-SIGNAL-1
|
Sites |
|
|
|
|
Active site
|
138 - 138
|
1
|
|
P07711-ACT_SITE-138
|
Active site
|
276 - 276
|
1
|
|
P07711-ACT_SITE-276
|
Active site
|
300 - 300
|
1
|
|
P07711-ACT_SITE-300
|
Amino acid modifications |
|
|
|
|
Disulfide bond
|
135 - 178
|
44
|
|
P07711-DISULFID-135
|
Disulfide bond
|
169 - 211
|
43
|
|
P07711-DISULFID-169
|
Disulfide bond
|
269 - 322
|
54
|
Interchain (between heavy and light chains)
|
P07711-DISULFID-269
|
Glycosylation
|
221 - 221
|
1
|
N-linked (GlcNAc...)
|
P07711-CARBOHYD-221
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MNPTLILAAF |
CLGIASATLT |
FDHSLEAQWT |
KWKAMHNRLY |
GMNEEGWRRA |
VWEKNMKMIE |
LHNQEYREGK |
HSFTMAMNAF |
|
81 |
GDMTSEEFRQ |
VMNGFQNRKP |
RKGKVFQEPL |
FYEAPRSVDW |
REKGYVTPVK |
NQGQCGSCWA |
FSATGALEGQ |
MFRKTGRLIS |
|
161 |
LSEQNLVDCS |
GPQGNEGCNG |
GLMDYAFQYV |
QDNGGLDSEE |
SYPYEATEES |
CKYNPKYSVA |
NDTGFVDIPK |
QEKALMKAVA |
|
241 |
TVGPISVAID |
AGHESFLFYK |
EGIYFEPDCS |
SEDMDHGVLV |
VGYGFESTES |
DNNKYWLVKN |
SWGEEWGMGG |
YVKMAKDRRN |
|
321 |
HCGIASAASY |
PTV |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|P07711|CATL1_human Cathepsin L1
MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF
GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS
LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA
TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN
HCGIASAASYPTV
|
|
|
| |
|
|
|
|
|
|