(HGS) hepatocyte growth factor-regulated tyrosine kinase substrate [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
9296
|
Official symbol
|
HGS
|
Official name
|
hepatocyte growth factor-regulated tyrosine kinase substrate
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 17 (NC_000017.10) : 79651019...79669147 (+)
|
Map
|
17q25
|
Length
|
18129 nt
|
NM_004712.3
|
HGS, mRNA isoform 1
|
Accession
|
Name
|
Organism
|
Length
|
O14964
|
Hepatocyte growth factor-regulated tyrosine kinase substrate
|
Homo sapiens
|
777 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
ZFYVE8; Vps27; Hrs
|
Alternative name(s) |
human growth factor-regulated tyrosine kinase substrate
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GCGGAAGCGG |
AAGTCGGGGG |
GCGCGCCAGC |
TCGTAGCAGG |
GGAGCGCCCG |
CGGCGTCGGG |
TTTGGGCTGG |
AGGTCGCCAT |
|
81 |
GGGGCGAGGC |
AGCGGCACCT |
TCGAGCGTCT |
CCTAGGTAAC |
GCGTCCCCAC |
CCGACGGCTC |
GGCCTTGGTC |
AGCCCGCAGC |
|
161 |
CTCCGCCCGG |
CGCTGAGTCA |
GCGCGGCCCC |
GAGGGCCCGA |
GGGGCGCGGA |
CCGGCCGCCC |
TGCCCGCCTC |
TCCCCGGCCC |
|
241 |
GCTGTCCTCC |
CGGGTTCGGA |
GCCGCCGATC |
CTGGACCCTC |
GGCGCGGCCG |
TGGGGTCGGC |
GTCCGTCGGG |
CGTTGGCAAA |
|
321 |
TGGGGCTCTG |
CGAGGGTCCG |
CGCAGGGCGC |
CAGGGGCGCT |
CGGCGACCTC |
GCTCCTCCGC |
GGGCCCGGGC |
ATTCGGCCGA |
|
401 |
TTTCCTCTTG |
ACCGACGAGC |
ACCCGGCCGA |
GTGGGCCACA |
AGGCCTGAGT |
CATGAAAGTG |
GGTGACTCAG |
TCGGATGTTA |
|
481 |
GGCTCGCACT |
TGGGCTTCTG |
ACCCAAAACA |
CTGGGCCTCG |
AAGGGTGGAG |
CCGGCAGTGT |
TCTCTCTGGG |
ACCTTCGAGC |
|
561 |
TCTCTTGGGG |
GCTGCGGGTC |
CCAGACCCAC |
CTTTTGTTCC |
CACACTGGCT |
TCGGGAAGTG |
CCTAATGGTT |
TAGGAAGGCG |
|
641 |
TTTTCTCACG |
TTTTTGGAGG |
TGAAAGCTGA |
TCCTATCTGA |
GCCTCCTGGA |
ATTGACGCGT |
TTTGGGAGGG |
AAACGAATGT |
|
721 |
TGATGTGCAC |
GTGCCCAGCA |
GCACGTGGAA |
AATGCGTCTA |
ATGTTTGGGG |
GTAGGAGCTT |
GGGTGGTCGA |
ATGGGCTGCC |
|
801 |
CAAGCTTGCC |
TTTTTGTCAA |
GTAGGGTTTT |
CAAGGAGACT |
TCAAGTGAGA |
CACCTTCCTT |
GCCAAGGGAA |
CCAGCAAGCA |
|
881 |
GACTGAGTAT |
CTCAGCCACA |
CAGGTAGTTC |
AGTGCTTCAG |
GGATGAATCC |
AGAGGTTAAC |
TACTGAATCA |
GTATAGAAAG |
|
961 |
TGGACGTGGC |
TTGGCTTGTT |
AGATCGTTTC |
TGTATCCGGG |
AAAGGCGATG |
TGGGCAGAGC |
TGGAGCTGCC |
CCCCCAGAGG |
|
1041 |
GGCCAGTTGT |
TGGAGACAGC |
AGTGATGTGG |
TATGATGACA |
GGTATGCTGG |
CGAGTCCTCC |
TTGGAGCCCC |
GTTCTGCCCT |
|
1121 |
TATTTCCAAA |
GCACTTGGCT |
TGGAGCAAGG |
GGCCTGGAGT |
CTACACTGCA |
GGCCTCTCTA |
GCCCTTGCCC |
AAGCTAAGGT |
|
1201 |
CCCATTGAAA |
CCGGGAGCAT |
CTGGGCGAAC |
GGGGACTCCA |
AGGGACGGGT |
GATAGGGTTC |
TTTGTTGAGA |
AGCAGCCGAG |
|
1281 |
CCTGATGTTT |
AAGAGTGAAT |
CAGCCTCTGT |
CTCTGGTGAG |
CTGTGTGACT |
CACGGCAGTG |
CCCTCGCCTC |
TCTGCTGAGT |
|
1361 |
GTCTTTACCT |
CGCTCGCGGG |
TTGTGGGAGG |
GCTTGGACCA |
GCACCTGGTG |
TGCTCTTACG |
CTGTTTGCTG |
CTTCCCTGGG |
|
1441 |
GAGGATGGCA |
CTTCCCTGCC |
CATCAGCACG |
CTCAGGAGGA |
TTCTGTCATG |
CTTTTTCATT |
TGGTCCAAAT |
GTCTAAAGGG |
|
1521 |
GAAGGAGAGA |
GTCCCCCCCA |
GCTGCTGCCA |
AGCTGGGGTG |
CTGCACGGGG |
CGTCCAGCAG |
GATAACCCCT |
CCCTTTCATG |
|
1601 |
GCATTTTCTC |
TCTCAGACAA |
GGCGACCAGC |
CAGCTCCTGT |
TGGAGACAGA |
TTGGGAGTCC |
ATTTTGCAGA |
TCTGCGACCT |
|
1681 |
GATCCGCCAA |
GGGGACACAC |
AGTGAGTTAG |
CGGGGCCTGT |
GCCCTGATGC |
GGAGGAGCAG |
CCGTGCACTA |
AGCTGGGCTC |
|
1761 |
GCTTGGTGCC |
CGTTGGGTCT |
CCACGACGTA |
GCTGACCGTG |
TTAGGCTCTT |
ATGTGGAGAA |
CTGCATGGGC |
TACATGCTAG |
|
1841 |
GATTTTTTGA |
CATCTTGGAG |
GATTAAGGGG |
TTTTCTCTGT |
ACAGCTTGCA |
GATACCTGGT |
TATGTTTGTA |
TTTGCTTCGG |
|
1921 |
GGTTCTCAAA |
GACTAAGATG |
GCAGACAGAC |
TTTTTTTTTT |
AAGACGGGGC |
CTTGCTTTGT |
CACCCAGGCT |
GAAGTGCAGT |
|
2001 |
GGCGTGATCT |
CAGCTCACTG |
CAGCCCCTGC |
CTCCCAAGCT |
CAAACGATCG |
TCCCACCTCA |
ACCTCCTGAG |
TAGCTGGGAC |
|
2081 |
TACAGGCATT |
CACCATCACG |
CCCGGCTGAT |
TTTTTGTATT |
TTGGTAGAGA |
TGGGTTTTTG |
CCGTAATGCA |
CAGGCTGGTC |
|
2161 |
TCAGACTCCT |
GGGCTCAAGC |
AATCCTCCTG |
CCTTGGCCTC |
CCAAAGCACT |
GGGATTACAG |
GCGTGAGCCA |
CTGCACCTGG |
|
2241 |
CAGCAGATGA |
GCTTTTCTCA |
TCTTAGTTGC |
TGAGACTATT |
TTTTTCCCGT |
TTTTCTCTGC |
TTTTATCAAT |
GTTTCCTTTT |
|
2321 |
CAGAGCAAAA |
TATGCTGTGA |
ATTCCATCAA |
GAAGAAAGTC |
AACGACAAGA |
ACCCACACGT |
CGCCTTGTAT |
GCCCTGGAGG |
|
2401 |
TAAGCAGACC |
CCCGTGCCTC |
AGTGGCCCCC |
AGGGTCCCTA |
CCCCCTACTA |
GTCACGACAG |
CATGAGAAGA |
GTTTGTCCTG |
|
2481 |
TGGCCTTGCC |
CACCTCCCTG |
GGCATACATC |
AAGCAGGAGA |
GTTTCCACTC |
AGCCTTGAGT |
GCCTGTGTGA |
ACAGTGCCAC |
|
2561 |
CTAGGAGAGG |
GTCACGCTGA |
GGGAGAGACA |
GGAGATAGGG |
GAGGGAGTGG |
GCACCACTTC |
TTCAGCCGGT |
CCTGCGTGAG |
|
2641 |
TAAACACCAG |
AAGGCGGATG |
TCATTGCTGT |
GCTTGGCAGT |
GAGACTTGGC |
GTCACGGAGC |
GGTGCTGTGC |
AGGCTGCTGA |
|
2721 |
GTCTCAGATG |
TGGGTTCCCC |
ACTGGCCTGA |
CGGGCCTGTT |
GAAGGGCTGC |
TGTCCCACCG |
GGTTCCCCAC |
CGGCCACTGA |
|
2801 |
CGGGCCTGTC |
GAAGGGCTCT |
GCTGTCCCAC |
CGGGTTCCCC |
TCCGGCCACT |
GACGGGCCTG |
TCGAAGGGCT |
CTGCTGTCCC |
|
2881 |
ACCGGGTTCC |
CCACCGGCCA |
CTGACGGGCC |
TGTCGAAGGG |
CTCTGCTGTC |
CCACAGGGAG |
GTGGGTCTGT |
CCGGGAGGAG |
|
2961 |
GAGGGGCCCT |
GCCCTGACCA |
CTGGCCCAAC |
CCTTCCCTTC |
CTGGGCATCT |
GCAGGTCATG |
GAATCTGTGG |
TAAAGAACTG |
|
3041 |
TGGCCAGACA |
GTTCATGATG |
AGGTGGCCAA |
CAAGCAGACC |
ATGGAGGAGC |
TGAAGGACCT |
GCTGAAGGTG |
GGTGAGACGG |
|
3121 |
GGGCATGCGG |
GTGGCCACCC |
AGGCTGGCAC |
CTTTGCTTCT |
CCGGGTGTTT |
ACTGGGCACT |
GATGACAGAA |
CAGCACCAGG |
|
3201 |
TGTGGCGGAA |
AATGTGCAGG |
TGCAGATCGG |
TGGTCCCTGC |
ACTGCGCAAG |
CTCGCACTCA |
TGAGTCGGAG |
AGACAGGGCC |
|
3281 |
GTGAGCCCCA |
ATCCAGGGAG |
GTCAGGGTTT |
AGCAGGAGCC |
GGCCTGGGGT |
GCTGTGTCTG |
GAGGACCAAG |
GTGACAGCGA |
|
3361 |
CCCTGGGGCC |
TGGGGGAAGC |
TTCCGGGAGG |
CAGGGCTTGG |
GCACCTGCTG |
TCTTAGGGAA |
GGTGAGTAGA |
TGCCACTGGG |
|
3441 |
TGGGCAGGGC |
AGGTGGTGGC |
TATCGTGTGT |
GAGAGTGGCT |
GTGGTCCGGA |
GGGCAGGCTG |
GCCACATCCC |
ACAGGCAGGG |
|
3521 |
GCCTGGGAGG |
GCTCTGAGCG |
GGGGCTTGTA |
AGATCGGATG |
TGTCTGGCGG |
GGATAAGAGC |
TGTTTTGGAG |
AGCGAGGGCT |
|
3601 |
TCATGCTTTC |
CTGGGCAGAG |
GAGGAAAAGC |
CCTCCAGATC |
CCCAGGAGAT |
TGTGCCGGGG |
ACATCTGCAG |
GGATTGTGCT |
|
3681 |
GGGGACACCT |
GCAGGGACTG |
TGATGTGGAC |
TGTGATGCAG |
TGGGGATGCT |
GTTGCAGGTG |
TCAAGGAGAA |
AGAGTGTCCC |
|
3761 |
ATGTAGTCAG |
GCTGCTCCCC |
TTCTGTCCCA |
ACCTGTGGGC |
CAGACCTGGG |
TCTCATAGGA |
GACTCCCTGG |
ACTCACACAG |
|
3841 |
CTGAGAACCT |
GCCCGGGGAT |
CCCGGTGGAC |
AGCCAGGCCT |
CTCAAAGGCC |
CTCACTGGCC |
ACTTTGGCTG |
GCTTCCTGTC |
|
3921 |
CCGTCCTGTC |
CTGGCTGAGA |
GGCGGAAGGT |
GCCGGAGTCT |
GGCCCTGCCC |
GACCCTGGTG |
CTCCTGAGCT |
CCTTCAGTCA |
|
4001 |
GGCTGCAGAT |
CCTGAGCCGG |
TTCGCTCCTC |
TGGGGTGAGG |
CAGGGCTGTG |
AAGGGAAGGC |
GCCGCTGAGG |
ATGCAGAGGG |
|
4081 |
GTTCTGGAAA |
GGAGCTTAGG |
CTGAGGTGCA |
AGTGCCCCGG |
AGCCTGCTTG |
CAGCAGCCTC |
TGATCGCACG |
CCGTCCTGCA |
|
4161 |
CGTCACACCG |
GGACGGCGTC |
CCCGAGAGGG |
CCGTAGACGC |
GAGGCCAGGC |
TGCGCTCTAA |
CCGGGGGAGA |
GGATGGCTGC |
|
4241 |
AGGGGGTCTG |
GAGCCCGCTC |
TGAATTTGGA |
GGCATTTCTG |
GAGCGGCTGC |
CCCCCGCGTC |
TCCCCGGCTG |
TGACTGGAAG |
|
4321 |
GAGGGGCGTG |
GCCAGAGCCC |
CGCAGTACCC |
CTGTGTTCTC |
AGCCCCAACA |
CCACGCTGCC |
ACCGTCAGGC |
CTGCCGGGTC |
|
4401 |
CTGCCTGTTG |
GAGGCTGACG |
GGAACCGGGG |
ATCCTGGGGT |
GGGCAGGTCC |
CGTGGGGAAA |
GGGAGAGACA |
GATGGTCAGG |
|
4481 |
GGCACAGAGA |
CGCGGCGTTG |
TTTGGCTCCA |
GCAAGTAGAG |
GCAGGCGTGA |
GGTTTGAACT |
CAGGGTCTGG |
GAACAGGTTG |
|
4561 |
GTGCATGGCC |
CCCACGCCCC |
ACTCGGGGGG |
CCCTCCCTGG |
CCTCTGTGTC |
AGCCCCATCT |
GCTCAGAGGC |
TGAGGTCTGC |
|
4641 |
CAGCTGCTTC |
CTCACACCGA |
GGCCTCTGGC |
GCCTGGAGTG |
TCCTGCGACC |
CTCACCCCCT |
TCTCCCTGCC |
TGCAGAGACA |
|
4721 |
AGTGGAGGTA |
AACGTCCGTA |
ACAAGATCCT |
GTACCTGATC |
CAGGCCTGGG |
CGCATGCCTT |
CCGGAACGAG |
CCCAAGTACA |
|
4801 |
AGGTGGTCCA |
GGACACCTAC |
CAGATCATGA |
AGGTGGAGGG |
TGAGTCAGGA |
CTGAGGTTGG |
GACCAGGTTG |
AGGCTTGGAA |
|
4881 |
CTGCTGGGCA |
GTCAGGCTCA |
ACGGGCACAG |
TGGCGAGGGG |
CCTGGGAAGA |
TGGGTTGTTC |
CCTGTGTTGG |
GAGGGAGGAG |
|
4961 |
GGCGGTGGCC |
TGGAGCCAGG |
GAAGACCGTG |
CATGTGAGGG |
CGGGTTCAGA |
CCGGCTCCTG |
CCTTTCTGGA |
CCCACGGCCC |
|
5041 |
ACGGGCAGGG |
GCAGGCAGTG |
ATGGCAGACA |
ACACACAGGT |
GAAAAGGCAG |
CAGACAGCCT |
GTTTCAGAGT |
GGCAGGACTC |
|
5121 |
CTGGGGGCCG |
GGTTGGGCTG |
TGCAGTCAGG |
GAAGGCCTCT |
GAAGGAGCTG |
TCCCAAGTGT |
CGAGACTGGA |
ACAGAAGCCA |
|
5201 |
GAGCCGGCGT |
GAGCAGGCAG |
GCAGCACCCG |
TGGGCAGACA |
CTCAGGGTTG |
AGGCCCCAGG |
GCAGAAAGGT |
GGCCAGCGTG |
|
5281 |
CCAGAGGAGA |
GTGCAGCCGG |
GCCTGTGCGC |
CGAGGGCCTC |
CAGGCTGTTG |
GGCTGGAGTT |
CAGGGTTGCT |
GTCAGCGCAG |
|
5361 |
CAGGAGGCCT |
CTGGTGGGTT |
TTCGTCTAGG |
TCGTGGCTGC |
CAAGGTGTGT |
TTCTTAAGGG |
TCCCTCTCGG |
CCATTGTGGC |
|
5441 |
CGTGGAGAAC |
TAGGGGATTG |
TGTAGAGAGA |
GGTGGGGGCT |
TCAGGATGCA |
TTTGGGAGTC |
TTGGCTGGTG |
CTCAGGGCCC |
|
5521 |
ACTCAGTGAG |
AGAGAGGTGG |
GCATGGCTGC |
TCACATCTGT |
GCGGGTTCTC |
TGTGGCCCTG |
AGTCTGCTGA |
AGACGAGGCT |
|
5601 |
GAGAGCGTGG |
GGAATGGGGC |
AGGCTCCAGG |
GGTGGTGCGT |
CCCCGCTGCA |
GGAGTTGGAC |
GCTTGCCCTG |
TGGCTGCAGG |
|
5681 |
GTGGGCAGGA |
CCAGTGTGGA |
GAACACGGGA |
AGAACGCAGG |
GGGCGCTGCA |
CGAGCTGCCT |
TGGACCCTCA |
ACTTTGGGGT |
|
5761 |
GGTGGATGCT |
GGTGGTCTGT |
GCTCCTGTCT |
CCTTCCTGGG |
GTGGGAGCTC |
TCCTGGATTT |
TTTCTGGGGT |
CGTGTGTCAA |
|
5841 |
GAATAGCTCA |
AGGTCTTGTC |
CTCAGGATCA |
GTGGGAGCCC |
TGGCCAGAGA |
GCAGAGGGTG |
AGGGAGCCAC |
TCAGCAGGCG |
|
5921 |
GGGCCACGCA |
GGGGCCTCAT |
GCCCTGATGG |
AGTGGGAGCA |
GTGTCTCAGG |
AAGAGGGCAG |
CAGGGCCTGG |
CTCTTGTCTC |
|
6001 |
CCGCCACAGT |
GAGGACAGAG |
GCCCCATGGG |
GACACCCAAT |
CTTGGGGGCA |
GGAGGCTGTC |
TCGGTGCCCC |
CAGACACCTT |
|
6081 |
CACTTGGGTG |
CACGGTCACT |
GCTGCGTTGC |
AGCTGCCGAG |
AGTGAGGAAG |
GGGCTCCCGG |
AAGTCTTGCA |
GGCCAGGTGG |
|
6161 |
GAGCAGGCAG |
TGACTATGGC |
TTCATCTCTC |
CAGGGCACGT |
CTTTCCAGAA |
TTCAAAGAGA |
GCGATGCCAT |
GTTTGCTGCC |
|
6241 |
GAGAGAGTGA |
GTGTGGGCGG |
CCGCCAGGGG |
TTCTGGAGTC |
GGGCTGCTCA |
GGAAGCGTGA |
AGGGGAGTGC |
TGGGAGCCCG |
|
6321 |
GCTTGTTTGA |
GGGTTGGTGG |
CTGAGCTCTC |
GTCTGTCTGG |
GACCTGAGGA |
TGCCTGTGTG |
TCCTGGGCGG |
AGGCGTAGCA |
|
6401 |
GCTGTCTCTG |
CAGGGCGAGG |
TGGAGACGTG |
GCTTGAGGGC |
CTTTAGAGTG |
GCTAGGGGGC |
TCGGGAAAGG |
AAAACAAGCA |
|
6481 |
CCTTTGATGA |
GGAAGGAAGT |
CCCTTCCTCA |
GGCCTCGGCA |
GCTTCACCCC |
AGGCCACGCC |
GCTGGGAAGG |
GCCGCCCGGG |
|
6561 |
GCATTGCTCT |
CGCTCGTGCT |
GGGGTCCTCT |
CTGGGTCCGT |
TGCCTTCTGT |
GGGGCAGGGG |
AGGCTCTGTG |
CCTCTGGGGC |
|
6641 |
TGGTGGGGCG |
GGAAGCGTGT |
GGTCCTGACT |
GCTGCCCCTC |
CTCAGGCCCC |
AGACTGGGTG |
GACGCTGAGG |
AATGCCACCG |
|
6721 |
CTGCAGGGTG |
CAGTTCGGGG |
TGATGACCCG |
TAAGGTGAGT |
TCCCACCTGG |
GGGGCTCTAC |
AGCCCCGGCC |
AGACACCAGG |
|
6801 |
TCCCCTGCCG |
TGCACAAGGC |
CACCGTCCCT |
GCTGGGCCTT |
TCCTTCCTGG |
TGGGTGGTTC |
AGGAGGTGAG |
TGCAGCTCGC |
|
6881 |
AGCGGGTGGC |
AGTGTGGCGT |
CAGGAGGGGG |
ACAGTGAGGG |
CCCACGTAAG |
GGCCTGATGC |
CACTGCCTGG |
GCCGGGCCCA |
|
6961 |
GCCTCACTCT |
GGAATTATAA |
TGTTTTAAAC |
CTAGGGCTAC |
AGGCCTTGCC |
ACAGTCACTC |
TGCTGCTTGC |
CAGGGCCGTG |
|
7041 |
GGGCCCCCTT |
CTCTTTGTGG |
CTTAGCCCAT |
CTGGATTCTT |
ACCCTCGAGG |
CATCTTCACA |
TCAAAACCCC |
CTCCAGGCTG |
|
7121 |
GCAACCGGCA |
GTGCTTGGGG |
TTTGGGGTGT |
CAGCAGCTTC |
CAGCACCCAC |
TGCACTCACA |
AAAGCTCTTG |
TTTTATCAGC |
|
7201 |
AGAATTGATG |
TGTATTTTTT |
CCTTGCCCTT |
ACTTTTAACT |
TACCTTATTT |
TCCCCAAAAC |
GGTGGCTGGC |
GTTGAGACTC |
|
7281 |
CCGGGAGCAT |
GTCCAGGTTC |
CCCGGCCTTA |
GGGTCTTCCC |
AGGCACTTGT |
TCTGCTTGTC |
CCTTGCCTTC |
CCCCACCTGT |
|
7361 |
GAGGCCCAGC |
TTCGGCATCG |
TACGGGGTGG |
TTCTGGGCCG |
GGTGGCGCAT |
CAGGGTCCCC |
CAGTGCCTGT |
GACCAGGCCC |
|
7441 |
GCCCGCCCCA |
TCTTACAGCA |
CCACTGCCGG |
GCGTGTGGGC |
AGATATTCTG |
TGGAAAGTGT |
TCTTCCAAGT |
ACTCCACCAT |
|
7521 |
CCCCAAGTTT |
GGCATCGAGA |
AGGAGGTGCG |
CGTGTGTGAG |
CCCTGCTACG |
AGCAGCTGAA |
CAGGTGAGTC |
CCCGCCCCCC |
|
7601 |
ATTTGGGCTG |
CAGGTGGGGC |
AGGCTCTCCA |
GGCTGGGTTT |
TCTGTCCCTC |
TTGGCCATGG |
TGCCTGAGGC |
CTGCAGACCC |
|
7681 |
CAGAGGACCC |
TCACAGCACA |
GCAGCTGGAA |
GGTCAAGGGA |
AACCCAGGGT |
GGCCGCATGC |
CCTCGGACCC |
TGCCCCACAC |
|
7761 |
TAGGGCAGGT |
GGGTGTGAGA |
GACAGGGCGC |
CGCGGCTCCA |
GGGACCGAGG |
CTGCCCCGAC |
AAACCTGTTG |
CTTGGGTTTG |
|
7841 |
GGTTTGGGTT |
TGTTTGCATT |
TCAACTTTCG |
GAATAAAACT |
TACAGAAAAG |
TTGCAAGAGT |
AGCACAGAGA |
AGCTGCGGGG |
|
7921 |
CCGCGGCCAG |
TGCCTCAGCG |
GTGGGAACCT |
GCGGGGCCGC |
GGCCGGTGCC |
TCGGCGGTCG |
GTGTTTTGTC |
GCAGAGTTTA |
|
8001 |
CTCTGCCTCC |
CCTCTCCTGC |
GCGTGCGTGT |
GTTCACACAG |
GTTTTATTTT |
GAATTTGCAT |
GAGTGCAGAT |
GTCATGCCCC |
|
8081 |
TTGGCACTCA |
GATCTTCGCC |
TGTGATCTCT |
GAGAGGGACA |
GTGCTCTCAT |
GGCCACAGCA |
GTCACTCGGG |
ACTGCGCTGC |
|
8161 |
CACCCAGTGC |
TGGCTTCTGC |
TCTGTGGTCC |
ACGTTCCATT |
TCTGCCGTGG |
TCCCAGCAGC |
GTCGCTGTGG |
GTCTGGCCTG |
|
8241 |
GGTTGCGTGT |
GTTTCGTATG |
TGGGCCGTGC |
TCCCTGCTTG |
GTTCCCTTTT |
CCTGGAACGT |
GTCACTGCCT |
CCCTGTCTCG |
|
8321 |
CTCCGTGGAC |
ATTTCTGGGA |
GGTCAGGCCG |
TGGCCACCTG |
GCCCCCTGTT |
CAGGTCTGAG |
GCTCCCACCT |
GCTTAGGTTC |
|
8401 |
GGGAAGCTCA |
GGAGTGAGGC |
CATGCCCTCC |
TCAGGACATC |
CCATCCAAGC |
CAGCCATGTC |
CGGTGATGGG |
CCGCTGCCCG |
|
8481 |
GAAAGTTCCT |
TTTCCTTCTT |
GTAACTGAGA |
AGAACTTGCC |
TTGAGCCACG |
TCAAGTCCCG |
TCCGTCGCAG |
CCACTGCCCA |
|
8561 |
CAAGCGTGAG |
TCTGCTGTGA |
GCCAGCGGCT |
CCATGGCAGG |
GCATCCCAGC |
GCCATTCCTG |
CCTTCACACA |
CACTTGCTGC |
|
8641 |
CGTTTCCCTG |
TGCTGGGGGC |
TGTGCAGGTC |
TGCCTCGGTG |
TGGACTTTTC |
TCTTAGGAAA |
GAGCCCCAGG |
TCGGCCGAGC |
|
8721 |
ACGGTGGCTC |
ATGCCTGTAA |
TCCCAGCACT |
TTGGGAGGCT |
GAGGCGGGCA |
GATCACGAGG |
CCAAGAGATC |
AAGACAATCC |
|
8801 |
TGGCCAACAT |
GGTGAAATCC |
CGTCTCTACT |
TTTTAAGTAT |
TTTATACTTA |
AAATTTTTGT |
ATTTTATACA |
AAAATTAGCG |
|
8881 |
GGCTTGGTGG |
CAGATGCCTG |
TAGTCCCAGC |
TACTCGGGAG |
GCTGAGGCAG |
GAAAATCACT |
TGAACCTGAG |
AGGCGGAGAT |
|
8961 |
TGCAGTGAGC |
CAAGATGGCG |
CCACTGCATT |
CCAGCCTGGG |
CGACAGAGCA |
AGACTCTATC |
TCAAATAAAA |
AAAAAAGAAA |
|
9041 |
AGAGCCCCAG |
GTCAAAGGTC |
AGCGTGCGTG |
GTTGGAAATT |
GCGACCATGC |
TGTGAGCCTG |
CGGCCCTTGG |
ACTGCCTGGG |
|
9121 |
GCGCCCCGGA |
AGAGCTTACT |
GGGCATGACA |
GTCACTGACC |
TGTTTGCCCC |
TTCTTGGGTG |
GGGCCCGTCC |
CACAGTGGCG |
|
9201 |
GTGCTGTGTT |
TGGTTTCCCG |
AGTGCCGTAG |
GAAGGTCTTC |
GCAGGCCGGG |
CTGGTGTCCT |
GAATGTCCAT |
TTTCAGGAGA |
|
9281 |
GGAACTGGGG |
CTTACCCTGA |
GGGACTCACC |
CGAGCCCTTG |
TGGCTGGTGT |
GATGCCCTCG |
AGTCCTGCCC |
TGCTCTGCTC |
|
9361 |
TGGAGTGTGG |
CCATTCGGAC |
ACCTGTGGCT |
GTGGAACCAC |
TGTCCTCACT |
CTGTGGGGAC |
ACTTAGAGGA |
GCCCGCAGAG |
|
9441 |
GTGTGGTTGA |
GAGCTTTGGC |
GGGGGCAGGG |
CGGTGTCAGC |
GCATGGTGAC |
CTGCAGCATT |
CCTTGTGCCC |
ACAGGAAAGC |
|
9521 |
GGAGGGAAAG |
GCCACTTCCA |
CCACTGAGCT |
GCCCCCCGAG |
TACCTGACCA |
GCCCCCTGTC |
TCAGCAGTCC |
CAGGTACTCA |
|
9601 |
GCCCCCTCCG |
TCCCGTGGGC |
ACCTCTTCCC |
CGGCGCCCCC |
CCTCACCCTC |
CCCGCTTGTC |
CTCAGCTGCC |
CCCCAAGAGG |
|
9681 |
GACGAGACGG |
CCCTGCAGGA |
GGAGGAGGAG |
CTGCAGCTGG |
CCCTGGCGCT |
GTCACAGTCA |
GAGGCGGAGG |
AGAAGGAGAG |
|
9761 |
GCTGGTAAGC |
CGGGTGGGGC |
GGGGCGGCCT |
CAGGAGGGGC |
CCAGCTCCCC |
TGGATGTGCT |
GCGGTGGGGC |
CGGAGGGGCG |
|
9841 |
TCACGTGCAC |
CCAAGTGACG |
CCCCTTCTGA |
TTCTGCCTCA |
GAGACAGAAG |
TCCACGTACA |
CTTCGTACCC |
CAAGGCGGAG |
|
9921 |
CCCATGCCCT |
CGGCCTCCTC |
AGCGCCCCCC |
GCCAGCAGCC |
TGTACTCTTC |
ACCTGTGGTG |
AGCGGCCCTT |
GGGCTGGAGC |
|
10001 |
TCCCTCTCCT |
GGAAGGCAGT |
AGGGTTGATG |
GGGGACGCGG |
GTCCCTGAGC |
TGATTTAGTC |
AGGTTGGTGG |
GGGATGCGGG |
|
10081 |
TCCCCGGGCT |
TCCCAGAGAG |
GACAGCCCCA |
GCACACGGGC |
GGACATCAGG |
GCAGAGCCCC |
ACGGCTCCAG |
GCACCCGTTG |
|
10161 |
CTCCTGGTCC |
CTGGTTCGGC |
CCTTGGGACT |
GAGGACATGG |
GACAGGCTGT |
GGGGGAGGCG |
CCTTGCTGGT |
GAGACCCCGT |
|
10241 |
CTCTTCCACA |
CCCCTCCCGC |
CCCACAGGAA |
CTGAAGGACT |
TGCTTTCTGG |
GGCATCCACG |
TCACCTCTCC |
CTGGCCTCAG |
|
10321 |
CCCCGCTCTC |
AGTAGAGGGT |
GAAGAGGCAG |
CCTGTTTTGC |
AGGGGGTTGG |
GTCGGGGACA |
GCAGGTTGAG |
AAAGTCATTT |
|
10401 |
TGTAGGTTCC |
GTGGAGTGGC |
CAGGCCGCTG |
CAGGCCCAGG |
GGTGCAGAAC |
AGCCCTGCCC |
CAGTGAGCCA |
CCCCTTCCCT |
|
10481 |
TCTTCCCTTC |
CTTCCGGGCT |
GTCGGCTGGG |
GCCCCACACG |
CTGGACCGTG |
GGGGCTGTCC |
AGTGTCCACC |
TTGAGGCCCC |
|
10561 |
AGGGCTGGCC |
CAGTGCCACC |
CTGTTCCTCC |
CATAGCAAGG |
TTAGGAGCCT |
TGTAGGACGG |
CAGTGCTGTC |
AGCCTCTGTG |
|
10641 |
GAGTTCTAAG |
CATCTGGAAG |
AGGAGAAAAG |
ATGGCTGCTC |
AGATGCCAGG |
AACCAGAGGA |
GGGCACGGAG |
GGAGGGCCCA |
|
10721 |
GGCTGCGGCT |
CTGGTCTCCA |
GGATCTGTCG |
CACTGGGGAC |
ATCCCTGTCC |
CTGCCGAAGC |
AACTGGCTCT |
GTCACCTGTG |
|
10801 |
AGACTCAGAT |
GCCCTTTTCT |
CCCCAGAACT |
CGTCGGCGCC |
TCTGGCTGAG |
GACATCGACC |
CTGAGGTAAG |
GCCCAGCATG |
|
10881 |
GGGTGCATCC |
TCTCACGGTT |
TCTGGCCTTG |
GGAGTGACCC |
CCTCATTGCC |
TGCAGCTCGC |
ACGGTATCTC |
AACCGGAACT |
|
10961 |
ACTGGGAGAA |
GAAGCAGGAG |
GAGGCTCGCA |
AGAGCCCCAC |
GCCATCTGCG |
CCCGTGCCCC |
TGACGGAGCC |
GGCTGCACAG |
|
11041 |
CCTGGGGAAG |
GGCACGCAGC |
CCCCACCAAC |
GTGGTGGAGG |
TGAGGGGGCC |
ACTCCCGGCA |
TTCCTAGTGG |
CAGGGTCCCT |
|
11121 |
TGGAAGGGGT |
GGATGCGGGA |
CAGGTTGGAG |
GCCCCACTCA |
TTCTCTCTCT |
TCCAGAACCC |
CCTCCCGGAG |
ACAGACTCTC |
|
11201 |
AGCCCATTCC |
TCCCTCTGGT |
GGCCCCTTTA |
GTGAGGTAAG |
CTGTGGCTCC |
CTCCACGGGC |
CAGGGCAAAA |
CATGGCCTCC |
|
11281 |
TGGCCCACAG |
CGCCAGGCAC |
ATGGCACAGG |
TGCCTGCCCT |
AACCAGAGGG |
CCGTGCTAGA |
GCAAGGGTGT |
CTGCCCCAGC |
|
11361 |
CCAGCCCTGG |
CCTGCCCTGC |
CCTGCCCTTT |
GTGGCCTCTC |
CCAATGGAAA |
CTCTACACCA |
GGCTGTGGTC |
CAGAGCTCGG |
|
11441 |
GCCACTCTCT |
GTGGACCTAA |
CATGGACCTG |
AATATTCCAG |
AGCAGCCTAG |
AACCATCAGA |
TGAGTCCTTG |
GCGTTGCCTG |
|
11521 |
GGGCTTGTGG |
GCTGGTTGGC |
TGTCAGAGCA |
CAGGTCCCTG |
GAGAGGGAGC |
TGGCAGTGGG |
GCCCTGAGCC |
AGCTCCGTCC |
|
11601 |
TGACCAGGGC |
TTGCAGCTGG |
ACAAGGACCC |
CGCCTCCAGG |
GCCTCGCCTT |
CCTCAGCTGT |
AGAAGGGGCT |
GCTTGCATAA |
|
11681 |
GGAGCAGATG |
GACTCTGCTC |
CAGGCTTGAG |
TATAGCTGGG |
TGCCTCCATC |
CCAGGCCCCA |
CCAGGGAGGC |
TGGCTGGGGC |
|
11761 |
GTGGCCGCAC |
TCATCCAGAA |
CCCTGCTCTG |
CCTGCAGCCA |
CAGTTCCACA |
ATGGCGAGTC |
TGAGGAGAGC |
CACGAGCAGT |
|
11841 |
TCCTGAAGGC |
GCTGCAGAAC |
GCCGTCACCA |
CCTTCGTGAA |
CCGCATGAAG |
AGTAACCACA |
TGCGGGGCCG |
CAGCATCACC |
|
11921 |
AATGACTCGG |
CCGTGCTCTC |
ACTCTTCCAG |
TCCATCAACG |
GCATGCACCC |
GCAGCTGCTG |
GAGCTGCTCA |
ACCAGCTGGA |
|
12001 |
CGAGCGCAGG |
CGTAGGTGCC |
CGCGCCACGG |
GGCCTCGGCT |
CAGGGGCAGC |
CAGGTGTTGT |
GAGCGCCATC |
CTGGGCCAGG |
|
12081 |
GCCTCCCCTG |
AGGGTGCTGA |
GCTCTTGTGA |
GTCCTTCATT |
GGGGCCGTGG |
CTTCCTCCAG |
AGAGTGTCAA |
CAGGAGGTGG |
|
12161 |
TCTCAGTGAT |
GACCATGCCC |
TGCCCTGCCC |
TGCCCTGCCC |
TGCCCTGCCC |
TGCCCAGGAC |
CCTCTGCCTG |
CCTCCCCCAG |
|
12241 |
AGCCCAGCAC |
CTTCAGAGCC |
CTCTGCAGAA |
GGTTGGGCAC |
AGGGCGGCCC |
ATCTGCGTGT |
CCGTTCCACC |
CAGGAGCTTC |
|
12321 |
TCGGCACTGT |
GCCGGAGTGG |
TCAGGGTTGC |
TCTGTCATCT |
GCCCACAGTG |
TACTATGAGG |
GGCTGCAGGA |
CAAGCTGGCA |
|
12401 |
CAGATCCGCG |
ATGCCCGGGG |
GGCGCTGAGT |
GCCCTGCGCG |
AAGAGCACCG |
GGAGAAGCTT |
CGCCGGGCAG |
CCGAGGAGGC |
|
12481 |
AGAGCGCCAG |
CGCCAGATCC |
AGCTGGCCCA |
GAAGCTGGAG |
ATAATGCGGC |
AGAAGAAGCA |
GGTGCAGTGG |
CTGCCCAGCC |
|
12561 |
ACAGGCCGGG |
GCCGGCTGGG |
GGACCTCGCA |
GCATAACCAG |
CATGTTTTTG |
CCGCACAGGA |
GTACCTGGAG |
GTGCAGAGGC |
|
12641 |
AGCTGGCCAT |
CCAGCGCCTG |
CAGGAGCAGG |
AGAAGGAGCG |
GCAGATGCGG |
CTGGAGCAGC |
AGAAGCAGAC |
GGTCCAGATG |
|
12721 |
CGCGCGCAGA |
TGCCCGCCTT |
CCCCCTGCCC |
TACGCCCAGG |
CATGTGCCAT |
CCTCCCGCCA |
CCCAGAGGCT |
TGTGGGCTGA |
|
12801 |
GGACCAACTC |
TCACCGCTGT |
CTCTTTTGTC |
CCCAGCTCCA |
GGCCATGCCC |
GCAGCCGGAG |
GTGTGCTCTA |
CCAGCCCTCG |
|
12881 |
GGACCAGCCA |
GCTTCCCCAG |
CACCTTCAGC |
CCTGCCGGCT |
CGGTGGAGGG |
CTCCCCAATG |
CACGGCGTGT |
ACATGAGCCA |
|
12961 |
GCCGGCCCCT |
GCCGCTGGCC |
CCTACCCCAG |
CATGCCCAGC |
ACTGCGGCTG |
GTAAGGACGG |
GTCGGGGCAG |
AGACCATGCC |
|
13041 |
TTTTATCCCT |
CGTCTTTATT |
TTAGCCGAAT |
TTACAGAAAA |
GCAGTAAGAA |
TGGTGCGAAG |
GGTTTTCCCA |
GTTGCCCCGA |
|
13121 |
TGTGAATGTT |
GTCCCTGCTT |
TTTCCTGTTT |
CCATGCAAAC |
GCACTCACAC |
AGGCTTTCCT |
GCTGGAGCCG |
TGTTTCAGTC |
|
13201 |
CGTGCGTGGC |
AGGTACTGCC |
CTCTCCCCTG |
AATGCGACAG |
CGAGCATTTC |
CTGAAACCAG |
CAGGTGCTGG |
GCTGCCGATC |
|
13281 |
CTCGTCTGTA |
GCAGGCTCAG |
CCCTCATGGA |
CTGTCCCAGT |
GGTGTCCTGT |
AAACAGCACG |
CCCCCTGCTG |
CGTGGCATTT |
|
13361 |
GCCCCCCCCC |
CCCCCGCCTT |
TAATCTAGAG |
TGTTCCCCAG |
TCAGCTTCTG |
TGTTTTGTGA |
CATCTACTGT |
CCCAAGAGCA |
|
13441 |
CTGGCCTGTC |
ATTCGGCAGA |
CCGTCCTCAG |
CTCAGGTTTG |
TCTGGTGTCT |
CCACGCGGTG |
ACCATAAGGC |
TGTGCTTTTC |
|
13521 |
TCAGAGTGGC |
AGCCGAGGTG |
ACGTGCATCC |
CGTCAGTCCC |
ATTGCTGGCC |
GTTCGGCTGA |
GACGGGGTCT |
GCCAGGGGTT |
|
13601 |
TTCACTGTGA |
AGTGACTGTT |
TTGCCCTTTA |
CAGATGTCTC |
TTGGGGGAGG |
CCTTCCAAAT |
ACCCTGGGAC |
TCCTCAAGGT |
|
13681 |
CTAGCATTTT |
TTGCTAATGT |
TGCCTGTGTT |
TATTACGGTG |
TTTGCCAAAT |
AGCGCCTTCT |
AATTCTGCTG |
TTGTCTTTCC |
|
13761 |
ACTGGCGGGA |
GGTACTCTTC |
CCCACTCTCC |
TGCTTATCGT |
CAGTCACAGC |
AGCGTGGAGA |
CGTGTTTATT |
TTTTCTTTCA |
|
13841 |
AAGGTGACAC |
TCCTCTGTCA |
TCACTCTTTA |
TTTTTAGAGT |
CTCGCTGTCA |
TCCAGGCTGG |
AGTGCAGTGC |
CATGATCATG |
|
13921 |
GTTCACTGCA |
GCCTCGACCT |
CCTGGGCTCA |
AGCAATCCTC |
CCATCTCAGC |
CTCTTGAGTA |
GCTGGGACTA |
CAGGATTGCA |
|
14001 |
CCACCATGCC |
TGGCTAATTA |
AAAAAAAAAG |
GGCTGGGTGC |
GGTGGCTCAC |
GCCTGTAGTC |
CCAGCACATT |
GGGAGGCTGA |
|
14081 |
GGCAGGTGGG |
TCATGAGGTC |
AGGAGTTCAA |
GACCAGCCTG |
CCAGGATGGT |
GAGACCCTGT |
CTCTACTAAA |
CATACAAACA |
|
14161 |
TTAGGTGGGC |
GTGGTGGCAC |
GTGCCTGTAA |
TCCCAGCTAT |
TCGGGAGGCA |
GAGGCAGAGA |
ATGAATTGAG |
CCCGGGAAGT |
|
14241 |
GGAGGTTGCA |
GTGAGCCGAG |
GTTGCACCAC |
CACACTCTAG |
CCTGAGCAAC |
AGAGTGAAAC |
TCCATCTCAA |
AAAAAATGTA |
|
14321 |
GAAATGAGGT |
CTTTCTGTGT |
TGCCCAAGCT |
GACCTCAAAC |
TCCTGACCTC |
GAGTGATCCG |
CCCTTCTCGG |
CCTTCCAAAG |
|
14401 |
TGCTGGGATT |
ACAGGTGTGA |
GCCACCGTGC |
CCAGCCCCGT |
TACTTTGATG |
CTCAAAGTTT |
CCCAGGTGTG |
GCCATGGGAG |
|
14481 |
GCGTCAGTGG |
CTTCTCTGTT |
GCGTAGGGCC |
ATTGGTTTGG |
GTTGCTCCTG |
AGTTTCTGTT |
GCTAGATACT |
CTGTGCTCGT |
|
14561 |
CTTGTATGTT |
CTCAACCCTG |
GAAGCAGCCA |
TTTCTCCAAG |
GAGCCCTTGT |
TCCCTTTAGT |
GGAGGGTGGT |
CTGGATCTTG |
|
14641 |
GTTAGAAGCC |
AAGATCAAGA |
TTCTGGGTGT |
GCTCACAGCT |
CTAGGTTGTT |
AGCAGACAGA |
GCTGGAAAGT |
GTGTATGTGA |
|
14721 |
ATACAAGTGC |
ACAGTTGTGG |
GTGCGTCTGT |
GTATGTGCGA |
ATACATGCGT |
CTTTGTATAT |
GATAAAAACC |
GTGAGTGCAG |
|
14801 |
CCAGGTGCGG |
TGGCTCATGC |
CTGTAATCCC |
AACACTTTGG |
GAGGCCAAGG |
CGCGTGGCTC |
ACCTGAGGTC |
AGGAGTTGGA |
|
14881 |
GACTGGCCTG |
GCCAACATGG |
TGAAACCCCG |
TCTACTAAAA |
ATACAAAAAT |
TAGCCAGGCG |
TGATGGTGCA |
CACCTGTAAT |
|
14961 |
CCCAGCTACT |
TGGGAGGCTG |
AGGCAAGAGA |
ATCACTGGAA |
CCTGGGAGGT |
GGAACTTGCA |
GTGAGCCAAG |
ATCATGCCAT |
|
15041 |
TGCCCTCCAG |
CCTGGGTGAC |
AGAGCGAGAC |
TCCGTCTCAA |
AAAAAAAAAA |
AAAATCCATG |
AGTTAATTCC |
AGTACTTGCA |
|
15121 |
ATTTTAATCC |
AATAGCACAG |
TTTTCTTTTG |
CATTGTTTTC |
AATATACTCA |
ATTGCTCAGC |
CCTAGAATAC |
ACAAAAAGTA |
|
15201 |
GTTTTAAGAA |
TGCTAACCTG |
GCCCACTGTG |
GAGACGATGC |
CTCCTAACTG |
GAGTGTAACA |
TTTGTCTGTA |
GTTCTTGTCA |
|
15281 |
TTTGTAGCCT |
GAGGCCGTGG |
AGTCCAGATT |
TTGGGTTCAG |
AAGTTACTGG |
AATTCACTAT |
CCCATCCCAT |
TGGTCAGACC |
|
15361 |
ATGTCACTCA |
TTTCAAATAC |
AGTTTGGTTC |
ATTTTTTCTG |
ATTTCATTCC |
ATTCTAGGAT |
TCTCCTCCCA |
TCCTTGTTGG |
|
15441 |
TTTATCTTCT |
TTTTTGAGAC |
GAAGTCTCGC |
TCTGTCGCCC |
AGGCTGGAGT |
GCAGTGGTGC |
GACGTCAGCT |
CACGGCATTC |
|
15521 |
TCCGCCTCCC |
AGGTTCAAGC |
AATTCTCCTG |
CCTCAGCCTC |
CTGCGTAGCT |
GGGATTACAG |
GCGCCTGCCA |
CCATGCCCAG |
|
15601 |
CTAGTTTTTG |
TATTTTTATT |
AGAGACAGGG |
TTTTACTATG |
TTAGACAGCC |
TGGTCTCGAA |
CTCCTGCCAT |
CATGATCCGC |
|
15681 |
CTGCCTCGGC |
CTCCCAAAGT |
GCTGGGATTA |
CAAGCATGAG |
CCACCACGCC |
CCGCCTATGT |
TCCTTTTTTT |
TAGAAACGTG |
|
15761 |
AAACCTGAAC |
TTGGTTCCAA |
AAGTCACAAC |
AAAAAGGAAA |
ACTCCTGGCC |
GGGCGGTGGC |
TCACGCCTAT |
AATCCCAGCA |
|
15841 |
CTTCGGGAGG |
CCGAGGCGGG |
TGGATCACCT |
GAGGTCAGGA |
GTTTGACACC |
AGCCTGACCA |
ACATGGTGAA |
ACCCGTCTCT |
|
15921 |
CCTAAAAATA |
CAAAAACTTA |
GCTGGGTGTG |
GTGGTGCACG |
CCTGTAATCC |
CAGCTGCTCG |
GGAGGCTGAG |
GCAGGAGAAT |
|
16001 |
TGCTTCAACC |
CAGGAAGCAG |
AGGTTGCAGT |
GAGCTGAGAT |
CGCGCCACTG |
CACTCCAGCC |
TGCGTGACAG |
ACTATCTCAA |
|
16081 |
AAAAAAAGTA |
AAACTCGGCC |
GGGCGCGGTG |
GCTCACGCCT |
GTAATCCCAG |
CACTTTGGGA |
GGCCGAGGCG |
GGCGGATCAC |
|
16161 |
AAGGTCAGGA |
GATCAAGACA |
ATCCTGGCCA |
ACATGATGAA |
ACCCCATCTT |
TATTAAAAGT |
ACAAAAAAAA |
ATTAGCCGGG |
|
16241 |
CGTAGTGGCG |
TACGCCCGTA |
ATCCCAGCTA |
CTCGGGAGGC |
TGAGGCAGGA |
GAATCGCTTG |
AACCCGGGAG |
GCAGAGGTTG |
|
16321 |
CAGTGAGCCA |
GGATTGTGCC |
ACTGCACTCC |
AGCCTGGTGA |
CAGAGCAGGA |
CTCCATCTCA |
AAAAAAAAAA |
AAAAAAAGTA |
|
16401 |
AAACTCAAGA |
GTAGGGTGTC |
CCTTCCTCTC |
CTCCCTGTTG |
CCCTGGCTGA |
ACCATCTCCC |
CTGTCTTGTT |
TGTCACAGAT |
|
16481 |
CCCAGCATGG |
TGAGTGCCTA |
CATGTACCCA |
GCAGGGGCCA |
CTGGGGCGCA |
GGCGGCCCCC |
CAGGCCCAGG |
CCGGACCCAC |
|
16561 |
CGCCAGCCCC |
GCTTACTCAT |
CCTACCAGCC |
TACTCCCACA |
GCGGGCTACC |
AGGTACACAG |
GAAGGCCGCT |
CCTCTCCTTC |
|
16641 |
CAGGGCCAGC |
CCCAGCCCCA |
GCCCCAGCCC |
CAGCCCCTTC |
TCCCATGGCA |
CTCATTCCCT |
CCGCAGAACG |
TGGCCTCCCA |
|
16721 |
GGCCCCACAG |
AGCCTCCCGG |
CCATCTCTCA |
GCCTCCGCAG |
TCCAGCACCA |
TGGGCTACAT |
GGGGAGCCAG |
TCAGTCTCCA |
|
16801 |
TGGGCTACCA |
GCCTTACAAC |
ATGCAGGTAC |
AGTGACCTCC |
AGGCCCTGCT |
GGGGGCCAGG |
GTGGGGGAGC |
AGTTGATGAT |
|
16881 |
GCTGAGGGTC |
CTTTGGTGAG |
GCTGGCAGGC |
ACTGGGTGGG |
CTCCACCCCT |
TCTGCTCCTC |
CCCCTCCACT |
CTCTGGGTGC |
|
16961 |
TCCCTGTGGT |
CACCTTTGGA |
TTGTTGCAAG |
CCAGAAACAT |
CCCCGCCTGC |
CTGGTCACAG |
GGCTACTCTC |
TCACATCTGA |
|
17041 |
CGTCTTCTCA |
CAACAGAATC |
TCATGACCAC |
CCTCCCAAGC |
CAGGATGCGT |
CTCTGCCACC |
CCAGCAGCCC |
TACATCGCGG |
|
17121 |
GGCAGCAGCC |
CATGTACCAG |
CAGGTGAGCC |
ATTCCCGGGG |
CCTCACAGCG |
GCACCCGCAG |
GGCACCCTCA |
GGCTTCACGG |
|
17201 |
TTTAGCAATG |
GGACCCCTGG |
CCCCAGTGGG |
GATTGTCCCG |
TACGTCTGCT |
CACAGGGGGA |
GACGCATACC |
CGAGGCCCCT |
|
17281 |
TCTCAGCAGA |
CAGAACGAGG |
ACACAAGTCT |
CAGGGCAGAT |
ACGCGCATCC |
GCAAGTGTAG |
ACAGGAAAAA |
CCGTGAATTT |
|
17361 |
ACTTGAAAGG |
CAAAGGCCGC |
ATGAGCTGGC |
TGCATGAGCT |
GCAGTCCGTG |
GACATGGATT |
ACAAGCACTT |
GTGGGTCTGT |
|
17441 |
GGCTCTGCTG |
GGACAAAAGC |
CTTCCTCCCT |
GGGCTGGGGA |
AGGGAGGACC |
AGGGCCATGC |
CTGCTTTCCT |
CCTGCACAGA |
|
17521 |
TGGCACCCTC |
TGGCGGTCCC |
CCCCAGCAGC |
AGCCCCCCGT |
GGCCCAGCAA |
CCGCAGGCAC |
AGGGGCCGCC |
GGCACAGGGC |
|
17601 |
AGCGAGGCCC |
AGCTCATTTC |
ATTCGACTGA |
CCCAGGCCAT |
GCTCACGTCC |
GGAGTAACAC |
TACATACAGT |
TCACCTGAAA |
|
17681 |
CGCCTCGTCT |
CTAACTGCCG |
TCGTCCTGCC |
TCCCTGTCCT |
CTACTGCCGG |
TAGTGTCCCT |
TCTCTGCGAG |
TGAGGGGGGG |
|
17761 |
CCTTCACCCC |
AAGCCCACCT |
CCCTTGTCCT |
CAGCCTACTG |
CAGTCCCTGA |
GTTAGTCTCT |
GCTTTCTTTC |
CCCAGGGCTG |
|
17841 |
GGCCATGGGG |
AGGGAAGGAC |
TTTCTCCCAG |
GGGAAGCCCC |
CAGCCCTGTG |
GGTCATGGTC |
TGTGAGAGGT |
GGCAGGAATG |
|
17921 |
GGGACCCTCA |
CCCCCCAAGC |
AGCCTGTGCC |
CTCTGGCCGC |
ACTGTGAGCT |
GGCTGTGGTG |
TCTGGGTGTG |
GCCTGGGGCT |
|
18001 |
CCCTCTGCAG |
GGGCCTCTCT |
CGGCAGCCAC |
AGCCAAGGGT |
GGAGGCTTCA |
GGTCTCCAGC |
TTCTCTGCTT |
CTCAGCTGCC |
|
18081 |
ATCTCCAGTG |
CCCCAGAATG |
GTACAGCGAT |
AATAAAATGT |
ATTTCAGAA |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:9146|HGS|NC_000017.10:79651019...79669147 (+)
GCGGAAGCGGAAGTCGGGGGGCGCGCCAGCTCGTAGCAGGGGAGCGCCCGCGGCGTCGGGTTTGGGCTGGAGGTCGCCAT
GGGGCGAGGCAGCGGCACCTTCGAGCGTCTCCTAGGTAACGCGTCCCCACCCGACGGCTCGGCCTTGGTCAGCCCGCAGC
CTCCGCCCGGCGCTGAGTCAGCGCGGCCCCGAGGGCCCGAGGGGCGCGGACCGGCCGCCCTGCCCGCCTCTCCCCGGCCC
GCTGTCCTCCCGGGTTCGGAGCCGCCGATCCTGGACCCTCGGCGCGGCCGTGGGGTCGGCGTCCGTCGGGCGTTGGCAAA
TGGGGCTCTGCGAGGGTCCGCGCAGGGCGCCAGGGGCGCTCGGCGACCTCGCTCCTCCGCGGGCCCGGGCATTCGGCCGA
TTTCCTCTTGACCGACGAGCACCCGGCCGAGTGGGCCACAAGGCCTGAGTCATGAAAGTGGGTGACTCAGTCGGATGTTA
GGCTCGCACTTGGGCTTCTGACCCAAAACACTGGGCCTCGAAGGGTGGAGCCGGCAGTGTTCTCTCTGGGACCTTCGAGC
TCTCTTGGGGGCTGCGGGTCCCAGACCCACCTTTTGTTCCCACACTGGCTTCGGGAAGTGCCTAATGGTTTAGGAAGGCG
TTTTCTCACGTTTTTGGAGGTGAAAGCTGATCCTATCTGAGCCTCCTGGAATTGACGCGTTTTGGGAGGGAAACGAATGT
TGATGTGCACGTGCCCAGCAGCACGTGGAAAATGCGTCTAATGTTTGGGGGTAGGAGCTTGGGTGGTCGAATGGGCTGCC
CAAGCTTGCCTTTTTGTCAAGTAGGGTTTTCAAGGAGACTTCAAGTGAGACACCTTCCTTGCCAAGGGAACCAGCAAGCA
GACTGAGTATCTCAGCCACACAGGTAGTTCAGTGCTTCAGGGATGAATCCAGAGGTTAACTACTGAATCAGTATAGAAAG
TGGACGTGGCTTGGCTTGTTAGATCGTTTCTGTATCCGGGAAAGGCGATGTGGGCAGAGCTGGAGCTGCCCCCCCAGAGG
GGCCAGTTGTTGGAGACAGCAGTGATGTGGTATGATGACAGGTATGCTGGCGAGTCCTCCTTGGAGCCCCGTTCTGCCCT
TATTTCCAAAGCACTTGGCTTGGAGCAAGGGGCCTGGAGTCTACACTGCAGGCCTCTCTAGCCCTTGCCCAAGCTAAGGT
CCCATTGAAACCGGGAGCATCTGGGCGAACGGGGACTCCAAGGGACGGGTGATAGGGTTCTTTGTTGAGAAGCAGCCGAG
CCTGATGTTTAAGAGTGAATCAGCCTCTGTCTCTGGTGAGCTGTGTGACTCACGGCAGTGCCCTCGCCTCTCTGCTGAGT
GTCTTTACCTCGCTCGCGGGTTGTGGGAGGGCTTGGACCAGCACCTGGTGTGCTCTTACGCTGTTTGCTGCTTCCCTGGG
GAGGATGGCACTTCCCTGCCCATCAGCACGCTCAGGAGGATTCTGTCATGCTTTTTCATTTGGTCCAAATGTCTAAAGGG
GAAGGAGAGAGTCCCCCCCAGCTGCTGCCAAGCTGGGGTGCTGCACGGGGCGTCCAGCAGGATAACCCCTCCCTTTCATG
GCATTTTCTCTCTCAGACAAGGCGACCAGCCAGCTCCTGTTGGAGACAGATTGGGAGTCCATTTTGCAGATCTGCGACCT
GATCCGCCAAGGGGACACACAGTGAGTTAGCGGGGCCTGTGCCCTGATGCGGAGGAGCAGCCGTGCACTAAGCTGGGCTC
GCTTGGTGCCCGTTGGGTCTCCACGACGTAGCTGACCGTGTTAGGCTCTTATGTGGAGAACTGCATGGGCTACATGCTAG
GATTTTTTGACATCTTGGAGGATTAAGGGGTTTTCTCTGTACAGCTTGCAGATACCTGGTTATGTTTGTATTTGCTTCGG
GGTTCTCAAAGACTAAGATGGCAGACAGACTTTTTTTTTTAAGACGGGGCCTTGCTTTGTCACCCAGGCTGAAGTGCAGT
GGCGTGATCTCAGCTCACTGCAGCCCCTGCCTCCCAAGCTCAAACGATCGTCCCACCTCAACCTCCTGAGTAGCTGGGAC
TACAGGCATTCACCATCACGCCCGGCTGATTTTTTGTATTTTGGTAGAGATGGGTTTTTGCCGTAATGCACAGGCTGGTC
TCAGACTCCTGGGCTCAAGCAATCCTCCTGCCTTGGCCTCCCAAAGCACTGGGATTACAGGCGTGAGCCACTGCACCTGG
CAGCAGATGAGCTTTTCTCATCTTAGTTGCTGAGACTATTTTTTTCCCGTTTTTCTCTGCTTTTATCAATGTTTCCTTTT
CAGAGCAAAATATGCTGTGAATTCCATCAAGAAGAAAGTCAACGACAAGAACCCACACGTCGCCTTGTATGCCCTGGAGG
TAAGCAGACCCCCGTGCCTCAGTGGCCCCCAGGGTCCCTACCCCCTACTAGTCACGACAGCATGAGAAGAGTTTGTCCTG
TGGCCTTGCCCACCTCCCTGGGCATACATCAAGCAGGAGAGTTTCCACTCAGCCTTGAGTGCCTGTGTGAACAGTGCCAC
CTAGGAGAGGGTCACGCTGAGGGAGAGACAGGAGATAGGGGAGGGAGTGGGCACCACTTCTTCAGCCGGTCCTGCGTGAG
TAAACACCAGAAGGCGGATGTCATTGCTGTGCTTGGCAGTGAGACTTGGCGTCACGGAGCGGTGCTGTGCAGGCTGCTGA
GTCTCAGATGTGGGTTCCCCACTGGCCTGACGGGCCTGTTGAAGGGCTGCTGTCCCACCGGGTTCCCCACCGGCCACTGA
CGGGCCTGTCGAAGGGCTCTGCTGTCCCACCGGGTTCCCCTCCGGCCACTGACGGGCCTGTCGAAGGGCTCTGCTGTCCC
ACCGGGTTCCCCACCGGCCACTGACGGGCCTGTCGAAGGGCTCTGCTGTCCCACAGGGAGGTGGGTCTGTCCGGGAGGAG
GAGGGGCCCTGCCCTGACCACTGGCCCAACCCTTCCCTTCCTGGGCATCTGCAGGTCATGGAATCTGTGGTAAAGAACTG
TGGCCAGACAGTTCATGATGAGGTGGCCAACAAGCAGACCATGGAGGAGCTGAAGGACCTGCTGAAGGTGGGTGAGACGG
GGGCATGCGGGTGGCCACCCAGGCTGGCACCTTTGCTTCTCCGGGTGTTTACTGGGCACTGATGACAGAACAGCACCAGG
TGTGGCGGAAAATGTGCAGGTGCAGATCGGTGGTCCCTGCACTGCGCAAGCTCGCACTCATGAGTCGGAGAGACAGGGCC
GTGAGCCCCAATCCAGGGAGGTCAGGGTTTAGCAGGAGCCGGCCTGGGGTGCTGTGTCTGGAGGACCAAGGTGACAGCGA
CCCTGGGGCCTGGGGGAAGCTTCCGGGAGGCAGGGCTTGGGCACCTGCTGTCTTAGGGAAGGTGAGTAGATGCCACTGGG
TGGGCAGGGCAGGTGGTGGCTATCGTGTGTGAGAGTGGCTGTGGTCCGGAGGGCAGGCTGGCCACATCCCACAGGCAGGG
GCCTGGGAGGGCTCTGAGCGGGGGCTTGTAAGATCGGATGTGTCTGGCGGGGATAAGAGCTGTTTTGGAGAGCGAGGGCT
TCATGCTTTCCTGGGCAGAGGAGGAAAAGCCCTCCAGATCCCCAGGAGATTGTGCCGGGGACATCTGCAGGGATTGTGCT
GGGGACACCTGCAGGGACTGTGATGTGGACTGTGATGCAGTGGGGATGCTGTTGCAGGTGTCAAGGAGAAAGAGTGTCCC
ATGTAGTCAGGCTGCTCCCCTTCTGTCCCAACCTGTGGGCCAGACCTGGGTCTCATAGGAGACTCCCTGGACTCACACAG
CTGAGAACCTGCCCGGGGATCCCGGTGGACAGCCAGGCCTCTCAAAGGCCCTCACTGGCCACTTTGGCTGGCTTCCTGTC
CCGTCCTGTCCTGGCTGAGAGGCGGAAGGTGCCGGAGTCTGGCCCTGCCCGACCCTGGTGCTCCTGAGCTCCTTCAGTCA
GGCTGCAGATCCTGAGCCGGTTCGCTCCTCTGGGGTGAGGCAGGGCTGTGAAGGGAAGGCGCCGCTGAGGATGCAGAGGG
GTTCTGGAAAGGAGCTTAGGCTGAGGTGCAAGTGCCCCGGAGCCTGCTTGCAGCAGCCTCTGATCGCACGCCGTCCTGCA
CGTCACACCGGGACGGCGTCCCCGAGAGGGCCGTAGACGCGAGGCCAGGCTGCGCTCTAACCGGGGGAGAGGATGGCTGC
AGGGGGTCTGGAGCCCGCTCTGAATTTGGAGGCATTTCTGGAGCGGCTGCCCCCCGCGTCTCCCCGGCTGTGACTGGAAG
GAGGGGCGTGGCCAGAGCCCCGCAGTACCCCTGTGTTCTCAGCCCCAACACCACGCTGCCACCGTCAGGCCTGCCGGGTC
CTGCCTGTTGGAGGCTGACGGGAACCGGGGATCCTGGGGTGGGCAGGTCCCGTGGGGAAAGGGAGAGACAGATGGTCAGG
GGCACAGAGACGCGGCGTTGTTTGGCTCCAGCAAGTAGAGGCAGGCGTGAGGTTTGAACTCAGGGTCTGGGAACAGGTTG
GTGCATGGCCCCCACGCCCCACTCGGGGGGCCCTCCCTGGCCTCTGTGTCAGCCCCATCTGCTCAGAGGCTGAGGTCTGC
CAGCTGCTTCCTCACACCGAGGCCTCTGGCGCCTGGAGTGTCCTGCGACCCTCACCCCCTTCTCCCTGCCTGCAGAGACA
AGTGGAGGTAAACGTCCGTAACAAGATCCTGTACCTGATCCAGGCCTGGGCGCATGCCTTCCGGAACGAGCCCAAGTACA
AGGTGGTCCAGGACACCTACCAGATCATGAAGGTGGAGGGTGAGTCAGGACTGAGGTTGGGACCAGGTTGAGGCTTGGAA
CTGCTGGGCAGTCAGGCTCAACGGGCACAGTGGCGAGGGGCCTGGGAAGATGGGTTGTTCCCTGTGTTGGGAGGGAGGAG
GGCGGTGGCCTGGAGCCAGGGAAGACCGTGCATGTGAGGGCGGGTTCAGACCGGCTCCTGCCTTTCTGGACCCACGGCCC
ACGGGCAGGGGCAGGCAGTGATGGCAGACAACACACAGGTGAAAAGGCAGCAGACAGCCTGTTTCAGAGTGGCAGGACTC
CTGGGGGCCGGGTTGGGCTGTGCAGTCAGGGAAGGCCTCTGAAGGAGCTGTCCCAAGTGTCGAGACTGGAACAGAAGCCA
GAGCCGGCGTGAGCAGGCAGGCAGCACCCGTGGGCAGACACTCAGGGTTGAGGCCCCAGGGCAGAAAGGTGGCCAGCGTG
CCAGAGGAGAGTGCAGCCGGGCCTGTGCGCCGAGGGCCTCCAGGCTGTTGGGCTGGAGTTCAGGGTTGCTGTCAGCGCAG
CAGGAGGCCTCTGGTGGGTTTTCGTCTAGGTCGTGGCTGCCAAGGTGTGTTTCTTAAGGGTCCCTCTCGGCCATTGTGGC
CGTGGAGAACTAGGGGATTGTGTAGAGAGAGGTGGGGGCTTCAGGATGCATTTGGGAGTCTTGGCTGGTGCTCAGGGCCC
ACTCAGTGAGAGAGAGGTGGGCATGGCTGCTCACATCTGTGCGGGTTCTCTGTGGCCCTGAGTCTGCTGAAGACGAGGCT
GAGAGCGTGGGGAATGGGGCAGGCTCCAGGGGTGGTGCGTCCCCGCTGCAGGAGTTGGACGCTTGCCCTGTGGCTGCAGG
GTGGGCAGGACCAGTGTGGAGAACACGGGAAGAACGCAGGGGGCGCTGCACGAGCTGCCTTGGACCCTCAACTTTGGGGT
GGTGGATGCTGGTGGTCTGTGCTCCTGTCTCCTTCCTGGGGTGGGAGCTCTCCTGGATTTTTTCTGGGGTCGTGTGTCAA
GAATAGCTCAAGGTCTTGTCCTCAGGATCAGTGGGAGCCCTGGCCAGAGAGCAGAGGGTGAGGGAGCCACTCAGCAGGCG
GGGCCACGCAGGGGCCTCATGCCCTGATGGAGTGGGAGCAGTGTCTCAGGAAGAGGGCAGCAGGGCCTGGCTCTTGTCTC
CCGCCACAGTGAGGACAGAGGCCCCATGGGGACACCCAATCTTGGGGGCAGGAGGCTGTCTCGGTGCCCCCAGACACCTT
CACTTGGGTGCACGGTCACTGCTGCGTTGCAGCTGCCGAGAGTGAGGAAGGGGCTCCCGGAAGTCTTGCAGGCCAGGTGG
GAGCAGGCAGTGACTATGGCTTCATCTCTCCAGGGCACGTCTTTCCAGAATTCAAAGAGAGCGATGCCATGTTTGCTGCC
GAGAGAGTGAGTGTGGGCGGCCGCCAGGGGTTCTGGAGTCGGGCTGCTCAGGAAGCGTGAAGGGGAGTGCTGGGAGCCCG
GCTTGTTTGAGGGTTGGTGGCTGAGCTCTCGTCTGTCTGGGACCTGAGGATGCCTGTGTGTCCTGGGCGGAGGCGTAGCA
GCTGTCTCTGCAGGGCGAGGTGGAGACGTGGCTTGAGGGCCTTTAGAGTGGCTAGGGGGCTCGGGAAAGGAAAACAAGCA
CCTTTGATGAGGAAGGAAGTCCCTTCCTCAGGCCTCGGCAGCTTCACCCCAGGCCACGCCGCTGGGAAGGGCCGCCCGGG
GCATTGCTCTCGCTCGTGCTGGGGTCCTCTCTGGGTCCGTTGCCTTCTGTGGGGCAGGGGAGGCTCTGTGCCTCTGGGGC
TGGTGGGGCGGGAAGCGTGTGGTCCTGACTGCTGCCCCTCCTCAGGCCCCAGACTGGGTGGACGCTGAGGAATGCCACCG
CTGCAGGGTGCAGTTCGGGGTGATGACCCGTAAGGTGAGTTCCCACCTGGGGGGCTCTACAGCCCCGGCCAGACACCAGG
TCCCCTGCCGTGCACAAGGCCACCGTCCCTGCTGGGCCTTTCCTTCCTGGTGGGTGGTTCAGGAGGTGAGTGCAGCTCGC
AGCGGGTGGCAGTGTGGCGTCAGGAGGGGGACAGTGAGGGCCCACGTAAGGGCCTGATGCCACTGCCTGGGCCGGGCCCA
GCCTCACTCTGGAATTATAATGTTTTAAACCTAGGGCTACAGGCCTTGCCACAGTCACTCTGCTGCTTGCCAGGGCCGTG
GGGCCCCCTTCTCTTTGTGGCTTAGCCCATCTGGATTCTTACCCTCGAGGCATCTTCACATCAAAACCCCCTCCAGGCTG
GCAACCGGCAGTGCTTGGGGTTTGGGGTGTCAGCAGCTTCCAGCACCCACTGCACTCACAAAAGCTCTTGTTTTATCAGC
AGAATTGATGTGTATTTTTTCCTTGCCCTTACTTTTAACTTACCTTATTTTCCCCAAAACGGTGGCTGGCGTTGAGACTC
CCGGGAGCATGTCCAGGTTCCCCGGCCTTAGGGTCTTCCCAGGCACTTGTTCTGCTTGTCCCTTGCCTTCCCCCACCTGT
GAGGCCCAGCTTCGGCATCGTACGGGGTGGTTCTGGGCCGGGTGGCGCATCAGGGTCCCCCAGTGCCTGTGACCAGGCCC
GCCCGCCCCATCTTACAGCACCACTGCCGGGCGTGTGGGCAGATATTCTGTGGAAAGTGTTCTTCCAAGTACTCCACCAT
CCCCAAGTTTGGCATCGAGAAGGAGGTGCGCGTGTGTGAGCCCTGCTACGAGCAGCTGAACAGGTGAGTCCCCGCCCCCC
ATTTGGGCTGCAGGTGGGGCAGGCTCTCCAGGCTGGGTTTTCTGTCCCTCTTGGCCATGGTGCCTGAGGCCTGCAGACCC
CAGAGGACCCTCACAGCACAGCAGCTGGAAGGTCAAGGGAAACCCAGGGTGGCCGCATGCCCTCGGACCCTGCCCCACAC
TAGGGCAGGTGGGTGTGAGAGACAGGGCGCCGCGGCTCCAGGGACCGAGGCTGCCCCGACAAACCTGTTGCTTGGGTTTG
GGTTTGGGTTTGTTTGCATTTCAACTTTCGGAATAAAACTTACAGAAAAGTTGCAAGAGTAGCACAGAGAAGCTGCGGGG
CCGCGGCCAGTGCCTCAGCGGTGGGAACCTGCGGGGCCGCGGCCGGTGCCTCGGCGGTCGGTGTTTTGTCGCAGAGTTTA
CTCTGCCTCCCCTCTCCTGCGCGTGCGTGTGTTCACACAGGTTTTATTTTGAATTTGCATGAGTGCAGATGTCATGCCCC
TTGGCACTCAGATCTTCGCCTGTGATCTCTGAGAGGGACAGTGCTCTCATGGCCACAGCAGTCACTCGGGACTGCGCTGC
CACCCAGTGCTGGCTTCTGCTCTGTGGTCCACGTTCCATTTCTGCCGTGGTCCCAGCAGCGTCGCTGTGGGTCTGGCCTG
GGTTGCGTGTGTTTCGTATGTGGGCCGTGCTCCCTGCTTGGTTCCCTTTTCCTGGAACGTGTCACTGCCTCCCTGTCTCG
CTCCGTGGACATTTCTGGGAGGTCAGGCCGTGGCCACCTGGCCCCCTGTTCAGGTCTGAGGCTCCCACCTGCTTAGGTTC
GGGAAGCTCAGGAGTGAGGCCATGCCCTCCTCAGGACATCCCATCCAAGCCAGCCATGTCCGGTGATGGGCCGCTGCCCG
GAAAGTTCCTTTTCCTTCTTGTAACTGAGAAGAACTTGCCTTGAGCCACGTCAAGTCCCGTCCGTCGCAGCCACTGCCCA
CAAGCGTGAGTCTGCTGTGAGCCAGCGGCTCCATGGCAGGGCATCCCAGCGCCATTCCTGCCTTCACACACACTTGCTGC
CGTTTCCCTGTGCTGGGGGCTGTGCAGGTCTGCCTCGGTGTGGACTTTTCTCTTAGGAAAGAGCCCCAGGTCGGCCGAGC
ACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACGAGGCCAAGAGATCAAGACAATCC
TGGCCAACATGGTGAAATCCCGTCTCTACTTTTTAAGTATTTTATACTTAAAATTTTTGTATTTTATACAAAAATTAGCG
GGCTTGGTGGCAGATGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAAAATCACTTGAACCTGAGAGGCGGAGAT
TGCAGTGAGCCAAGATGGCGCCACTGCATTCCAGCCTGGGCGACAGAGCAAGACTCTATCTCAAATAAAAAAAAAAGAAA
AGAGCCCCAGGTCAAAGGTCAGCGTGCGTGGTTGGAAATTGCGACCATGCTGTGAGCCTGCGGCCCTTGGACTGCCTGGG
GCGCCCCGGAAGAGCTTACTGGGCATGACAGTCACTGACCTGTTTGCCCCTTCTTGGGTGGGGCCCGTCCCACAGTGGCG
GTGCTGTGTTTGGTTTCCCGAGTGCCGTAGGAAGGTCTTCGCAGGCCGGGCTGGTGTCCTGAATGTCCATTTTCAGGAGA
GGAACTGGGGCTTACCCTGAGGGACTCACCCGAGCCCTTGTGGCTGGTGTGATGCCCTCGAGTCCTGCCCTGCTCTGCTC
TGGAGTGTGGCCATTCGGACACCTGTGGCTGTGGAACCACTGTCCTCACTCTGTGGGGACACTTAGAGGAGCCCGCAGAG
GTGTGGTTGAGAGCTTTGGCGGGGGCAGGGCGGTGTCAGCGCATGGTGACCTGCAGCATTCCTTGTGCCCACAGGAAAGC
GGAGGGAAAGGCCACTTCCACCACTGAGCTGCCCCCCGAGTACCTGACCAGCCCCCTGTCTCAGCAGTCCCAGGTACTCA
GCCCCCTCCGTCCCGTGGGCACCTCTTCCCCGGCGCCCCCCCTCACCCTCCCCGCTTGTCCTCAGCTGCCCCCCAAGAGG
GACGAGACGGCCCTGCAGGAGGAGGAGGAGCTGCAGCTGGCCCTGGCGCTGTCACAGTCAGAGGCGGAGGAGAAGGAGAG
GCTGGTAAGCCGGGTGGGGCGGGGCGGCCTCAGGAGGGGCCCAGCTCCCCTGGATGTGCTGCGGTGGGGCCGGAGGGGCG
TCACGTGCACCCAAGTGACGCCCCTTCTGATTCTGCCTCAGAGACAGAAGTCCACGTACACTTCGTACCCCAAGGCGGAG
CCCATGCCCTCGGCCTCCTCAGCGCCCCCCGCCAGCAGCCTGTACTCTTCACCTGTGGTGAGCGGCCCTTGGGCTGGAGC
TCCCTCTCCTGGAAGGCAGTAGGGTTGATGGGGGACGCGGGTCCCTGAGCTGATTTAGTCAGGTTGGTGGGGGATGCGGG
TCCCCGGGCTTCCCAGAGAGGACAGCCCCAGCACACGGGCGGACATCAGGGCAGAGCCCCACGGCTCCAGGCACCCGTTG
CTCCTGGTCCCTGGTTCGGCCCTTGGGACTGAGGACATGGGACAGGCTGTGGGGGAGGCGCCTTGCTGGTGAGACCCCGT
CTCTTCCACACCCCTCCCGCCCCACAGGAACTGAAGGACTTGCTTTCTGGGGCATCCACGTCACCTCTCCCTGGCCTCAG
CCCCGCTCTCAGTAGAGGGTGAAGAGGCAGCCTGTTTTGCAGGGGGTTGGGTCGGGGACAGCAGGTTGAGAAAGTCATTT
TGTAGGTTCCGTGGAGTGGCCAGGCCGCTGCAGGCCCAGGGGTGCAGAACAGCCCTGCCCCAGTGAGCCACCCCTTCCCT
TCTTCCCTTCCTTCCGGGCTGTCGGCTGGGGCCCCACACGCTGGACCGTGGGGGCTGTCCAGTGTCCACCTTGAGGCCCC
AGGGCTGGCCCAGTGCCACCCTGTTCCTCCCATAGCAAGGTTAGGAGCCTTGTAGGACGGCAGTGCTGTCAGCCTCTGTG
GAGTTCTAAGCATCTGGAAGAGGAGAAAAGATGGCTGCTCAGATGCCAGGAACCAGAGGAGGGCACGGAGGGAGGGCCCA
GGCTGCGGCTCTGGTCTCCAGGATCTGTCGCACTGGGGACATCCCTGTCCCTGCCGAAGCAACTGGCTCTGTCACCTGTG
AGACTCAGATGCCCTTTTCTCCCCAGAACTCGTCGGCGCCTCTGGCTGAGGACATCGACCCTGAGGTAAGGCCCAGCATG
GGGTGCATCCTCTCACGGTTTCTGGCCTTGGGAGTGACCCCCTCATTGCCTGCAGCTCGCACGGTATCTCAACCGGAACT
ACTGGGAGAAGAAGCAGGAGGAGGCTCGCAAGAGCCCCACGCCATCTGCGCCCGTGCCCCTGACGGAGCCGGCTGCACAG
CCTGGGGAAGGGCACGCAGCCCCCACCAACGTGGTGGAGGTGAGGGGGCCACTCCCGGCATTCCTAGTGGCAGGGTCCCT
TGGAAGGGGTGGATGCGGGACAGGTTGGAGGCCCCACTCATTCTCTCTCTTCCAGAACCCCCTCCCGGAGACAGACTCTC
AGCCCATTCCTCCCTCTGGTGGCCCCTTTAGTGAGGTAAGCTGTGGCTCCCTCCACGGGCCAGGGCAAAACATGGCCTCC
TGGCCCACAGCGCCAGGCACATGGCACAGGTGCCTGCCCTAACCAGAGGGCCGTGCTAGAGCAAGGGTGTCTGCCCCAGC
CCAGCCCTGGCCTGCCCTGCCCTGCCCTTTGTGGCCTCTCCCAATGGAAACTCTACACCAGGCTGTGGTCCAGAGCTCGG
GCCACTCTCTGTGGACCTAACATGGACCTGAATATTCCAGAGCAGCCTAGAACCATCAGATGAGTCCTTGGCGTTGCCTG
GGGCTTGTGGGCTGGTTGGCTGTCAGAGCACAGGTCCCTGGAGAGGGAGCTGGCAGTGGGGCCCTGAGCCAGCTCCGTCC
TGACCAGGGCTTGCAGCTGGACAAGGACCCCGCCTCCAGGGCCTCGCCTTCCTCAGCTGTAGAAGGGGCTGCTTGCATAA
GGAGCAGATGGACTCTGCTCCAGGCTTGAGTATAGCTGGGTGCCTCCATCCCAGGCCCCACCAGGGAGGCTGGCTGGGGC
GTGGCCGCACTCATCCAGAACCCTGCTCTGCCTGCAGCCACAGTTCCACAATGGCGAGTCTGAGGAGAGCCACGAGCAGT
TCCTGAAGGCGCTGCAGAACGCCGTCACCACCTTCGTGAACCGCATGAAGAGTAACCACATGCGGGGCCGCAGCATCACC
AATGACTCGGCCGTGCTCTCACTCTTCCAGTCCATCAACGGCATGCACCCGCAGCTGCTGGAGCTGCTCAACCAGCTGGA
CGAGCGCAGGCGTAGGTGCCCGCGCCACGGGGCCTCGGCTCAGGGGCAGCCAGGTGTTGTGAGCGCCATCCTGGGCCAGG
GCCTCCCCTGAGGGTGCTGAGCTCTTGTGAGTCCTTCATTGGGGCCGTGGCTTCCTCCAGAGAGTGTCAACAGGAGGTGG
TCTCAGTGATGACCATGCCCTGCCCTGCCCTGCCCTGCCCTGCCCTGCCCTGCCCAGGACCCTCTGCCTGCCTCCCCCAG
AGCCCAGCACCTTCAGAGCCCTCTGCAGAAGGTTGGGCACAGGGCGGCCCATCTGCGTGTCCGTTCCACCCAGGAGCTTC
TCGGCACTGTGCCGGAGTGGTCAGGGTTGCTCTGTCATCTGCCCACAGTGTACTATGAGGGGCTGCAGGACAAGCTGGCA
CAGATCCGCGATGCCCGGGGGGCGCTGAGTGCCCTGCGCGAAGAGCACCGGGAGAAGCTTCGCCGGGCAGCCGAGGAGGC
AGAGCGCCAGCGCCAGATCCAGCTGGCCCAGAAGCTGGAGATAATGCGGCAGAAGAAGCAGGTGCAGTGGCTGCCCAGCC
ACAGGCCGGGGCCGGCTGGGGGACCTCGCAGCATAACCAGCATGTTTTTGCCGCACAGGAGTACCTGGAGGTGCAGAGGC
AGCTGGCCATCCAGCGCCTGCAGGAGCAGGAGAAGGAGCGGCAGATGCGGCTGGAGCAGCAGAAGCAGACGGTCCAGATG
CGCGCGCAGATGCCCGCCTTCCCCCTGCCCTACGCCCAGGCATGTGCCATCCTCCCGCCACCCAGAGGCTTGTGGGCTGA
GGACCAACTCTCACCGCTGTCTCTTTTGTCCCCAGCTCCAGGCCATGCCCGCAGCCGGAGGTGTGCTCTACCAGCCCTCG
GGACCAGCCAGCTTCCCCAGCACCTTCAGCCCTGCCGGCTCGGTGGAGGGCTCCCCAATGCACGGCGTGTACATGAGCCA
GCCGGCCCCTGCCGCTGGCCCCTACCCCAGCATGCCCAGCACTGCGGCTGGTAAGGACGGGTCGGGGCAGAGACCATGCC
TTTTATCCCTCGTCTTTATTTTAGCCGAATTTACAGAAAAGCAGTAAGAATGGTGCGAAGGGTTTTCCCAGTTGCCCCGA
TGTGAATGTTGTCCCTGCTTTTTCCTGTTTCCATGCAAACGCACTCACACAGGCTTTCCTGCTGGAGCCGTGTTTCAGTC
CGTGCGTGGCAGGTACTGCCCTCTCCCCTGAATGCGACAGCGAGCATTTCCTGAAACCAGCAGGTGCTGGGCTGCCGATC
CTCGTCTGTAGCAGGCTCAGCCCTCATGGACTGTCCCAGTGGTGTCCTGTAAACAGCACGCCCCCTGCTGCGTGGCATTT
GCCCCCCCCCCCCCCGCCTTTAATCTAGAGTGTTCCCCAGTCAGCTTCTGTGTTTTGTGACATCTACTGTCCCAAGAGCA
CTGGCCTGTCATTCGGCAGACCGTCCTCAGCTCAGGTTTGTCTGGTGTCTCCACGCGGTGACCATAAGGCTGTGCTTTTC
TCAGAGTGGCAGCCGAGGTGACGTGCATCCCGTCAGTCCCATTGCTGGCCGTTCGGCTGAGACGGGGTCTGCCAGGGGTT
TTCACTGTGAAGTGACTGTTTTGCCCTTTACAGATGTCTCTTGGGGGAGGCCTTCCAAATACCCTGGGACTCCTCAAGGT
CTAGCATTTTTTGCTAATGTTGCCTGTGTTTATTACGGTGTTTGCCAAATAGCGCCTTCTAATTCTGCTGTTGTCTTTCC
ACTGGCGGGAGGTACTCTTCCCCACTCTCCTGCTTATCGTCAGTCACAGCAGCGTGGAGACGTGTTTATTTTTTCTTTCA
AAGGTGACACTCCTCTGTCATCACTCTTTATTTTTAGAGTCTCGCTGTCATCCAGGCTGGAGTGCAGTGCCATGATCATG
GTTCACTGCAGCCTCGACCTCCTGGGCTCAAGCAATCCTCCCATCTCAGCCTCTTGAGTAGCTGGGACTACAGGATTGCA
CCACCATGCCTGGCTAATTAAAAAAAAAAGGGCTGGGTGCGGTGGCTCACGCCTGTAGTCCCAGCACATTGGGAGGCTGA
GGCAGGTGGGTCATGAGGTCAGGAGTTCAAGACCAGCCTGCCAGGATGGTGAGACCCTGTCTCTACTAAACATACAAACA
TTAGGTGGGCGTGGTGGCACGTGCCTGTAATCCCAGCTATTCGGGAGGCAGAGGCAGAGAATGAATTGAGCCCGGGAAGT
GGAGGTTGCAGTGAGCCGAGGTTGCACCACCACACTCTAGCCTGAGCAACAGAGTGAAACTCCATCTCAAAAAAAATGTA
GAAATGAGGTCTTTCTGTGTTGCCCAAGCTGACCTCAAACTCCTGACCTCGAGTGATCCGCCCTTCTCGGCCTTCCAAAG
TGCTGGGATTACAGGTGTGAGCCACCGTGCCCAGCCCCGTTACTTTGATGCTCAAAGTTTCCCAGGTGTGGCCATGGGAG
GCGTCAGTGGCTTCTCTGTTGCGTAGGGCCATTGGTTTGGGTTGCTCCTGAGTTTCTGTTGCTAGATACTCTGTGCTCGT
CTTGTATGTTCTCAACCCTGGAAGCAGCCATTTCTCCAAGGAGCCCTTGTTCCCTTTAGTGGAGGGTGGTCTGGATCTTG
GTTAGAAGCCAAGATCAAGATTCTGGGTGTGCTCACAGCTCTAGGTTGTTAGCAGACAGAGCTGGAAAGTGTGTATGTGA
ATACAAGTGCACAGTTGTGGGTGCGTCTGTGTATGTGCGAATACATGCGTCTTTGTATATGATAAAAACCGTGAGTGCAG
CCAGGTGCGGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCCAAGGCGCGTGGCTCACCTGAGGTCAGGAGTTGGA
GACTGGCCTGGCCAACATGGTGAAACCCCGTCTACTAAAAATACAAAAATTAGCCAGGCGTGATGGTGCACACCTGTAAT
CCCAGCTACTTGGGAGGCTGAGGCAAGAGAATCACTGGAACCTGGGAGGTGGAACTTGCAGTGAGCCAAGATCATGCCAT
TGCCCTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAATCCATGAGTTAATTCCAGTACTTGCA
ATTTTAATCCAATAGCACAGTTTTCTTTTGCATTGTTTTCAATATACTCAATTGCTCAGCCCTAGAATACACAAAAAGTA
GTTTTAAGAATGCTAACCTGGCCCACTGTGGAGACGATGCCTCCTAACTGGAGTGTAACATTTGTCTGTAGTTCTTGTCA
TTTGTAGCCTGAGGCCGTGGAGTCCAGATTTTGGGTTCAGAAGTTACTGGAATTCACTATCCCATCCCATTGGTCAGACC
ATGTCACTCATTTCAAATACAGTTTGGTTCATTTTTTCTGATTTCATTCCATTCTAGGATTCTCCTCCCATCCTTGTTGG
TTTATCTTCTTTTTTGAGACGAAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGACGTCAGCTCACGGCATTC
TCCGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGCGTAGCTGGGATTACAGGCGCCTGCCACCATGCCCAG
CTAGTTTTTGTATTTTTATTAGAGACAGGGTTTTACTATGTTAGACAGCCTGGTCTCGAACTCCTGCCATCATGATCCGC
CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGCATGAGCCACCACGCCCCGCCTATGTTCCTTTTTTTTAGAAACGTG
AAACCTGAACTTGGTTCCAAAAGTCACAACAAAAAGGAAAACTCCTGGCCGGGCGGTGGCTCACGCCTATAATCCCAGCA
CTTCGGGAGGCCGAGGCGGGTGGATCACCTGAGGTCAGGAGTTTGACACCAGCCTGACCAACATGGTGAAACCCGTCTCT
CCTAAAAATACAAAAACTTAGCTGGGTGTGGTGGTGCACGCCTGTAATCCCAGCTGCTCGGGAGGCTGAGGCAGGAGAAT
TGCTTCAACCCAGGAAGCAGAGGTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGCGTGACAGACTATCTCAA
AAAAAAAGTAAAACTCGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCAC
AAGGTCAGGAGATCAAGACAATCCTGGCCAACATGATGAAACCCCATCTTTATTAAAAGTACAAAAAAAAATTAGCCGGG
CGTAGTGGCGTACGCCCGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTG
CAGTGAGCCAGGATTGTGCCACTGCACTCCAGCCTGGTGACAGAGCAGGACTCCATCTCAAAAAAAAAAAAAAAAAAGTA
AAACTCAAGAGTAGGGTGTCCCTTCCTCTCCTCCCTGTTGCCCTGGCTGAACCATCTCCCCTGTCTTGTTTGTCACAGAT
CCCAGCATGGTGAGTGCCTACATGTACCCAGCAGGGGCCACTGGGGCGCAGGCGGCCCCCCAGGCCCAGGCCGGACCCAC
CGCCAGCCCCGCTTACTCATCCTACCAGCCTACTCCCACAGCGGGCTACCAGGTACACAGGAAGGCCGCTCCTCTCCTTC
CAGGGCCAGCCCCAGCCCCAGCCCCAGCCCCAGCCCCTTCTCCCATGGCACTCATTCCCTCCGCAGAACGTGGCCTCCCA
GGCCCCACAGAGCCTCCCGGCCATCTCTCAGCCTCCGCAGTCCAGCACCATGGGCTACATGGGGAGCCAGTCAGTCTCCA
TGGGCTACCAGCCTTACAACATGCAGGTACAGTGACCTCCAGGCCCTGCTGGGGGCCAGGGTGGGGGAGCAGTTGATGAT
GCTGAGGGTCCTTTGGTGAGGCTGGCAGGCACTGGGTGGGCTCCACCCCTTCTGCTCCTCCCCCTCCACTCTCTGGGTGC
TCCCTGTGGTCACCTTTGGATTGTTGCAAGCCAGAAACATCCCCGCCTGCCTGGTCACAGGGCTACTCTCTCACATCTGA
CGTCTTCTCACAACAGAATCTCATGACCACCCTCCCAAGCCAGGATGCGTCTCTGCCACCCCAGCAGCCCTACATCGCGG
GGCAGCAGCCCATGTACCAGCAGGTGAGCCATTCCCGGGGCCTCACAGCGGCACCCGCAGGGCACCCTCAGGCTTCACGG
TTTAGCAATGGGACCCCTGGCCCCAGTGGGGATTGTCCCGTACGTCTGCTCACAGGGGGAGACGCATACCCGAGGCCCCT
TCTCAGCAGACAGAACGAGGACACAAGTCTCAGGGCAGATACGCGCATCCGCAAGTGTAGACAGGAAAAACCGTGAATTT
ACTTGAAAGGCAAAGGCCGCATGAGCTGGCTGCATGAGCTGCAGTCCGTGGACATGGATTACAAGCACTTGTGGGTCTGT
GGCTCTGCTGGGACAAAAGCCTTCCTCCCTGGGCTGGGGAAGGGAGGACCAGGGCCATGCCTGCTTTCCTCCTGCACAGA
TGGCACCCTCTGGCGGTCCCCCCCAGCAGCAGCCCCCCGTGGCCCAGCAACCGCAGGCACAGGGGCCGCCGGCACAGGGC
AGCGAGGCCCAGCTCATTTCATTCGACTGACCCAGGCCATGCTCACGTCCGGAGTAACACTACATACAGTTCACCTGAAA
CGCCTCGTCTCTAACTGCCGTCGTCCTGCCTCCCTGTCCTCTACTGCCGGTAGTGTCCCTTCTCTGCGAGTGAGGGGGGG
CCTTCACCCCAAGCCCACCTCCCTTGTCCTCAGCCTACTGCAGTCCCTGAGTTAGTCTCTGCTTTCTTTCCCCAGGGCTG
GGCCATGGGGAGGGAAGGACTTTCTCCCAGGGGAAGCCCCCAGCCCTGTGGGTCATGGTCTGTGAGAGGTGGCAGGAATG
GGGACCCTCACCCCCCAAGCAGCCTGTGCCCTCTGGCCGCACTGTGAGCTGGCTGTGGTGTCTGGGTGTGGCCTGGGGCT
CCCTCTGCAGGGGCCTCTCTCGGCAGCCACAGCCAAGGGTGGAGGCTTCAGGTCTCCAGCTTCTCTGCTTCTCAGCTGCC
ATCTCCAGTGCCCCAGAATGGTACAGCGATAATAAAATGTATTTCAGAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_004712.3 (GI:24496766)
|
Name |
Hepatocyte growth factor-regulated tyrosine kinase substrate (HGS)
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2926 nt
|
Map |
17q25
|
Location |
Chromosome 17 (NC_000017.10) strand : +
79651019...79651132 | 79652634...79652718 | 79653341...79653416 | 79654032...79654124 |
79655733...79655856 | 79657211...79657263 | 79657703...79657771 | 79658476...79658600 |
79660532...79660610 | 79660683...79660781 | 79660899...79660994 | 79661844...79661882 |
79661953...79662096 | 79662193...79662252 | 79662815...79663028 | 79663386...79663558 |
79663636...79663776 | 79663853...79664027 | 79667496...79667629 | 79667724...79667843 |
79668074...79668160 | 79668537...79669147 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
114
|
114
|
1
|
Exon 2
|
115
|
199
|
85
|
1
|
Exon 3
|
200
|
275
|
76
|
1
|
Exon 4
|
276
|
368
|
93
|
1
|
Exon 5
|
369
|
492
|
124
|
1
|
Exon 6
|
493
|
545
|
53
|
1
|
Exon 7
|
546
|
614
|
69
|
1
|
Exon 8
|
615
|
739
|
125
|
1
|
Exon 9
|
740
|
818
|
79
|
1
|
Exon 10
|
819
|
917
|
99
|
1
|
Exon 11
|
918
|
1013
|
96
|
1
|
Exon 12
|
1014
|
1052
|
39
|
1
|
Exon 13
|
1053
|
1196
|
144
|
1
|
Exon 14
|
1197
|
1256
|
60
|
1
|
Exon 15
|
1257
|
1470
|
214
|
1
|
Exon 16
|
1471
|
1643
|
173
|
1
|
Exon 17
|
1644
|
1784
|
141
|
1
|
Exon 18
|
1785
|
1959
|
175
|
1
|
Exon 19
|
1960
|
2093
|
134
|
1
|
Exon 20
|
2094
|
2213
|
120
|
1
|
Exon 21
|
2214
|
2300
|
87
|
1
|
Exon 22
|
2301
|
2911
|
611
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS11784
|
Nucleotide |
HGS, mRNA isoform 1[NM_004712.3] : 78...2411
|
Length |
2334
|
Location |
Chromosome 17 (NC_000017.10) strand : +
79651096...79651132 | 79652634...79652718 | 79653341...79653416 | 79654032...79654124 |
79655733...79655856 | 79657211...79657263 | 79657703...79657771 | 79658476...79658600 |
79660532...79660610 | 79660683...79660781 | 79660899...79660994 | 79661844...79661882 |
79661953...79662096 | 79662193...79662252 | 79662815...79663028 | 79663386...79663558 |
79663636...79663776 | 79663853...79664027 | 79667496...79667629 | 79667724...79667843 |
79668074...79668160 | 79668537...79668647 |
|
Start codon |
1
|
Translation |
NP_004703.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
CGGAAGCGGA |
AGTCGGGGGG |
CGCGCCAGCT |
CGTAGCAGGG |
GAGCGCCCGC |
GGCGTCGGGT |
TTGGGCTGGA |
GGTCGCCATG |
|
81 |
GGGCGAGGCA |
GCGGCACCTT |
CGAGCGTCTC |
CTAGACAAGG |
CGACCAGCCA |
GCTCCTGTTG |
GAGACAGATT |
GGGAGTCCAT |
|
161 |
TTTGCAGATC |
TGCGACCTGA |
TCCGCCAAGG |
GGACACACAA |
GCAAAATATG |
CTGTGAATTC |
CATCAAGAAG |
AAAGTCAACG |
|
241 |
ACAAGAACCC |
ACACGTCGCC |
TTGTATGCCC |
TGGAGGTCAT |
GGAATCTGTG |
GTAAAGAACT |
GTGGCCAGAC |
AGTTCATGAT |
|
321 |
GAGGTGGCCA |
ACAAGCAGAC |
CATGGAGGAG |
CTGAAGGACC |
TGCTGAAGAG |
ACAAGTGGAG |
GTAAACGTCC |
GTAACAAGAT |
|
401 |
CCTGTACCTG |
ATCCAGGCCT |
GGGCGCATGC |
CTTCCGGAAC |
GAGCCCAAGT |
ACAAGGTGGT |
CCAGGACACC |
TACCAGATCA |
|
481 |
TGAAGGTGGA |
GGGGCACGTC |
TTTCCAGAAT |
TCAAAGAGAG |
CGATGCCATG |
TTTGCTGCCG |
AGAGAGCCCC |
AGACTGGGTG |
|
561 |
GACGCTGAGG |
AATGCCACCG |
CTGCAGGGTG |
CAGTTCGGGG |
TGATGACCCG |
TAAGCACCAC |
TGCCGGGCGT |
GTGGGCAGAT |
|
641 |
ATTCTGTGGA |
AAGTGTTCTT |
CCAAGTACTC |
CACCATCCCC |
AAGTTTGGCA |
TCGAGAAGGA |
GGTGCGCGTG |
TGTGAGCCCT |
|
721 |
GCTACGAGCA |
GCTGAACAGG |
AAAGCGGAGG |
GAAAGGCCAC |
TTCCACCACT |
GAGCTGCCCC |
CCGAGTACCT |
GACCAGCCCC |
|
801 |
CTGTCTCAGC |
AGTCCCAGCT |
GCCCCCCAAG |
AGGGACGAGA |
CGGCCCTGCA |
GGAGGAGGAG |
GAGCTGCAGC |
TGGCCCTGGC |
|
881 |
GCTGTCACAG |
TCAGAGGCGG |
AGGAGAAGGA |
GAGGCTGAGA |
CAGAAGTCCA |
CGTACACTTC |
GTACCCCAAG |
GCGGAGCCCA |
|
961 |
TGCCCTCGGC |
CTCCTCAGCG |
CCCCCCGCCA |
GCAGCCTGTA |
CTCTTCACCT |
GTGAACTCGT |
CGGCGCCTCT |
GGCTGAGGAC |
|
1041 |
ATCGACCCTG |
AGCTCGCACG |
GTATCTCAAC |
CGGAACTACT |
GGGAGAAGAA |
GCAGGAGGAG |
GCTCGCAAGA |
GCCCCACGCC |
|
1121 |
ATCTGCGCCC |
GTGCCCCTGA |
CGGAGCCGGC |
TGCACAGCCT |
GGGGAAGGGC |
ACGCAGCCCC |
CACCAACGTG |
GTGGAGAACC |
|
1201 |
CCCTCCCGGA |
GACAGACTCT |
CAGCCCATTC |
CTCCCTCTGG |
TGGCCCCTTT |
AGTGAGCCAC |
AGTTCCACAA |
TGGCGAGTCT |
|
1281 |
GAGGAGAGCC |
ACGAGCAGTT |
CCTGAAGGCG |
CTGCAGAACG |
CCGTCACCAC |
CTTCGTGAAC |
CGCATGAAGA |
GTAACCACAT |
|
1361 |
GCGGGGCCGC |
AGCATCACCA |
ATGACTCGGC |
CGTGCTCTCA |
CTCTTCCAGT |
CCATCAACGG |
CATGCACCCG |
CAGCTGCTGG |
|
1441 |
AGCTGCTCAA |
CCAGCTGGAC |
GAGCGCAGGC |
TGTACTATGA |
GGGGCTGCAG |
GACAAGCTGG |
CACAGATCCG |
CGATGCCCGG |
|
1521 |
GGGGCGCTGA |
GTGCCCTGCG |
CGAAGAGCAC |
CGGGAGAAGC |
TTCGCCGGGC |
AGCCGAGGAG |
GCAGAGCGCC |
AGCGCCAGAT |
|
1601 |
CCAGCTGGCC |
CAGAAGCTGG |
AGATAATGCG |
GCAGAAGAAG |
CAGGAGTACC |
TGGAGGTGCA |
GAGGCAGCTG |
GCCATCCAGC |
|
1681 |
GCCTGCAGGA |
GCAGGAGAAG |
GAGCGGCAGA |
TGCGGCTGGA |
GCAGCAGAAG |
CAGACGGTCC |
AGATGCGCGC |
GCAGATGCCC |
|
1761 |
GCCTTCCCCC |
TGCCCTACGC |
CCAGCTCCAG |
GCCATGCCCG |
CAGCCGGAGG |
TGTGCTCTAC |
CAGCCCTCGG |
GACCAGCCAG |
|
1841 |
CTTCCCCAGC |
ACCTTCAGCC |
CTGCCGGCTC |
GGTGGAGGGC |
TCCCCAATGC |
ACGGCGTGTA |
CATGAGCCAG |
CCGGCCCCTG |
|
1921 |
CCGCTGGCCC |
CTACCCCAGC |
ATGCCCAGCA |
CTGCGGCTGA |
TCCCAGCATG |
GTGAGTGCCT |
ACATGTACCC |
AGCAGGGGCC |
|
2001 |
ACTGGGGCGC |
AGGCGGCCCC |
CCAGGCCCAG |
GCCGGACCCA |
CCGCCAGCCC |
CGCTTACTCA |
TCCTACCAGC |
CTACTCCCAC |
|
2081 |
AGCGGGCTAC |
CAGAACGTGG |
CCTCCCAGGC |
CCCACAGAGC |
CTCCCGGCCA |
TCTCTCAGCC |
TCCGCAGTCC |
AGCACCATGG |
|
2161 |
GCTACATGGG |
GAGCCAGTCA |
GTCTCCATGG |
GCTACCAGCC |
TTACAACATG |
CAGAATCTCA |
TGACCACCCT |
CCCAAGCCAG |
|
2241 |
GATGCGTCTC |
TGCCACCCCA |
GCAGCCCTAC |
ATCGCGGGGC |
AGCAGCCCAT |
GTACCAGCAG |
ATGGCACCCT |
CTGGCGGTCC |
|
2321 |
CCCCCAGCAG |
CAGCCCCCCG |
TGGCCCAGCA |
ACCGCAGGCA |
CAGGGGCCGC |
CGGCACAGGG |
CAGCGAGGCC |
CAGCTCATTT |
|
2401 |
CATTCGACTG |
ACCCAGGCCA |
TGCTCACGTC |
CGGAGTAACA |
CTACATACAG |
TTCACCTGAA |
ACGCCTCGTC |
TCTAACTGCC |
|
2481 |
GTCGTCCTGC |
CTCCCTGTCC |
TCTACTGCCG |
GTAGTGTCCC |
TTCTCTGCGA |
GTGAGGGGGG |
GCCTTCACCC |
CAAGCCCACC |
|
2561 |
TCCCTTGTCC |
TCAGCCTACT |
GCAGTCCCTG |
AGTTAGTCTC |
TGCTTTCTTT |
CCCCAGGGCT |
GGGCCATGGG |
GAGGGAAGGA |
|
2641 |
CTTTCTCCCA |
GGGGAAGCCC |
CCAGCCCTGT |
GGGTCATGGT |
CTGTGAGAGG |
TGGCAGGAAT |
GGGGACCCTC |
ACCCCCCAAG |
|
2721 |
CAGCCTGTGC |
CCTCTGGCCG |
CACTGTGAGC |
TGGCTGTGGT |
GTCTGGGTGT |
GGCCTGGGGC |
TCCCTCTGCA |
GGGGCCTCTC |
|
2801 |
TCGGCAGCCA |
CAGCCAAGGG |
TGGAGGCTTC |
AGGTCTCCAG |
CTTCTCTGCT |
TCTCAGCTGC |
CATCTCCAGT |
GCCCCAGAAT |
|
2881 |
GGTACAGCGA |
TAATAAAATG |
TATTTCAGAA |
AAAAAAAAAA |
AAAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|24496766|ref|NM_004712.3|Hepatocyte growth factor-regulated tyrosine kinase substrate (HGS)
CGGAAGCGGAAGTCGGGGGGCGCGCCAGCTCGTAGCAGGGGAGCGCCCGCGGCGTCGGGTTTGGGCTGGAGGTCGCCATG
GGGCGAGGCAGCGGCACCTTCGAGCGTCTCCTAGACAAGGCGACCAGCCAGCTCCTGTTGGAGACAGATTGGGAGTCCAT
TTTGCAGATCTGCGACCTGATCCGCCAAGGGGACACACAAGCAAAATATGCTGTGAATTCCATCAAGAAGAAAGTCAACG
ACAAGAACCCACACGTCGCCTTGTATGCCCTGGAGGTCATGGAATCTGTGGTAAAGAACTGTGGCCAGACAGTTCATGAT
GAGGTGGCCAACAAGCAGACCATGGAGGAGCTGAAGGACCTGCTGAAGAGACAAGTGGAGGTAAACGTCCGTAACAAGAT
CCTGTACCTGATCCAGGCCTGGGCGCATGCCTTCCGGAACGAGCCCAAGTACAAGGTGGTCCAGGACACCTACCAGATCA
TGAAGGTGGAGGGGCACGTCTTTCCAGAATTCAAAGAGAGCGATGCCATGTTTGCTGCCGAGAGAGCCCCAGACTGGGTG
GACGCTGAGGAATGCCACCGCTGCAGGGTGCAGTTCGGGGTGATGACCCGTAAGCACCACTGCCGGGCGTGTGGGCAGAT
ATTCTGTGGAAAGTGTTCTTCCAAGTACTCCACCATCCCCAAGTTTGGCATCGAGAAGGAGGTGCGCGTGTGTGAGCCCT
GCTACGAGCAGCTGAACAGGAAAGCGGAGGGAAAGGCCACTTCCACCACTGAGCTGCCCCCCGAGTACCTGACCAGCCCC
CTGTCTCAGCAGTCCCAGCTGCCCCCCAAGAGGGACGAGACGGCCCTGCAGGAGGAGGAGGAGCTGCAGCTGGCCCTGGC
GCTGTCACAGTCAGAGGCGGAGGAGAAGGAGAGGCTGAGACAGAAGTCCACGTACACTTCGTACCCCAAGGCGGAGCCCA
TGCCCTCGGCCTCCTCAGCGCCCCCCGCCAGCAGCCTGTACTCTTCACCTGTGAACTCGTCGGCGCCTCTGGCTGAGGAC
ATCGACCCTGAGCTCGCACGGTATCTCAACCGGAACTACTGGGAGAAGAAGCAGGAGGAGGCTCGCAAGAGCCCCACGCC
ATCTGCGCCCGTGCCCCTGACGGAGCCGGCTGCACAGCCTGGGGAAGGGCACGCAGCCCCCACCAACGTGGTGGAGAACC
CCCTCCCGGAGACAGACTCTCAGCCCATTCCTCCCTCTGGTGGCCCCTTTAGTGAGCCACAGTTCCACAATGGCGAGTCT
GAGGAGAGCCACGAGCAGTTCCTGAAGGCGCTGCAGAACGCCGTCACCACCTTCGTGAACCGCATGAAGAGTAACCACAT
GCGGGGCCGCAGCATCACCAATGACTCGGCCGTGCTCTCACTCTTCCAGTCCATCAACGGCATGCACCCGCAGCTGCTGG
AGCTGCTCAACCAGCTGGACGAGCGCAGGCTGTACTATGAGGGGCTGCAGGACAAGCTGGCACAGATCCGCGATGCCCGG
GGGGCGCTGAGTGCCCTGCGCGAAGAGCACCGGGAGAAGCTTCGCCGGGCAGCCGAGGAGGCAGAGCGCCAGCGCCAGAT
CCAGCTGGCCCAGAAGCTGGAGATAATGCGGCAGAAGAAGCAGGAGTACCTGGAGGTGCAGAGGCAGCTGGCCATCCAGC
GCCTGCAGGAGCAGGAGAAGGAGCGGCAGATGCGGCTGGAGCAGCAGAAGCAGACGGTCCAGATGCGCGCGCAGATGCCC
GCCTTCCCCCTGCCCTACGCCCAGCTCCAGGCCATGCCCGCAGCCGGAGGTGTGCTCTACCAGCCCTCGGGACCAGCCAG
CTTCCCCAGCACCTTCAGCCCTGCCGGCTCGGTGGAGGGCTCCCCAATGCACGGCGTGTACATGAGCCAGCCGGCCCCTG
CCGCTGGCCCCTACCCCAGCATGCCCAGCACTGCGGCTGATCCCAGCATGGTGAGTGCCTACATGTACCCAGCAGGGGCC
ACTGGGGCGCAGGCGGCCCCCCAGGCCCAGGCCGGACCCACCGCCAGCCCCGCTTACTCATCCTACCAGCCTACTCCCAC
AGCGGGCTACCAGAACGTGGCCTCCCAGGCCCCACAGAGCCTCCCGGCCATCTCTCAGCCTCCGCAGTCCAGCACCATGG
GCTACATGGGGAGCCAGTCAGTCTCCATGGGCTACCAGCCTTACAACATGCAGAATCTCATGACCACCCTCCCAAGCCAG
GATGCGTCTCTGCCACCCCAGCAGCCCTACATCGCGGGGCAGCAGCCCATGTACCAGCAGATGGCACCCTCTGGCGGTCC
CCCCCAGCAGCAGCCCCCCGTGGCCCAGCAACCGCAGGCACAGGGGCCGCCGGCACAGGGCAGCGAGGCCCAGCTCATTT
CATTCGACTGACCCAGGCCATGCTCACGTCCGGAGTAACACTACATACAGTTCACCTGAAACGCCTCGTCTCTAACTGCC
GTCGTCCTGCCTCCCTGTCCTCTACTGCCGGTAGTGTCCCTTCTCTGCGAGTGAGGGGGGGCCTTCACCCCAAGCCCACC
TCCCTTGTCCTCAGCCTACTGCAGTCCCTGAGTTAGTCTCTGCTTTCTTTCCCCAGGGCTGGGCCATGGGGAGGGAAGGA
CTTTCTCCCAGGGGAAGCCCCCAGCCCTGTGGGTCATGGTCTGTGAGAGGTGGCAGGAATGGGGACCCTCACCCCCCAAG
CAGCCTGTGCCCTCTGGCCGCACTGTGAGCTGGCTGTGGTGTCTGGGTGTGGCCTGGGGCTCCCTCTGCAGGGGCCTCTC
TCGGCAGCCACAGCCAAGGGTGGAGGCTTCAGGTCTCCAGCTTCTCTGCTTCTCAGCTGCCATCTCCAGTGCCCCAGAAT
GGTACAGCGATAATAAAATGTATTTCAGAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1
|
Length |
114 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79651019...79651132 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CGGAAGCGGAAGTCGGGGGGCGCGCCAGCTCGTAGCAGGGGAGCGCCCGCGGCGTCGGGTTTGGGCTGGAGGTCGCCATG
GGGCGAGGCAGCGGCACCTTCGAGCGTCTCCTAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
2
|
Length |
85 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79652634...79652718 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
ACAAGGCGACCAGCCAGCTCCTGTTGGAGACAGATTGGGAGTCCATTTTGCAGATCTGCGACCTGATCCGCCAAGGGGAC
ACACA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
4
|
Length |
93 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79654032...79654124 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
GTCATGGAATCTGTGGTAAAGAACTGTGGCCAGACAGTTCATGATGAGGTGGCCAACAAGCAGACCATGGAGGAGCTGAA
GGACCTGCTGAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
5
|
Length |
124 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79655733...79655856 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
AGACAAGTGGAGGTAAACGTCCGTAACAAGATCCTGTACCTGATCCAGGCCTGGGCGCATGCCTTCCGGAACGAGCCCAA
GTACAAGGTGGTCCAGGACACCTACCAGATCATGAAGGTGGAGG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
8
|
Length |
125 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79658476...79658600 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CACCACTGCCGGGCGTGTGGGCAGATATTCTGTGGAAAGTGTTCTTCCAAGTACTCCACCATCCCCAAGTTTGGCATCGA
GAAGGAGGTGCGCGTGTGTGAGCCCTGCTACGAGCAGCTGAACAG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
10
|
Length |
99 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79660683...79660781 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CTGCCCCCCAAGAGGGACGAGACGGCCCTGCAGGAGGAGGAGGAGCTGCAGCTGGCCCTGGCGCTGTCACAGTCAGAGGC
GGAGGAGAAGGAGAGGCTG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
11
|
Length |
96 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79660899...79660994 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
AGACAGAAGTCCACGTACACTTCGTACCCCAAGGCGGAGCCCATGCCCTCGGCCTCCTCAGCGCCCCCCGCCAGCAGCCT
GTACTCTTCACCTGTG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
13
|
Length |
144 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79661953...79662096 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CTCGCACGGTATCTCAACCGGAACTACTGGGAGAAGAAGCAGGAGGAGGCTCGCAAGAGCCCCACGCCATCTGCGCCCGT
GCCCCTGACGGAGCCGGCTGCACAGCCTGGGGAAGGGCACGCAGCCCCCACCAACGTGGTGGAG
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
15
|
Length |
214 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79662815...79663028 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CCACAGTTCCACAATGGCGAGTCTGAGGAGAGCCACGAGCAGTTCCTGAAGGCGCTGCAGAACGCCGTCACCACCTTCGT
GAACCGCATGAAGAGTAACCACATGCGGGGCCGCAGCATCACCAATGACTCGGCCGTGCTCTCACTCTTCCAGTCCATCA
ACGGCATGCACCCGCAGCTGCTGGAGCTGCTCAACCAGCTGGACGAGCGCAGGC
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
16
|
Length |
173 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79663386...79663558 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
TGTACTATGAGGGGCTGCAGGACAAGCTGGCACAGATCCGCGATGCCCGGGGGGCGCTGAGTGCCCTGCGCGAAGAGCAC
CGGGAGAAGCTTCGCCGGGCAGCCGAGGAGGCAGAGCGCCAGCGCCAGATCCAGCTGGCCCAGAAGCTGGAGATAATGCG
GCAGAAGAAGCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
17
|
Length |
141 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79663636...79663776 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
GAGTACCTGGAGGTGCAGAGGCAGCTGGCCATCCAGCGCCTGCAGGAGCAGGAGAAGGAGCGGCAGATGCGGCTGGAGCA
GCAGAAGCAGACGGTCCAGATGCGCGCGCAGATGCCCGCCTTCCCCCTGCCCTACGCCCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
18
|
Length |
175 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79663853...79664027 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
CTCCAGGCCATGCCCGCAGCCGGAGGTGTGCTCTACCAGCCCTCGGGACCAGCCAGCTTCCCCAGCACCTTCAGCCCTGC
CGGCTCGGTGGAGGGCTCCCCAATGCACGGCGTGTACATGAGCCAGCCGGCCCCTGCCGCTGGCCCCTACCCCAGCATGC
CCAGCACTGCGGCTG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
19
|
Length |
134 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79667496...79667629 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
ATCCCAGCATGGTGAGTGCCTACATGTACCCAGCAGGGGCCACTGGGGCGCAGGCGGCCCCCCAGGCCCAGGCCGGACCC
ACCGCCAGCCCCGCTTACTCATCCTACCAGCCTACTCCCACAGCGGGCTACCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
20
|
Length |
120 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79667724...79667843 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
AACGTGGCCTCCCAGGCCCCACAGAGCCTCCCGGCCATCTCTCAGCCTCCGCAGTCCAGCACCATGGGCTACATGGGGAG
CCAGTCAGTCTCCATGGGCTACCAGCCTTACAACATGCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
21
|
Length |
87 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79668074...79668160 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
AATCTCATGACCACCCTCCCAAGCCAGGATGCGTCTCTGCCACCCCAGCAGCCCTACATCGCGGGGCAGCAGCCCATGTA
CCAGCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
22
|
Length |
611 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79668537...79669147 (+)
|
Is part of |
HGS, mRNA isoform 1
(NM_004712.3)
|
Sequence |
Show
|
|
ATGGCACCCTCTGGCGGTCCCCCCCAGCAGCAGCCCCCCGTGGCCCAGCAACCGCAGGCACAGGGGCCGCCGGCACAGGG
CAGCGAGGCCCAGCTCATTTCATTCGACTGACCCAGGCCATGCTCACGTCCGGAGTAACACTACATACAGTTCACCTGAA
ACGCCTCGTCTCTAACTGCCGTCGTCCTGCCTCCCTGTCCTCTACTGCCGGTAGTGTCCCTTCTCTGCGAGTGAGGGGGG
GCCTTCACCCCAAGCCCACCTCCCTTGTCCTCAGCCTACTGCAGTCCCTGAGTTAGTCTCTGCTTTCTTTCCCCAGGGCT
GGGCCATGGGGAGGGAAGGACTTTCTCCCAGGGGAAGCCCCCAGCCCTGTGGGTCATGGTCTGTGAGAGGTGGCAGGAAT
GGGGACCCTCACCCCCCAAGCAGCCTGTGCCCTCTGGCCGCACTGTGAGCTGGCTGTGGTGTCTGGGTGTGGCCTGGGGC
TCCCTCTGCAGGGGCCTCTCTCGGCAGCCACAGCCAAGGGTGGAGGCTTCAGGTCTCCAGCTTCTCTGCTTCTCAGCTGC
CATCTCCAGTGCCCCAGAATGGTACAGCGATAATAAAATGTATTTCAGAAA
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : O14964
|
Name |
Hepatocyte growth factor-regulated tyrosine kinase substrate
|
Alternative name(s) |
Hrs Protein pp110
|
Synonym(s) |
|
Organism |
Homo sapiens
|
Length |
777 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Domain
|
Has a double-sided UIM that can bind 2 ubiquitin molecules, one on each side of the helix.
|
Function
|
Involved in intracellular signal transduction mediated by cytokines and growth factors. When associated with STAM, it suppresses DNA signaling upon stimulation by IL-2 and GM-CSF. Could be a direct effector of PI3-kinase in vesicular pathway via early endosomes and may regulate trafficking to early and late endosomes by recruiting clathrin. May concentrate ubiquitinated receptors within clathrin-coated regions. Involved in down- regulation of receptor tyrosine kinase via multivesicular body (MVBs) when complexed with STAM (ESCRT-0 complex). The ESCRT-0 complex binds ubiquitin and acts as sorting machinery that recognizes ubiquitinated receptors and transfers them to further sequential lysosomal sorting/trafficking processes. May contribute to the efficient recruitment of SMADs to the activin receptor complex. Involved in receptor recycling via its association with the CART complex, a multiprotein complex required for efficient transferrin receptor recycling but not for EGFR degradation.
|
Ptm
|
Phosphorylated on Tyr-334. A minor site of phosphorylation on Tyr-329 is detected (By similarity). Phosphorylation occurs in response to EGF, IL-2, GM-CSF and HGF.
|
Similarity
|
Contains 1 FYVE-type zinc finger.
Contains 1 UIM (ubiquitin-interacting motif) repeat.
Contains 1 VHS domain.
|
Subcellular location
|
Cytoplasm. Early endosome membrane; Peripheral membrane protein; Cytoplasmic side. Endosome, multivesicular body membrane; Peripheral membrane protein (By similarity).
|
Subunit
|
Interacts with TRAK1. Interacts with TRAK2 (By similarity). Interacts with TRAK1. Component of the ESCRT-0 complex composed of STAM or STAM2 and HGS. Part of a complex at least composed of HSG, STAM2 (or probably STAM) and EPS15. Interacts with STAM. Interacts with STAM2. Interacts with EPS15; the interaction is direct, calcium-dependent and inhibited by SNAP25. Interacts with NF2; the interaction is direct. Interacts with ubiquitin; the interaction is direct. Interacts with VPS37C. Interacts with SMAD1, SMAD2 and SMAD3. Interacts with TSG101; the interaction mediates the association with the ESCRT-I complex. Interacts with SNAP25; the interaction is direct and decreases with addition of increasing concentrations of free calcium. Interacts with SNX1; the interaction is direct. Component of a 550 kDa membrane complex at least composed of HGS and SNX1 but excluding EGFR. Component of the CART complex, at least composed of ACTN4, HGS/HRS, MYO5B and TRIM3.
|
Tissue specificity
|
Ubiquitous expression in adult and fetal tissues with higher expression in testis and peripheral blood leukocytes.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
intracellular protein transport [GO:0006886]
signal transduction [GO:0007165]
negative regulation of cell proliferation [GO:0008285]
endosome transport [GO:0016197]
regulation of protein catabolic process [GO:0042176]
negative regulation of JAK-STAT cascade [GO:0046426]
|
Cellular component
|
cytosol [GO:0005829]
early endosome membrane [GO:0031901]
multivesicular body membrane [GO:0032585]
|
Molecular function
|
zinc ion binding [GO:0008270]
protein domain specific binding [GO:0019904]
|
|
|
|
|
|
|
|
|
|
|
|
|
With
|
Uniprot accession
|
IntAct
|
Stam2 (xeno)
|
O88811
|
EBI-740220,
|
MED7
|
O43513
|
EBI-740220,EBI-394632
|
USHBP1
|
Q8N6Y0
|
EBI-740220,EBI-739895
|
BEGAIN
|
Q9BUH8
|
EBI-740220,EBI-742722
|
CCDC33
|
Q8N5R6
|
EBI-740220,EBI-740841
|
C1orf190
|
Q96LR2
|
EBI-740220,EBI-741355
|
EXOC7
|
Q9UPT5
|
EBI-740220,EBI-720048
|
???
|
Q81M57
|
EBI-740220,EBI-2809848
|
???
|
Q8D065
|
EBI-740220,EBI-2847402
|
EXOC8
|
Q8IYI6
|
EBI-740220,EBI-742102
|
EHMT2
|
Q96KQ7
|
EBI-740220,EBI-744366
|
MIF4GD
|
A9UHW6
|
EBI-740220,EBI-373498
|
LDOC1
|
O95751
|
EBI-740220,EBI-740738
|
STAM2
|
O75886
|
EBI-740220,EBI-373258
|
???
|
Q8ZGA1
|
EBI-740220,EBI-2840375
|
GKAP1
|
Q5VSY0
|
EBI-740220,EBI-743722
|
UBQLN1
|
Q9UMX0
|
EBI-740220,EBI-741480
|
???
|
Q8CZJ9
|
EBI-740220,EBI-2847413
|
DAZAP2
|
Q15038
|
EBI-740220,EBI-724310
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
  |
Uniprot Identifier
|
Difference(s) with the 'canonical' sequence
|
Notes
|
Sequences
|
Isoform 1
|
O14964-1
|
---
|
'Canonical' sequence
|
Get Fasta
|
Isoform 2
|
O14964-2
|
518-604 : Missing
|
---
|
Get Fasta
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Region |
|
|
|
|
Compositional bias
|
346 - 394
|
49
|
Pro-rich
|
O14964-COMPBIAS-346
|
Compositional bias
|
505 - 772
|
268
|
Gln-rich
|
O14964-COMPBIAS-505
|
Domain
|
15 - 143
|
129
|
VHS
|
O14964-DOMAIN-15
|
Region
|
225 - 543
|
319
|
Interaction with SNX1 (By similarity)
|
O14964-REGION-225
|
Region
|
445 - 543
|
99
|
Interaction with SNAP25 and TRAK2 (By similarity)
|
O14964-REGION-445
|
Region
|
454 - 572
|
119
|
Interaction with STAM1 (By similarity)
|
O14964-REGION-454
|
Region
|
480 - 777
|
298
|
Interaction with NF2
|
O14964-REGION-480
|
Repeat region
|
258 - 277
|
20
|
UIM
|
O14964-REPEAT-UIM
|
Zinc finger
|
160 - 220
|
61
|
FYVE-type
|
O14964-ZN_FING-160
|
Natural variations |
|
|
|
|
Natural variant site
|
7 - 7
|
1
|
T -> S
|
VAR_054154
|
Natural variant site
|
400 - 400
|
1
|
E -> D (in dbSNP:rs34868130)
|
VAR_052981
|
Natural variant site
|
733 - 733
|
1
|
A -> S (in dbSNP:rs56058441)
|
VAR_061991
|
Alternative sequence
|
518 - 604
|
87
|
Missing
|
VSP_036172
|
Amino acid modifications |
|
|
|
|
Modified residue
|
132 - 132
|
1
|
Phosphotyrosine
|
O14964-MOD_RES-132
|
Modified residue
|
207 - 207
|
1
|
N6-acetyllysine
|
O14964-MOD_RES-207
|
Modified residue
|
216 - 216
|
1
|
Phosphotyrosine
|
O14964-MOD_RES-216
|
Modified residue
|
240 - 240
|
1
|
Phosphoserine
|
O14964-MOD_RES-240
|
Modified residue
|
286 - 286
|
1
|
Phosphotyrosine
|
O14964-MOD_RES-286
|
Modified residue
|
289 - 289
|
1
|
Phosphotyrosine
|
O14964-MOD_RES-289
|
Modified residue
|
308 - 308
|
1
|
Phosphotyrosine
|
O14964-MOD_RES-308
|
Modified residue
|
329 - 329
|
1
|
Phosphotyrosine (By similarity)
|
O14964-MOD_RES-329
|
Modified residue
|
334 - 334
|
1
|
Phosphotyrosine (By similarity)
|
O14964-MOD_RES-334
|
Experimental info |
|
|
|
|
Mutagenesis
|
266 - 266
|
1
|
A->Q: Strongly reduced ubiquitin-binding. Reduced degradation of ubiquitinated EGFR
|
O14964-MUTAGEN-266
|
Mutagenesis
|
268 - 268
|
1
|
A->Q: Strongly reduced ubiquitin-binding. Reduced degradation of ubiquitinated EGFR
|
O14964-MUTAGEN-268
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MGRGSGTFER |
LLDKATSQLL |
LETDWESILQ |
ICDLIRQGDT |
QAKYAVNSIK |
KKVNDKNPHV |
ALYALEVMES |
VVKNCGQTVH |
|
81 |
DEVANKQTME |
ELKDLLKRQV |
EVNVRNKILY |
LIQAWAHAFR |
NEPKYKVVQD |
TYQIMKVEGH |
VFPEFKESDA |
MFAAERAPDW |
|
161 |
VDAEECHRCR |
VQFGVMTRKH |
HCRACGQIFC |
GKCSSKYSTI |
PKFGIEKEVR |
VCEPCYEQLN |
RKAEGKATST |
TELPPEYLTS |
|
241 |
PLSQQSQLPP |
KRDETALQEE |
EELQLALALS |
QSEAEEKERL |
RQKSTYTSYP |
KAEPMPSASS |
APPASSLYSS |
PVNSSAPLAE |
|
321 |
DIDPELARYL |
NRNYWEKKQE |
EARKSPTPSA |
PVPLTEPAAQ |
PGEGHAAPTN |
VVENPLPETD |
SQPIPPSGGP |
FSEPQFHNGE |
|
401 |
SEESHEQFLK |
ALQNAVTTFV |
NRMKSNHMRG |
RSITNDSAVL |
SLFQSINGMH |
PQLLELLNQL |
DERRLYYEGL |
QDKLAQIRDA |
|
481 |
RGALSALREE |
HREKLRRAAE |
EAERQRQIQL |
AQKLEIMRQK |
KQEYLEVQRQ |
LAIQRLQEQE |
KERQMRLEQQ |
KQTVQMRAQM |
|
561 |
PAFPLPYAQL |
QAMPAAGGVL |
YQPSGPASFP |
STFSPAGSVE |
GSPMHGVYMS |
QPAPAAGPYP |
SMPSTAADPS |
MVSAYMYPAG |
|
641 |
ATGAQAAPQA |
QAGPTASPAY |
SSYQPTPTAG |
YQNVASQAPQ |
SLPAISQPPQ |
SSTMGYMGSQ |
SVSMGYQPYN |
MQNLMTTLPS |
|
721 |
QDASLPPQQP |
YIAGQQPMYQ |
QMAPSGGPPQ |
QQPPVAQQPQ |
AQGPPAQGSE |
AQLISFD |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|O14964|HGS_human Hepatocyte growth factor-regulated tyrosine kinase substrate
MGRGSGTFERLLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMESVVKNCGQTVH
DEVANKQTMEELKDLLKRQVEVNVRNKILYLIQAWAHAFRNEPKYKVVQDTYQIMKVEGHVFPEFKESDAMFAAERAPDW
VDAEECHRCRVQFGVMTRKHHCRACGQIFCGKCSSKYSTIPKFGIEKEVRVCEPCYEQLNRKAEGKATSTTELPPEYLTS
PLSQQSQLPPKRDETALQEEEELQLALALSQSEAEEKERLRQKSTYTSYPKAEPMPSASSAPPASSLYSSPVNSSAPLAE
DIDPELARYLNRNYWEKKQEEARKSPTPSAPVPLTEPAAQPGEGHAAPTNVVENPLPETDSQPIPPSGGPFSEPQFHNGE
SEESHEQFLKALQNAVTTFVNRMKSNHMRGRSITNDSAVLSLFQSINGMHPQLLELLNQLDERRLYYEGLQDKLAQIRDA
RGALSALREEHREKLRRAAEEAERQRQIQLAQKLEIMRQKKQEYLEVQRQLAIQRLQEQEKERQMRLEQQKQTVQMRAQM
PAFPLPYAQLQAMPAAGGVLYQPSGPASFPSTFSPAGSVEGSPMHGVYMSQPAPAAGPYPSMPSTAADPSMVSAYMYPAG
ATGAQAAPQAQAGPTASPAYSSYQPTPTAGYQNVASQAPQSLPAISQPPQSSTMGYMGSQSVSMGYQPYNMQNLMTTLPS
QDASLPPQQPYIAGQQPMYQQMAPSGGPPQQQPPVAQQPQAQGPPAQGSEAQLISFD
|
|
|
| |
|
|
|
|
|
|