(P4HB) prolyl 4-hydroxylase, beta polypeptide [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
9577
|
Official symbol
|
P4HB
|
Official name
|
prolyl 4-hydroxylase, beta polypeptide
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 17 (NC_000017.10) : 79801033...79818543 (-)
|
Map
|
17q25
|
Length
|
17511 nt
|
NM_000918.3
|
P4HB, mRNA isoform 1
|
Accession
|
Name
|
Organism
|
Length
|
P07237
|
Protein disulfide-isomerase
|
Homo sapiens
|
508 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
PHDB; PROHB; PO4HB; PO4DB; PDIA1; PDI; P4Hbeta; GIT; ERBA2L; DSI
|
Alternative name(s) |
glutathione-insulin transhydrogenase; thyroid hormone-binding protein p55; protein disulfide isomerase/oxidoreductase; protocollagen hydroxylase; v-erb-a avian erythroblastic leukemia viral oncogene homolog 2-like; protein disulfide isomerase-associated 1; protein disulfide isomerase family A, member 1; procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide (protein disulfide isomerase; thyroid hormone binding protein p55); procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide (protein disulfide isomerase-associated 1); procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide; collagen prolyl 4-hydroxylase beta
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
This gene encodes the beta subunit of prolyl 4-hydroxylase, a highly abundant multifunctional enzyme that belongs to the protein disulfide isomerase family. When present as a tetramer consisting of two alpha and two beta subunits, this enzyme is involved in hydroxylation of prolyl residues in preprocollagen. This enzyme is also a disulfide isomerase containing two thioredoxin domains that catalyze the formation, breakage and rearrangement of disulfide bonds. Other known functions include its ability to act as a chaperone that inhibits aggregation of misfolded proteins in a concentration-dependent manner, its ability to bind thyroid hormone, its role in both the influx and efflux of S-nitrosothiol-bound nitric oxide, and its function as a subunit of the microsomal triglyceride transfer protein complex. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
AGCCTCGAAG |
TCCGCCGGCC |
AATCGAAGGC |
GGGCCCCAGC |
GGCGCGTGCG |
CGCCGCGGCC |
AGCGCGCGCG |
GGCGGGGGGG |
|
81 |
CAGGCGCGCC |
CCGGACCCAG |
GATTTATAAA |
GGCGAGGCCG |
GGACCGGCGC |
GCGCTCTCGT |
CGCCCCCGCT |
GTCCCGGCGG |
|
161 |
CGCCAACCGA |
AGCGCCCCGC |
CTGATCCGTG |
TCCGACATGC |
TGCGCCGCGC |
TCTGCTGTGC |
CTGGCCGTGG |
CCGCCCTGGT |
|
241 |
GCGCGCCGAC |
GCCCCCGAGG |
AGGAGGACCA |
CGTCCTGGTG |
CTGCGGAAAA |
GCAACTTCGC |
GGAGGCGCTG |
GCGGCCCACA |
|
321 |
AGTACCTGCT |
GGTGGAGTTC |
TGTGAGCGCC |
GGGCCTGGCG |
GGCGGGCGGG |
GCTCGGGGCC |
GCTGAGCCAG |
GCTCTTGGGA |
|
401 |
CGCAGGCACG |
GCCCGGCAGC |
CCCCGGGGTC |
GGGACCCCCG |
GCGCGCCCGG |
AACTGAAGGG |
CTCCCTTGCT |
GCCCGCCTTG |
|
481 |
GGGGCCATGT |
CAGGGGCTCC |
CTGGGGGTGA |
GAGCCGGGCT |
GGGAGCCGGG |
GGGCCGTCCC |
GATGCCCGCC |
CTGCGCACGG |
|
561 |
CAGGATCTTC |
CTGTTAGTGC |
GAGAAGAACA |
GGATCGACCT |
GTGTGAGCAA |
ACAGAGCAAG |
CCAGTGTCCA |
GCCTTGCCGA |
|
641 |
GTCCTCCCCG |
GGGGCCGACC |
CGGAAGGCGC |
CCCGCTCCTG |
CCCACCCCAC |
CTTCAGCTTG |
CCACAGCAGT |
CCTTCCGATG |
|
721 |
GTGTCTCAGA |
GGAGAATCCC |
AAGCCTTCCT |
CACGTCCATA |
AGGATTTGAG |
CTCTGTCCCT |
GGGGTGGGAA |
CCTTGTGCCT |
|
801 |
GACACAGCTG |
CTAACTCTAT |
TGACAGATCG |
TCTTATTCTC |
ACCCTAAGCA |
ACTCAGGGTC |
TGTGTGCACG |
GATGTGCATC |
|
881 |
TCATACTCAC |
AGCTAGTGCT |
GGTACAGGCC |
GCCACGGTCA |
CTTCTTCACT |
TTTGTCTGTA |
GTCTTAAAGG |
GAAAGTCTAG |
|
961 |
GGGAGATCCT |
TGCCTTAGTT |
GCCTGTGGGG |
AATAAGAAGT |
CAGCAACCAT |
TGAAAGGTTT |
GTCTGGCTGT |
GCTGATGGTG |
|
1041 |
ACATAGCAGA |
GTGGGGGCTG |
GTGTTGGTTG |
GTGGTAGTTT |
GTCGTTCACG |
GCCTCTGTCC |
TGATATTGCC |
CAAGAACACC |
|
1121 |
AGGCTATCAG |
CTCAGCCTTG |
TGCGTTAGGA |
GGGGTTATCT |
TGGTGAGTAG |
ATAAGGTTTT |
TATGAAGGGA |
AATGCCAGAG |
|
1201 |
GAAAAAGGGA |
AGCACTGCTG |
AGGGATCAAG |
GCTGCTTTAG |
GGATCAAGGC |
TGCTTTCTAG |
CATCTCATTT |
CCGCTTCCAG |
|
1281 |
ATGCCCCTTG |
GTGTGGCCAC |
TGCAAGGCTC |
TGGCCCCTGA |
GTATGCCAAA |
GCCGCTGGGA |
AGCTGAAGGC |
AGAAGGTTCC |
|
1361 |
GAGATCAGGT |
TGGCCAAGGT |
GGACGCCACG |
GAGGAGTCTG |
ACCTGGCCCA |
GCAGTACGGC |
GTGCGCGGCT |
ATCCCACCAT |
|
1441 |
CAAGTTCTTC |
AGGAATGGAG |
ACACGGCTTC |
CCCCAAGGAA |
TATACAGGTG |
TGGCTGTGGC |
ACTGCCCTTG |
AACTGTCTTT |
|
1521 |
AGAGAGGGAC |
TGGCCTGCCG |
ACTTGGAAGG |
GCAGGGGCAG |
CTGTCTGAGC |
GGTGGGGAGG |
CCGGGTTCCA |
GAGGCTTGGA |
|
1601 |
GGGATCCCGT |
TCCTCAGGCC |
GGGTTCTCTG |
TCCACTTGTG |
CCCTGAGGGT |
CTTGTGAGGA |
GTTTTCATCC |
TGGAAGAACC |
|
1681 |
TTCGAATGGA |
GAACCTGTCT |
TGTCTGGTGC |
CATCCTGGGG |
GCGGCTGGCT |
GAGGTCTGTT |
TGGAGACTGA |
CCAGCGAGGA |
|
1761 |
GAGGGAGACT |
GGTGCCTTTC |
TGGCCACATG |
GGCTGTTTGT |
GCTGCCCCCT |
GTTCCTAGGA |
GCAGCAGCAT |
GGCTGTAGCT |
|
1841 |
CTCGGGGTGA |
CACCTGCTAG |
CTGTCGTGTG |
TAGCGTGCCA |
GCTGCCGCGT |
CACGTGCTTG |
CGTGATTTCT |
CCTCGCATCC |
|
1921 |
TCTCTGTGAG |
GCGGGCTTTT |
CCCAGACAAA |
GGGCTGAGGC |
TTAGAGCAGC |
TGAGTAACAG |
CATCGAGCCT |
GTCGGTAGCA |
|
2001 |
GACCTCGAAT |
CCAAGCCCGC |
CCTGCTCCGT |
CTGTAGCACG |
GGCCCTCAGC |
TGCTCGTCTG |
CCGCCTCTGA |
AGCTGCCGTA |
|
2081 |
GTGGAGGCCG |
AGCCCCTGTG |
TAGCCAGATC |
AGTGACGGTG |
GCAAAAGAAA |
ACTCGTCTAT |
AAAACCGGTT |
CTGTGGCATG |
|
2161 |
TCAGATGCTC |
ATGGTGGGTA |
GCCCGAAAGC |
AGCTCATTCC |
AGTGTTAGCA |
GGGTTCTCCC |
CTTAAGCAGG |
TGTGCCCTGT |
|
2241 |
CCTGTTCTAG |
TGGCAGCAAC |
TCCGTGGTTG |
CCTCTCACTT |
GGCTGTTTTG |
ATGTTTCCTT |
TTGTCCCCTT |
AAGCAGGTGT |
|
2321 |
GCCCTGTCCT |
GTTCTAGTGG |
CAGCAGCTCT |
GTGGTTGCCT |
CTCACTTGGC |
TGTTTTGATG |
TTTCCTTTTT |
TTTTTTTTTT |
|
2401 |
TTTTTTTTTT |
GAGACAGTCT |
CGCTCTGTCG |
TGCAGTGGTG |
CGATCTCTGC |
TCACTGCAAC |
CTCTGCCTCC |
CGGGTACAAG |
|
2481 |
TGATTCTCCT |
GCCTCAGCCT |
CTTGAGTAGC |
TGGGGCTACA |
GGTGTGCACC |
ACCACGCCTG |
GCTAATTTTT |
GTATTTTTAG |
|
2561 |
TAGAGACGGG |
GTTTCACCAT |
GTTGGCCAGG |
ATGGTCTCGA |
TCGTGACCTC |
GTGATCCACC |
CGTCTCGGCC |
TCCTAGAGTG |
|
2641 |
CTAGGATTAC |
AGGCGTGATT |
GATGTTCTCA |
TTTTTTACTC |
ATGTTCACTT |
TTGGAACGGG |
AAGTGCTCTG |
TCCAAAGTCA |
|
2721 |
CCGTGGTTCT |
GTCACTAACA |
CTCAGGTTTG |
CAGATGAGAT |
GAGCACTCCT |
AAATCCACTT |
GTCACTGGTG |
ACCGTCTTGT |
|
2801 |
TAGCACTGGT |
GGAATCCTTC |
TACATCTGAA |
TGGGGTTTTC |
CCGATTCGGA |
CCAGGGAGTC |
AAGTTCTAGG |
AGGGAAAAAG |
|
2881 |
GAGAGGCATC |
ATTCCTTAGC |
CTCAGTCTCC |
CAGGAGGAGG |
AAGTTTCTTT |
CCTGTCAGTT |
GACCGCCTTT |
GGTGGCATAA |
|
2961 |
AAGTGTGTCA |
TCTTCTTCCT |
GGTCTGGGAG |
GCATGGAGGG |
TGGGCCATTG |
GCGAGAACTC |
CCGAAACGCC |
TGGGATAAGT |
|
3041 |
TAGGAGGCGC |
GGGAGTCAGG |
TGGGCGGGGC |
GTCAGCCTGT |
TCCCAGAGGT |
AGAAAACTGG |
CACTGAGTTT |
TGAGTTACTG |
|
3121 |
TCCACCTCTC |
TAAAATGAGG |
CCTGCTGTCA |
GACTCCTGGG |
TGAGAACATG |
TGTTTGATGG |
ATGTACATGT |
CAAAGAAAAT |
|
3201 |
TTAGATAACG |
CAGAGGACAG |
GCTGGGTGCG |
GTGGCTCATG |
CCTGTAATCC |
CAGCACTTTG |
GGAGGCTGAG |
ACGGGTGGAT |
|
3281 |
CACCTGAGGT |
CAGGAGTTCG |
AGGCCAGCAT |
GGCCAATATG |
GTGAAACCCC |
CGTCTCTACT |
AAAAATACAA |
AAATTAGCCG |
|
3361 |
GGCGTGGTGG |
TGCACGTCTG |
TAATCCCAGC |
TACTTGGGAG |
GCTGACACAG |
GAGAATCCCT |
TGAACCTGGG |
AGGTAAGGTT |
|
3441 |
GCAGTGAGCT |
GAGATCGTGC |
CACCGCACTC |
CAGCCTGGGT |
GACAGAGTGA |
GACTTCGTTT |
CAAAAAATAA |
AATTTTTAAA |
|
3521 |
ATGCAGAGGG |
CCATCCTGGG |
CAACATGGTG |
AAACCCTGTC |
TACAAAAAAT |
ACAAAAATTA |
GCTAGGGCTG |
GGCACAGTGG |
|
3601 |
TTCATCCCGG |
TAATCCCAGC |
ACTTTGGAAG |
GCCGAGGTGG |
GCGGACTGCT |
TAATCCCAGG |
AGTTTGAGAC |
CATCCTAGGC |
|
3681 |
AACGTGGCAA |
AACCCCATCT |
CTACAAAAAA |
TAGAAAAATT |
GGCCGGGCAT |
GGTGGCTCAC |
GCATGTAATC |
CCAGCACTTT |
|
3761 |
GGGAGGCCGA |
GGCGGGCGGA |
TCACGAGGTC |
AGGAGTTCAA |
GACCAGCCTG |
GCCAACACAG |
TGAAACCCCG |
TCTCTACTAA |
|
3841 |
AAATACAAAA |
ATTAGCTGGG |
CATGGTGGCG |
GGCGCCTGTA |
ATACCAGCTA |
TTTGGGAGGC |
CGAGGCAGGA |
GAATCACTTG |
|
3921 |
AACCTGGGAG |
ATGGAGGTTG |
CAGTGAGCCA |
GGATGGCGCC |
ACTGCATTCT |
CCAGCCTGGA |
CGACAGAGGT |
AGATTCCGTC |
|
4001 |
TCAAAAAAAA |
AAAAAAAAAA |
GAAGAAACAT |
TGCTGGGCGC |
AGTGGCTCAC |
CTCTGTAATC |
CTAGCACTTT |
GGGAGACCGC |
|
4081 |
TGAGGCAGGT |
GTATCACCTG |
AGGTCAGGAG |
TGAGACCAGC |
CTGGCCAACA |
TGGGGAACCC |
TGTCTCTGCT |
AAACATACAA |
|
4161 |
AAATTAGCCG |
GGTGTGGTGG |
CGGGCACCTA |
TAATCCTAGC |
TACGCGGGAG |
GGTCAGGCAG |
GAGAATTGCT |
TGAACCCGGG |
|
4241 |
AGGTGGAGGT |
TGCAGTGAGC |
CGAGATCACA |
CTATTGCACT |
CCAGCCTGAG |
CAACAAGAGC |
AAAACTCCGC |
CTCAAAAAAA |
|
4321 |
AAAAAAAAAA |
AAAGTTAGTT |
GGGTGTGGTG |
GCACATGCCT |
GTGGTCCCAG |
CTACTTGGGA |
GCCTGAGGTA |
GGAGGATTGC |
|
4401 |
TTGAGCCCAG |
AAGTTCGAGG |
TTGCAGTGAG |
CCATGATCAT |
GCCACTGCAC |
TCTAACCTGG |
GTGACAGAGC |
AAGACCCTGT |
|
4481 |
CTTCCAAGAA |
AAAAAAAAGG |
GCTGGGATTG |
GTGGCTCATG |
CCTGTAATCC |
CAGCACTTTG |
GGAAGTCGTG |
GTGGGCAAAC |
|
4561 |
TGCTTGAGCA |
CAGGAGTTAA |
AGACCAGCCT |
GGGCAACGTG |
GCAAAACCCC |
GTCTCTACAA |
TAAATACAAA |
AATGAGCTGG |
|
4641 |
GTGTGGTAGC |
GTGCATCTGT |
ACCCCAGATA |
CTCAGGAGGC |
TGAGGTTGGG |
AAGATTGCTT |
GAGCCCAGTG |
GGTGGAGGCT |
|
4721 |
GCAGTGAGCC |
AAGATTCTGC |
CACTACCCTC |
CAGCTGGGTA |
ACAGAGTGAG |
CCCCTGTGTC |
AAAAGAAAAA |
AGAAAATGCA |
|
4801 |
GAGAGATCAG |
AGAAAAAAAA |
TGGGTTTGTG |
ATCCTGTTAT |
TCAGAGACAG |
AATCACCCTC |
TTCCCTCTCC |
ACTCCCCATA |
|
4881 |
AACACAACCA |
CAGTCTCACG |
TACACTGGGG |
TTTAGGAGGC |
GGGCGGACGG |
CACCGGGCCT |
CTTCCATGGT |
GGGAGAACAT |
|
4961 |
CTTACTTAGG |
TTGATTCAGG |
TGTCCTCAGC |
CTGGCTGGCA |
GAGAGCAGAG |
ACGGGAGCAG |
GCGCGGCAGG |
GCAGGCGGCA |
|
5041 |
CTCCCTGTTT |
GAGAGTAGAA |
CCTACATTTG |
TTTGGGGTTA |
GCTGGCAGAG |
AGGCTGATGA |
CATCGTGAAC |
TGGCTGAAGA |
|
5121 |
AGCGCACGGG |
CCCGGCTGCC |
ACCACCCTGC |
CTGACGGCGC |
AGCTGCAGAG |
TCCTTGGTGG |
AGTCCAGCGA |
GGTGGCTGTC |
|
5201 |
ATCGGCTTCT |
TCAAGGTAGA |
GACCAGAGCA |
TTCCAGTCTT |
CCTTCCTTCC |
GTCATCCATT |
GAGGAGGGGT |
CCACAGCCTG |
|
5281 |
CAGCTGGCAG |
TTCTGCACGT |
GTCCCCCCTG |
GCCTGGGCTC |
TACAGGGTCT |
GCCCTACTGC |
TTGGTGCTGC |
GGGATCATGA |
|
5361 |
CGACCTCTGC |
TTCTCCCTCC |
TCATTCAGGA |
CGTGGAGTCG |
GACTCTGCCA |
AGCAGTTTTT |
GCAGGCAGCA |
GAGGCCATCG |
|
5441 |
ATGACATACC |
ATTTGGGATC |
ACTTCCAACA |
GTGACGTGTT |
CTCCAAATAC |
CAGCTCGACA |
AAGATGGGGT |
TGTCCTCTTT |
|
5521 |
AAGAAGGTGA |
GTGGCCCCAG |
GCAGCTCTGC |
CCAGGTTAGT |
TCTGGGGTTG |
GTCTGCAGAG |
GGTGGCGTTG |
CTCTCCTTAT |
|
5601 |
CGTTAAGGGA |
ACCTTGCTCC |
TGGCACCTTT |
GGCCCAATAG |
GTATGTTCTG |
CAGCTTTGCC |
AAGTTGGGGT |
GTTGCGCTGA |
|
5681 |
TGCCTGTGGC |
CTGGTCTTCG |
CATTCAGCTG |
TGGGCTTTCT |
GGCCACTCTC |
CTGCCACAGC |
CGCCTCAGAA |
CTGCTTTTGA |
|
5761 |
CTGTTAGCTT |
TTTTTTTTTT |
TTCAATTATG |
ATAAAACATG |
TAACAACATT |
TGCCCTTTTT |
TTTTTTTTGG |
GAGAGTCTTG |
|
5841 |
CTCTGTCACC |
CATGCTGGAG |
TGCAGTTGCG |
TGATCTCGGC |
TCACTGCACC |
CTCCGCCTCC |
TGGGTTCAAG |
TGATTCTCCT |
|
5921 |
GCCTCGGCCT |
CCCGAGTAGC |
TGGGACTACA |
GGCATGCACT |
ACCATGCCCA |
GCTAATCTTT |
GTTTTAGTAG |
AGATGAGATT |
|
6001 |
TCACCATTTT |
GGCCAGGATG |
GTATCAATCT |
CTTGACCTCG |
TGATCCACCC |
ACCTCAGCCT |
CCCAAAGTGC |
TGGGATTACA |
|
6081 |
GATGTGAGCC |
ACCGCGCCTG |
GCCCCGTTTG |
CCTTTTTTTT |
TTTTGAGATA |
GGGTTTCTCA |
CTCTGCCGCC |
CAGGCTGGAG |
|
6161 |
TGCGGTGGTG |
CATTCTTAGC |
TCACTGCAAC |
TTCTGCTTCC |
CAGGCTCAAG |
TGATCCTCTA |
ACCTCAGCCT |
CCCGAGCAGC |
|
6241 |
TGGAACTACA |
GGCGCGTGCC |
ACCACGCTGG |
GCTGAGTTTT |
TGTATTTTTT |
GGTAGACATG |
GGGTTTTGCT |
GTGTTGTCCA |
|
6321 |
GGCTGGTCTT |
GAACTCCTGA |
GCTCCGAGTG |
GTCTGCCTGC |
CTCAGCCTCC |
CAAAATGGTG |
GGATCACAGG |
CGTGAGCCAC |
|
6401 |
CGCGCCCGGC |
CACGTTTTAA |
CTATTCTAAA |
GTATTAACTA |
CATTCAGAGC |
GTCCTGCAGC |
ATCCCTATTT |
CCAGAACGTT |
|
6481 |
TTCCTTACCC |
AGAGGTCTGG |
AGTATTGGCG |
CTCTGGGCGT |
GTTCCCACCC |
AGGTGGCACC |
CACACTTCGT |
ATGCCCCTCA |
|
6561 |
GCCCCAAGGG |
CATTATGACC |
CCTAAGGATC |
CACTGTGAAA |
ACTGCCATCC |
TGATCCCGCT |
GTCCTTCTCC |
TGGGGAAGGG |
|
6641 |
CTCCTGGACA |
GTGTGGCCTC |
TTAGCCGCCT |
CTCCCCGTTA |
CCAGGACACA |
GCAGGGTCCT |
TGTTTGCACA |
GTGTCAAGAT |
|
6721 |
GGGGCCTGTG |
GGTTTCTATT |
GTCTCTTTTC |
TCGGTCACCA |
GGCAGGTGGG |
GCTGGGGCCG |
CACACTTGTG |
CTCTAGGGAC |
|
6801 |
AGCTGAACCT |
GGATGTGGGT |
GATGGTGGGT |
TTGTGCCGCT |
CAGAGCCAGG |
GTAGTGTGGA |
TAGGAGAGAA |
AGTCTCGGGA |
|
6881 |
GAGGAGGGAC |
CTGGGGAGTG |
AGGGGCAGGA |
GGGCGGCCGG |
GGGCGATCGG |
ACCTTCAGCT |
CCTGTCCTGT |
GTCGTTTCCA |
|
6961 |
GAGGGGAGAA |
GAACCTTTTA |
GACAGCAGCA |
TCATAAGCCA |
AACGGTCAGA |
CGGGCAGCCT |
CTGGGATTGG |
GGCTGAGTGG |
|
7041 |
TTTTGGACAC |
AAGGAACCCA |
TTCACTTTCT |
TTTTTTTTTT |
CTTTTTCTTT |
TTTTGAGACA |
GAGTCTGTCT |
CTGTCGCCCA |
|
7121 |
GGCTGGAGTG |
CAGTGGCGCG |
ATCTCAGCTC |
ACTGCAAGCT |
CCGCCTCACG |
GGTTCACGCC |
ATTCTCCTGC |
CTCAGCCTCC |
|
7201 |
CGAGTAGCTG |
GGACTACAGG |
GCACCCGCCA |
CCACACCCAG |
CTAATTTTTT |
TGTATTTTTA |
GTAGAGACGG |
GGTTTCACCG |
|
7281 |
TGTTAGCCAG |
GGTGGTCTCG |
AACTCCTGAC |
TTCATGATCC |
GCCTGCCTTG |
GCCTCCCAAA |
GTGCTGGGAT |
TACAGGTGTG |
|
7361 |
AGCCACCGAC |
CCATTCACTT |
TCTGTAGTAA |
GCACGGAGCA |
TCCAGCACTC |
CCCGGGACCA |
GGTAGGAGGG |
AGGGTAGAGC |
|
7441 |
CAGCTTCCTG |
GAGCGCGGTG |
GTGCGGGTGG |
GTCAGGATGG |
CTTGCTTCCA |
TGTTCTTGTT |
CTTGTCATTT |
AGCTGTTTTT |
|
7521 |
GCAGCCACTT |
CCTCAAGATA |
ACTCAAGACA |
GTAGATGCTC |
TTTACCAGGA |
GACATTCTCT |
TGATGTCCCC |
TGAAGGGAAG |
|
7601 |
AGAGTTGCTA |
AATTAGTCTA |
TTGCCATTGC |
AAGGTGCCCA |
GGAAATTAAA |
CCTCTGACTA |
GAAAAAGCCA |
GTGAAGTGTT |
|
7681 |
GGATAAAGTT |
GAAGGAACCA |
AAGGTCTTTT |
TAGAGCTTTT |
ATTAAAGAGA |
AAGTTCCAAG |
TCTTGTCCTG |
AAACTTGGGT |
|
7761 |
TCTGGGGCAG |
ACTTTCTCCA |
GAGCCCACGA |
AACTCGCGAA |
CCCTGTGTTG |
TCACACAGCC |
CACAGCTCCT |
TCAGTGCCCT |
|
7841 |
GGATGTCTGC |
AGTGGCCGCT |
GTGACTCTAT |
GGACAGTTTG |
GTGATATGTG |
TCCGCGAAGG |
CTGGGAGGGT |
GGTTTCGTTG |
|
7921 |
CTCTTGCCTG |
CAAAGGCTCA |
CACGTGGCCA |
TCCATGTGGG |
CTCCCCACAG |
GTTAACGTTC |
TGTCTCCATT |
CTCTGGAGTC |
|
8001 |
CACTAGAGCT |
GGGCACAGAC |
AAACCCACTT |
GGGTTTGATG |
CCAGTTGGTG |
GGTGGCGCTT |
TCCTGCCCAC |
TGAAGGTGGC |
|
8081 |
TCAGGTGTCT |
GTGATGGCGA |
ACGTGGTCGT |
TGCAGGCAGC |
TCCGGGCAGA |
AGCCCGGCCC |
CCTGTCCGGT |
GCACGTGGTA |
|
8161 |
GGGGCTGCCC |
TGGAAGTCAG |
AGCTCACCCC |
GCAGCTGAGA |
GGATGAGTCC |
AACTTGTTCC |
TTTTCCACCA |
GGCTGTTAAC |
|
8241 |
CCCCATCCCT |
CAAAGTGAGT |
GAGGCCTTCC |
TGTGTGTGCA |
GTGGCCACAG |
TGGGCAGCCT |
GGCTGAGTCG |
GGTGCTCTTG |
|
8321 |
TGTGGCCCGC |
TCTTGCCTGC |
TCTTCCCTTC |
CTTCCCTAGC |
AGGCACTGCA |
ACCCAGGAAA |
ACCTCTGCGG |
GGGCTGCTGG |
|
8401 |
TCACTGGCAC |
CATGGGTGTG |
AGAGCTTCTG |
ATATTTCACT |
TTTTCTTCTC |
CCCCAGCTGG |
CAGCACTGGG |
CCTGTTATTG |
|
8481 |
TTGTTTGAGA |
AAGGGTCTCG |
CTCTGTTACC |
AGGCTGGAGT |
GCGGTGGCAC |
AATCACACTT |
CACTGCAGCT |
TCGACCTCCT |
|
8561 |
GGGCTCAAGC |
AGTCCTCCCA |
CCCCAGCCTG |
CTGAGTAGCT |
GGGACCACAG |
GTGTGCACCA |
CCGCACCTGG |
CTATTTTTTA |
|
8641 |
ATACTTCTGT |
GGAAATGGGG |
TTTCACCGTG |
TTGCCCAGGC |
TGGTCTCGAA |
CTTCTGAGCT |
CAAGCAATCC |
TCCTGCCTCA |
|
8721 |
GCCTGCCAAA |
ACATTGGGGT |
TACAGGCGTG |
AGCCATACCA |
CCCGGCCTGC |
ATTGGGCGTT |
TCTATGCTGT |
TTGTCACCGT |
|
8801 |
GGCCTGGCAG |
GGCTGGCCTC |
CTGCTGCTGT |
GCCCTGCTAA |
GTGGACCAGT |
TTCCCTTACC |
GCAGTAGACC |
GTGATTGAGT |
|
8881 |
GTGGTGTAGC |
ATCTCTGCTG |
AAACGATGGC |
AGCCCTGGGA |
CAGCTGCGGC |
AGTGTCTCCT |
CCTGGCTTCT |
TGGCATGAGA |
|
8961 |
TGGTGTAGGC |
GGCGCCTGAC |
ACTCTGGAGG |
GAGTCAGAGA |
CATTGTTCTG |
CAGGGGTGGG |
GGGATTTCGT |
TTGCCTGCAG |
|
9041 |
TTGCGGCCCG |
GCGTTACTCC |
TGGGCACAGT |
GCGAGCCCAC |
CCGTGCCACT |
GGGAGCACGG |
CTGTGATCAG |
CAGCTCATGT |
|
9121 |
TCCAAAGGAT |
TCCCTCCAAG |
GGCCTCCTGA |
ACTTGGCTTT |
TCTGCCTCTC |
CTTACCCGCC |
TTTGCCCCTC |
TGGGGAAGAG |
|
9201 |
GTAGCAGGCT |
TAGAAGCCTC |
TGCCCCTCGG |
GTTCGGGCAG |
GGTGGTTGGT |
GCTTGTGCCC |
TGGTTTAACA |
TCAGCTCCTT |
|
9281 |
GGCAGTCAGG |
CCAGGCCAGG |
GTGGCCCAGG |
CACAGCGTGC |
ACAAGAGCAG |
GGAGTGGCAC |
CTGGTCTCTA |
AGTTACTCTG |
|
9361 |
CAGACGCATA |
CATGGAAACG |
GCCAGTAAGA |
AAGGATGCGT |
GATTTATTTC |
CCCAAGTCTC |
ATTTAAGAAG |
TTTGAGGAGG |
|
9441 |
CCGGGTGCAG |
GGGCTCATGC |
CTGTAATCCC |
AGCACTTTGG |
GAGGCTGAGG |
TGGGCGGGTC |
ACGAGGTCAG |
GAGATCGAGA |
|
9521 |
CCATCCTGGC |
TAACACGGTG |
AAACCCCGTC |
TCTACTAAAA |
ATACAAAAAA |
TTAGCCGGGC |
GTGGTGGCGG |
GCACCTGTAG |
|
9601 |
TCCCAGCTAC |
TCAGGAGGCT |
GAGGCAGGAG |
AATGGCGTGA |
ACTCAGGAGG |
CGGAGCTTGC |
AGTGAGCCGA |
GAAGCACCAC |
|
9681 |
TGCACTCCAG |
CGTAGGCGAC |
AGAGCAAGAC |
TCTGTCTCAA |
AAAAAAAAAA |
GGTTCAAGGG |
GACCAGGCTT |
GGTGGCTCAC |
|
9761 |
ACCTGTGATC |
CCAGCACTTA |
GGGAGGCCAG |
GACAAGAGGA |
TTGCTTGAGG |
CCAGGAGTCT |
GAGACCAGCC |
TGGGCAACAT |
|
9841 |
AGTAAGACTG |
TCTCTACAAA |
AAGTAAAAAT |
AAAATAAATT |
TTTTTGGCTG |
GACACGCTGG |
CTCACGCCTG |
TAATCTCAGC |
|
9921 |
ACTTTGGGAG |
GCCAAGGCGG |
GCAGATCACC |
TGAAGTCAGG |
AGCTTGAGAC |
CAGCCTGACC |
AACATGGAGA |
AACCCCATCT |
|
10001 |
CTAATAAAAA |
TACAAAATTT |
GCCAGGCGTG |
TTGGCACATG |
CCTATAATCC |
CAGCTACTTG |
CGAGGCTGAG |
GCAGGAGAAT |
|
10081 |
TACTTGAACC |
CAAGAGGCGG |
AGATTGCAAT |
GAGGCAAGAT |
CGAGATCGTG |
CCATTGCACT |
CTAGCCAGGG |
AAACAAGAGC |
|
10161 |
AAAACTCAGT |
CTCAAAATAG |
ATAAAATAAA |
ATAAAATAAA |
TTAAATAAAT |
AAGTATTTTA |
AAAAGTAGTT |
CTGTCCAGGT |
|
10241 |
GCAGTGGCTC |
ACACCTGTAA |
TTCCAGCACT |
TTGGGAGGCC |
GAAGCAGGCC |
GATCACCTGA |
GGTCGGGAGT |
TCAAGACCAG |
|
10321 |
CCTGACCAAC |
ATGGAGAAAG |
AAACTCCATC |
TCTACTAAAA |
AAACAAAATT |
AGCCAGGCGT |
GGTGGCGCGT |
ACCTGTAATC |
|
10401 |
CCAGCTACTT |
GGGAGGCAGA |
GGCAGGAGAA |
TTGCTTTAAC |
CCGGGAGGCA |
GAGGTTGCAG |
TGAGCCAAGA |
TCCTGCCACT |
|
10481 |
GCACTCTAGC |
CTGGGCAACA |
AGAGTGAAAC |
TCCGTCTCAA |
AAAAAAAATA |
AAATAGTTTG |
GGCCGGGCTC |
AGTGGCTCAC |
|
10561 |
ACTTATAATC |
CCAGCACTTT |
AGGAGGCCGA |
GGCAAGTGGA |
TCACCTGGGG |
TCAGGAGTTC |
AAGACCAGCC |
TGGCCAACAT |
|
10641 |
GGTAAAACCC |
TGTCTCTACT |
AAAAATACAA |
AAATTAGCCA |
GGCGTGGTGG |
CGGGTGCCTG |
TGATCCCAGC |
TACTCGGGAG |
|
10721 |
GCTGAGGAAA |
GAGAATTGCT |
GGAACTCGGG |
AGGTGGAGGT |
TGCAGTGAGC |
CGAGATCATG |
CCATTGCACT |
CCAGCCTGGG |
|
10801 |
TGACAATAGC |
AAAACTCCGT |
CTCAAAAAAT |
AAAAAAATAA |
ATAAATTTTA |
AAAATAAATA |
AAAAAGTAGT |
TCGGGGGAAT |
|
10881 |
GATGGAAGGC |
TCACCCAAAA |
TTCTCACTTG |
AAGAGCCTCA |
GCACAGGGGC |
GCCTCTCTGA |
GCCTGAGCCG |
AGGCCGCATC |
|
10961 |
CTCAGCTGCA |
GGCTGACCCC |
TCGTGGTTGG |
GGCGCCACAG |
CCTCTCCATC |
TTGTGGCCTT |
AGGACTCCCT |
GAAGCTGCCG |
|
11041 |
TGCATTATTA |
GGGGTCCCCA |
GGGCCTTACA |
GGCACGTGGA |
TTGCGCCTTT |
CAGGATCTGT |
TGTATTAGAA |
ACTAGAACTA |
|
11121 |
AGAATGTAAA |
AAATCGTTTT |
TTTCTCTCTA |
TTTATTTTTA |
TTTTTTATTT |
TTTGAGATGA |
AGTCTTGCTC |
TGTTGCCAGG |
|
11201 |
CTCGAATGCA |
GTGGCACGAT |
CTGGGCTCAC |
TGCAACCTCC |
GCCTCCTGGG |
TTCAAGCGAT |
TCTCCTGCCT |
CAGCCTCCCA |
|
11281 |
AGTAGTTGGG |
ATTACAGGCA |
CCCGCCACCA |
CGCCCAGCTA |
ATTTTTGTAT |
TTCTAGTAAA |
GACGACGTTT |
TACCATGTTG |
|
11361 |
GATTAGGCTG |
CTCTGAAACT |
CCTGGCTTCA |
GGTGATCCGC |
CCGCCTCAGC |
CTCCCAAAGT |
ACTAGGATTA |
CAGGCGTGAG |
|
11441 |
CCACCACGCC |
TGGCCTGTTT |
CACTCGTCAC |
CCAGGCTGGA |
GTGCAGTGGT |
GCGATCTCAC |
TGCAATGTCT |
ACCTCCCAGA |
|
11521 |
CTCAAGCAAT |
CCTCCTGCCT |
CAGCCTCCCA |
AGTAGCTGGG |
ATTACAGGCA |
GGTGCCACCA |
TACCCAACTG |
ATTTTTGTAT |
|
11601 |
TTTTGTATTT |
TTTTTTTTTT |
TTGAGACAGA |
GTCTTGCTCT |
GTTGCCCAGG |
CTGGAGTGCA |
GTGGCACGAT |
CTTGGCTCAC |
|
11681 |
TGCAACCTCT |
GCCTCCCGGG |
TTCAAGTGTT |
TCTTCTGCCT |
CAGTCTCCCG |
AGCAGCTGGG |
ATTACAGGTG |
TGCACCACCA |
|
11761 |
CACCCAGCTA |
AATTTTTGTA |
TTTTTGGTAG |
AGTCGAGGTT |
TCACCATGTT |
GGCCAGGCTG |
GTCCCAAACT |
CGTGAACTCA |
|
11841 |
AGTGATCCAC |
CTGCCTCAGC |
CTCCCAAACT |
GCTAGGATTA |
CAGGTGTGAG |
CCACTGCACC |
TGGCCTTTTT |
TTTTTTTTTT |
|
11921 |
TTTTTTTTTT |
TGAGACAGAG |
TTTCGCTCTT |
GTCGCCCAGG |
CTGGAGTGCA |
GTGGTGCGAT |
CTCGGCTCAC |
CGCAACCTCC |
|
12001 |
GCCTCCCGGG |
TCCAAGCGAT |
TCTCCTGCCT |
CAGCCTCCCA |
AGTACCTGGG |
ATTACAGGTG |
CCCGCCACCA |
CGCCCAGCTT |
|
12081 |
ATTTTTGTAT |
TTTTAGTAGA |
GATGGGGTTT |
TACCGTGTTG |
GCCAGGCTGG |
TCTCGAACTC |
CTGGTCTCAA |
GCGATCCACC |
|
12161 |
CATCTTGGCC |
TCCCAAAGTG |
CTGGGCTTAC |
AGGCCTGAGC |
CACTGTAGCC |
AAATTTATTT |
TAAAAGAATA |
ATAATCCCAT |
|
12241 |
TACATGTTAT |
CATAAATCAC |
ATCTTTACGG |
AAACTAGTTT |
GTCCCAGACA |
AAAACAATTT |
AGTGCTGAGT |
GATGTTTTAG |
|
12321 |
ATTCCTCCAG |
CAGGATTCTC |
CTCTCCTGCC |
CTCCTGCTCT |
TGCGACACCA |
TGCGCCCAGC |
AGCCTCTGTG |
GGTCTGTGCC |
|
12401 |
GCAGCACGTG |
AGAGGGAGAG |
TGAAAAGGCA |
GAGGCCAGAG |
AAGTGTGAGT |
GTGAAAATAG |
CTTTGACCAG |
CCGGGCGCGG |
|
12481 |
TGGCTCACGC |
CTATAATCCC |
AGCACTTTGG |
GAGGCCGAGG |
CGGGCAGATC |
ACGAGGTAAG |
GAGATTGAGA |
CCATCCTGGC |
|
12561 |
TAACGTGGTG |
AAACCCATCT |
CTACTAAAAA |
TACCAAAAAT |
TAGCCGGTGG |
TGGCGGGCGC |
CTATAGTCCC |
AGCTACTTGG |
|
12641 |
GAGGCTGAGG |
CAGGAGAATG |
GCTTGAATCC |
GGGAGGCGGA |
GCTTGCAGTG |
AGTGGAGATC |
ACGCCACTGC |
ACTCCAGCCT |
|
12721 |
GGGAGACACA |
GCAAGACTCC |
ATCACAAAAA |
AAAAAAAAAA |
AAGATCAAGA |
CCTTCGTGGC |
CAACATGGTA |
ATACCCCATC |
|
12801 |
TCGACTAAAA |
ATACAAAAAA |
AAATTAGCCA |
GGCATGGTGG |
CAGGCGCCTG |
TAGTCCTAGC |
TACTCGGGAG |
GCTAAGGCAG |
|
12881 |
GAAAATCACT |
TGAACCTGGG |
AGGCAGAGGT |
TGCAGTGAGC |
CAAGATCATG |
CCACTGCACT |
CCAGCCTGGG |
CAACAGAGCG |
|
12961 |
AGACTGTCTC |
AAAAACAAAT |
AAAAAAGAAA |
ATAGCTTTGA |
CCACATGGGT |
GCCCTGAAAA |
GGCTAGGGGC |
CCTGGGGATG |
|
13041 |
GGCCACACTT |
CAAGGTTGCT |
GGAGGACCAC |
GTGCGCCAGG |
CTTGCATTGT |
GGATGTGAAG |
CCAGCTGTTG |
AATTCACGGG |
|
13121 |
CAGCCATAAG |
GCCGTGACCA |
GAGCCAGTCT |
TACTGTCGAG |
GAACAGCCAT |
GTCCTGTACC |
CATTGCTGGA |
CCTGTGGCTC |
|
13201 |
GAAACCCCAC |
GGGAGGGCAG |
GGCTGCTGGC |
TGCTGTGACT |
TCCCACGTCC |
AGGGAGACCC |
GGGCCCAGGG |
GCTGTGCTCC |
|
13281 |
CAGCAGCAAT |
GCTCAGTCAG |
GGCCCTCTTT |
CTGTCCACAG |
TTTGATGAAG |
GCCGGAACAA |
CTTTGAAGGG |
GAGGTCACCA |
|
13361 |
AGGAGAACCT |
GCTGGACTTT |
ATCAAACACA |
ACCAGCTGCC |
CCTTGTCATC |
GAGTTCACCG |
AGCAGGTGCG |
GCTGCCCTGC |
|
13441 |
AGCCCCCAGG |
CTGGGGATGG |
GGAGGAGTGG |
GTTGGGGGTG |
GCATGAGGCT |
CCCGGTCACC |
ACTGCTGTCC |
AGCGTCTGAC |
|
13521 |
ATTGGAGGCC |
ATTGGGAGAA |
TCTTCTGCAT |
TAATAGTGTG |
TGACTCAGAC |
CCAGTAACTT |
AAGTTTATAA |
CACAGACAGC |
|
13601 |
CCCGAAGATT |
TTTGGAGGTG |
AAATCAAGAC |
TCACATCCTG |
CTGTTCTTGC |
CCAAGAGTGT |
GTCTGACTAT |
GACGGCAAAC |
|
13681 |
TGAGCAACTT |
CAAAACAGCA |
GCCGAGAGCT |
TCAAGGGCAA |
GGTGAGCTGC |
TCTTCTGGGT |
GGCTCGTGGG |
GCGGTGCCAG |
|
13761 |
GGAGCAGTAC |
AACTCTTCCC |
GAGGTGACAA |
GTGTGGGGCC |
TTCCTGAGGC |
TGCCTGCTGG |
ACTGGATCGG |
GCACCTGATT |
|
13841 |
CAGGACAGCC |
TGGATGTAGG |
CGAGGGGCCA |
GCCAGGCCAG |
ACCTCTGCTC |
CCAGTCACCT |
GCTGGTCTGG |
TTGGGCAACC |
|
13921 |
CCGGAATCCA |
GGTTGGAGGA |
AAAAGGCACC |
GGAGCACGGT |
CTGCTTCCGA |
GCAAAGTCAG |
GGCTGGAGGC |
AGAGGCCAGC |
|
14001 |
CGTCTCCCGC |
ACCTGTGCAA |
CCTCCTTTTC |
TCCCCCAGAT |
CCTGTTCATC |
TTCATCGACA |
GCGACCACAC |
CGACAACCAG |
|
14081 |
CGCATCCTCG |
AGTTCTTTGG |
CCTGAAGAAG |
GAAGAGTGCC |
CGGCCGTGCG |
CCTCATCACC |
CTGGAGGAGG |
AGATGACCAA |
|
14161 |
GTACAAGCCC |
GAATCGGAGG |
AGCTGACGGC |
AGAGAGGATC |
ACAGAGTTCT |
GCCACCGCTT |
CCTGGAGGGC |
AAAATCAAGG |
|
14241 |
TGCAGGCGCG |
TCCCCGGGCC |
AGGGCTGCCT |
CCCGGGTAGA |
GCCGCCACCT |
TGGGGTCTCT |
GGGCTCTCAG |
TGTCCTGGGC |
|
14321 |
TTCGTCCTCA |
AAGTAAGATC |
TTCAGAGTAA |
GAGGACGGAG |
CCCAAGATGG |
TGTGAAGACC |
ACCATTCTTG |
TCACACCTTT |
|
14401 |
GGGGACCTGT |
AGGAGCCCTC |
TTGTCCACAC |
GGCTGGCCCA |
GATGCTCCTC |
AGGGAAACCT |
CCTGGCCGGC |
GAGGGTGCGG |
|
14481 |
CCTGCGTGGG |
CTCATTGGGC |
CCAGCCCACT |
GGCCCAGCCT |
TCTCTTTCTG |
GATTCTTGGC |
ACTGTTGGTG |
TCTGCGGCTG |
|
14561 |
CCACCCTGGT |
GTGCCCAGGG |
CAGCCACGCA |
CCTCAGTCCC |
GGGCAGGGTG |
AGGGTTGGGG |
AGCTCTGTGG |
TCCCCCAGGC |
|
14641 |
ATCGCCGCCC |
CTCACCGTGC |
CCTGCCTGCC |
CTCCAGCCCC |
ACCTGATGAG |
CCAGGAGCTG |
CCGGAGGACT |
GGGACAAGCA |
|
14721 |
GCCTGTCAAG |
GTGCTTGTTG |
GGAAGAACTT |
TGAAGACGTG |
GCTTTTGATG |
AGAAAAAAAA |
CGTCTTTGTG |
GAGTTCTGTA |
|
14801 |
AGTGTTGCCC |
TTTCAGCTCC |
CCAGGTCCGT |
GCCACCAGGG |
CACGCGCAAG |
GTAGGCAGAC |
TCCTGGGGAA |
GCGTCTGCCA |
|
14881 |
GTGGCTTTGT |
GTCCAGGACA |
CACCCTAGAA |
CTGCTTTCTT |
TTCAGATGCC |
CCATGGTGTG |
GTCACTGCAA |
ACAGTTGGCT |
|
14961 |
CCCATTTGGG |
ATAAACTGGG |
AGAGACGTAC |
AAGGACCATG |
AGAACATCGT |
CATCGCCAAG |
ATGGACTCGA |
CTGCCAACGA |
|
15041 |
GGTGGAGGCC |
GTCAAAGTGC |
ACAGCTTCCC |
CACACTCAAG |
TTCTTTCCTG |
CCAGTGCCGA |
CAGGACGGTG |
CGCCTTCCCT |
|
15121 |
CCAGACGGGG |
CTGGGGCGGG |
CTCTGGGAAG |
AGCAGTGGTC |
CCCACGCCAG |
CCTGGGTGCC |
TCTCTGGAAG |
GAGCCTGGGG |
|
15201 |
CTAGTCAGGT |
GGGAGAAGAG |
CTGTGGGAGC |
TCCTTCAGAA |
GCACCCACCT |
TTGTTTTTCG |
AGAGAGATGG |
GATCTCACTA |
|
15281 |
TATTGCCTAG |
GCTGTTTTTG |
AACTCCCAAT |
GTGCTGGATT |
CCGGGTGTGA |
GCCACCGGAC |
CGGGCCTGCT |
CTGCCTTTCT |
|
15361 |
CTAGGGAGGA |
GGAGCCAATG |
CCAGGAGATT |
GGCTCTGGGC |
CGGGACCAGC |
CCCTGCACTC |
ATCCCTGTCT |
TCCACAGGTC |
|
15441 |
ATTGATTACA |
ACGGGGAACG |
CACGCTGGAT |
GGTTTTAAGA |
AATTCCTGGA |
GAGCGGTGGC |
CAGGATGGGG |
CAGGGGATGA |
|
15521 |
TGACGTGAGT |
GGGGTCACAG |
CCCTGGGCTG |
GTCTCTGGGA |
TTCAGCCCCT |
TGGCCCCACG |
GGACATGCCC |
ACTTGGGCTC |
|
15601 |
GGCTTGTGAT |
GGGAATGGTG |
CCATTGGAGG |
CACGTCTGTG |
CGTCCAGGGC |
TCTGGGGCCC |
CACACCTTCG |
CCCAGCAGGG |
|
15681 |
CAAGGGATGT |
GCTCCCAGTG |
CTCTGCTGAA |
ACAGCTCTCC |
CTGGCCAGCA |
CTTCTTTAGG |
GGACGACGTT |
GAAGGCATCT |
|
15761 |
TTAAAAACCA |
GATTTTGCTT |
AGAAAACCTT |
TTCAGACACA |
TGGCACCGGT |
TCCTCACATG |
CTGCTAACTT |
GGACTCAGAG |
|
15841 |
GTCCATCCCC |
AGCCCGCTGG |
CGGGGCCCGG |
GCACAAGGGC |
GCAGACATGG |
GTCTGCCCTC |
GGGAGTTTAG |
TCCTGCCCGT |
|
15921 |
CAGCGCGTGG |
CCAGCAGCAG |
TGGTGACTCT |
GCCGGGGAAG |
CAGGAAGGCT |
GGTGTTGGGA |
GGATGCGGCA |
GGGGCGGGGT |
|
16001 |
GCTTGACCAT |
GGTGAAGGAG |
ATGAGGGCAT |
GGTCAGCCCA |
GGCTGGCAGG |
GCCTGCTGGG |
AGTGCTGGTG |
CACAGTGGGA |
|
16081 |
GGTGGGAATA |
CTCTGCCCAG |
GCTGGGGATG |
GAGGGCCGGC |
CCCTGGTGCT |
GCCTGTGGCC |
TGTCATTGGC |
CAGAGCCCAA |
|
16161 |
GGGTTGGAAG |
TAGAATGGTC |
ACACCCCAGT |
GGTGACTGTG |
CCTGGACTTC |
ATGCTGGGGC |
TGTTGTCCCT |
GCCGAGATGT |
|
16241 |
GGACCCTCTG |
TAGACTCCAT |
GGTGCCAAGC |
TGGGAGACAC |
ACATACGCCT |
TCCATGCCCA |
GTGGAAGGGT |
CCAAGGCTGA |
|
16321 |
GCTGGGATTC |
CCAGGGTTCC |
TGACACAGTG |
GGGGCTCTGA |
TTCATCCTGC |
TATAGGCAGA |
AATCCCCCAG |
GCGTCACTCT |
|
16401 |
TCCTGGGAAG |
ACCCCTGTGT |
AGCTCGTAAA |
GACCAGGGTT |
GGCTGCTCCA |
GTGGCCGGTG |
GCCATGTCCG |
CCGTGGCCGG |
|
16481 |
CTAGTCCCCG |
GTGTGAGCAG |
TGTGGGGTGC |
TGGGGGCCGG |
CAGCCTCAGC |
TGTTGCAGCC |
AACCTGCCCG |
CCCCGCCCCT |
|
16561 |
TTTCCTGTAT |
CCCAGGATCT |
CGAGGACCTG |
GAAGAAGCAG |
AGGAGCCAGA |
CATGGAGGAA |
GACGATGATC |
AGAAAGCTGT |
|
16641 |
GAAAGATGAA |
CTGTAATACG |
CAAAGCCAGA |
CCCGGGCGCT |
GCCGAGACCC |
CTCGGGGGCT |
GCACACCCAG |
CAGCAGCGCA |
|
16721 |
CGCCTCCGAA |
GCCTGCGGCC |
TCGCTTGAAG |
GAGGGCGTCG |
CCGGAAACCC |
AGGGAACCTC |
TCTGAAGTGA |
CACCTCACCC |
|
16801 |
CTACACACCG |
TCCGTTCACC |
CCCGTCTCTT |
CCTTCTGCTT |
TTCGGTTTTT |
GGAAAGGGAT |
CCATCTCCAG |
GCAGCCCACC |
|
16881 |
CTGGTGGGGC |
TTGTTTCCTG |
AAACCATGAT |
GTACTTTTTC |
ATACATGAGT |
CTGTCCAGAG |
TGCTTGCTAC |
CGTGTTCGGA |
|
16961 |
GTCTCGCTGC |
CTCCCTCCCG |
CGGGAGGTTT |
CTCCTCTTTT |
TGAAAATTCC |
GTCTGTGGGA |
TTTTTAGACA |
TTTTTCGACA |
|
17041 |
TCAGGGTATT |
TGTTCCACCT |
TGGCCAGGCC |
TCCTCGGAGA |
AGCTTGTCCC |
CCGTGTGGGA |
GGGACGGAGC |
CGGACTGGAC |
|
17121 |
ATGGTCACTC |
AGTACCGCCT |
GCAGTGTCGC |
CATGACTGAT |
CATGGCTCTT |
GCATTTTTGG |
GTAAATGGAG |
ACTTCCGGAT |
|
17201 |
CCTGTCAGGG |
TGTCCCCCAT |
GCCTGGAAGA |
GGAGCTGGTG |
GCTGCCAGCC |
CTGGGGCCCG |
GCACAGGCCT |
GGGCCTTCCC |
|
17281 |
CTTCCCTCAA |
GCCAGGGCTC |
CTCCTCCTGT |
CGTGGGCTCA |
TTGTGACCAC |
TGGCCTCTCT |
ACAGCACGGC |
CTGTGGCCTG |
|
17361 |
TTCAAGGCAG |
AACCACGACC |
CTTGACTCCC |
GGGTGGGGAG |
GTGGCCAAGG |
ATGCTGGAGC |
TGAATCAGAC |
GCTGACAGTT |
|
17441 |
CTTCAGGCAT |
TTCTATTTCA |
CAATCGAATT |
GAACACATTG |
GCCAAATAAA |
GTTGAAATTT |
TACCACCTGT |
C |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:5034|P4HB|NC_000017.10:79801033...79818543 (-)
AGCCTCGAAGTCCGCCGGCCAATCGAAGGCGGGCCCCAGCGGCGCGTGCGCGCCGCGGCCAGCGCGCGCGGGCGGGGGGG
CAGGCGCGCCCCGGACCCAGGATTTATAAAGGCGAGGCCGGGACCGGCGCGCGCTCTCGTCGCCCCCGCTGTCCCGGCGG
CGCCAACCGAAGCGCCCCGCCTGATCCGTGTCCGACATGCTGCGCCGCGCTCTGCTGTGCCTGGCCGTGGCCGCCCTGGT
GCGCGCCGACGCCCCCGAGGAGGAGGACCACGTCCTGGTGCTGCGGAAAAGCAACTTCGCGGAGGCGCTGGCGGCCCACA
AGTACCTGCTGGTGGAGTTCTGTGAGCGCCGGGCCTGGCGGGCGGGCGGGGCTCGGGGCCGCTGAGCCAGGCTCTTGGGA
CGCAGGCACGGCCCGGCAGCCCCCGGGGTCGGGACCCCCGGCGCGCCCGGAACTGAAGGGCTCCCTTGCTGCCCGCCTTG
GGGGCCATGTCAGGGGCTCCCTGGGGGTGAGAGCCGGGCTGGGAGCCGGGGGGCCGTCCCGATGCCCGCCCTGCGCACGG
CAGGATCTTCCTGTTAGTGCGAGAAGAACAGGATCGACCTGTGTGAGCAAACAGAGCAAGCCAGTGTCCAGCCTTGCCGA
GTCCTCCCCGGGGGCCGACCCGGAAGGCGCCCCGCTCCTGCCCACCCCACCTTCAGCTTGCCACAGCAGTCCTTCCGATG
GTGTCTCAGAGGAGAATCCCAAGCCTTCCTCACGTCCATAAGGATTTGAGCTCTGTCCCTGGGGTGGGAACCTTGTGCCT
GACACAGCTGCTAACTCTATTGACAGATCGTCTTATTCTCACCCTAAGCAACTCAGGGTCTGTGTGCACGGATGTGCATC
TCATACTCACAGCTAGTGCTGGTACAGGCCGCCACGGTCACTTCTTCACTTTTGTCTGTAGTCTTAAAGGGAAAGTCTAG
GGGAGATCCTTGCCTTAGTTGCCTGTGGGGAATAAGAAGTCAGCAACCATTGAAAGGTTTGTCTGGCTGTGCTGATGGTG
ACATAGCAGAGTGGGGGCTGGTGTTGGTTGGTGGTAGTTTGTCGTTCACGGCCTCTGTCCTGATATTGCCCAAGAACACC
AGGCTATCAGCTCAGCCTTGTGCGTTAGGAGGGGTTATCTTGGTGAGTAGATAAGGTTTTTATGAAGGGAAATGCCAGAG
GAAAAAGGGAAGCACTGCTGAGGGATCAAGGCTGCTTTAGGGATCAAGGCTGCTTTCTAGCATCTCATTTCCGCTTCCAG
ATGCCCCTTGGTGTGGCCACTGCAAGGCTCTGGCCCCTGAGTATGCCAAAGCCGCTGGGAAGCTGAAGGCAGAAGGTTCC
GAGATCAGGTTGGCCAAGGTGGACGCCACGGAGGAGTCTGACCTGGCCCAGCAGTACGGCGTGCGCGGCTATCCCACCAT
CAAGTTCTTCAGGAATGGAGACACGGCTTCCCCCAAGGAATATACAGGTGTGGCTGTGGCACTGCCCTTGAACTGTCTTT
AGAGAGGGACTGGCCTGCCGACTTGGAAGGGCAGGGGCAGCTGTCTGAGCGGTGGGGAGGCCGGGTTCCAGAGGCTTGGA
GGGATCCCGTTCCTCAGGCCGGGTTCTCTGTCCACTTGTGCCCTGAGGGTCTTGTGAGGAGTTTTCATCCTGGAAGAACC
TTCGAATGGAGAACCTGTCTTGTCTGGTGCCATCCTGGGGGCGGCTGGCTGAGGTCTGTTTGGAGACTGACCAGCGAGGA
GAGGGAGACTGGTGCCTTTCTGGCCACATGGGCTGTTTGTGCTGCCCCCTGTTCCTAGGAGCAGCAGCATGGCTGTAGCT
CTCGGGGTGACACCTGCTAGCTGTCGTGTGTAGCGTGCCAGCTGCCGCGTCACGTGCTTGCGTGATTTCTCCTCGCATCC
TCTCTGTGAGGCGGGCTTTTCCCAGACAAAGGGCTGAGGCTTAGAGCAGCTGAGTAACAGCATCGAGCCTGTCGGTAGCA
GACCTCGAATCCAAGCCCGCCCTGCTCCGTCTGTAGCACGGGCCCTCAGCTGCTCGTCTGCCGCCTCTGAAGCTGCCGTA
GTGGAGGCCGAGCCCCTGTGTAGCCAGATCAGTGACGGTGGCAAAAGAAAACTCGTCTATAAAACCGGTTCTGTGGCATG
TCAGATGCTCATGGTGGGTAGCCCGAAAGCAGCTCATTCCAGTGTTAGCAGGGTTCTCCCCTTAAGCAGGTGTGCCCTGT
CCTGTTCTAGTGGCAGCAACTCCGTGGTTGCCTCTCACTTGGCTGTTTTGATGTTTCCTTTTGTCCCCTTAAGCAGGTGT
GCCCTGTCCTGTTCTAGTGGCAGCAGCTCTGTGGTTGCCTCTCACTTGGCTGTTTTGATGTTTCCTTTTTTTTTTTTTTT
TTTTTTTTTTGAGACAGTCTCGCTCTGTCGTGCAGTGGTGCGATCTCTGCTCACTGCAACCTCTGCCTCCCGGGTACAAG
TGATTCTCCTGCCTCAGCCTCTTGAGTAGCTGGGGCTACAGGTGTGCACCACCACGCCTGGCTAATTTTTGTATTTTTAG
TAGAGACGGGGTTTCACCATGTTGGCCAGGATGGTCTCGATCGTGACCTCGTGATCCACCCGTCTCGGCCTCCTAGAGTG
CTAGGATTACAGGCGTGATTGATGTTCTCATTTTTTACTCATGTTCACTTTTGGAACGGGAAGTGCTCTGTCCAAAGTCA
CCGTGGTTCTGTCACTAACACTCAGGTTTGCAGATGAGATGAGCACTCCTAAATCCACTTGTCACTGGTGACCGTCTTGT
TAGCACTGGTGGAATCCTTCTACATCTGAATGGGGTTTTCCCGATTCGGACCAGGGAGTCAAGTTCTAGGAGGGAAAAAG
GAGAGGCATCATTCCTTAGCCTCAGTCTCCCAGGAGGAGGAAGTTTCTTTCCTGTCAGTTGACCGCCTTTGGTGGCATAA
AAGTGTGTCATCTTCTTCCTGGTCTGGGAGGCATGGAGGGTGGGCCATTGGCGAGAACTCCCGAAACGCCTGGGATAAGT
TAGGAGGCGCGGGAGTCAGGTGGGCGGGGCGTCAGCCTGTTCCCAGAGGTAGAAAACTGGCACTGAGTTTTGAGTTACTG
TCCACCTCTCTAAAATGAGGCCTGCTGTCAGACTCCTGGGTGAGAACATGTGTTTGATGGATGTACATGTCAAAGAAAAT
TTAGATAACGCAGAGGACAGGCTGGGTGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGACGGGTGGAT
CACCTGAGGTCAGGAGTTCGAGGCCAGCATGGCCAATATGGTGAAACCCCCGTCTCTACTAAAAATACAAAAATTAGCCG
GGCGTGGTGGTGCACGTCTGTAATCCCAGCTACTTGGGAGGCTGACACAGGAGAATCCCTTGAACCTGGGAGGTAAGGTT
GCAGTGAGCTGAGATCGTGCCACCGCACTCCAGCCTGGGTGACAGAGTGAGACTTCGTTTCAAAAAATAAAATTTTTAAA
ATGCAGAGGGCCATCCTGGGCAACATGGTGAAACCCTGTCTACAAAAAATACAAAAATTAGCTAGGGCTGGGCACAGTGG
TTCATCCCGGTAATCCCAGCACTTTGGAAGGCCGAGGTGGGCGGACTGCTTAATCCCAGGAGTTTGAGACCATCCTAGGC
AACGTGGCAAAACCCCATCTCTACAAAAAATAGAAAAATTGGCCGGGCATGGTGGCTCACGCATGTAATCCCAGCACTTT
GGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACACAGTGAAACCCCGTCTCTACTAA
AAATACAAAAATTAGCTGGGCATGGTGGCGGGCGCCTGTAATACCAGCTATTTGGGAGGCCGAGGCAGGAGAATCACTTG
AACCTGGGAGATGGAGGTTGCAGTGAGCCAGGATGGCGCCACTGCATTCTCCAGCCTGGACGACAGAGGTAGATTCCGTC
TCAAAAAAAAAAAAAAAAAAGAAGAAACATTGCTGGGCGCAGTGGCTCACCTCTGTAATCCTAGCACTTTGGGAGACCGC
TGAGGCAGGTGTATCACCTGAGGTCAGGAGTGAGACCAGCCTGGCCAACATGGGGAACCCTGTCTCTGCTAAACATACAA
AAATTAGCCGGGTGTGGTGGCGGGCACCTATAATCCTAGCTACGCGGGAGGGTCAGGCAGGAGAATTGCTTGAACCCGGG
AGGTGGAGGTTGCAGTGAGCCGAGATCACACTATTGCACTCCAGCCTGAGCAACAAGAGCAAAACTCCGCCTCAAAAAAA
AAAAAAAAAAAAAGTTAGTTGGGTGTGGTGGCACATGCCTGTGGTCCCAGCTACTTGGGAGCCTGAGGTAGGAGGATTGC
TTGAGCCCAGAAGTTCGAGGTTGCAGTGAGCCATGATCATGCCACTGCACTCTAACCTGGGTGACAGAGCAAGACCCTGT
CTTCCAAGAAAAAAAAAAGGGCTGGGATTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAAGTCGTGGTGGGCAAAC
TGCTTGAGCACAGGAGTTAAAGACCAGCCTGGGCAACGTGGCAAAACCCCGTCTCTACAATAAATACAAAAATGAGCTGG
GTGTGGTAGCGTGCATCTGTACCCCAGATACTCAGGAGGCTGAGGTTGGGAAGATTGCTTGAGCCCAGTGGGTGGAGGCT
GCAGTGAGCCAAGATTCTGCCACTACCCTCCAGCTGGGTAACAGAGTGAGCCCCTGTGTCAAAAGAAAAAAGAAAATGCA
GAGAGATCAGAGAAAAAAAATGGGTTTGTGATCCTGTTATTCAGAGACAGAATCACCCTCTTCCCTCTCCACTCCCCATA
AACACAACCACAGTCTCACGTACACTGGGGTTTAGGAGGCGGGCGGACGGCACCGGGCCTCTTCCATGGTGGGAGAACAT
CTTACTTAGGTTGATTCAGGTGTCCTCAGCCTGGCTGGCAGAGAGCAGAGACGGGAGCAGGCGCGGCAGGGCAGGCGGCA
CTCCCTGTTTGAGAGTAGAACCTACATTTGTTTGGGGTTAGCTGGCAGAGAGGCTGATGACATCGTGAACTGGCTGAAGA
AGCGCACGGGCCCGGCTGCCACCACCCTGCCTGACGGCGCAGCTGCAGAGTCCTTGGTGGAGTCCAGCGAGGTGGCTGTC
ATCGGCTTCTTCAAGGTAGAGACCAGAGCATTCCAGTCTTCCTTCCTTCCGTCATCCATTGAGGAGGGGTCCACAGCCTG
CAGCTGGCAGTTCTGCACGTGTCCCCCCTGGCCTGGGCTCTACAGGGTCTGCCCTACTGCTTGGTGCTGCGGGATCATGA
CGACCTCTGCTTCTCCCTCCTCATTCAGGACGTGGAGTCGGACTCTGCCAAGCAGTTTTTGCAGGCAGCAGAGGCCATCG
ATGACATACCATTTGGGATCACTTCCAACAGTGACGTGTTCTCCAAATACCAGCTCGACAAAGATGGGGTTGTCCTCTTT
AAGAAGGTGAGTGGCCCCAGGCAGCTCTGCCCAGGTTAGTTCTGGGGTTGGTCTGCAGAGGGTGGCGTTGCTCTCCTTAT
CGTTAAGGGAACCTTGCTCCTGGCACCTTTGGCCCAATAGGTATGTTCTGCAGCTTTGCCAAGTTGGGGTGTTGCGCTGA
TGCCTGTGGCCTGGTCTTCGCATTCAGCTGTGGGCTTTCTGGCCACTCTCCTGCCACAGCCGCCTCAGAACTGCTTTTGA
CTGTTAGCTTTTTTTTTTTTTTCAATTATGATAAAACATGTAACAACATTTGCCCTTTTTTTTTTTTTGGGAGAGTCTTG
CTCTGTCACCCATGCTGGAGTGCAGTTGCGTGATCTCGGCTCACTGCACCCTCCGCCTCCTGGGTTCAAGTGATTCTCCT
GCCTCGGCCTCCCGAGTAGCTGGGACTACAGGCATGCACTACCATGCCCAGCTAATCTTTGTTTTAGTAGAGATGAGATT
TCACCATTTTGGCCAGGATGGTATCAATCTCTTGACCTCGTGATCCACCCACCTCAGCCTCCCAAAGTGCTGGGATTACA
GATGTGAGCCACCGCGCCTGGCCCCGTTTGCCTTTTTTTTTTTTGAGATAGGGTTTCTCACTCTGCCGCCCAGGCTGGAG
TGCGGTGGTGCATTCTTAGCTCACTGCAACTTCTGCTTCCCAGGCTCAAGTGATCCTCTAACCTCAGCCTCCCGAGCAGC
TGGAACTACAGGCGCGTGCCACCACGCTGGGCTGAGTTTTTGTATTTTTTGGTAGACATGGGGTTTTGCTGTGTTGTCCA
GGCTGGTCTTGAACTCCTGAGCTCCGAGTGGTCTGCCTGCCTCAGCCTCCCAAAATGGTGGGATCACAGGCGTGAGCCAC
CGCGCCCGGCCACGTTTTAACTATTCTAAAGTATTAACTACATTCAGAGCGTCCTGCAGCATCCCTATTTCCAGAACGTT
TTCCTTACCCAGAGGTCTGGAGTATTGGCGCTCTGGGCGTGTTCCCACCCAGGTGGCACCCACACTTCGTATGCCCCTCA
GCCCCAAGGGCATTATGACCCCTAAGGATCCACTGTGAAAACTGCCATCCTGATCCCGCTGTCCTTCTCCTGGGGAAGGG
CTCCTGGACAGTGTGGCCTCTTAGCCGCCTCTCCCCGTTACCAGGACACAGCAGGGTCCTTGTTTGCACAGTGTCAAGAT
GGGGCCTGTGGGTTTCTATTGTCTCTTTTCTCGGTCACCAGGCAGGTGGGGCTGGGGCCGCACACTTGTGCTCTAGGGAC
AGCTGAACCTGGATGTGGGTGATGGTGGGTTTGTGCCGCTCAGAGCCAGGGTAGTGTGGATAGGAGAGAAAGTCTCGGGA
GAGGAGGGACCTGGGGAGTGAGGGGCAGGAGGGCGGCCGGGGGCGATCGGACCTTCAGCTCCTGTCCTGTGTCGTTTCCA
GAGGGGAGAAGAACCTTTTAGACAGCAGCATCATAAGCCAAACGGTCAGACGGGCAGCCTCTGGGATTGGGGCTGAGTGG
TTTTGGACACAAGGAACCCATTCACTTTCTTTTTTTTTTTCTTTTTCTTTTTTTGAGACAGAGTCTGTCTCTGTCGCCCA
GGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAAGCTCCGCCTCACGGGTTCACGCCATTCTCCTGCCTCAGCCTCC
CGAGTAGCTGGGACTACAGGGCACCCGCCACCACACCCAGCTAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCG
TGTTAGCCAGGGTGGTCTCGAACTCCTGACTTCATGATCCGCCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTG
AGCCACCGACCCATTCACTTTCTGTAGTAAGCACGGAGCATCCAGCACTCCCCGGGACCAGGTAGGAGGGAGGGTAGAGC
CAGCTTCCTGGAGCGCGGTGGTGCGGGTGGGTCAGGATGGCTTGCTTCCATGTTCTTGTTCTTGTCATTTAGCTGTTTTT
GCAGCCACTTCCTCAAGATAACTCAAGACAGTAGATGCTCTTTACCAGGAGACATTCTCTTGATGTCCCCTGAAGGGAAG
AGAGTTGCTAAATTAGTCTATTGCCATTGCAAGGTGCCCAGGAAATTAAACCTCTGACTAGAAAAAGCCAGTGAAGTGTT
GGATAAAGTTGAAGGAACCAAAGGTCTTTTTAGAGCTTTTATTAAAGAGAAAGTTCCAAGTCTTGTCCTGAAACTTGGGT
TCTGGGGCAGACTTTCTCCAGAGCCCACGAAACTCGCGAACCCTGTGTTGTCACACAGCCCACAGCTCCTTCAGTGCCCT
GGATGTCTGCAGTGGCCGCTGTGACTCTATGGACAGTTTGGTGATATGTGTCCGCGAAGGCTGGGAGGGTGGTTTCGTTG
CTCTTGCCTGCAAAGGCTCACACGTGGCCATCCATGTGGGCTCCCCACAGGTTAACGTTCTGTCTCCATTCTCTGGAGTC
CACTAGAGCTGGGCACAGACAAACCCACTTGGGTTTGATGCCAGTTGGTGGGTGGCGCTTTCCTGCCCACTGAAGGTGGC
TCAGGTGTCTGTGATGGCGAACGTGGTCGTTGCAGGCAGCTCCGGGCAGAAGCCCGGCCCCCTGTCCGGTGCACGTGGTA
GGGGCTGCCCTGGAAGTCAGAGCTCACCCCGCAGCTGAGAGGATGAGTCCAACTTGTTCCTTTTCCACCAGGCTGTTAAC
CCCCATCCCTCAAAGTGAGTGAGGCCTTCCTGTGTGTGCAGTGGCCACAGTGGGCAGCCTGGCTGAGTCGGGTGCTCTTG
TGTGGCCCGCTCTTGCCTGCTCTTCCCTTCCTTCCCTAGCAGGCACTGCAACCCAGGAAAACCTCTGCGGGGGCTGCTGG
TCACTGGCACCATGGGTGTGAGAGCTTCTGATATTTCACTTTTTCTTCTCCCCCAGCTGGCAGCACTGGGCCTGTTATTG
TTGTTTGAGAAAGGGTCTCGCTCTGTTACCAGGCTGGAGTGCGGTGGCACAATCACACTTCACTGCAGCTTCGACCTCCT
GGGCTCAAGCAGTCCTCCCACCCCAGCCTGCTGAGTAGCTGGGACCACAGGTGTGCACCACCGCACCTGGCTATTTTTTA
ATACTTCTGTGGAAATGGGGTTTCACCGTGTTGCCCAGGCTGGTCTCGAACTTCTGAGCTCAAGCAATCCTCCTGCCTCA
GCCTGCCAAAACATTGGGGTTACAGGCGTGAGCCATACCACCCGGCCTGCATTGGGCGTTTCTATGCTGTTTGTCACCGT
GGCCTGGCAGGGCTGGCCTCCTGCTGCTGTGCCCTGCTAAGTGGACCAGTTTCCCTTACCGCAGTAGACCGTGATTGAGT
GTGGTGTAGCATCTCTGCTGAAACGATGGCAGCCCTGGGACAGCTGCGGCAGTGTCTCCTCCTGGCTTCTTGGCATGAGA
TGGTGTAGGCGGCGCCTGACACTCTGGAGGGAGTCAGAGACATTGTTCTGCAGGGGTGGGGGGATTTCGTTTGCCTGCAG
TTGCGGCCCGGCGTTACTCCTGGGCACAGTGCGAGCCCACCCGTGCCACTGGGAGCACGGCTGTGATCAGCAGCTCATGT
TCCAAAGGATTCCCTCCAAGGGCCTCCTGAACTTGGCTTTTCTGCCTCTCCTTACCCGCCTTTGCCCCTCTGGGGAAGAG
GTAGCAGGCTTAGAAGCCTCTGCCCCTCGGGTTCGGGCAGGGTGGTTGGTGCTTGTGCCCTGGTTTAACATCAGCTCCTT
GGCAGTCAGGCCAGGCCAGGGTGGCCCAGGCACAGCGTGCACAAGAGCAGGGAGTGGCACCTGGTCTCTAAGTTACTCTG
CAGACGCATACATGGAAACGGCCAGTAAGAAAGGATGCGTGATTTATTTCCCCAAGTCTCATTTAAGAAGTTTGAGGAGG
CCGGGTGCAGGGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGGTCACGAGGTCAGGAGATCGAGA
CCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGCGGGCACCTGTAG
TCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATGGCGTGAACTCAGGAGGCGGAGCTTGCAGTGAGCCGAGAAGCACCAC
TGCACTCCAGCGTAGGCGACAGAGCAAGACTCTGTCTCAAAAAAAAAAAAGGTTCAAGGGGACCAGGCTTGGTGGCTCAC
ACCTGTGATCCCAGCACTTAGGGAGGCCAGGACAAGAGGATTGCTTGAGGCCAGGAGTCTGAGACCAGCCTGGGCAACAT
AGTAAGACTGTCTCTACAAAAAGTAAAAATAAAATAAATTTTTTTGGCTGGACACGCTGGCTCACGCCTGTAATCTCAGC
ACTTTGGGAGGCCAAGGCGGGCAGATCACCTGAAGTCAGGAGCTTGAGACCAGCCTGACCAACATGGAGAAACCCCATCT
CTAATAAAAATACAAAATTTGCCAGGCGTGTTGGCACATGCCTATAATCCCAGCTACTTGCGAGGCTGAGGCAGGAGAAT
TACTTGAACCCAAGAGGCGGAGATTGCAATGAGGCAAGATCGAGATCGTGCCATTGCACTCTAGCCAGGGAAACAAGAGC
AAAACTCAGTCTCAAAATAGATAAAATAAAATAAAATAAATTAAATAAATAAGTATTTTAAAAAGTAGTTCTGTCCAGGT
GCAGTGGCTCACACCTGTAATTCCAGCACTTTGGGAGGCCGAAGCAGGCCGATCACCTGAGGTCGGGAGTTCAAGACCAG
CCTGACCAACATGGAGAAAGAAACTCCATCTCTACTAAAAAAACAAAATTAGCCAGGCGTGGTGGCGCGTACCTGTAATC
CCAGCTACTTGGGAGGCAGAGGCAGGAGAATTGCTTTAACCCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCCTGCCACT
GCACTCTAGCCTGGGCAACAAGAGTGAAACTCCGTCTCAAAAAAAAAATAAAATAGTTTGGGCCGGGCTCAGTGGCTCAC
ACTTATAATCCCAGCACTTTAGGAGGCCGAGGCAAGTGGATCACCTGGGGTCAGGAGTTCAAGACCAGCCTGGCCAACAT
GGTAAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGCGGGTGCCTGTGATCCCAGCTACTCGGGAG
GCTGAGGAAAGAGAATTGCTGGAACTCGGGAGGTGGAGGTTGCAGTGAGCCGAGATCATGCCATTGCACTCCAGCCTGGG
TGACAATAGCAAAACTCCGTCTCAAAAAATAAAAAAATAAATAAATTTTAAAAATAAATAAAAAAGTAGTTCGGGGGAAT
GATGGAAGGCTCACCCAAAATTCTCACTTGAAGAGCCTCAGCACAGGGGCGCCTCTCTGAGCCTGAGCCGAGGCCGCATC
CTCAGCTGCAGGCTGACCCCTCGTGGTTGGGGCGCCACAGCCTCTCCATCTTGTGGCCTTAGGACTCCCTGAAGCTGCCG
TGCATTATTAGGGGTCCCCAGGGCCTTACAGGCACGTGGATTGCGCCTTTCAGGATCTGTTGTATTAGAAACTAGAACTA
AGAATGTAAAAAATCGTTTTTTTCTCTCTATTTATTTTTATTTTTTATTTTTTGAGATGAAGTCTTGCTCTGTTGCCAGG
CTCGAATGCAGTGGCACGATCTGGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCA
AGTAGTTGGGATTACAGGCACCCGCCACCACGCCCAGCTAATTTTTGTATTTCTAGTAAAGACGACGTTTTACCATGTTG
GATTAGGCTGCTCTGAAACTCCTGGCTTCAGGTGATCCGCCCGCCTCAGCCTCCCAAAGTACTAGGATTACAGGCGTGAG
CCACCACGCCTGGCCTGTTTCACTCGTCACCCAGGCTGGAGTGCAGTGGTGCGATCTCACTGCAATGTCTACCTCCCAGA
CTCAAGCAATCCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCAGGTGCCACCATACCCAACTGATTTTTGTAT
TTTTGTATTTTTTTTTTTTTTTGAGACAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACGATCTTGGCTCAC
TGCAACCTCTGCCTCCCGGGTTCAAGTGTTTCTTCTGCCTCAGTCTCCCGAGCAGCTGGGATTACAGGTGTGCACCACCA
CACCCAGCTAAATTTTTGTATTTTTGGTAGAGTCGAGGTTTCACCATGTTGGCCAGGCTGGTCCCAAACTCGTGAACTCA
AGTGATCCACCTGCCTCAGCCTCCCAAACTGCTAGGATTACAGGTGTGAGCCACTGCACCTGGCCTTTTTTTTTTTTTTT
TTTTTTTTTTTGAGACAGAGTTTCGCTCTTGTCGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCGCAACCTCC
GCCTCCCGGGTCCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTACCTGGGATTACAGGTGCCCGCCACCACGCCCAGCTT
ATTTTTGTATTTTTAGTAGAGATGGGGTTTTACCGTGTTGGCCAGGCTGGTCTCGAACTCCTGGTCTCAAGCGATCCACC
CATCTTGGCCTCCCAAAGTGCTGGGCTTACAGGCCTGAGCCACTGTAGCCAAATTTATTTTAAAAGAATAATAATCCCAT
TACATGTTATCATAAATCACATCTTTACGGAAACTAGTTTGTCCCAGACAAAAACAATTTAGTGCTGAGTGATGTTTTAG
ATTCCTCCAGCAGGATTCTCCTCTCCTGCCCTCCTGCTCTTGCGACACCATGCGCCCAGCAGCCTCTGTGGGTCTGTGCC
GCAGCACGTGAGAGGGAGAGTGAAAAGGCAGAGGCCAGAGAAGTGTGAGTGTGAAAATAGCTTTGACCAGCCGGGCGCGG
TGGCTCACGCCTATAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCACGAGGTAAGGAGATTGAGACCATCCTGGC
TAACGTGGTGAAACCCATCTCTACTAAAAATACCAAAAATTAGCCGGTGGTGGCGGGCGCCTATAGTCCCAGCTACTTGG
GAGGCTGAGGCAGGAGAATGGCTTGAATCCGGGAGGCGGAGCTTGCAGTGAGTGGAGATCACGCCACTGCACTCCAGCCT
GGGAGACACAGCAAGACTCCATCACAAAAAAAAAAAAAAAAAGATCAAGACCTTCGTGGCCAACATGGTAATACCCCATC
TCGACTAAAAATACAAAAAAAAATTAGCCAGGCATGGTGGCAGGCGCCTGTAGTCCTAGCTACTCGGGAGGCTAAGGCAG
GAAAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGCAACAGAGCG
AGACTGTCTCAAAAACAAATAAAAAAGAAAATAGCTTTGACCACATGGGTGCCCTGAAAAGGCTAGGGGCCCTGGGGATG
GGCCACACTTCAAGGTTGCTGGAGGACCACGTGCGCCAGGCTTGCATTGTGGATGTGAAGCCAGCTGTTGAATTCACGGG
CAGCCATAAGGCCGTGACCAGAGCCAGTCTTACTGTCGAGGAACAGCCATGTCCTGTACCCATTGCTGGACCTGTGGCTC
GAAACCCCACGGGAGGGCAGGGCTGCTGGCTGCTGTGACTTCCCACGTCCAGGGAGACCCGGGCCCAGGGGCTGTGCTCC
CAGCAGCAATGCTCAGTCAGGGCCCTCTTTCTGTCCACAGTTTGATGAAGGCCGGAACAACTTTGAAGGGGAGGTCACCA
AGGAGAACCTGCTGGACTTTATCAAACACAACCAGCTGCCCCTTGTCATCGAGTTCACCGAGCAGGTGCGGCTGCCCTGC
AGCCCCCAGGCTGGGGATGGGGAGGAGTGGGTTGGGGGTGGCATGAGGCTCCCGGTCACCACTGCTGTCCAGCGTCTGAC
ATTGGAGGCCATTGGGAGAATCTTCTGCATTAATAGTGTGTGACTCAGACCCAGTAACTTAAGTTTATAACACAGACAGC
CCCGAAGATTTTTGGAGGTGAAATCAAGACTCACATCCTGCTGTTCTTGCCCAAGAGTGTGTCTGACTATGACGGCAAAC
TGAGCAACTTCAAAACAGCAGCCGAGAGCTTCAAGGGCAAGGTGAGCTGCTCTTCTGGGTGGCTCGTGGGGCGGTGCCAG
GGAGCAGTACAACTCTTCCCGAGGTGACAAGTGTGGGGCCTTCCTGAGGCTGCCTGCTGGACTGGATCGGGCACCTGATT
CAGGACAGCCTGGATGTAGGCGAGGGGCCAGCCAGGCCAGACCTCTGCTCCCAGTCACCTGCTGGTCTGGTTGGGCAACC
CCGGAATCCAGGTTGGAGGAAAAAGGCACCGGAGCACGGTCTGCTTCCGAGCAAAGTCAGGGCTGGAGGCAGAGGCCAGC
CGTCTCCCGCACCTGTGCAACCTCCTTTTCTCCCCCAGATCCTGTTCATCTTCATCGACAGCGACCACACCGACAACCAG
CGCATCCTCGAGTTCTTTGGCCTGAAGAAGGAAGAGTGCCCGGCCGTGCGCCTCATCACCCTGGAGGAGGAGATGACCAA
GTACAAGCCCGAATCGGAGGAGCTGACGGCAGAGAGGATCACAGAGTTCTGCCACCGCTTCCTGGAGGGCAAAATCAAGG
TGCAGGCGCGTCCCCGGGCCAGGGCTGCCTCCCGGGTAGAGCCGCCACCTTGGGGTCTCTGGGCTCTCAGTGTCCTGGGC
TTCGTCCTCAAAGTAAGATCTTCAGAGTAAGAGGACGGAGCCCAAGATGGTGTGAAGACCACCATTCTTGTCACACCTTT
GGGGACCTGTAGGAGCCCTCTTGTCCACACGGCTGGCCCAGATGCTCCTCAGGGAAACCTCCTGGCCGGCGAGGGTGCGG
CCTGCGTGGGCTCATTGGGCCCAGCCCACTGGCCCAGCCTTCTCTTTCTGGATTCTTGGCACTGTTGGTGTCTGCGGCTG
CCACCCTGGTGTGCCCAGGGCAGCCACGCACCTCAGTCCCGGGCAGGGTGAGGGTTGGGGAGCTCTGTGGTCCCCCAGGC
ATCGCCGCCCCTCACCGTGCCCTGCCTGCCCTCCAGCCCCACCTGATGAGCCAGGAGCTGCCGGAGGACTGGGACAAGCA
GCCTGTCAAGGTGCTTGTTGGGAAGAACTTTGAAGACGTGGCTTTTGATGAGAAAAAAAACGTCTTTGTGGAGTTCTGTA
AGTGTTGCCCTTTCAGCTCCCCAGGTCCGTGCCACCAGGGCACGCGCAAGGTAGGCAGACTCCTGGGGAAGCGTCTGCCA
GTGGCTTTGTGTCCAGGACACACCCTAGAACTGCTTTCTTTTCAGATGCCCCATGGTGTGGTCACTGCAAACAGTTGGCT
CCCATTTGGGATAAACTGGGAGAGACGTACAAGGACCATGAGAACATCGTCATCGCCAAGATGGACTCGACTGCCAACGA
GGTGGAGGCCGTCAAAGTGCACAGCTTCCCCACACTCAAGTTCTTTCCTGCCAGTGCCGACAGGACGGTGCGCCTTCCCT
CCAGACGGGGCTGGGGCGGGCTCTGGGAAGAGCAGTGGTCCCCACGCCAGCCTGGGTGCCTCTCTGGAAGGAGCCTGGGG
CTAGTCAGGTGGGAGAAGAGCTGTGGGAGCTCCTTCAGAAGCACCCACCTTTGTTTTTCGAGAGAGATGGGATCTCACTA
TATTGCCTAGGCTGTTTTTGAACTCCCAATGTGCTGGATTCCGGGTGTGAGCCACCGGACCGGGCCTGCTCTGCCTTTCT
CTAGGGAGGAGGAGCCAATGCCAGGAGATTGGCTCTGGGCCGGGACCAGCCCCTGCACTCATCCCTGTCTTCCACAGGTC
ATTGATTACAACGGGGAACGCACGCTGGATGGTTTTAAGAAATTCCTGGAGAGCGGTGGCCAGGATGGGGCAGGGGATGA
TGACGTGAGTGGGGTCACAGCCCTGGGCTGGTCTCTGGGATTCAGCCCCTTGGCCCCACGGGACATGCCCACTTGGGCTC
GGCTTGTGATGGGAATGGTGCCATTGGAGGCACGTCTGTGCGTCCAGGGCTCTGGGGCCCCACACCTTCGCCCAGCAGGG
CAAGGGATGTGCTCCCAGTGCTCTGCTGAAACAGCTCTCCCTGGCCAGCACTTCTTTAGGGGACGACGTTGAAGGCATCT
TTAAAAACCAGATTTTGCTTAGAAAACCTTTTCAGACACATGGCACCGGTTCCTCACATGCTGCTAACTTGGACTCAGAG
GTCCATCCCCAGCCCGCTGGCGGGGCCCGGGCACAAGGGCGCAGACATGGGTCTGCCCTCGGGAGTTTAGTCCTGCCCGT
CAGCGCGTGGCCAGCAGCAGTGGTGACTCTGCCGGGGAAGCAGGAAGGCTGGTGTTGGGAGGATGCGGCAGGGGCGGGGT
GCTTGACCATGGTGAAGGAGATGAGGGCATGGTCAGCCCAGGCTGGCAGGGCCTGCTGGGAGTGCTGGTGCACAGTGGGA
GGTGGGAATACTCTGCCCAGGCTGGGGATGGAGGGCCGGCCCCTGGTGCTGCCTGTGGCCTGTCATTGGCCAGAGCCCAA
GGGTTGGAAGTAGAATGGTCACACCCCAGTGGTGACTGTGCCTGGACTTCATGCTGGGGCTGTTGTCCCTGCCGAGATGT
GGACCCTCTGTAGACTCCATGGTGCCAAGCTGGGAGACACACATACGCCTTCCATGCCCAGTGGAAGGGTCCAAGGCTGA
GCTGGGATTCCCAGGGTTCCTGACACAGTGGGGGCTCTGATTCATCCTGCTATAGGCAGAAATCCCCCAGGCGTCACTCT
TCCTGGGAAGACCCCTGTGTAGCTCGTAAAGACCAGGGTTGGCTGCTCCAGTGGCCGGTGGCCATGTCCGCCGTGGCCGG
CTAGTCCCCGGTGTGAGCAGTGTGGGGTGCTGGGGGCCGGCAGCCTCAGCTGTTGCAGCCAACCTGCCCGCCCCGCCCCT
TTTCCTGTATCCCAGGATCTCGAGGACCTGGAAGAAGCAGAGGAGCCAGACATGGAGGAAGACGATGATCAGAAAGCTGT
GAAAGATGAACTGTAATACGCAAAGCCAGACCCGGGCGCTGCCGAGACCCCTCGGGGGCTGCACACCCAGCAGCAGCGCA
CGCCTCCGAAGCCTGCGGCCTCGCTTGAAGGAGGGCGTCGCCGGAAACCCAGGGAACCTCTCTGAAGTGACACCTCACCC
CTACACACCGTCCGTTCACCCCCGTCTCTTCCTTCTGCTTTTCGGTTTTTGGAAAGGGATCCATCTCCAGGCAGCCCACC
CTGGTGGGGCTTGTTTCCTGAAACCATGATGTACTTTTTCATACATGAGTCTGTCCAGAGTGCTTGCTACCGTGTTCGGA
GTCTCGCTGCCTCCCTCCCGCGGGAGGTTTCTCCTCTTTTTGAAAATTCCGTCTGTGGGATTTTTAGACATTTTTCGACA
TCAGGGTATTTGTTCCACCTTGGCCAGGCCTCCTCGGAGAAGCTTGTCCCCCGTGTGGGAGGGACGGAGCCGGACTGGAC
ATGGTCACTCAGTACCGCCTGCAGTGTCGCCATGACTGATCATGGCTCTTGCATTTTTGGGTAAATGGAGACTTCCGGAT
CCTGTCAGGGTGTCCCCCATGCCTGGAAGAGGAGCTGGTGGCTGCCAGCCCTGGGGCCCGGCACAGGCCTGGGCCTTCCC
CTTCCCTCAAGCCAGGGCTCCTCCTCCTGTCGTGGGCTCATTGTGACCACTGGCCTCTCTACAGCACGGCCTGTGGCCTG
TTCAAGGCAGAACCACGACCCTTGACTCCCGGGTGGGGAGGTGGCCAAGGATGCTGGAGCTGAATCAGACGCTGACAGTT
CTTCAGGCATTTCTATTTCACAATCGAATTGAACACATTGGCCAAATAAAGTTGAAATTTTACCACCTGTC
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_000918.3 (GI:121256637)
|
Name |
Prolyl 4-hydroxylase, beta polypeptide (P4HB)
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
2596 nt
|
Map |
17q25
|
Location |
Chromosome 17 (NC_000017.10) strand : -
79801033...79801967 | 79803019...79803105 | 79803436...79803617 | 79803746...79803866 |
79804304...79804504 | 79804822...79804947 | 79805118...79805222 | 79813017...79813154 |
79813328...79813461 | 79817056...79817262 | 79818202...79818543 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
342
|
342
|
1
|
Exon 2
|
343
|
549
|
207
|
1
|
Exon 3
|
550
|
683
|
134
|
1
|
Exon 4
|
684
|
821
|
138
|
1
|
Exon 5
|
822
|
926
|
105
|
1
|
Exon 6
|
927
|
1052
|
126
|
1
|
Exon 7
|
1053
|
1253
|
201
|
1
|
Exon 8
|
1254
|
1374
|
121
|
1
|
Exon 9
|
1375
|
1556
|
182
|
1
|
Exon 10
|
1557
|
1643
|
87
|
1
|
Exon 11
|
1644
|
2578
|
935
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS11787
|
Nucleotide |
P4HB, mRNA isoform 1[NM_000918.3] : 198...1724
|
Length |
1527
|
Location |
Chromosome 17 (NC_000017.10) strand : -
79801887...79801967 | 79803019...79803105 | 79803436...79803617 | 79803746...79803866 |
79804304...79804504 | 79804822...79804947 | 79805118...79805222 | 79813017...79813154 |
79813328...79813461 | 79817056...79817262 | 79818202...79818346 |
|
Start codon |
1
|
Translation |
NP_000909.2 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
GAGCCTCGAA |
GTCCGCCGGC |
CAATCGAAGG |
CGGGCCCCAG |
CGGCGCGTGC |
GCGCCGCGGC |
CAGCGCGCGC |
GGGCGGGGGG |
|
81 |
GCAGGCGCGC |
CCCGGACCCA |
GGATTTATAA |
AGGCGAGGCC |
GGGACCGGCG |
CGCGCTCTCG |
TCGCCCCCGC |
TGTCCCGGCG |
|
161 |
GCGCCAACCG |
AAGCGCCCCG |
CCTGATCCGT |
GTCCGACATG |
CTGCGCCGCG |
CTCTGCTGTG |
CCTGGCCGTG |
GCCGCCCTGG |
|
241 |
TGCGCGCCGA |
CGCCCCCGAG |
GAGGAGGACC |
ACGTCCTGGT |
GCTGCGGAAA |
AGCAACTTCG |
CGGAGGCGCT |
GGCGGCCCAC |
|
321 |
AAGTACCTGC |
TGGTGGAGTT |
CTATGCCCCT |
TGGTGTGGCC |
ACTGCAAGGC |
TCTGGCCCCT |
GAGTATGCCA |
AAGCCGCTGG |
|
401 |
GAAGCTGAAG |
GCAGAAGGTT |
CCGAGATCAG |
GTTGGCCAAG |
GTGGACGCCA |
CGGAGGAGTC |
TGACCTGGCC |
CAGCAGTACG |
|
481 |
GCGTGCGCGG |
CTATCCCACC |
ATCAAGTTCT |
TCAGGAATGG |
AGACACGGCT |
TCCCCCAAGG |
AATATACAGC |
TGGCAGAGAG |
|
561 |
GCTGATGACA |
TCGTGAACTG |
GCTGAAGAAG |
CGCACGGGCC |
CGGCTGCCAC |
CACCCTGCCT |
GACGGCGCAG |
CTGCAGAGTC |
|
641 |
CTTGGTGGAG |
TCCAGCGAGG |
TGGCTGTCAT |
CGGCTTCTTC |
AAGGACGTGG |
AGTCGGACTC |
TGCCAAGCAG |
TTTTTGCAGG |
|
721 |
CAGCAGAGGC |
CATCGATGAC |
ATACCATTTG |
GGATCACTTC |
CAACAGTGAC |
GTGTTCTCCA |
AATACCAGCT |
CGACAAAGAT |
|
801 |
GGGGTTGTCC |
TCTTTAAGAA |
GTTTGATGAA |
GGCCGGAACA |
ACTTTGAAGG |
GGAGGTCACC |
AAGGAGAACC |
TGCTGGACTT |
|
881 |
TATCAAACAC |
AACCAGCTGC |
CCCTTGTCAT |
CGAGTTCACC |
GAGCAGACAG |
CCCCGAAGAT |
TTTTGGAGGT |
GAAATCAAGA |
|
961 |
CTCACATCCT |
GCTGTTCTTG |
CCCAAGAGTG |
TGTCTGACTA |
TGACGGCAAA |
CTGAGCAACT |
TCAAAACAGC |
AGCCGAGAGC |
|
1041 |
TTCAAGGGCA |
AGATCCTGTT |
CATCTTCATC |
GACAGCGACC |
ACACCGACAA |
CCAGCGCATC |
CTCGAGTTCT |
TTGGCCTGAA |
|
1121 |
GAAGGAAGAG |
TGCCCGGCCG |
TGCGCCTCAT |
CACCCTGGAG |
GAGGAGATGA |
CCAAGTACAA |
GCCCGAATCG |
GAGGAGCTGA |
|
1201 |
CGGCAGAGAG |
GATCACAGAG |
TTCTGCCACC |
GCTTCCTGGA |
GGGCAAAATC |
AAGCCCCACC |
TGATGAGCCA |
GGAGCTGCCG |
|
1281 |
GAGGACTGGG |
ACAAGCAGCC |
TGTCAAGGTG |
CTTGTTGGGA |
AGAACTTTGA |
AGACGTGGCT |
TTTGATGAGA |
AAAAAAACGT |
|
1361 |
CTTTGTGGAG |
TTCTATGCCC |
CATGGTGTGG |
TCACTGCAAA |
CAGTTGGCTC |
CCATTTGGGA |
TAAACTGGGA |
GAGACGTACA |
|
1441 |
AGGACCATGA |
GAACATCGTC |
ATCGCCAAGA |
TGGACTCGAC |
TGCCAACGAG |
GTGGAGGCCG |
TCAAAGTGCA |
CAGCTTCCCC |
|
1521 |
ACACTCAAGT |
TCTTTCCTGC |
CAGTGCCGAC |
AGGACGGTCA |
TTGATTACAA |
CGGGGAACGC |
ACGCTGGATG |
GTTTTAAGAA |
|
1601 |
ATTCCTGGAG |
AGCGGTGGCC |
AGGATGGGGC |
AGGGGATGAT |
GACGATCTCG |
AGGACCTGGA |
AGAAGCAGAG |
GAGCCAGACA |
|
1681 |
TGGAGGAAGA |
CGATGATCAG |
AAAGCTGTGA |
AAGATGAACT |
GTAATACGCA |
AAGCCAGACC |
CGGGCGCTGC |
CGAGACCCCT |
|
1761 |
CGGGGGCTGC |
ACACCCAGCA |
GCAGCGCACG |
CCTCCGAAGC |
CTGCGGCCTC |
GCTTGAAGGA |
GGGCGTCGCC |
GGAAACCCAG |
|
1841 |
GGAACCTCTC |
TGAAGTGACA |
CCTCACCCCT |
ACACACCGTC |
CGTTCACCCC |
CGTCTCTTCC |
TTCTGCTTTT |
CGGTTTTTGG |
|
1921 |
AAAGGGATCC |
ATCTCCAGGC |
AGCCCACCCT |
GGTGGGGCTT |
GTTTCCTGAA |
ACCATGATGT |
ACTTTTTCAT |
ACATGAGTCT |
|
2001 |
GTCCAGAGTG |
CTTGCTACCG |
TGTTCGGAGT |
CTCGCTGCCT |
CCCTCCCGCG |
GGAGGTTTCT |
CCTCTTTTTG |
AAAATTCCGT |
|
2081 |
CTGTGGGATT |
TTTAGACATT |
TTTCGACATC |
AGGGTATTTG |
TTCCACCTTG |
GCCAGGCCTC |
CTCGGAGAAG |
CTTGTCCCCC |
|
2161 |
GTGTGGGAGG |
GACGGAGCCG |
GACTGGACAT |
GGTCACTCAG |
TACCGCCTGC |
AGTGTCGCCA |
TGACTGATCA |
TGGCTCTTGC |
|
2241 |
ATTTTTGGGT |
AAATGGAGAC |
TTCCGGATCC |
TGTCAGGGTG |
TCCCCCATGC |
CTGGAAGAGG |
AGCTGGTGGC |
TGCCAGCCCT |
|
2321 |
GGGGCCCGGC |
ACAGGCCTGG |
GCCTTCCCCT |
TCCCTCAAGC |
CAGGGCTCCT |
CCTCCTGTCG |
TGGGCTCATT |
GTGACCACTG |
|
2401 |
GCCTCTCTAC |
AGCACGGCCT |
GTGGCCTGTT |
CAAGGCAGAA |
CCACGACCCT |
TGACTCCCGG |
GTGGGGAGGT |
GGCCAAGGAT |
|
2481 |
GCTGGAGCTG |
AATCAGACGC |
TGACAGTTCT |
TCAGGCATTT |
CTATTTCACA |
ATCGAATTGA |
ACACATTGGC |
CAAATAAAGT |
|
2561 |
TGAAATTTTA |
CCACCTGTAA |
AAAAAAAAAA |
AAAAAA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|121256637|ref|NM_000918.3|Prolyl 4-hydroxylase, beta polypeptide (P4HB)
GAGCCTCGAAGTCCGCCGGCCAATCGAAGGCGGGCCCCAGCGGCGCGTGCGCGCCGCGGCCAGCGCGCGCGGGCGGGGGG
GCAGGCGCGCCCCGGACCCAGGATTTATAAAGGCGAGGCCGGGACCGGCGCGCGCTCTCGTCGCCCCCGCTGTCCCGGCG
GCGCCAACCGAAGCGCCCCGCCTGATCCGTGTCCGACATGCTGCGCCGCGCTCTGCTGTGCCTGGCCGTGGCCGCCCTGG
TGCGCGCCGACGCCCCCGAGGAGGAGGACCACGTCCTGGTGCTGCGGAAAAGCAACTTCGCGGAGGCGCTGGCGGCCCAC
AAGTACCTGCTGGTGGAGTTCTATGCCCCTTGGTGTGGCCACTGCAAGGCTCTGGCCCCTGAGTATGCCAAAGCCGCTGG
GAAGCTGAAGGCAGAAGGTTCCGAGATCAGGTTGGCCAAGGTGGACGCCACGGAGGAGTCTGACCTGGCCCAGCAGTACG
GCGTGCGCGGCTATCCCACCATCAAGTTCTTCAGGAATGGAGACACGGCTTCCCCCAAGGAATATACAGCTGGCAGAGAG
GCTGATGACATCGTGAACTGGCTGAAGAAGCGCACGGGCCCGGCTGCCACCACCCTGCCTGACGGCGCAGCTGCAGAGTC
CTTGGTGGAGTCCAGCGAGGTGGCTGTCATCGGCTTCTTCAAGGACGTGGAGTCGGACTCTGCCAAGCAGTTTTTGCAGG
CAGCAGAGGCCATCGATGACATACCATTTGGGATCACTTCCAACAGTGACGTGTTCTCCAAATACCAGCTCGACAAAGAT
GGGGTTGTCCTCTTTAAGAAGTTTGATGAAGGCCGGAACAACTTTGAAGGGGAGGTCACCAAGGAGAACCTGCTGGACTT
TATCAAACACAACCAGCTGCCCCTTGTCATCGAGTTCACCGAGCAGACAGCCCCGAAGATTTTTGGAGGTGAAATCAAGA
CTCACATCCTGCTGTTCTTGCCCAAGAGTGTGTCTGACTATGACGGCAAACTGAGCAACTTCAAAACAGCAGCCGAGAGC
TTCAAGGGCAAGATCCTGTTCATCTTCATCGACAGCGACCACACCGACAACCAGCGCATCCTCGAGTTCTTTGGCCTGAA
GAAGGAAGAGTGCCCGGCCGTGCGCCTCATCACCCTGGAGGAGGAGATGACCAAGTACAAGCCCGAATCGGAGGAGCTGA
CGGCAGAGAGGATCACAGAGTTCTGCCACCGCTTCCTGGAGGGCAAAATCAAGCCCCACCTGATGAGCCAGGAGCTGCCG
GAGGACTGGGACAAGCAGCCTGTCAAGGTGCTTGTTGGGAAGAACTTTGAAGACGTGGCTTTTGATGAGAAAAAAAACGT
CTTTGTGGAGTTCTATGCCCCATGGTGTGGTCACTGCAAACAGTTGGCTCCCATTTGGGATAAACTGGGAGAGACGTACA
AGGACCATGAGAACATCGTCATCGCCAAGATGGACTCGACTGCCAACGAGGTGGAGGCCGTCAAAGTGCACAGCTTCCCC
ACACTCAAGTTCTTTCCTGCCAGTGCCGACAGGACGGTCATTGATTACAACGGGGAACGCACGCTGGATGGTTTTAAGAA
ATTCCTGGAGAGCGGTGGCCAGGATGGGGCAGGGGATGATGACGATCTCGAGGACCTGGAAGAAGCAGAGGAGCCAGACA
TGGAGGAAGACGATGATCAGAAAGCTGTGAAAGATGAACTGTAATACGCAAAGCCAGACCCGGGCGCTGCCGAGACCCCT
CGGGGGCTGCACACCCAGCAGCAGCGCACGCCTCCGAAGCCTGCGGCCTCGCTTGAAGGAGGGCGTCGCCGGAAACCCAG
GGAACCTCTCTGAAGTGACACCTCACCCCTACACACCGTCCGTTCACCCCCGTCTCTTCCTTCTGCTTTTCGGTTTTTGG
AAAGGGATCCATCTCCAGGCAGCCCACCCTGGTGGGGCTTGTTTCCTGAAACCATGATGTACTTTTTCATACATGAGTCT
GTCCAGAGTGCTTGCTACCGTGTTCGGAGTCTCGCTGCCTCCCTCCCGCGGGAGGTTTCTCCTCTTTTTGAAAATTCCGT
CTGTGGGATTTTTAGACATTTTTCGACATCAGGGTATTTGTTCCACCTTGGCCAGGCCTCCTCGGAGAAGCTTGTCCCCC
GTGTGGGAGGGACGGAGCCGGACTGGACATGGTCACTCAGTACCGCCTGCAGTGTCGCCATGACTGATCATGGCTCTTGC
ATTTTTGGGTAAATGGAGACTTCCGGATCCTGTCAGGGTGTCCCCCATGCCTGGAAGAGGAGCTGGTGGCTGCCAGCCCT
GGGGCCCGGCACAGGCCTGGGCCTTCCCCTTCCCTCAAGCCAGGGCTCCTCCTCCTGTCGTGGGCTCATTGTGACCACTG
GCCTCTCTACAGCACGGCCTGTGGCCTGTTCAAGGCAGAACCACGACCCTTGACTCCCGGGTGGGGAGGTGGCCAAGGAT
GCTGGAGCTGAATCAGACGCTGACAGTTCTTCAGGCATTTCTATTTCACAATCGAATTGAACACATTGGCCAAATAAAGT
TGAAATTTTACCACCTGTAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1
|
Length |
342 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79818202...79818543 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
GAGCCTCGAAGTCCGCCGGCCAATCGAAGGCGGGCCCCAGCGGCGCGTGCGCGCCGCGGCCAGCGCGCGCGGGCGGGGGG
GCAGGCGCGCCCCGGACCCAGGATTTATAAAGGCGAGGCCGGGACCGGCGCGCGCTCTCGTCGCCCCCGCTGTCCCGGCG
GCGCCAACCGAAGCGCCCCGCCTGATCCGTGTCCGACATGCTGCGCCGCGCTCTGCTGTGCCTGGCCGTGGCCGCCCTGG
TGCGCGCCGACGCCCCCGAGGAGGAGGACCACGTCCTGGTGCTGCGGAAAAGCAACTTCGCGGAGGCGCTGGCGGCCCAC
AAGTACCTGCTGGTGGAGTTCT
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
2
|
Length |
207 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79817056...79817262 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
ATGCCCCTTGGTGTGGCCACTGCAAGGCTCTGGCCCCTGAGTATGCCAAAGCCGCTGGGAAGCTGAAGGCAGAAGGTTCC
GAGATCAGGTTGGCCAAGGTGGACGCCACGGAGGAGTCTGACCTGGCCCAGCAGTACGGCGTGCGCGGCTATCCCACCAT
CAAGTTCTTCAGGAATGGAGACACGGCTTCCCCCAAGGAATATACAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
3
|
Length |
134 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79813328...79813461 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
CTGGCAGAGAGGCTGATGACATCGTGAACTGGCTGAAGAAGCGCACGGGCCCGGCTGCCACCACCCTGCCTGACGGCGCA
GCTGCAGAGTCCTTGGTGGAGTCCAGCGAGGTGGCTGTCATCGGCTTCTTCAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
4
|
Length |
138 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79813017...79813154 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
GACGTGGAGTCGGACTCTGCCAAGCAGTTTTTGCAGGCAGCAGAGGCCATCGATGACATACCATTTGGGATCACTTCCAA
CAGTGACGTGTTCTCCAAATACCAGCTCGACAAAGATGGGGTTGTCCTCTTTAAGAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
5
|
Length |
105 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79805118...79805222 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
TTTGATGAAGGCCGGAACAACTTTGAAGGGGAGGTCACCAAGGAGAACCTGCTGGACTTTATCAAACACAACCAGCTGCC
CCTTGTCATCGAGTTCACCGAGCAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
6
|
Length |
126 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79804822...79804947 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
ACAGCCCCGAAGATTTTTGGAGGTGAAATCAAGACTCACATCCTGCTGTTCTTGCCCAAGAGTGTGTCTGACTATGACGG
CAAACTGAGCAACTTCAAAACAGCAGCCGAGAGCTTCAAGGGCAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
7
|
Length |
201 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79804304...79804504 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
ATCCTGTTCATCTTCATCGACAGCGACCACACCGACAACCAGCGCATCCTCGAGTTCTTTGGCCTGAAGAAGGAAGAGTG
CCCGGCCGTGCGCCTCATCACCCTGGAGGAGGAGATGACCAAGTACAAGCCCGAATCGGAGGAGCTGACGGCAGAGAGGA
TCACAGAGTTCTGCCACCGCTTCCTGGAGGGCAAAATCAAG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
8
|
Length |
121 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79803746...79803866 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
CCCCACCTGATGAGCCAGGAGCTGCCGGAGGACTGGGACAAGCAGCCTGTCAAGGTGCTTGTTGGGAAGAACTTTGAAGA
CGTGGCTTTTGATGAGAAAAAAAACGTCTTTGTGGAGTTCT
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
9
|
Length |
182 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79803436...79803617 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
ATGCCCCATGGTGTGGTCACTGCAAACAGTTGGCTCCCATTTGGGATAAACTGGGAGAGACGTACAAGGACCATGAGAAC
ATCGTCATCGCCAAGATGGACTCGACTGCCAACGAGGTGGAGGCCGTCAAAGTGCACAGCTTCCCCACACTCAAGTTCTT
TCCTGCCAGTGCCGACAGGACG
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
10
|
Length |
87 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79803019...79803105 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
GTCATTGATTACAACGGGGAACGCACGCTGGATGGTTTTAAGAAATTCCTGGAGAGCGGTGGCCAGGATGGGGCAGGGGA
TGATGAC
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
11
|
Length |
935 nt
|
Location |
Chromosome 17 (NC_000017.10) : 79801033...79801967 (-)
|
Is part of |
P4HB, mRNA isoform 1
(NM_000918.3)
|
Sequence |
Show
|
|
GATCTCGAGGACCTGGAAGAAGCAGAGGAGCCAGACATGGAGGAAGACGATGATCAGAAAGCTGTGAAAGATGAACTGTA
ATACGCAAAGCCAGACCCGGGCGCTGCCGAGACCCCTCGGGGGCTGCACACCCAGCAGCAGCGCACGCCTCCGAAGCCTG
CGGCCTCGCTTGAAGGAGGGCGTCGCCGGAAACCCAGGGAACCTCTCTGAAGTGACACCTCACCCCTACACACCGTCCGT
TCACCCCCGTCTCTTCCTTCTGCTTTTCGGTTTTTGGAAAGGGATCCATCTCCAGGCAGCCCACCCTGGTGGGGCTTGTT
TCCTGAAACCATGATGTACTTTTTCATACATGAGTCTGTCCAGAGTGCTTGCTACCGTGTTCGGAGTCTCGCTGCCTCCC
TCCCGCGGGAGGTTTCTCCTCTTTTTGAAAATTCCGTCTGTGGGATTTTTAGACATTTTTCGACATCAGGGTATTTGTTC
CACCTTGGCCAGGCCTCCTCGGAGAAGCTTGTCCCCCGTGTGGGAGGGACGGAGCCGGACTGGACATGGTCACTCAGTAC
CGCCTGCAGTGTCGCCATGACTGATCATGGCTCTTGCATTTTTGGGTAAATGGAGACTTCCGGATCCTGTCAGGGTGTCC
CCCATGCCTGGAAGAGGAGCTGGTGGCTGCCAGCCCTGGGGCCCGGCACAGGCCTGGGCCTTCCCCTTCCCTCAAGCCAG
GGCTCCTCCTCCTGTCGTGGGCTCATTGTGACCACTGGCCTCTCTACAGCACGGCCTGTGGCCTGTTCAAGGCAGAACCA
CGACCCTTGACTCCCGGGTGGGGAGGTGGCCAAGGATGCTGGAGCTGAATCAGACGCTGACAGTTCTTCAGGCATTTCTA
TTTCACAATCGAATTGAACACATTGGCCAAATAAAGTTGAAATTTTACCACCTGT
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : P07237
|
Name |
Protein disulfide-isomerase
|
Alternative name(s) |
Cellular thyroid hormone-binding protein Prolyl 4-hydroxylase subunit beta p55
|
Synonym(s) |
PDI
|
Organism |
Homo sapiens
|
Length |
508 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Catalytic activity
|
Catalyzes the rearrangement of -S-S- bonds in proteins.
|
Function
|
This multifunctional protein catalyzes the formation, breakage and rearrangement of disulfide bonds. At the cell surface, seems to act as a reductase that cleaves disulfide bonds of proteins attached to the cell. May therefore cause structural modifications of exofacial proteins. Inside the cell, seems to form/rearrange disulfide bonds of nascent proteins. At high concentrations, functions as a chaperone that inhibits aggregation of misfolded proteins. At low concentrations, facilitates aggregation (anti-chaperone activity). May be involved with other chaperones in the structural modification of the TG precursor in hormone biogenesis. Also acts a structural subunit of various enzymes such as prolyl 4-hydroxylase and microsomal triacylglycerol transfer protein MTTP.
|
Similarity
|
Belongs to the protein disulfide isomerase family.
Contains 2 thioredoxin domains.
|
Subcellular location
|
Endoplasmic reticulum lumen. Melanosome. Cell membrane; Peripheral membrane protein (Potential). Note=Highly abundant. In some cell types, seems to be also secreted or associated with the plasma membrane, where it undergoes constant shedding and replacement from intracellular sources (Probable). Localizes near CD4-enriched regions on lymphoid cell surfaces. Identified by mass spectrometry in melanosome fractions from stage I to stage IV.
|
Subunit
|
Homodimer. Monomers and homotetramers may also occur. Also constitutes the structural subunit of prolyl 4-hydroxylase and of the microsomal triacylglycerol transfer protein MTTP in mammalian cells. Stabilizes both enzymes and retain them in the ER without contributing to the catalytic activity (By similarity). Binds UBQLN1. Binds to CD4, and upon HIV-1 binding to the cell membrane, is part of a P4HB/PDI-CD4-CXCR4-gp120 complex.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
peptidyl-proline hydroxylation to 4-hydroxy-L-proline [GO:0018401]
cell redox homeostasis [GO:0045454]
|
Cellular component
|
extracellular region [GO:0005576]
endoplasmic reticulum lumen [GO:0005788]
ER-Golgi intermediate compartment [GO:0005793]
plasma membrane [GO:0005886]
cell surface [GO:0009986]
melanosome [GO:0042470]
|
Molecular function
|
protein disulfide isomerase activity [GO:0003756]
procollagen-proline 4-dioxygenase activity [GO:0004656]
protein binding [GO:0005515]
|
|
|
|
|
|
|
|
|
|
|
|
|
With
|
Uniprot accession
|
IntAct
|
???
|
Q81JM5
|
EBI-395883,EBI-2815808
|
???
|
Q5NH25
|
EBI-395883,EBI-2801721
|
EIF4A2
|
Q14240
|
EBI-395883,EBI-73473
|
TAP1
|
Q03518
|
EBI-395883,EBI-747259
|
TRIP6
|
Q15654
|
EBI-395883,
|
PUF60
|
Q9UHX1
|
EBI-395883,
|
ELF3
|
P78545
|
EBI-395883,
|
PTN
|
P21246
|
EBI-395883,EBI-473725
|
FEZ1
|
Q99689
|
EBI-395883,EBI-396435
|
CD2BP2
|
O95400
|
EBI-395883,
|
CUL2
|
Q13617
|
EBI-395883,
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Molecule processing |
|
|
|
|
Signal
|
1 - 17
|
17
|
|
P07237-SIGNAL-1
|
Region |
|
|
|
|
Domain
|
18 - 134
|
117
|
Thioredoxin 1
|
P07237-DOMAIN-18
|
Domain
|
349 - 475
|
127
|
Thioredoxin 2
|
P07237-DOMAIN-349
|
Motif
|
505 - 508
|
4
|
Prevents secretion from ER
|
P07237-MOTIF-505
|
Sites |
|
|
|
|
Active site
|
53 - 53
|
1
|
Nucleophile
|
P07237-ACT_SITE-53
|
Active site
|
56 - 56
|
1
|
Nucleophile
|
P07237-ACT_SITE-56
|
Active site
|
397 - 397
|
1
|
Nucleophile (By similarity)
|
P07237-ACT_SITE-397
|
Active site
|
400 - 400
|
1
|
Nucleophile (By similarity)
|
P07237-ACT_SITE-400
|
Site
|
54 - 54
|
1
|
Contributes to redox potential value
|
P07237-SITE-54
|
Site
|
55 - 55
|
1
|
Contributes to redox potential value
|
P07237-SITE-55
|
Site
|
120 - 120
|
1
|
Lowers pKa of C-terminal Cys of first active site
|
P07237-SITE-120
|
Site
|
398 - 398
|
1
|
Contributes to redox potential value (By similarity)
|
P07237-SITE-398
|
Site
|
399 - 399
|
1
|
Contributes to redox potential value (By similarity)
|
P07237-SITE-399
|
Site
|
461 - 461
|
1
|
Lowers pKa of C-terminal Cys of second active site (By similarity)
|
P07237-SITE-461
|
Amino acid modifications |
|
|
|
|
Disulfide bond
|
53 - 56
|
4
|
Redox-active
|
P07237-DISULFID-53
|
Disulfide bond
|
397 - 400
|
4
|
Redox-active (By similarity)
|
P07237-DISULFID-397
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MLRRALLCLA |
VAALVRADAP |
EEEDHVLVLR |
KSNFAEALAA |
HKYLLVEFYA |
PWCGHCKALA |
PEYAKAAGKL |
KAEGSEIRLA |
|
81 |
KVDATEESDL |
AQQYGVRGYP |
TIKFFRNGDT |
ASPKEYTAGR |
EADDIVNWLK |
KRTGPAATTL |
PDGAAAESLV |
ESSEVAVIGF |
|
161 |
FKDVESDSAK |
QFLQAAEAID |
DIPFGITSNS |
DVFSKYQLDK |
DGVVLFKKFD |
EGRNNFEGEV |
TKENLLDFIK |
HNQLPLVIEF |
|
241 |
TEQTAPKIFG |
GEIKTHILLF |
LPKSVSDYDG |
KLSNFKTAAE |
SFKGKILFIF |
IDSDHTDNQR |
ILEFFGLKKE |
ECPAVRLITL |
|
321 |
EEEMTKYKPE |
SEELTAERIT |
EFCHRFLEGK |
IKPHLMSQEL |
PEDWDKQPVK |
VLVGKNFEDV |
AFDEKKNVFV |
EFYAPWCGHC |
|
401 |
KQLAPIWDKL |
GETYKDHENI |
VIAKMDSTAN |
EVEAVKVHSF |
PTLKFFPASA |
DRTVIDYNGE |
RTLDGFKKFL |
ESGGQDGAGD |
|
481 |
DDDLEDLEEA |
EEPDMEEDDD |
QKAVKDEL |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|P07237|PDIA1_human Protein disulfide-isomerase
MLRRALLCLAVAALVRADAPEEEDHVLVLRKSNFAEALAAHKYLLVEFYAPWCGHCKALAPEYAKAAGKLKAEGSEIRLA
KVDATEESDLAQQYGVRGYPTIKFFRNGDTASPKEYTAGREADDIVNWLKKRTGPAATTLPDGAAAESLVESSEVAVIGF
FKDVESDSAKQFLQAAEAIDDIPFGITSNSDVFSKYQLDKDGVVLFKKFDEGRNNFEGEVTKENLLDFIKHNQLPLVIEF
TEQTAPKIFGGEIKTHILLFLPKSVSDYDGKLSNFKTAAESFKGKILFIFIDSDHTDNQRILEFFGLKKEECPAVRLITL
EEEMTKYKPESEELTAERITEFCHRFLEGKIKPHLMSQELPEDWDKQPVKVLVGKNFEDVAFDEKKNVFVEFYAPWCGHC
KQLAPIWDKLGETYKDHENIVIAKMDSTANEVEAVKVHSFPTLKFFPASADRTVIDYNGERTLDGFKKFLESGGQDGAGD
DDDLEDLEEAEEPDMEEDDDQKAVKDEL
|
|
|
| |
|
|
|
|
|
|