(CXCR4) chemokine (C-X-C motif) receptor 4 [Homo sapiens] |
|
|
|
|
|
|
Gene
Transcript(s)
Exon(s)
Protein(s)
|
|
Accession
|
1830
|
Official symbol
|
CXCR4
|
Official name
|
chemokine (C-X-C motif) receptor 4
|
Gene type
|
gene with protein product
|
Organism
|
Homo sapiens
|
Location
|
Chromosome 2 (NC_000002.11) : 136871918...136875724 (-)
|
Map
|
2q21
|
Length
|
3807 nt
|
NM_001008540.1
|
CXCR4, mRNA isoform 1
|
NM_003467.2
|
CXCR4, mRNA isoform 2
|
Accession
|
Name
|
Organism
|
Length
|
P61073
|
C-X-C chemokine receptor type 4
|
Homo sapiens
|
352 aa
|
Q53S69
|
Chemokine (C-X-C motif) receptor 4 [submitted name]
|
Homo sapiens
|
352 aa
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Synonyms |
WHIM; NPYRL; LCR1; LAP3; FB22; NPYY3R; NPYR; NPY3R; LESTR; HSY3RR; HM89; fusin; D2S201E; CD184
|
Alternative name(s) |
seven transmembrane helix receptor; C-X-C chemokine receptor type 4; CD184 antigen; stromal cell-derived factor 1 receptor; lipopolysaccharide-associated protein 3; seven-transmembrane-segment receptor, spleen; leukocyte-derived seven-transmembrane-domain receptor; fusin; neuropeptide Y receptor Y3; chemokine (C-X-C motif), receptor 4 (fusin)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary |
This gene encodes a CXC chemokine receptor specific for stromal cell-derived factor-1. The protein has 7 transmembrane regions and is located on the cell surface. It acts with the CD4 protein to support HIV entry into cells and is also highly expressed in breast cancer cells. Mutations in this gene have been associated with WHIM (warts, hypogammaglobulinemia, infections, and myelokathexis) syndrome. Alternate transcriptional splice variants, encoding different isoforms, have been characterized. [provided by RefSeq].
|
|
|
|
|
|
|
|
|
|
|
|
|
Related Articles in PubMed
|
|
|
|
|
|
|
|
|
|
|
|
Go to ensembl
|
|
|
|
|
|
|
|
|
|
|
|
1 |
ACTTCAGTTT |
GTTGGCTGCG |
GCAGCAGGTA |
GCAAAGTGAC |
GCCGAGGGCC |
TGAGTGCTCC |
AGTAGCCACC |
GCATCTGGAG |
|
81 |
AACCAGCGGT |
TACCATGGAG |
GGGATCAGTG |
TAAGTCCAGT |
TTCAACCTGC |
TTTGTCATAA |
ATGTACAAAC |
GTTTGAACTT |
|
161 |
AGAGCGCAGC |
CCCTCTCCGA |
GCGGGCAGAA |
GCGGCCAGGA |
CATTGGAGGT |
ACCCGTACTC |
CAAAAAAGGG |
TCACCGAAAG |
|
241 |
GAGTTTTCTT |
GACCATGCCT |
ATATAGTGCG |
GGTGGGTGGG |
GGGGGAGCAG |
GATTGGAATC |
TTTTTCTCTG |
TGAGTCGAGG |
|
321 |
AGAAACGACT |
GGAAAGAGCG |
TTCCAGTGGC |
TGCATGTGTC |
TCCCCCTTGA |
GTCCCGCCGC |
GCGCGGCGGC |
TTGCACGCTG |
|
401 |
TTTGCAAACG |
TAAGAACATT |
CTGTGCACAA |
GTGCAGAGAA |
GGCGTGCGCG |
CTGCCTCGGG |
ACTCAGACCA |
CCGGTCTCTT |
|
481 |
CCTTGGGGAA |
GCGGGGATGT |
CTTGGAGCGA |
GTTACATTGT |
CTGAATTTAG |
AGGCGGAGGG |
CGGCGTGCCT |
GGGCTGAGTT |
|
561 |
CCCAGGAGGA |
GATTGCGCCC |
GCTTTAACTT |
CGGGGTTAAG |
CGCCTGGTGA |
CTGTTCTTGA |
CACTGGGTGC |
GTGTTTGTTA |
|
641 |
AACTCTGTGC |
GGCCGACGGA |
GCTGTGCCAG |
TCTCCCAGCA |
CAGTAGGCAG |
AGGGCGGGAG |
AGGCGGGTGG |
ACCCACCGCG |
|
721 |
CCGATCCTCT |
GAGGGGATCG |
AGTGGTGGCA |
GCAGCTAGGA |
GTTGATCCGC |
CCGCGCGCTT |
TGGGTTTGAG |
GGGGAAAACC |
|
801 |
TTCCCGCCGT |
CCGAAGCGCG |
CCTCTTCCCC |
ACGGCCGCGA |
GTGGGTCCTG |
CAGTTCGAGA |
GTTTGGGGTC |
GTGCAGAGGT |
|
881 |
CAGCGGAGTG |
GTTTGACCTC |
CCCTTTGACA |
CCGCGCAGCT |
GCCAGCCCTG |
AGATTTGCGC |
TCCGGGGATA |
GGAGCGGGTA |
|
961 |
CGGGGTGAGG |
GGCGGGGGCG |
GTTAAGACCG |
CACCTGGGCT |
GCCAGGTCGC |
CGCCGCGAAG |
ACTGGCAGGT |
GCAAGTGGGG |
|
1041 |
AAACCGTTTG |
GCTCTCTCCG |
AGTCCAGTTG |
TGATGTTTAA |
CCGTCGGTGG |
TTTCCAGAAA |
CCTTTTGAAA |
CCCTCTTGCT |
|
1121 |
AGGGAGTTTT |
TGGTTTCCTG |
CAGCGGCGCG |
CAATTCAAAG |
ACGCTCGCGG |
CGGAGCCGCC |
CAGTCGCTCC |
CCAGCACCCT |
|
1201 |
GTGGGACAGA |
GCCTGGCGTG |
TCGCCCAGCG |
GAGCCCCTGC |
AGCGCTGCTT |
GCGGGCGGTT |
GGCGTGGGTG |
TAGTGGGCAG |
|
1281 |
CCGCGGCGGC |
CCGGGGCTGG |
ACGACCCGGC |
CCCCCGCGTG |
CCCACCGCCT |
GGAGGCTTCC |
AGCTGCCCAC |
CTCCGGCCGG |
|
1361 |
GTTAACTGGA |
TCAGTGGCGG |
GGTAATGGGA |
AGCCACCCGG |
GAGAGTGAGG |
AAATGAAACT |
TGGGGCGAGG |
ACCACGGGTG |
|
1441 |
CAGACCCCGT |
TACCTTCTCC |
ACCCAGGAAA |
ATGCCCCGCT |
CCCTAACGTC |
CCAAACGCGC |
CAAGTGATAA |
ACACGAGGAT |
|
1521 |
GGCAAGAGAC |
CCACACACCG |
GAGGAGCGCC |
CGCTTGGGGG |
AGGAGGTGCC |
GTTTGTTCAT |
TTTCTGACAC |
TCCCGCCCAA |
|
1601 |
TATACCCCAA |
GCACCGAAGG |
GCCTTCGTTT |
TAAGACCGCA |
TTCTCTTTAC |
CCACTACAAG |
TTGCTTGAAG |
CCCAGAATGG |
|
1681 |
TTTGTATTTA |
GGCAGGCGTG |
GGAAAATTAA |
GTTTTTGCGC |
TTTAGGAGAA |
TGAGTCTTTG |
CAACGCCCCC |
GCCCTCCCCC |
|
1761 |
CGTGATCCTC |
CCTTCTCCCC |
TCTTCCCTCC |
CTGGGCGAAA |
AACTTCTTAC |
AAAAAGTTAA |
TCACTGCCCC |
TCCTAGCAGC |
|
1841 |
ACCCACCCCA |
CCCCCCACGC |
CGCCTGGGAG |
TGGCCTCTTT |
GTGTGTATTT |
TTTTTTTCCT |
CCTAAGGAAG |
GTTTTTTTTC |
|
1921 |
TTCCCTCTAG |
TGGGCGGGGC |
AGAGGAGTTA |
GCCAAGATGT |
GACTTTGAAA |
CCCTCAGCGT |
CTCAGTGCCC |
TTTTGTTCTA |
|
2001 |
AACAAAGAAT |
TTTGTAATTG |
GTTCTACCAA |
AGAAGGATAT |
AATGAAGTCA |
CTATGGGAAA |
AGATGGGGAG |
GAGAGTTGTA |
|
2081 |
GGATTCTACA |
TTAATTCTCT |
TGTGCCCTTA |
GCCCACTACT |
TCAGAATTTC |
CTGAAGAAAG |
CAAGCCTGAA |
TTGGTTTTTT |
|
2161 |
AAATTGCTTT |
AAAAATTTTT |
TTTAACTGGG |
TTAATGCTTG |
CTGAATTGGA |
AGTGAATGTC |
CATTCCTTTG |
CCTCTTTTGC |
|
2241 |
AGATATACAC |
TTCAGATAAC |
TACACCGAGG |
AAATGGGCTC |
AGGGGACTAT |
GACTCCATGA |
AGGAACCCTG |
TTTCCGTGAA |
|
2321 |
GAAAATGCTA |
ATTTCAATAA |
AATCTTCCTG |
CCCACCATCT |
ACTCCATCAT |
CTTCTTAACT |
GGCATTGTGG |
GCAATGGATT |
|
2401 |
GGTCATCCTG |
GTCATGGGTT |
ACCAGAAGAA |
ACTGAGAAGC |
ATGACGGACA |
AGTACAGGCT |
GCACCTGTCA |
GTGGCCGACC |
|
2481 |
TCCTCTTTGT |
CATCACGCTT |
CCCTTCTGGG |
CAGTTGATGC |
CGTGGCAAAC |
TGGTACTTTG |
GGAACTTCCT |
ATGCAAGGCA |
|
2561 |
GTCCATGTCA |
TCTACACAGT |
CAACCTCTAC |
AGCAGTGTCC |
TCATCCTGGC |
CTTCATCAGT |
CTGGACCGCT |
ACCTGGCCAT |
|
2641 |
CGTCCACGCC |
ACCAACAGTC |
AGAGGCCAAG |
GAAGCTGTTG |
GCTGAAAAGG |
TGGTCTATGT |
TGGCGTCTGG |
ATCCCTGCCC |
|
2721 |
TCCTGCTGAC |
TATTCCCGAC |
TTCATCTTTG |
CCAACGTCAG |
TGAGGCAGAT |
GACAGATATA |
TCTGTGACCG |
CTTCTACCCC |
|
2801 |
AATGACTTGT |
GGGTGGTTGT |
GTTCCAGTTT |
CAGCACATCA |
TGGTTGGCCT |
TATCCTGCCT |
GGTATTGTCA |
TCCTGTCCTG |
|
2881 |
CTATTGCATT |
ATCATCTCCA |
AGCTGTCACA |
CTCCAAGGGC |
CACCAGAAGC |
GCAAGGCCCT |
CAAGACCACA |
GTCATCCTCA |
|
2961 |
TCCTGGCTTT |
CTTCGCCTGT |
TGGCTGCCTT |
ACTACATTGG |
GATCAGCATC |
GACTCCTTCA |
TCCTCCTGGA |
AATCATCAAG |
|
3041 |
CAAGGGTGTG |
AGTTTGAGAA |
CACTGTGCAC |
AAGTGGATTT |
CCATCACCGA |
GGCCCTAGCT |
TTCTTCCACT |
GTTGTCTGAA |
|
3121 |
CCCCATCCTC |
TATGCTTTCC |
TTGGAGCCAA |
ATTTAAAACC |
TCTGCCCAGC |
ACGCACTCAC |
CTCTGTGAGC |
AGAGGGTCCA |
|
3201 |
GCCTCAAGAT |
CCTCTCCAAA |
GGAAAGCGAG |
GTGGACATTC |
ATCTGTTTCC |
ACTGAGTCTG |
AGTCTTCAAG |
TTTTCACTCC |
|
3281 |
AGCTAACACA |
GATGTAAAAG |
ACTTTTTTTT |
ATACGATAAA |
TAACTTTTTT |
TTAAGTTACA |
CATTTTTCAG |
ATATAAAAGA |
|
3361 |
CTGACCAATA |
TTGTACAGTT |
TTTATTGCTT |
GTTGGATTTT |
TGTCTTGTGT |
TTCTTTAGTT |
TTTGTGAAGT |
TTAATTGACT |
|
3441 |
TATTTATATA |
AATTTTTTTT |
GTTTCATATT |
GATGTGTGTC |
TAGGCAGGAC |
CTGTGGCCAA |
GTTCTTAGTT |
GCTGTATGTC |
|
3521 |
TCGTGGTAGG |
ACTGTAGAAA |
AGGGAACTGA |
ACATTCCAGA |
GCGTGTAGTG |
AATCACGTAA |
AGCTAGAAAT |
GATCCCCAGC |
|
3601 |
TGTTTATGCA |
TAGATAATCT |
CTCCATTCCC |
GTGGAACGTT |
TTTCCTGTTC |
TTAAGACGTG |
ATTTTGCTGT |
AGAAGATGGC |
|
3681 |
ACTTATAACC |
AAAGCCCAAA |
GTGGTATAGA |
AATGCTGGTT |
TTTCAGTTTT |
CAGGAGTGGG |
TTGATTTCAG |
CACCTACAGT |
|
3761 |
GTACAGTCTT |
GTATTAAGTT |
GTTAATAAAA |
GTACATGTTA |
AACTTAC |
|
|
|
|
|
|
|
|
|
|
|
|
>ref|Gene_ID:7852|CXCR4|NC_000002.11:136871918...136875724 (-)
ACTTCAGTTTGTTGGCTGCGGCAGCAGGTAGCAAAGTGACGCCGAGGGCCTGAGTGCTCCAGTAGCCACCGCATCTGGAG
AACCAGCGGTTACCATGGAGGGGATCAGTGTAAGTCCAGTTTCAACCTGCTTTGTCATAAATGTACAAACGTTTGAACTT
AGAGCGCAGCCCCTCTCCGAGCGGGCAGAAGCGGCCAGGACATTGGAGGTACCCGTACTCCAAAAAAGGGTCACCGAAAG
GAGTTTTCTTGACCATGCCTATATAGTGCGGGTGGGTGGGGGGGGAGCAGGATTGGAATCTTTTTCTCTGTGAGTCGAGG
AGAAACGACTGGAAAGAGCGTTCCAGTGGCTGCATGTGTCTCCCCCTTGAGTCCCGCCGCGCGCGGCGGCTTGCACGCTG
TTTGCAAACGTAAGAACATTCTGTGCACAAGTGCAGAGAAGGCGTGCGCGCTGCCTCGGGACTCAGACCACCGGTCTCTT
CCTTGGGGAAGCGGGGATGTCTTGGAGCGAGTTACATTGTCTGAATTTAGAGGCGGAGGGCGGCGTGCCTGGGCTGAGTT
CCCAGGAGGAGATTGCGCCCGCTTTAACTTCGGGGTTAAGCGCCTGGTGACTGTTCTTGACACTGGGTGCGTGTTTGTTA
AACTCTGTGCGGCCGACGGAGCTGTGCCAGTCTCCCAGCACAGTAGGCAGAGGGCGGGAGAGGCGGGTGGACCCACCGCG
CCGATCCTCTGAGGGGATCGAGTGGTGGCAGCAGCTAGGAGTTGATCCGCCCGCGCGCTTTGGGTTTGAGGGGGAAAACC
TTCCCGCCGTCCGAAGCGCGCCTCTTCCCCACGGCCGCGAGTGGGTCCTGCAGTTCGAGAGTTTGGGGTCGTGCAGAGGT
CAGCGGAGTGGTTTGACCTCCCCTTTGACACCGCGCAGCTGCCAGCCCTGAGATTTGCGCTCCGGGGATAGGAGCGGGTA
CGGGGTGAGGGGCGGGGGCGGTTAAGACCGCACCTGGGCTGCCAGGTCGCCGCCGCGAAGACTGGCAGGTGCAAGTGGGG
AAACCGTTTGGCTCTCTCCGAGTCCAGTTGTGATGTTTAACCGTCGGTGGTTTCCAGAAACCTTTTGAAACCCTCTTGCT
AGGGAGTTTTTGGTTTCCTGCAGCGGCGCGCAATTCAAAGACGCTCGCGGCGGAGCCGCCCAGTCGCTCCCCAGCACCCT
GTGGGACAGAGCCTGGCGTGTCGCCCAGCGGAGCCCCTGCAGCGCTGCTTGCGGGCGGTTGGCGTGGGTGTAGTGGGCAG
CCGCGGCGGCCCGGGGCTGGACGACCCGGCCCCCCGCGTGCCCACCGCCTGGAGGCTTCCAGCTGCCCACCTCCGGCCGG
GTTAACTGGATCAGTGGCGGGGTAATGGGAAGCCACCCGGGAGAGTGAGGAAATGAAACTTGGGGCGAGGACCACGGGTG
CAGACCCCGTTACCTTCTCCACCCAGGAAAATGCCCCGCTCCCTAACGTCCCAAACGCGCCAAGTGATAAACACGAGGAT
GGCAAGAGACCCACACACCGGAGGAGCGCCCGCTTGGGGGAGGAGGTGCCGTTTGTTCATTTTCTGACACTCCCGCCCAA
TATACCCCAAGCACCGAAGGGCCTTCGTTTTAAGACCGCATTCTCTTTACCCACTACAAGTTGCTTGAAGCCCAGAATGG
TTTGTATTTAGGCAGGCGTGGGAAAATTAAGTTTTTGCGCTTTAGGAGAATGAGTCTTTGCAACGCCCCCGCCCTCCCCC
CGTGATCCTCCCTTCTCCCCTCTTCCCTCCCTGGGCGAAAAACTTCTTACAAAAAGTTAATCACTGCCCCTCCTAGCAGC
ACCCACCCCACCCCCCACGCCGCCTGGGAGTGGCCTCTTTGTGTGTATTTTTTTTTTCCTCCTAAGGAAGGTTTTTTTTC
TTCCCTCTAGTGGGCGGGGCAGAGGAGTTAGCCAAGATGTGACTTTGAAACCCTCAGCGTCTCAGTGCCCTTTTGTTCTA
AACAAAGAATTTTGTAATTGGTTCTACCAAAGAAGGATATAATGAAGTCACTATGGGAAAAGATGGGGAGGAGAGTTGTA
GGATTCTACATTAATTCTCTTGTGCCCTTAGCCCACTACTTCAGAATTTCCTGAAGAAAGCAAGCCTGAATTGGTTTTTT
AAATTGCTTTAAAAATTTTTTTTAACTGGGTTAATGCTTGCTGAATTGGAAGTGAATGTCCATTCCTTTGCCTCTTTTGC
AGATATACACTTCAGATAACTACACCGAGGAAATGGGCTCAGGGGACTATGACTCCATGAAGGAACCCTGTTTCCGTGAA
GAAAATGCTAATTTCAATAAAATCTTCCTGCCCACCATCTACTCCATCATCTTCTTAACTGGCATTGTGGGCAATGGATT
GGTCATCCTGGTCATGGGTTACCAGAAGAAACTGAGAAGCATGACGGACAAGTACAGGCTGCACCTGTCAGTGGCCGACC
TCCTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCAAACTGGTACTTTGGGAACTTCCTATGCAAGGCA
GTCCATGTCATCTACACAGTCAACCTCTACAGCAGTGTCCTCATCCTGGCCTTCATCAGTCTGGACCGCTACCTGGCCAT
CGTCCACGCCACCAACAGTCAGAGGCCAAGGAAGCTGTTGGCTGAAAAGGTGGTCTATGTTGGCGTCTGGATCCCTGCCC
TCCTGCTGACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGACAGATATATCTGTGACCGCTTCTACCCC
AATGACTTGTGGGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTATCCTGCCTGGTATTGTCATCCTGTCCTG
CTATTGCATTATCATCTCCAAGCTGTCACACTCCAAGGGCCACCAGAAGCGCAAGGCCCTCAAGACCACAGTCATCCTCA
TCCTGGCTTTCTTCGCCTGTTGGCTGCCTTACTACATTGGGATCAGCATCGACTCCTTCATCCTCCTGGAAATCATCAAG
CAAGGGTGTGAGTTTGAGAACACTGTGCACAAGTGGATTTCCATCACCGAGGCCCTAGCTTTCTTCCACTGTTGTCTGAA
CCCCATCCTCTATGCTTTCCTTGGAGCCAAATTTAAAACCTCTGCCCAGCACGCACTCACCTCTGTGAGCAGAGGGTCCA
GCCTCAAGATCCTCTCCAAAGGAAAGCGAGGTGGACATTCATCTGTTTCCACTGAGTCTGAGTCTTCAAGTTTTCACTCC
AGCTAACACAGATGTAAAAGACTTTTTTTTATACGATAAATAACTTTTTTTTAAGTTACACATTTTTCAGATATAAAAGA
CTGACCAATATTGTACAGTTTTTATTGCTTGTTGGATTTTTGTCTTGTGTTTCTTTAGTTTTTGTGAAGTTTAATTGACT
TATTTATATAAATTTTTTTTGTTTCATATTGATGTGTGTCTAGGCAGGACCTGTGGCCAAGTTCTTAGTTGCTGTATGTC
TCGTGGTAGGACTGTAGAAAAGGGAACTGAACATTCCAGAGCGTGTAGTGAATCACGTAAAGCTAGAAATGATCCCCAGC
TGTTTATGCATAGATAATCTCTCCATTCCCGTGGAACGTTTTTCCTGTTCTTAAGACGTGATTTTGCTGTAGAAGATGGC
ACTTATAACCAAAGCCCAAAGTGGTATAGAAATGCTGGTTTTTCAGTTTTCAGGAGTGGGTTGATTTCAGCACCTACAGT
GTACAGTCTTGTATTAAGTTGTTAATAAAAGTACATGTTAAACTTAC
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_001008540.1 (GI:56790926)
|
Name |
Chemokine (C-X-C motif) receptor 4 (CXCR4), transcript variant 1
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
1912 nt
|
Map |
2q21
|
Location |
Chromosome 2 (NC_000002.11) strand : -
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 2a
|
1
|
1895
|
1895
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS33295
|
Nucleotide |
CXCR4, mRNA isoform 1[NM_001008540.1] : 305...1375
|
Length |
1071
|
Location |
Chromosome 2 (NC_000002.11) strand : -
|
Start codon |
1
|
Translation |
NP_001008540.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
TTTTTTTTCT |
TCCCTCTAGT |
GGGCGGGGCA |
GAGGAGTTAG |
CCAAGATGTG |
ACTTTGAAAC |
CCTCAGCGTC |
TCAGTGCCCT |
|
81 |
TTTGTTCTAA |
ACAAAGAATT |
TTGTAATTGG |
TTCTACCAAA |
GAAGGATATA |
ATGAAGTCAC |
TATGGGAAAA |
GATGGGGAGG |
|
161 |
AGAGTTGTAG |
GATTCTACAT |
TAATTCTCTT |
GTGCCCTTAG |
CCCACTACTT |
CAGAATTTCC |
TGAAGAAAGC |
AAGCCTGAAT |
|
241 |
TGGTTTTTTA |
AATTGCTTTA |
AAAATTTTTT |
TTAACTGGGT |
TAATGCTTGC |
TGAATTGGAA |
GTGAATGTCC |
ATTCCTTTGC |
|
321 |
CTCTTTTGCA |
GATATACACT |
TCAGATAACT |
ACACCGAGGA |
AATGGGCTCA |
GGGGACTATG |
ACTCCATGAA |
GGAACCCTGT |
|
401 |
TTCCGTGAAG |
AAAATGCTAA |
TTTCAATAAA |
ATCTTCCTGC |
CCACCATCTA |
CTCCATCATC |
TTCTTAACTG |
GCATTGTGGG |
|
481 |
CAATGGATTG |
GTCATCCTGG |
TCATGGGTTA |
CCAGAAGAAA |
CTGAGAAGCA |
TGACGGACAA |
GTACAGGCTG |
CACCTGTCAG |
|
561 |
TGGCCGACCT |
CCTCTTTGTC |
ATCACGCTTC |
CCTTCTGGGC |
AGTTGATGCC |
GTGGCAAACT |
GGTACTTTGG |
GAACTTCCTA |
|
641 |
TGCAAGGCAG |
TCCATGTCAT |
CTACACAGTC |
AACCTCTACA |
GCAGTGTCCT |
CATCCTGGCC |
TTCATCAGTC |
TGGACCGCTA |
|
721 |
CCTGGCCATC |
GTCCACGCCA |
CCAACAGTCA |
GAGGCCAAGG |
AAGCTGTTGG |
CTGAAAAGGT |
GGTCTATGTT |
GGCGTCTGGA |
|
801 |
TCCCTGCCCT |
CCTGCTGACT |
ATTCCCGACT |
TCATCTTTGC |
CAACGTCAGT |
GAGGCAGATG |
ACAGATATAT |
CTGTGACCGC |
|
881 |
TTCTACCCCA |
ATGACTTGTG |
GGTGGTTGTG |
TTCCAGTTTC |
AGCACATCAT |
GGTTGGCCTT |
ATCCTGCCTG |
GTATTGTCAT |
|
961 |
CCTGTCCTGC |
TATTGCATTA |
TCATCTCCAA |
GCTGTCACAC |
TCCAAGGGCC |
ACCAGAAGCG |
CAAGGCCCTC |
AAGACCACAG |
|
1041 |
TCATCCTCAT |
CCTGGCTTTC |
TTCGCCTGTT |
GGCTGCCTTA |
CTACATTGGG |
ATCAGCATCG |
ACTCCTTCAT |
CCTCCTGGAA |
|
1121 |
ATCATCAAGC |
AAGGGTGTGA |
GTTTGAGAAC |
ACTGTGCACA |
AGTGGATTTC |
CATCACCGAG |
GCCCTAGCTT |
TCTTCCACTG |
|
1201 |
TTGTCTGAAC |
CCCATCCTCT |
ATGCTTTCCT |
TGGAGCCAAA |
TTTAAAACCT |
CTGCCCAGCA |
CGCACTCACC |
TCTGTGAGCA |
|
1281 |
GAGGGTCCAG |
CCTCAAGATC |
CTCTCCAAAG |
GAAAGCGAGG |
TGGACATTCA |
TCTGTTTCCA |
CTGAGTCTGA |
GTCTTCAAGT |
|
1361 |
TTTCACTCCA |
GCTAACACAG |
ATGTAAAAGA |
CTTTTTTTTA |
TACGATAAAT |
AACTTTTTTT |
TAAGTTACAC |
ATTTTTCAGA |
|
1441 |
TATAAAAGAC |
TGACCAATAT |
TGTACAGTTT |
TTATTGCTTG |
TTGGATTTTT |
GTCTTGTGTT |
TCTTTAGTTT |
TTGTGAAGTT |
|
1521 |
TAATTGACTT |
ATTTATATAA |
ATTTTTTTTG |
TTTCATATTG |
ATGTGTGTCT |
AGGCAGGACC |
TGTGGCCAAG |
TTCTTAGTTG |
|
1601 |
CTGTATGTCT |
CGTGGTAGGA |
CTGTAGAAAA |
GGGAACTGAA |
CATTCCAGAG |
CGTGTAGTGA |
ATCACGTAAA |
GCTAGAAATG |
|
1681 |
ATCCCCAGCT |
GTTTATGCAT |
AGATAATCTC |
TCCATTCCCG |
TGGAACGTTT |
TTCCTGTTCT |
TAAGACGTGA |
TTTTGCTGTA |
|
1761 |
GAAGATGGCA |
CTTATAACCA |
AAGCCCAAAG |
TGGTATAGAA |
ATGCTGGTTT |
TTCAGTTTTC |
AGGAGTGGGT |
TGATTTCAGC |
|
1841 |
ACCTACAGTG |
TACAGTCTTG |
TATTAAGTTG |
TTAATAAAAG |
TACATGTTAA |
ACTTAAAAAA |
AAAAAAAAAA |
AA |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|56790926|ref|NM_001008540.1|Chemokine (C-X-C motif) receptor 4 (CXCR4), transcript variant 1
TTTTTTTTCTTCCCTCTAGTGGGCGGGGCAGAGGAGTTAGCCAAGATGTGACTTTGAAACCCTCAGCGTCTCAGTGCCCT
TTTGTTCTAAACAAAGAATTTTGTAATTGGTTCTACCAAAGAAGGATATAATGAAGTCACTATGGGAAAAGATGGGGAGG
AGAGTTGTAGGATTCTACATTAATTCTCTTGTGCCCTTAGCCCACTACTTCAGAATTTCCTGAAGAAAGCAAGCCTGAAT
TGGTTTTTTAAATTGCTTTAAAAATTTTTTTTAACTGGGTTAATGCTTGCTGAATTGGAAGTGAATGTCCATTCCTTTGC
CTCTTTTGCAGATATACACTTCAGATAACTACACCGAGGAAATGGGCTCAGGGGACTATGACTCCATGAAGGAACCCTGT
TTCCGTGAAGAAAATGCTAATTTCAATAAAATCTTCCTGCCCACCATCTACTCCATCATCTTCTTAACTGGCATTGTGGG
CAATGGATTGGTCATCCTGGTCATGGGTTACCAGAAGAAACTGAGAAGCATGACGGACAAGTACAGGCTGCACCTGTCAG
TGGCCGACCTCCTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCAAACTGGTACTTTGGGAACTTCCTA
TGCAAGGCAGTCCATGTCATCTACACAGTCAACCTCTACAGCAGTGTCCTCATCCTGGCCTTCATCAGTCTGGACCGCTA
CCTGGCCATCGTCCACGCCACCAACAGTCAGAGGCCAAGGAAGCTGTTGGCTGAAAAGGTGGTCTATGTTGGCGTCTGGA
TCCCTGCCCTCCTGCTGACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGACAGATATATCTGTGACCGC
TTCTACCCCAATGACTTGTGGGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTATCCTGCCTGGTATTGTCAT
CCTGTCCTGCTATTGCATTATCATCTCCAAGCTGTCACACTCCAAGGGCCACCAGAAGCGCAAGGCCCTCAAGACCACAG
TCATCCTCATCCTGGCTTTCTTCGCCTGTTGGCTGCCTTACTACATTGGGATCAGCATCGACTCCTTCATCCTCCTGGAA
ATCATCAAGCAAGGGTGTGAGTTTGAGAACACTGTGCACAAGTGGATTTCCATCACCGAGGCCCTAGCTTTCTTCCACTG
TTGTCTGAACCCCATCCTCTATGCTTTCCTTGGAGCCAAATTTAAAACCTCTGCCCAGCACGCACTCACCTCTGTGAGCA
GAGGGTCCAGCCTCAAGATCCTCTCCAAAGGAAAGCGAGGTGGACATTCATCTGTTTCCACTGAGTCTGAGTCTTCAAGT
TTTCACTCCAGCTAACACAGATGTAAAAGACTTTTTTTTATACGATAAATAACTTTTTTTTAAGTTACACATTTTTCAGA
TATAAAAGACTGACCAATATTGTACAGTTTTTATTGCTTGTTGGATTTTTGTCTTGTGTTTCTTTAGTTTTTGTGAAGTT
TAATTGACTTATTTATATAAATTTTTTTTGTTTCATATTGATGTGTGTCTAGGCAGGACCTGTGGCCAAGTTCTTAGTTG
CTGTATGTCTCGTGGTAGGACTGTAGAAAAGGGAACTGAACATTCCAGAGCGTGTAGTGAATCACGTAAAGCTAGAAATG
ATCCCCAGCTGTTTATGCATAGATAATCTCTCCATTCCCGTGGAACGTTTTTCCTGTTCTTAAGACGTGATTTTGCTGTA
GAAGATGGCACTTATAACCAAAGCCCAAAGTGGTATAGAAATGCTGGTTTTTCAGTTTTCAGGAGTGGGTTGATTTCAGC
ACCTACAGTGTACAGTCTTGTATTAAGTTGTTAATAAAAGTACATGTTAAACTTAAAAAAAAAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
RefSeq : NM_003467.2 (GI:56790928)
|
Name |
Chemokine (C-X-C motif) receptor 4 (CXCR4), transcript variant 2
|
Type of transcript |
mRNA
|
Organism |
Homo sapiens
|
Length |
1691 nt
|
Map |
2q21
|
Location |
Chromosome 2 (NC_000002.11) strand : -
136871918...136873481 | 136875615...136875724 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon(s) |
Start |
End |
Length |
is coding |
Exon 1
|
1
|
110
|
110
|
1
|
Exon 2b
|
111
|
1674
|
1564
|
1
|
|
|
|
|
|
|
|
|
|
|
|
|
ID |
Class |
Location |
Mutation |
Length |
is synonymous |
Source |
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
NCBI : CCDS46420
|
Nucleotide |
CXCR4, mRNA isoform 2[NM_003467.2] : 96...1154
|
Length |
1059
|
Location |
Chromosome 2 (NC_000002.11) strand : -
136872438...136873481 | 136875615...136875629 |
|
Start codon |
1
|
Translation |
NP_003458.1 |
|
|
|
|
|
|
|
|
|
|
|
|
1 |
AACTTCAGTT |
TGTTGGCTGC |
GGCAGCAGGT |
AGCAAAGTGA |
CGCCGAGGGC |
CTGAGTGCTC |
CAGTAGCCAC |
CGCATCTGGA |
|
81 |
GAACCAGCGG |
TTACCATGGA |
GGGGATCAGT |
ATATACACTT |
CAGATAACTA |
CACCGAGGAA |
ATGGGCTCAG |
GGGACTATGA |
|
161 |
CTCCATGAAG |
GAACCCTGTT |
TCCGTGAAGA |
AAATGCTAAT |
TTCAATAAAA |
TCTTCCTGCC |
CACCATCTAC |
TCCATCATCT |
|
241 |
TCTTAACTGG |
CATTGTGGGC |
AATGGATTGG |
TCATCCTGGT |
CATGGGTTAC |
CAGAAGAAAC |
TGAGAAGCAT |
GACGGACAAG |
|
321 |
TACAGGCTGC |
ACCTGTCAGT |
GGCCGACCTC |
CTCTTTGTCA |
TCACGCTTCC |
CTTCTGGGCA |
GTTGATGCCG |
TGGCAAACTG |
|
401 |
GTACTTTGGG |
AACTTCCTAT |
GCAAGGCAGT |
CCATGTCATC |
TACACAGTCA |
ACCTCTACAG |
CAGTGTCCTC |
ATCCTGGCCT |
|
481 |
TCATCAGTCT |
GGACCGCTAC |
CTGGCCATCG |
TCCACGCCAC |
CAACAGTCAG |
AGGCCAAGGA |
AGCTGTTGGC |
TGAAAAGGTG |
|
561 |
GTCTATGTTG |
GCGTCTGGAT |
CCCTGCCCTC |
CTGCTGACTA |
TTCCCGACTT |
CATCTTTGCC |
AACGTCAGTG |
AGGCAGATGA |
|
641 |
CAGATATATC |
TGTGACCGCT |
TCTACCCCAA |
TGACTTGTGG |
GTGGTTGTGT |
TCCAGTTTCA |
GCACATCATG |
GTTGGCCTTA |
|
721 |
TCCTGCCTGG |
TATTGTCATC |
CTGTCCTGCT |
ATTGCATTAT |
CATCTCCAAG |
CTGTCACACT |
CCAAGGGCCA |
CCAGAAGCGC |
|
801 |
AAGGCCCTCA |
AGACCACAGT |
CATCCTCATC |
CTGGCTTTCT |
TCGCCTGTTG |
GCTGCCTTAC |
TACATTGGGA |
TCAGCATCGA |
|
881 |
CTCCTTCATC |
CTCCTGGAAA |
TCATCAAGCA |
AGGGTGTGAG |
TTTGAGAACA |
CTGTGCACAA |
GTGGATTTCC |
ATCACCGAGG |
|
961 |
CCCTAGCTTT |
CTTCCACTGT |
TGTCTGAACC |
CCATCCTCTA |
TGCTTTCCTT |
GGAGCCAAAT |
TTAAAACCTC |
TGCCCAGCAC |
|
1041 |
GCACTCACCT |
CTGTGAGCAG |
AGGGTCCAGC |
CTCAAGATCC |
TCTCCAAAGG |
AAAGCGAGGT |
GGACATTCAT |
CTGTTTCCAC |
|
1121 |
TGAGTCTGAG |
TCTTCAAGTT |
TTCACTCCAG |
CTAACACAGA |
TGTAAAAGAC |
TTTTTTTTAT |
ACGATAAATA |
ACTTTTTTTT |
|
1201 |
AAGTTACACA |
TTTTTCAGAT |
ATAAAAGACT |
GACCAATATT |
GTACAGTTTT |
TATTGCTTGT |
TGGATTTTTG |
TCTTGTGTTT |
|
1281 |
CTTTAGTTTT |
TGTGAAGTTT |
AATTGACTTA |
TTTATATAAA |
TTTTTTTTGT |
TTCATATTGA |
TGTGTGTCTA |
GGCAGGACCT |
|
1361 |
GTGGCCAAGT |
TCTTAGTTGC |
TGTATGTCTC |
GTGGTAGGAC |
TGTAGAAAAG |
GGAACTGAAC |
ATTCCAGAGC |
GTGTAGTGAA |
|
1441 |
TCACGTAAAG |
CTAGAAATGA |
TCCCCAGCTG |
TTTATGCATA |
GATAATCTCT |
CCATTCCCGT |
GGAACGTTTT |
TCCTGTTCTT |
|
1521 |
AAGACGTGAT |
TTTGCTGTAG |
AAGATGGCAC |
TTATAACCAA |
AGCCCAAAGT |
GGTATAGAAA |
TGCTGGTTTT |
TCAGTTTTCA |
|
1601 |
GGAGTGGGTT |
GATTTCAGCA |
CCTACAGTGT |
ACAGTCTTGT |
ATTAAGTTGT |
TAATAAAAGT |
ACATGTTAAA |
CTTAAAAAAA |
|
1681 |
AAAAAAAAAA |
A |
|
|
|
|
|
|
|
|
|
|
|
|
>gi|56790928|ref|NM_003467.2|Chemokine (C-X-C motif) receptor 4 (CXCR4), transcript variant 2
AACTTCAGTTTGTTGGCTGCGGCAGCAGGTAGCAAAGTGACGCCGAGGGCCTGAGTGCTCCAGTAGCCACCGCATCTGGA
GAACCAGCGGTTACCATGGAGGGGATCAGTATATACACTTCAGATAACTACACCGAGGAAATGGGCTCAGGGGACTATGA
CTCCATGAAGGAACCCTGTTTCCGTGAAGAAAATGCTAATTTCAATAAAATCTTCCTGCCCACCATCTACTCCATCATCT
TCTTAACTGGCATTGTGGGCAATGGATTGGTCATCCTGGTCATGGGTTACCAGAAGAAACTGAGAAGCATGACGGACAAG
TACAGGCTGCACCTGTCAGTGGCCGACCTCCTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCAAACTG
GTACTTTGGGAACTTCCTATGCAAGGCAGTCCATGTCATCTACACAGTCAACCTCTACAGCAGTGTCCTCATCCTGGCCT
TCATCAGTCTGGACCGCTACCTGGCCATCGTCCACGCCACCAACAGTCAGAGGCCAAGGAAGCTGTTGGCTGAAAAGGTG
GTCTATGTTGGCGTCTGGATCCCTGCCCTCCTGCTGACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGA
CAGATATATCTGTGACCGCTTCTACCCCAATGACTTGTGGGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTA
TCCTGCCTGGTATTGTCATCCTGTCCTGCTATTGCATTATCATCTCCAAGCTGTCACACTCCAAGGGCCACCAGAAGCGC
AAGGCCCTCAAGACCACAGTCATCCTCATCCTGGCTTTCTTCGCCTGTTGGCTGCCTTACTACATTGGGATCAGCATCGA
CTCCTTCATCCTCCTGGAAATCATCAAGCAAGGGTGTGAGTTTGAGAACACTGTGCACAAGTGGATTTCCATCACCGAGG
CCCTAGCTTTCTTCCACTGTTGTCTGAACCCCATCCTCTATGCTTTCCTTGGAGCCAAATTTAAAACCTCTGCCCAGCAC
GCACTCACCTCTGTGAGCAGAGGGTCCAGCCTCAAGATCCTCTCCAAAGGAAAGCGAGGTGGACATTCATCTGTTTCCAC
TGAGTCTGAGTCTTCAAGTTTTCACTCCAGCTAACACAGATGTAAAAGACTTTTTTTTATACGATAAATAACTTTTTTTT
AAGTTACACATTTTTCAGATATAAAAGACTGACCAATATTGTACAGTTTTTATTGCTTGTTGGATTTTTGTCTTGTGTTT
CTTTAGTTTTTGTGAAGTTTAATTGACTTATTTATATAAATTTTTTTTGTTTCATATTGATGTGTGTCTAGGCAGGACCT
GTGGCCAAGTTCTTAGTTGCTGTATGTCTCGTGGTAGGACTGTAGAAAAGGGAACTGAACATTCCAGAGCGTGTAGTGAA
TCACGTAAAGCTAGAAATGATCCCCAGCTGTTTATGCATAGATAATCTCTCCATTCCCGTGGAACGTTTTTCCTGTTCTT
AAGACGTGATTTTGCTGTAGAAGATGGCACTTATAACCAAAGCCCAAAGTGGTATAGAAATGCTGGTTTTTCAGTTTTCA
GGAGTGGGTTGATTTCAGCACCTACAGTGTACAGTCTTGTATTAAGTTGTTAATAAAAGTACATGTTAAACTTAAAAAAA
AAAAAAAAAAA
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
1
|
Length |
110 nt
|
Location |
Chromosome 2 (NC_000002.11) : 136875615...136875724 (-)
|
Is part of |
CXCR4, mRNA isoform 2
(NM_003467.2)
|
Sequence |
Show
|
|
AACTTCAGTTTGTTGGCTGCGGCAGCAGGTAGCAAAGTGACGCCGAGGGCCTGAGTGCTCCAGTAGCCACCGCATCTGGA
GAACCAGCGGTTACCATGGAGGGGATCAGT
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
2a
|
Length |
1895 nt
|
Location |
Chromosome 2 (NC_000002.11) : 136871918...136873812 (-)
|
Is part of |
CXCR4, mRNA isoform 1
(NM_001008540.1)
|
Sequence |
Show
|
|
TTTTTTTTCTTCCCTCTAGTGGGCGGGGCAGAGGAGTTAGCCAAGATGTGACTTTGAAACCCTCAGCGTCTCAGTGCCCT
TTTGTTCTAAACAAAGAATTTTGTAATTGGTTCTACCAAAGAAGGATATAATGAAGTCACTATGGGAAAAGATGGGGAGG
AGAGTTGTAGGATTCTACATTAATTCTCTTGTGCCCTTAGCCCACTACTTCAGAATTTCCTGAAGAAAGCAAGCCTGAAT
TGGTTTTTTAAATTGCTTTAAAAATTTTTTTTAACTGGGTTAATGCTTGCTGAATTGGAAGTGAATGTCCATTCCTTTGC
CTCTTTTGCAGATATACACTTCAGATAACTACACCGAGGAAATGGGCTCAGGGGACTATGACTCCATGAAGGAACCCTGT
TTCCGTGAAGAAAATGCTAATTTCAATAAAATCTTCCTGCCCACCATCTACTCCATCATCTTCTTAACTGGCATTGTGGG
CAATGGATTGGTCATCCTGGTCATGGGTTACCAGAAGAAACTGAGAAGCATGACGGACAAGTACAGGCTGCACCTGTCAG
TGGCCGACCTCCTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCAAACTGGTACTTTGGGAACTTCCTA
TGCAAGGCAGTCCATGTCATCTACACAGTCAACCTCTACAGCAGTGTCCTCATCCTGGCCTTCATCAGTCTGGACCGCTA
CCTGGCCATCGTCCACGCCACCAACAGTCAGAGGCCAAGGAAGCTGTTGGCTGAAAAGGTGGTCTATGTTGGCGTCTGGA
TCCCTGCCCTCCTGCTGACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGACAGATATATCTGTGACCGC
TTCTACCCCAATGACTTGTGGGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTATCCTGCCTGGTATTGTCAT
CCTGTCCTGCTATTGCATTATCATCTCCAAGCTGTCACACTCCAAGGGCCACCAGAAGCGCAAGGCCCTCAAGACCACAG
TCATCCTCATCCTGGCTTTCTTCGCCTGTTGGCTGCCTTACTACATTGGGATCAGCATCGACTCCTTCATCCTCCTGGAA
ATCATCAAGCAAGGGTGTGAGTTTGAGAACACTGTGCACAAGTGGATTTCCATCACCGAGGCCCTAGCTTTCTTCCACTG
TTGTCTGAACCCCATCCTCTATGCTTTCCTTGGAGCCAAATTTAAAACCTCTGCCCAGCACGCACTCACCTCTGTGAGCA
GAGGGTCCAGCCTCAAGATCCTCTCCAAAGGAAAGCGAGGTGGACATTCATCTGTTTCCACTGAGTCTGAGTCTTCAAGT
TTTCACTCCAGCTAACACAGATGTAAAAGACTTTTTTTTATACGATAAATAACTTTTTTTTAAGTTACACATTTTTCAGA
TATAAAAGACTGACCAATATTGTACAGTTTTTATTGCTTGTTGGATTTTTGTCTTGTGTTTCTTTAGTTTTTGTGAAGTT
TAATTGACTTATTTATATAAATTTTTTTTGTTTCATATTGATGTGTGTCTAGGCAGGACCTGTGGCCAAGTTCTTAGTTG
CTGTATGTCTCGTGGTAGGACTGTAGAAAAGGGAACTGAACATTCCAGAGCGTGTAGTGAATCACGTAAAGCTAGAAATG
ATCCCCAGCTGTTTATGCATAGATAATCTCTCCATTCCCGTGGAACGTTTTTCCTGTTCTTAAGACGTGATTTTGCTGTA
GAAGATGGCACTTATAACCAAAGCCCAAAGTGGTATAGAAATGCTGGTTTTTCAGTTTTCAGGAGTGGGTTGATTTCAGC
ACCTACAGTGTACAGTCTTGTATTAAGTTGTTAATAAAAGTACATGTTAAACTTA
|
|
|
|
|
|
|
|
|
|
|
|
|
Exon number |
2b
|
Length |
1564 nt
|
Location |
Chromosome 2 (NC_000002.11) : 136871918...136873481 (-)
|
Is part of |
CXCR4, mRNA isoform 2
(NM_003467.2)
|
Sequence |
Show
|
|
ATATACACTTCAGATAACTACACCGAGGAAATGGGCTCAGGGGACTATGACTCCATGAAGGAACCCTGTTTCCGTGAAGA
AAATGCTAATTTCAATAAAATCTTCCTGCCCACCATCTACTCCATCATCTTCTTAACTGGCATTGTGGGCAATGGATTGG
TCATCCTGGTCATGGGTTACCAGAAGAAACTGAGAAGCATGACGGACAAGTACAGGCTGCACCTGTCAGTGGCCGACCTC
CTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCAAACTGGTACTTTGGGAACTTCCTATGCAAGGCAGT
CCATGTCATCTACACAGTCAACCTCTACAGCAGTGTCCTCATCCTGGCCTTCATCAGTCTGGACCGCTACCTGGCCATCG
TCCACGCCACCAACAGTCAGAGGCCAAGGAAGCTGTTGGCTGAAAAGGTGGTCTATGTTGGCGTCTGGATCCCTGCCCTC
CTGCTGACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGACAGATATATCTGTGACCGCTTCTACCCCAA
TGACTTGTGGGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTATCCTGCCTGGTATTGTCATCCTGTCCTGCT
ATTGCATTATCATCTCCAAGCTGTCACACTCCAAGGGCCACCAGAAGCGCAAGGCCCTCAAGACCACAGTCATCCTCATC
CTGGCTTTCTTCGCCTGTTGGCTGCCTTACTACATTGGGATCAGCATCGACTCCTTCATCCTCCTGGAAATCATCAAGCA
AGGGTGTGAGTTTGAGAACACTGTGCACAAGTGGATTTCCATCACCGAGGCCCTAGCTTTCTTCCACTGTTGTCTGAACC
CCATCCTCTATGCTTTCCTTGGAGCCAAATTTAAAACCTCTGCCCAGCACGCACTCACCTCTGTGAGCAGAGGGTCCAGC
CTCAAGATCCTCTCCAAAGGAAAGCGAGGTGGACATTCATCTGTTTCCACTGAGTCTGAGTCTTCAAGTTTTCACTCCAG
CTAACACAGATGTAAAAGACTTTTTTTTATACGATAAATAACTTTTTTTTAAGTTACACATTTTTCAGATATAAAAGACT
GACCAATATTGTACAGTTTTTATTGCTTGTTGGATTTTTGTCTTGTGTTTCTTTAGTTTTTGTGAAGTTTAATTGACTTA
TTTATATAAATTTTTTTTGTTTCATATTGATGTGTGTCTAGGCAGGACCTGTGGCCAAGTTCTTAGTTGCTGTATGTCTC
GTGGTAGGACTGTAGAAAAGGGAACTGAACATTCCAGAGCGTGTAGTGAATCACGTAAAGCTAGAAATGATCCCCAGCTG
TTTATGCATAGATAATCTCTCCATTCCCGTGGAACGTTTTTCCTGTTCTTAAGACGTGATTTTGCTGTAGAAGATGGCAC
TTATAACCAAAGCCCAAAGTGGTATAGAAATGCTGGTTTTTCAGTTTTCAGGAGTGGGTTGATTTCAGCACCTACAGTGT
ACAGTCTTGTATTAAGTTGTTAATAAAAGTACATGTTAAACTTA
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : P61073
|
Name |
C-X-C chemokine receptor type 4
|
Alternative name(s) |
FB22 Fusin HM89 LCR1 Leukocyte-derived seven transmembrane domain receptor NPYRL Stromal cell-derived factor 1 receptor
|
Synonym(s) |
CXC-R4 CXCR-4 LESTR SDF-1 receptor
|
Organism |
Homo sapiens
|
Length |
352 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Disease
|
Defects in CXCR4 are a cause of WHIM syndrome [MIM:193670]; also known as warts, hypogammaglobulinemia, infections and myelokathexis. WHIM syndrome is an immunodeficiency disease characterized by neutropenia, hypogammaglobulinemia and extensive human papillomavirus (HPV) infection. Despite the peripheral neutropenia, bone marrow aspirates from affected individuals contain abundant mature myeloid cells, a condition termed myelokathexis.
|
Domain
|
The amino-terminus is critical for ligand binding. Residues in all four extracellular regions contribute to HIV-1 coreceptor activity.
|
Function
|
Receptor for the C-X-C chemokine CXCL12/SDF-1. Transduces a signal by increasing the intracellular calcium ions level. Involved in haematopoiesis and in cardiac ventricular septum formation. Plays also an essential role in vascularization of the gastrointestinal tract, probably by regulating vascular branching and/or remodeling processes in endothelial cells. Could be involved in cerebellar development. In the CNS, could mediate hippocampal-neuron survival. Acts as a coreceptor (CD4 being the primary receptor) for HIV-1 X4 isolates and as a primary receptor for some HIV-2 isolates. Promotes Env-mediated fusion of the virus.
|
Ptm
|
Sulfation on Tyr-21 is required for efficient binding of CXCL12/SDF-1alpha and promotes its dimerization.
O- and N-glycosylated. Asn-11 is the principal site of N- glycosylation. There appears to be very little or no glycosylation on Asn-176. N-glycosylation masks coreceptor function in both X4 and R5 laboratory-adapted and primary HIV-1 strains through inhibiting interaction with their Env glycoproteins. The O- glycosylation chondroitin sulfate attachment does not affect interaction with CXCL12/SDF-1alpha nor its coreceptor activity.
|
Similarity
|
Belongs to the G-protein coupled receptor 1 family.
|
Subcellular location
|
Cell membrane; Multi-pass membrane protein.
|
Subunit
|
Monomer. Can form dimers. Interacts with CD164. Interacts with HIV-1 surface protein gp120 and Tat. Interacts with ARRB2.
|
Tissue specificity
|
Expressed in numerous tissues, such as peripheral blood leukocytes, spleen, thymus, spinal cord, heart, placenta, lung, liver, skeletal muscle, kidney, pancreas, cerebellum, cerebral cortex and medulla (in microglia as well as in astrocytes), brain microvascular, coronary artery and umbilical cord endothelial cells. Isoform 1 is predominant in all tissues tested.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
activation of MAPK activity [GO:0000187]
response to hypoxia [GO:0001666]
apoptosis [GO:0006915]
inflammatory response [GO:0006954]
G-protein coupled receptor protein signaling pathway [GO:0007186]
elevation of cytosolic calcium ion concentration [GO:0007204]
response to virus [GO:0009615]
initiation of viral infection [GO:0019059]
interspecies interaction between organisms [GO:0044419]
regulation of chemotaxis [GO:0050920]
|
Cellular component
|
plasma membrane [GO:0005886]
cell surface [GO:0009986]
integral to membrane [GO:0016021]
cytoplasmic membrane-bounded vesicle [GO:0016023]
cell leading edge [GO:0031252]
|
Molecular function
|
actin binding [GO:0003779]
coreceptor activity [GO:0015026]
C-C chemokine receptor activity [GO:0016493]
C-X-C chemokine receptor activity [GO:0016494]
myosin light chain binding [GO:0032027]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
  |
Uniprot Identifier
|
Difference(s) with the 'canonical' sequence
|
Notes
|
Sequences
|
Isoform 1
|
P61073-1, P30991-1
|
---
|
'Canonical' sequence
|
Get Fasta
|
Isoform 2
|
P61073-2, P30991-2
|
1-5 : MEGIS -> MSIPLPLLQ
|
---
|
Get Fasta
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
Region |
|
|
|
|
Motif
|
133 - 135
|
3
|
Important for signaling
|
P61073-MOTIF-133
|
Topological domain
|
1 - 39
|
39
|
Extracellular (Potential)
|
P61073-TOPO_DOM-1
|
Topological domain
|
64 - 79
|
16
|
Cytoplasmic (Potential)
|
P61073-TOPO_DOM-64
|
Topological domain
|
100 - 110
|
11
|
Extracellular (Potential)
|
P61073-TOPO_DOM-100
|
Topological domain
|
133 - 154
|
22
|
Cytoplasmic (Potential)
|
P61073-TOPO_DOM-133
|
Topological domain
|
176 - 200
|
25
|
Extracellular (Potential)
|
P61073-TOPO_DOM-176
|
Topological domain
|
221 - 240
|
20
|
Cytoplasmic (Potential)
|
P61073-TOPO_DOM-221
|
Topological domain
|
262 - 285
|
24
|
Extracellular (Potential)
|
P61073-TOPO_DOM-262
|
Topological domain
|
306 - 352
|
47
|
Cytoplasmic (Potential)
|
P61073-TOPO_DOM-306
|
Transmembrane
|
40 - 63
|
24
|
Helical; Name=1; (Potential)
|
P61073-TRANSMEM-40
|
Transmembrane
|
80 - 99
|
20
|
Helical; Name=2; (Potential)
|
P61073-TRANSMEM-80
|
Transmembrane
|
111 - 132
|
22
|
Helical; Name=3; (Potential)
|
P61073-TRANSMEM-111
|
Transmembrane
|
155 - 175
|
21
|
Helical; Name=4; (Potential)
|
P61073-TRANSMEM-155
|
Transmembrane
|
201 - 220
|
20
|
Helical; Name=5; (Potential)
|
P61073-TRANSMEM-201
|
Transmembrane
|
241 - 261
|
21
|
Helical; Name=6; (Potential)
|
P61073-TRANSMEM-241
|
Transmembrane
|
286 - 305
|
20
|
Helical; Name=7; (Potential)
|
P61073-TRANSMEM-286
|
Natural variations |
|
|
|
|
Alternative sequence
|
1 - 5
|
5
|
MEGIS -> MSIPLPLLQ
|
VSP_001890
|
Amino acid modifications |
|
|
|
|
Disulfide bond
|
109 - 186
|
78
|
By similarity
|
P61073-DISULFID-109
|
Glycosylation
|
11 - 11
|
1
|
N-linked (GlcNAc...)
|
P61073-CARBOHYD-11
|
Glycosylation
|
18 - 18
|
1
|
O-linked (Xyl...) (chondroitin sulfate)
|
P61073-CARBOHYD-18
|
Glycosylation
|
176 - 176
|
1
|
N-linked (GlcNAc...) (Potential)
|
P61073-CARBOHYD-176
|
Modified residue
|
21 - 21
|
1
|
Sulfotyrosine
|
P61073-MOD_RES-21
|
Modified residue
|
319 - 319
|
1
|
Phosphoserine
|
P61073-MOD_RES-319
|
Modified residue
|
348 - 348
|
1
|
Phosphoserine
|
P61073-MOD_RES-348
|
Modified residue
|
351 - 351
|
1
|
Phosphoserine
|
P61073-MOD_RES-351
|
Experimental info |
|
|
|
|
Mutagenesis
|
7 - 7
|
1
|
Y->F: Sulfate incorporation greatly reduced; when associated with F-12 and F- 21. Moderate reduction in sulfate incorporation; when associated with F-12 and A-18. No sulfate incorporation and binding PDF1alpha greatly reduced; when associated with F-12; A-18 and F-21
|
P61073-MUTAGEN-7
|
Mutagenesis
|
8 - 8
|
1
|
T->A: No effect on sulfate incorporation; when associated with A-9 and A-13
|
P61073-MUTAGEN-8
|
Mutagenesis
|
9 - 9
|
1
|
S->A: No effect on sulfate incorporation; when associated with A-8 and A-13
|
P61073-MUTAGEN-9
|
Mutagenesis
|
11 - 11
|
1
|
N->A: Reduced molecular weight. Enhanced coreceptor activity on R5 HIV-1 isolate Envs. Slight further enhancement of coreceptor activity; when associated with A-13
|
P61073-MUTAGEN-11
|
Mutagenesis
|
12 - 12
|
1
|
Y->F: Sulfate incorporation greatly reduced; when associated with F-7 and F- 21. Moderate reduction in sulfate incorporation; when associated with F-7 and A-18. No sulfate incorporation and binding PDF1alpha greatly reduced; when associated with F-7; A-18 and F-21
|
P61073-MUTAGEN-12
|
Mutagenesis
|
13 - 13
|
1
|
T->A: Enhanced coreceptor activity on R5 HIV-1 isolate Envs. No effect on sulfate incorporation; when associated with A-8 and A-9
|
P61073-MUTAGEN-13
|
Mutagenesis
|
18 - 18
|
1
|
S->A: Sulfate incorporation greatly reduced; when associated with F-21. Moderate reduction in sulfate incorporation; when associated with F-7 and F-12. No sulfate incorporation and binding PDF1alpha greatly reduced; when associated with F-7; F-12; and F-21
|
P61073-MUTAGEN-18
|
Mutagenesis
|
21 - 21
|
1
|
Y->F: Sulfate incorporation greatly reduced; when associated with F-7 and F- 12. Sulfate incorporation greatly reduced; when associated with A-18. No sulfate incorporation and binding PDF1alpha greatly reduced; when associated with F-7; F-12 and A-18
|
P61073-MUTAGEN-21
|
Mutagenesis
|
176 - 176
|
1
|
N->A: Enhanced coreceptor activity on R5 HIV-1 isolate Envs; when associated with A-11
|
P61073-MUTAGEN-176
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MEGISIYTSD |
NYTEEMGSGD |
YDSMKEPCFR |
EENANFNKIF |
LPTIYSIIFL |
TGIVGNGLVI |
LVMGYQKKLR |
SMTDKYRLHL |
|
81 |
SVADLLFVIT |
LPFWAVDAVA |
NWYFGNFLCK |
AVHVIYTVNL |
YSSVLILAFI |
SLDRYLAIVH |
ATNSQRPRKL |
LAEKVVYVGV |
|
161 |
WIPALLLTIP |
DFIFANVSEA |
DDRYICDRFY |
PNDLWVVVFQ |
FQHIMVGLIL |
PGIVILSCYC |
IIISKLSHSK |
GHQKRKALKT |
|
241 |
TVILILAFFA |
CWLPYYIGIS |
IDSFILLEII |
KQGCEFENTV |
HKWISITEAL |
AFFHCCLNPI |
LYAFLGAKFK |
TSAQHALTSV |
|
321 |
SRGSSLKILS |
KGKRGGHSSV |
STESESSSFH |
SS |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|P61073|CXCR4_human C-X-C chemokine receptor type 4
MEGISIYTSDNYTEEMGSGDYDSMKEPCFREENANFNKIFLPTIYSIIFLTGIVGNGLVILVMGYQKKLRSMTDKYRLHL
SVADLLFVITLPFWAVDAVANWYFGNFLCKAVHVIYTVNLYSSVLILAFISLDRYLAIVHATNSQRPRKLLAEKVVYVGV
WIPALLLTIPDFIFANVSEADDRYICDRFYPNDLWVVVFQFQHIMVGLILPGIVILSCYCIIISKLSHSKGHQKRKALKT
TVILILAFFACWLPYYIGISIDSFILLEIIKQGCEFENTVHKWISITEALAFFHCCLNPILYAFLGAKFKTSAQHALTSV
SRGSSLKILSKGKRGGHSSVSTESESSSFHSS
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Primary source |
Uniprot : Q53S69
|
Name |
Chemokine (C-X-C motif) receptor 4 [submitted name]
|
Alternative name(s) |
Putative uncharacterized protein CXCR4 [submitted name]
|
Synonym(s) |
|
Organism |
Homo sapiens
|
Length |
352 aa
|
Protein existence |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
General annotation (Comments)
|
top
|
|
|
|
|
|
|
Similarity
|
Belongs to the G-protein coupled receptor 1 family.
|
|
|
|
|
|
|
|
|
|
|
|
|
Biological process
|
signal transduction [GO:0007165]
G-protein coupled receptor protein signaling pathway [GO:0007186]
|
Cellular component
|
plasma membrane [GO:0005886]
integral to membrane [GO:0016021]
|
Molecular function
|
C-C chemokine receptor activity [GO:0016493]
C-X-C chemokine receptor activity [GO:0016494]
|
|
|
|
|
|
|
|
|
|
|
|
|
With
|
Uniprot accession
|
IntAct
|
|
|
|
|
|
|
|
Alternative product(s)
|
top
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feature key |
Position
|
Length
|
Description
|
Feature identifier
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
MEGISIYTSD |
NYTEEMGSGD |
YDSMKEPCFR |
EENANFNKIF |
LPTIYSIIFL |
TGIVGNGLVI |
LVMGYQKKLR |
SMTDKYRLHL |
|
81 |
SVADLLFVIT |
LPFWAVDAVA |
NWYFGNFLCK |
AVHVIYTVNL |
YSSVLILAFI |
SLDRYLAIVH |
ATNSQRPRKL |
LAEKVVYVGV |
|
161 |
WIPALLLTIP |
DFIFANVSEA |
DDRYICDRFY |
PNDLWVVVFQ |
FQHIMVGLIL |
PGIVILSCYC |
IIISKLSHSK |
GHQKRKALKT |
|
241 |
TVILILAFFA |
CWLPYYIGIS |
IDSFILLEII |
KQGCEFENTV |
HKWISITEAL |
AFFHCCLNPI |
LYAFLGAKFK |
TSAQHALTSV |
|
321 |
SRGSSLKILS |
KGKRGGHSSV |
STESESSSFH |
SS |
|
|
|
|
|
|
|
|
|
|
|
|
>sp|Q53S69|Q53S69_human Chemokine (C-X-C motif) receptor 4 [submitted name]
MEGISIYTSDNYTEEMGSGDYDSMKEPCFREENANFNKIFLPTIYSIIFLTGIVGNGLVILVMGYQKKLRSMTDKYRLHL
SVADLLFVITLPFWAVDAVANWYFGNFLCKAVHVIYTVNLYSSVLILAFISLDRYLAIVHATNSQRPRKLLAEKVVYVGV
WIPALLLTIPDFIFANVSEADDRYICDRFYPNDLWVVVFQFQHIMVGLILPGIVILSCYCIIISKLSHSKGHQKRKALKT
TVILILAFFACWLPYYIGISIDSFILLEIIKQGCEFENTVHKWISITEALAFFHCCLNPILYAFLGAKFKTSAQHALTSV
SRGSSLKILSKGKRGGHSSVSTESESSSFHSS
|
|
|
| |
|
|
|
|
|
|