DNA Data File for Human Sex Hormone


LOCUS       HUMSHBGA     6087 bp ds-DNA             PRI       15-JUN-1990

DEFINITION  Human human sex hormone-binding globulin (SHBG) gene, complete cds.

ACCESSION   M31651

KEYWORDS    human sex hormone-binding globulin.

SOURCE      Human adult testis DNA, clone hgSH!BG-6.

  ORGANISM  Homo sapiens

            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;

            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.

REFERENCE   1  (bases 1 to 6087)

  AUTHORS   Hammond,G.L., Underhill,D.A., Rykse,H.M. and Smith,C.L.

  TITLE     The human sex hormone-binding globulin gene contains exons for

            androgen-binding protein and two other testicular messenger RNAs

  JOURNAL   Mol. Endocrinol. 3, 1869-1876 (1989)

  STANDARD  simple staff_review

COMMENT     A region of SHBGr-3 mRNA contains an inverted exon.  However, the

            paper does not state the boundaries. This exon is related to a

            palindromic sequence 'atcttggctcagtctccacctccaagat' located at

            positions 4455-4482.



BASE COUNT     1400 a   1632 c   1640 g   1415 t

ORIGIN      1 bp upstream of EcoRI site.



M31651  Length: 6087  January 18, 1992  09:30  Type: N  Check: 9104  ..



       1  GAATTCGGCT AGCTCCTAAG GCGTGGGTAC GGAAGCTAGA TTAGAGCAGA 



      51  AGGGCCCCGC TGCTCCCCGA GCAGGTTCCC AAGGCGAGCC CCTCCCCCTG 



     101  CCCCCGCCTC CTACGACCCC GCTCTGGCCG CGCCACTCTG ACCCCCGGGT 



     151  TACCGGCCTG CAGTCTTCAC CCGAATCAGC CTCAGGATAT CTCCACAGTC 



     201  TCCCTCCTTG GCCTCTCGGA TCCGCACGGA AGCCATCCGG ATCCCCGCTG 



     251  TCTGGGACCA AAGTCCCAGG GCCTCGCAAA CGGCAACTAG ACCCCTTAAA 



     301  GGGCCTACGG ACTTGGATCC TGAAGAGCCT GAGAGAGCGG GGTGGCGGGA 



     351  GTCGGGGGGG ACGGCGGGGT AGCCGCGGCC TGGTAAGTGG AGCTGGGATT 



     401  CCGGCGCCGT ACGGGAGGAG AGAGTAGGCC AGCGAGGCGA TCCTCTGTCC 



     451  GGCATAGCCC CACCCCCTCG AATTCTGTCG CAGCAGGGGG CACAACTGTC 



     501  AGCCAATCAG CTTGGAGAAC AGGCACGGCC GCGTCCCCCC CAAGCCCCAC 



     551  CCCCGACAGC TGGATCTTGT GACTGGGCTC CTGGGTAGAG TTCAAGGTTG 



     601  GAGTGAAGCG GCTTCCTTGC GGTTGTGTGG GTGTCCCAAC CTGGGTCGAG 



     651  ATACCCCGCG GTTCAAAGGC TCCCCCGCAG TGCTTTTTAA ATTGACATAT 



     701  GCAGTGATAA CCTGCTTTAG CCTCAGGCTC ACTCACCCGC CCAGACCCTG 



     751  GGTAAGCCTT AAGACCCTCA GCTCTGAAAG CTGTTTCCTG CAGCTCTTGA 



     801  GTAGCATGAA GTGTTACCTC TTGGGGGCAT TTGCATTTTT AAATGTTTTA 



     851  TTTTATATTT ATTTATTTAT TTATTTTTGG AGATGGTGTA TTGCTTTGTC 



     901  GCCCAGGCTG GGGTGTAGTG GCGCGATCTC TGCTCACTGC AGCCTCCACC 



     951  TCCCGAGTTC AAGCGATCCT CCTGTCTCAG CCTCCGGAGT AGCTGGAACT 



    1001  ACAGTCGCGC ACCGGCACGC CCGGCTAATT TTTTTTTCTT TTTTCTTCTC 



    1051  TTTTTTTTTG TTTTAACGGA TTCTCACTCT GTCACTCAGG CTGGAGTGCA 



    1101  GTGGCGCGAT CTCGGCTCAC TGCAACCTCT CCCTCCTGGG TTCAAGCGAT 



    1151  TCTCCTGCCT CACCCTGGAG ATAGCTGGGA TTACAGGCAT GGGCAACCAT 



    1201  ACCTGGCTAA TTTTTGTATT TTTAGTAGAG ACGGGGTTTC ACTATGTTGG 



    1251  CCAGGCTGGT CTGGAACTCC TGACCTCAAG TGATCTGCCC GCCTCAGCCT 



    1301  TCTAAAGTGC GGGGATTACA GATGTGACCC ACCAAGCCCG GTCTGTCATT 



    1351  TGCATTTTAA AATGGGTCAT GGGGTGGGCA CAGTGGCTCA CACCTGTAAT 



    1401  CCCAGCATTT TGGGGAGGCA GAGGCAGGCG GATCACTGAG ATCAGGAATT 



    1451  TGAGACCAGC CTGACCAACA TGGTGAAACC CGTCTCTACT AAAATACAAA 



    1501  ATTAGACAGG TGTGGTGGCG CATGCCTTTA CTCCCAGCTA CACGGGAGGC 



    1551  TGAGACAGGA GAATCGCTTG AACCTGGGAG GTGGAGGTTA CAGTGAGCCG 



    1601  AGATCGTGCC ATTGCACTCC AGCCTGGGCA ACAAAAGCGA AACTCCATCT 



    1651  CAAAATAAAT AAATAAAATA AAATGCGTCA GGGAGGGTCG GGCCTTGTGG 



    1701  CTAATGCCTG TAATCCAGGT ACTTTGGGAG GCTGAGGTGG GCGGATCATT 



    1751  TCAGGTCAGA GGTTCGAGAT CAGCCTGGGC AACGTGGTGA AATCCCCGTC 



    1801  TCTACTAAAA ACACAAAAAA ATTTGCTGGG CGTGGTGGTG CGTGCACCTG 



    1851  TAGTCCCAAC TACTAAGGAG GCTGAGACAG GAGGATCGCT TGAACTCGAG 



    1901  AGGCAGAGGC AGCAGTGAGC CGAGATCACT CCACTGCACT CCAGCCTGGG 



    1951  TGATAGAGCA AGACTCTGTC TAAAATAAAA TAAAATAAAA TAAAATAAAA 



    2001  TTGGGTCAGG GAGTGGGTGA TTTCTACTGC TAGACTGTTT AGGCCCTGTA 



    2051  ATAAATGGAT AAGGGAAGAT AACTGAGAGG CGGGGGGCAG GTCCCTTCTT 



    2101  AATATTCACT GAATCATACA CACAGACAAT ACCTTCTTGG GAGACAGGCC 



    2151  TCAGAGGCTG GGAAAAGACT GGGGGAGGAG TTCAGACCAG ATGCCAGGCA 



    2201  CTGTGCCTGC ATTTTCTCAA TGAACCCTCT TTCACAGTCA CCCCGTAAAG 



    2251  TATTATTTCC TCATTTTACA GCAAGGACAC TGAAGCACAA AGGTGAAGTG 



    2301  ACTTGGCCCA AGGTCACTCA GGGACAGAAA TCTTGGAGGA CCTAGATCAG 



    2351  GCCCTAGAGG AGGAGAGGGG AGATGGAATA TCCTCTCCCA GTTCAGAAAC 



    2401  TTTCTCGGCA GTGGAGGATG ATAGTGGAGG GACTCTGTCC TTCACCCCAT 



    2451  TGATCCCCAG AGGGGTGATA GCTGAGTCTT GTGACTGGGC CCCTGGGCAG 



    2501  GGGTCAAGGG TCAGTGCCCC TGTTTCCTTT ACCCCCTCCT CCCCGGGCAA 



    2551  CCTTTAACCC TCCACCGCCC ACACGCAAGG CTGCCTGCCT CTACACATTC 



    2601  TCCCAAGAGT TGTCTGAGCC GCCGAGTGGA CAGTGGCTGA TTATGGAGAG 



    2651  CAGAGGCCCA CTGGCTACCT CGCGCCTGCT GCTGTTGCTG CTGTTGCTAC 



    2701  TACTGCGTCA CACCCGCCAG GGATGGGCCC TGAGACCTGT TCTCCCCACC 



    2751  CAGGTGCAGG AGCGGGACAG GGCACTCAGC TCATGCAGTC TTCCCTTCTC 



    2801  TCCTCTGGCC CTGTAGCAGG GCCTCTCCCT CTGTCTGTCT CTGACATGTC 



    2851  CCTACTCAGC TTTGTTTGTT TTCTCTTTCT GATAGAGTGC CCACGACCCT 



    2901  CCGGCTGTCC ACCTCAGCAA TGGCCCAGGA CAAGAGCCTA TCGCTGTCAT 



    2951  GACCTTTGAC CTCACCAAGA TCACAAAGTA TGGGGTTGGC CTAGCCCTTG 



    3001  ACCCAGTCCC CTGGTTCTGC CCTCTCTCCA TCAGCTCTTC TCTTTTCCCT 



    3051  GTCTTCCTTT CCTTATCTGT GAACACCATC TCCCCCAAAC CCACACTGGT 



    3101  TCTCAAAGGA CACATGACAT ACACAATCTT TCCTTCTGTG TCCTTCCAGA 



    3151  ACCTCCTCCT CCTTTGAGGT TCGAACCTGG GACCCAGAGG GAGTGATTTT 



    3201  TTATGGGGAT ACCAACCCTA AGGATGACTG GTTTATGCTG GGACTTCGAG 



    3251  ACGGCAGGCC TGAGATCCAA CTGCACAATC ACTGGGCCCA GCTTACGGTG 



    3301  GGTGCTGGAC CACGGCTGGA TGATGGGAGA TGGCACCAGG TAAGCTAGCT 



    3351  CTGGTCCTCA GGGGAGGGAT GTCTGGAGCT GGTCTGAGGA AAGGGAACAA 



    3401  AACCAAGTTA TTGGGCATCC CTTTACCACT GTCATCTCGT TTAATCCACA 



    3451  CGAACCCCCA CAAAGTAGCT ATTCTTGGCC CCATCTTTTC TGATGGGAAT 



    3501  TCTAAGGCTC AGTCAGTATA TAAGTGACAA GAGCTGAGTG ACCCAAGGCC 



    3551  AAGGATGCTA GCTGCTTCTT TAAGGCATGT TCTTTCCACT ATAGTACTAG 



    3601  GCTGCCTCAC AGGAAGGTGG CAGAAACAGA TCCCAGGGGC CTCTGATTTT 



    3651  GCTTCCCACC TTCCTGCAGG TGGAAGTCAA GATGGAGGGG GACTCTGTGC 



    3701  TGCTGGAGGT GGATGGGGAG GAGGTGCTGC GCCTGAGACA GGTCTCTGGG 



    3751  CCCCTGACCA GCAAACGCCA TCCCATCATG AGGATTGCGC TTGGGGGGCT 



    3801  GCTCTTCCCC GCTTCCAACC TTCGGTTGCC GGTAACTACA CCCCAGGGGT 



    3851  GGAACCCTAG CCAAGACTTG GTAAAGCACT GCTGGGTGGC TGGCCGTGGG 



    3901  AATCTAAGTC CACACTTTTA GGGAGAAGGG AAGGGTTGAG AGCTGCAAGG 



    3951  GGGAGGCCAA ATGCTCAGAG GGGAGTCAAC TGAGGGCAGG GAGGTCGGGA 



    4001  CTGCGCCTCC GATGCCCTGA TTTCTACATC CCCGTATCTT ATCTCTGTCA 



    4051  CACTCCAGCT GGTTCCTGCC CTGGATGGCT GCCTGCGCCG GGATTCCTGG 



    4101  CTGGACAAAC AGGCCGAGAT CTCAGCATCT GCCCCCACTA GCCTCAGAAG 



    4151  CTGTGATGTA GAATCAAATC CCGGGATATT TCTCCCTCCA GGGACTCAGG 



    4201  CAGAATTCAA TCTCCGAGGT AGATTTCCTC GGAGTCTATT TTTCCCACCC 



    4251  TGGCCAGCTC AGCCTGCCTC TGTCCCCCTC TACCACTGGC CCCTTTCCTC 



    4301  CTTGAGACCC CAGCTTTGAG GCCTCAGGAT AATCATTTCT CCCCACAGAC 



    4351  ATTCCCCAGC CTCATGCAGA GCCCTGGGCC TTCTCTTTGG ACCTGGGACT 



    4401  CAAGCAGGCA GCAGGCTCAG GCCACCTCCT TGCTCTTGGG ACACCAGAGA 



    4451  ACCCATCTTG GCTCAGTCTC CACCTCCAAG ATCAAGTAAG GGACAGTGGG 



    4501  CATTGCCTGT ATTCAGTGGA GCCTGGAGCA ATGAGGAAGA GGGAGTCCAA 



    4551  CATGTCAATA TTAGGAAGGT TTCCAGCCCA GGGAACATAA CAAGACTGGC 



    4601  TCCACAGAAT TGTTTTTCAT TAATAATTAG CCAGGCATGG TGGTGGTGCT 



    4651  TGCCTGTAAT CCCAGGTGCT GGAGGCCAAG ACCAGAGGAT CACTTGAGGC 



    4701  CAGGAGTTTG ACACCAGCCT GGGCAACATA GCAGAGACCT CTGTCTAAAA 



    4751  AAAAAAAAAA ATTAGCCAGG CATGGTAGCA CATGTCTGCT GCCCTAGCTA 



    4801  TTTAGGAGCC TGAGGCAGGA GGTTCACTTG AGCCCAGGAG TTTGAAGCTG 



    4851  CAGTGAGCTA TGATGTGCCA CTGCACTCTG ACCTGGGCCA CAGTGAGACC 



    4901  CTGTCTCAAA AAATAAAAAT AAAAATAAGG CTTATGGATG GCACTCAGGT 



    4951  GGGTGGTAGG GGCGAGGGAC ATATCTTGAA GCTCCCCACA GCAAGCAAAC 



    5001  AGTTTTGACT TAGACTGCAT ATTTACTTGG GGCAGGTGTG GTTTCAAAAA 



    5051  GGGTCAAGCC AAAAAAAATT GGGGCAGGAT TTAAGTGGTG AGAATGGCCA 



    5101  GTAGGTGGAG GCATAGCGAA GAGGCAGAAT TAAGGCAGCT AGGGGTGAGG 



    5151  CCACAGGCAG TAGGCCCGGC TCATTCTTCC CTCTCTCTCT ACCGTCCCTT 



    5201  TCCCACACAC TCTGCAGAAG GTGGTGTTGT CTTCTGGGTC GGGGCCAGGG 



    5251  CTGGATCTGC CCCTGGTCTT GGGACTCCCT CTTCAGCTGA AGCTGAGTAT 



    5301  GTCCAGGGTG GTCTTGAGCC AAGGGTCGAA GATGAAGGCC CTTGCCCTGC 



    5351  CTCCCTTAGG CCTGCTCCCC TCCCTTAACC TCTGGGCCAA GCCTCAAGGG 



    5401  CGTCTCTTCC TGGGGGCTTT ACCAGGTAAG AGAGAATGAT GTTCAAGTTC 



    5451  ATGAGCACAA CATTGGAAAC AGCTCAAGGG AGGCGGCACA TTTTGAGGGG 



    5501  AAGGAAACCT CTGGGAGGGA AGAAGAATAG GCCACAAGAA GAAGATATGG 



    5551  GGGCAGTGGA AGGTAGTGCT TTTGCAAACT CAGGTTGGAG GAGTGGAAAA 



    5601  GTGGGGAGAA GATTCTGGAT CCGAGCCACC TTAATGCTCT AATGCCACCT 



    5651  TTGCACTACC TCCCTCTAGG AGAAGACTCT TCCACCTCTT TTTGCCTGAA 



    5701  TGGCCTTTGG GCACAAGGTC AGAGGCTGGA TGTGGACCAG GCCCTGAACA 



    5751  GAAGCCATGA GATCTGGACT CACAGCTGCC CCCAGAGCCC AGGCAATGGC 



    5801  ACTGACGCTT CCCATTAAAG CTCCACCTAA GAACCCCCTT TGAAAGTTAC 



    5851  TGATTATTCA TTTATTCAAC AAATATTCAC TGTGCACTAG CAATGTACCA 



    5901  GGCACTGTGC CAAGTATTGA GTTGTCTTAA TGAGCAAAAA CACTCTGGTT 



    5951  CCTACCCTCT TGGTGCCCAC AGTCCCATAG GGAAGCAGAC ATCCATCAAA 



    6001  GGCTAACTAA TAAGTGGATA GTTGGAAGCA CTGATAAAGA AGAATTGGAG 



    6051  AGTTGTGAAA ACATGGAGAC TGGCGGGCGT GTGGCTC