LOCUS       SPHISH34     2506 bp    DNA             INV       12-SEP-1993
DEFINITION  Sea urchin (S. purpuratus) late embryonic H3 and H4 histone genes.
ACCESSION   X03952
NID         g10256
KEYWORDS    histone; histone H3; histone H4; inverted repeat.
SOURCE      purple urchin.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata;
            Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida;
            Strongylocentrotidae; Strongylocentrotus.
REFERENCE   1  (bases 1 to 2506)
  AUTHORS   Kaumeyer,J.F. and Weinberg,E.S.
  TITLE     Sequence, organization and expression of late embryonic H3 and H4
            histone genes from the sea urchin, Strongylocentrotus purpuratus
  JOURNAL   Nucleic Acids Res. 14 (11), 4557-4576 (1986)
  MEDLINE   86232591
FEATURES             Location/Qualifiers
     source          1..2506
                     /organism="Strongylocentrotus purpuratus"
                     /db_xref="taxon:7668"
     repeat_unit     609..618
                     /note="inverted repeat A"
     repeat_unit     625..647
                     /note="inverted repeat B"
     CDS             complement(678..989)
                     /note="histone H4 (aa 1-103)"
                     /codon_start=1
                     /db_xref="PID:g10257"
                     /db_xref="SWISS-PROT:P02306"
                     /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV
                     KRISGLIYEETRGVLKVFLENVIRDAVTYCEHAKRKTVTAMDVVYALKRQGRTLYGFG
                     G"
     repeat_region   1034..1038
                     /note="direct repeat 1"
     promoter        complement(1048..1055)
                     /note="TATA-like sequence"
     repeat_region   1071..1075
                     /note="direct repeat 1"
     promoter        1865..1871
                     /note="CAAT-like sequence"
     promoter        1898..1904
                     /note="CAAT-like sequence"
     promoter        1924..1930
                     /note="CAAT-like sequence"
     promoter        1947..1954
                     /note="TATA-like sequence"
     CDS             2006..2416
                     /note="histone H3 (aa 1-136)"
                     /codon_start=1
                     /db_xref="PID:g10258"
                     /db_xref="SWISS-PROT:P06352"
                     /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP
                     GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTELRFQSSAVMALQEASEAYLV
                     RLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA"
     repeat_unit     2444..2466
                     /note="inverted repeat B'"
     repeat_unit     2473..2481
                     /note="inverted repeat A'"
BASE COUNT      720 a    592 c    486 g    708 t
ORIGIN      
        1 aattcttagg ggagggggcc caattttttt ttttttttac cccctacgac gaccacccca
       61 tgggacactt accccccgga caactaggct aggcctagtg ggtagttatc atttatgaca
      121 tgaaattgta catattatat agactataaa taattcataa aaatagtgtg tggcaaaaat
      181 agtcaactgt gaccgattgc caaacaaaaa caaatcaaga agaagaatat cctaggccta
      241 ttttgagaat gagaaatcca agggttaggc ctactgttag tgagcactaa ttttttgttt
      301 cttccctcaa tcaccccccc cccccccttt tgtttaaaaa aaatggcatg taacaggttc
      361 cctggccctg cctgccttac gagtgagaac attggggaaa gttgggtaat tatcaggcct
      421 ggaaatcaca gatcatttct tatcactctc cagagtagtc tagctctaga cctagactag
      481 atctagtttt agactcactt tcgaattaat actttcaaaa gccccgcatt cttcggtcac
      541 cacacacact tcatttatac tagactctag atcatgtgct gtcaaactta gttagagatt
      601 cttattgatt ctttcttgga tatttggtgg ctctaaaaag agccgtttga tatgccgagt
      661 taagaggtat ttccttctta accgccgaat ccgtacaggg tacgtccttg gcgcttcaga
      721 gcgtagacga cgtccatggc agtgacggtc tttctcttgg cgtgctcgca gtaagtgacg
      781 gcatcacgga tgacgttctc aaggaagacc ttgaggaccc cacgggtctc ttcgtagatg
      841 agaccgctga tacgcttgac acctccacgt cgggcaagac ggcgaatggc gggcttggtg
      901 attccttgga tgttatcacg caaaaccttg cgatgacgct tggctcctcc ctttccgagt
      961 ccttttcctc ctttaccacg tccagacata ttgacttgta gatttgacaa ttgaaaaaat
     1021 ctgtactgaa aatcgagttt cggcgggtat atataactcc tttgcggacg cgagtgtact
     1081 aaataatttc tattgagagt gagccgccta cggtcaaggg gactaaaatc tcgtcgcttc
     1141 gtcgatgcaa tatttgcata aactatcgca cgttcgttat gaacaaatcg tctttactca
     1201 gatcggtata aaaaatcgtc aagaacagag ttccggttat caaatactat ttaaacactt
     1261 taattcatct caatacagta tttaagcgta ttagtttaca tattgcatag tagacaagag
     1321 aacattaata tattttactt gaccaaaatc gtttacaaca ggtcgccagc accttatgaa
     1381 taattcatca ggactccttg aagtcgtttg ctcgccaaaa tagaaaacaa cgtggaaata
     1441 ttctttcaat ttctactttt gtttggtgaa aaaatagtcc atattatttt ctattcaata
     1501 tcgatcattt cattttcaat tttgatggat gcttttattg ataaatgata acatacttct
     1561 caaaaagcca aaatgtcgac gacaggaaag ccggtttgtt aatgaattat tcatttttac
     1621 agcgatctcg aaaatccaat ccagtacgat ctttcttctc ttaaaaccta atgaatatta
     1681 cgtattagcg tataaatttc tgtaatacat ttacaaatac tactttacag cgataacgat
     1741 gcaatttagg atgtaattaa gtttaatatt tcataatctt ataacgttta ctacaatgac
     1801 catgtacaaa atcacacgac gaggcccgaa gaaatcatga atatattaag aaagcggaca
     1861 gtacccaatc acatactgtg cttgatatag cgaatggcca atcactgctt gtcgcacgac
     1921 taaccaatca tcttcgtcaa ttttgatata aatacgagtg cgggattttt gaaacatcag
     1981 ttgatatcac attcagcaaa tcaaaatggc ccgtaccaag cagaccgctc gcaagtccac
     2041 cggaggaaag gctcctcgca agcagctggc caccaaggca gctcgcaagt ccgccccagc
     2101 cactggcgga gtcaagaagc cccatcgtta caggcccggt accgtcgctc tccgtgagat
     2161 ccgtcgctac cagaagagta ccgagctgct catccgcaag ctccccttcc agcgtctggt
     2221 ccgtgagatc gctcaggact tcaagaccga gctccgtttc cagagctctg ccgtcatggc
     2281 tcttcaggag gccagcgaag cctacttggt ccgtcttttc gaggacacca acctttgcgc
     2341 catccacgcc aagcgtgtta ccatcatgcc aaaggacatc cagctggccc gccgcatccg
     2401 tggcgagcga gcttagattg tcagcttgac atctaaataa accaacggct ctttttagag
     2461 ccaccacatt tccaagaaag atcaaattct aaactctgcg tagatc
//