LOCUS       SUSEGFIII    1980 bp    mRNA            INV       15-JUN-1993
DEFINITION  Strongylocentrotus purpuratus fibropellin c (SpEGF III) mRNA,
            complete cds.
ACCESSION   L07045
NID         g310659
KEYWORDS    epidermal growth factor; fibropellin c.
SOURCE      Strongylocentrotus purpuratus (library: lambda gt10) gastrula cDNA
            to mRNA.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata;
            Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida;
            Strongylocentrotidae; Strongylocentrotus.
REFERENCE   1  (bases 1 to 1980)
  AUTHORS   Bisgrove,B.W. and Raff,R.A.
  TITLE     The SpEGF III gene encodes a member of the fibropellins: EGF
            repeat-containing proteins that form the apical lamina of the sea
            urchin embryo
  JOURNAL   Dev. Biol. 157, 526-538 (1993)
  MEDLINE   93273088
FEATURES             Location/Qualifiers
     source          1..1980
                     /organism="Strongylocentrotus purpuratus"
                     /db_xref="taxon:7668"
                     /dev_stage="gastrula"
                     /tissue_lib="lambda gt10"
     sig_peptide     112..165
                     /gene="EGF III"
                     /note="putative"
     CDS             112..1824
                     /gene="EGF III"
                     /note="Avidin-like domain AA 442-570; Cis-like domain AA
                     57-175; N-linked glycosylation AA 30-32; 136-138; 357-359;
                     EGF-like repeats AA 19-56; 176-213;214-251;252-289;
                     290-327; 328-365; 366-403; 404-441"
                     /codon_start=1
                     /product="fibropellin c"
                     /db_xref="PID:g310660"
                     /translation="MKVSLLAVLLLSIVAATYGQGECGSNPCENGSVCRDGEGTYICE
                     CQMGYDGQNCDRFTGANCGYNIFESTGVIESPNYPANYNNRADCLYLVRIKGARVITF
                     TIEDFATEIFKDAVEYGVGPVADFNQALATFEGNLTANNQVPPPFSVQGEQAWFIFST
                     DRNIPRKGFRITFSSDGDDCTPNPCLNGATCVDQVNDYQCICAPGFTGDNCETDIDEC
                     ASAPCRNGGACVDQVNGYTCNCIPGFNGVNCENNINECASIPCLNGGICVDGINQFAC
                     TCLPGYTGILCETDINECASSPCQNGGSCTDAVNRYTCDCRAGFTGSNCETNINECAS
                     SPCLNGGSCLDGVDGYVCQCLPNYTGTHCEISLDACASLPCQNGGVCTNVGGDYVCEC
                     LPGYTGINCEIDINECASLPCQNGGECINGIAMYICQCRQGYAGVNCEEVGFCDLEGV
                     WFNECNDQITIIKTSTGMMLGDHMTFTERELGVAAPTVMVGYPSNNYDFPSFGITVVR
                     DNGRTTTSWTGQCHLCDGQEVLYTTWIESSMVSTCEEIKRANKVGQDKWTRYEQSFAP
                     QPDA"
     gene            112..1824
                     /gene="EGF III"
     mat_peptide     166..1821
                     /gene="EGF III"
                     /product="fibropellin c"
BASE COUNT      535 a    441 c    498 g    506 t
ORIGIN      
        1 gttgagggtc cgagagcgtc ttggaaacgt agtgaagaca tagcgcaagt acaaggcatc
       61 gcagctttct ccgctttcga cttgagacgt tgactcctct gaggcttcga catgaaggtg
      121 tctttactag ccgttttgct tctcagtatt gttgctgcaa catacgggca aggtgaatgt
      181 ggcagtaacc catgtgagaa tggctcagtg tgtcgagacg gagaagggac atacatctgt
      241 gaatgccaga tgggctatga tgggcaaaac tgcgatcgtt tcacaggtgc aaactgcgga
      301 tataatatct tcgaatcgac tggagtgatc gagtctccca actacccggc caattacaat
      361 aaccgagccg actgtctcta cctcgtccgg atcaagggtg cacgtgtcat aacctttacc
      421 atcgaggatt tcgcgaccga gatctttaaa gatgctgttg agtacggtgt gggccctgtc
      481 gctgatttca accaagctct ggcaaccttc gaaggaaacc tgactgcaaa taaccaggtc
      541 ccacctccgt tctcagtcca gggagaacaa gcctggttca tcttctcgac agatcgtaac
      601 atcccccgga agggattcag aattacattc tcatcagatg gagacgactg tacccctaac
      661 ccctgtctga atggcgccac ttgcgttgac caggtcaatg attatcaatg tatctgcgct
      721 cctggattca ccggagataa ctgcgaaaca gatattgatg agtgtgctag cgccccttgt
      781 cgtaacggtg gtgcatgcgt agaccaggtc aatggataca cctgtaactg tattcctgga
      841 ttcaatggag tcaactgcga aaacaatatc aacgaatgtg ccagcattcc ttgtctgaat
      901 ggagggatct gtgtggatgg tatcaaccag ttcgcctgca cctgtctccc tggatacacg
      961 ggaatccttt gtgaaaccga cattaacgaa tgtgcaagca gcccatgcca aaatggcggt
     1021 tcctgtactg acgctgtgaa cagatataca tgtgattgtc gtgctggatt cactggaagt
     1081 aactgtgaga caaatatcaa cgagtgtgcc agcagccctt gtcttaatgg aggctcgtgc
     1141 ttggatggag ttgatggtta cgtctgccaa tgtcttccaa actacacggg gactcactgc
     1201 gagatatcac tggatgcatg cgcgagtctg ccgtgccaaa atggcggagt atgtacgaac
     1261 gttggtggcg attacgtttg tgaatgtcta ccaggatata ctggcataaa ttgcgaaatc
     1321 gatattaacg aatgcgctag tctaccgtgc cagaacggtg gcgaatgtat caacggtata
     1381 gccatgtaca tctgtcaatg tcgccaagga tacgccggtg tgaactgcga ggaagttgga
     1441 ttctgtgact tggaaggcgt gtggtttaac gagtgtaatg atcagatcac catcatcaag
     1501 acctcgacag gaatgatgct tggagatcat atgaccttca ctgaacgtga actcggagtc
     1561 gcagccccta ccgtgatggt cggctacccc agcaacaatt acgatttccc atcatttgga
     1621 atcaccgttg tccgtgataa tggacgtacg accaccagct ggactggtca gtgtcatcta
     1681 tgtgatggtc aagaggtctt atacaccaca tggatcgaga gcagcatggt cagtacctgt
     1741 gaggagatca agagagctaa caaggttgga caggataaat ggacgaggta cgagcagagc
     1801 tttgctccac agccagatgc ataaagagat atagttcacc tcaatttcaa tataaaaaac
     1861 atagttgcgg gtgggaacaa atctgcgtgt gttttaataa aaatcatttt aaacagtctt
     1921 tatattcaat actaatgtat tttatcgttt tttaacacat gaaaataata gattagaatt
//