LOCUS       SUSEGFI      3362 bp    DNA             INV       09-FEB-1995
DEFINITION  Strongylocentrotus purpuratus fibropellin Ia and alternatively
            spliced fibropellin Ib (EGFI) mRNA, complete cds.
ACCESSION   L08692
NID         g161465
KEYWORDS    epidermal growth factor (EGF) repeat-containing protein;
            extracellular matrix protein; fibropellin.
SOURCE      Strongylocentrotus purpuratus (tissue library: lambda gt11/lambda
            EMBL3) gastrula DNA.
  ORGANISM  Strongylocentrotus purpuratus
            Eukaryotae; mitochondrial eukaryotes; Metazoa; Echinodermata;
            Echinozoa; Echinoidea; Euechinoidea; Echinacea; Echinoida;
            Strongylocentrotidae; Strongylocentrotus.
REFERENCE   1  (bases 1 to 3362)
  AUTHORS   Delgadillo-Reynoso,M.G., Rollo,D.R., Hursh,D.A. and Raff,R.A.
  TITLE     Structural analysis of the uEGF gene in the sea urchin
            strongylocentrotus purpuratus reveals more similarity to vertebrate
            than to invertebrate genes with EGF-like repeats
  JOURNAL   J. Mol. Evol. 29 (4), 314-327 (1989)
  MEDLINE   90112459
FEATURES             Location/Qualifiers
     source          1..3362
                     /organism="Strongylocentrotus purpuratus"
                     /db_xref="taxon:7668"
                     /dev_stage="gastrula"
                     /tissue_lib="lambda gt11/lambda EMBL3"
     gene            join(133..1570,2483..3327)
                     /gene="EGFI"
     sig_peptide     133..186
                     /gene="EGFI"
     CDS             133..3327
                     /gene="EGFI"
                     /note="domains: C1s-like (aa: 57..175), avidin-like (aa.
                     980..1108); glycosylation sites: (aa.  74..76, 180..182, &
                     895..897); EGF-like repeats: #1 (aa. 19..56) #2-21 (aa.
                     176..976)"
                     /codon_start=1
                     /product="fibropellin Ia"
                     /db_xref="PID:g161467"
                     /translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ
                     CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF
                     TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST
                     DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC
                     ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC
                     TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS
                     IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC
                     QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP
                     CQNGAVCVDGVNGFVCTCSAGYTGVLCETDINECASMPCLNGGVCTDLVNGYICTCAA
                     GFEGTNCETDTDECASFPCQNGATCTDQVNGYVCTCVPGYTGVLCETDINECASFPCL
                     NGGTCNDQVNGYVCVCAQDTSVSTCETDRDECASAPCLNGGACMDVVNGFVCTCLPGW
                     EGTNCEINTDECASSPCMNGGLCVDQVNSYVCFCLPGFTGIHCGTEIDECASSPCLNG
                     GQCIDRVDSYECVCAAGYTAVRCQINIDECASAPCQNGGVCVDGVNGYVCNCAPGYTG
                     DNCETEIDECASMPCLNGGACIEMVNGYTCQCVAGYTGVICETDIDECASAPCQNGGV
                     CTDTINGYICACVPGFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTY
                     CEISLDACRSMPCQNGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCI
                     DGIAGYTCQCRLGYIGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNE
                     RALGYAAPTVVVGYASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWIN
                     TNMVSTCQDIKKSNMVGQDKWTRYEQSIAPQPDA"
     CDS             join(133..1570,2483..3327)
                     /gene="EGFI"
                     /note="alternatively spliced product not containing
                     EGF-like repeats 10-17."
                     /codon_start=1
                     /product="fibropellin Ib"
                     /db_xref="PID:g161466"
                     /translation="MRTWLLAVLLLSVIAVTYGQGECDSDPCENGSTCQEGEGSYICQ
                     CPMGYDGQNCDRFTGSNCGYNVFDANGMIDSPNYPAMYNNRADCLYLVRITKARSITF
                     TIEDFMTEVFKDVVEYGIGPEADFNQALGSFEGNLTQDDVIPAPFTVQGDQAWFIFST
                     DRNIVNRGFRITFSSDGDDCDPNLCQNGAACTDLVNDYACTCPPGFTGRNCEIDIDEC
                     ASDPCQNGGACVDGVNGYVCNCVPGFDGDECENNINECASSPCLNGGICVDGVNMFEC
                     TCLAGFTGVRCEVNIDECASAPCQNGGICIDGINGYTCSCPLGFSGDNCENNDDECSS
                     IPCLNGGTCVDLVNAYMCVCAPGWTGPTCADNIDECASAPCQNGGVCIDGVNGYMCDC
                     QPGYTGTHCETDIDECARPPCQNGGDCVDGVNGYVCICAPGFDGLNCENNIDECASRP
                     CQNGAVCVDGVNGFVCTCSAGYTGVLCETDIDECASAPCQNGGVCTDTINGYICACVP
                     GFTGSNCETNIDECASDPCLNGGICVDGVNGFVCQCPPNYSGTYCEISLDACRSMPCQ
                     NGATCVNVGADYVCECVPGYAGQNCEIDINECASLPCQNGGLCIDGIAGYTCQCRLGY
                     IGVNCEEVGFCDLEGMWYNECNDQVTITKTSTGMMLGDYMTYNERALGYAAPTVVVGY
                     ASNNYDFPSFGFTVVRDNGQSTTSWTGQCHLCDGEEVLYTTWINTNMVSTCQDIKKSN
                     MVGQDKWTRYEQSIAPQPDA"
BASE COUNT      822 a    770 c    891 g    879 t
ORIGIN      
        1 ccttggtata ttgtggacta cagcgcttga agcaggttcg tttgttggac attgttcagg
       61 tgccggtttc ctcatccatc acgctcctta ccagtgactt ttgttttctt cgctggaaaa
      121 gaacgcttca aaatgaggac gtggttacta gctgtattgc ttctcagcgt gatagctgtt
      181 acatacgggc aaggtgaatg tgacagcgat ccctgtgaaa atggatcaac ctgtcaggag
      241 ggtgaagggt cgtatatctg ccagtgtccc atgggatacg atggacaaaa ctgcgaccgt
      301 ttcacaggtt caaactgcgg atacaatgtc ttcgatgcca acggtatgat cgattcacct
      361 aactacccgg ccatgtacaa caaccgtgcc gattgtcttt atcttgttcg tatcaccaag
      421 gctcgcagca tcactttcac aatcgaagac ttcatgactg aggtcttcaa agacgttgtc
      481 gagtatggta ttgggccaga ggcagacttc aaccaggctc tcggttcgtt cgaaggtaac
      541 ctgacacaag acgacgtcat cccagctcct ttcactgtcc agggcgatca ggcttggttc
      601 attttcagta ctgatcgtaa tatcgtcaac aggggattca gaattacatt ctcatcagat
      661 ggagacgatt gtgatcccaa cctttgtcag aatggcgctg cctgtactga cctcgtgaat
      721 gattatgctt gtacctgccc tccaggattc acgggtagaa actgcgaaat cgatattgac
      781 gaatgtgcca gtgatccctg tcagaatggt ggcgcctgtg tcgatggagt caacggctat
      841 gtctgtaact gtgtcccagg attcgacgga gatgaatgtg aaaacaatat caatgagtgt
      901 gcaagcagcc cttgtcttaa cggaggaatc tgtgttgatg gcgttaacat gttcgagtgt
      961 acctgtttag ccggcttcac tggcgtacga tgtgaagtca acattgatga atgtgcaagt
     1021 gccccttgtc agaatggtgg tatctgtatt gatggtatca atggatacac ctgctcatgt
     1081 ccgctcggct tctctggaga taactgtgaa aacaatgatg atgaatgctc cagcatccct
     1141 tgtttaaatg gtggaacctg tgtggatctt gttaacgcct acatgtgtgt ctgtgccccc
     1201 ggctggaccg gccctacctg cgctgacaac attgacgagt gtgctagtgc cccttgccag
     1261 aacggaggtg tgtgcattga cggtgtgaac ggatacatgt gtgactgtca acctggatac
     1321 accggaaccc attgcgaaac tgatatcgac gagtgcgcaa ggcccccttg ccaaaatgga
     1381 ggtgactgtg tggatggagt caacggatac gtctgcatct gcgctcctgg attcgacgga
     1441 ctcaactgcg agaacaatat tgacgaatgc gccagccgtc cctgccagaa cggagctgtc
     1501 tgcgttgatg gtgtaaacgg gttcgtctgc acctgctctg ctggctacac aggagtcctt
     1561 tgtgaaaccg atatcaacga atgtgctagc atgccttgtc tgaatggtgg tgtttgcacg
     1621 gacctagtga acgggtacat ctgcacatgc gcagcaggct tcgagggaac taattgcgag
     1681 acagacaccg acgaatgtgc ttcattccca tgtcaaaacg gagccacgtg tacagaccag
     1741 gttaatggat acgtgtgcac atgtgttcca ggatacacgg gagtcctctg cgaaacagat
     1801 attaacgaat gtgcctcatt tccttgtctg aatggaggta cttgtaacga tcaagtcaat
     1861 ggatacgtgt gcgtgtgcgc acaggatact tcggtgtcaa cctgtgaaac agatcgtgac
     1921 gagtgtgcat ctgccccatg tttgaatggt ggagcttgta tggacgtagt gaatggattt
     1981 gtatgtactt gcttacctgg atgggaggga accaattgtg aaatcaacac ggacgagtgt
     2041 gcaagctctc catgcatgaa tggtggtctc tgtgttgacc aggtcaatag ctacgtctgc
     2101 ttctgtctcc ctggtttcac tggcattcat tgcggaaccg aaattgacga gtgtgcaagc
     2161 agcccatgtc taaacggagg acagtgtatc gaccgagttg actcgtacga gtgcgtttgc
     2221 gctgctggct acactgctgt cagatgccaa atcaatatcg acgaatgtgc ttctgcccct
     2281 tgtcaaaatg gcggagtgtg tgttgatgga gttaatggtt acgtgtgtaa ttgtgcacca
     2341 ggctacactg gcgataactg tgaaactgaa atcgacgaat gtgcttccat gccttgtttg
     2401 aacggaggag cgtgcattga aatggttaac ggatacacct gtcagtgtgt agctggctac
     2461 actggggtta tttgcgagac tgatattgac gagtgtgcca gtgccccttg ccagaatggt
     2521 ggtgtgtgta ctgataccat taacggatat atctgtgcct gtgtgccagg attcaccgga
     2581 agcaactgcg agactaacat cgacgagtgt gctagcgacc cctgtctaaa tggaggtatc
     2641 tgtgtggatg gagtcaatgg tttcgtctgc cagtgccctc ccaactactc tggaacttat
     2701 tgtgaaatct cacttgatgc atgcaggagt atgccatgcc agaatggcgc cacgtgcgta
     2761 aacgttggag ccgactacgt ctgcgaatgc gtaccaggat atgctggaca aaactgtgaa
     2821 attgacatca acgagtgtgc tagtcttcca tgccaaaacg gcggtctatg tattgatggt
     2881 attgctggat acacctgtca gtgccgtcta ggatacatcg gtgtcaactg cgaggaagtt
     2941 ggtttctgcg acttggaggg tatgtggtac aacgagtgca atgatcaggt caccatcacc
     3001 aagacctcta caggaatgat gcttggagat tacatgactt acaatgaacg tgccctcgga
     3061 tacgcagccc caaccgtcgt ggtcggttac gccagcaaca actatgactt cccatctttc
     3121 ggtttcacgg tggtccgtga caatggtcag tctactacca gttggaccgg tcagtgccat
     3181 ctatgtgacg gtgaagaggt tctctacacc acctggatca acaccaacat ggtcagcacc
     3241 tgccaggaca tcaagaaatc aaacatggtt ggccaggaca aatggacacg ttatgaacag
     3301 agcatcgcac ctcagcccga tgcataggca atttaactac attaatattg taacatgaat
     3361 ac
//