This web page was produced as an assignment for an undergraduate course at Davidson College.

My Favorite Yeast Genes:

Cox17 and the Non-annotated YLL007C

Introduction:

This web page investigates two yeast genes, Cox17, a previously annotated yeast gene, and YLL007C, a non-annotated yeast gene on the same chromosome. Both genes are located on the Crick strand of Chromosome XII in the S. Cerevisiae genome. The goal of this web page is to combine information from databases such as NCBI, NCBI Blast, Enterez, and PREDATOR to fully characterize the Cox17 yeast gene. In addition, this web page seeks to use those same databases to make inferences about the cellular role of the non-annotated YLL007C gene.

Comprehensive information about Cox17 and YLL007C can be found at the Saccharomyces Genome Database.

Characterization of Cox17

Cox17 Gene Ontology:

Molecular Function:

S. Cerevisiae Cox17 is a copper metallochaperone that transports copper from the cytosol to the mitochondrial inter-membrane space for delivery to cytochrom c oxidase. Cox17 transfers Cu to Sco1 as an intermediate step in Cu donation to cytochrome oxidase. Cytochrome c oxidase is the terminal complex of mitochondrial and bacterial respiratory chains. The cytochrome c oxidase utilizes the central iron of the Heme A prosthetic groups of cytochromes a and a3, each of which has an associated copper center. The synthesis of Heme A occurs in the inner mitochondrial membrane, however, copper must be imported from the cytoplasm. It is unlikely that copper ever exists as a free solute in yeast and it is known that external copper is imported into the cell via an ATP-dependent pump endoded by the CTR1 gene. It is believed that Cox17 is responsible for the delivery of this copper in the cytoplasm to the mitochondria (Glerum et al., 1996). Cox 17 transports this copper by binding as a binuclear cuprous-thiolate cluster.

Figure 1: Candidate models for the binuclear Cu(I)-thiolate cluster in Cox17, (Permission pending from Srinivasan et al. 1998).

Biological Process:

Cox17 is involved in cytochrome c oxidase biogenesis and intra-cellular copper ion transport. Cox17 has been shown to be essential for the expression of cytochrome oxidase in Saccharomyces cerevisiae (Beers et al. 1997).

Figure 2: The mechanism of importation of copper ions from the extra-cellular space to cytochrome c oxidase in the mitochondria. Cox17 binds the copper ions once they have entered the cytoplasm. Permission pending from www.genetics.unimelb.edu.au/.../ projAfig.html

Cellular Location:

Cox17 is a soluble protein found in the cytoplasm and intermembrace space of the mitochondria in yeast.

Other Useful Gene Information:

Comprehensive gene information about Cox17 can be found at Enterez Gene.

Gene ID for Cox17: 850651

Nucleotide Sequence:

ATGACTGAAACTGACAAGAAACAAGAACAAGAAAACCACGCGGAGTGCGAGGACAAACCT
AAGCCATGTTGCGTTTGTAAGCCAGAAAAGGAGGAGCGGGATACATGCATCTTATTCAAT
GGACAAGACTCTGAAAAATGCAAGGAATTCATTGAAAAGTACAAAGAGTGCATGAAGGGT
TATGGCTTCGAAGTTCCAAGTGCAAATTAG

Complete Chromosome XII Sequence: NC_001144

Cox17 Chromosomal Location:

Cox17 is located on the Crick strand of chromosome XII in S. Cerevisiae.

Figure 3: Chromosomal location of Cox17 on the Crick strand of Chromosome XII. Cox17 is flanked by two neighboring genes: PSR1, a plasma membrane Sodium Response 1 protein, and YLLOO7C, an non-annotated yeast gene. Figure obtained from Saccharomyces Genome Database.

Blastn Search:

A Blastn search was performed using the Cox17 genomic DNA sequence shown above. The results are seen below:

 

Figure 4: Blastn results for the genomic DNA sequence of Cox17. Figure obtained using the Blastn database.

The only sequences producing significant alignments with Cox17 genomic DNA were those within the yeast genome that actually contained the Cox17 sequence. For example, one of the hits was the genomic sequence for all of Chromosome XII.

Protein Information:

Cox17 is a low molecular weight protein found in both yeast an mammalian cells (Beers et al., 1997). The Cox17 ORF in yeast encodes an 8057 Dalton polypeptide (Srinivasan et al. 1998). The yeast Cox17 contains seven cysteinal residues, six of which are conserved in the human Cox17 protein.

Cox17 Amino Acid Sequence:

MTETDKKQEQENHAECEDKPKPCCVCKPEKEERDTCILFNGQDSEKCKEFIEKYKECMKG
YGFEVPSAN

NCBI Protein Accession Number: NP_013092

Blastp Searches:

Using the Cox17 amino acid sequence shown above, a search of the Blastp database yielded the following results:

Figure 5: Blastp results for the Cox17 protein sequence. Figure was obtained using the Blastp database.

Most of the significant alignments between the S. Cerevisiae Cox17 protein and other proteins are either unnamed or hypotheitcal protein products of various fungal species. The Blastp results also indicated that the Cox17 protein contained only one conserved domain:

Figure 6: NCBI conserved domain search for the Cox17 protein. Figured obtained from NCBI Conserved Domain.

The Cox17 domain is conserved in 24 eukaryotic species and is known to be involved in copper transport from the cytosol to the inter-membrane space of the mitochondria in all of these speices.

In order to determine if there were any more broadly related proteins, a PSI-Blast analysis of the Cox17 yeast protein was performed. PSI-Blast yields the following dendogram:

Figure 7: Dedogram generated using PSI Blast analysis of the Cox17 protein. PSI blast identifies broadly related protein families. This figure was obtained from the Saccharomyce Genome Database under the information for Cox17.

This indicates that Cox17 is most closely related to other yeast and fungal proteins, showing some similarity with other unclassified/unknown sequences.

Mutations in Cox17:

Yeast cells that harbor a mutant Cox17 gene are respiratory-deficient but are still able to synthesize mitochondially and nuclearly encoded cytochrome oxidase units. This respiratory defect can be partially rescued by high exogenous Cu(II) levels, providing further evidence that Cox17 in involved with copper ion delivery to the mitochondria. A study of the effects of various mutations in the Cox17 gene revealed that three of the seven cysteinyl residues are critical for Cox17 function, and these three residues are present in a Cys-Cys-Xaa-Cys sequence motif. Single substitution of any of the three cysteines with serines results in a nonfunctional cytochrome oxidase complex. Mutant proteins that lack only one of the critical Cys residues retain the ability to bind three Cu(I) ions and are imported within the mitochondria. In contrast, Cox17 molecules with a double Cys-Ser mutation exhibit no Cu(I) binding but are still localized to the mitochondria (Heaton et al. 2000).

Homologs of Cox17

There are several known homologs of yeast Cox17 in various other organisms. Some of their protein accession numbers are listed below:

Human: NP_005685

Mouse: P56394

Rat: NP_445992

Arabidopsis thaliana: NP_566508

Cox17 in Mammalian Cells:

A study was completed by Amaravadi et al. in 1997 to determine if Cox17 is present in mammalian cells. To make this determination, the group used a yeast straicontaining a null mutation in Cox17 and transformed this with a human cDNA expresion library. All of the clones obtained from the transformations contained a common cDNA sequence homologous to yeast Cox17. This cloning of the mammalian Cox17 homolog suggests that this protein plays a role in copper recruitment in alleukaryotic cells. This gene is also a target being investigated in cytochrome oxidase definciencies in humans.

Cox23:

Another yeast protein, Cox23, has been shown to be required for the expression of cytochrome oxidase. Cox 23 is homologous to Cox17 and null mutants can be rescued by high exogenous copper concentrations when the mutant contains Cox17 on a high copy plasmid. Like Cox17, Cox23 is found in the intermembrane space of the mitochondria, suggesting that they function in a common pathway (Barros et al. 2004).

Characterization of YLL007C

YLL007C is considered to be a hypothetical ORF on chromosome XII of S. Cerevisiae.

YLL007C Gene Ontology:

Molecular Function:

According to information provided by the Saccharomyces Genome Database, YLL007C has no known molecular function.

Cellular Process:

Similarly, the cellular processes of the YLL007C gene are also unknown.

Cellular Location:

According to the SGD, YLL007C is a cytoplasmic protein.

Other Useful Gene Information:

Comprehensive Gene Information about YLL007C can be found at Enterez Gene

Genomic DNA Sequence of YLL007C :

ATGAAGCATAACAGGCAGATGCCAGATGAAAGCTTGGAAAACATAAAAGTGCTACTGAAT
CCTAAATTAGGAAAGCCAGTCAAGTCACTTACGAGTGCACAATCAAAAGCTTGCTATCAT
ACGCTGATCAGTAATAAAAATTTGAACAAGACAAGTGATGAATACGAAAAGTTATTGGCA
AACTATATCCTCCTTTGTGATGAAAAATACCTATGTAAAACCGTAATTCCAGATTCCCGA
TTTTGGGCTATATTGTGTGATAATTGCCAAAAGCTAAGGTCAGAAACGTTGGTAGCAAAT
CTCATTAGGATTTTTAATGTAGCCCTTAAATGCCAAGATTCTAATAAGAATGAAGTTATT
GTGAGCATCTGCCACATCTCCAGAGAGAACTCTCAGCTCATTGGAATACTTTTACAACTT
TTGTCCCAAAGACCTATTCATATACCGTTGTTTACGGACACTATACTATGCATAACCCTA
TTTTTAAAATGCTCACTTACATTATGTGAGACATCACTATCACACGCCGTAGAATTTGTA
CCCAGAATCCTCATTTTGCTATTTCAGTATAACTTTCCAGCTTCCATGTCTGAATTACTT
TACATAGAAGACTTGCAACCACTAATCCTTGAAGAATTTGTACCGTTAAAGCAGAGGCTA
ATAAATTTTCTATCTTCGGTTAGTATTGATGATTATTCCTGTTCGCTTAAAGCTGATCTT
CTTACAGCCATCAAAGATAACTCCGTATTCCAAAAAGGGTTAGAAATGGAAATGGGCGAT
CTTCCCAGTATAAATTTATTAAATGCGTATGATACATTCACATTCTTAAACAGTCCTAAT
GGGTCTTTCAAGAGACTTTACACTGAACAATTGTTGTTTGGCGAGAACGATTTCCCTCTT
TATGAAGCAATATTTAAACTTTCAGACCAATTTAGAAGACTTTTCAATCTAAGTGGGAAA
AAAGAGAACCAATACTCAGACTCAGAGCGTGATTTAAAATTACAAATTGCTACCGCTGTT
CTCAATCGACAAACATGTTTTTACAAAACACTCGAGTTGTTTTTGAGATTTTGGATAGAA
TCATTAGCAAAATCGCAAAGTGATTTGGTTTCACTGTTAAATTTAGCTATTATCACTTTA
AAATATGTTTGTTTGTCATCCAGTGATTTGGAAGCAGCAATACAGACGAAATCCCTTCTC
AAGACCCAAGTTGTTGCACTTGACTCTATGAGATATAAATTTGCAAGAACATTACAATTA
GACTCAATCAAAAAAGAACAGTATCGAACCTGGTCATCTAGTATCGCTAGCTTCGATACC
ATGCTATCAGGCCAGGTTCGTGATTACGTTCGTCATCAAAGACTTCTTCAATTGCAAAAG
GGAACGTGGGTTTACGCAGAGAACCCACTGAATCCTGAAGCTGGCACGCCTAAAGTCTAC
TTCCTCATAGTTTCAGATAACCACGCAAATTTATTAGCCAGAGAATTCGAAACACAAACA
AACGATTTACCTTATCTATTCGACAATAAAATTTTAACAAGTCCAGGTTCTGAGGCATTA
GCTAATGGCAGAACAAAAGTTGTTGTTTTGAAACATATAACCTCTTTCAAAAGCATAGAA
CTTACTACACCGAGTCGTCGGACGAGCAGTAATGTTTACATCAAATTAGATGAAGCCAAT
GTGTACACTGGAGTTGAATTAAAAGATCGTAACGACAGGACAGTTTTGAAATTTTATTTA
GACACAGAAGAGGGAAGATATATATGGCTAGACGGATTGAAATTGATATCTCCATTCCAG
CATGAAGATATATCAGAAGACACGAAAGAACAAATAGACACTTTATTTGATTTGAGGAAA
AATGTTCAGATGATAAATTTGAACGTTCGTCAGGACATCATAGTGCCTCCACCTGAACCA
AGCGACGAGGACGAAGATGAAGAATTTTATAACTTGGAAACGTTAAAGAAAGTAACCCAA
AATTTTTATTTTGATTAA

Chromosomal Location:

YLL007C is located on the Crick strand of Chromosome XII. See Figure 3 above.

Blastn Search:

Using the genomic DNA sequence for YLL007C listed above, I performed a Blastn search. The results can be seen below:

Figure 8 : Blastn results for the genomic DNA sequence of YLL007C. This figure was obtained using the Blastn database.

Just as with the sequence for the Cox17 gene, the only significant matches were those within the yeast genome. Once again, these hits contained overlapping regions of S. Cerevisiae Chromosome XII.

Protein Information:

YLL007C Amino Acid Sequence:

1 mkhnrqmpde slenikvlln pklgkpvksl tsaqskacyh tlisnknlnk tsdeyeklla
61 nyillcdeky lcktvipdsr fwailcdncq klrsetlvan lirifnvalk cqdsnknevi
121 vsichisren sqligillql lsqrpihipl ftdtilcitl flkcsltlce tslshavefv
181 prilillfqy nfpasmsell yiedlqplil eefvplkqrl inflssvsid dyscslkadl
241 ltaikdnsvf qkglememgd lpsinllnay dtftflnspn gsfkrlyteq llfgendfpl
301 yeaifklsdq frrlfnlsgk kenqysdser dlklqiatav lnrqtcfykt lelflrfwie
361 slaksqsdlv sllnlaiitl kyvclsssdl eaaiqtksll ktqvvaldsm rykfartlql
421 dsikkeqyrt wsssiasfdt mlsgqvrdyv rhqrllqlqk gtwvyaenpl npeagtpkvy
481 flivsdnhan llarefetqt ndlpylfdnk iltspgseal angrtkvvvl khitsfksie
541 lttpsrrtss nvyikldean vytgvelkdr ndrtvlkfyl dteegryiwl dglklispfq
601 hedisedtke qidtlfdlrk nvqminlnvr qdiivpppep sdededeefy nletlkkvtq
661 nfyfd

Protein Accession Number: NP_013094

Blastp Searches:

With the amino acid sequence for YLL007C listed above, I performed a Blastp search:

 

Figure 9 : Blastp results for YLL007C amino acid sequence. This figure was obtained using the Blastp database.

The only significant protein similarities that I found were unnamed or hypothetical proteins that had no known functions. With little information provdided from the Blastp results, I decided to perform a conserved domain search for this protein. The results of this conserved domain search can be seen below:

Figure 10: Results of the Conserved Domain search using the amino acid sequence from YLL007C. This figured was obtained from NCBI's Conserved Domain Database.

Unfortunately, there were no hits found. Next, I decided to investigate the PSI Blast data for YLL007C on the Saccharomyces Genome Database website.

PSI Blast Results:

Figure 11: Results obtained from PSI Blast analysis of the YLL007C protein. This figure was obtained from the Saccharomyces Genome Database.

Although some of the proteins showed only 65.3% alignment, the PSI Blast data indicate that yeast YLL007C has some similarity to the human and mouse engulfment and cell motility isoform 1 protein.

To gain some more insight into the structural identity of the YLL007C gene, I entered the amino acid sequence into the PREDATOR database:

Figure 12: PREDATOR DATA generated using the YLL007C amino acid sequence. This figure was obtained from the PREDATOR database.

Predator data indicated that the protein is largely composed of alpha helices and random coils. To further investigate the structure of this protein, I entered the amino acid sequence into a Kyte-Doolittle Hydropathy Plot to identify regions of hydrophobicity. The results of the Hydropathy Plot can be seen below:

Figure 13: Data from the Kyte-Doolittle Hydropathy Plot analysis of the YLL007C amino acid sequence. This figure was generated using the Kyte-Doolittle Hydropathy Plot.

The Kyte Doolittle Data indicates that this protein is likely a transmembrane protein, as three separate regions of the protein cross the hydropathy score of 1.8.

Mutations in YLL007C:

A study conducted by Giaever G, et al. in 2002 indicated that mutant yeast with a systemic deletion of YLL007C were still viable. However, mutations that result in null YLL007C alleles along with null alleles of pac10, gim4, or gim5 resulted in a slow growth phenotype (Tong et al. 2004).

Another study using genetic footprinting analysis of the S. Cerevisiae genome used the Ty1 transposon as a mutagenizing agent. The tranposon was inserted into various ORF's throughout the genome as a means of rendering the ORF non-functional. After mutagenesis, each cell within the population contained approximately one ORF with an inserted transposon. Portions of the mutagenized population were subjected to five alternative (non-mutagenizing) growth situations: 20 doublings in YPD (YPD20), 60 doublings in YPD (YPD60); 20 doublings in minimal medium (YNB), 20 doublings in YP plus 2% Lactate (LAC), and 20 doublings in YP plus 0.9M NaCl (NACL). The proportion of cells carrying Ty insertions in a given gene that remained in the population were measured and this proportion was considered to be a measure of that gene's contribution to fitness in each growth environment. Mutations involving the YLL007C ORF resulted in several apparent "severe growth defects" after 20 generations (Dunn et. al 2004).

Homologs of YLL007C:

In order to determine if there were any known homologs of YLL007C, I searched NCBI's Homologene Database. The results of this search can be seen below:

Figure 15: Homologene results for S. Cerevisiae YLL007C. This figured was obtained from NCBI's Homologene Database.

All that is reported about the only homologous gene that was found is that it is a syntenic homolog of Saccharomyces cerevisiae YLL007C.

Conclusions:

From the data I have gathered, it is difficult to accurately predict a cellular role for the YLL007C yeast gene. Based upon Kyte-Doolittle Hydropathy Plot Analysis, it is highly likely that YLL007C is a transmembrance protein. Since homoolgy and sequence data yielded no conclusive results, I must base my prediction solely on studies involving mutant forms of the YLL007C gene. I hypothesize that the S. Cerevisiae YLL007C gene is involved in cellular growth, thus potentially playing a role in the regulation of the cell cycle.

References:

Barros MH, et al. (2004) COX23, a homologue of COX17, is required for cytochrome oxidase assembly. J Biol Chem 279(30):31943-7.

Beers J, et al. (1997) Purification, characterization, and localization of yeast Cox17p, a mitochondrial copper shuttle. J Biol Chem 272(52):33191-6.

Heaton D, et al. (2000) Mutational analysis of the mitochondrial copper metallochaperone Cox17. J Biol Chem 275(48):37582-7.

Giaever G. et al. 2002. Functional profiling of the Saccharomyces cerevisiae genome. Nature 418: 387-91.

Glerum DM, et al. 1996. Characterization of COX17, a yeast gene involved in copper metabolism and assembly of cytochrome oxidase. J Biol Chem 271(24):14504-9.

Srinivasan C, et al. (1998) Characterization of the copper chaperone Cox17 of Saccharomyces cerevisiae. Biochemistry 37(20):7572-7

Tong, A.H., et. al. 2004. Global mapping of the yeast genetic interaction network. Science 303: 808-13.

Databases Used:

Kyte-Doolittle Hydropathy Plot. 2004. <http://occawlonline.pearsoned.com/bookbind/pubbooks/bc_mcampbell_genomics_1/medialib/activities/kd/kyte-doolittle.htm > Accessed 2004 6 Oct. Original reference: Kyte, J. and Doolittle, R. 1982. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157: 105-132.

NCBI. 2004. National Center for Biotechnology Information. <http://www.ncbi.nih.gov/> Accessed 2004 6 Oct.

PREDATOR Database 2004. <http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_preda.html> Accessed 2004 6 Oct.

Saccharomyces Genome Database. 2004. <http://www.yeastgenome.org/> Accessed 2004 6 Oct.

SwissProt. 2004. Swiss-prot protein knowledgebase. <http://us.expasy.org/sprot/> Accessed 2004 6 Oct.

 

 

Davidson College Home

Davidson College Biology Department

Genomics Course Home Page

Erin Cobain's Genomics Home Page

Email Questions or Comments to Erin Cobain