GTC
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE ADVANCED SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Genes to Cells (2009) 14, 499-509. doi:10.1111/j.1365-2443.2009.01284.x
© 2009 Blackwell Publishing or its licensors

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hiraoka, Y.
Right arrow Articles by Chikashige, Y.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hiraoka, Y.
Right arrow Articles by Chikashige, Y.

Codon usage bias is correlated with gene expression levels in the fission yeast Schizosaccharomyces pombe

Yasushi Hiraoka1,2,3, Kenichi Kawamata1, Tokuko Haraguchi1,2 and Yuji Chikashige1,2

1 Department of Biology, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, 560-0043, Japan
2 Kobe Advanced ICT Research Center, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
3 Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamadaoka, Suita, 565-0871, Japan


    Abstract
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
Usage of synonymous codons represents a characteristic pattern of preference in each organism. It has been inferred that such bias of codon usage has evolved as a result of adaptation for efficient synthesis of proteins. Here we examined synonymous codon usage in genes of the fission yeast Schizosaccharomyces pombe, and compared codon usage bias with expression levels of the gene. In this organism, synonymous codon usage bias was correlated with expression levels of the gene; the bias was most obvious in two-codon amino acids. A similar pattern of the codon usage bias was also observed in Saccharomyces cerevisiae, Arabidopsis thaliana and Caenorhabditis elegans, but was not obvious in Oryza sativa, Drosophila melanogaster, Takifugu rubripes and Homo sapiens. As codons of the highly expressed genes have greater influence on translational efficiency than codons of genes expressed at lower levels, it is likely that codon usage in the S. pombe genome has been optimized by translational selection through evolution.


    Introduction
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
In protein synthesis, a triplet codon of mRNA is translated to a respective amino acid. In 18 out of 20 amino acids (exceptions are Met and Trp), multiple codons are assigned for one amino acid in their translation. Such synonymous codons are not used equally, but are instead biased. One of the models explaining codon usage bias relies on selection for translational efficiency. It has been shown in some species that codon usage preference is correlated with the abundance of the respective tRNA (Ikemura 1985; Andersson & Kurland 1990; Sharp et al. 1993; Akashi 2001; Kanaya et al. 2001) which is expected to affect translational efficiency of the gene product (Akashi 2003).

Although the question of codon usage bias has been extensively studied in many organisms, including humans as well as human infectious viruses (e.g. Plotkin & Dushoff 2003; Meintjes & Rodrigo 2005; Ren et al. 2007; van Hemert et al. 2007), unicellular organisms can provide a simpler model system for examining whether codon usage bias is related to cellular metabolism. In Escherichia coli, clear evidence has been obtained for correlation between codon usage preference and tRNA abundance (Ikemura 1981a,b). Also, in the budding yeast Saccharomyces cerevisiae, it has been shown that codon usage bias is correlated with mRNA copy numbers of the gene (Sharp & Cowe 1991; Akashi 2003). Translational efficiency can be directly related to growth rates, and thus it is expected that codon usage has been optimized for growth through evolution in such situations.

Here we report codon usage in another unicellular eukaryote, the fission yeast Schizosaccharomyces pombe, in which the entire genome sequence has been determined (Wood et al. 2002). The usage of synonymous codons has been reported previously for three highly expressed genes and another 153 genes that are expressed at a lower level (Forsburg 1994). We examined codon usage for genes in the S. pombe genome, and found significant difference in codon usage bias with respect to expression levels of the gene. Thus, we propose that codon usage of the gene can predict its expression levels in S. pombe.


    Results
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
Codon usage profiles in Schizosaccharomyces pombe

Expression levels of 4932 open reading frames (ORFs) in the S. pombe genome were determined based on DNA microarray analysis (Fig. 1A). Among these ORFs, genes for ribosomal proteins showed high levels of expression (Fig. 1B). We first examined the usage of synonymous codons in the ribosomal protein genes, and found a significant difference compared with the genome average (Table 1). The difference can be seen clearly in two-codon amino acids (Lys, Asn, Phe, His, Glu, Cys, Tyr, Asp and Gln) where the usage bias of two codons for ribosomal proteins was opposite to that of total proteins (Fig. 2). In contrast, codon usage in meiotic proteins was similar to that in total proteins (Table 1; Fig. 2). The use of codons with G or C at the third position approximately corresponds to the GC content in the S. pombe genome (38%) in total proteins and in meiotic proteins, but significantly higher in ribosomal proteins, with an exception of Gln (Fig. 2).


Figure 1
View larger version (18K):
[in this window]
[in a new window]

 
Figure 1  Histogram of mRNA copy numbers. The majority of genes showed low levels of expression (A) whereas ribosomal proteins were highly expressed (B). Thus, average codon usage is representative of proteins with low expression levels.

 

View this table:
[in this window]
[in a new window]

 
Table 1  Codon usage
 

Figure 2
View larger version (36K):
[in this window]
[in a new window]

 
Figure 2  Codon usage in two-codon amino acids. Occurrence of G or C nucleotides at the third position of the codon was approximately equal to the average GC content in the S. pombe genome (38%; indicated by the broken line). This was also the case in meiotic proteins. In contrast, the G/C frequency was significantly higher (except for Gln) in ribosomal proteins.

 
Codon usage and gene expression in Schizosaccharomyces pombe

To examine whether the codon usage bias is related to levels of gene expression or not, we compared codon usage in selected genes at seven levels of expression. We found that the codon usage varied according to level of gene expression (Table 2, Figs 35), and that codon usage in ribosomal proteins was characteristic of that in highly-expressed genes.


View this table:
[in this window]
[in a new window]

 
Table 2  Codon usage and gene expression levels
 

Figure 3
View larger version (57K):
[in this window]
[in a new window]

 
Figure 3  Codon usage as a function of expression levels in two-codon amino acids.

 

Figure 5
View larger version (37K):
[in this window]
[in a new window]

 
Figure 5  Codon usage as a function of expression levels in six-codon amino acids.
Usage patterns of two-codon and four-codon components are shown separately in the upper and lower panels.

 
Correlation of codon usage with gene expression was particularly high in the two-codon amino acids (Fig. 3). Similar tendencies for correlation were also observed in a three-codon amino acid (Ile) and four-codon amino acids (Gly, Val, Ala, Thr and Pro) (Fig. 4). In Ile, the ATC codon showed codon usage increasing as a function of expression level whereas the ATT codon was used most frequently at all levels of gene expression. In Val, Ala, Thr and Pro, one of the four codons was most frequently used at all levels of gene expression although there was a tendency for a slight increase as a function of expression. One of the other three codons showed a significant increase in codon usage as a function of the level of gene expression. In contrast, in Gly, the most frequently used codon, GGT, showed a significant increase as a function of the expression.


Figure 4
View larger version (37K):
[in this window]
[in a new window]

 
Figure 4  Codon usage as a function of expression levels in three- and four-codon amino acids.

 
In six-codon amino acids, increasing usage as a function of gene expression was seen in codons CGT (Arg), TCG and TCC (Ser), and TTG, CTT and CTC (Leu) (Table 2). In these cases, six codons can be divided into two-codon and four-codon components depending on the first nucleotide of the codon. The usage of two-codon components showed a pattern similar to that in the two-codon amino acids (Fig. 5). Among four-codon components (Fig. 5), Ser and Leu showed a pattern similar to that in Val, Ala, Thr and Pro, in which one of the four codons was most frequently used at all levels of gene expression, and another codon showed an increase as a function of expression. In contrast, four-codon components in Arg showed a codon usage pattern similar to that in Gly, in which only one of the four codons increased as a function of expression. Taken together, these results showed that the usage of synonymous codons is correlated with expression levels of the gene in all amino acids in S. pombe.

Codon usage profiles in other organisms

The codon usage of ribosomal protein genes was compared to the genomic average in Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Oryza sativa, Drosophila melanogaster, Takifugu rubripes and Homo sapiens (Supporting Table S1). Codon usage of the ribosomal proteins in Saccharomyces cerevisiae, A. thaliana and C. elegans was significantly different to the genomic average, which is especially clear in two-codon amino acids. In contrast, no significant difference in codon usage was observed between the ribosomal protein genes and the genomic average in O. sativa, D. melanogaster, T. rubripes and H. sapiens (Fig. 6). Codon usage of two-codon amino acids in ribosomal proteins was plotted against that in total proteins. Plots were lined along the orthogonal line for O. sativa, D. melanogaster, T. rubripes and H. sapiens (Fig. 6A), indicating that these two groups of proteins have a similar usage of codons. In contrast, in Saccharomyces cerevisiae, A. thaliana and C. elegans, plots showed a scattered distribution from the orthogonal line (Fig. 6B), indicating that codon usage of ribosomal proteins differs from that of total proteins as in S. pombe. In contrast, a recent report shows that synonymous codon usage bias is only moderately correlated with expression levels of the gene in A. thaliana (dos Reis & Wernisch 2009). This apparent discrepancy may be because our results are based on only ribosomal protein genes. We examined ribosomal protein genes as representatives of highly-expressed genes to avoid possible complexity associated with tissue-specific expression levels in multi-cellular organisms.


Figure 6
View larger version (25K):
[in this window]
[in a new window]

 
Figure 6  Codon usage profiles in ribosomal proteins vs. total proteins. Codon usage of two-codon amino acids (vertical axis) is plotted against that of total proteins (horizontal axis). If their codon usage biases were close to each other, the plots were lined up along the orthogonal line.

 

    Discussion
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
We demonstrated that codon usage is correlated with expression levels of the genes in S. pombe. In S. pombe, the majority of genes express less than 32 copies of mRNA; only a small number of the genes (12%) express more than 32 copies of mRNA, but the total number of mRNA copies expressed from these genes occupies 50% of mRNA in the cell (Fig. 1A). Thus, the S. pombe genes can be divided into two groups that produce one half of mRNA copies in the cell: 12% of genes with high levels of expression (above 32 copies of mRNA) and the rest of genes with low levels of expression (below 32 copies of mRNA). Interestingly, codon usage preference switches at an expression level above and below 32 copies of mRNA. An implication of this result is that major codons are allocated for the group of genes with high expression levels to allow efficient translation of the mRNA, and that genes with low expression levels use minor codons to avoid competition with highly expressed genes. This usage allocation of codons may optimize overall translational efficiency to provide the best growth rate in S. pombe, and thus such codon usage bias may have been selected through evolution.

From these results, we formulated a set of coefficients for each codon to determine a type of codon usage of a gene of interest (Table 3). This formula calculates codon usage score which provides a criterion for judgment of two types of codon usage: high or low expression type (above or below 32 copies of mRNA, respectively) (see Experimental procedures). It should be mentioned that the formula does not predict expression levels of the gene, but instead predicts the types of codon usage. Examples for two S. pombe genes, cdc2 and act1, are shown in Table 3. The positive and negative values represent high- and low-expression types of codon usage, respectively.


View this table:
[in this window]
[in a new window]

 
Table 3  Prediction of the types of codon usage
 
This formula may also be used to judge a type of codon usage of a foreign gene introduced to S. pombe cells. There are caveats to the analysis of data from host cells that have been transfected with a foreign gene. Examples for two versions of GFP are shown in Table 3. Aequorea victoria GFP has codon usage similar to that in low expression genes in S. pombe whereas humanized EGFP has codon usage of the high-expression type. Introduction of an extremely biased codon usage may affect growth rates through translational efficiency. High translational efficiency of foreign genes may not always be appreciated in the host cell as it may compete with synthesis of ribosomal proteins. In choosing versions of GFP, translational efficiency might be optimized by matching the codon usage type with the target gene of fusion.

Mutations causing amino acid substitution can directly affect functions of the protein, and thus have been effectively selected through evolution. In contrast, mutations only changing codon usage may be less effectively selected through evolution. However, in unicellular organisms such as yeasts, translational efficiency can be directly related to growth rates, and thus optimization of codon usage is expected to occur as a consequence of evolutional pressures.


    Experimental procedures
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
Databases for codon usage and ribosomal proteins

Codon usages in the genome of the species examined were obtained from the web site "Codon Usage Database", http://www.kazusa.or.jp/codon/ (Nakamura et al. 2000). Genes for ribosomal proteins were obtained from the web site "Ribosomal Protein Gene Database", http://ribosome.med.miyazaki-u.ac.jp/ (Nakao et al. 2004).

Analysis of codon usage and gene expression in Schizosaccharomyces pombe

Levels of gene expression in S. pombe were determined using a DNA microarray containing 4932 ORFs (Chikashige et al. 2007). Relative amounts of mRNA for each ORF were measured by DNA microarray using genomic DNA as a reference, and the copy number of mRNA was calculated using an estimate of the total mRNA number in the cell as 100 000 copies. The entire DNA microarray data were deposited in gene expression omnibus (GEO, <http://www.ncbi.nlm.nih.gov/geo/index.cgi>; accession number GSE13554). Expression levels of the ORFs are available through the web site: <http://www2.nict.go.jp/w/w103/w131103/CellMagic/index.html>.

Thirty ORFs with the highest expression levels express an average of 256 copies of mRNA, and contain a total of 7326 codons. Other groups of ORFs that express an average of 128, 64, 32, 16, 8 and 4 copies of mRNA were selected such that each group contained at least 7000 codons or 20 ORFs (7263 codons of 34 ORFs, 7291 codons of 20 ORFs, 7234 codons of 20 ORFs, 12 182 codons of 20 ORFs, 10 964 codons of 20 ORFs, and 10 075 codons of 20 ORFs, respectively). For S. pombe ribosomal proteins, 141 ORFs containing 233 320 codons were selected. For S. pombe meiotic proteins, 18 ORFs containing 9039 codons were selected. These genes selected are listed in Supporting Table S2.

Analysis of codon usage in other organisms

Genes for ribosomal proteins were obtained in "Ribosomal Protein Gene Database", <http://ribosome.med.miyazaki-u.ac.jp/>, and codon usage of these genes was examined by the Countcodon program in "Codon Usage Database", <http://www.kazusa.or.jp/codon/>.

Formulation for codon usage score

Codon usage score (S) was defined as an average of expression coefficients over all codons contained in the gene of interest:



Formula 1

(1)

N is the number of codons in the gene, Ec is an expression coefficient calculated for each triplet codon as listed in Table 3, and the suffix r represents the array number of triplet codons from 1 to N. The expression coefficients were obtained by subtracting the codon usage of 32 copies of mRNA from that of 256 copies of mRNA for each triplet codon, and were normalized to make the SD32 value equal to unity where the SD32 is a standard deviation of the S-value for the group of genes expressing 32 copies of mRNA. Positive and negative signs of codon usage score indicate the high-expression type (above 32 copies) and low-expression type (below 32 copies) of codon usage, respectively; its absolute value indicates a fold multiple of SD32, which provides a measure for the likelihood of the prediction.


    Acknowledgements
 
We thank Chihiro Tsutsumi for her technical support in DNA microarray analysis. This work was supported by grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan (MEXT) to YH, TH and CY.


    Footnotes
 
Communicated by: Masayuki Yamamoto (The University of Tokyo)

* Correspondence: hiraoka{at}fbs.osaka-u.ac.jp


    References
 Top
 Abstract
 Introduction
 Results
 Discussion
 Experimental procedures
 References
 
Akashi, H. (2001) Gene expression and molecular evolution. Curr. Opin. Genet. Dev. 11, 660–666.[CrossRef][Medline]

Akashi, H. (2003) Translational selection and yeast proteome evolution. Genetics 164, 1291–1303.[Abstract/Free Full Text]

Andersson, S.G. & Kurland, C.G. (1990) Codon preferences in free-living microorganisms. Microbiol. Rev. 54, 198–210.[Abstract/Free Full Text]

Chikashige, Y., Tsutsumi, C., Okamasa, K., Yamane, M., Nakayama, J., Niwa, O., Haraguchi, T. & Hiraoka, Y. (2007) Gene expression and distribution of Swi6 in partial aneuploids of the fission yeast Schizosaccharomyces pombe. Cell Struct. Funct. 32, 149–161.[CrossRef][Medline]

Forsburg, S.L. (1994) Codon usage table for Schizosaccharomyces pombe. Yeast 10, 1045–1047.[CrossRef][Medline]

van Hemert, F.J., Berkhout, B. & Lukashov, V.V. (2007) Host-related nucleotide composition and codon usage as driving forces in the recent evolution of the Astroviridae. Virology 361, 447–454.[CrossRef][Medline]

Ikemura, T. (1981a) Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. J. Mol. Biol. 146, 1–21.[CrossRef][Medline]

Ikemura, T. (1981b) Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. J. Mol. Biol. 151, 389–409.[CrossRef][Medline]

Ikemura, T. (1985) Codon usage and tRNA content in unicellular and multicellular organisms. Mol. Biol. Evol. 2, 13–34.[Abstract]

Kanaya, S., Yamada, Y., Kinouchi, M., Kudo, Y. & Ikemura, T. (2001) Codon usage and tRNA genes in eukaryotes: correlation of codon usage diversity with translation efficiency and with CG-dinucleotide usage as assessed by multivariate analysis. J. Mol. Evol. 53, 290–298.[CrossRef][Medline]

Meintjes, P.L. & Rodrigo, A.G. (2005) Evolution of relative synonymous codon usage in Human Immunodeficiency Virus type-1. J. Bioinform. Comput. Biol. 3, 157–168.[CrossRef][Medline]

Nakamura, Y., Gojobori, T. & Ikemura, T. (2000) Codon usage tabulated from international DNA sequence databases: status for the year 2000. Nucleic Acids Res. 28, 292.[Abstract/Free Full Text]

Nakao, A., Yoshihama, M. & Kenmochi, N. (2004) RPG: the Ribosomal Protein Gene database. Nucleic Acids Res. 32, D168–D170.[Abstract/Free Full Text]

Plotkin, J.B. & Dushoff, J. (2003) Codon bias and frequency-dependent selection on the hemagglutinin epitopes of influenza A virus. Proc. Natl. Acad. Sci. USA 100, 7152–7157.[Abstract/Free Full Text]

dos Reis, M. & Wernisch, L. (2009) Estimating translational selection in Eukaryotic genomes. Mol. Biol. Evol. 26, 451–461.[Abstract/Free Full Text]

Ren, L., Gao, G., Zhao, D., Ding, M., Luo, J. & Deng, H. (2007) Developmental stage related patterns of codon usage and genomic GC content: searching for evolutionary fingerprints with models of stem cell differentiation. Genome. Biol. 8, R35.[CrossRef][Medline]

Sharp, P.M. & Cowe, E. (1991) Synonymous codon usage in Saccharomyces cerevisiae. Yeast 7, 657–658.[CrossRef][Medline]

Sharp, P.M., Stenico, M., Peden, J.F. & Lloyd, A.T. (1993) Codon usage: mutational bias, translational selection, or both? Biochem. Soc. Trans. 21, 835–841.[Medline]

Wood, V., Gwilliam, R., Rajandream, M.A. et al. (2002) The genome sequence of Schizosaccharomyces pombe. Nature 415, 871–880.[CrossRef][Medline]

Accepted: 15 January 2009





This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hiraoka, Y.
Right arrow Articles by Chikashige, Y.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hiraoka, Y.
Right arrow Articles by Chikashige, Y.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE ADVANCED SEARCH TABLE OF CONTENTS