Optimizing multicopy chromosomal integration for stable high-performing strains.
Abstract
The copy number of genes in chromosomes can be modified by chromosomal integration to construct efficient microbial cell factories but the resulting genetic systems are prone to failure or instability from triggering homologous recombination in repetitive DNA sequences. Finding the optimal copy number of each gene in a pathway is also time and labor intensive. To overcome these challenges, we applied a multiple nonrepetitive coding sequence calculator that generates sets of coding DNA sequence (CDS) variants. A machine learning method was developed to calculate the optimal copy number combination of genes in a pathway. We obtained an engineered Yarrowia lipolytica strain for eicosapentaenoic acid biosynthesis in 6 months, producing the highest titer of 27.5 g l(-1) in a 50-liter bioreactor. Moreover, the lycopene production in Escherichia coli was also greatly improved. Importantly, all engineered strains of Y. lipolytica, E. coli and Saccharomyces cerevisiae constructed with nonrepetitive CDSs maintained genetic stability.