First Bacterial Genome Created Entirely with a Computer

All the genome sequences of organisms known throughout the world are stored in a database belonging to the National Center for Biotechnology Information in the United States. As of today, the database has an additional entry: Caulobacter ethensis-2.0. It is the world's first fully computer-generated genome of a living organism, developed by scientists at ETH Zurich. However, it must be emphasised that although the genome for C. ethensis-2.0 was physically produced in the form of a very large DNA molecule, a corresponding organism does not yet exist.

C. ethensis-2.0 is based on the genome of a well-studied and harmless freshwater bacterium, Caulobacter crescentus, which is a naturally occurring bacterium found in spring water, rivers and lakes around the globe. It does not cause any diseases. C. crescentus is also a model organism commonly used in research laboratories to study the life of bacteria. The genome of this bacterium contains 4,000 genes. Scientists previously demonstrated that only about 680 of these genes are crucial to the survival of the species in the lab. Bacteria with this minimal genome are viable under laboratory conditions.

Beat Christen, Professor of Experimental Systems Biology at ETH Zurich, and his brother, Matthias Christen, a chemist at ETH Zurich, took the minimal genome of C. crescentus as a starting point. They set out to chemically synthesise this genome from scratch, as a continuous ring-shaped chromosome. Such a task was previously seen as a true tour de force: The chemically synthesised bacterial genome presented eleven years ago by the American genetics pioneer Craig Venter was the result of ten years of work by 20 scientists, according to media reports. The cost of the project is said to have totalled 40 million dollars.

Rationalising the production process

While Venter's team made an exact copy of a natural genome, the researchers at ETH Zurich radically altered their genome using a computer algorithm. Their motivation was twofold: one, to make it much easier to produce genomes, and two, to address fundamental questions of biology.

To create a DNA molecule as large as a bacterial genome, scientists must proceed step by step. In the case of the Caulobacter genome, the scientists at ETH Zurich synthesised 236 genome segments, which they subsequently pieced together. "The synthesis of these segments is not always easy," explains Matthias Christen. "DNA molecules not only possess the ability to stick to other DNA molecules, but depending on the sequence, they can also twist themselves into loops and knots, which can hamper the production process or render manufacturing impossible," explains Matthias Christen.

Simplified DNA sequences

To synthesise the genome segments in the simplest possible way, and then piece together all segments in the most streamlined manner, the scientists radically simplified the genome sequence without modifying the actual genetic information (at the protein level). There is ample latitude for the simplification of genomes, because biology has built-in redundancies for storing genetic information. For example, for many protein components (amino acids), there are two, four or even more possibilities to write their information into DNA.

The algorithm developed by the scientists at ETH Zurich makes optimal use of this redundancy of the genetic code. Using this algorithm, the researchers computed the ideal DNA sequence for the synthesis and construction of the genome, which they ultimately utilised for their work.

As a result, the scientists seeded many small modifications into the minimal genome, which in their entirety are, however, impressive: more than a sixth of all of the 800,000 DNA letters in the artificial genome were replaced, compared to the "natural" minimal genome. "Through our algorithm, we have completely rewritten our genome into a new sequence of DNA letters that no longer resembles the original sequence. However, the biological function at the protein level remains the same," says Beat Christen.

Litmus test for genetics

The rewritten genome is also interesting from a biological perspective. "Our method is a litmus test to see whether we biologists have correctly understood genetics, and it allows us to highlight possible gaps in our knowledge," explains Beat Christen. Naturally, the rewritten genome can contain only information that the researchers have actually understood. Possible "hidden" additional information that is located in the DNA sequence, and has not yet been understood by scientists, would have been lost in the process of creating the new code.

For research purposes, the scientists produced strains of bacteria that contained both the naturally occurring Caulobacter genome and also segments of the new artificial genome. By turning off certain natural genes in these bacteria, the researchers were able to test the functions of the artificial genes. They tested each one of the artificial genes in a multistep process.

In these experiments, the researchers found out that only about 580 of the 680 artificial genes were functional. "With the knowledge we have gained, it will, however, be possible for us to improve our algorithm and develop a fully functional genome version 3.0," says Beat Christen.

Enormous potential for biotechnology

"Even though the current version of the genome is not yet perfect, our work nevertheless shows that biological systems are constructed in such a simple manner that in the future, we will be able to work out the design specifications on the computer according to our goals, and then build them," says Matthias Christen. And this can be accomplished in a comparatively straightforward way, as Beat Christen emphasises: "What took ten years with Craig Venter's approach, our small group achieved with our new technology within the time frame of one year with manufacturing costs of 120,000 Swiss francs."

"We believe that it will also soon be possible to produce functional bacterial cells with such a genome," says Beat Christen. Such a development would hold great potential. Among the possible future applications are synthetic microorganisms that could be utilised in biotechnology for the production of complex pharmaceutically active molecules or vitamins, for example. The technology can be employed universally for all microorganisms, not just Caulobacter. Another possibility would be the production of DNA vaccines.

"As promising as the research results and possible applications may be, they demand a profound discussion in society about the purposes for which this technology can be used and, at the same time, about how abuses can be prevented," says Beat Christen. It is still not clear when the first bacterium with an artificial genome will be produced - but it is now clear that it can and will be developed. "We must use the time we have for intensive discussions among scientists, and also in society as a whole. We stand ready to contribute to that discussion, with all of the know-how we possess."

Jonathan E. Venetz, Luca Del Medico, Alexander Wölfle, Philipp Schächle, Yves Bucher, Donat Appert, Flavia Tschan, Carlos E Flores-Tinoco, Mariëlle van Kooten, Rym Guennoun, Samuel Deutsch, Matthias Christen, Beat Christen.
Chemical synthesis rewriting of a bacterial genome to achieve design flexibility and biological functionality.
Proceedings of the National Academy of Sciences Apr 2019. doi: 10.1073/pnas.1818259116.

Most Popular Now

Artificial Intelligence Detects a New Cl…

Many mutations in DNA that contribute to disease are not in actual genes but instead lie in the 99% of the genome once considered "junk." Even though scientists have recently...

Essen University Medicine and Siemens He…

The Essen University Medicine, Germany's leading hospital company for digitalized medicine, and Siemens Healthineers, a world-leading medical technology company, plan to work together to develop the hospital of the future...

Final Cohort Selected For Pioneering Dig…

Propel@YH has announced the six start-ups to participate in its inaugural digital health accelerator programme, aimed at navigating the complex healthcare landscape and building an NHS-relevant business case. The Leeds-based...

New Kid on the Block: The Doctrina Acade…

The Doctrina Academy is a new service, developed by Doctrina, a video e-learning platform that reaches over 1,000,000 healthcare professionals (HCPs) from around the globe, helping them to switch ...

Efficacy and SilverCloud Health Supports…

A leading mental health service is using the digital mental health platform developed by SilverCloud Health to support a confidential NHS service for GPs in England. London-based Efficacy is offering...

Gloucestershire Hospitals to Deliver EPR…

Gloucestershire Hospitals NHS Foundation Trust is moving to a 'clinical wrap' strategy to deliver an electronic patient record (EPR) and achieve the highest levels of digital maturity within five-years. The...

Mobile Phone App Designed to Boost Physi…

Activity trackers and mobile phone apps are all the rage, but do they really help users increase and maintain physical activity? A new study has found that one mobile phone...

Application Period for an International …

The Master of Science (M.Sc.)in Medical Informatics (MMI) at European Campus Rottal-Inn (ECRI) in Pfarrkirchen - a branch of the Deggendorf University of Applied Sciences (THD - Technische Hochschule Deggendorf)...

NHS Prescribing is Back in the Spotlight…

Opinion Article by Dr Simon Hendricks, Product Innovation Manager and Clinical Strategy Lead, FDB (First Databank) Political will for better and safer prescribing in the NHS has fast gained momentum in...

Sectra to Provide Large Centralized Regi…

International medical imaging IT and cybersecurity company Sectra (STO: SECT B) has signed a five-year contract with North Tees and Hartlepool NHS Foundation Trust for the delivery of a regional...

IMI to Boost Patient Involvement in its …

The Innovative Medicines Initiative (IMI) is creating a 'pool' of patient experts to strengthen the role and voice of patients in IMI activities at both strategic and operational levels. IMI...