The origin of life is a chemical problem

Overview

One of the most persistent confusions in public discussions of evolution is the conflation of two distinct scientific questions: how life changes over time (biological evolution) and how life first arose from non-living matter (abiogenesis). These are separate research programs with different methods, different histories, and different states of evidence. Darwin's theory of evolution by natural selection, first published in 1859, describes the mechanism by which populations of organisms change across generations—it begins with life already in place.1 The question of how the first self-replicating, metabolizing systems emerged from prebiotic chemistry is a younger field, rooted in biochemistry, geochemistry, and synthetic chemistry rather than in population genetics or paleontology.2, 3 Importantly, the validity of evolutionary theory does not depend on solving the origin of life, just as the theory of gravity does not depend on knowing how the universe began. Nevertheless, the chemical origin of life is one of the most active and productive areas of modern science, and the progress made over the past seven decades has been remarkable.3

Abiogenesis and evolution are distinct

The distinction between abiogenesis and evolution is not merely semantic. Biological evolution operates on populations of organisms that already possess heritable genetic information, variation, and differential reproduction. It requires, at minimum, entities that can replicate with occasional errors and that face selective pressures from their environment.1 Abiogenesis, by contrast, asks how such entities came into existence in the first place—how chemistry became biology. The two fields share certain conceptual tools (notably, the idea that chemical selection can operate on populations of molecules before true biological evolution begins), but their core questions are fundamentally different.2, 4

The modern study of abiogenesis traces its intellectual origins to the 1920s, when the Russian biochemist Alexander Oparin and the British geneticist J. B. S. Haldane independently proposed that life could have arisen through chemical processes in a reducing atmosphere on the early Earth.5, 6 Oparin published his initial ideas in a 1924 pamphlet titled The Origin of Life, while Haldane articulated a similar hypothesis in a 1929 essay, coining the evocative phrase "hot dilute soup" to describe the primordial ocean in which organic molecules might have accumulated.5, 6 Their framework—that simple inorganic molecules could give rise to complex organic ones, which could in turn assemble into self-replicating systems—laid the conceptual groundwork for all subsequent experimental work on the origin of life.3

Prebiotic chemistry: building blocks from simple molecules

The experimental study of abiogenesis began in earnest in 1953, when Stanley Miller, a 23-year-old graduate student working under Nobel laureate Harold Urey at the University of Chicago, demonstrated that amino acids—the building blocks of proteins—could be synthesized from a mixture of simple gases. Miller sealed methane, ammonia, hydrogen, and water vapor in a closed glass apparatus, passed electrical sparks through the gas mixture to simulate lightning, and within a week detected several amino acids in the resulting solution, including glycine and alanine.7 The results, published in Science under the title "A Production of Amino Acids Under Possible Primitive Earth Conditions," electrified the scientific community and inaugurated the modern era of origin-of-life research.7, 3

Miller conducted additional experiments in the following years that were never published during his lifetime. After his death in 2007, his former student Jeffrey Bada and colleagues reanalyzed sealed vials from these unpublished experiments using modern analytical techniques including high-performance liquid chromatography and mass spectrometry. The 2008 reanalysis, published in Science, revealed that one experiment simulating volcanic spark discharge conditions had produced 22 amino acids and 5 amines—a far richer inventory of organic molecules than Miller had originally reported.8 These results demonstrated that volcanic environments on the early Earth, with their combination of reducing gases, water, and electrical energy, could have been especially productive sources of prebiotic organic molecules.8

Since Miller's pioneering work, prebiotic chemists have shown that virtually all the major classes of biological molecules can be synthesized under plausible early-Earth conditions. Juan Oró demonstrated in 1961 that the nucleobase adenine—a component of both DNA and RNA—could be produced from hydrogen cyanide and ammonia in aqueous solution.9 Subsequent work has generated sugars, lipids, and other nucleobases from simple precursors under various prebiotic scenarios.3 A landmark advance came in 2009, when John Sutherland and colleagues at the University of Manchester showed that activated pyrimidine ribonucleotides—building blocks of RNA—could be synthesized under prebiotically plausible conditions from cyanamide, cyanoacetylene, glycolaldehyde, glyceraldehyde, and inorganic phosphate, bypassing the long-standing assumption that nucleotides had to be assembled from pre-existing sugars and bases.10

Amino acids produced in Miller-Urey experiments (1953 and 2008 reanalysis)7, 8

Original (1953, published)
5
Original (1953, reanalyzed 2008)
14
Volcanic spark (unpublished, reanalyzed 2008)
22

The RNA world hypothesis

One of the central puzzles of the origin of life is the chicken-and-egg problem of information and catalysis. In modern cells, DNA stores genetic information and proteins carry out catalytic functions, but DNA requires proteins to replicate and proteins require DNA to encode their sequences. Which came first? The RNA world hypothesis, first articulated by Carl Woese in 1967, Francis Crick in 1968, and Leslie Orgel in 1968, proposes an elegant resolution: RNA came first, because RNA molecules can serve as both information carriers and catalysts.11, 12

The discovery that clinched the plausibility of this idea came in the early 1980s, when Thomas Cech and Sidney Altman independently discovered ribozymes—RNA molecules that catalyze chemical reactions. Cech showed that a ribosomal RNA intron in the ciliate Tetrahymena could splice itself out of its precursor without any protein assistance, while Altman demonstrated that the RNA component of the enzyme RNase P was responsible for its catalytic activity.13, 14 Both received the Nobel Prize in Chemistry in 1989 for these discoveries. Subsequent work revealed that the ribosome itself—the molecular machine that translates genetic information into proteins in all living cells—is fundamentally a ribozyme, with its catalytic core composed of RNA rather than protein.15 This finding strongly suggests that RNA-catalyzed protein synthesis preceded the modern DNA-protein system.15

In 2009, Tracey Lincoln and Gerald Joyce at the Scripps Research Institute achieved a critical milestone by creating a pair of RNA enzymes that catalyze each other's synthesis from a total of four short RNA substrates. These cross-replicating ribozymes underwent self-sustained exponential amplification—with a doubling time of approximately one hour—and could continue replicating indefinitely without any protein involvement.16 When populations of different cross-replicating enzymes were allowed to compete for shared substrates, recombinant replicators arose and came to dominate the population, demonstrating a rudimentary form of Darwinian evolution at the molecular level.16

More recent work has continued to strengthen the RNA world framework. In 2024, researchers at the Salk Institute reported an RNA enzyme capable of making accurate copies of other functional RNA strands while allowing new variants to emerge over time, providing further evidence that the earliest forms of evolution may have operated on populations of RNA molecules before the advent of DNA or proteins.17

Protocells: membranes and compartments

A self-replicating molecule in open solution faces a fundamental problem: any useful products it generates will diffuse away, benefiting competitors as much as the replicator itself. For natural selection to operate effectively, replicating molecules need to be enclosed in compartments—protocells—that keep their products nearby and create a unit of selection.18 Research by Jack Szostak and colleagues at Harvard (later the University of Chicago) has shown that simple fatty acid vesicles provide a remarkably plausible model for the earliest cell membranes.18, 19

Unlike the phospholipid membranes of modern cells, which are stable but relatively impermeable and require complex enzymatic machinery to assemble, fatty acid membranes form spontaneously when single-chain amphiphilic molecules are placed in water at appropriate concentrations. These vesicles are dynamic: fatty acid molecules enter and leave the membrane on timescales of seconds to minutes, making the membrane permeable to small molecules such as nucleotides—exactly the kind of permeability that a primitive genetic system would need to import building blocks from the environment.18, 19

Szostak's group demonstrated that fatty acid vesicles can grow by incorporating additional fatty acids from their surroundings, and can divide when subjected to mild physical forces such as passage through small pores—no protein machinery required.19 They further showed that encapsulated double-stranded DNA could be denatured by heating and reannealed upon cooling without destroying the vesicle, suggesting that thermal cycling in natural environments (such as near hydrothermal vents or in sun-warmed pools) could have driven primitive cycles of genetic replication inside protocells.18 In competitive experiments, protocells with higher internal osmotic pressure (from encapsulated RNA or other solutes) grew at the expense of neighboring vesicles with lower osmotic pressure, demonstrating a primitive analogue of Darwinian competition between protocells even before the evolution of complex metabolism.18

Hydrothermal vents and energy gradients

While the "warm little pond" envisioned by Darwin and elaborated by Oparin and Haldane remains one plausible setting for the origin of life, an alternative hypothesis has gained substantial support since the 1990s: that life began at alkaline hydrothermal vents on the ocean floor. This idea was pioneered by Michael Russell, who predicted the existence of low-temperature alkaline vent systems more than a decade before their discovery, arguing that the chemical and thermodynamic conditions at such sites were ideally suited to drive the emergence of life.20

The discovery of the Lost City hydrothermal field in 2000 provided dramatic confirmation of Russell's predictions. Located on the Mid-Atlantic Ridge, Lost City is powered by serpentinization—the exothermic reaction of ultramafic minerals from the upper mantle with seawater—which produces large volumes of hydrogen gas and warm (40–90°C), highly alkaline fluids.21 Where these alkaline fluids meet the acidic, CO2-rich Hadean ocean, steep gradients of pH, temperature, and reduction potential are established across thin mineral barriers within a microporous labyrinth of precipitated rock. Russell and William Martin proposed that these natural electrochemical gradients could have driven the reduction of CO2 to organic molecules on iron-nickel-sulfide mineral surfaces—essentially the same chemistry used by modern acetogens and methanogens, which are among the most ancient lineages of life on Earth.20, 22

The alkaline vent hypothesis is attractive for several reasons. It provides a continuous source of chemical energy (the proton gradient across vent walls) that closely resembles the chemiosmotic mechanism used by virtually all modern cells to generate ATP.22 It concentrates reactants within mineral pores that function as natural reaction vessels. And it connects the origin of life directly to the geological processes of the early Earth, avoiding the need for special or exotic conditions.20 The hypothesis also aligns well with phylogenomic reconstructions of the last universal common ancestor (LUCA), which suggest that LUCA was an anaerobic organism that used the Wood-Ljungdahl (acetyl-CoA) pathway of carbon fixation—precisely the metabolism predicted by the vent hypothesis.22, 23

The last universal common ancestor

While the origin of life itself left no fossil record, the biology of the last universal common ancestor—LUCA, the organism from which all extant life descends—can be partially reconstructed by comparing the genomes of bacteria and archaea. LUCA is not the first living organism; it is simply the most recent ancestor shared by all life on Earth today, separated from the actual origin of life by an unknown span of evolution. Nevertheless, understanding LUCA helps constrain what the earliest forms of life may have looked like.23

A comprehensive 2024 study by Edmund Moody and colleagues, published in Nature Ecology & Evolution, used phylogenetic reconciliation across 9,365 protein families to reconstruct LUCA's genome and estimate its age. The analysis placed LUCA at approximately 4.2 billion years ago (confidence interval: 4.09–4.33 Ga), implying that life originated remarkably early in Earth's 4.54-billion-year history—perhaps within the first few hundred million years after the planet's formation.23 The reconstructed LUCA genome contained an estimated 2,657 protein-coding genes across approximately 2.5 megabases, comparable in complexity to a modern prokaryote.23 LUCA appears to have been a complex anaerobic acetogen that already possessed an early form of the CRISPR-Cas immune system, indicating that it lived within a pre-existing ecosystem of other organisms—including viruses—rather than being a solitary pioneer.23

Independent lines of geological evidence corroborate this early date. Biologically fractionated carbon isotopes (in which organisms preferentially incorporate the lighter carbon-12 isotope over carbon-13) have been detected in graphite inclusions within 4.1-billion-year-old zircon crystals from the Jack Hills of Western Australia, though the biogenicity of this signal remains debated.24 Less controversial are the stromatolites—layered structures produced by microbial communities—found in the 3.48-billion-year-old Dresser Formation of the Pilbara Craton in Western Australia, which represent the oldest widely accepted direct evidence of life.25

Timeline of key evidence for early life on Earth23, 24, 25

Age (Ga) Evidence Location Status
~4.2 LUCA (genomic reconstruction) Global (phylogenetic inference) Modeled
~4.1 Biogenic carbon isotopes in zircon Jack Hills, Western Australia Debated
~3.7 Possible stromatolites Isua, Greenland Debated
~3.48 Stromatolites in geyserite Pilbara, Western Australia Accepted
~3.4 Microfossils and carbon isotopes Strelley Pool, Western Australia Accepted

The current state of research

Origin-of-life research in the 2020s is characterized by an increasingly integrated approach that draws on synthetic chemistry, geochemistry, biophysics, and computational biology. No single "complete" scenario for the origin of life has yet been established, and honest researchers acknowledge that major gaps remain—particularly in understanding how the transition from prebiotic chemistry to self-sustaining Darwinian evolution actually occurred.3, 4 Nevertheless, the field has moved far beyond speculation. Laboratory experiments have demonstrated plausible prebiotic routes to all four classes of biological macromolecules (nucleic acids, proteins, lipids, and carbohydrates), shown that RNA can both store information and catalyze its own replication, and constructed protocells capable of growth, division, and rudimentary competition.7, 10, 16, 18

Several major research programs are currently converging. Sutherland's group and others have developed "systems chemistry" approaches that show how networks of prebiotically plausible reactions can produce nucleotides, amino acids, and lipid precursors from common feedstock molecules, suggesting that the building blocks of life may have been generated together rather than independently.10 The hydrothermal vent hypothesis continues to gain support from both laboratory simulations and phylogenomic evidence about the metabolism of LUCA.20, 23 And protocell research is bridging the gap between replicating chemistry and cellular biology, showing how simple physical and chemical processes can generate the compartmentalization, replication, and competition that are prerequisites for natural selection.18, 19

What is clear is that the origin of life is a chemical problem with chemical answers. Every step in the transition from simple molecules to living systems—the formation of organic building blocks, the emergence of self-replicating polymers, the assembly of membrane-bounded compartments, the coupling of information to function—involves processes that can be studied, tested, and reproduced in the laboratory. The question is no longer whether chemistry can produce the components of life, but how these components came together in the specific conditions of the early Earth to cross the threshold into biology.3, 4

expand_less

References

1

On the Origin of Species by Means of Natural Selection

Darwin, C. · John Murray, 1859

open_in_new
2

The origin of life: what we know, what we can know and what we will never know

Luisi, P. L. · Open Biology 4(4): 140049, 2014

open_in_new
3

The 1953 Stanley L. Miller Experiment: Fifty Years of Prebiotic Organic Chemistry

Bada, J. L. & Lazcano, A. · Origins of Life and Evolution of Biospheres 33: 235–242, 2003

open_in_new
4

Abiogenesis as a theoretical challenge: some reflections

Peretó, J. · Journal of Theoretical Biology 434: 1–4, 2017

open_in_new
5

Pioneers of Origin of Life Studies—Darwin, Oparin, Haldane, Miller, Oró—And the Oldest Known Records of Life

Allwood, A. C. et al. · Life 14(10): 1307, 2024

open_in_new
6

Oparin-Haldane theory

Encyclopaedia Britannica, 2024

open_in_new
7

A Production of Amino Acids Under Possible Primitive Earth Conditions

Miller, S. L. · Science 117(3046): 528–529, 1953

open_in_new
8

The Miller Volcanic Spark Discharge Experiment

Johnson, A. P. et al. · Science 322(5900): 404, 2008

open_in_new
9

Mechanism of Synthesis of Adenine from Hydrogen Cyanide under Possible Primitive Earth Conditions

Oró, J. · Nature 191: 1193–1194, 1961

open_in_new
10

Synthesis of activated pyrimidine ribonucleotides in prebiotically plausible conditions

Powner, M. W., Gerland, B. & Sutherland, J. D. · Nature 459: 239–242, 2009

open_in_new
11

The Genetic Code: The Molecular Basis for Genetic Expression

Woese, C. R. · Harper & Row, 1967

open_in_new
12

The origin of the genetic code

Crick, F. H. C. · Journal of Molecular Biology 38(3): 367–379, 1968

open_in_new
13

Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena

Kruger, K. et al. · Cell 31(1): 147–157, 1982

open_in_new
14

The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme

Guerrier-Takada, C. et al. · Cell 35(3): 849–857, 1983

open_in_new
15

The structural basis of ribosome activity in peptide bond synthesis

Nissen, P. et al. · Science 289(5481): 920–930, 2000

open_in_new
16

Self-sustained replication of an RNA enzyme

Lincoln, T. A. & Joyce, G. F. · Science 323(5918): 1229–1232, 2009

open_in_new
17

Modeling the origins of life: New evidence for an “RNA World”

Salk Institute for Biological Studies, 2024

open_in_new
18

The Origins of Cellular Life

Szostak, J. W., Bartel, D. P. & Luisi, P. L. · Nature 409: 387–390, 2001

open_in_new
19

From Self-Assembled Vesicles to Protocells

Zhu, T. F. & Szostak, J. W. · Journal of the American Chemical Society 131(15): 5705–5713, 2009

open_in_new
20

The Origin of Life in Alkaline Hydrothermal Vents

Sojo, V. et al. · Astrobiology 16(2): 181–197, 2016

open_in_new
21

Hydrothermal vents and the origin of life

Martin, W., Baross, J., Kelley, D. & Russell, M. J. · Nature Reviews Microbiology 6: 805–814, 2008

open_in_new
22

On the origin of biochemistry at an alkaline hydrothermal vent

Martin, W. & Russell, M. J. · Philosophical Transactions of the Royal Society B 362(1486): 1887–1926, 2007

open_in_new
23

The nature of the last universal common ancestor and its impact on the early Earth system

Moody, E. R. R. et al. · Nature Ecology & Evolution 8: 1654–1666, 2024

open_in_new
24

Earliest known life forms

Wikipedia, 2025

open_in_new
25

Earliest Evidence of Life

National Center for Science Education, 2018

open_in_new
arrow_upward