Protein
In 1838, the Swedish chemist Jöns Jacob Berzelius proposed a new name for substances that had been studied since the 1700s. Earlier researchers like Antoine Fourcroy called these materials albumins or albuminous materials. They observed gluten in wheat and various animal proteins around 1747. Berzelius derived the word protein from the Greek term meaning primary or standing in front. This naming followed work by Gerardus Johannes Mulder who analyzed common proteins. Mulder found nearly all proteins shared an empirical formula of C400H620N100O120P1S1. He incorrectly concluded they were single large molecules rather than chains.
A linear chain of amino acid residues forms what scientists call a polypeptide. The Dutch chemists Franz Hofmeister and Hermann Emil Fischer established this understanding in 1902. Each residue connects to its neighbor via peptide bonds between amino and carboxyl groups. These bonds create a backbone containing carbon nitrogen and oxygen atoms. The genetic code specifies twenty standard amino acids for most organisms. Some archaea include pyrrolysine while certain organisms add selenocysteine to their lists. Proteins always biosynthesize from the N-terminus toward the C-terminus. Short chains under thirty residues are usually termed peptides instead of proteins.
Biochemists describe four distinct aspects of protein organization starting with the primary structure. This level refers simply to the specific sequence of amino acids within the chain. Secondary structures emerge as regular local patterns stabilized by hydrogen bonds. Alpha helices and beta sheets represent the most common examples found in nature. Tertiary structure defines the overall three-dimensional shape of a single molecule. Hydrophobic cores and salt bridges often stabilize these complex folds. Quaternary structure describes complexes formed when multiple polypeptide subunits associate. A single protein may shift between related conformations during function. These transitions allow enzymes to bind substrates or transport molecules effectively.
Proteins assemble from amino acids using information encoded directly into genes. DNA sequences first transcribe into pre-messenger RNA via RNA polymerase. Most organisms process this transcript through post-transcriptional modifications before translation begins. Ribosomes read mRNA three nucleotides at a time matching codons to anticodons. Transfer RNA molecules carry specific amino acids corresponding to each recognized codon. The enzyme aminoacyl tRNA synthetase charges these transfer RNAs correctly. Prokaryotes synthesize proteins faster reaching up to twenty residues per second. Eukaryotic cells move mature mRNA across the nuclear membrane for cytoplasmic synthesis. Average protein size increases from Archaea to Bacteria to Eukaryote species.
Enzymes accelerate chemical reactions by as much as 10^17-fold compared to uncatalyzed rates. Orotate decarboxylase converts substrates in eighteen milliseconds without an enzyme taking seventy-eight million years. About four thousand known reactions rely on enzymatic catalysis within living systems. Only three to four residues typically participate directly in the active site of an enzyme. These enzymes manipulate DNA during replication repair and transcription processes. Some act on other proteins adding or removing chemical groups through modification. Dirigent proteins dictate stereochemistry of compounds synthesized by other enzymes. This acceleration allows metabolism to proceed rapidly enough to sustain life.
Collagen provides critical stiffness to connective tissue like cartilage and bones. Keratin forms hard structures including hair nails feathers hooves and animal shells. Elastin gives elasticity to blood vessels pulmonary tissue and bladder walls. Young's modulus measures relative stiffness where collagen reaches five to seven point five gigapascals. Keratin ranges between one point five and ten gigapascals depending on cross-linking. Globular proteins like bovine serum albumin show moduli around twenty-five kilopascals. Motor proteins such as myosin generate mechanical forces for muscle contraction. Actin and tubulin polymerize into stiff fibers maintaining cell shape and size.
X-ray crystallography solved the first protein structures in 1958 using hemoglobin and myoglobin. John Kendrew worked with Max Perutz to determine these complex three-dimensional arrangements. The Protein Data Bank now contains over one hundred eighty thousand X-ray structures. Cryo-electron microscopy uses frozen samples rather than crystals to analyze large assemblies. Electron beams cause less damage allowing scientists to obtain more information from fragile complexes. Nuclear magnetic resonance spectroscopy produces structural data at atomic resolution for smaller molecules. Computational methods predict molecular formations when experimental data remains unavailable. Distributed computing projects like Folding@home facilitate modeling of biological-scale systems.
Up Next
Common questions
Who named proteins in 1838 and what was the origin of the word?
Jöns Jacob Berzelius named proteins in 1838 by deriving the term from a Greek word meaning primary or standing in front. Earlier researchers like Antoine Fourcroy had called these materials albumins since the 1700s.
When did scientists establish that proteins are linear chains of amino acid residues?
Scientists established this understanding on the 2nd of May 1902 when Franz Hofmeister and Hermann Emil Fischer defined polypeptides as linear chains. Each residue connects to its neighbor via peptide bonds between amino and carboxyl groups.
What are the four levels of protein organization described by biochemists?
Biochemists describe primary structure as the specific sequence of amino acids within the chain. Secondary structures emerge as regular local patterns stabilized by hydrogen bonds such as alpha helices and beta sheets. Tertiary structure defines the overall three-dimensional shape while quaternary structure describes complexes formed when multiple polypeptide subunits associate.
How fast do prokaryotes synthesize proteins compared to eukaryotic cells?
Prokaryotes synthesize proteins faster reaching up to twenty residues per second. Eukaryotic cells move mature mRNA across the nuclear membrane for cytoplasmic synthesis after processing transcripts through post-transcriptional modifications.
Which enzyme converts substrates in eighteen milliseconds without catalysis taking seventy-eight million years?
Orotate decarboxylase converts substrates in eighteen milliseconds without an enzyme taking seventy-eight million years. Enzymes accelerate chemical reactions by as much as 10^17-fold compared to uncatalyzed rates.
When was the first protein structure solved using X-ray crystallography?
X-ray crystallography solved the first protein structures on the 2nd of May 1958 using hemoglobin and myoglobin. John Kendrew worked with Max Perutz to determine these complex three-dimensional arrangements.