Scientific paper: Sequences, segments, structures and interactions of proteins and nucleic acids
Background and scope
Frederick Sanger presents a reflective synthesis of the experimental strategies and conceptual lessons that emerged from efforts to determine the linear sequences of proteins and nucleic acids. The account traces how chemical and enzymatic approaches were adapted to break large biological polymers into analyzable fragments, how overlapping segmental data were assembled into full chains, and how sequence information began to illuminate structure, function and evolutionary relationships. Emphasis falls on practical problems of sample preparation, separation and identification, together with broader implications for molecular biology.
Approaches to protein sequencing
Protein sequencing advanced through selective derivatization of terminal residues, controlled fragmentation and meticulous separation of peptides. Techniques that label and remove N-terminal groups, combined with exhaustive chemical cleavage at specific amino acids, allowed ordered sets of short peptides to be generated. Overlapping fragments provided the means to reconstruct the primary sequence of a polypeptide chain, while chromatographic and electrophoretic methods served to resolve and characterize the component peptides. These methodological threads converged in the determination of many complete protein sequences, demonstrating the power of iterative chemical analysis and fragment assembly.
Strategies for nucleic acid sequencing
Sequencing nucleic acids required analogous principles adapted to the chemistries of ribo- and deoxyribonucleotides. Controlled enzymatic and chemical cleavage methods produced defined oligonucleotide fragments whose composition and order could be deduced by separation and degradation analyses. Mapping techniques using specific nucleases, partial hydrolysis and two-dimensional separation made it possible to generate overlapping segments and thereby infer longer sequence stretches. Early successes with small RNA species and defined viral genomes illustrated that the same logic of fragmentation plus overlap could be applied to nucleic acids, although different reagents and detection methods were needed.
Segments, structure and interaction
Sanger highlights that sequence fragments are not merely technical intermediate products but also carry structural and functional meaning. Recurrent motifs and conserved segments point to domains with particular chemical or binding properties, and the arrangement of segments governs higher-order folding and intermolecular contacts. Comparisons of protein and nucleic acid sequences reveal both shared organizational features, such as repeating units and conserved cores, and distinct constraints imposed by folding requirements and base-pairing interactions. Recognition of these patterns began to inform hypotheses about how sequence determines three-dimensional form and biological activity.
Practical challenges and outlook
Critical obstacles included achieving sufficient purity and quantity of starting material, obtaining resolvable fragment patterns, and interpreting complex mixtures of overlapping pieces. Improvements in chromatographic resolution, selective cleavage chemistries and sensitive detection were portrayed as essential drivers of progress. Looking forward, greater automation, more refined separation techniques and systematic mapping of sequence variation promised to accelerate accumulation of sequence data and deepen understanding of molecular mechanisms. The narrative conveys optimism that methodological refinement will increasingly connect linear sequence information to the architecture and interactions of macromolecules.
Legacy and significance
The synthesis underscores sequencing as a foundational activity that transforms biological polymers into decipherable information. By tracing methodological ingenuity and conceptual advances, the account situates sequence determination as central to the emerging molecular view of life and as a stimulus for new questions about structure, function and evolution. The work's pragmatic focus on how to obtain and interpret sequence data helped set priorities for subsequent technical developments and framed sequencing as a pathway to uncovering the rules that govern biomolecular behavior.
Frederick Sanger presents a reflective synthesis of the experimental strategies and conceptual lessons that emerged from efforts to determine the linear sequences of proteins and nucleic acids. The account traces how chemical and enzymatic approaches were adapted to break large biological polymers into analyzable fragments, how overlapping segmental data were assembled into full chains, and how sequence information began to illuminate structure, function and evolutionary relationships. Emphasis falls on practical problems of sample preparation, separation and identification, together with broader implications for molecular biology.
Approaches to protein sequencing
Protein sequencing advanced through selective derivatization of terminal residues, controlled fragmentation and meticulous separation of peptides. Techniques that label and remove N-terminal groups, combined with exhaustive chemical cleavage at specific amino acids, allowed ordered sets of short peptides to be generated. Overlapping fragments provided the means to reconstruct the primary sequence of a polypeptide chain, while chromatographic and electrophoretic methods served to resolve and characterize the component peptides. These methodological threads converged in the determination of many complete protein sequences, demonstrating the power of iterative chemical analysis and fragment assembly.
Strategies for nucleic acid sequencing
Sequencing nucleic acids required analogous principles adapted to the chemistries of ribo- and deoxyribonucleotides. Controlled enzymatic and chemical cleavage methods produced defined oligonucleotide fragments whose composition and order could be deduced by separation and degradation analyses. Mapping techniques using specific nucleases, partial hydrolysis and two-dimensional separation made it possible to generate overlapping segments and thereby infer longer sequence stretches. Early successes with small RNA species and defined viral genomes illustrated that the same logic of fragmentation plus overlap could be applied to nucleic acids, although different reagents and detection methods were needed.
Segments, structure and interaction
Sanger highlights that sequence fragments are not merely technical intermediate products but also carry structural and functional meaning. Recurrent motifs and conserved segments point to domains with particular chemical or binding properties, and the arrangement of segments governs higher-order folding and intermolecular contacts. Comparisons of protein and nucleic acid sequences reveal both shared organizational features, such as repeating units and conserved cores, and distinct constraints imposed by folding requirements and base-pairing interactions. Recognition of these patterns began to inform hypotheses about how sequence determines three-dimensional form and biological activity.
Practical challenges and outlook
Critical obstacles included achieving sufficient purity and quantity of starting material, obtaining resolvable fragment patterns, and interpreting complex mixtures of overlapping pieces. Improvements in chromatographic resolution, selective cleavage chemistries and sensitive detection were portrayed as essential drivers of progress. Looking forward, greater automation, more refined separation techniques and systematic mapping of sequence variation promised to accelerate accumulation of sequence data and deepen understanding of molecular mechanisms. The narrative conveys optimism that methodological refinement will increasingly connect linear sequence information to the architecture and interactions of macromolecules.
Legacy and significance
The synthesis underscores sequencing as a foundational activity that transforms biological polymers into decipherable information. By tracing methodological ingenuity and conceptual advances, the account situates sequence determination as central to the emerging molecular view of life and as a stimulus for new questions about structure, function and evolution. The work's pragmatic focus on how to obtain and interpret sequence data helped set priorities for subsequent technical developments and framed sequencing as a pathway to uncovering the rules that govern biomolecular behavior.
Sequences, segments, structures and interactions of proteins and nucleic acids
The paper summarizes achievements and experiences in sequencing both nucleic acids and protein sequences.
- Publication Year: 1969
- Type: Scientific paper
- Genre: Scientific
- Language: English
- Awards: Nobel Prize in Chemistry 1980
- View all works by Frederick Sanger on Amazon
Author: Frederick Sanger

More about Frederick Sanger
- Occup.: Scientist
- From: United Kingdom
- Other works:
- The arrangement of the amino acids in proteins (1957 Scientific paper)
- The insulin molecule (1960 Scientific paper)