I – History of DNA sequencing
DNA sequencing as an innovation was very much incremental. Several milestones had to be reached to get
to the technology we have today. Listing each one would be too long, and it is not the point of this essay, so
we’ll only name the most important ones.
An important figure in DNA sequencing is Frederick Sanger, a British biochemist who received two Nobel
Prizes: the first for “his work on the structure of proteins, especially that of insulin”, the second, he shared
with Walter Gilbert for “their contributions concerning the determination of base sequences in nucleic acids”
– that is to say, DNA sequencing. Sanger did not discover DNA sequencing methods immediately, he first had
to sequence a protein, insulin, then RNA, and finally DNA.
Sequencing of Insulin
Frederick Sanger obtained his Ph.D in 1943 in Cambridge, working on the metabolism of the amino acid
lysine in the animal body. The same year, he got his first job, a position in the Biochemistry department of
Cambridge offered by A.C. Chibnall. Chibnall was interested in insulin, of which he determined, with
colleagues, the amino acid composition of its bovine form. The structure of insulin, that is to say, the exact
order of these amino acids, was however still a mystery to biologists.
Chibnall put Sanger up to the task of investigating the end groups of the
protein, which would later lead to research on its structure. Sanger tried several
techniques and molecules before he managed, thanks to his connections, to get
his hands on dinitrofluorobenzene (DNFB), synthesised at the time for the war
effort. It worked, and he published his first paper. It was not the discovery of the
entire sequence yet, but it was a start.
Sanger later found insulin was made of two chains, that he called chain A et
chain B. With Hans Tuppy and later Ted Thompson, Austrian and Australian
biochemists, he gradually pieced together the complete structure of insulin from
overlapping sequences.
The only thing missing was the emplacement of the disulfide bonds
, Sanger located the disulfide
bridges, thus finally completing the structure of insulin.
Sanger therefore proved proteins are well-ordered molecules and that, by analogy, genes must be, too. For
his work, Sanger received his first Nobel Prize in Chemistry in 1958.
Sequencing of RNA
At the time, nucleic acids, that is to say, DNA and RNA, were still a mystery to scientists. Sanger only got
interested in RNA because DNA remained too difficult to sequence yet. We know now DNA and RNA are made
of pieces called nucleotides, of which there are five types: A (adenine), C (cytosine), G (guanine), T (thymine)
and U (Uracil). We know DNA is exclusively composed of A, C, G, and T, and mRNA