|
|
  
Darwinism: Science or Philosophy
Chapter6
Experimental Support for Regarding Functional Classes of Proteins to Be Highly Isolated
from Each Other
Michael J. Behe
[ Previous | Table of Contents | Next ]
Response to this paper.
Author's final comments.
IN WRITING ON THE TOPIC of naturalism and evolution the problem arises of what to call the
contending camps. The difficulty comes from the fact that although the term
evolutionist is often used to refer to persons who demand the unrelenting
application of physical laws to all phenomena in the universe, many other persons who are
opposed to this view are perfectly willing to concede that a limited number of phenomena
can be explained by Darwinistic principles. Similarly, although a term like creationist
brings to mind champions of a young-earth theory, it is often applied to persons who do
not defend that thesis but do contend that natural laws have at some point been superseded
by a supernatural agency
Since the focus of this symposium is the sufficiency of natural law, and in order to avoid
the confusing terminology discussed above, in this essay I will use the term believer
for those who believe in the universal application of natural law and the term
skeptic for those who doubt it. This has the advantage of using terms for each
side that the opposite side generally regards positively. Perhaps this will go a little
way toward promoting the good will that this conference strives for.
Introduction
Several years ago the fossilized remains of an extinct species of whale were unearthed in
the Zeuglodon valley of Egypt. The particular aspect of the fossil which excited
archaeologists and science writers was the fact that the whale apparently had functional
legs and feet From the condition of the fossilized leg bones it could be discerned by
trained eyes that the legs were well muscled and thus must have been actively used during
the life of the whale. A Washington Post story describing the discovery included
a drawing of both a modern whale and an ancient whale, showing the differences in their
shapes but the similarities in their lengths. Also included in the illustration, down in
the lower righthand corner, was a drawing of an animal that looked for all the world like
a scruffy dog. Underneath the dog was the caption "Mesonychid, the ancestor of the
whales," in the story it was explained that
Most researchers agree the earliest whales descended from a line
of large carnivorous beasts the size of wolves and bears. These furry land
mammals, known as mesonychids, ran around on four legs. But for unknown
reasons, some mesonychids evolved into forms that returned to the sea, from
which all life originally arose. The legs found on primitive whales are
remnants from their time on land (July 13, 1990).
Even allowing for the enthusiasms of the popular press, the story reflects the way in
which a theory, here evolution, is allowed to supply "facts" which the evidence in no way
justifies. I discussed this article with my students in a course I teach for freshmen,
entitled "Popular Arguments on Evolution." The course is intended to develop critical
reasoning skills, using popular books that have opposing viewpoints on evolution as the
vehicle. This past semester we read, side by side, Richard Dawkins's The Blind
Watchmaker and Michael Denton's Evolution: A Theory in Crisis. This forced
the students to argue over the meaning of observations, without the automatic social
support that usually goes to proponents of evolution in academic settings. The students
themselves, after reading the Post article, pointed out that there is no reason
to suppose that the ancient whale appeared on earth before the modern whale, since modem
whales have vestigial legs that could have developed into the functional legs of the
Zeuglodon whale. For the same reason, the students noted, the discovery does not represent
the development of a new trait or even the loss of an old one. Finally, most glaringly
obvious, if random evolution is true, there must have been a large number of transitional
forms between the Mesonychid and the ancient whale. Where are they? It seems like
quite a coincidence that of all the intermediate species that must have existed between
the Mesonychid and whale, only species that are very similar to the end species
have been found. The students concluded that the fossil whale, although a fascinating
discovery for natural history, was no evidence for the Post's evolutionary
scenario.
I have started my contribution to this symposium with a discussion of the Zeuglodon whale
because it is a paradigmatic example of evolutionary argumentation: a small change in a
preexisting structure is used to argue to massive changes involving completely new
structures or functions. It is like arguing that because a man can jump over a
fissure five-feet wide, then given enough time he could jump over the Grand Canyon. Now, a
believer in the unabating rule of natural law would argue that the man could jump over the
Grand Canyon if there were ledges and buttes for him to use as steppingstones. The skeptic
would ask to be shown the steppingstones.
This essay will examine how the search is going for steppingstones in one area of
biochemistry, that of protein structure. We will see that, without a prior commitment to
naturalism, there is little reason to suppose that steppingstones exist in the canyon
separating functional classes of proteins.
Protein Structure
I ask for the patience of those who already have a working knowledge of protein structure,
but in order to make sure that everyone reading this essay has the necessary background I
will spend a little time discussing some fundamentals.
Although most people think of proteins as something we eat-one of the major food groups-
when they reside in the body of an uneaten animal or plant, proteins serve a different
purpose. Proteins are the machinery of living tissue that builds the structures and
carries out the chemical reactions necessary for life. For example, the conversion of
foodstuffs to biologically usable forms of energy is carried out, step by step, by part of
a group of proteins called enzymes. Skin is made in large measure of a protein called
collagen. When light impinges on your retina it interacts first with a protein called
rhodopsin.
As can be seen even by this limited number of examples, proteins carry out amazingly
diverse functions. In general, however, a given protein can perform only one or a few
functions: rhodopsin cannot form skin, and collagen cannot interact usefully with light.
Therefore a typical cell contains thousands and thousands of different types of proteins
to perform the many tasks necessary for life, much like a carpenter's workshop might
contain many different kinds of tools for various carpentry tasks.
What do these versatile tools look like? The basic structure of proteins is quite simple:
they are formed by hooking together in a chain discrete subunits called amino acids. Now,
although the protein chain can consist of anywhere from about fifty to about one thousand
amino acid links, each position can contain only one of twenty different amino acids. In
this they are much like words: words can come in various lengths but they are made up from
a discrete set of twenty-six letters. As a matter of fact, biochemists often refer to each
amino acid by a single letter abbreviation: G for glycine, S for serine,
H for histidine, and so forth. Each different kind of amino acid has a different
shape and different chemical properties; for example, W is large but A
is small, R carries a positive charge but E carries a negative charge,
S prefers to be dissolved in water but I prefers oil, etc. A protein in
a cell does not float around like a floppy chain; rather, it folds up into a precise
structure that can be quite different for different types of proteins. This is done
automatically through interactions such as a positively charged amino acid trying to get
near a negatively charged one, oil-preferring amino acids trying to huddle together to
exclude water, large amino acids being excluded from small spaces, etc. When all is said
and done, two different amino acid sequences, two different proteins, can be folded to
structures as specific and different from each other as a three-eighths inch wrench and a
jigsaw. Like the household tools, if the shape of the proteins is significantly warped,
they fail to do their jobs.
Proteins and Language
Because amino acid residues are often abbreviated by letters, because there is a similar
number of letters and amino acids (twenty-six vs. twenty, respectively), and because a
small protein consists of about one hundred amino acids, many commentators have likened a
functional protein (i.e., one that has the correct shape to be able to do a particular
job) to a functional sentence (i.e., one that obeys the rules of English grammar) of about
one hundred letters. My students in "Popular Arguments on Evolution" found it interesting
that both believers and skeptics used this kind of analogy in their writings, but that
their reasonings brought them to opposite conclusions. The skeptic typically argues that a
monkey banging away at a typewriter (monkeys and typewriters are very popular) would be
unlikely to produce an intelligible, grammatically correct sentence like "Drop the anchor
in one hour" in a reasonable length of time. Near misses don't count for the skeptic since
the change of even one letter would break a spelling or grammar rule, or change the sense
of the sentence. Needless to say, the hour would most likely pass, and the anchor remain
undropped, before the monkey produced the correct sentence.
Believers in the universal application of physical law take a different approach with
their monkey and typewriter. Their argument generally goes something like this. Suppose in
his first try the monkey typed "bsqm dshcbbbk,RR .nsurlei aknex." Admittedly this
is poor grammar, but it's the only sentence we've got. Since living systems reproduce, and
since there is Darwinian competition, the bad sentence will be reproduced until a better
one comes along. Now suppose in his second try the monkey typed a p in the fourth
position and a u in the penultimate position. Well, since these are closer to the
target sentence we will throw out the original sentence and keep "bsqp dshcbbbk.RR
.nsurlei aknux." After a few more rounds perhaps the monkey has gotten a few more letters
correct, say, a d in the first position and a ch in the thirteenth and
fourteenth positions. Now we have "dsqp dshcbbbchRR .nsurlei aknux." Since this has more
matches with the target sentence we'll keep it and throw out the last sentence. After
perhaps fifty rounds we get to "dsop dhe abehRR in uneei hour." Breed from this. In
another fifty rounds or so we arrive triumphantly at our target "Drop the anchor in one
hour."
The above argument in its pure form can be convincing only to persons already convinced.
It asserts a functional difference between two nonsensical strings of letters. No person,
or machine for that matter, looking for a sentence would notice a difference between "bsqm
dshcbbbk,RR .nsurlei aknex" and "bsqp dshcbbbk,RR .nsurlei aknux." It is only because the
believer has a distant goal in mind that he or she chooses one nonsense character string
over the other. In the believers' argument the analogy of proteins to language is
implicitly abandoned in the first rounds of the monkey's typing, since the character
string does not have to obey any rules of spelling or grammar. The analogy to language is
used simply to try to impress the unwary with the apparent production of sense from
nonsense. My students in "Popular Arguments on Evolution" were uneasy with this argument
when they read it in Dawkins's book, but they could not refute it. It is not easy
for the casual reader to see that the illusion of steady, gradual evolution to a
functional sentence is produced by an intellect, either the believer's directly or in some
cases a computer program written by him, guiding the result to a distant goal. This of
course is the antithesis of Darwinian evolution.
But perhaps there is a middle ground between the skeptic's insistence on absolute
grammatical correctness and the believer's abandonment of grammatical rules. Suppose we
allowed the vowels in the sentence to vary to produce something like "Drep tha enchir on
une hoir." Such a sentence could probably still be recognized by someone, perhaps a
sailor, even though all the words are misspelled. Or, alternatively, suppose we vary some
consonants: "Trof tte ankhow im ode hous. Clearly some misspelled words would be easier to
recognize than others and some letter substitutions (t for d, k
for c) would be easier to follow than others (r for t,
l for g). The ability of a sentence like that to function would depend a
lot on the reader and the context.
To put this back into a protein context, it might be possible for a protein to tolerate a
lot of amino acid substitutions and remain functional. (Again, when talking about
proteins, functional means folded to a discrete, stable structure.) And in fact
it has been known for a long time that this is true. Analogous proteins from different
species- for example, human hemoglobin and horse hemoglobin-have differences between their
amino acid sequences, yet fold to discrete and closely similar structures.
But what is the limit to tolerance for amino acid changes? Are proteins significantly more
tolerant to changes in "spelling" than words are? Is there a point at which, like our
sentences above, further changes will render a protein nonfunctional? What then is the
probability of finding some member of a particular class in a reasonable time in
a nondirected search? These are empirical questions and, although they can be speculated
upon in the absence of relevant data, such speculations must be radically curtailed when
data are available. A direct approach to the question, ''How isolated are functional
protein sequences?" would have been experimentally impossible twenty years ago, before the
molecular biological revolution. But since the development of powerful tools to probe the
molecules of life, an answer to that question appears to be within reach. Progress in this
area is the topic of the following sections.
How Rare are Functional Proteins?
The observation that analogous proteins from different species could differ from each
other, often by quite a bit, and yet retain the same compact shape led workers in the
field to speculate that perhaps the exact identity of an amino acid at a particular
position in a protein was not so important as its overall chemical properties. So, for
example, if one finds an I at position 10 of hedgehog hemoglobin and an
L in position 10 of the analogous protein from skunk, then perhaps the imponant
feature is that both I and L prefer an oily environment, and maybe any
other amino acid, such as W, F, or V, that prefers a similar
environment would also be suitable at that position. This is something like
saying that in a language perhaps all of the vowels are interchangeable. Taking the idea
further, perhaps amino acids, such as S, A, H, and T.
that prefer a watery environment could form an interchangeable group, and perhaps charged
amino acids (E, D, R, and K) another group.
Fifteen years ago a man named Hubert Yockey published an article in the Jourrnal of
Theoretical Biology{1} showing that
these considerations could enormously reduce the odds against finding a functional protein
by trial and error. If we do not insist on the perfect diction of the typical skeptic, but
allow some slurred speech in proteins, then the probability of finding a small, functional
protein of one hundred amino acids in length is reduced from one in ten to the
130th power to one in ten to the 65th power-a reduction of sixty-five orders of magnitude!
Yockey went on to show in the article that his calculation of one in 1065, which he
obtained from theoretical considerations, fit very closely with the number that could be
calculated from considerations of the known sequence variability of the protein cytochrome
c among many different species,
Now, the problem with Yockey's calculation for a believer in the sufficiency of natural
law is that, although 1065 is enormously smaller than 10130, it still is quite a large
number. It has been calculated that there are about 1065 atoms in a galaxy. Thus, if
Yockey was correct, the odds of finding a functional protein are about the same as finding
one particular atom in the Milky Way. Not too likely. Well, if you were a believer, how
might you answer this challenge? One way is through obfuscation, like the production of
sentences from nonsense character strings, as was discussed above. A second way is by
claiming that Yockey's calculation is inaccurate and that the known sequences of
cytochrome c that he used to buttress his work do not reflect all the possible sequences
that could produce a folded protein. The best way, though, in the absence of relevant
data, is to produce your own calculation, starting from a separate set of independent
principles, and show that the odds are not quite so long as Yockey thought. This is what
has been done in an elegant series of calculations from the laboratory of Ken Dill{2} {3}at the
University of California at San Francisco.
Dill's laboratory asked a question that can be paraphrased as follows. Given a ten-by-ten
square matrix (like a big checkerboard) and a string of pearls containing both black beads
and white beads, in how many ways can a string of one hundred pearls be laid on the
checkerboard so that each square contains one and only one pearl, and most of the black
pearls are in the middle spaces of the board? This analogy is intended to represent a
folding protein comprised of two types of amino acids, ones that prefer watery
surroundings and ones that do not. After feeding this scenario into a computer, Dill's
group obtained the surprising result that it wasn't that hard to fit the pearl necklace on
the checkerboard in the right way. They then mathematically extrapolated from the two
dimensional checkerboard to three dimensional space, and finally arrived at the conclusion
that about one in 1010 amino acid sequences would yield a folded protein That is a much
smaller number than Yockey's (the federal government spends 1010 dollars, ten billion
dollars, every three days) and brings the spontaneous generation of functional proteins
into the realm of the credible.
The problem for a skeptic is how to refute Dill's calculation. It isn't easy, since few
people are as mathematically talented as he and since it's hard to disprove the
simplifying assumptions his model contains. Skeptics are free to criticize the
assumptions, but there is enough uncertainty in such things to allow believers to tout
Dill's calculation credibly over Yockey's. To resolve this dilemma, to gain firm ground to
stand on, hard experimental results are required. Fortunately in the past several years
such results have been forthcoming from the laboratory of Robert Sauer{4} {5} {6} in the department of biology at the Massachusetts
Institute of Technology. We now turn to those crucial experiments.
Functional Proteins Are Very Rare
In the past twenty years the science of molecular biology has made enormous strides. It is
now possible, in laboratories with such expertise, to cut up a gene, rearrange it
to suit yourself, and place it back in a functioning biological system. Since genes code
for proteins, one can also produce proteins made-to-order in this manner. Sauer's
laboratory, in order to answer questions about protein structure that interested them,
took the genes for several viral proteins, systematically took out small pieces of them
(corresponding to instructions for three amino acids at a time), and inserted altered
pieces back in the genes. They did this, three amino acids "codons" at a time, for the
whole length of the gene. By clever manipulation of the altered pieces they were able to
screen codons for all twenty amino acids at each position of the protein. This is like
trying all twenty-six letters of the alphabet in turn at each position of a word. The
altered genes were then placed in bacteria, which read the DNA code and produced chains of
amino acids from them. It turns out that bacteria quickly destroy proteins that
are not folded, so Sauer's group looked for the altered proteins that were not destroyed.
By determining their sequences they could tell which amino acids in a given position were
compatible with producing a folded, functional protein.
What did they see? In some positions of the protein, Sauer's group saw that a great deal
of amino acid diversity could be tolerated. Up to fifteen of the twenty amino acids could
occur at some positions and still yield a functional, folded protein. At other positions
in the amino acid sequence, however, very little diversity could be tolerated. Many
positions could accommodate only three or four different amino acids. Other positions had
an absolute requirement for a particular amino acid; this means that if, say, a P
does not appear at position 78 of a given protein, the protein will not fold
regardless of the proxirnity of the rest of the sequence to the natural protein.
In terms of our sentence analogy, this is like saying that, yes, all vowels are
interchangeable, but that if the last r is changed to any other letter, such as
s ("Drop the anchor in one hous"), the protein sentence is no longer
understandable.
Sauer's results can be used to calculate the probability of finding a given protein
structure.{6} We proceed in the following manner. If
any of ten amino acids can appear in the first position of a given functional protein
sequence, then the odds are one in 2 that a nondirected search will place one of the
allowed group there. If any of four amino acids can appear in the second position, then
the odds are one in 5 of finding one of that group, and the odds of finding the correct
amino acids next to each other in the first two positions are one-half times one-fifth,
which is one-tenth. Suppose in the third position there is an absolute requirement for G.
Then the odds of getting a G at that position are one in twenty and the odds of getting
the first three amino acids right are now up to one in two hundred. In this aspect it is
like winning a trifecta in horse racing. Over the course of one hundred amino acids in our
small protein, the odds quickly reach astronomical numbers.
From the actual experimental results of Sauer's group it can easily be calculated that the
odds of finding a folded protein are about one in 10 to the 65th power.{6} To put this fantastic number in perspective, imagine that someone
hid a grain of sand, marked with a tiny X, somewhere in the Sahara Desert. After wandering
blindfolded for several years in the desert you reach down, pick up a grain of sand, take
off your blindfold, and find it has a tiny X. Suspicious, you give the grain of sand to
someone to hide again, again you wander blindfolded into the desert, bend down, and the
grain you pick up again has an X. A third time you repeat this action and a third time you
find the marked grain. The odds of finding that marked grain of sand in the Sahara Desert
three times in a row are about the same as finding one new functional protein structure.
Rather than accept the result as a lucky coincidence, most people would be certain that
the game had been fixed.
The number of one in 1065, arrived at by Sauer's experimental route, is virtually
identical to the results obtained by Yockey's theoretical calculation and his deduction
from natural cytochrome c sequences! It therefore strongly reinforces our confidence that
a correct result has been obtained. Sauer's group obtained closely similar results for two
different proteins: arc repressor{4} and lambda
repressor.{5} {6}
This means that all proteins that have been examined to date, either experimentally or by
comparison of analogous sequences from different species, have been seen to be surrounded
by an almost infinitely wide chasm of unfolded, nonfunctional, useless protein sequences.
There are no ledges, no buttes, no steppingstones to cross the chasm.
The conclusion that a reasonable person draws from this is that the laws of nature are
insufficient to produce functional proteins and, therefore, functional proteins have not
been produced through a nondirected search.
Implications of Protein Sequence isolation
The numerical concreteness of Sauer's and Yockey's results is breathtaking. When a skeptic
sees a drawing of Mesonychid next to the Zeuglodon whale, he or she intuitively
realizes that the transformation is highly improbable. But how improbable? There is no way
to put a quantitative measure on the difference between a doglike animal and a whale, and
believers in the relentless application of physical law take advantage of this by verbally
minimizing the differences.
The situation is otherwise with proteins. Because there is a discrete set of amino acids
and a finite number of positions in a given protein, the odds of attaining a folded,
functional protein can be calculated quite closely, but only if the tolerance of proteins
to amino acid substitution is known. Thanks to Sauer and Yockey we now have such
quantitative data.
It is important to realize that Sauer's and Yockey's results hold whether or not the
system can replicate and is subject to Darwinian selection. The odds against
finding a new functional protein structure remain astronomical in either case. This is
because Darwinian selection can only discriminate based on function and, with the
exception of those found in living organisms, virtually all protein sequences are
functionless. An amino acid sequence can be replicated and mutated in living organisms
till the cows come home, and the odds are still one in 1065 that a new functional protein
class will be produced.
The problem of the isolation of functional protein sequences is a vivid illustration of
the truth of the symposium thesis,
Darwinism and neo-Darwinism as generally held and taught in our society carry with
them an a priori commitment to meta-physical naturalism, which is essential
to make a convincing case on their behalf.
The skeptic can accept Sauer's and Yockey's results with equanimity because his world is
not necessarily limited to those phenomena that can be explained by naturalism.
Furthermore, the skeptic can happily concede that many biological phenomena are
explained by natural laws. He can agree that beak shape and wing color can change
under selective pressure, or that different proteins in the same structural class, such as
the alpha and beta chains of hemoglobin, may have arisen through Darwinistic mechanisms.
But the believer in the universal application of physical law is stuck. He must maintain,
against the evidence, that different protein classes, like cytochromes and
immunoglobulins, found their way by raw luck through the vast, dark sea of nonfunctional
sequences to the tiny islands of function we observe experimentally. He must maintain,
without any evidence, that Mesonychid gave birth over time to the whale.
And why, we ask, must he maintain these positions against impossible odds and without
supporting evidence? Because, he replies, I can measure only material phenomena, and
therefore nothing else exists.
In closing I would like to paraphrase Hubert Yockey,{7} who in his career repeatedly pointed out facts that are not
supposed to be mentioned in polite scientific company: "Since science has not the vaguest
idea how [proteins] originated, it would only be honest to admit this to students, [to]
the agencies funding research, and [to] the public."
NOTES
{1} Yockey, H. P. (1978), "A Calculation of the
Probability of Spontaneous Biogenesis by information Theory," Journal of Theoretical
Biology 67:377-398.
{2} Lau, K. F., & Dill, K. A. (1989), "A Lattice
Statistical Mechanics Model of the Conformational and Sequence Spaces of Proteins,"
Macromolecules 22:3986-3994.
{3} Chan, H. S., & Dill, K. A. (1990),
"Origins of Structure in Globular Proteins," Proceedings of the Natural
Academy of Sciences USA 87:6388-6392.
{4} Bowie, J. U., & Sauer, R. T. (1989),
"Identifying Determinants of Folding and Activity for a Protein of Unknown Structure,"
Proceedings of the National Academy of Sciences USA 86:2152-2156.
{5} Bowie, J. U. Reidhaar-Olson, J. F., Lim, W. A., &
Sauer, R. T. (1990), "Deciphering the Message in Protein Sequences: Tolerance to Amino
Acid Substitution," Science 247:1306-1310.
{6} Reidhaar-Olson, J. F., & Sauer, R. T. (1990),
"Functionally Acceptable Substitutions in Two a-Helical Regions of l Repressor,"
Proteins: Structure, Function, and Genetics 7:306-316.
{7} Yockcy H. P. (1981), "Self Organization Origin of
Life Scenarios and information Theory," Journal of Theoretical Biology
91:13-31.
[ Previous | Table of Contents | Next ]
Copyright © 1994, 1997 Foundation for Thought and Ethics.
Copies of the book Darwinism: Science or Philosophy are available from:
Foundation for Thought and Ethics
P.O. Box 830721
Richardson, TX 75083-0721
Phone: 1-800-669-3410 or 972-661-1661
Fax: 972-661-2989
Email: fte-books@juno.com
Email this to a friend
copyright
© 1995-2008
Leadership U. All rights reserved.
Updated: 14 July 2002
|