Section 7

POPULATION GENETICS OF THE Lap LOCUS IN

WHITE CAMPION (Silene latifolia) POPULATIONS


BACKGROUND

How do biologists study evolution in natural populations? They employ several approaches, and we will use one of these approaches, protein electrophoresis, this week.

Remember that genes code for the amino acid sequences of proteins, and many of these proteins function as enzymes or serve as transporters, structural components, cell recognition factors, or hormones. A diploid organism carries two copies of each gene, one that derived from its mother, the other from the father. If the two copies of a gene (alleles) differ in their nucleotide sequence, this variation may result in the production of two proteins that differ in their amino acid sequences. These differences in amino acid sequence can translate into differences in the mass and/or charge of the proteins specified by the two alleles. Differences in protein mass or charge can cause the two proteins to migrate at different rates in an electric field. As such, we can use an electric field to separate an individual's proteins, and then stain for the particular protein that interests us. This process is called protein electrophoresis and is widely used by evolutionary ecologists and population geneticists to study genetic variation and genetic mechanisms in natural populations.

If a population contains more than one allele for a particular protein, then the proteins that correspond to each allele may move at different rates in the electrical field. Let us consider a population that contains two alleles at a locus for a particular enzyme. Some individuals will be homozygous for the allele that codes for the more rapidly migrating version of the protein, and their protein will stain as a single band that travels further in the electrical field than the protein of individuals who are homozygous for the other allele (Fig 1). Individuals who are heterozygous at this locus will display two protein bands (Fig 1).

Figure 1: Protein banding patterns for Lap-2 locus in the mollusk Dreissena polymorpha.. Each "lane" represents protein from one individual. From left to right, phenotypes are: lanes 1-3 homozygous for the faster migrating protein, lane 4 heterozygous for the fast and slow migrating proteins, lane 5 homozygous for the faster migrating protein, lane 6 heterozygous, lane 7 homozygous for the more slowly migrating protein, lane8 homozygous for the faster migrating protein.


The genes we investigate using this technique usually fall into the category of metabolic enzymes, and most of them are involved in cellular respiration or biosynthesis. In most cases we assume that the different alleles present at a locus are selectively neutral, i.e., we assume that the differences in protein structure that result from the presence of more than one allele at a particular locus do not translate into fitness differences among individuals. This assumption is based upon the observation that 1) the polymorphisms we find at these loci are common in natural populations and, 2) in populations where we know that non selective evolutionary mechanisms are either absent or minimal in their effects, the genotype frequencies for these loci remain in Hardy-Weinberg Equilibrium. As such, protein electrophoresis of the proteins specified by these loci can provide us with valuable information on mating patterns within populations, genetic drift, founder effects, and gene flow among populations.

In this lab segment, you will ask questions about genotype and allele frequencies for the locus that codes for leucyl amino peptidase (Lap), an enzyme that cleaves peptide bonds between leucine and other amino acids, in the plant white campion.

WHITE CAMPION

White campion (Silene latifolia) is a perennial weed in the carnation family (Fig 2). It is native to Europe and was introduced to North America during colonial times,where it has become naturalized in the northeastern part of the continent. Plants are either male or female, and sex is determined chromosomally in a manner similar to sex determination in mammals. The flowers are pollinated primarily by bees and moths. The seeds are about the size of the poppy seeds you find on your bagel, and we know that dispersal is extremely limited (McCauley, Steven, Peroni, and Raveill, 1996).

Figure 2: White campion stems and flowers



The following collections of seedlings or adults plants will be available to you for investigation during this lab.

Whittaker Population - A large population (> 500 plants) located in Eggleston, Giles Co., VA.; plants from the 1992, 1993*, 1994, 1995 seed crops. I also have sets of plants from the 1993 seed crop that represent the portion of that cohort that survived 2.5 yr of burial in the soil*.
Duncan - A large population (> 500 plants) located ~1 km from Whittaker. Seedlings from the 1994 seed crop are available.
Trailer - A small population (<50 plants), located ~ 10 km from Duncan and Whittaker populations in Giles Co., Va. Seedlings from the 1993 seed crop are available.

* I will need at least 24 hr notice if you plan to use these sets of plants.

BEFORE YOU COME TO LAB

Before lab, each group should:

1. Formulate a populations genetics question that the group can address using protein electrophoresis of the Lap locus for one or more of the white campion populations listed above. For example, your group might ask if non random mating occurs in one of these populations while another group could ask if populations from the same general vicinity experience considerable gene flow or operate as a discrete gene pools.

2. Establish research and null hypotheses. Hypotheses make predictions about your findings, and the null hypotheses always predicts that no real differences exist among groups or between observed results and those predicted by theories. For example, if your group asks if random mating occurs in a population, the your hypotheses would be as follows:



Specify the types of results that would lead your group to reject or accept its null hypothesis.

3. Determine the research design for the investigation. This process will include decisions on:

- sample size (40 individuals per population is good for this type of investigation). Note: Given time and equipment constraints, if you want to compare two or more populations, join forces with another lab group.
- sample selection (how will you pick the plants you want to use - haphazardly, systematically, or randomly? )
- sample processing (i.e., will you run all the individuals from onepopulation before you run the individuals from the other population?)

4. Prepare a 5 - 10 minute presentation of your proposed research which will be delivered by a spokes-person from the group. This presentation should clearly state:

- the question and the group's reasons for selecting this question
- the research and null hypotheses
- the research design
- the types of results that would lead to acceptance or rejection of the null hypothesis.

WEEK 1


DATA COLLECTION


This week, you will actually perform cellulose acetate electrophoresis and stain for the Lap enzyme. You will use the data you collect to test your hypotheses Note: the equipment we use for this procedure is very expensive and rather delicate. Please treat it with respect. Students will be billed for equipment damaged due to carelessness.

Protein electrophoresis includes 5 procedures:

1. Extraction of enzymes from the tissues (grinding)
2. Loading the samples onto the gel
3. Running the gel (separation of the enzymes in the electrical field)
4. Staining for the enzyme so we can visualize any polymorphisms for the protein
5. Determining the Lap genotypes of individuals based on their electrophoresis phenotypes (scoring the gel)

Extraction


Loading the gel

Keep your extracts (ground tissue samples) on ice.


Running the gel

Hold the loaded gel by its edges and take it to the electrode chamber.

Staining the gel

Line a gel box with plastic warp and place the gel, coated side up (plastic side down) into the box.


Put on latex gloves.


Scoring the gel



HOMEWORK


Calculations

Calculate the allele and genotype frequencies for each population you investigated. Using your allele frequencies calculate the genotype frequencies predicted by Hardy-Weinberg equilibrium for a population where no evolutionary mechanisms operate*. Bring your calculations to lab next week. At that time we will use the Chi-square statistical test to evaluate your hypotheses.


Let: freq allele 1 = p
freq allele 2 = q
freq allele 3 = r

HW predictions:
freq 1,1 genotype = p2
freq 2,2 genotype = q2
freq 3,3 genotype = r2
freq 1,2 genotype = 2pq
freq 1,3 genotype = 2pr
freq 2,3 genotype = 2qr





Bio 112/ Lap Lab - Data Sheet/ Davidson College

Group: ____________________________ Date: __________________

Gel #1 Gel #2

Lane # Population Genotype Lane # Population Genotype

1 1
2 2
3 3
4 4
5 5
6 Marker A (1,3) 6 Marker A (1,3)
7 Marker B (1,2) 7 Marker B (1,2)
8 8
9 9
10 10
11 11
12 12





Gel #3 Gel #4

Lane # Population Genotype Lane # Population Genotype

1 1
2 2
3 3
4 4
5 5
6 Marker A (1,3) 6 Marker A (1,3)
7 Marker B (1,2) 7 Marker B (1,2)
8 8
9 9
10 10
11 11
12 12



WEEK 2

DATA ANALYSIS
Graphic Representation of Data - Use DeltaGraph to prepare a figure that compares your observed and predicted results (e.g., your observed genotype frequencies with the Hardy Weinberg predictions).
Hypothesis Testing - For virtually every group, the observed genotypes will differ from the Hardy Weinberg predictions. What factors could contribute to these discrepancies?

1. Biased sampling
2. Poor methodology or scoring of gels
3. Operation of evolutionary mechanisms in your population (research hypothesis)
4. Chance (null hypothesis)

Careful planning and attention to detail can minimize the possibility that the first two factors contribute to differences between our observed and predicted values. As such, when we analyze our data, we must determine if discrepancies between observed and predicted variables represent deviations of our population from Hardy-Weinberg assumptions or simply the effects of chance.
We use inferential statistics to determine the probability that the deviations of our observed values from the theoretical predictions could result from chance. If it is very likely that a sample's deviation from Hardy Weinberg predictions resulted from chance alone, then we cannot reject our null hypothesis. In other words, we will only reject our null hypothesis in favor of our research hypothesis in cases were the probability that our deviations from Hardy Weinberg result from chance alone are very low. How low is low? We only reject our null hypothesis in cases where the probability that the deviation between our observed and predicted values results from chance is < 0.05.

So, how do we determine the probability that our sample's deviations from Hardy Weinberg predictions are due to chance? We calculate a test statistic which expresses the magnitude of the differences between our observed and predicted values. For variables such as genotype frequencies we use the Chi-square (X2) test statistic. We calculate our X2 test statistic using the following formula:

X2 = _[ (oi- ei)2 / ei]

Where: oi = the number observed for genotype category i
ei = the number expected for genotype category i, based on Hardy Weinberg predictions
_ = summation - The equation instructs you to calculate (oi- ei)2 / ei for each genotype category, and then sum these values for all the genotypes.

Now, let us examine the equation for X2 carefully. If our observations exactly match Hardy Weinberg expectations, then X2 will equal zero. But, if our observations differ greatly from Hardy Weinberg expectations, then X2 will be a large value. How large must X2 be in order for us to reject our null hypothesis? X2 must be sufficiently large enough so there is < 0.05 chance that we would get such a deviation of observed and expected values due to chance alone. How do we determine the probability (p) that any particular X2 value resulted from chance? We can use a published X2 table or instruct a spreadsheet or statistics software package to calculate the probability for us. In either case, we must calculate the degrees of freedom (abbreviated as df or v) associated with our sample. The degrees of freedom = the number of categories (in our case genotypes) minus the number of pieces of information in our data set that we used to calculate our expected values. In our case, we used the sample size and our estimates of the frequencies of two of the alleles in our populations in order to calculate the number of individuals of each genotype predicted by Hardy Weinberg (once we calculated the estimated frequencies of two of the three alleles we could determine frequency of the third allele by subtraction). As such, our degrees of freedom = 6 genotypes - 3 = 3.
We will use EXCEL, a spreadsheet software package to calculate X2. Consult Fig. 5 for a sample EXCEL data sheet. More detailed instructions regarding the use of EXCEL will be provided in lab.
We will use the CHIDIST function on Excel to determine the probability that any particular X2 value resulted from chance. To do so type:

= CHIDIST(X2,df)

Excel will return the probability. If the probability is < 0.05, then we reject the null hypothesis and conclude that our population probably violates at least one Hardy Weinberg assumption (i.e., at least one evolutionary mechanism operates on our Lap locus in this population).

If the probability (p value) associated with our X2 is > 0.05, then we cannot reject the null hypothesis. We conclude that we do not have enough evidence to argue that evolutionary mechanisms operate on the Lap locus in our population.

I will assist groups who need to compare genotype and allele frequencies for two or more populations.

ACKNOWLEDGEMENTS
Dr. David McCauley at Vanderbilt University inspired the development of the population genetics cellulose acetate electrophoresis lab. He uses this approach with fern and Drosophila populations in his teaching. Dr. McCauley and Dr. Jay Raveill developed the protocols for cellulose acetate electrophoresis of the Lap enzyme in white campion. Dr. Patricia Peroni developed the white campion Lap electrophoresis lab itself and the accompanying material on data analysis.
Fig. 1 was adapted from Hartl and Clark (1989); Fig 2 was copied from Radford, Ahles, and Bell (1968); and Fig 3 was copied from Hebert and Beaton (1993).

REFERENCES



© Copyright 2000 Department of Biology, Davidson College, Davidson, NC 28036
Send comments, questions, and suggestions to: macampbell@davidson.edu