An Investigation of Clathrin-Mediated Endocytosis through Three

An Investigation of Clathrin-Mediated
Endocytosis through Three-Color
Super-Resolution Microscopy
Olivia Waring
University of Tokyo Research Internship Program
Ozawa Laboratory
Abstract
Since the discovery of GFP, fluorescent proteins have revolutionized the field of
molecular imaging, allowing researchers to tag specific molecules with glowing reporters
and thereby track their progress through living cells. The study of subcellular components has also benefited greatly from the advent of super-resolution microscopy, which
exploits the blinking and photobleaching of single molecules to circumvent the diffraction limit of visible light. By combining fluorescent reporting with super-resolution
microscopy, we have at our disposal a powerful methodology for probing subcellular processes. The phenomenon of immediate interest to the Ozawa Lab is clathrin-mediated
endocytosis, through which the plasma membrane engulfs extracellular particles and
pinches off to form a vesicle. A number of proteins are implicated in this process:
most notably clathrin itself, which assembles on the cytosolic surface of the cell during
vesicle formation; but also transferrin receptor, a transmembrane protein that binds
selectively to iron-bearing transferrin; and dynamin, a GTPase thought to mediate
vesicle scission by mechanical twisting. In order to more precisely characterize the
interactions between these three proteins, the Ozawa group sought to tag each target
molecule with a fluorescent reporter and quantify co-localization via super-resolution
microscopy. During my six-week stay, I assisted in the preparation of three fluorescent systems: clathrin-light-chain cloned to PAmCherry; transferrin receptor cloned
to EYFP; and dynamin linked to AlexaFluor 647 via antibody conjugation. The two
plasmids were co-expressed in COS-7 cells, followed by immunostaining with the antibody construct. We first used confocal microscopy to validate the expression of the
target proteins and their associated fluorophores. We then observed the samples on
a total internal reflection microscope at a sampling rate of about 30 milliseconds per
frame. Once data had been acquired, images were processed using ImageJ software
with the Octane plugin. PALM analysis was performed, after which clustering and colocalization were qualitatively verified. In order to subject our data to more rigorous,
quantitative treatment, I proceeded to implement Ripley’s K function and a standard
pair correlation algorithm in MATLAB. These cluster analysis techniques confirmed
that localization had indeed occurred. Throughout the coming months, I plan to improve upon this software, rendering the code more robust, versatile, and conducive to
the Ozawa labs specific requirements.
1
1
1.1
Background
Fluorescence Microscopy
Fluorescent emission is essentially a three-step process: an electron is promoted to an excited
state by an incident photon; the electron relaxes to a lower energy state via non-radiative
decay; and finally, the electron returns to the ground state by emitting a photon of higher
wavelength than the incident photon (see Figure 1). Beginning in 1913 with the work of
Otto Heimstaedt and Heinrich Lehmann, the native fluorescence of certain organisms (such
as autofluorescing bacteria) has been harnessed as a novel visualization technique. In conventional microscopy, the specimens under study simply reflect incident light; in fluorescent
microscopy, however, the fluorescent samples themselves serve as the light source.
Figure 1: A schematic of the fluorescence mechanism
1.1.1
Green Fluorescent Protein: The Revolution
The discovery of Green Fluorescent Protein in 1961 (for which Osamu Shimomura deservedly
snagged the 2008 Nobel Prize) ushered in a new era in the field of microscopy, allowing
researchers to use GFP and related fluorescent proteins as exogenous labels for biological
imaging [1]. Martin Chalfie was the first to express GFP in E. coli, thus importing fluorescent
functionality into a non fluorescent organism. A few years later, Tulle Hazelrigg (who also
happens to be Chalfie’s wife) generated the first GFP fusion protein, which allowed her to
express a fluorescently-tagged exu molecule in Drosophilia melanogaster. This constituted a
significant departure from previous labeling techniques [2], all of which required “exogenously
added substrates or cofactors.” As Chalfie wrote in his groundbreaking 1994 Science paper,
“Because the detection of intracellular GFP requires only irradiation by near UV or blue light,
it is not limited by the availability of substrates. Thus, it should provide an excellent means
for monitoring gene expression and protein localization in living cells.” Chalfie’s predictions
have been borne out in laboratories worldwide ever since. The last decade and a half has seen
an explosive proliferation of fluorescent technology, with new colors, labeling techniques, and
2
applications being pioneered each year. There is even a species of transgenic glowing zebra
fish being sold in pet stores throughout Asia [2].
Figure 2: The barrel-shaped GFP molecule that revolutionized molecular imaging
These developments have given rise to a brand new technique for analyzing subcellular
phenomena: multicolor imaging. Provided their emission channels are spectrally distinct,
we can (in theory) obtain simultaneous images of an arbitrary number of fluorophores. This
provides a powerful tool for analyzing the interactions between molecules in cells [3]. For example, in a groundbreaking 2005 experiment, Koyama-Honda et. al. successfully confirmed
the co-localization of E-cadherin (a transmembrane protein responsible for cell adhesion)
and anti-E-cadherin antibody using two-color fluorescent microscopy [4].
1.1.2
Labeling Strategies
There are two principle methods for applying fluorescent labels to target molecules. In
the case of fluorescent proteins, genetic recombination is the prevailing technology, whereas
immunostaining is the preferred approach for organic fluorescent dyes.
In genetic recombination, the nucleotide sequence encoding the target protein is cloned
to the DNA encoding the fluorescent protein. The resulting recombinant plasmid is then
transfected into the host cells, and upon expression, a chimeric protein with the properties
of both the target molecule and the fluorescent probe is generated. One drawback of this
technique is low transfection efficiency, which is compounded in the case of co-transfection:
in general, only a small fraction of cells can be expected to express both constructs.
With organic fluorescent dyes - which, as a rule, are brighter and more photostable than
fluorescent proteins [1] - genetic recombination is not an option, so immunolabelling is utilized. In this procedure, the molecule to be labelled is tagged with a primary antibody, while
the fluorescent reporter is tagged with a secondary antibody. The primary and secondary
antibodies associate exclusively, giving rise to selective affinity between the target molecule
and its tag. In theory, all cells containing the target molecule are uniformly labeled, and
in this respect, immunostaining is more robust than genetic recombination. However, one
signficant disadvantage of immunostaining is the comparatively large size of the fluorescent
construct, which limits spatial resolution. Immunostaining also precludes the imaging of live
cells, since antibodies are unable to cross the plasma membrane unless it is punctured.
3
Figure 3: Labeling strategies: genetic recombination and immuolabeling
1.2
1.2.1
Super-resolution Imaging
The Diffraction Limit
In an ideal world, visualizing subcellular structures would simply be a matter of arbitrarily
increasing the magnification. Unfortunately, however, there is a certain point beyond which
“zooming in” fails to produce a resolvable image [3]. Conventional optical microscopy is
subject to a fundamental limitation: the finite wavelength of visible light. When a wave
strikes an object, it diffracts around the incident point and, upon returning to the detector,
is registered as an intensity distribution rather than a point. This intensity distribution
(or point-spread function, in mathematical terms) can be characterized by its localization
precision:
0.61λ
(1)
NA
In Equation 1, λ is the wavelength of incident light and N A is the numerical aperture of the
objective lens. Therefore, the spatial resolution of any image is bounded by the illumination
wavelength. For green light striking a standard oil immersion objective, there is a lateral
resolution limit of about 200 nm [3], whereas sub cellular components are on the order of
angstroms.
Equation 1 would imply that decreasing the wavelength of the incident beam is a viable
means of increasing the spatial resolution. In fact, this principle has been harnessed to
devise novel imaging techniques, most notably electron diffraction. A beam of electrons has
a wavelength of approximately 1 to 10 pico meters - about 3 orders of magnitude smaller
than the wavelength of a visible light ray - and can therefore probe matter at much higher
resolution. However, due to the invasivity of this methodology, electron diffraction cannot
be used to investigate living specimens [5]. Another approach is required.
∆loc =
4
1.2.2
General Strategy
By appealing to the temporal domain, researchers have devised a means of circumventing
the diffraction limit without compromising the integrity of living tissues. This technology
harnesses the optical highlighting properties of certain fluorescent probes to generate a composite image from successive frames. The procedure can be summarized as follows:
1. Observe the fluorescence of a few particles at a time, provided they are separated by a
distance larger than the diffraction limit.
2. Fit each diffracted spot to a Gaussian distribution to estimate the centroid location1 .
3. Repeat this procedure for approximately 10,000 frames and superimpose all the processed frames to generate the final image [1].
Figure 4 provides a schematic of super-resolution image generation:
Figure 4: A schematic of super-resolution image reconstruction
The localization precision is no longer limited by wavelength, but rather by the number
of measurements of the centroid location, which is equivalent to the number of photons (N)
emitted by each fluorophore [3] .
σP SF
∆loc = √
(2)
N
We have thus circumvented the wavelength-dependence of spatial resolution. Using this
iterative procedure of activation + visualization + bleaching [1], we are able to achieve
resolutions up to one order of magnitude higher than before (which translates into a lateral
localization of about 20 nm for a standard optical setup). Thus, the sub-cellular regime
is accessible with visible light. Figure 5 demonstrates the amazing capabilities of superresolution microscopy as compared to conventional imaging techniques.
1
Of course, this step relies on the assumption that the point-spread function is perfectly Gaussian, which
is not generally true.
5
Figure 5: Super-resolution microscopy applied to microtubules
1.2.3
Optical Highlighting
As described in the preceding section, super-resolution imaging technology relies on the ability to image single molecules at a time. When two fluorophores are separated by a distance
shorter than the localization precision, their intensity distributions overlap, and centroid
determination is compromised [3]2 . Only when dealing with spatially isolated molecules can
we be confident that each PSF is being generated by a single light source, allowing us to
perform Gaussian fitting to localize the centroid [1]. One prerequisite for suitably separated
single molecules is the use of imaging techniques such as TIRFM, which is described below.
Another is the employment of fluorophores that exhibit optical highlighting: that is, the
ability to be photoactivated, photoconverted, and/or photobleached [1]. In essence, optical
highlighting allows us to increase spatial resolution by invoking the temporal domain. Optical highlighting takes many forms, all of which involve fluorophores that can switch between
a bright and a dark state. The switching mechanisms, however, are crucially different depending on the type of fluorescent probe, and the methods for generating super-resolution
images vary accordingly. Two of the most prominent techniques are PALM - used with
fluorescent proteins - and STORM - used with fluorescent dyes.
Photo-Activated Localization Microscopy (PALM) is predicated on fluorescent
proteins that can undergo a chemical switch from an “OFF” state to an “ON” state upon
2
Compare this to the familiar macroscopic phenomenon of watching the headlights of an approaching car:
they are only resolvable into two separate light sources at distances below a certain threshold.
6
light irradiation. One of the most well-established photoactivatable fluorescent proteins
is PAmCherry, developed by Subach et. al. in 2009. The group performed site-specific
mutagenesis on conventional mCherry in the hopes of identifying mutants that might exhibit
photoactvatable character. Further rounds of random mutagensis improved the fluorescent
properties of this probe [6].
In the OFF state, PAmCherry absorbs at 405 nm; in the ON state, it absorbs at 564
nm (see Figure 9). The absorbance shift involves the formation of a double bond (via a
light-induced Kolbe-type radical pathway) between the beta-carbon atom in the Tyr-67 side
chain and the imidazol-5-ol ring of the chromophore. (This reaction requires the abstraction
of a proton from the beta-carbon atom, which can be accomplished by a lysine residue in
position 70; site-selected mutagenesis confirmed the crucial role played by this neighboring
lysine.) The double bond gives rise to pi-bond conjugation between the imidazol-5-ol ring
and the phenyl ring in tyrosine, which is ultimately responsible for the fluorescence activity
of the activated chromophore. Crucially, exciting the PAmCherry chromophore with 405
nm light under anaerobic conditions does not lead to fluorescence., implying that oxidation
plays a fundamental role in the photo-activation pathway. The mechanism is discussed at
length in [7].
Direct Stochastic Optical Reconstruction Microscopy (dSTORM) relies on the
fluorescent intermittence of certain organic probes: that is to say, these fluorophores oscillate
randomly between dark and bright states under continuous excitation. In order to successfully localize single molecules, we need to minimize the number of actively fluorescing probes
at any given time. In theory, a sparse population of activated probes can be achieved by
stabilizing the OFF states of the fluorophores; but in practice, how is this accomplished?
So-called “switching buffers” provide one solution. Recall that when fluorophores are
stimulated, valence electrons are excited to a singlet state, from which they can either return
radiatively to the ground state or relax nonradiatively to a triplet state. This triplet state
can be quenched by the addition of a reducing agent (such as mercaptoethanol or DTT [8]),
thereby trapping the fluorophore in a long-term OFF state. In the presence of oxygen, the
reduced triplet state is reoxidized and fluorescent emission is once again enabled. AlexaFluor
647, for example, has been shown to blink stochastically in the presence of a switching buffer
alkalinized with KOH. Unfortunately, switching buffers have a finite lifetime: as soon as the
reducing agent is consumed, the fluorophores remain continuously in their ON states [9].
1.2.4
TIRF
One well-established tool for single molecule spectroscopy is the Total Internal Reflection
Fluorescence Microscope. This device illuminates an effectively unicellular slice of sample
(100 to 150 nm) at a time, thereby reducing the number of target molecules within the
field of view. The operating principle is as follows: when a ray of light strikes an interface
at an angle higher than the critical angle (which is determined by the refractive indices of
the media on either side of the interface), the ray is reflected back into the source medium.
At the same time, an evanescent wave is generated at the incident plane, which propagates
away from the interface on the opposite side. The intensity of this evanescent wave decreases
exponentially with distance from the interface. Therefore, only a thin slice of the specimen
is appreciably excited, limiting the region of study to a single plane of cells and increasing
7
the signal-to-noise ratio [8]. Figure 6 elucidates this technique.
Figure 6: TIRF Microscopy setup
1.2.5
Spatial vs. temporal resolution
Super-resolution imaging via optical highlighting illustrates a fundamental trade-off between
spatial and temporal resolution. The generation of a single image requires about 30 ms of
exposure; therefore, in order to acquire the 10,000 frames required for thorough PALM or
STORM processing, approximately 5 minutes of acquisition time is required. This precludes
dynamic imaging of most cellular processes - for example, clathrin-mediated endocytosis
(which will be discussed at length in the subsequent section) proceeds on the order of 30
to 90 seconds. Conceivably, we could increase the temporal resolution by shortening the
exposure time of each frame. However, recall that the localization precision is inversely
proportional to the square root of the number of recorded photons (see Equation 2). The
more time spent acquiring each image, the more photons are used to compute the centroid,
and the higher the spatial resolution. Engineering brighter fluorescent probes can mitigate
this difficulty somewhat, but ultimately the trade-off is unavoidable.
1.3
Clathrin-Mediated Endocytosis: The Subject of Study
Endocytosis is the process by which the plasma membrane engulfs an extracellular particle
and invaginates to form a cargo-containing vesicle. Endocytosis (in conjunction with its
inverse process, exocytosis) is crucial to the maintenance of homeostasis, and is one of the
principle means by which a cell interacts with its environment. Endocytotic mechanisms
can be broadly classed into two main types: facultative and constitutive. Facultative endocytosis (also called phagocytosis in some contexts) exists primarily for the transport of
8
large particles across the plasma membrane, and only occurs when certain membrane receptors are specifically triggered. Constitutive endocytosis (or pinocytosis), on the other hand,
continuously shuttles small particles across the membrane without a concerted stimulus [10].
Clathrin-mediated endocytosis - a constitutive process - is the most common endocytotic
mechanism, and is employed by all known eukaryotes [11]. We use this term to refer specifically to endocytosis occurring at the plasma membrane, though an analogous process can
occur in organelles within the cell [12]. The functions of clathrin-mediated endocytosis are
crucial and varied, and include “regulating the surface expression of proteins, sampling the
cell’s environment for growth and guidance cues, bringing nutrients into cells, controlling
the activation of signaling pathways, retrieving proteins deposited after vesicle fusion, and
turning over membrane components by sending these components for degradation in lysosomes” [12]. Clathrin-mediated endocytosis is a modular pathway consisting of five discrete
stages:
1. Nucleation: preliminary membrane deformation
2. Cargo selection: association of membrane receptors with their intended cargo
3. Coat assembly: recruitment of clathrin to the deforming membrane
4. Scission: pinching off of the vesicle
5. Uncoating: dispersal of clathrin [12]
The entire process (depicted in Figure 7) occurs on a minute time-scale [13].
Figure 7: Clathrin-mediated endocytosis
A number of proteins are implicated in clathrin-mediated endocytosis, any of which might
be suitable target molecules for a fluorescence-based co-localization experiment. The present
investigation, however, is limited to three molecules of immediate interest. Our selection of
these particular targets is motivated by a combination of functional salience, labeling facility,
and scientific precedent.
9
1.3.1
Clathrin
As its name suggests, clathrin-mediated endocytosis depends first and foremost on the protein clathrin, which induces membrane curvature. Each clathrin molecule is comprised of
three so-called “heavy chains” (190 kDa, 450 Å) and three corresponding “light chains”
(23-26 kDa), which adopt a triskelian structure radiating outwards from a central hub [11].
Following nucleation and cargo binding, clathrin molecules are recruited to the cytosolic
surface of the budding membrane, where they polymerize to form a stiff lattice of alternating pentagons and hexagons. A geometrical consequence of this three-dimensional pattern
is that the membrane deforms into a dome, ultimately curving in on itself to form what
McMahon and Boucrot poetically describe as a “[vesicle] in a basket.” The radius of membrane curvature - which can be anywhere from 20-80 nm - depends on the ratio of hexagons
to pentagons in the clathrin coat [12]3 .
1.3.2
Transferrin Receptor
Transferrin receptor is a transmembrane glycoprotein which binds selectively to the ironbearing transferrin molecule [13], and is thus responsible for delivering iron into the cell4 .
Morphologically speaking, its ectodomain is a homodimeric protein comprised of two 11 nm
wing-shaped structures, each of which is composed of three domains: apical, central helical,
and protease-like. Cargo binding occurs through the the latter two domains. Between
rounds of endocytosis, transferrin receptors diffuse freely throughout the membrane; during
endocytosis, they have been shown to localize around budding pits.
1.3.3
Dynamin
Dynamin is a GTPase that readily oligimerizes to form long helical spirals. Its name hints
at its function [5]: dynamin is a mechanochemical enzyme [12] whose dynamic properties
are crucial to its role in endocytic fission. Mettlen et. al. propose one possible mechanism
for dynamin-induced fission: exogenous dynamin in the cytoplasm binds to GTP and then
assembles around the narrowing neck of a clathrin-coated pit. The subsequent hydrolysis of
GTP into GDP results in dynamin constriction, which in turn induces vesicle scission [5].
Other research suggests that constriction alone cannot precipitate the vesicle’s pinching off
from the plasma membrane, but that longitudinal membrane tension is also required [15] to
form a full-fledged vesicle.
1.3.4
Other important molecules
Though not directly involved in our study, a number of auxiliary proteins play a crucial role
in clathrin-mediated endocytosis. Transmembrane receptors such as TfR do not interact
with clathrin directly, but rather through specially tailored adaptor proteins (AP’s). The socalled AP-2 molecule binds to the cytosolic domain of TfR (recall that the ectodomain of TfR
3
Some clathrin coated pits fail to bud into full-fledged vesicles; these abortive structures never associate
with cargo, and typically have lifetimes of about 20 seconds.
4
In fact, overexpression of TfR has been linked to atherosclerosis, an arterial condition thought to result
from excess iron [14].
10
Figure 8: a) A single clathrin triskelion highlighted inside a clathrin cage; b) The ectodomain
of Transferrin Receptor; c) Dynamin polymer
associates with transferrin), thereby sequestering the appropriate cargo in the region primed
for endocytosis [5]. The AP-2 adaptor complex is also responsible for clathrin recruitment
to the cytosolic surface [13]. Following vesicle scission, adaptor proteins are recycled back
into the cytoplasm, where they await future deployment [12].
Recent research has suggested that the actin cytoskeleton might also participate in
clathrin-mediated endocytosis [11]. Though the exact mechanism of this interaction is still
unclear, actin filaments are known to localize around the clathrin-coated pits during endocytosis and seem to play nontrivial roles in “invagination, constriction, and scission” [5].
2
2.1
Results
Experimental Design
Using the protocols outlined in Section 5, we generated two chimeric proteins (transferrin receptor + EYFP and clathrin light chain + PAmCherry) and one immunoconjugate (dynamin
+ AlexaFluor 647). The fluorophores were chosen to have easily distinguishable excitation
and emission properties, as shown in Figure 9.
We co-expressed the two chimeric proteins in simian, fibroblast-like COS-7 cells. After
fixation and permeabilization, we performed immunostaining to introduce the dynamin immunoconjugate into the cells. Using confocal microscopy, we confirmed the expression of
these three constructs and troubleshooted the photo-activation mechanism of PAmCherry;
one set of confocal images is shown in Figure 10. We spent a significant amount of time optimizing the intensity of the activation signal: if too intense, the EYFP and AlexaFluor 647
signals would suffer bleaching, but if too diffuse, the PAmCherry would remain in its OFF
state. Ultimately, we exposed our samples to a 405 nm activation signal at 30% intensity
for 10 seconds.
We then proceeded to collect data on an Olympus Total Internal Reflection Miscroscope.
11
Figure 9: Spectral details of the chosen fluorescent probes
Figure 10: Confocal images validated our expression protocol
Through extensive trial and error, we observed that the strongest signals were obtained
when AlexaFluor 647 was imaged first, followed by EYFP, followed by PAmCherry. We
achieved a great deal of qualitative evidence to support clustering of the individual target
molecules. Figures 11 through 13 show the heterogeneous localization of our three labeled
proteins. Co-localization between TfR and CLC was also observed, as shown in Figure 14.
Dynamin, however, does not seem to interact proliferatively with the other target molecules.
This is not altogether unexpected, since the rate of dynamin’s recruitment and dispersal
during clathrin-mediated endocytosis is faster than the temporal resolution of our imaging
12
techniques.
Figure 11: Clathrin Light Chain clustering
Figure 12: Transferrin Receptor clustering
13
Figure 13: Dynamin co-localization
Figure 14: Putative co-localization
2.2
Cluster Analysis
Achieving qualitative co-localization is undeniably encouraging; however, we cannot set much
store by our results without subjecting them to more quantitative treatment [16]. Tan et. al.
warn against purely qualitative analysis, since apparent clustering might be a mere “artifact
of the human visual system” [17], and Figure 15 illustrates the ambiguity associated with
performing cluster analysis “by hand”. Lippitz et. al. elaborate on the importance of
performing statistical analysis on PALM or dSTORM-processed data: “Both the randomness
14
of single photon counting, and the irreproducibility of random blinking make statistical
evaluations indispensable to establish firm conclusions and fully exploit single-molecule data”
[18]. In this section, we investigate a number of cluster analysis techniques which enable us
to rigorously characterize homogeneous clustering and heterogeneous co-localization.
Figure 15: The difficulty of interpreting clustered data by sight alone [17]
There are a number of ways to define and identify a “cluster,” the most common and
versatile of which are described below:
1. Well-separated: in this idealized scenario, the clusters are spatially distinct and straightforward to identify with the naked eye.
2. Centroid-based: an event belongs to a certain cluster if it is more similar to the centroid
of that cluster than the centroid of any other cluster.
3. Density-based: a cluster is a region of high event density surrounded by a region of
low event density.
4. Conceptual: cluster memberships are assigned based on abstract characteristics.
5. Objective function: different cluster assignments are experimented with until some
objective function is minimized.
Thus, though individual approaches may vary, all clustering algorithms capitalize on the
similarities within groups (cohesion) and the differences between them (separation) [17].
We begin our quantitative analysis by representing our PALM-processed data5 as the
result of a spatial point process. According to this model, an “event” is defined as a single
bright pixel or group of tightly clustered bright pixels [19], which we assume emanates
5
The data could just as easily have been processed using STORM or another super-resolution analysis
technique; the method by which we generated the final image is irrelevant here.
15
from a single fluorescent probe. Our task is to determine whether there is any spatial
dependence governing these events6 . In most cluster analysis scenarios, data is compared
to a condition known as complete spatial randomness (CSR), which serves as our control.
CSR data consists of events resulting from the execution of a homogenous Poisson point
process. Under conditions of CSR, there is no spatial dependence among the points [20];
in the context of this experiment, we could identify CSR with the signals of fluorescently
labeled proteins which do not interact.
2.2.1
K-Nearest Neighbor Approach
One of the most intuitive cluster analysis methods is K-Nearest-Neighbor (KNN) Algorithm.
Though more robust techniques are available, the KNN approach merits discussion because of
its simplicity and historical importance. The naive implementation is as follows: all N points
are assigned a unique class label, C1 through CN . An NxN matrix encoding the Euclidean
distances between each pair of points is computed. Then, in an iterative “machine-learning”
process, each point is considered in turn and assigned the most common class label of its
K nearest neighbors7 . Although relatively easy to understand and remarkably effective in
certain contexts, the KNN approach can be computationally intensive for large values of N,
so it has been supplanted by more efficient algorithms in recent years.
2.2.2
Ripley’s K Function
Ripley’s K function is a density-based cluster analysis technique. In this approach, we define
a “cluster” as a region of high density surrounded by a region of lower density, relative to
CSR. Generally speaking, Ripley’s K function takes the form:
K(d) =
E[number of events occurring within distance d of the given event]
λ
(3)
where λ denotes the density of events and E represents the expectation operator [21]. In
practice, evaluating Ripley’s K Function involves selecting a point, drawing successively
larger concentric circles around that point, and evaluating the number of events encompassed
within each circle. Thus, Equation 3 becomes:
n
n
A XX
δij
K(d) = 2
n i=1 j=1
(4)
where d is the radius of a circle centered around point i, A is the area of the region being
considered, n is the total number of points in area A, and δij = 1 if δij < r and 0 otherwise.
For a homogeneous Poisson process (which results in a CSR-type distribution), the expected
6
In order to represent our data as a series of discrete events, we needed to convert our grayscale image file
into a binary matrix. We accomplished this by applying a threshold to each pixel. The choice of threshold
value is somewhat arbitrary; we followed the lead of Owen et. al. and chose a threshold value of 60% of the
peak pixel intensity [8].
7
If there is a tie among the neighbors, the iteration is repeated with the nearest K-1 neighbors, and so
on.
16
number of events is λ ∗ π ∗ d2 , and therefore K(d) = π ∗ d2 . In the event of clustering, we
expect to see a higher K value than π ∗ d2 for small values of d and a lower K value than
π ∗ d2 for large values of d. Higher K(d) values indicate aggregation, whereas lower K(d)
values correspond to “spatial inhibition.”
Figure 16: Calculating Ripley’s K Function
After implementing a simple version of Ripley’s K Function in MATLAB, we applied the
program to our PALM-processed data (as well as an artificially-generated CSR distribution)
and achieved the results shown in Figure 17. Note that Ripley’s K Function can also be
represented as K(d) - d, where positive values correspond to regions of high density and
negative values correspond to regions of low density [8].
Figure 17: Our clustering data subjected to Ripley’s K Function analysis
17
When coding Ripley’s K function, we ought to bear a few implementation details in mind.
In order to account for edge effects, we define a border region around the perimeter of the
image. Events within this border region are not themselves subjected to cluster analysis,
though they are factored into the analyses of neighboring points. Also, rather than dividing
the area of study into a grid of quadrates, we define a moving “window” of fixed size [20].
This can be implemented by applying a kernel weighting function to the PALM or dSTORMprocessed image, which “zeros out” the signals outside the region of immediate interest [20].
Ripley’s K function can also be made to accommodate temporal dependence, rendering it
suitable for live-cell applications.
2.2.3
Pair Correlation Analysis
Perhaps the technique most commonly used to evaluate PALM or dSTORM data is Pair
Correlation Analysis (PCA), which relies on the use of correlation functions. As its name
implies, a correlation function describes the spatial or temporal correlation between two
sets of data. For a standard correlation function, nonzero values represent some sort of
mutual dependence between the variables, whereas a value near zero indicates that the
variables are independent. Cross-correlation functions quantify the relationship between
two different signals (in our case, two different proteins); an autocorrelation function is
simply the cross-correlation function of one signal with itself. Therefore, we can use an
autocorrelation function to quantify the localization of a single target molecule and a crosscorrelation function to quantify the co-localization of multiple target molecules.
Pair Correlation Analysis is an extremely powerful tool for cluster assignment. Unlike the
methods previously described, PCA allows us to distinguish between genuine clusters (which
represent separate proteins interacting with one another) and spurious clusters (which arise
from conflicting localizations of a single protein at different time points). The apparent
motion of a solitary protein between frames is a stochastic phenomenon, arising because of
uncertainty in centroid localization [22]. This effect - which causes both the human observer
and cluster analysis algorithms to perceive multiple molecules where there is, in fact, only
one - can severely distort our results if not taken into account.
The total pairwise correlation function is expressed as:
g(r)peaks = (g(r)centroid + g(r)protein ) ∗ g(r)P SF
= g(r)stoch + g(r)protein ∗ g(r)P SF
(5)
(6)
where g(r)peaks represents the probability of finding another protein at a distance r from a
given protein; g(r)centroid represents the protein correlation function at r = 0; g(r)protein represents the protein correlation function at r > 0; g(r)P SF represents localization uncertainty;
and g(r)stoch accounts for “multiple appearances of the same molecule” [19]. In essence, this
equation relates observed patterns in the binarized data (g(r)peaks ) to the protein interactions
these patterns represent (g(r)protein ). Under CSF conditions, we expect to see:
g(r)peaks = g(r)stock + 1
18
(7)
For clustered data, we observe an equation of the form:
g(r)peaks = g(r)stoch + (Ae−r/ξ + 1) ∗ g(r)P SF )
(8)
Fortunately, “the cluster of peaks belonging to a particular protein has a well-defined spatial signature” [22]. This allows us to estimate g(r)stoch based on the average protein density
and the width of the Gaussian function being fitted. By subtracting away the stochastic contribution to g(r)peaks , solving for g(r)protein , and fitting the protein correlation function to the
second term of Equation 8, we can extract the parameters A and ξ, which provide estimates
of cluster size and density. Thus, PCA allows for the removal of stochastic misinformation
and the precise characterization of the clusters’ physical properties8 . Figure 18 shows the
results of our own PCA implementation in MATLAB. The calculated parameters are only
rough estimates of cluster radius and population; however, the analysis clearly confirms the
existence of large, stable clusters of all three target molecules.
Figure 18: Our data subjected to pair correlation analysis
2.2.4
Cluster Validation
No cluster analysis technique is ironclad. In order to ensure that we have not simply identified
serendipitous patterns in signal noise, we must subject our results to cluster validation.
According to Tan et. al., there are three types of cluster validation:
8
Another benefit of PCA is its ability to account for noise-induced over-counting [23].
19
1. Relative validation: compare the results of two or more clustering algorithms.
2. Internal validation: analyze the cluster analysis results using only the data at hand (i.e.
evaluate the cohesion and separation of the putative clusters using the matrix-based
technique described in the following paragraph).
3. External validation: cross-check the cluster analysis results against exogenous information (i.e. a known number of clusters) [17].
We have already performed relative validation by implementing both Ripley’s K Function
and a simple PCA algorithm and comparing their respective outcomes. One potentially
informative internal validation scheme would be the following: first, we would generate an
N x N proximity matrix, which encodes the Euclidean distance between each pair of points.
Following cluster analysis, we would assemble the N x N incidence matrix, where an entry of
1 means points i and j have been classified into the same cluster and an entry of 0 means they
have been classified into different clusters. We would then calculate the correlation between
these two matrices: if it is high, the cluster analysis would be deemed successful [17]. The
implementation of this validation technique is currently in progress.
3
Discussion and Future Directions
Super-resolution imaging through photoactivated localization microscopy provides a powerful means for circumventing the diffraction limit. In this study, chimeric protein expression
and antibody conjugation were used to fluorescently label clathrin light chain, transferrin
receptor, and dynamin, three proteins thought to collectively mediate endocytosis. The fluorescent probes were imaged with confocal and TIRF microscopes using 3 distinct laser lines.
Cursory investigation of the data seemed to corroborate the clustering and co-localization
hypotheses, and more rigorous analysis using Ripley’s K Function and PCA confirmed our
hunch that all three target molecules exhibit heterogeneous clustering during endocytosis.
3.1
Correcting for Chromatic Aberration
One persistent difficulty in optical microscopy is a phenomenon known as chromatic aberration. Refractive indices are wavelength dependent; therefore, when polychromatic light
passes through a lens, the individual wavelengths converge on slightly different focal points,
giving rise to a blurry image. In multicolored fluorescent imaging, each excited fluorophore
emits a different wavelength of light. Chromatic aberration renders the resulting images
non-superimposable, thus precluding co-localization analysis. We can mitigate this effect by
incorporating “fluorescent beads” into our experimental set-up. These minute polystyrene
spheres - available from a number of laboratory suppliers - are conjugated with fluorescent
dyes that emit at a known wavelength, and can be used to assist in spatial calibration.
3.2
Improved Cluster Analysis
The cluster analysis algorithms used to analyze our results were rather crudely implemented.
For example, the parameters extracted from PCA were disconcertingly dependent upon ini20
Figure 19: Fluorescent beads can be used to correct for chromatic aberration
tial conditions, and computation times were prohibitively long. Refinement of our customized
MATLAB code is currently underway. Future efforts may also involve implementation of DBSCAN (Density-Based Spatial Clustering of Applications with Noise), one of the most sophisticated cluster analysis algorithms currently available. DBSCAN is a density-based approach
which can accommodate noise and arbitrarily-shaped clusters [17]. We also plan to perform
cross-correlation analysis in order to rigorously characterize TfR-CLC co-localization.
3.3
Computational Docking with HADDOCK
As Vries et. al. eloquently write, “to fully understand how the various units work together
to fulfill their tasks, structural knowledge at an atomic level is required” [24]. in silico
docking simulations are a powerful complement to experimental techniques, offering a highresolution glimpse into protein-protein interactions. Armed with atomic insights into the
binding conformations of our target molecules, we can modify individual residues to enhance
or abrogate native interactions.
Computational docking software iteratively adjusts protein positions to find the binding
conformations that minimize the system’s total energy. There are many docking packages
available, but HADDOCK (High Ambiguity Driven protein-protein DOCKing) bimolecular
docking software is among the most powerful, thanks to its data-driven approach: rather
than assessing binding conformations in an ab initio fashion, it employs existing experimental
data to preemptively narrow configurational space. For example, the user can input a list
of putative “active” and “passive” residues - identified through random mutagenesis - which
gives the algorithm a substantial head-start [24]9 . Once primed with experimental data,
HADDOCK employs a three-part algorithm to probe configurational space and estimate the
best docking orientation:
1. Rigid body minimization, in which the individual molecules are treated as immobile
structures
9
If such data is unavailable, the user can use an interface prediction algorithm such as CPORT (Consensus
Prediction Of interface Residues in Transient complexes) to identify salient residues [25].
21
2. Semi-flexible refinement, in which the side chains are allowed to fluctuate
3. Fine-tuning in explicit solvent [24]
Future endeavors may involve performing docking simulations on all-atom models of TfR,
CLC, and dynamin, using PDB crystal structures and CPORT Prediction Interface results
as inputs.
4
Acknowledgements
I am sincerely grateful to Professor Takeaki Ozawa, for being such an accessible and inspiring
teacher; to Yusuke Nasu, for his tireless patience and unflagging mentorship; to the entire
Ozawa lab, for welcoming me so warmly into their midst; to Sachiko Soeda and the other
coordinators of the UTRIP program for ensuring such a smooth and rewarding study-abroad
experience; and to Friends of Todai, Inc. for funding my stay in Japan.
5
Materials and Methods
5.1
5.1.1
Sample Preparation
Subculture
A subculture is performed on stock cells approximately every three days in order to eliminate metabolic
wastes, refresh the nutrient supply, and ensure that the population does not grow too dense. For COS-7
cells, the optimum confluency for transfection is approximately 90%.
1.
2.
3.
4.
5.
6.
7.
5.1.2
Remove old medium by aspiration.
Wash dead cells away with phosphate-buffered saline (PBS).
Add Trypsin-EDTA to unstick live cells from dish10 .
Incubate sample at 37 degrees C for 5 minutes.
Centrifuge sample until a pellet forms (about 2 minutes), and remove supernatant by aspiration.
Resuspend pellet in 1 mL pre-warmed DMEM glucose medium.
Add a few hundred microliters of cell solution and 10 mL fresh medium to a clean dish and mix gently.
Co-Transfection
The transfection protocol is designed to introduce custom-made genetic constructs into host cells. The
readily-available lipofectamine kit promotes engulfment of plasmid DNA into the cell.
1. Combine 200 micro liters Opti-MEM reduced serum medium and 1 µg of each plasmid to be cotransfected in an Eppendorf tube (the plasmid concentrations can be adjusted to maximize expression).
2. Add 2 µL Lipofectamine 1025 reagent 1 (orange cap), microcentrifuge, and incubate at room temperature for 5 minutes.
3. Add 6 µL Lipofectamine 1025 reagent 2 (green cap), microcentrifuge, and incubate at room temperature for 20 minutes.
4. Add resulting solution to sample cells and incubate overnight.
10
COS-7 cells are belong to a class known as “adherent cells.”
22
5.1.3
Fixation and Permeabilization
The cell fixation procedure preserves biological tissues so as to prolong observation time. When immunostaining is involved, the plasma membranes must also be permeabalized to permit entry of the antibodies
into the cell.
1.
2.
3.
4.
5.
5.1.4
Prepare a sample dish for observation.
Wash twice with 2 mL PBS.
Add 1 mL of 4% paraformaldehyde (PFA) solution and incubate sample at 37 degrees for 10 minutes.
Wash sample three times with PBS.
Add permeablization buffer (1.4 mL PBS + 2.8 µL Triton X-100) and incubate sample at room
temperature for 5 minutes.
Immunostaining
In this procedure, we add the primary antibody (tagged with the target molecule) and the secondary antibody
(tagged with the fluorescent reporter) to the host cells. The purpose of the blocking buffer is to eliminate
non-specific binding of the antibodies, which could lead to spurious signal.
1.
2.
3.
4.
5.
6.
7.
8.
9.
Add 1mL of blocking buffer (0.2% gel from cold water fish skin + PBS) and 20 µL primary antibody.
Incubate sample on spinner (40 rpm) at room temperature for 1 hour.
Remove medium and save for later (the primary antibody can be reused up to 5 times).
Wash sample three times with 1 mL blocking buffer.
Add 0.75 µL secondary antibody.
Incubate sample on spinner (40 rpm) at room temperature for 1 hour.
Remove and discard medium.
Wash sample three times with 1 mL blocking buffer.
Add 1 mL blocking buffer and deposit on labeled slide.
5.2
Image Acquisition
5.2.1
Confocal Microscopy
We used EYFP emission to identify viable imaging candidates: i.e. cells that exhibit strong fluorescence and
clear nuclear structure. We defined a photoactivation region and irradiated this area with 405 nm light at
30% intensity for 10 seconds to switch PAmCherry into its ON state. We then acquired PAmCherry, EYFP,
AlexaFluor 647, and bright field images.
5.2.2
TIRF Microscopy
Images were acquired on an Olympus Total Internal Reflection Microscope. EYFP signal (488 nm excitation)
was measured with a 525/45 nm filter. PAmCherry signal (561 nm excitation following 405 nm activation)
was measured with a 609/59 filter. EYFP and PAmCherry images were captured at an exposure rate of 30
ms per frame; AlexaFluor 647, however, emits at a higher intensity, so an exposure rate of 20 ms per frame
was sufficient.
5.3
Image Analysis
Freely available ImageJ software was used to perform PALM and dSTORM analysis on TIRFM images. The
raw data was processed with a Gaussian blur low-pass filter (σ = 0.75). The Octane plugin was used to
analyze the image stack (size = 2, σ = 0.94, threshold = 2000). For the Ozawa laboratory’s optical system,
1 pixel corresponds to approximately 80 nm.
23
References
[1] G. Patterson, M. Davidson, S. Manley, and J. Lippincott-Schwartz, “Superresolution imaging using single-molecule localization,” Annual Review of Physical Chemistry,
vol. 61, pp. 345–367, 2010.
[2] M. Zimmer, Glowing Genes: A Revolution in Biotechnology. Prometheus Books, 2005.
[3] B. Huang, M. Bates, and X. Zhuang, “Super-resolution fluorescence microscopy,” Annual Review of Biochemistry, vol. 78, pp. 993–1016, 2009.
[4] I. Koyama-Honda, K. Ritchie, T. Fujiwara, R. Iino, H. Murakoshi, R. S. Kasai, and
A. Kusumi, “Fluorescence imaging for monitoring the colocalization of two single
molecules in living cells,” Biophysical Journal, vol. 88, pp. 2126–2136, March 2005.
[5] M. Kaksonen, C. P. Toret, and D. G. Drubin, “Harnassing actin dynamics for clathrinmediated endocytosis,” Nature Reviews: Molecular Cell Biology, vol. 7, June 2006.
[6] F. V. Subach, G. H. Patterson, S. Manley, J. M. Gillette, J. Lippincott-Schwartz, and
V. V. Verkhusha, “Photoactivatable mcherry for high-resolution two-color fluorescence
microscopy,” Nature Methods, vol. 6, pp. 153–158, February 2009.
[7] F. V. Subach, V. N. Malashkevich, W. D. Zencheck, H. Xiao, G. S. Filonov, S. C.
Almo, and V. V. Verkhusha, “Photoactivation mechanism of pamcherry based on crystal
structures of the protein in the dark and fluorescent states,” PNAS, vol. 106, pp. 21097–
21102, December 2009.
[8] D. M. Owen, C. Rentero, J. Rossy, A. Magenau, D. Williamson, M. Rodriguez, and
K. Gaus, “Palm imaging and cluster analysis of protein heterogeneity at the cell surface,”
Journal of Biophotonics, vol. 3, no. 7, pp. 446–454, 2010.
[9] M. Heilemann, S. van de Linde, A. Mukherjee, and M. Sauer, “Super-resolution imaging
with small organic fluorophores,” Angewandte Chemie, vol. 48, pp. 6903–6908, August
2009.
[10] B. Alberts, A. Johnson, J. Lewis, M. Raff, K. Roberts, and P. Walter, Molecular Biology
of the Cell. Garland Science, 5th ed., 2008.
[11] T. Kirchhausen, “Imaging endocytic clathrin structures in living cells,” Trends in Cell
Biology, vol. 19, no. 11, 2009.
[12] H. T. McMahon and E. Boucrot, “Molecular mechanism and physiological functions
of clathrin-mediated endocytosis,” Nature Reviews: Molecular Cell Biology, vol. 12,
pp. 517–533, August 2011.
[13] D. K. Cureton, C. E. Harbison, E. Cocucci, C. R. Parrish, and T. Kirchhausen, “Limited
transferrin receptor clustering allows rapid diffusion of canine parvovirus into clathrin
endocytic structures,” Journal of Virology, vol. 86, pp. 5330–5340, May 2012.
24
[14] W. Li, L.-H. Xu, C. Forssell, J. L. Sullivan, and X.-M. Yuan, “Overexpression of transferrin receptor and feritin related to clinical symptoms and destabilization of human
carotid plaques,” Experimental Biology and Medicine, vol. 233, pp. 818–826, July 2008.
[15] A. Roux, K. Uyhazi, A. Frost, and P. D. Camilli, “Gtp-dependent twisting of dynamin
implicates constriction and tension in membrane fission,” Nature, vol. 441, May 2006.
[16] J. C. Waters and J. R. Swedlow, Evaluating Techniques in Biochemical Research, ch. Interpreting Fluorescence Microscopy Images and Measurements. Cell Press, 2007.
[17] P.-N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining.
[18] M. Lippitz, F. Kulzer, and M. Orrit, “Statistical evaluation of single nano-object fluorescence,” Chemical Physical Chemistry, vol. 6, pp. 770–789, 2005.
[19] P. Sengupta, T. Jovanovic-Talisman, D. Skoko, M. Renz, S. L. Veatch, and J. LippincottSchwartz, “Probing protein heterogeneity in the plasma membrane using palm and pair
correlation analysis,” Nature Methods, vol. Advance Online Publication, 2011.
[20] A. C. Gatrell, T. C. Bailey, P. J. Diggle, and B. S. Rowlingson, “Spatial point pattern
analysis and its application in geographical epidemiology,” Transactions of the Institute
of British Geographers, vol. 21, no. 256-274, 1996.
[21] P. M. Dixon, Encyclopedia of Environmentrics, vol. 3, ch. Ripley’s K Function, pp. 1796–
1803. John Wiley and Sons, Ltd, 2002.
[22] P. Sengupta and J. Lippincott-Schwartz, “Quantitative analysis of photoactivated localization microscopy (palm) datasets using pair-correlation analysis,” Bioessays, vol. 34,
pp. 396–405, 2012.
[23] S. L. Veatch, B. B. Machta, S. A. Shelby, E. N. Chiang, and D. A. Holowka, “Correlation
functions quantify super-resolution images and estimate apparent clustering due to overcounting,” Public Library of Science ONE, vol. 7, February 2012.
[24] S. J. de Vries, M. van Dijk, and A. M. J. J. Bonvin, “The haddock web server for
data-driven biomolecular docking,” Nature Protocols, vol. 5, no. 5, pp. 883–896, 2010.
[25] S. J. de Vries and A. M. J. J. Bonvin, “Cport: A consensus interface predictor and
its performance in prediction-driven docking with haddock,” Public Library of Science
ONE, vol. 6, no. 3, 2011.
25