Two-factor reprogramming of somatic cells to pluripotent stem cells

Cell Death and Differentiation (2012) 19, 1268–1276
& 2012 Macmillan Publishers Limited All rights reserved 1350-9047/12
www.nature.com/cdd
Two-factor reprogramming of somatic cells to
pluripotent stem cells reveals partial functional
redundancy of Sox2 and Klf4
A Nemajerova*,1, SY Kim2, O Petrenko1 and UM Moll1
Ectopic expression of defined sets of transcription factors in somatic cells enables them to adopt the qualities of pluripotency.
Mouse embryonic fibroblasts (MEFs) are the classic target cell used to elucidate the core principles of nuclear reprogramming.
However, their phenotypic and functional heterogeneity represents a major hurdle for mechanistic studies aimed at defining the
molecular nature of cellular plasticity. We show that reducing the complexity of MEFs by flow cytometry allows the isolation of
discrete cell subpopulations that can be efficiently reprogrammed to pluripotency with fewer genes. Using these FACS-sorted
cells, we performed a systematic side-by-side analysis of the reprogramming efficiency with different two- and three-factor
combinations of Oct4, Sox2 and Klf4. We show that introduction of exogenous Oct4 with either Sox2 or Klf4 does not directly
convert MEFs to a pluripotent state. Instead, each combination of factors disrupts the normal cellular homeostasis and
establishes transient states characterized by the concurrent expression of mixed lineage markers. These cells convert into
induced pluripotent stem cells in a stochastic fashion. Our data suggest that there is a partial functional redundancy between
Sox2 and Klf4 in the disruption of cellular homeostasis and activation of regulatory networks that define pluripotency.
Cell Death and Differentiation (2012) 19, 1268–1276; doi:10.1038/cdd.2012.45; published online 27 April 2012
Reprogramming somatic cells into induced pluripotent stem
(iPS) cells represents a valuable resource for the development of general and patient-specific therapies. The crucial
issues facing the clinical translation of iPS technology are
(i) choosing the most appropriate cell type for induction of
pluripotency; (ii) improving the reprogramming efficiency; and
(iii) gaining a mechanistic understanding of the reprogramming process. To date, multiple cell types have been used for
iPS production, including embryonic and adult fibroblasts,
adipocytes, cardiomyocytes, epithelial, vascular, hematopoietic and neuronal progenitors.1,2 Reprogramming has
been achieved through the expression of various combinations of transcription factors, most often consisting of Oct4,
Klf4, Sox2 and c-Myc (OKSM).3–6 An important observation
from these studies is that the type of somatic cells chosen has
a significant effect on the efficiency of iPS generation and the
extent of manipulation required. Thus, less-differentiated cells
reprogram more efficiently than differentiated cells. Moreover,
partially differentiated cells, for example, neural stem cells,
can be reprogrammed with fewer factors.7–10
It is accepted that the ectopic expression of pluripotency
factors establishes an embryonic-like transcriptome that is
sustained by the subsequent reactivation of endogenous
gene-regulatory networks.1,2 However, determining the order
of critical events that take place during reprogramming and
their timing has proven difficult, in part because the process is
neither efficient (less than 0.1%), nor rapid (about 2–3 weeks),
nor homogeneous in terms of transgene integration. The
advent of a ‘secondary’ reprogrammable system based on
doxycycline-inducible lentiviruses circumvented some of
these problems.11–13 The system was further improved by
integration of a single inducible cassette with four reprogramming factors into mouse genome.14,15 Although the reprogramming efficiency of cells from such secondary mice is
higher (1–10%), several important questions regarding the
mechanistic underpinning of the process remain unanswered:
Why do only a small fraction of cells reprogram, while others
don’t? Which cell types can be effectively reprogrammed into
a pluripotent state? Is reprogramming a fundamentally
deterministic process that occurs in a well-defined temporal
order, or is it driven by random chains of stochastic events
triggered by the aberrant levels of transcription factors?16–18
To date, the majority of mechanistic studies of iPS cells
have used mouse embryonic fibroblasts (MEFs) as the
starting cell population, and it was assumed that MEFs are
rather well-defined and fairly homogeneous. The advantages
of fibroblasts are that they are relatively easy to derive and
reprogram. However, it has been shown that fibroblasts
represent a dynamic population of cells, exhibiting functional
heterogeneity among and within tissues.19,20 Moreover,
embryo-derived fibroblasts are expected to contain a wide
spectrum of progenitor cells with diverse properties that are
able to expand prior to terminal differentiation. The transcriptional profiles of partially reprogrammed cells, which exhibit
1
Department of Pathology, Stony Brook University, Stony Brook, NY 11794, USA and 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
*Corresponding author: A Nemajerova, Department of Pathology, Stony Brook University, 100 Nicolls Road, BST 9, L9, Room 136, Stony Brook, NY 11794, USA.
Tel: þ 1 631 444 3030; Fax: þ 1 631 444 2434; E-mail: [email protected]
Keywords: somatic cell reprogramming; induced pluripotent stem cells; iPSC
Abbreviations: FACS, fluorescence-activated cell sorting; GFP, green fluorescent protein; iPS cells, induced pluripotent stem cells; KSR, knockout serum replacement;
MEFs, mouse embryonic fibroblasts; OKSM, Oct4, Klf4, Sox2 and c-Myc
Received 30.12.11; revised 16.3.12; accepted 19.3.12; Edited by R De Maria; published online 27.4.12
Two-factor reprogramming of somatic cells
A Nemajerova et al
1269
similar features regardless of the starting cell type,21–24 may
support the notion of a stepwise process with well-defined
intermediate stages. However, the phenotypic complexity of
MEFs still has to be viewed as a confounding variable that
needs to be taken into account when evaluating highthroughput expression data from initially heterogeneous
populations.
In this study, we sought to address these questions and
determine whether different two- and three-factor combinations of Oct4, Sox2 and Klf4 induce the dedifferentiation of
MEFs into a unique progenitor-like state before they acquire
pluripotency. We also sought to prospectively identify and
isolate the most reprogrammable MEF populations. Our
results indicate that forced expression of Oct4 with either
Sox2 or Klf4 does not directly convert MEFs to a pluripotent
stem cell state. Instead, each combination of factors disrupts
the normal cellular homeostasis and establishes transient or
metastable states characterized by the concurrent expression
of mixed lineage markers. These cells convert into iPS cells in
a stochastic fashion.
Results
Efficient induction of iPS cells with two factors using
FACS (fluorescence-activated cell sorting)-sorted MEFs
and an optimized protocol. One of the biggest hurdles in
reprogramming somatic cells into pluripotent cells is the fact
that the process is inefficient and often incomplete. To
overcome this limitation, we generated REBNA episomal
retroviral vectors25 expressing human Myc, Klf4, Oct4 and
Sox2. With this optimized system, we were able to reprogram
into iPS cells up to 2% of freshly isolated MEFs using 4F
(OSKM) or 3F (OSK) combinations. We next assessed
whether the complexity of MEFs may influence the levels of
reprogramming. To this end, E12.5–E13.5 MEFs were
FACS-sorted into four phenotypically distinct subpopulations
based on the surface expression of Thy1 (CD90) and Sca1
(stem cell antigen 1) markers. Previous studies demonstrated that Thy1 expression defines lipofibroblastic or
myofibroblastic lineage,26 while Sca1 is a marker specifying
murine mesenchymal and epithelial progenitor cells.27 Our
microarray analyses confirmed that Sca1-single-positive
cells (SP) express genes broadly involved in embryonic
and tissue morphogenesis, thus indicating their immature
origin (Supplemental Table 1), whereas Thy1/Sca1-doublenegative (DN) cells selectively express markers of myofibroblast-committed progenitors (Supplemental Table 2).
Notably, the Thy1-positive fractions (composed of Thy1single positive (Thy1-SP) and Thy1/Sca1 double-positive
(DP) cells) represent the majority of MEFs (470%), while
Thy1-negative (Sca1-SP and DN) cells are a minority
(10–15% each, Figure 1a).
Freshly sorted MEFs were then seeded at equal densities
and transduced with different combinations of reprogramming
factors. The transduction efficiencies of the sorted MEF
fractions were equal, based on the levels of green fluorescent
protein (GFP) expression (data not shown) and ectopic factor
expression (Figure 1b). However, the frequency of iPS
colonies produced by each subpopulation differed
(Figure 1c). Thus, OSKM- or OSK-transduced DN and
Sca1-SP cells yielded significantly more iPS colonies than
the corresponding Thy1-SP or DP fractions (Figure 1c and
Supplemental Table 3). We found that Myc was fully
dispensable for the reprogramming of DN and Sca1-SP cells
(Figure 1c). Moreover, DN and Sca1-SP cells transduced with
2F combinations (OK or OS) also yielded significantly higher
numbers of iPS colonies compared to Thy1-SP and DP
fractions (Figure 1c). We calculated a reprogramming
efficiency of up to 2% for OSK-transduced, and B0.2% for
OK- and OS-transduced DN and Sca1-SP cells, respectively.
Colonies of iPS cells first appeared in OK- and 3F-transduced
MEFs (4 to 5 days post infection), followed by OS-transduced
cells (9 to 10 days post infection, Figures 1d and e). We
expanded over 100 independent OK and OS colonies for
further analysis. Genotyping of these iPS clones with
transgene-specific primers verified the integration of the
Oct4 and either Klf4 or Sox2 transgenes in OK and OS
infections, respectively (Supplemental Figure 1). None of the
clones examined (n ¼ 23) contained unwarranted integrated
transgenes (e.g., Myc), thereby ruling out the possibility of
cross-contamination (Supplemental Figure 1). Together,
these data indicate that DN and Sca1-SP cells represent
superior cell types for reprogramming and are accessible by
fewer genetic manipulations.
Two-factor-derived iPS cells are pluripotent. Most of our
2F colonies showed well-defined ES cell-like morphology
with sharp colony borders. The established iPS cells were
alkaline phosphatase-positive (100%), predominantly negative for Thy1 (90–100%), Sca1 (80–90%), CD34 (100%) and
CD133 (95–100%), and did express variable levels of ES cell
markers CDH1 (100%), EpCAM (100%), Kit (60–90%) and
SSEA1 (50–60%) (Figure 2a and data not shown). The
observed fractions of KIT-, Sca1- and SSEA1-negative iPS
cells likely reflect heterogeneous expression of these
markers, as previously reported for undifferentiated wild-type
ES cells.28,29 In a separate set of experiments, we generated
2F iPS cells from FACS-sorted MEFs containing the Oct4GFP reporter gene. These iPS cells homogeneously
expressed the GFP, confirming their undifferentiated status
(Figure 2b). Following subcutaneous injection into nude
mice, teratomas were formed by all clonal iPS cell lines
tested (n ¼ 20). Histological analysis revealed tissue
structures indicative of all three germ layers (Figure 2c).
We successfully obtained adult chimeric mice with independently produced OK (1 out of 1) and OS (1 out of 2) iPS cell
lines (Figures 2d and e). The degree of chimerism, as
assessed by coat color contribution, was wide-spread,
ranging from 50 to 90% in individual animals. Taken together,
these in vivo results indicate that 2F-derived iPS cells are
pluripotent.
Gene expression profiles of two-factor-transduced
MEFs. We next examined the expression profiles of each
Thy1/Sca1 subpopulation transduced with the reprogramming factors. Analysis of global gene expression patterns
demonstrated a strong correlation between transcriptomes of
OS-infected cells and those of OK-infected cells from the
Thy1-negative group (Figure 3a, top panels). In contrast, the
Cell Death and Differentiation
Two-factor reprogramming of somatic cells
A Nemajerova et al
1270
OSK
Sca1-SP
DN
Thy1-SP
DP
Sca1-SP
DN
Thy1-SP
DP
Control
18%
Oct4
58%
Thy1
Klf4
Sox2
11%
12%
MAPK
Sca1
OSKM
OSK
OK
OS
60
40
20
0
Unsorted
% iPS colonies
40
Thy1-SP
DP
OK
OSK
OS
30
% Cumulative number
Relative colony number
80
20
10
0
0
5
10
15
Days post infection
20
Sca1-SP
120
DN
OK
OSK
OS
100
80
60
40
20
0
0
5
10
15
Days post infection
20
Figure 1 Efficient induction of iPS cells with two factors. (a) Flow-cytomeric analysis of Thy1 and Sca1 expression by wild-type MEFs at passage 2. (b) Immunoblot
analysis of Oct4, Klf4 and Sox2 expression by sorted MEFs transduced with control or OSK retroviruses. MAPK is a loading control. (c) Relative efficiency of iPS cell
generation from total unsorted MEFs or Thy1-SP, DP, Sca1-SP and DN fractions transduced with the indicated retroviruses. The results represent the average of 10
experiments. Values are normalized relative to unsorted MEFs transduced with OK viruses. The error bars correspond to standard error. (d and e) Reprogramming kinetics of
DN cells measured as the percentage of newly produced iPS colonies per day (d) or the percentage of cumulative iPS colonies per day (e). Cells were transduced with the
indicated retroviruses. Colonies consisting of Z20 cells were counted daily. The results represent an average of two experiments performed in duplicate. The error bars
correspond to standard deviation
correlation was lower when the respective OS- versus OKinfections were compared (Figure 3a, bottom panels). As
expected, microarray analysis revealed that OK- versus OSmediated reprogramming elicited dissimilar effects on the
expression of many endogenous transcription factors. Thus,
combined expression of Oct4 and Klf4 altered the levels of
B120 endogenous transcription factors, while the combined
expression of Oct4 and Sox2 caused changes in the levels of
B90 endogenous transcription factors compared with the
respective uninfected controls (Figures 3b and c and
Supplemental Figures 2a and b). Only a small number of
endogenous transcription factors were similarly induced or
repressed in OK- and OS-transduced cells (Supplemental
Figure 2c). Notably, gene set enrichment analysis30 showed
Cell Death and Differentiation
that the OS combination of factors was more effective in
inducing genes associated with pluripotency (i.e., the previously identified Core, Myc and ESC-like gene modules31,32),
consistent with the notion that these two proteins bind
to DNA together (Supplemental Figure 3). However, expression of either OS or OK also resulted in the concurrent
induction of different or even conflicting lineage-specific
markers (Figure 3d). Thus, infection with OS viruses led to
upregulation of mesenchymal markers ESM1 (endothelial
cell-specific molecule 1), MSC (musculin) and SOSTDC1
(sclerostin domain containing 1, inhibitor of Wnt-mediated
signaling), but also neuronal genes, such as NES (Nestin),
RELN (reelin), SOX21 and TNN (tenascin N). Likewise, cells
infected with OK viruses upregulated mesenchyme-specific
Two-factor reprogramming of somatic cells
A Nemajerova et al
CD34
EpCam
EpCam
1271
CD133
CDH1
CD34
EpCam
EpCam
Oct4-GFP
Oct4-GFP
CD133
EpCam
N
N
C
N
P
P
G
C
R
G
P
N
Figure 2 Assessment of the quality of two-factor-produced iPS cells. (a) FACS analysis of two independent iPS clones derived from OS-transduced DN Oct4-GFP MEFs.
For the two panels on the right, each clone was gated on GFP-positive cells (R2). (b) GFP-positive ES-like colonies derived from OS-transduced Oct4-GFP-expressing MEFs.
(c) Teratoma formation by 2F iPS cell lines. Hematoxylin and eosin staining identified mature tissue derivatives of all three germ layers: squamous epithelium with keratotic
pearl (P, ectoderm); neuroepithelial cells (N, ectoderm); secretory epithelium (R, ciliated respiratory or G, glandular structures) (endoderm); and mature cartilage embedded in
fibrovascular stroma (C, mesoderm). (d and e) Chimeric mice with agouti coat color originating from an OS iPS clone (d) or OK iPS clone (e)
genes CD34, LY6A (Sca1), MSC and TEK (TEK tyrosine
kinase), the neuronal genes RELN and TNN, but also
the epithelial genes KERA, KRT14, KRT16 and
KRTDAP (keratocan, keratins 14, 16 and keratinocyte
differentiation-associated protein, respectively) (Figure 3d).
In sum, these data indicate that cells transduced with OSversus OK-expressing viruses differ in the molecular
mechanisms underlying the induction of pluripotency. The
observed alterations likely reflect the fact that, in addition to
their roles in stem cell maintenance, Oct4, Sox2 and Klf4 are
Cell Death and Differentiation
Two-factor reprogramming of somatic cells
A Nemajerova et al
DN OS
r = 0.989
Sca1-SP OS
r = 0.996
Sca1-SP OK
DN OK
Thy1-SP OS
r = 0.957
r = 0.896
DN OS
Sca1-SP OS
r = 0.944
DN OK
SOX21
NFE2L3
MSC
FOSB
NPAS4
FOXQ1
ETV1
EGR2
MEOX2
PROX1
MKX
MAFF
NR4A2
RB1
KLF4
ATF3
ETV4
CREB5
GLIS3
SIM2
EPAS1
NCOA2
OTX1
ESR1
CLOCK
ISL1
BHLHB5
BACH2
NOBOX
EBF4
MLX
SIX4
SATB2
FOXN1
BATF2
Thy1-SP OK
Thy1-/Thy1+
Sca1-SP OS
DN OS
DP OS
Sca1-SP OK
DN OK
Thy1-SP OK
Thy1-/Thy1+
Sca1-SP OK
DN OK
Thy1-SP OK
Thy1-/Thy1+
Sca1-SP OS
DN OS
DP OS
Sca1-SP OK
r = 0.977
Thy1-SP OS
Sca1-SP OS
1272
OTX1
MSC
CITED1
AHRR
TCFAP2B
MEOX2
EPAS1
TCF7
P42POP
LMX1A
TRP63
MLX
TCF3
GLIS2
TEAD3
ESR1
SOHLH1
NR5A2
NPAS4
NCOA2
ISL1
ZFP367
NFE2L3
ASCL2
CTNNB1
HOXC13
THRA
FOSB
DLX3
POU3F1
SIX4
KLF15
FOXA3
TBX4
TCFEB
MESDC1
ESM1
MSC
SOSTDC1
LY6A
TEK
CD34
CDH13
NGFR
NOTCH1
NES
SOX2
SOX21
TNN
TUBA8
RELN
CDH6
CDH1
TRP63
MET
KERA
KRT14
KRT16
KRTDAP
MYH1
MYH2
MYH11
MYL1
DES
TNNC2
TNNI2
Mesenchymal
Neuronal
Epithelial
Myogenic
Low
High
Expression (log2)
Figure 3 Gene expression profiles of two-factor-transduced MEFs. (a) Scatter plots of global gene expression patterns comparing Sca1-SP, DN and Thy1-SP cells 4 days
post infection with the indicated retroviruses. The degree of correlation was assessed using Pearson’s correlation coefficient (r). The central red line represents equal gene
expression. The outer red lines indicate twofold different expression. (b and c) Heat maps from microarray analyses showing induction of endogenous transcription factors
in Sca1-SP, DN, Thy1-SP and DP MEFs transduced with OS- (b) or OK-expressing retroviruses (c). Values are given relative to their own uninfected controls. Expression
profiles of the corresponding uninfected Sca1-SP and DN (summarily called ‘Thy1 # ’) versus Thy1-SP and DP (summarily called ‘Thy1 þ ’) cells are shown as controls.
(d) Simultaneous induction of lineage-specific genes in OS- and OK-transduced Sca1-SP, DN and Thy1-SP MEFs. Values are given relative to their own uninfected controls
lineage specifiers and promote mesodermal, neural and
epithelial differentiation, respectively.
Generation of iPS cells from Nestin-GFP MEFs. The
experiments described in Figures 1 and 2 show that DN and
Sca1-SP cells yield higher numbers of IPS colonies,
suggesting that Thy1-SP and DP cells represent inferior cell
types for nuclear reprogramming. In support of this, timecourse analysis of sorted MEFs showed that Thy1-SP and
DP fractions represent progressive differentiation stages
Cell Death and Differentiation
within MEF populations (Supplemental Figure 4). These
results suggest that in the course of reprogramming of total
unsorted MEFs, a significant proportion of 2F and 3F iPS
clones might be generated from immature DN and Sca1-SP
cells or their subsets thereof, whose reprogramming is more
efficient than that of Thy1-SP and DP cells. To test this idea,
we followed the reprogramming of embryonic fibroblasts
derived from mice expressing GFP under the control of the
endogenous Nestin promoter.33 Nestin has been extensively
studied as a marker for neural and mesenchymal stem/
Two-factor reprogramming of somatic cells
A Nemajerova et al
1273
progenitor cells.33,34 Our microarray analyses revealed that
both DN and Sca1-SP populations are enriched for Nestinpositive cells (Figure 3d and Supplemental Figure 5a). We
surmised that these Nestin-positive cells might represent a
virtually homogeneous subset of immature cells and thus be
tangible for mechanistic studies of reprogramming.
As reported,33 MEFs from Nestin-GFP embryos could be
sorted into three populations based on the levels of GFP
expression (GFP-bright, -dim and -negative, Figures 4a
and b). In several independent E12.5 MEF cultures, the
GFP-bright cells accounted for B2% of the total number of
cells (Figure 4a). A fraction of these cells expressed NGFR,
another surface marker that distinguishes Thy1-negative from
Thy1-positive cells (Figure 4a and Supplemental Figure 5b).
To test their respective reprogramming competence, each of
the Nestin-GFP fractions was infected with 2F or 3F
Sort Nestin-GFP MEFs
NGFR
Neg Dim Bright
R1
R2
R4
Infect with 3F or 2F viruses
R3
Maintain in culture;
Score and expand iPScolonies;
FACS analysis
IPS colonies/105
cells
2500
2000
1500
200
100
0
IPS colonies/ 105
cells
GFP
2000
total
negative
dim
bright
OSK
OK
OS
negative
GFP-bright
GFP-bright NGFR+
% GFP-positive
cells
1000
200
100
0
100
80
60
40
20
0
OSK
OK
OS
negative
dim
bright
1
OSK
2
OK
3
OS
Figure 4 Generation of iPS cells from Nestin-GFP MEFs. (a) FACS analysis of
GFP and NGFR expression by E12.5 Nestin-GFP MEFs. GFP-negative (R1), -dim
(R2) and -bright (R3 and R4) fractions are indicated. (b) Experimental design and
analysis of iPS clones derived from Nestin-GFP MEFs. (c) Reprogramming
efficiency of 3F- and 2F-transduced GFP-negative, -dim and -bright fractions. The
error bars correspond to standard error. (d) Reprogramming efficiency of 3F- and
2F-transduced GFP-negative, GFP-bright NGFR-positive and GFP-bright NGFRnegative MEFs. (e) Induction of GFP expression 7 days post transduction with OSK,
OK or OS viruses. Transduction with OSK induces strong expression of GFP
(i.e., Nestin) in both GFP-dim and GFP-negative cells. A representative of
two experiments is shown
retroviruses as a reference (Figure 4b). The iPS colonies
were detected based on their ES cell-like morphology and
alkaline phosphatase staining. Notably, the efficiency of 2F
reprogramming of GFP-bright cells was at least 5–10 times
higher than that of GFP-negative cells (Figure 4c). Furthermore, Nestin-SP cells were more reprogrammable compared
to their NGFR-positive counterparts (Figure 4d). In addition,
NGFR expression was virtually lost after transduction of GFPbright cells with OK or OSK retroviruses (see below), whereas
the Nestin-GFP transgene remained transcriptionally active
regardless of which reprogramming factors were used
(Figure 4e). Transduction with OSK retroviruses and, to a
lesser extent, with OS or OK viruses also induced strong
expression of GFP (i.e., Nestin) in both GFP-dim and GFPnegative cells (Figure 4e and Supplemental Figure 6). Thus,
the acquired expression of Nestin by cells mirrors their ability
to convert into iPS cells (Figure 4c).
We next assessed the temporal induction patterns of
lineage-specific markers during the reprogramming process.
To this end, GFP-bright, -dim and -negative cells were
infected with 2F or 3F retroviruses. On days 1, 4, 7 and 11
post infection and after becoming stable iPS cell lines, they
were stained with antibodies for the tissue- and lineagespecific surface proteins and scored by FACS on a single-cell
basis (Figure 5a and data not shown). Consistent with our
earlier results, this analysis again revealed that induction of
reprogramming by OK versus OS viruses generates phenotypically dissimilar cells. Thus, uninfected GFP-bright cells
were predominantly negative for Thy1 (80%) and Sca1 (70%),
and showed no significant expression of CD34, CD133, KIT,
SSEA1, CDH1 (E-cadherin) or EpCAM (Figure 5a). Transduction with OK or OSK viruses caused transient expression
of Thy1, Sca1, CD34 and CD133, but early loss of NGFR
(Figure 5a). Subsets of these cells concurrently expressed
CD34 and CD133 (Figure 5a). In contrast, cells transduced
with OS viruses retained significant NGFR expression but
showed no induction of CD34 or CD133 even after 11 days in
culture (Figure 5a). Only small fractions of these cells (1–5%)
showed expression of KIT, SSEA1, CDH1 and EpCAM prior to
their conversion into iPS cells (Figure 5a).
Western blot analysis confirmed these results. Thus,
transduction of sorted MEFs with OS, OK or OSK viruses
caused little or no changes in the expression levels of CDH1,
EpCAM or KIT genes during initiation of the reprogramming
event (Figure 5b). Whether the small amount of CDH1 protein
induced in OK and OSK cells (but not in OS cells) is present at
the cell surface rather than in the cytosol cannot be
determined with certainty, given the low score by FACS
(Figure 5a). Furthermore, quantitative reverse transcriptionpolymerase chain reaction (qRT-PCR) revealed that mRNAs
for E cadherin, EpCAM and KIT were only partially induced
compared to iPS cells (Figure 5c and Supplemental Figure 7).
On the other hand, the induced expression of pluripotency
transcription factors Sox2 and, to a lesser extent, Sall4 and
Zic3 either preceded or closely paralleled the later induction of
epithelial and ES cell markers (Figure 5c). Accordingly, OSand OSK-transduced cells expressed total (endogenous and
exogenous) Sox2 protein at levels similar to those present in
iPS and wild-type ES cells (Figure 5b). In sum, a side-by-side
comparison of 2F and 3F transductions demonstrates that
Cell Death and Differentiation
Two-factor reprogramming of somatic cells
A Nemajerova et al
1274
ES
1000
800
600
400
40
60
80
100 %
EpCam
CD34
CD133
20
0
300
20
CDH1
40
Relative expression
0
Relative expression
Nestin
Thy1
Sca1
CD34
CD133
NGFR
KIT
CDH1
EpCAM
SSEA1
Thy1+Sca1+
Thy1-Sca1CD34+CD133+
EpCAM+CD133+
KIT+SSEA1+
CD34+NGFR+
CD34+CDH1+
NGFR+CD133+
OSK
OK
OS
iPS
OSK
OK
OS
11d
OSK
OK
OS
Contr
4d
OS OK OSK
OS OK OSK
OS OK OSK
OS OK OSK
Sox2
Zic3
Oct4
Klf4
OS OK OSK
OS OK OSK
OS OK OSK
OS OK OSK
200
100
20
10
0
GFP-neg
GFP-neg (D1)
GFP-neg (D11)
GFP-bright
GFP-bright (D1)
GFP-bright (D11
iPS
Contr
OS
OK
OSK
iPS
ESC
Nestin-GFP Contr
GFP-neg
GFP-neg (D1)
GFP-neg (D11)
GFP-bright
GFP-bright (D1)
GFP-bright (D11
iPS
3000
EpCAM
KIT
Oct4
Relative expression
CDH1
Sall4
Tbx3
Myc
Nanog
2000
1000
20
10
GFP-neg
GFP-neg (D1)
GFP-neg (D11)
GFP-bright
GFP-bright (D1)
GFP-bright (D11
iPS
0
Sox2
OS OK OSK
OS OK OSK
OS OK OSK
OS OK OSK
< Klf4
MAPK
Figure 5 Multilineage pattern of gene expression parallels induction of pluripotency. (a) Schematic heat map showing the percentage of Nestin-GFP-bright cells with
co-expression of lineage-specific genes as determined by FACS analysis. The analysis before (Control) and after transduction with OS, OK or OSK viruses is shown at the
indicated time points. Expression profiles of iPS and WT ES cells are shown for comparison. Color scale indicates the percentage of cells that express the indicated markers.
(b) Immunoblot analysis of Nestin-GFP-bright MEFs before (Control) and 7 days after transduction with OS, OK or OSK viruses. Expression profiles of iPS and WT ES cells are
shown for comparison. MAPK is a control for equal loading. (c) Quantitative real-time reverse transcription–polymerase chain reaction (qRT-PCR) analysis of the indicated
endogenous genes in Nestin-GFP-negative (yellow group) and Nestin-GFP-bright MEFs (blue group) before and 1 or 11 days after transduction with OS, OK or OSK viruses.
Data were normalized to the corresponding HPRT value to obtain relative changes in the indicated mRNAs. Mouse-specific primers that did not cross-react with the exogenous
human counterparts were used for Oct4, Sox2 and Klf4
forced expression of Oct4 with either Sox2 or Klf4 does not
directly convert MEFs to a pluripotent state. Instead, each
combination of factors disrupts the normal cellular homeostasis and establishes transient states characterized by the
concurrent expression of mixed lineage markers. These cells
convert into iPS cells in a stochastic fashion.
Discussion
Recent studies in fibroblasts suggest that reprogramming to
pluripotency follows a well-organized temporal sequence of
molecular and cellular events termed initiation, maturation
and stabilization.29,35,36 This sequence was proposed to entail
a rate-limiting step of mesenchymal-to-epithelial transition
(MET) that marks initiation.23,24 According to this model, the
acquisition of epithelial features is accompanied by the loss of
differentiated cell characteristics and massive downregulation
of somatic genes.23,24,29,35 A subset of cells will then initiate
the embryonic transcriptome, and, finally, upregulated
pluripotency genes establish an ES cell-like expression
Cell Death and Differentiation
program.23,24,29,35 Although this concept is appealing,
the question of why the majority of cells fail to reprogram
persists.
Several insights emerge from our study. We performed a
systematic molecular and functional dissection of primary
MEFs. First, we show that MEFs, independent of strain
background, are heterogeneous with respect to reprogramming competency and can be FACS-sorted into distinct
subpopulations based on differential expression of cell
surface markers. Importantly, these subpopulations yield
significantly different efficiencies in iPS generation with
different combinations of reprogramming factors. Specifically,
Thy1-SP and DP cells, which constitute the vast majority of
the unsorted MEF population, represent a poor target for iPS
production, whereas the preexisting DN and Sca1-SP
subpopulations represent the more reprogrammable cell
types. Thus, Nestin-GFP-positive cells, which represent a
distinct subpopulation of DN and Sca1-SP cells, can be
effectively reprogrammed with only two factors, Oct4/Klf4 and
Oct4/Sox2.
Two-factor reprogramming of somatic cells
A Nemajerova et al
1275
Second, we find that, compared to OKSM fibroblasts, our
three-partite system (OS, OK and OSK) enables a more
comprehensive and unbiased view of the reprogramming
process. We show that Sox2 stands out as the most
responsive endogenous pluripotency factor during early
reprogramming stages. Moreover, the sustained expression
of Sox2 precedes the acquisition of epithelial and ES cell-like
phenotypes in cells that undergo reprogramming. We find it
plausible that a subset of Nestin-GFP-positive cells expresses
sufficiently high levels of endogenous Sox2, and thus requires
fewer exogenous factors. However, it is unlikely that these
cells may still be heterogeneous and contain multiple
subpopulations, one responding to overexpression of Sox2
and the other responding to Klf4. Given that Klf4 is required for
the activation of epithelial gene expression during somatic cell
reprogramming,23,24 we speculate that a proportion of Sox2overexpressing cells are able to bypass an early Klf4dependent phase. Thus, notwithstanding the differences in
microarray profiling, our data suggest that there is a partial
functional redundancy between Sox2 and Klf4 in the disruption of cellular homeostasis and activation of pluripotencyrelated gene networks. The partial overlap between these
factors could explain why Oct4/Sox2 and Oct4/Klf4-mediated
reprogramming had the same degree of efficiency (B0.2%),
but was an order of magnitude higher with three factors
combined.
One important question is whether in our system all
transduced genes were expressed at levels sufficient to
achieve the optimal reprogramming efficiency. We addressed
this in the following way. The distribution of retroviralmediated gene expression is essentially stochastic.37 Simple
calculations show that for 2% of cells to be effectively
reprogrammed with three factors, at least 27% of them would
have contained an optimal amount of each of these factors
(i.e., 0.273 ¼ 0.02). Hence, B7% of cells (i.e., 0.272) would
have expressed optimal levels of each of the two factors,
Oct4/Klf4 or Oct4/Sox2. Given that 0.2% of cells became
reprogrammed, this corresponds to a reprogramming efficiency of B3% of cells with optimal expression of both factors
(i.e., 0.2 $ 100/7). Thus, we conclude that the expression
levels of exogenous factors did not have a rate-limiting role
during iPS induction in this system.
In sum, our results support a version of the stochastic model
in which cells of different degrees of differentiation have the
potential to generate iPS cells, albeit with very different
kinetics and the requirement of three rather than two factors
for reprogramming of more differentiated cells. The main
difference between undifferentiated and differentiated cells, in
this respect, is that the former are less stable at high levels of
internal or external stimuli.38,39 Our results infer that regardless of the cell type the net result of stochastic transitions is the
establishment of a precarious balance between ‘supply’ and
‘demand’ for transcription factors that have explicit lineagespecifying activities.39 Based on this, we conclude that forced
expression of Oct4 with either Sox2 or Klf4 does not directly
convert MEFs to a pluripotent stem cell state. Instead,
each combination of factors disrupts the normal cellular
homeostasis and establishes transient states characterized
by the concurrent expression of mixed lineage markers
(Supplemental Figure 8). Viewed in this light, the MET
observed in four-factor reprogramming might be just one of
many possible transient stages.
Materials and Methods
All cells were cultured on 0.1% gelatinized tissue culture plates. Wild-type mice
(Jackson Laboratory, Bar Harbor, ME, USA) were maintained on the 129/Sv
background. Oct4-GFP mice (Jackson Laboratory) and Nestin-GFP transgenic
mice33 were maintained on a mixed 129/B6 genetic background. MEFs were
derived from E12.5–E13.5 embryos using standard procedures. iPS induction was
performed according to Li et al.40 Retroviral vectors encoding human Oct4, Sox2,
Klf4 and c-Myc were transfected into packaging Phoenix E cells using Lipofectamine
reagent (Invitrogen, Grand Island, NY, USA). Viral supernatants were collected and
filtered through a 0.45-mm cellulose acetate filter. MEFs were plated at a density of
2 $ 105 cells per 6-cm plate and incubated with the viral supernatant overnight.
After four successive infections, cells were switched to knockout serum replacement
(KSR) medium consisting of DMEM/F12 supplemented with 10% KSR (Invitrogen),
1 $ nonessential amino acids, 1 $ Glutamine, 1 $ Pen/Strep, 0.1 mM
b-mercaptoethanol and 1000 U/ml leukemia inhibitory factor (Millipore, Billerica,
MA, USA). The transduction efficiency of MEFs was determined by extrapolation
from infection with a GFP-expressing control virus, which was conducted in parallel.
In order to improve transduction efficiencies, the proviral parts of pMX vectors
contained within the corresponding BspH1 sites were subcloned into episomal
vector REBNA.25 Molecular, histological, microarray and other analyses were performed
as described in Supplemental Experimental Procedures, Figures and Tables.
Conflict of Interest
The authors declare no conflict of interest.
Acknowledgements. This work was supported by NYSTEM contract no.
N08T-040 to UMM and OP. AN was supported by a fellowship from the Lymphoma
Research Foundation.
1. Hanna JH, Saha K, Jaenisch R. Pluripotency and cellular reprogramming: facts,
hypotheses, unresolved issues. Cell 2010; 143: 508–525.
2. Stadtfeld M, Hochedlinger K. Induced pluripotency: history, mechanisms, and applications.
Genes Dev 2010; 24: 2239–2263.
3. Takahashi K, Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and
adult fibroblast cultures by defined factors. Cell 2006; 126: 663–676.
4. Maherali N, Sridharan R, Xie W, Utikal J, Eminli S, Arnold K et al. Directly reprogrammed
fibroblasts show global epigenetic remodeling and widespread tissue contribution. Cell
Stem Cell 2007; 1: 55–70.
5. Wernig M, Meissner A, Foreman R, Brambrink T, Ku M, Hochedlinger K et al. In vitro
reprogramming of fibroblasts into a pluripotent ES-cell-like state. Nature 2007; 448:
318–324.
6. Yu J, Vodyanik MA, Smuga-Otto K, Antosiewicz-Bourget J, Frane JL, Tian S et al.
Induced pluripotent stem cell lines derived from human somatic cells. Science 2007;
318: 1917–1920.
7. Kim JB, Zaehres H, Wu G, Gentile L, Ko K, Sebastiano V et al. Pluripotent stem cells
induced from adult neural stem cells by reprogramming with two factors. Nature 2008; 454:
646–650.
8. Tsai SY, Clavel C, Kim S, Ang YS, Grisanti L, Lee DF et al. Oct4 and klf4 reprogram dermal
papilla cells into induced pluripotent stem cells. Stem Cells 2010; 28: 221–228.
9. Kim JB, Greber B, Arauzo-Bravo MJ, Meyer J, Park KI, Zaehres H et al. Direct
reprogramming of human neural stem cells by OCT4. Nature 2009; 461: 649–3.
10. Tsai SY, Bouwman BA, Ang YS, Kim SJ, Lee DF, Lemischka IR et al. Single transcription
factor reprogramming of hair follicle dermal papilla cells to induced pluripotent stem cells.
Stem Cells 2011; 29: 964–971.
11. Wernig M, Lengner CJ, Hanna J, Lodato MA, Steine E, Foreman R et al. A drug-inducible
transgenic system for direct reprogramming of multiple somatic cell types. Nat Biotechnol
2008; 26: 916–924.
12. Markoulaki S, Hanna J, Beard C, Carey BW, Cheng AW, Lengner CJ et al. Transgenic
mice with defined combinations of drug-inducible reprogramming factors. Nat Biotechnol
2009; 27: 169–171.
13. Eminli S, Utikal J, Arnold K, Jaenisch R, Hochedlinger K. Reprogramming of neural
progenitor cells into induced pluripotent stem cells in the absence of exogenous Sox2
expression. Stem Cells 2008; 26: 2467–2474.
14. Stadtfeld M, Maherali N, Borkent M, Hochedlinger K. A reprogrammable mouse strain from
gene-targeted embryonic stem cells. Nat Methods 2010; 7: 53–55.
Cell Death and Differentiation
Two-factor reprogramming of somatic cells
A Nemajerova et al
1276
15. Carey BW, Markoulaki S, Beard C, Hanna J, Jaenisch R. Single-gene transgenic mouse
strains for reprogramming adult somatic cells. Nat Methods 2010; 7: 56–59.
16. Hanna J, Saha K, Pando B, van Zon J, Lengner CJ, Creyghton MP et al. Direct cell
reprogramming is a stochastic process amenable to acceleration. Nature 2009; 462:
595–601.
17. Hanna J, Carey BW, Jaenisch R. Reprogramming of somatic cell identity. Cold Spring Harb
Symp Quant Biol 2008; 73: 147–155.
18. Nagy A, Nagy K. The mysteries of induced pluripotency: where will they lead? Nat Methods
2010; 7: 22–24.
19. Koumas L, Smith TJ, Feldon S, Blumberg N, Phipps RP. Thy-1 expression in human
fibroblast subsets defines myofibroblastic or lipofibroblastic phenotypes. Am J Pathol 2003;
163: 1291–1300.
20. Phan SH. Biology of fibroblasts and myofibroblasts. Proc Am Thorac Soc 2008; 5:
334–337.
21. Mikkelsen TS, Hanna J, Zhang X, Ku M, Wernig M, Schorderet P et al. Dissecting direct
reprogramming through integrative genomic analysis. Nature 2008; 454: 49–55.
22. Sridharan R, Tchieu J, Mason MJ, Yachechko R, Kuoy E, Horvath S et al. Role
of the murine reprogramming factors in the induction of pluripotency. Cell 2009; 136:
364–377.
23. Li R, Liang J, Ni S, Zhou T, Qing X, Li H et al. A mesenchymal-to-epithelial transition
initiates and is required for the nuclear reprogramming of mouse fibroblasts. Cell Stem Cell
2010; 7: 51–63.
24. Samavarchi-Tehrani P, Golipour A, David L, Sung HK, Beyer TA, Datti A et al. Functional
genomics reveals a BMP-driven mesenchymal-to-epithelial transition in the initiation of
somatic cell reprogramming. Cell Stem Cell 2010; 7: 64–77.
25. Petrenko O, Beavis A, Klaine M, Kittappa R, Godin I, Lemischka IR. The molecular
characterization of the fetal stem cell marker AA4. Immunity 1999; 10: 691–700.
26. Sanders YY, Kumbla P, Hagood JS. Enhanced myofibroblastic differentiation and survival
in Thy-1(-) lung fibroblasts. Am J Respir Cell Mol Biol 2007; 36: 226–235.
27. Holmes C, Stanford WL. Concise review: stem cell antigen-1: expression, function, and
enigma. Stem Cells 2007; 25: 1339–1347.
28. Gu B, Zhang J, Wang W, Mo L, Zhou Y, Chen L et al. Global expression of cell surface
proteins in embryonic stem cells. PLoS One 2010; 5: e15795.
29. Stadtfeld M, Maherali N, Breault DT, Hochedlinger K. Defining molecular cornerstones
during fibroblast to iPS cell reprogramming in mouse. Cell Stem Cell 2008; 2: 230–240.
30. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA et al. Gene set
enrichment analysis: a knowledge-based approach for interpreting genome-wide
expression profiles. Proc Natl Acad Sci USA 2005; 102: 15545–15550.
31. Wong DJ, Liu H, Ridky TW, Cassarino D, Segal E, Chang HY. Module map of stem
cell genes guides creation of epithelial cancer stem cells. Cell Stem Cell 2008; 2:
333–344.
32. Kim J, Woo AJ, Chu J, Snow JW, Fujiwara Y, Kim CG et al. A Myc network accounts for
similarities between embryonic stem and cancer cell transcription programs. Cell 2010;
143: 313–324.
33. Mignone JL, Kukekov V, Chiang AS, Steindler D, Enikolopov G. Neural stem and
progenitor cells in nestin-GFP transgenic mice. J Comp Neurol 2004; 469: 311–324.
34. Mendez-Ferrer S, Michurina TV, Ferraro F, Mazloom AR, Macarthur BD, Lira SA et al.
Mesenchymal and haematopoietic stem cells form a unique bone marrow niche. Nature
2010; 466: 829–834.
35. Brambrink T, Foreman R, Welstead GG, Lengner CJ, Wernig M, Suh H et al. Sequential
expression of pluripotency markers during direct reprogramming of mouse somatic cells.
Cell Stem Cell 2008; 2: 151–159.
36. Smith ZD, Nachman I, Regev A, Meissner A. Dynamic single-cell imaging of direct
reprogramming reveals an early specifying event. Nat Biotechnol 2010; 28: 521–526.
37. Nash KL, Lever AM. Green fluorescent protein: green cells do not always indicate gene
expression. Gene Ther 2004; 11: 882–883.
38. MacArthur BD, Please CP, Oreffo RO. Stochasticity and the molecular mechanisms of
induced pluripotency. PLoS One 2008; 3: e3086.
39. Loh KM, Lim B. A precarious balance: pluripotency factors as lineage specifiers. Cell Stem
Cell 2011; 8: 363–369.
40. Li H, Collado M, Villasante A, Strati K, Ortega S, Canamero M et al. The Ink4/Arf locus is a
barrier for iPS cell reprogramming. Nature 2009; 460: 1136–1139.
Supplementary Information accompanies the paper on Cell Death and Differentiation website (http://www.nature.com/cdd)
Cell Death and Differentiation
Supplemental Experimental Procedures
Mice and cell culture
All animal studies were approved by the Institutional Animal Care and Use Committee at Stony
Brook University. Wild-type mice (Jackson Laboratory) were maintained on the 129/Sv
background. Oct4-GFP mice (Jackson Laboratory) Nestin-GFP transgenic mice were
maintained on a mixed 129/B6 genetic background. MEFs were derived from E12.5-E13.5
embryos using standard procedures. Mouse Oct4-GFP ES cells were kindly provided by
Antonella Galli (Columbia University). Primary MEFs were cultured in DMEM/10%FBS on 0.1%
gelatinized tissue culture plates. ES and iPS cells were cultured inKSR medium consisting of
F12 DME supplemented with 10% KSR (Invitrogen), 1 x nonessential amino acids, 1 x
Glutamine, 1 x Pen/Strep, 0.1 µM ȕ-mercaptoethanol, and 1000 µ/ml LIF (Millipore).
For teratoma assays, 1 x106 iPS cells were resuspended in Matrigel (Invitrogen) and
injected subcutaneously into CD1 athymic nude mice (Harlan). When tumors reached 1 cm3 in
size (6 to 8 weeks), they were harvested and processed histologically. For blastocyst injection,
4-week-old C57BL/6N (Taconic) female mice were superovulated by administration of 5 IU of
pregnant mare serum gonadotropin and 5 IU of human chorionic gonadotrophin, and then
mated with C57Bl/6N males. All blastocysts were collected at day 3.5 after detection of vaginal
plugs and flushed in M2 medium (Millipore). The blastocysts were washed in M2 medium and
cultured in KSOM+AA medium (Millipore) at 370C, 5% CO2 incubator. Five days before
injection, the iPS cells were thawed and passaged once. Ten to fifteen iPS cells were injected
into each blastocyst for generation of chimeric mice.
Flow cytometry
Cells were harvested using Accutase (Sigma), stained with anti-Sca1, Thy1.2, CD34, CD133,
EpCAM, CDH1, SSEA1, NGFR and c-Kit antibodies (BD Pharmingen or eBioscience), and
analyzed using FACS Calibur (Becton-Dickinson) and CellQuest software. Hematopoietic cells
(Mac1- and CD45-positive) were excluded from the sorting gates.
Expression analyses
Total cellular RNA was extracted using the RNeasy Mini Kit (Quiagen). One-color hybridizations
of cRNAs (2 technical replicas) were performed against the mouse Ilumina MouseRef-8
Expression BeadChip 25K microarray according to the manufacturer’’s instructions by Empire
Genomics LLC (Buffalo NY). Microarray signals were processed with GenomeStudio Gene
Expression Module (GSGX) Version 1.6.0. Data were background corrected and quantile
normalized. Differentially expressed genes were defined as > 2-fold change. For scatter plot
analyses, Pearson correlation coefficients between two samples were calculated using
normalized signals. Scatter plots were generated using log2 transformed gene expression
values. The heatmaps were generated by calculating ratios of expression in each sample
versus control. The log2 values were then supplied to the heatmap function of the R statistical
package. Module expression analysis was conducted as describe in (Kim et al., 2010). Average
gene expression values (log2) of all genes were set as baseline 0. The gene expression values
(log2) of each module relative to the overall average were represented as mean ± SEM.
Definition of each module is as follows: the Core module is composed of genes co-occupied by
at least seven factors among nine factors shown in the Core cluster (Smad1, Stat3, Klf4, Oct4,
Nanog, Sox2, Nac1, Zfp281, and Dax1) (Kim et al., 2010); the ESC-like module (Wong et al.,
2008); the Myc module comprises targets of Myc, Max, N-Myc, Dmap1, E2F1, E2F4, and Zfx
(Kim et al., 2010); the PRC module is composed of targets of PRC cluster proteins, Phc1, Rnf2,
Eed and Suz12 (Kim et al., 2010). We confirmed reproducibility of our microarray by quantitative
real-time PCR, using RNA samples that had been independently isolated from newly infected
MEFs. RT-PCR was performed using total RNAs and gene-specific primers. The sequences of
primers are listed in Supplemental Table 4.
Western blotting
Western analyses were performed on a routine basis. Representative results are shown. Briefly,
cells were lysed in buffer containing 10ௗmM Tris (pH 7.4), 150ௗmM NaCl, 10% glycerol, 1%
Triton, 1ௗmM EDTA, 0.1% SDS and protease inhibitor cocktail (Roche). Aliquots of whole-cell
lysates (50––80 ȝg of protein) were separated on SDS––acrylamide gels and blotted onto BA85
nitrocellulose membranes (Schleicher & Schuell). They were then incubated with antibodies
specific for Oct4 (ab19857), Klf4 (ab34814, both from Abcam); Sox2 (AB5603, Millipore), c-Kit
(DI3A2, Cell Signaling), CDH1 (610404, BD Pharmingen), EpCam (E144, Novus), and Erk1,2
(05-157, Upstate).
Supplemental Figures
Suppl. Figure 1.
Genomic PCR to detect exogenous transgene integration in OS-generated (A) or OK-generated
(B) iPS clones. Mouse ES cells were used as negative controls.
Suppl. Figure 2.
A, B. Heat maps from microarray analyses showing induction of endogenous transcription
factors in Sca1-SP, DN, Thy1-SP and DP MEFs transduced with OS- (A) or OK-expressing
retroviruses (B). Values are given relative to their own uninfected controls. Expression in
corresponding uninfected Sca1-SP and DN (summarily called ‘‘Thy1-‘‘) versus Thy1-SP and DP
(summarily called ‘‘Thy1+’’) cells are shown as controls.
C. Heat maps of shared transcription factor genes in OS- and OK-transduced Thy1-negative
(Sca1-SP and DN) and Thy1-positive (Thy1-SP or DP) cells.
Suppl. Figure 3.
Average gene expression values of the ESC-like, Core pluripotency, Polycomb repressive
complex (PRC) and Myc transcriptional modules in control uninfected Thy1-negative (Sca1-SP
and DN) and Thy1-positive (Thy1-SP and DP) cells as well as in Sca1-SP, DN and Thy1-SP
subpopulations transduced with OS or OK retroviruses. Microarray analyses were performed 4
days post infection with the indicated retroviruses. Data are represented as mean +/- SEM.
Suppl. Figure 4.
The Thy1+ fractions (composed of Thy1+ single-positive, Thy1-SP; and Thy1+Sca1+, double
positive, DP cells) represent the majority of MEFs (>70%), while Thy1-negative fractions
(composed of Sca1+ single-positive, Sca1-SP; and Thy1-Sca1-, double negative, DN cells)
represent a relative minority (A). When sorted cells were maintained under standard culture
conditions for 7 days, Sca1-SP MEFs gradually lost their phenotype and converted to DN and
Thy1-SP cells (A). Likewise, DP cells largely lost the expression of Sca1, becoming Thy1-SP
cells, and the same drift was seen in DN cells (A). In contrast, the Thy1-SP fraction was stable
in retaining the corresponding phenotype over extended periods of time (A). Thus, DP, DN and
Sca1-SP cells spontaneously gravitate towards acquisition of Thy1 single positivity. Retroviral
expression of Klf4, alone or in combination with Oct4 and Sox2 rendered all fractions positive for
Sca1, thereby reducing the proportion of DN and Thy1-SP cells (B). On the other hand,
retroviral expression of Oct4 and Sox2 either alone or in combination with each other, produced
no significant changes (4B and data not shown). Thus, the sorted MEF fractions acquire
different phenotypic features depending on whether they were infected with OK versus OS
viruses or other factor combinations.
Suppl. Figure 5.
A. FACS analysis of Nestin-GFP+ MEFs. Nestin-GFP+ cells (left) were gated (R1) and analyzed
for Thy1 expression (right). The majority of Nestin-GFP+ cells are Thy1-negative (81%).
B. FACS analysis of NGFR+ MEFs. Nestin-GFP+NGFR+ cells (left) were gated (R1) and
analyzed for Thy1 expression (right). The majority of NGFR+ cells are Thy1-negative (79%).
Suppl. Figure 6.
A. Western blot analysis of GFP expression in FACS-sorted Nestin-GFP-negative MEFs. Cells
were analyzed 4 days post-infection with empty control retroviruses (Mock) or OS-, OK-, or
OSK-expressing viruses. Note that expression of GFP (i.e., Nestin) is significantly reduced upon
conversion of cells into iPS cells (OS-generated iPS colony is shown). ES cells derived from
Oct4-GFP transgenic mice are shown for control. MAPK is a control for equal loading.
B. Induction of GFP expression in FACS-sorted Nestin-GFP-negative MEFs transduced with
retroviruses as in (A). Cells were analyzed by FACS on days 4, 7, 11, and 14 post-infection.
Suppl. Figure 7.
A. Quantitative real-time RT-PCR analysis of gene expression levels in Nestin-GFP negative
(yellow group) and Nestin-GFP bright MEFs (blue group). Cells were analyzed before and 1 or
11 days after transduction with OS, OK or OSK viruses. Data were normalized to the
corresponding HPRT value to obtain relative changes in the indicated mRNAs.
B. Quantitative real-time RT-PCR analysis of NGFR expression levels in total unsorted NestinGFP MEFs and sorted populations of Nestin-GFP+NGFR+ and Nestin-GFP-NGFR- MEFs.
Cells were analyzed before and 1 or 11 days after transduction with OS, OK or OSK viruses.
Data were normalized to the corresponding HPRT value to obtain relative expression changes.
Suppl. Figure 8.
Schematic of reprogramming stages: disruption of cellular homeostasis (A), cell switch to
transient states characterized by concurrent expression of mixed lineage markers (B), activation
of pluripotency networks (C) and the conversion into IPS cells (D). The OSK and OS
combinations of factors are more efficient than OK in activation of pluripotency networks, while
the OSK and OK combinations are more efficient and faster than OS in inducing ES cell-specific
morphological changes.
Clone ID
B
mES cells
2 OSKM
1 OSKM
10 OK
9 OK
8 OK
7 OK
6 OK
5 OK
4 OK
3 OK
2 OK
H2O
mES cells
2 OSKM
1 OSKM
13 OS
12 OS
11 OS
10 OS
9 OS
8 OS
7 OS
6 OS
5 OS
4 OS
3 OS
2 OS
1 OS
Clone ID
1 OK
A
Oct4
Sox 2
Klf4
Myc
Oct 4
Klf4
Sox 24
Myc
Nemajerova et al., Suppl. Figure 1
Thy1-/Thy1+
Sca1-SP OS
DN OS
DP OS
Thy1-/Thy1+
Sca1-SP OK
DN OK
Thy1-SP OK
B
Shared factors
OK responsive factors
OS responsive factors
A
Thy1-/Thy1+
Sca1-SP OS
DN OS
DP OS
Sca1-SP OK
DN OK
Thy1-SP OK
C
Low
Expression (log2)
High
Nemajerova et al., Suppl. Figure 2
Thy1-SP/OK
DN/OK
Sca1-SP/OK
Thy1-SP/OS
DN/OS
Sca1-SP/OS
Thy1-positive
Thy1-negative
Log2 Average expression
ESC-like
Core
PRC
Myc
0.6
0.4
0.2
0.0
-0.2
-0.4
-0.6
1
2
3
4
5
6
7
8
X Data
Control
Oct4 + Sox2
Oct4 + Klf4
Nemajerova et al., Suppl. Figure 3
Sorted
3%
7d culture
1%
24%
1%
Klf4
5%
Nemajerova et al., Suppl. Figure 4
A
Oct4 + Klf4
10%
8%
9%
DN
93%
72%
2%
3%
27%
58%
38%
46%
1%
2%
25%
2%
3%
14%
6%
12%
2%
95%
65%
7%
13%
70%
26%
56%
15%
82%
81%
9%
16%
74%
27%
64%
2%
1%
10%
0%
4%
6%
6%
3%
97%
1%
93%
2%
28%
50%
36%
42%
1%
0%
5%
0%
7%
15%
9%
13%
Sca1-SP
DP
B
Sorted
3%
7d culture
1%
24%
1%
Sca1
Oct4 + Sox2
17%
2%
DN
93%
72%
3%
2%
76%
6%
1%
2%
25%
2%
19%
2%
2%
95%
65%
7%
70%
8%
15%
82%
81%
9%
65%
19%
2%
1%
10%
0%
14%
2%
97%
1%
93%
2%
78%
7%
1%
0%
5%
0%
14%
1%
Sca1-SP
DP
Thy1
Thy1-SP
Sca1
Thy1
Thy1-SP
A
Thy1
Thy1
R1
6%
13%
68%
13%
GFP
Sca1
R1
GFP
NGFR
NGFR
B
79%
21%
Thy 1
Nemajerova et al., Suppl. Figure 5
Mock
OS
OK
OSK
iPS
ES Oct4GFP
A
GFP-negative
GFP
OSK
OK
OS
4
7
11
14
100
80
60
40
20
0
4
7
11
14
% GFP-positive cells
Mock
4
7
11
14
B
4
7
11
14
MAPK
1
2
3
Days post-infection
4
Nemajerova et al., Suppl. Figure 6
Relative expression
A
KIT
24
TCF3
p53
ARF
GFP-neg
GFP-neg (D1)
GFP-neg (D11)
GFP-bright
GFP-bright (D1)
GFP-bright (D11)
iPS
22
20
4
2
0
0
1
2
3
4
5
6
7
8
OS OK OSK
OS OK OSK
GFP+NGFR+
GFP-NGFR-
X Data
9
10
11
OS OK OSK
12
13
14
15
16
OS OK OSK
Relative expression
B
1.2
Unsorted
Mock
OS
OK
OSK
0.8
Sorted
0.4
0.0
1
D1
2
3
D11 X Data D1
4
D11
Nemajerova et al., Suppl Figure 7
Primary cell
OSK, OS, or OK
Disrupted homeostasis
A
OSK or OK
OS
Transient states
B
OK
Activation of pluripotency networks
C
Acquisition of ES cell properties
D
Nemajerova et al., Suppl. Figure 8
Nemajerova et al., Suppl. Table 1
Term
Count
%
P Value
------------------------------------------------------------------------------------------------------------GO:0048598~embryonic morphogenesis
8
6.1
0.016666071
GO:0048729~tissue morphogenesis
8
6.1
0.001862836
GO:0060284~regulation of cell development
8
6.1
1.69E-04
GO:0048736~appendage development
5
3.8
0.011506968
GO:0035107~appendage morphogenesis
5
3.8
0.010242797
GO:0060348~bone development
5
3.8
0.011181983
GO:0008544~epidermis development
4
3.0
0.064658192
GO:0001654~eye development
5
3.8
0.028622431
GO:0007507~heart development
7
5.3
0.006052495
GO:0003007~heart morphogenesis
5
3.8
0.002142245
GO:0001822~kidney development
4
3.0
0.044328062
GO:0035108~limb morphogenesis
5
3.8
0.010242
GO:0060173~limb development
5
3.8
0.011506968
GO:0014031~mesenchymal cell development
3
2.3
0.046788171
GO:0048762~mesenchymal cell differentiation
3
2.3
0.050424149
GO:0030182~neuron differentiation
10
7.6
0.002676603
GO:0045664~regulation of neuron differentiation
7
5.3
1.04E-04
GO:0007423~sensory organ development
8
6.1
0.002867616
GO:0001501~skeletal system development
9
6.8
0.001192193
GO:0014706~striated muscle tissue development
4
3.0
0.067142518
GO:0001655~urogenital system development
6
4.5
0.004445775
GO:0051216~cartilage development
6
4.5
2.69E-04
GO:0006928~cell motion
8
6.1
0.018575252
GO:0007155~cell adhesion
12
9.1
0.002739769
GO:0030574~collagen catabolic process
4
3.0
2.44E-04
GO:0030198~extracellular matrix organization
5
3.8
0.00653851
GO:0007167~receptor protein signaling pathway
7
5.3
0.015413984
GO:0042981~regulation of apoptosis
9
6.8
0.050942203
GO:0001558~regulation of cell growth
4
3.0
0.030321804
GO:0040008~regulation of growth
6
4.5
0.040578565
GO:0051252~regulation of RNA metabolic process
17
12.9
0.077021932
GO:0006355~regulation of transcription, DNA-dependent
17
12.9
0.068985832
GO:0006357~regulation of transcription from RNA polymerase II promoter
10
7.6
0.037471912
Supplemental Table 1.
Sca1-SP MEFs show increased expression of genes involved in embryonic morphogenesis, tissue morphogenesis and regulation of
cell development.
Nemajerova et al., Suppl Table 2
Fold Change
(DN vs. Thy1-SP)
--------------------------------------------------------------------------------------MYLPF
fast skeletal myosin light chain 2
27.6
MYL1
myosin, light chain 1, alkali; skeletal, fast
11.6
MYL4
myosin, light chain 4, alkali; atrial, embryonic
8.3
MYL6B
myosin, light chain 6B, alkali, smooth muscle and
non-muscle
5.1
MYH1
myosin, heavy chain 1, skeletal muscle, adult
2.48
MYH8
myosin, heavy chain 8, skeletal muscle, perinatal
4.24
MB
myoglobin
2.46
TNNC1
troponin C type 1
8.05
TNNC2
troponin C type 2
19.4
TNNI1
troponin I type 1
12.74
TNNI2
troponin I type 2
16.04
TNNT1
troponin T type 1
5.9
TPM2
tropomyosin 2
7.11
THY1
0.31
Sca1
1
NGFR
(CD271)
11.06
CD82
(Inducible membrane protein R2)
2.41
CD80
(T-lymphocyte activation antigen CD80)
2.37
CD68
(macrophage antigen CD68)
1
CD40
(TNF receptor superfamily member 5)
1
CD96
(T-cell surface protein tactile)
1
CD97
(heterodimeric receptor associated with inflammation)
1
CD55
(decay accelerating factor for complement)
1
Gene
Supplemental Table 2.
DN MEFs are enriched for myofibroblast-committed progenitors.
Supplemental Table 3.
Sorted Sca1-SP and DN MEFs yield significantly more iPS colonies
than the corresponding Thy1-SP or DP fractions.
MEF subpopulations
p value
(OSKM)*
p value
(OSK)*
p value
(OK)*
p value
(OS)*
Sca1-SP vs. Unsorted
<0.0001
<0.0001
<0.0001
<0.0001
Sca1-SP vs. Thy1-SP
<0.0001
<0.0001
<0.0001
<0.0001
Sca1-SP vs. DP
<0.0001
<0.0001
<0.0001
<0.0001
DN vs. Unsorted
<0.0001
<0.0001
<0.0001
<0.0001
DN vs. Thy1-SP
<0.0001
<0.0001
<0.0001
<0.0001
DN vs. DP
<0.0001
<0.0001
<0.0001
0.0002
0.17
0.97
0.039
0.42
DN vs. Sca1-SP
* Cells were transduced with OSKM, OSK, OK or OS combinations of reprogramming factors.
P values were calculated using Student’s t-test.
Nemajerova et al., Suppl Table 3
Supplemental Table 4
Primer sequences for quantitative real time RT-PCR:
Gene
name
HPRT
CDH1
EPCAM
CD34
CD133
c-KIT
Oct4
Sox2
Klf4
c-Myc
Nanog
Sall4
Tbx3
Tcf3
Zic3
Arf
p53
NGFR
Primer sequence 5’’-3’’
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
F
R
GGCTATAAGTTCTTTGCTGACC
CTCCACCAATAACTTTTATGTCC
CAGGTCTCCTCATGGCTTTGC
CTTCCGAAAAGAAGGCTGTCC
GCGGCTCAGAGAGACTGTG
CCAAGCATTTAGACGCCAGTTT
AAGGCTGGGTGAAGACCCTTA
TGAATGGCCGTTTCTGGAAGT
GTTGAGACTGTGCCCATGAAA
GACGGGCTTGTCATAACAGGA
GCCACGTCTCAGCCATCTG
GTCGCCAGCTTCAACTATTAACT
CCATGCATTCAAACTGAGGCACCA
AGCTATCTACTGTGTGTCCCAGTC
CGCCCAGTAGACTGCACAT
CCCTCCCAATTCCCTTGTAT
GTGCCCCGACTAACCGTTG
GTCGTTGAACTCCTCGGTCT
GCTCTCCATCCTATGTTGCGG
TCCAAGTAACTCGGTCATCATCT
TCTTCCTGGTCCCCACAGTTT
GCAAGAATAGTTCTCGGGATGAA
CCCTGGGAACTGCGATGAAG
TCAGAGAGACTAAAGAACTCGGC
GAACCTACCTGTTCCCGGAAA
CAATGCCCAATGTCTCGAAAAC
ACGAGCTGATCCCCTTCCA
CAGGGACGACTTGACCTCAT
TGCTGCCAGTTCAGGCTATG
CGAGAAGGGGTTTTAGTGGTATC
CTTGGTCACTGTGAGGATTCA
CTACGTGAACGTTGCCCATCA
ACAGCGTGGTGGTACCTTAT
GGTTCCCACTGGAGTCTTC
TGGGCTCAGGACTCGTGTT
CAGGGATCTCCTCGCATTCG
Amplicon
size (bp)
126
175
139
157
98
90
243
154
185
116
100
111
121
101
83
125
149
189
All primers are for mouse mRNA detection and have a hybridization temperature of 60°C. Oct4,
Sox2 and KLF4 primers are designed to amplify only the endogenous mRNA, not the transgene.
Relative expression of the target genes was calculated using the ǻCT method described
previously: Relative expression = 2-ǻCT, where ǻCT = CT (Target gene) - CT (HPRT).