Evaluating the Dynamic Employment Effects of Training Programs in East Germany Using Conditional Difference–in–Differences Annette Bergemann+ , Bernd Fitzenberger§ , Stefan Speckesser& Long Version – April 2008 Abstract: This study analyzes the employment effects of training in East Germany. We propose and apply an extension of the widely used conditional differencein-differences estimator. Focusing on transition rates between nonemployment and employment, we take into account that employment is a state and duration dependent process. Our results show that using transition rates is more informative than using unconditional employment rates as commonly done in the literature. Moreover, the results indicate that due to the labor market turbulence during the East German transformation process, the focus on labor market dynamics is important. Training as a first participation in a program of Active Labor Market Policies shows zero to positive effects both on reemployment probabilities and on probabilities of remaining employed with notable variation over the different start dates of the program. Keywords: Evaluation of active labor market policy in East Germany, transition rates, employment dynamics, Ashenfelter’s Dip, nonparametric matching, conditional difference–in–differences, bootstrap JEL classification: C 14, C 23, H 43, J 64, J 68 ∗ This is the unabridged version of our paper which is forthcoming in the Journal of Applied Econometrics. We especially thank Gerard van den Berg, Ed Vytlacil, three anonymous referees, and the editor for their very helpful comments. We are grateful for numerous helpful comments received in numerous seminars at various universities and at various conferences. Thanks goes also to Thomas Ketzmerick of the ZSH for the provision and help with the data. Annette Bergemann acknowledges the support by a Marie Curie Fellowship of the European Community Programme ‘EU Training and Mobility of Researcher’ under contract number HPMF-CT-2002-02047. The usual disclaimer applies. Corresponding Author: Bernd Fitzenberger, Department of Economics, Albert Ludwigs-University Freiburg, 79085 Freiburg, Germany, Email: [email protected] + § Free University Amsterdam and IZA Albert Ludwigs-University Freiburg, ZEW, IZA, and & IFS Westminster Business School, University of Westminster, London 1 Introduction After the formation of the German “Social and Economic Union” in 1990, the East German economy underwent enormous changes. It had to transform from a command driven backward economy to a market economy at an unprecedented speed. The transformation process brought about high unemployment in East Germany. To increase the employment chances of the unemployed, the German government decided to provide on a high scale Active Labor Market Policies (ALMP) in East Germany. These programs mainly consisted of training and temporary employment schemes. Fifteen years after the reunification, the German Federal Employment Service still spends around e 10 Billion for ALMP (Bundesanstalt für Arbeit, 2005). About 50% of this budget is spent in East Germany with a labor force less than one sixth of Germany as a whole. Quite a significant share of the labor force in East Germany has been participating in programs of ALMP since 1990. During the last decade, there were a lot of pessimistic assessments regarding the usefulness of public sector sponsored training programs in raising employment chances of the unemployed (see the surveys in Fay, 1996; Heckman et al., 1999; Martin and Grubb, 2001; Kluve and Schmidt, 2002). These studies doubt that large scale training programs, which are not well targeted, are successful in raising employment. However, evidence for Eastern European transition economies (other than East Germany) has often shown positive effects (Kluve et al., 1999 and 2004; Lubyova and Van Ours, 1999; Puhani, 1999).1 For East Germany, appropriate data for an evaluation of public sector sponsored training were not available for a long time and, until recently, the available evidence has been quite mixed.2 Most studies suffer from data limitations, either from a small number of participants (e.g. Lechner, 2000, using the German Socio-Economic 1 Another exception consists of training programs for prime–aged women in countries with a relatively low female labor force participation (see the survey by Bergemann and Van den Berg, 2007). 2 See Bergemann et al. (2004), Fitzenberger and Prey (2000), Kraus et al. (1999), or Lechner (2000) for exemplary studies based on survey data. Hagen and Steiner (2000), Speckesser (2004, chapter 1) and Wunsch (2006, section 6.5) provide comprehensive surveys of this literature, which is not reviewed here for the sake of brevity, and discuss critically the data used. 1 Panel) or from the data being limited to the early 1990s and lacking the employment history on a monthly basis (e.g. Fitzenberger and Prey, 2000, using the Labor Market Monitor for East Germany). Recently, administrative data brought about more evidence on the effectiveness of further training indicating short-term reductions in the employment outcomes of participants relative to non-participation, but significantly positive effects of ALMP in the medium and long-run, see Fitzenberger and Speckesser (2007) and Lechner et al. (2008). While studies based on administrative data only estimate the effects on employment subject to the mandatory social insurance system our analysis applying employment outcomes reported by survey participants can also consider self-employment or public sector employment, which usually remains unrecorded in administrative data. Lechner et al. (2008) evaluate effects of training programs on employment. They find strong evidence that, on average, most training programs under investigation increase long–term employment prospects. Fitzenberger and Speckesser (2007) estimate the employment effects of one major training program (Provision of Specific Professional Skills and Techniques, SPST) against nonparticipation in SPST for 36 months after the beginning of the treatment. The analysis is performed only for the 1993 inflow sample into unemployment. The analysis finds positive medium–run employment effects. These studies are based on administrative data since 1992 and they only analyze training programs starting in 1993 or later, because there are no administrative data available for the time shortly after the reunification. We focus on the employment effect of public sector sponsored training programs in East Germany starting with the reunification for the group of individuals who belonged to the active labor force in 1990. This group was fully hit by the transformation shock. In the early 90s, training was often considered to be the most effective among the ALMP programs as it was supposed to provide skills that were in demand in a market economy but not in sufficient supply due to the former educational system.3 Training was in term of participants the largest ALMP program. 3 Forecasts of the future labor demand in the early 1990s for both East and West Germany (e.g. Prognos 1993) usually indicated a severe shortage especially for service sector skills in the East if catching up to the economic situation of the West. Human capital transformation was believed to satisfy the changing labor demand and at the same time to reduce unemployment (OECD 1994). 2 We implement a semiparametric conditional difference–in–differences estimator (CDiD) (Heckman, Ichimura, Smith and Todd, 1998). We extend the CDiD approach to using transition rates between different labor market states as outcome variables instead of exclusively using employment rates in levels as is often done in the literature. The focus on transition rates is able to take into account the special economic situation of East Germany that was characterized by a labor market that was in a major restructuring process. An approach that only uses employment rates might not capture sufficiently that the labor market is in a dynamic adjustment process.4 As two benchmarks to estimate the treatment effects on unconditional employment rates, we also implement a matching estimator, which matches on employment history, and a CDiD estimator. Additionally, we also use the matching estimator in order to estimate effects on earnings. We apply propensity score matching in the first stage and then estimate average effects of treatment on the treated. The analysis matches treated individuals to nonparticipants using local linear matching to account for selection on observables. We consider selection on time invariant unobservable characteristics by implementing a conditional difference–in–differences estimator in matched samples. Our inference uses a bootstrap approach taking account of the estimation error in the propensity score. Our results indicate that modeling transition rates is more appropriate than using unconditional employment rates in the given labor market situation and the data at hand. Moreover, the approach is more informative, as we can determine whether ALMP programs help workers to find a job and/or whether they stabilize employment. We find zero to positive employment effects. Depending on the start date of the program we find significant variation concerning job finding rates and employment stability. Next to the extension of the CDiD to using transition rates as outcome variables, our paper involves two further methodological innovations: First, anticipation effects 4 See also Kluve et al. (1999) who use conditional probabilities for the analysis of ALMP effects in Poland. However, they apply a pure matching approach and define the conditional probabilities over several yearly quarters. 3 regarding future participation or eligibility criteria (Ashenfelter’s Dip) requiring a certain elapsed duration of unemployment for participation can affect the results of difference–in–differences estimator (Heckman and Smith, 1999). Using institutional knowledge and data inspection to bound the start of Ashenfelter’s Dip, we suggest for our context a long–run difference–in–differences estimator to take account of possible effects of anticipation and participation rules which might otherwise contaminate the estimation results. Second, we suggest a heuristic cross–validation procedure for the bandwidth choice that is well suited to the estimation of conditional expectations for counterfactual variables. Some recent publications propose a number of extensions to the standard static evaluation approaches to consider dynamic selection issues involved here: Similar to our paper, the timing–of–events approach (Abbring and Van den Berg, 2003; Fredriksson and Johanson, 2003) focuses on transition rates from unemployment to employment by modeling the duration of unemployment as outcome. Sianesi (2004) emphasizes that treatment differs by the elapsed duration of unemployment at the beginning of the program and that future program participants should be used in the comparison group for earlier treatments. We estimate the treatment effects during two time periods and we do not evaluate the effect of training now versus waiting where the latter may include the possibility of training in the future during the same time period. Also, our analysis goes beyond the aforementioned studies in two respects: We model the effects of treatment on both the probability of leaving nonemployment and of remaining employed. Extending the standard CDiD estimator, we allow for unobserved, state dependent individual specific fixed effects. The paper is organized as follows: Section 2 gives a short description of economic development in East Germany. Section 3 summarizes the institutional settings of training. Section 4 proceeds to the microeconomic evaluation approach used here. Section 5 describes the data set used and compares participants and nonparticipants. Section 6 shows the implementation of the estimation approaches and discusses the empirical results. Section 7 concludes. The appendix includes detailed descriptive evidence and results. 4 2 The Economic Development in East Germany During reunification in 1990, the East German economy underwent unprecedented changes. It was rapidly transformed from a command driven backward economy to a market economy facing competition from Western economies. The economic situation in the last years of the socialistic German Democratic Republic was shaped by stagnation, an obsolete capital stock, and production processes which were highly overstaffed. This resulted in high employment rates and a productivity level that fell substantially short of West European standards. Although formally highly qualified, the workforce lacked specific skills necessary in a competitive environment. Moreover, compared to modern market economies, the production structure was heavily biased towards manufacturing goods.5 Following July 01, 1990, the Economic, Monetary and Social Union transferred the West German legal, social and economic regulation to East Germany together with an exchange rate highly overvaluing the East German currency. Furthermore, in 1991 trade union and employer representatives agreed on a wage path quickly converging to West German levels in 1991, which stood in sharp contrast to the productivity level in East Germany. In 1993, however, the idea of a fast wage convergence was abandoned. Production collapsed during the first two years after the reunification. Production in 1991 only reached 2/3 of its 1989 level. Afterwards, real GDP was growing rapidly with annual growth rates between 7.7% and 11.9%. But since 1995, economic growth has ebbed off with growth rates between 1.6 and -0.2%. At the same time, labor productivity increased from 51.2% in 1991 of the West German level to 74.2% in 2004. However, wage increases were even larger. Wages in 1991 amounted to 56.1% of the West German level in 1991 and increased to 81.5% in 2004. The labor market suffered from a dramatic disequilibrium. In 1989 close to 9.6 million individuals were employed. This figure dropped sharply to 6.5 million in 1997 and remained fairly constant since then. To fight the resulting high unemploy5 Unless indicated otherwise, the following overview is based on Akerlof, Rose, Yellen and Hesse- nius (1991), Bundesministerium für Verkehr, Bau und Stadtentwicklung (2005), Hunt (2006), van Hagen and Strauch (2001), and Wurzel (2001). 5 ment, active and passive labor market policies were heavily used in East Germany (see figure 1). These programs should also provide an immediate cushion for the unemployed and support the goal that the standard of living in East Germany should converge quickly to Western levels in order to avoid large scale outmigration and to foster political stability. Although the composition and scale changed markedly over time, training programs played a central role during the 1990’s. It was strongly believed that training programs were important as the East German workforce lacked skills which are necessary in a competitive environment, especially in the light of the likely changes or the sectoral production composition. Compared to modern market economies, production related services were for example strongly under-represented in the pre– reunification period (Prognos, 1993; OECD, 1994; Bundesanstalt für Arbeit, 1991). Shortly after reunification, early retirement programs and short-time work programs were heavily used in order to reduce - at least in the short run - open unemployment (see figure 1). In ALMP, training programs and job creation schemes dominated. In addition to the main objective of increasing the re–employment probability, especially job creation schemes were also regarded as social measures. Participation in ALMP peaked in 1992 with over 800.000 individuals participating on average in full time programs. From 1993 onwards budget constraints forced the labor offices to reduce the number of participants. Open unemployment increased from 1 million in 1991 to 1.6 million individuals in 2005. From 1989 to 1991 migration from East to West Germany was substantial. In 1991, the net migration amounted to 171.000 individuals. However, after uncertainty concerning the political and economic situation disappeared, migration ebbed away and was increasingly matched by migration in the opposite direction. In 1996, net migration to the West reached its minimum with 26.000. Before 1997 net outflow was confined to lower and medium qualified labor. Since 1997, however, the net flow has increased again and the net outflow is highest for highly qualified labor (see Kempe 2001). 6 3 Training in East Germany 3.1 Institutional Background Between 1969 and 1997, training as part of Active Labor Market Policy in Germany was regulated by the Labor Promotion Act (Arbeitsförderungsgesetz, AFG). Despite a number of changes in the regulation over this time period, the basic design of training programs remained almost unchanged until the AFG was replaced by the new Social Law Book (Sozialgesetzbuch, SGB) III in 1998. The German Federal Labor Office (Bundesanstalt für Arbeit, BA) was in charge of implementing these programs in addition to being responsible for job placement and for granting unemployment benefits. In the course of German reunification, these programs were extended to East Germany (§ 249 AFG). Training programs under the AFG legislation consist of the following two types: Further Vocational Training (Fortbildung) and Re–training (Umschulung).6 Further vocational training includes mainly courses for the assessment, maintenance, and extension of skills. The duration of these courses depends on the characteristics of the participants and varies between 2 and 8 months. The courses are mainly offered by private sector training companies. Under the heading of further vocational training also Short–term training courses were implemented. These are courses that provide skill assessment, orientation, and guidance. The courses are intended to increase the placement rate of the unemployed. Mostly, they do not provide occupational skills but aim at maintaining search intensity and increasing the hiring chances. The courses usually last from two weeks to two months. Re–training enables vocational re–orientation if no adequate employment can be found because of skill obsolescence. Re–training is supported by the BA for a period up to 2 years and aims at providing a new certified vocational training degree. Participation in training is basically voluntary. Individuals face a variety of incentives to participate. Next to the potential benefits in terms of increase of the employment chances, participation can also offer some instantaneous utility, 6 By law, also so called Integration Subsidies are part of the the array of training programs, but in reality these programs are closer to wage subsidies and therefore not part of this analysis. 7 especially in this time with high uncertainty (being together with other job seekers, the learning experience, motivation, etc.). Enrollment into training is also attractive from a financial point of view. First of all, participants can be granted an income maintenance payment (Unterhaltsgeld ). The time of participation in the training program does not count towards the limited time of the unemployment benefits. Second, under certain conditions the participants can re–qualify for, or prolong their eligibility period for unemployment benefits. Third, the refusal of participation in a training program despite the recommendation of a caseworker can lead to the imposition of a sanction, which basically meant that their benefit payments would stop completely for between six weeks and three months. 3.2 Changing Incentives for Participation Initiated after 20 years of an unprecedented economic growth in post–war West Germany, the 1969 legislation of Active Labor Market Policies and training programs in particular was fairly generous. The design of these programs provided institutionally built–in incentives to participate because participants benefited from rather long-term programs, offering high levels of benefit and post-participation advantages such as renewed unemployment benefits after participation. As often observed in European welfare states (e.g. Calmfors 1994), the institutional incentives for participation were probably as important as the objective of re-integration into the labor market through human capital investment. As a consequence, the program intake consisted of heterogeneous participants: On the one hand, participants with good prospects might have started training in order to get a decent job. On the other hand, job seekers might have started the program because of the institutional incentives even though the program was not effective to achieve the objective of reintegration into the labor market. With the changing economic situation in the 1990’s, following the German reunification a number of changes concerning the institutional settings of training took place affecting also strongly the incentives to participate. The most important ones are the following: 8 1. Change in provision of training: Shortly after the reunification, training programs were massively supplied. However, mainly due to tighter public budgets, the number of courses provided declined strongly after 1992. 2. Short–term training programs were formally abolished in 1992. In 1993, a new program with the same purpose was established, but participants were no longer considered as taking part in training programs. Before 1993 program participants were not required to prove active job search and job placement while on the program. After 1993, participants in the new program remained openly unemployed, including the requirement for a job seeker on unemployment benefit to continue active job search and to start employment immediately when offered a job corresponding to their previous occupation. 3. Change in income maintenance payments: Participants in training might have been either recipients of unemployment benefit (i.e. those with unemployment of less than one year) or of unemployment assistance (long–term unemployed, receiving lower means–tested benefits). During the early 1990’s, participants, who previously had received unemployment benefits, received income maintenance payments exceeding the standard unemployment benefits by 3 percentage points, while participants entering the program from means–tested benefits continued to receive benefits at the level of 58% of previous net earnings. Legislation reduced the benefits for participants starting training after unemployment benefit in 1994. Since then, the level of income maintenance payments of 67% of previous net earnings for participants with children and 60% for participants without children matches the level of unemployment benefits. This decline in the income maintenance payment for the most important group of participants reduced institutional incentives of participating in training programs (see Eichler/Lechner 2001, 221). 4. Change in eligibility rules for participation: Originally, participation in a training program was open to participants who had not been unemployed before as long as the case worker deemed participation in training as “advisable”. This type of training intended to prevent future unemployment, to increase the la9 bor market prospects in the future, or to foster re–integration of individuals returning to the labor market. In 1994, a reform restricted access to individuals fulfilling the criteria for “necessary” training, i.e. to formerly unemployed participants. However, especially in East Germany, participation under the weak criterion of “being threatened by unemployment” was still possible. These four major changes over the period of investigation of our study reduced both the overall program intake as well as the participation due to institutional incentives. One can expect the outcomes of training programs to be different for the later years compared to the early 1990’s, as the mix of participants in training programs changed particularly following the 1994 reform. We conclude that a credible evaluation strategy has to account for this and our empirical analysis considers both periods separately. The reduction in program intake mainly affected the selection of participants (more targeted towards special problem groups). The end of explicit short–term training programs made the programs longer and more expensive. Hence, we expect the program mix to have become less focused on immediate placement of participants. After the change, there is a stronger focus on providing additional skills and helping participants to signal their skills. On the one hand, these changes result in stronger incentives to participate than before the reform. This and the knowledge, that ALMP would be a permanent feature of the East German labor market, were likely to cause unemployed individuals to decrease their search effort for a new job in anticipation of participation. On the other hand, training programs become less attractive, especially for workers who are still employed. Over time, a change in the selection of the program group occurs, with training increasingly targeting problem groups with a priori significantly lower employment chances. 3.3 Aggregate Participation Training programs were implemented in East Germany immediately after unification (see figure 2): 94,000 persons started to participate during the last three months of 1990. In 1992, the maximum was reached with 774,000 entries. Between 1993 and 1997 the number declined considerably to 155,000 in 1997. Afterwards participation 10 recovered to a level slightly above 180,000 reflecting the ongoing importance of these programs in East Germany. The share of entries into re–training as a percentage of total training varies between 15% in 1991 and 28% in 1993. Separate figures for the subprograms are not available for the time after 1997 due to the change in the regulation. Stocks of participants show a similar pattern (see figure 3). The maximum was reached in 1992, amounting to 492,000 participants on average. Participation has been declining afterwards (2000: 139,700, 2002: 129,000 participants). The trends for the subprograms (not reported in figure 3) are analogous. Direct costs for participation (see figure 3, right axis) – income maintenance, course fees, travel costs etc. – increased continuously over time. In 1991, when short– term training programs still existed, annual costs were at e 8,000 per participant. These cost increased to e 14,600 in 1995 and to e 20,600 in 2002. 4 Evaluation Approach Our empirical analysis is based on the potential outcome approach to causality (Roy, 1951, Rubin, 1974), see the survey Heckman, LaLonde, and Smith (1999). We focus on estimating the average causal effect of treatment on the treated (TT) in the binary treatment case.7 TT is given by E(Y 1 |D = 1) − E(Y 0 |D = 1), where the treatment outcome Y 1 and the nontreatment outcome Y 0 are the two potential outcomes and D denotes the treatment dummy. Our outcome variable of interest Y is an employment dummy, in our preferred approach conditional on the 7 The framework can be extended to allow for multiple, exclusive treatments. Imbens (2000) and Lechner (2001) show the extension of the standard propensity score matching estimators for this purpose. Although this would be a natural extension in our application, we do not think that our data are sufficiently rich enough for this purpose. Our analysis is very demanding since we argue that matching on observable covariates will not suffice to control for selection bias and since we model the effects on transition rates between different labor market states. Therefore, we restrict ourselves to estimating TT for training where the comparison group is the group of all individuals who either do not participate in any program or who only participate in other programs where the latter two are weighted by their sample frequencies. 11 employment status in the previous month resulting in a transition dummy.8 The observed outcome Y is given by Y = DY 1 + (1 − D)Y 0 . The evaluation problem for estimating the TT consists of estimating the counterfactual outcome in the nonparticipation situation for the participating individuals (D = 1). Identifying assumptions are needed to estimate the counterfactual based on the outcomes for nonparticipants (D = 0). One important aspect for the implementation of a treatment effect estimator in our context is connected with the observation that training in East Germany is associated with a disproportionate decline in employment rates shortly before the start of the treatment, as it is typical for training among the unemployed. A similar finding termed Ashenfelter’s Dip was first discovered when evaluating the treatment effects on earnings (Ashenfelter, 1978). Later research demonstrated that the same phenomenon can also occur regarding employment, see Heckman, LaLonde and Smith (1999), Heckman and Smith - henceforth HS -, 1999, and Fitzenberger and Prey (2000). We argue that in our context the decline in employment is caused by participation rules or anticipation effects (HS) and we implement our estimators to allow for this relationship. Regarding participation rules, the target group of public sector sponsored training are the unemployed and employees with a high risk of unemployment. Anticipation effects arise because unemployed individuals and employees with a high risk of losing their jobs reduce their search effort, because they know about the availability of a training program in the near future. Analogous to the finding in the US literature (Ashenfelter and Card, 1985, Heckman, Ichimura, Smith, and Todd - henceforth HIST - , 1999, HS) that earnings decline before training starts, we denote the decline in employment as Ashenfelter’s Dip (or the preprogram dip) in employment. The problem that Ashenfelter’s Dip imposes for evaluation studies is mostly discussed in the context of conditional Difference–in–Difference estimators (henceforth CDiD). HIST and HS for example, emphasize that CDiD estimators are quite sensitive with regard to Ashenfelter’s Dip, a problem which is also summarized under the heading of the “fallacy of alignment” (see Heckman, LaLonde and 8 For the evaluation approach with respect to the outcome variable earnings see section 4.4. 12 Smith, 1999). On the one hand, if the decline in employment before treatment is transient then contrasting the before–after difference in employment for the treated with that of the nontreated will overstate the treatment effect. On the other hand, if the decline in employment is persistent then DiD based on a longer preprogram period will underestimate the treatment effect. This is analogous to the discussion in HS, p. 317, for the dip in earnings. For US data, HS show that the short–run dynamics in unemployment are an important determinant of treatment and that the preprogram dip in earnings is transitory. Matching on the earnings history does not allow to take account of the dip. Rather it proves important to match on the short–run dynamics in unemployment which HIST and HS include as determinants of the propensity score. To account for the dynamics in earnings shortly before and shortly after treatment, the CDiD estimators in HIST and HS are based on before–after differences which are symmetric in time before and after treatment start. This builds on the identifying assumption that the selection bias after matching on the propensity score is the same symmetrically around the treatment start. This approach proves useful to reduce considerably the bias in nonexperimental TT estimates as shown by HIST and HS who have experimental estimates as a benchmark. We think that such a symmetry assumption is not warranted in our application because there is no reason to believe that in the East German context the selection bias in the employment recovery for the time shortly after participation is a mirror image to the preprogram dip in employment. In fact, since HIST and HS match on short–run preprogram unemployment dynamics, such an assumption would not even be justifiable for employment in their case. We will suggest a CDiD estimator based on long–run preprogram differences that lie before the time period of the preprogram dip. This paper discusses and implements three different estimators. First, as a benchmark approach, we introduce a matching estimator for the effects on employment rates and earnings. With respect to matching on the labor force history, it appears naturally to only match on the long–run preprogram history in order to circumvent potential problems with respect to Ashenfelter’s Dip. This comes at the cost 13 of ignoring the short–run preprogram history for the selection into the program. Note that simply matching on the short–run preprogram history would be incorrect resulting in a fallacy of alignment. As the second approach, we consider a static CDiD estimator (henceforth CDiDS) in order to estimate the effect on employment rates that is based on the long–run preprogram differences. Similar to the approach in HIST and HS, the CDiDS estimator takes account of the link between the treatment start and the short–run preprogram employment history by matching on the employment status in the month before program start for those who are nonemployed in that month. This way, the matched nontreated individuals are at similar risk of participating in training, which is necessary for an appropriate control group. As becomes evident below, matching on the employment status in the month before treatment start should not be necessary for those who are employed in that month.9 Note that the short–run preprogram employment history does not enter the before–after differences used for CDiDS estimation. However, this approach requires a stable long–run pre–program difference in employment rates between treated and matched controls, which we do not achieve in our application. Our third preferred estimator uses CDiD in employment rates conditional upon employment state in the previous month (discrete transition rates or ’hazard rates’) and conditional upon duration dependence (henceforth CDiDHR). We develop this estimator on the basis of an employment model with state dependent transition rates, duration dependence, additive unobserved heterogeneity, and heterogeneous treatment effects. CDiDHR is a natural extension of the CDiDS. CDiDHR takes account of short–run dynamics in two ways: First, by conditioning on the employment state in the previous month and on the time in the current employment state, we automatically account for the short–run dynamics in employment when matching treated and nontreated individuals at the time of evaluation. Second, we match on the employment status in the month before program start for those who are nonem9 In East Germany there were a number of transitions from employment to training without an intervening spell of nonemployment. Apparently, these participants are less selective because in the transition process a large fraction of the East German employees workforce was at the risk of becoming unemployed. 14 ployed. With this approach we are able to achieve parallel long–run preprogram differences between treated and matched non–treated. In the following, we develop the three estimation approaches more formally and we provide implementation details. In particular, we develop a new cross–validation rule to choose bandwidths for kernel matching. Last, we discuss applying the cross– sectional matching estimator also for the TT effects of training on earnings taking account of the fact that earnings are only observed in the last three years of our observation period. 4.1 Matching on Employment History and CDiDS Building on the Conditional Mean Independence Assumption (CIA), a standard matching estimator would involve matching the employment history and the relevant characteristics X to contrast the post treatment employment rates for treated and matched nontreated (see Heckman, LaLonde, and Smith, 1999). To account for the preprogram employment dip, it would seem advisable to match on the long–run preprogram employment history, i.e. for a time before Ashenfelter’s Dip starts. We specify the start of Ashenfelter’s Dip conservatively and we let it vary over time (see section 6.2 and 6.4). Shortly after the German reunification individuals could not have expected the huge supply of training programs. Thus, anticipation of program participation could only occur shortly before the beginning of the participation. Likewise, participation rules were only applied in very lax way in 1990 and 1991. With the reduction of supply of training courses, participation rules became stricter but at the same time it became obvious that training courses would be a permanent feature of the labor market. Consequently, we let the time of Ashenfelter’s Dip increase during early 1990’s. We implement the cross–sectional matching estimator as benchmark approach. To estimate the expected nonparticipation outcome for the participants with observable characteristics X, it suffices to take the average outcome for nonparticipants with the same X and the same long–run preprogram employment history. Based on the CIA, the expected nontreatment outcome for a participant i is estimated by the fitted value of a nonparametric regression in the sample of nonparticipants at 15 point X and at the employment history. This is implemented by a bivariate kernel regression on the propensity score and on the employment history. Differences in the long–run preprogram employment history are summarized by the Mahalanobis distance. The nonparametric regression can be represented by a weight function wN0 (i, j) that gives a higher weight to nonparticipants j the stronger his similarity to participant i in terms of X and the employment history using a product kerP nel. For each i, these weights sum up to one over j ( j∈{D=0} wN0 (i, j) = 1). The estimated TT is then (1) 1 N1 X i∈{D=1} X Yi1 − j∈{D=0} wN0 (i, j) Yj0 , with N0 the number of nonparticipants j and N1 the number of participants i. We use local linear matching for the propensity score and the employment history. The product kernel is given by (for ease of notation, we omit the index N0 ) Ã (2) pi − pj K(i, j) = φ hp ! Ã " mdist(i, j) · exp − h2m #! where pi , pj are the estimated propensity scores for individuals i, j, mdist(i, j) is the Mahalanobis distance in the employment history (difference in vector of monthly employment dummies), hp , hm are the two bandwidths, and φ is the Gaussian kernel. The weights are given by wN0 (i, j) = K(i, j)/ P j∈{D=0} K(i, j). As a simple alternative to the matching estimator discussed so far, CDiDS takes account of the short–run dynamics in the selection into training and accounts for time–invariant, additively separable selection bias due to unobserved characteristics. For instance, unobserved characteristics could be due to differences in the motivation of participants or could reflect that programs are targeted to individuals with some particular problems in the labor market. The CDiDS estimator analyzes the before– after change in in employment rate instead of its level in equation (1). Following the approach in HIST,10 we use local linear matching based on the estimated propensity score to match participants i and nonparticipants j in the same time period. In addition, we also match on being nonemployed in the month 10 See also Blundell, Costa Dias, Meghir and Van Reenen (2004) for an application of the CDiD, where age eligibility rules and regional variation in the provision of program are used to take account of selection effects. 16 before treatment start using a product kernel analogous to equation (2), where the distance in the employment status in the month before treatment (only for those participants who are nonemployed in the month before) is used as a one–dimensional Mahalanobis distance. For treatment in period τ , i.e. conditional on τi = τ (τi is the random time individual i first enrolls in training), the simple CDiDS–estimator for the treatment effect on the employment rate in period t = t1 + τ is given by N1 X 1 X 0 0 0 Y 1 − Y − wi,j (Yj,τ i,τ +t0 +t1 − Yj,τ +t0 ) N1 i=1 i,τ +t1 j (3) where period τ + t1 (t1 > 0) lies after and τ + t0 (t0 < 0) before treatment. t1 and t0 are defined relative to the actual beginning of the treatment τ . N1 is the number of participants i for whom the (t1, t0) difference can be determined. Note that we only match comparison individuals j who do not receive treatment during the observation period under consideration. We implement the CDiDS using long–run preprogram differences in the outcome variable after matching to control for remaining unobservable differences. Therefore, t0 must lie before −ad, the start of Ashenfelter’s Dip.11 This is parallel to the matching approach where the existence of the preprogram employment dip precludes that we use the short–run preprogram dynamics in employment to match. For the CDIDS, the preprogram employment dip does not allow the use of the short–run dynamics for the before–after differences. Using the short–run preprogram dynamics in either way would result in a fallacy of alignment. 4.2 Conditional Difference–in–Differences in Hazard Rates (CDiDHR) 4.2.1 Employment Model and Ashenfelter’s Dip We specify an econometric model for employment in order to develop and discuss the the estimator for the effects on transition rates. The model takes account of 11 As discussed below (footnote 15) in greater detail, we do not take symmetric differences t0 = −t1, in contrast to HS. We do not think the employment recovery after the beginning of treatment is likely to be symmetric to the decline in employment before treatment. 17 important features of employment dynamics such as state dependence and duration dependence. The model represents a nonparametric version of a linear probability regression specification for employment probability with a fixed effect. This model allows for a CDiD estimation of the treatment effects on state specific employment rates. As the we allow for state dependency in the employment Yit of individual i in month t and we distinguish between two different labor market states, notably employment and nonemployment, we specify two separate outcome equations depending on the state in the previous month.12 Employed in t − 1: (4) e Yit = ae (Xi , t, Ei,t−1 ) + δi,t,E (τi ) + cei + uei,t i,t−1 for Yi,t−1 = 1 Not employed in t − 1: (5) n Yit = an (Xi , t, Ni,t−1 ) + δi,t,N (τi ) + cni + uni,t i,t−1 for Yi,t−1 = 0 Duration dependence enters equation (4) through Ei,t−1 , the elapsed employment duration in t − 1, and equation (5) through Ni,t−1 , the elapsed nonemployment duration in t − 1. ae (Xi , t, Ei,t−1 ), an (Xi , t, Ni,t−1 ) are functions describing the state dependent employment probabilities as a flexible function of observed time invariant characteristics Xi , month t, and elapsed durations Ei,t−1 , Ni,t−1 . cei , cni are state dependent permanent individual specific effects, and uei,t , uni,t are the idiosyncratic, period specific effects which are conditionally heteroscedastic. Our estimation approach hinges critically on the assumption of additive fixed effects. τi is the actual (random) time individual i first enrolls in a training program with τi > T̄ for nonparticipants and T̄ being the end of the time period analyzed. e n δi,t,E (τi ), δi,t,N (τi ) are the individual specific, state dependent effects of treati,t−1 i,t−1 ment on the individual employment probabilities and represent potential treatment effects of different τi .13 We estimate averages of the employment effects in period e n t, δi,t,E (τ ), δi,t,N (τ ), conditional on receiving treatment in some period τ , i.e. i,t−1 i,t−1 12 In this subsection, the index i denotes any individual whereas in the remainder of the paper i applies only to treated individuals. 13 The link between actual treatment effects operating for treated individual i and potential 18 τi = τ where τ is some fixed time, and conditional on the employment status in the previous period t − 1. Furthermore, we assume that the effect of treatment occurs e n after treatment, i.e. δi,t,E (τi ) = 0 and δi,t,N (τi ) = 0 for t < τi .14 The asi,t−1 i,t−1 sumption implies the absence of deterrence effects, which is plausible since training programs are not mandatory. Next to state and duration dependence, we allow the e n individual potential treatment effects δi,t,E (τ ) and δi,t,N (τ ) (see footnote 13) i,t−1 i,t−1 to depend upon observed characteristics Xi and the individual specific effects cki . They are also allowed to vary by i, t, and τ conditional upon Xi and cki . Regarding the selection into treatment, the evaluation approach allows treatment time τi to be affected by the observed covariates Xi , by the treatment effects e n δi,t,E (τi ), δi,t,N (τi ) , and by the individual specific effects cei , cni . Furthermore, i,t−1 i,t−1 we impose little functional form restrictions on ae (Xi , t,Ei,t−1 ) and an (Xi , t, Ni,t−1 ). However, we choose a parametric model to estimate the propensity score. For the idiosyncratic error terms, we assume that uei,t , uni,t are conditionally mean independent of treatment in the past. Ashenfelter’s Dip in employment (the preprogram employment dip) links treatment and the idiosyncratic error terms uei,t , uni,t shortly before treatment. Formally, the preprogram employment dip can be described by E(uki,t−s |τi = t) < 0 for (k = e, n) and s = 1, . . . , ad, where ad denotes the beginning of Ashenfelter’s Dip. We assume that both anticipation effects and participation rules do not affect the idiosyncratic error term after treatment. Therefore, the preprogram effects are not linked to the outcome variable once treatment has started, i.e. uki,t−s (k = e, n) are not correlated with uki,t+l with s, l ≥ 1 and conditional on τi = t.15 Note that n treatment effects is given by δi,t,N (τi ) = i,t−1 P τ n δi,t,N (τ )I(τi = τ ), where I(τi = τ ) is a i,t−1 dummy variable which is equal to one iff τi = τ . 14 This assumption is similar to the timing–of–events approach (Abbring and Van den Berg, 2003). 15 This is in contrast to HS who model earnings in the recovery process to be expected (based on nontreatment outcomes) after the treatment being symmetric to the deterioration during Ashenfelter’s Dip. HS show empirically that such a pattern holds based on experimental data. In our context, state dependence in employment results in a sluggish recovery process without treatment which in general is not symmetric around Ashenfelter’s Dip. Analyzing transition rates allows us to take account of the sluggishness of recovery. 19 the preprogram employment dip as a temporary shock nevertheless would affect nontreatment outcomes through the state dependence of the employment process. In our empirical analysis, we allow for a maximum length of time (ad months) for Ashenfelter’s Dip (see section 4.1). 4.2.2 Implementation of CDiDHR Based on the employment model in equations (4) and (5), the following Conditional Difference–in–Differences in Hazard Rates (CDiDHR) estimator arises naturally as an extension of a CDiD estimator to a state dependent employment process with duration dependence. We simply estimate the treatment effect on the employment probability via CDiD as in equation (3) conditional on employment status in the previous month and conditional on dummy variables for duration dependence Ei,t−1 or Ni,t−1 . The estimator is calculated for the set of observations N l instead of all treated individuals {D = 1} where l denotes the employment status in the previous month (l = 1 if previously employed and l = 0 if previously nonemployed). N l is then the set of treated individuals for whom Yi,τ +t1−1 = Yi,τ +t0−1 = l, where period t1 lies after and t0 before treatment for individual i. N1 in equation (3) is replaced by N l , the number of individuals in the set N l . Also the weights are normalized accordingly based on this set. Similar to the previous section, we use the average individual employment rate Ym,τ +t0 (m = i, j) in the time period before the preprogram employment dip conditional on the employment status in the month before. Thus, we use individual average observable transition rates for alignment in the preprogram period. Only nonparticipants j for whom Yj,τ +t1−1 = Yj,τ +t0−1 = l and who fall into the same duration categories are matched, i.e. can have a non zero weight wi,j . For l = 0 and l = 1, we estimate the reemployment probability and the probability of remaining employed, respectively. When calculating the expression in equation (3), the set N l changes over the time periods (t1 , t0 ) considered since different sets of individuals are employed or non–employed in the previous month. For some individuals, there exists no pair of time periods in the ’pre’ and ’post’ intervals where they are employed or not employed. Effectively, there is a sorting process in either employment state which 20 respect to observed and unobserved characteristics. Our estimated treatment effect averages over the different set of individuals across time. This implies that we do not estimate the TT for all treated individuals but just the TT for those treated individuals who happen to be in the employment state of interest in the months after treatment start. This way we focus on the effects for those settings which are observed. There is no ready procedure to estimate the unconditional TT by also integrating out both observed and unobserved individual specific effects without imposing further stringent assumptions. To properly account for selection bias in the nonparticipation outcome, CDiDHR only requires the mean difference in the idiosyncratic error terms conditional on Di = 1 and DTi = 0 (both also conditional on Xi ) to coincide, i.e. E(uki,τ +t1 |Di = 1, Xi ) − E(uki,τ +t0 |Di = 1, Xi ) = E(uki,τ +t1 |DTi = 0, Xi ) − E(uki,τ +t0 |DTi = 0, Xi ) for k =e,n , t1 ≥ τ , and t0 < −ad, where Di ≡ D(τi = τ ) (dummy for treatment in period τ ) and DTi ≡ D(T ≤ τi < T ) (dummy for treatment during observation period [T , T ]). The individual specific effects cli do not have to be conditionally mean independent of treatment status and covariates Xi . Also for CDiDHR, we require that t0 lies before −ad, i.e. before anticipation and participation rules can take effect.16 We estimate the TT over the sample distribution of lagged employment status and elapsed (non)employment durations. The state dependent specification allows for a sluggish recovery in employment rates in the nontreatment state as a benchmark. In light of our employment model in equations (4) and (5), our CDiDHR estimator is robust against the preprogram dip in employment provided average long–run employment differences before treatment do not change before the start of the dip (parallel pretreatment outcomes). As in the case of CDiDS, CDiDHR can be modified to account for the dependence of treatment participation and treatment effect on the employment dynamics in the short run before the start of the 16 k If the timing of treatment τi is independent of the person specific gains δi,. (k = e, n) conditional on the unobserved heterogeneity terms, then the estimated treatment effect for some τ is the average effect of treatment on the treated for the entire treated sample in the observation period conditional on the employment state in the previous month. However, the latter condition may not be justifiable in our application. 21 treatment. Indeed, the dynamics shortly before treatment appear to be decisive for participation. We find that we need to match on the employment state in the month before treatment for those who are then nonemployed in order to stabilize the average long–run employment differences. Only this way, we obtain the necessary parallel average long–run preprogram outcomes which are required for CDiD (see Abadie, 2005). Note that despite matching on the short–run preprogram dynamics, our CDiDHR estimator is not invalidated by the preprogram employment dip. The only purpose is to match nontreated individuals who have a similar risk of participating in training as the treated individuals. 4.3 Implementation Details Following HIST and HS, we use a local linear matching in the propensity score, i.e. we run a local linear kernel regression in the dimension of the propensity score, see Pagan and Ullah (1999).17 A kernel function with unbounded support avoids some of the problems involved with local linear kernel regression, namely, that the variance can be extremely high in areas where there is not a lot of data, see Seifert and Gasser (1996) and Frölich (2004) for a critical assessment of local linear kernel regression. We take account of the sampling variability in the estimated propensity score by applying a bootstrap method to construct the standard errors of the estimated treatment effects. To account for autocorrelation over time, we use the entire time path for each individual as the block resampling unit. All the bootstrap results reported in this paper are based on 200 resamples. For the local linear kernel regression in the sample of nonparticipants, we use the Gaussian kernel, see Pagan and Ullah (1999). Standard bandwidth choices (e.g. rules 17 Local linear matching has a number of theoretical advantages compared to the widely used nearest neighbor matching. The asymptotic properties of kernel based methods are straightforward to analyze and it has been shown that bootstrapping provides a consistent estimator of the sampling variability of the estimator in (1) even if matching is based on closeness in the estimated propensity score, see HIST or Ichimura and Linton (2001) for an asymptotic analysis of kernel based treatment estimators. Abadie and Imbens (2006) show that the bootstrap is in general not valid for nearest neighbor matching due its extreme nonsmoothness. 22 of thumb) for pointwise estimation are not advisable here because the estimation of the treatment effect is based on the average expected nonparticipation outcome for the group of participants, possibly after conditioning on some information to capture the heterogeneity of treatment effects. Since averaging pointwise estimates reduces the variance, it is clear that the asymptotically optimal bandwidth should go to zero faster than an optimal bandwidth for a pointwise estimate, see Ichimura and Linton (2001) on such results for a different estimator of treatment effects.18 To choose the bandwidths hp and hm , we suggest the following heuristic leave– one–out cross–validation procedure which mimics the estimation of the average expected nonparticipation outcome for each period. First, for each participant i, we identify the nearest neighbor nn(i) in the sample of nonparticipants, i.e. the nonparticipant whose propensity score is closest to that of i, and we also assign the start of treatment τi for i as the fictitious start for nn(i). Second, we choose the bandwidths hp and hm to minimize the sum of the period–wise squared prediction errors 2 N1,t X 1 X 0 Y 0 wi,j Yj,τ nn(i),τi +t − i +t t=0 N1,t i=1 j∈{D=0}\nn(i) TX −1 where the prediction of employment status for nn(i) is not based on the nearest neighbor nn(i) himself and t = 1, ..., T denotes the month (T = 36 for our data) in the calendar months after the beginning for treatment for treated individual i himself. The optimal bandwidths affecting the weights wi,j are determined by numerical optimization. In some cases, we find a very large value for hp which means that the matching on the propensity score does not have much influence on the results. Since our method for the bandwidth choice is computationally quite expensive, it is not possible to bootstrap it. Instead, we use the bandwidth found for the sample in all resamples. 4.4 Effects on Earnings We also attempt to estimate the TT effects of training on earnings. For earnings as outcome variable, we can only provide cross–sectional matching estimates because 18 This is also the rationale for researchers using nearest neighbor matching with just the closest neighbor thus focusing on minimizing the bias. 23 earnings are only observed for the last three years of the data 1997 to 1999. Since we analyze training in the years 1990 to 1999, data availability precludes a CDiD estimator. We report an estimate of the treatment effects on earnings by matching on the long run employment history before treatment, analogous to what we do for employment effects, and then we control for the time passed since the beginning of treatment. 5 Data Our analysis uses the Labor Market Monitor Sachsen–Anhalt19 (Arbeitsmarktmonitor Sachsen–Anhalt, LMM–SA) for the years 1997, 1998, and 1999. The LMM–SA is a panel survey of the working–age population of the state (Bundesland) of Sachsen– Anhalt with 7,100 participants in 1997, 5,800 in 1998, and 4,760 in 1999. 1999 is the last year in which the survey was conducted. Only in the three years used, retrospective questionnaires on the monthly employment status between 1990 and the interview date were included, covering employment, unemployment, or participation in a program of ALMP, as well as periods in the education system, inactivity, or in the military. Individuals who did not participate in the 1998 survey are recorded until at least September 1997, those who dropped out in 1999 at least until October 1998.20 19 Our data refer to the state of Sachsen–Anhalt, which experienced a slightly worse economic development after 1990 than the average of East Germany (see Eichler and Lechner, 2002). However, the regional economic situation varies within this state and we control for these differences between treated and matched control observations in our matching approach. Therefore, the estimated effect should reflect an uncontaminated microeconomic effect that accounts for differences in the economic situation within the state. Since we focus on the employment effect as an effect of treatment–on–the–treated for a particular cohort on the East German labor market, the results are therefore likely to hold for similar cohorts in other East German areas. Further information on the data set can be found in Ketzmerik (2001). 20 Recall error is unlikely to be of particular importance for these data, see the discussion in the Appendix. 24 5.1 Selection of Sample Unfortunately, in the three survey years used the categories of the labor market status information differ. For compatibility, the data set also includes a combined monthly calendar for the three survey years. This calendar distinguishes the following categories: Education, full–time employed, part–time employed, unemployed, job creation scheme, training, retirement, pregnancy/maternity leave, not in active workforce. Additional information on the individuals that goes beyond the monthly labor market status since 1990 can be retrieved from the cross–sectional dimension of the survey for the years 1997 to 1999. We use static individual characteristics, such as education, area of residence at the time of the interview, and year of birth. As an additional outcome variable, next to employment, we consider monthly net income at the time of the interviews in 1997 to 1999, in case an individual is defined as employed. When nonemployed, we set this outcome variable to zero. Thus, we basically consider monthly earnings because nonlabor income is likely to be negligible in East Germany among employees. We only consider individuals with uninterrupted information on their labor market history between January 1990 and at least September 1997 (i.e. individuals who completed the retrospective question in 1997).21 The individuals are between 25 and 50 years old in January 1990 and employed before the start of the “Economic and Social Union” in June 1990. This way, only individuals are included who belonged to the active labor force of the GDR, who therefore are fully hit by the transformation shock, and who are not too close to retirement. Individuals who are later on in education, on maternity leave or retired are excluded completely from the analysis. The goal is to construct a consistent data base excluding individuals who have left the labor market completely. In addition, we exclude a small number of individuals without valid information on those individual characteristics, on which we build our matching estimator. We aggregate the remaining labor market states to the five categories employment, which comprises part– and full–time employment, 21 See table 1 for the number of observations dropped from the sample for each of the reasons described here. 25 unemployment, out of the labor force, training, and job creation. Our outcome variable employment is defined as a binary outcome variable with nonemployment as comprehensive alternative including participation in ALMP. Modeling transitions between unemployment and being out of the labor force is an impossible task. People move occasionally back and forth between the two states in the data and it is not obvious whether the individuals precisely distinguish between unemployment and being out of labor force, since no formal definition of unemployment is given in the questionnaire. The resulting sample consists of 5,165 individuals, involving both participants and nonparticipants in ALMP. Table 2 summarizes participation in ALMP based on our data. The two most important programs, Training (TR) and Job Creation Schemes (JC), were implemented on a large scale. In total, 27% of our sample participated at least once in one of the two programs. 13% (689 cases) participated at least once in JC, however, TR was by far the most important program with a rate of 20% (1,021 cases).22 Multiple participation is quite common in East Germany. After a first TR, a second treatment in TR or JC occurred in 326 cases, i.e. in more than 36 % of the 889 cases in a first treatment in TR.23 Here we focus on TR as the first treatment in ALMP. We observe 9.8% (495 cases) of our sample to participate in a first TR during the first period from 1990 to 1993 and 7.6% (394 cases) to participate during the second period from 1994 to 1999. Note, that our data do not distinguish between further training and retraining. Therefore, the estimated treatment effects represent an average of the two programs. 22 The question in the LMM–SA on training also includes privately financed training. However, calculations based on the German Socioeconomic Panel for East Germany show that a very high share of training is in fact public sector sponsored training (in 1993 more than 88%). 23 In the working paper version Bergemann, Fitzenberger and Speckesser (2005), we estimate effects both for a first and second treatment. We evaluated sequences or increments of multiple treatments using a specific variant of the evaluation approach suggested under section 4. 26 5.2 Descriptive Analysis of Participants and Non-participants Since our analysis estimates the effects of TR participation for a period up to 36 months after the beginning of the participation, only a small number of observations might remain in the sample for outcomes late in the 1990’s. Whereas attrition is absent for TR programs that started in our first observation period from 1990 to 1993, because of the retrospective sampling procedure, there might be a problem for training in the second period for 1994 to 1999. Table 3 shows for the first period that all monthly status information until 36 months after the beginning of the program can be retrieved from the data, both for the treated and the nontreated individuals. Attrition matters for the second period (see table 4), i.e. participation occurs between 1994 and 1999. There is no longer information on the labour market status for every treated individual for 36 months after the beginning of the treatment. The decreasing numbers result mainly from persons surveyed in the years 1997 and 1998 that were participating relatively late in the 1990’s. We lose about one third of the treated individuals after 24 months and about 50% after 36 months. There is considerably less attrition in the control group (based both on the evolution of the kernel weights used for the CDiDS estimator, whose results are described below, and the actual number of nonparticipants), see columns 3 and 4 of table 4 and the detailed explanations at the bottom of table 3. Thus, there is still a very substantial share of both the treated and the untreated persons available until 36 months after beginning of the treatment. In order to keep a sufficiently large sample size, we use the unbalanced sample for the evaluation analysis. This approach can also be justified based on existing evidence in the literature that attrition from a survey does not influence the results concerning the determinants of labor market transition rates (Van den Berg et al., 1994; Van den Berg and Lindeboom, 1998). Based on the sample selection outlined above, our descriptive analysis shows a stark contrast in the labor market participation of participants in a first TR and non-participants. Figure 4 shows the monthly employment status by participation status. Conditioning on employment in June 1990, we observe only few nonparticipants to leave employment until the end of 1990. Even until two years after, there is still an employment rate of 91.2% in June 1992 and remains relatively stable until 27 the mid 1990’s, when still more than 90% of the original cohort is in employment. After 1995, the unemployment rate of the original cohort doubles to almost 15%, and participation in JC increases. Employment falls steadily to 82.7% of the total population of non-participants at the end of the millennium. Ten years after unification, the share of our sample leaving the labor force remains low with less 0.5%.24 In contrast, the participants in a first TR experience an earlier and more dramatic reduction in their employment rates. Two years after the economic and social union, the employment share of the formerly fully employed cohort is already reduced to as low as 56.6%. Since then, the participants showed relatively stable employment rates, reaching still 53.3% at the end of 1999. The massive decline in employment not only resulted in participation in TR, but also in higher participation in JC and higher unemployment. With 16%, open unemployment was already three times the share as for non-participants in December 1992 when the public programs peaked with a participation of 31.8% in either TR or JC. Since then, the participation in programs declined to 15%. Simultaneously, the unemployment rate among participants increased to more than 30%. As for the non-participants, there is not much drop out, eventually reaching 1% at the end of 1999. A closer look into the dynamics of transitions in and out of employment reveals that the dynamics were completely different for participants and non-participants (see figures 5-6). In 1991, 45% of the first cohort of TR participants left employment for at least one month. Since then, exits to nonemployment declined to around 6%. On the other hand entries into employment reached a maximum of 19% in 1993 and declined thereafter, being, however, on average by 2 percentage points higher than exits from employment. Non–participants show a very different pattern. Exit rates from employment for the first cohort of non–participants fluctuates around 7% points, whereas entry rates to employment are around 2 percentage points lower. With respect to the second training period, the exits from employment for the 24 Note that a small fraction of the non-participants also shows participation in public sector sponsored further training. These are participants in a training program after an earlier participation in a public job creation program and thus are not subject to our evaluation of training as the first program. 28 participants increased until 1993 and remained with about 21% on a relatively high level until 1997. Entries into employment fluctuated until 1994 around 3% and increased thereafter to 19% in 1998, dominating exits from employment for the first time. The second cohort of non–participants display exit rates from employment of around 6% (with the exception of 1991), being mainly one percentage point lower than entry rates. These differences between participants and non–participants suggest that an analysis that only controls for the high static differences in nonemployment would be insufficient. Therefore, our CDiDHR estimator introduced in section 4 controls for the differences in the underlying dynamics of participants and nonparticipants. 6 6.1 Empirical Analysis Propensity Score and Matched Samples The goal is to estimate the effect for participation in training as the first program. The treatment probability (propensity score) is estimated by a parametric probit model. Since the data do not provide time–varying information (except for the labor market status), the regressors are the static observable characteristics education, occupational degree, gender, age, residence (at the time of the survey), and interactions of gender and education or occupational degree. We estimate two different probit models one for participating in a first TR during the time 1990 to 1993 and a second for participating in a first TR during the time 1994 to 1999. The group of “nonparticipants” represents the entire sample of individuals who are not participating in a first treatment program in the time period consider but who might be a participant in another program or in another time period. The results of the probit estimate for the propensity score are reported in table 5 and 6. In addition to the propensity score we also match either on the labor force dynamics in the long–run pre–program time period or, in another version of our estimators, on the employment status in the month before treatment starts for those who are non–employed in this month. We only present the evaluation results jointly for men and women. In our case separate estimations of the program effects did not show 29 significant differences by gender.25 Using a bootstrap estimator for the covariance matrix of the estimated treatment effects, we capture the estimation error in the propensity score. The post-program evaluation period starts with the beginning of the participation in training. This approach views treatment as one realization of the nonemployment state with the treated searching for a job. Since the participant might be enrolled in training for a duration of several months up to two years, the effects after program beginning include a lock–in effect caused by the program itself, i.e. the time spent in the program is likely to cause an increase of the nonemployment probability for the treatment group in the early months of our outcome period. The start of the evaluation period depends upon the outcome variables considered. For employment rates and reemployment probabilities, the evaluation period starts one month after the first month of the treatment. For probabilities of remaining employed, the evaluation period starts one month later than for the other two outcome variables, since we first have to observe employed former participants. We choose the length of the evaluation period to be 36 months (as far as being observed in the data set – otherwise set to missing). With respect to earnings we can not construct such a stringent evaluation period, as we only have earnings information for the time period from 1997 to 1999. However, when estimating earnings effects, we take the elapsed time period since start of training into consideration. For the alignment of the CDiD estimators in the preprogram period, we start 18 months before the beginning of the treatment (excluding Ashenfelter’s Dip). Based on the estimated propensity score and the preprogram employment history, we construct matched samples of participants and comparable “nonparticipants”. Alignment occurs in the same calendar month, i.e. we match individuals in the same calendar month in order to eliminate common time effects. The characteristics and outcomes of matched nonparticipants are the fitted values obtained by the local linear kernel regression of characteristics and outcomes, respectively, in the sample of nonparticipants as a whole. Note that the results of our cross validation procedure for the bandwidth choice involve a bandwidth hp for the propensity score 25 The results of these estimations are available upon request. 30 growing to infinity in all settings for participation in the first time period 1990 to 1993. In contrast, we obtain a finite hp for the second time period 1994 to 1999. When matching on the employment history, we obtained a finite bandwidth hm in all cases. Thus, for the first time period, it turns out that it is not necessary to match upon the propensity score – and therefore not on the observable time–invariant covariates. This is important to keep in mind. Table 7 and 8 provide evidence on the balancing properties in the matched samples when taking account of state and duration dependence. The first column shows the average characteristics in the whole sample. The remaining columns show the average characteristics conditional upon employment state in the previous month during the time period under consideration. For example, when calculating the average characteristics for the previously nonemployed, the individual contribution to the mean characteristics is weighted by the number of months the individual’s state was nonemployment during the time period under consideration. Table 7 and 8 show that participants are younger than the nonparticipants. Higher skilled individuals are more likely to participate in the training period 1990 to 1993 than lower skilled. Similarly women are likely to receive training than men in this time period. There are no clear pattern with respect to skills and gender for training in the time from 1994 to 1999. The matching process balances reasonably well the characteristics of the participants and the matched nonparticipants conditional on state and duration dependence. The balancing works especially well with respect to the short run employment dynamics before training in the second period. For example the employment probability for those nonparticipants who were previously nonemployed is 16%, whereas matching reduces the employment probability of to 3%. This is very close to the employment probability of 2% of the previously nonemployed participants. Balancing also works well with respect to age for the second training period and conditional on being previously nonemployed. Also, the regions seem fairly well balanced. However, the category skilled worker does not seem so well balanced as well as the employment probability in the month before (potential) participation during 1990 to 1993 for those who are previously nonemployed. 31 Furthermore, table 7 and 8 sheds some light on the differences in characteristics across employment states in the previous month. Male participants are more likely to belong to the group of previously employed participants than women. The skill distribution or participants in TR in 1990 to 1993 is quite different depending on whether they are previously employed or nonemployed. Previously employed participants have on average a higher education. Furthermore, previously employed participants in TR in 1994 to 1999 are younger than previously nonemployed. We conclude that our matching approach balances participants and nonparticipants fairly well. This holds even for the first period where the results of our cross–validation procedure implies that matching of the time–invariant covariates is not necessary. 6.2 Specification of Outcome Equation In the matched samples, the CDiDS estimators are based on a flexible linear model for the employment dummy as outcome variable. For CDiDHR, the model is estimated separately depending on the employment state in the month before, thus modeling transition rates. The state of nonemployment includes the participation in ALMP programs so that previous and subsequent participation in a program are both accounted for as nonemployment. We estimate an average employment effect of a program relative to all possible nonemployment states for the treated individuals thus estimating TT (with CDiDHR conditioned on the employment status in the previous month). We assume that participant i starts treatment in period τi and, in the following, we will omit the index i for ease of notation. We consider the employment outcome Y during the evaluation period t1 = 1, . . . , 36. The definition of t1 depends on the success criterion. For the unconditional employment probability or the reemployment probability being the outcome variable, the evaluation starts with the beginning of the program, t1 is measured relative to τ , e.g. t1 = 1 corresponds to month τ + 1 and t1 = −1 corresponds to τ − 1. For the probability of remaining employed, t1 is measured relative to τ + 1 during the evaluation period. The following estimators are applied separately for the two training periods 1990 to 1993 and 1994 to 1999. 32 The matching estimator, which matches on the long–run preprogram employment history, is implemented in the following way: 1 Yi,τ +t1 − (6) X 0 wi,j Yj,τ +t1 = 36 X δs I(t1 = s) + νi,τ +t1 s=1 j where I(.) denotes the indicator function and νi,τ +t1 is the error term. For the first time period, we also add dummies indicating the exact start year of the training as additional regressor in equation (i.e. we let the treatment effect vary by τ ), when the employment rate is the outcome variable. For earnings as outcome variable, we can only implement the matching estimator as in equation (6) just accounting for the years s (instead of months) since the start of the treatment. For CDiDS (sample of all participants) and CDiDHR (separately for the two employment states in the previous month), we estimate the following three steps (only for employment as outcome variable): 1. We calculate the average long–run preprogram difference between participant i (treatment starts in τ ) and comparable nonparticipants as âi,τ = 1 18 − ad(τ ) −ad(τ )−1 X 0 (Yi,τ +t0 − t0=−18 X 0 wi,j Yj,τ +t0 ) . j 2. Then, âi,τ is subtracted from the difference during the evaluation period resulting in the following model to estimate the treatment effects (7) 1 Yi,τ +t1 − X 0 wi,j Yj,τ +t1 − âi,τ = 36 X δs I(t1 = s) + νi,τ +t1 s=1 j Analogous to the matching estimator, we add dummies indicating the exact start year of the training as additional regressor in equation (7) for the first training period. 3. We also report the averages over the long–run preprogram difference âi,τ separately for different training periods to illustrate how the average long–run preprogram differences (≡ residual selection effect due to permanent individual specific effects) between participants and nonparticipants after matching depend upon the timing of the program. 33 We define: ad(τ ) month before the beginning of the program when Ashenfelter’s Dip starts depending upon τ , δs coefficients modeling the TT effect relative to the time since treatment started (s), and wi,j weights implementing local linear kernel regression on the estimated propensity score and, depending on the type of estimator, depending on the short–run or long–run preprogram employment history (see section 4). All three estimators take account of the possibility that the effect of the program depend upon the time since treatment (t1 > 0) and the beginning of the program τ . On the one hand, we estimate the effect δs separately for each month s since treatment. On the other hand, we allow the effect to to depend in a flexible way upon τ , by separately estimating the effects for the two training periods and including yearly dummies for effect estimates for training in the first time period. The length of Ashenfelter’s Dip, ad(τ ), (the preprogram dip in employment) is allowed to depend upon the time when the program starts. During the period shortly after unification, it is likely that the dip is fairly short because program participation could not have been anticipated long before and participation rules were not applied in a strict way. This changed with the occurrence of high unemployment in the mid 90’s. A visual inspection of the average employment differences between treated and matched controls before and after the program as a function of the time when the program starts indicates that the dip lasted at least one to two months in 90/91 and increases over time to at least six months for training. Before November 90, we set ad(τ ) = −1. Between November 1990 and July 1994, ad(τ ) increases linearly in absolute value from 2 months to 6 months, where ad(τ ) is rounded to the nearest integer. After July 1994, ad(τ ) remains constant. In order to avoid a potential ’Fallacy of alignment’, we are conservative and take Ashenfelter’s Dip as fairly long. Our subsequent results imply that our approach to account for Ashenfelter’s Dip does not work in a satisfactory way for the matching estimator and the CDiDS estimator of employment effects. In contrast, as we argue in section 4, the CDiDHR estimates seem more credible in light of Ashenfelter’s Dip. 34 6.3 Employment Differences in Matched Samples Next, we discuss the employment differences in the matched samples for the two different CDiD estimators in order to investigate whether the long–run preprogram differences are fairly stable. The results for the time of Ashenfelter’s Dip are left out because this is excluded for CDiD. Also estimates that are based on less than 20 observations are explicitly discarded (this applies to the lower panel in figure 9). Figure 7 shows the average differences in employment rates for the matched samples that build the basis for CDiDS, separately for the two different training periods. Time relative to the start of TR is depicted at the horizontal axis. Right after the beginning of treatment, employment rates of the participants are between 45 and 25 percentage points (ppoints) lower than for comparable nonparticipants. There is a noticeable recovery for the participants afterwards – basically the time path reflects the changes for participants since employment rates for nonparticipants change fairly little in comparison – but the difference comes nowhere close to zero except at the end of the evaluation period for TR after 1993. For the first period 1990 to 1993, the long–run preprogram differences are slightly below 10 ppoints and fairly stable. For the second period after 1993, they decline considerably from 40 ppoints at month -18 to 20 ppoints at month -7. Thus, for the second period, we do not find parallel long–run preprogram differences which would be a prerequisite for the CDiDS estimator. This suggests that CDiDS estimates for the second period are not valid. It is evident that the CDiDS estimates will imply strong negative employment effects (as we will see in the following). The continuous decline before the program in the first period and the recovery process after the program suggest that employment rates do not adjust instantaneously, suggesting that employment dynamics play an important role. In contrast to the employment rates for the second period, the differences in the transition rates in figures 8 and 9 do not display a trend for the long–run preprogram differences (excluding the period of Ashenfelter’s Dip). The differences in the matched samples for both outcomes, the reemployment probability and the probability of remaining employed, fluctuates around a stable average preprogram difference, indicating that our CDiDHR approach with alignment on an average pre35 program difference is valid. Furthermore, these long–run preprogram differences in transition rates are very small in comparison to the differences in employment levels. 6.4 Estimated Treatment Effects First, we discuss the employment and earnings effects estimated by the matching estimator. We then proceed to the CDiD estimates where the outcome variable consists of either the employment rate (CDiDS) or the transition rates (CDiDHR). We mainly rely on graphical illustrations and we report only point estimates representing at least 20 observations. Further detailed results are available upon request. 6.4.1 Matching Estimator for Employment and Earnings Figure 10 depicts the estimated employment effects based on the matching estimator. This estimator matches on the long–run preprogram employment history. The graphs show a thick line representing the estimated effects on the employment probability. The surrounding dotted lines around this line represent the 95%–confidence interval that is based on bootstrap covariance estimates.26 In both the early as well as the later 1990’s, the employment effects of training prove significantly negative. However, the negative employment effects became weaker over time, whereby the recovery is stronger for training starting in the first period. While in both periods the employment effects are about -70 ppoints in the first months after training started, the effect is less negative at the end of the first period (-18 ppoint) than at the end of the second period (-35 ppoint). All this would imply that training results in a considerable reduction in employment rates, which is a common result found in the literature using survey data when matching is based on observable characteristics (see the survey in Speckesser 2004, chapter 1). Table 9 reports the estimated earnings effects pooled for 1997 to 1999 by the time since the beginning of the treatment based on the matching estimator. Again, we find also negative point estimates up to eight years after the beginning of the 26 When comparing the bootstrap standard errors to conventional heteroscedasticity consistent standard errors, we find that bootstrap standard errors are higher. This is also the case for the CDiDS and CDiDHR estimates. 36 treatment which are significant in most years. The negative effects are reduced over time, but we can not say whether this is due to a recovery process or whether the effects differ by the time of the treatment. Unfortunately our data set does not allow us to distinguish the two. The results based on the matching estimator suggest negative earnings effects which is in line with the estimated employment effects. However, the estimates are subject to the same criticism as the matching estimator for employment because both ignore the dynamics in employment and earnings. 6.4.2 CDiDS Results for Employment Figure 11 displays the CDiDS employment effects. This estimator is able to take account of selection on the basis of unobserved characteristics (see section 4.1). The labor market dynamics are recognized by matching on the nonemployment status in the month before training starts. Figure 11 depicts next to the treatment effect also α, the average long–run preprogram differences. As could be expected from the discussion on the employment differences in the matched sample for CDiDS, α turns out to be positive and the effect estimates negative. If compared to the estimates of the matching approach, the CDiDS effects are similar in two aspect: they are always significantly negative and display a similar recovery process. They differ, however, slightly in the strength of the negative effect. The CDiDS estimates are not as negative as the results received by matching. However, the qualitative nature of results is the same. 6.4.3 CDiDHR Results for Transition Rates The CDiDHR estimates explicitly take into account the state dependence and duration dependence in the employment process. The outcome variable used is either the probability of exiting nonemployment for the previously nonemployed or the probability of remaining employed for the previously employed. Figure 12 illustrate the estimated TT for the reemployment probability. We show employment effects separately for three different time periods in order to give additional insides into possible changes of treatment effect. We distinguish programs 37 that start 1) between 1990 and 1991, 2) in 1993, and 3) after 1993. For 1990 and 1991, we find positive employment effects during the evaluation period, which are mostly significant. For example, one year after the program start the participants have a 5 ppoints higher reemployment probability than they would have had, had they not participated. These positive effects of the participation in training vanish for programs starting after 1993. In the second training period after 1993, the effect sometimes takes negative values, which are significant shortly after the program started. This is not too surprising because one would expect a reduced search effort when the program has just started (lock–in effect). The long–run preprogram difference is significantly negative shortly after reunification (-5 ppoints), becomes less negative over time, and is effectively zero after 1993. This is in contrast to the CDiDS results where the long–run preprogram differences increase from the first to the second period. Figure 13 provides results for the probability of remaining employed where the evaluation period starts two months after the beginning of the program. The estimated effect is close to zero for programs that start between 1990 and 1991. However, for 1994 to 1999, the effect becomes significantly positive in some cases. For example, one year after the program started the probability of remaining employed increases by approximately 6 ppoints. Shortly after reunification, the long–run preprogram difference is slightly negative and significant. It becomes more negative in later periods (-7 ppoints for programs that started in December 1996). The CDiDHR estimates differ strongly from the results for the matching estimator and the CDiDS estimator. The latter ignore the state dependence and the duration dependence of employment and therefore, they do not capture the fact that even without any treatment effect a sluggish employment recovery would have to be expected after the start of training. There is also evidence that the selectivity of the treated individuals regarding permanent differences across individuals differs by employment state. Also this aspect can not be captured by the matching estimator and by the CDiDS estimator. Thus, we conclude that the CDiDHR results are our preferred estimates. Furthermore, these consideration also cast some doubts on the validity of the estimate of the earnings effects of training. 38 Why do the CDiDHR results differ between the two transition rates? This could be driven by changes in the content of the training programs over time. Shortly after unification a large part of training consisted of short courses mainly aiming at increasing the participant’s placement potential, as described in section 3.2. This could be an explanation for the positive effect on the reemployment probability. However, later on, the composition of training courses changed towards longer courses intended to provide substantive skills. These additional skills could improve the quality of the match between participants and employers, thus increasing the employment stability, once a participant finds a job. However, these additional skills do not seem to help in finding a job at a faster rate. Also, changes in the search behavior of East Germans due to a better understanding of the labor market and the benefit system in unified Germany might play a role in the differences. Shortly after unification, unemployed East Germans, not being used to a labor market in a market economy, probably tended to accept new jobs quickly with little regard to the quality of the job (f.e. job stability). As a result, a positive effect of training programs might show up in an increase in their reemployment probability rather than in an increase of the probability of remaining employed. Later on, individuals searching for a job perhaps became more aware of the importance of finding a ‘good’ job, which is not only important for their job stability, but also for the level of potential future unemployment benefits, which depend on the earnings in the last job. In addition, the entitlement for transfer payments is prolonged by taking part in a training program for some time after the program, lowering the opportunity costs of job search for participants compared to other unemployed individuals. Thus, participants tended to search longer to find a ‘better’ job match resulting in a positive effect on the probability of remaining employed. Another feature of the results which should be explained are the changes in the long–run preprogram differences. The CDiDHR estimator matches participants and nonparticipants month by month conditional on having the same employment status in the previous month. Shortly after unification the labor market was quite turbulent. Everybody faced a high unemployment risk , resulting in a relatively small 39 long–run preprogram difference in the probability of remaining employed. However, some individuals quickly found another job and did not participate in a training program, leading to a large long–run preprogram difference in the reemployment probability at the begin of the 90’s. Later on, unemployment became more persistent. The difference in transitions out of nonemployment between participants and nonparticipants became less pronounced.27 The change in the long–run preprogram differences in the probability of remaining employed could reflect the stricter targeting of labor market policy on individuals with previous unemployment experience. Our CDiDHR results are mostly in line with the results in Fitzenberger and Prey (2000) for East Germany in the early 1990’s. The latter study analyzes the effects of training on state dependent employment rates based on a completely parametric model. Recent studies for East Germany using administrative data (Fitzenberger and Speckesser, 2007; Lechner et al., 2008) use matching estimators for the effects of training on employment rates after treatment had started. These studies do not distinguish state dependent transition rates. They estimate employment effects of training for an inflow sample into unemployment in 1993 and 1994 conditional on having been unemployed for a certain time. These analyses capture the employment dynamics before treatment start in a careful way but the studies do not address the issue of Ashenfelter’s dip explicitly. Both studies find a negative lock–in effect, which also involves the recovery period to be expected, and mostly positive long–run employment effects afterwards. The latter are consistent with our positive effects on the probability of remaining employed. Focussing on transition rates allows decomposing the effect on employment directly and seems appropriate in light of the strong labor market dynamics in East Germany, especially in the early 1990’s. Both studies do not provide special treatment for the case that program participants move directly from employment to training and, due to data limitations, they do not analyze training programs starting before 1993. 27 Note that this explanation of the changes in the long–run preprogram difference does not violate the assumption of permanent fixed effects since participants change over time. 40 7 Conclusions This paper investigates the employment effects of first participation in Public Sponsored Training in East Germany after the German reunification. Our study makes methodological progress, particularly regarding modeling the dynamic employment process in the context of program evaluation. Modeling employment as a state–dependent outcome variable, we develop a new semiparametric conditional difference–in–differences estimator for the treatment effect. We use the transition rates between employment and nonemployment as outcome variables. We account for Ashenfelter’s Dip caused by anticipation effects and institutional program participation rules. To start with, we find negative effects of training on unconditional employment rates. However, taking account of state dependency in employment, training shows zero or positive effects. Concerning training programs which took place shortly after reunification, we find positive program effects on the reemployment probability. For programs starting in the mid 1990s, we find some positive program effects on the probability of remaining employed. Our results indicate that modeling transition rates is more appropriate and more informative than using unconditional employment rates in such a turbulent economic environment as in East Germany shortly after reunification. Using only employment rates as an outcome might result in misleading conclusions concerning the effectiveness of ALMP programs. We also find some differences in treatment effects on the probability of remaining employed and on the reemployment probability depending upon the time period under investigation, probably as a result of institutional changes during the 1990s. Overall, our results are more positive than previous results in the literature and it is unlikely that training on average reduces the future employment chances of participants. Unfortunately, we can not assess to what extent general equilibrium effects influence our results. Our partial estimates represent average treatment effects for a randomly selected individual in the treatment group holding treatment status of all others constant. General equilibrium effects (or macro effects) may arise because programs have indirect effects on both participants and nonparticipants (Calmfors, 41 1994). Participation in training programs was very different in the two time periods considered, possibly resulting in diminishing returns in program effects after a time of high participation. The differences in targeting over time changed the group of nonparticipants most affected via substitution effects. In addition to having access to cost data, which we do not have, the knowledge of equilibrium effects is necessary to investigate whether training programs are cost effective (efficient). Albrecht et al. (2008) find for a large adult education program in Sweden that equilibrium effect work in favor of the participants, i.e. the fraction of vacancies tailored toward the medium–skilled workers increased almost one–to–one. In contrast, results for make–work–pay programs in Canada are much less favorable (Lise et al., 2005). Clearly, in a case as ours, where training programs were implemented at a large scale, additional research on general equilibrium effects would be very rewarding, especially regarding policy advice. However, typically the investigation of general equilibrium effects requires the use of structural models which is a challenging task because of the rapidly changing economic environment in East Germany. This important task is left for future research. 42 References Abadie, A. (2005). “Semiparametric Difference–in–Differences.” Review of Economic Studies 72:1-19. Abadie, A. and G. Imbens (2006). “On the Failure of the Bootstrap for Matching Estimators.” NBER Technical Working Paper No. 325. Cambridge (Mass.): NBER. Abbring, J., and G.J. van den Berg (2003). “The Nonparametric Identification of Treatment Effects in Duration Models.” Econometrica 71:1491–1517. Akerlof, G., A. Rose, J. Yellen and H. Hessenius (1991).“East Germany in from the Cold: The Economic Aftermath of Currency Union.” Brookings Papers for Economic Activity 1–101. Albrecht, J., G.J. van den Berg and S. Vroman (2008).“The Aggregate Labor Market Effects of the Swedish Knowledge Lift Program.”Review of Economic Dynamics, forthcoming. Ashenfelter, O. (1978). “Estimating the Effect of Training Program on Earnings.” Review of Economics and Statistics 60:47–57. Ashenfelter, O. and D. Card (1985).“Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs.” The Review of Economics and Statistis 67:648–660. Bergemann, A., B. Fitzenberger, B. Schultz and S. Speckesser (2000). “Multiple Active Labor Market Policy Participation in East Germany: An Assessment of Outcomes.” Konjunkturpolitik 51(Suppl.):195–244. Bergemann, A. and G.J. van den Berg (2007).“Active Labor Market Policy Effects for Women in Europe - A Survey.” Annales d’ Economie et de Statistique, forthcoming. Blundell, R., M. Costa Dias, C. Meghir, and J. Van Reenen (2004). “Evaluating the Employment Effects of a Mandatory Job Search Program.” Journal of the European Economic Association 2:569–606. Bundesanstalt für Arbeit (1991, 1993, 1997, 2001). Berufliche Weiterbildung. Nürnberg: Bundesanstalt für Arbeit. Bundesanstalt für Arbeit (2002, 2005). Arbeitsmarkt. Nürnberg: Bundesanstalt für Arbeit. Bundesanstalt für Arbeit (2003). Geschäftsbericht 2002. Nürnberg: Bundeanstalt für Arbeit. Bundesministerium für Verkehr, Bau und Stadtentwicklung (2005). Jahresbericht 2005 zum Stand der Deutschen Einheit. Berlin: Bundesministerium für Verkehr, Bau und Stadtentwicklung. Calmfors, L. (1994). “Active Labour Market Policy and Unemployment - a Framework for the Analysis of Crucial Design Features” OECD Economic Studies 22:7–47 Eichler, M. and M. Lechner (2001). “Public Sector Sponsored Continuous Vocational Training in East Germany: Institutional Arrangements, Participants, and Results of Empirical Evaluations.” In R.T. Riphahn, D. Snower and K. Zimmermann (eds.), Employment Policy in Transition: The Lessons of German Integration for the Labor Market. Heidelberg: Springer, 208–253. 43 Eichler, M. and M. Lechner (2002). “An Evaluation of Public Employment Programmes in the East German State of Sachsen-Anhalt.” Labour Economics 9:143–186 Fay, R. (1996). “Enhancing the Effectiveness of Active Labour Market Policies: Evidence from Programme Evaluations in OECD Countries.” OECD Labour Market and Social Policy Occasional Papers No 18. Paris: OECD. Fitzenberger, B. and H. Prey (2000). “Evaluating Public Sector Sponsored Training in East Germany.” Oxford Economic Papers 52:497–520. Fitzenberger B. and S. Speckesser (2007). “Employment Effects of the Provision of Specific Professional Skills and Techniques in Germany.” Empirical Economics 32:529–573. Fredriksson, P. and P. Johanson (2003) “Program Evaluation and Random Program Starts.” IFAU Working Paper 2003:1. Uppsala: IFAU. Frölich, M. (2004). “Finite Sample Properties of Propensity–Score Matching and Weighting Estimators.” Review of Economics and Statistics 86:77–90. Hagen, T. and V. Steiner (2000). Von der Finanzierung der Arbeitslosigkeit zur Förderung der Arbeit. ZEW Wirtschaftsanalysen, 51. Baden–Baden: Nomos Verlagsgesellschaft. Heckman, J., H. Ichimura, and P. Todd (1998). “Matching as an Econometric Evaluation Estimator.” Review of Economic Studies 65:261–294. Heckman, J., H. Ichimura, J. A. Smith and P. Todd (1998). “Characterizing Selection Bias using Experimental Data.” Econometrica 65:1017–1098. Heckman, J., R. J. LaLonde, and J. A. Smith (1999). “The Economics and Econometrics of Active Labor Market Programs.” In: O. Ashenfelter and D. Card (eds.), Handbook of Labor Economics. Vol. 3 A, Amsterdam: Elsevier Science, 1865–2097. Heckman, J. and J. A. Smith (1999). “The Preprogram Earnings Dip and the Determinants of Participation in a Social Program: Implications for Simple Program Evaluation Strategies.” Economic Journal 108:313–348. Hunt, J. (2006). “The Economics of German Reunification.” In: S. Durlauf and L. Blume (eds.), New Palgrave Dictionary of Economics and Law. London and Basingstoke: Palgrave Macmillan. Ichimura, H. and O. Linton (2001). “Asymptotic Expansions for some Semiparametric Program Evaluation Estimators.” CeMMAP Working Paper CWP04/01. London: CeMMAP. Imbens, G. (2000): The Role of the Propensity Score in Estimating Dose–Response Functions” Biometrica 87:706-710. Institute (2003). Zweiter Fortschrittbericht wirtschaftswissenschaftlicher Institute über die wirtschaftliche Entwicklung in Ostdeutschland. Halle: Institut für Wirtschaftforschung Halle. Kempe W. (2001). “Neuer Trend in der Bildungsstruktur der Ost-WestWanderung?” Wirtschaft im Wandel 9:205–210. 44 Ketzmerik, T. (2001). “Ostdeutsche Frauen mit instabilen Erwerbsverläufen am Beispiel Sachsen–Anhalts.” Forschungsbericht aus dem zsh 01–1. Halle(Saale): ZSH. Kluve, J. H. Lehmann, and C. Schmidt (2004). “Disentangling Treatment Effects of Labor Market Histories: the Role of Employment Histories.” RWI Discussion Paper. Essen: RWI. Kluve, J., H. Lehmann, and C. Schmidt (1999). “Active Labor Market Policies in Poland: Human Capital Enhancement, Stigmatization, or Benefit Churning?” Journal of Comparative Economics 27:61–89. Kluve, J., and C. Schmidt (2002). “Can Training and Employment Subsidies Combat European Unemployment?” Economic Policy. 35:411–448. Kraus, F., P.A. Puhani, and V. Steiner (1999). “Employment Effects of Publicly Financed Training Programs, The East German Experience.” Jahrbücher für Nationalökonomie und Statistik 219:216–248. Lechner, M. (2000). “An Evaluation of Public Sector Sponsored Continuous Vocational Training Programs in East Germany.” The Journal of Human Resources 35: 347–375 Lechner, M. (2001). “Identification and Estimation of Causal Effects of Multiple Treatments under the Conditional Independence Assumption.” In: M. Lechner and F. Pfeifer (eds.), Econometric Evaluation of Active Labor Market Policies in Europe. Heidelberg: Physica–Verlag, 43–58. Lechner, M. and R. Miquel (2001). “A Potential Outcome Approach to Dynamic Programme Evaluation – Part I: Identification.”SIAW Discussion Paper 2001– 07. St. Gallen: SIAW. Lechner, M., R. Miquel, and C. Wunsch (2008). “The Curse and Blessing of Training the Unemployed in a Changing Economy: The Case of East Germany after Unification.” German Economic Review, forthcoming. Lise, J., S. Seitz and J. Smith (2005). “Equilibrium Policy Experiments and the Evaluation of Social Programs.” University of Maryland Working Paper, College Park: University of Maryland. Lubyova, M. and J.C. van Ours (1999). “Effects of Active Labour Market Programs on the Transition Rate from Unemployment into Regular Jobs in the Slovak Republic.” Journal of Comparative Economics. 27:90–112. Martin, J.P. and D. Grubb (2001). “What Works and for Whom: A Review of OECD Countries’ Experiences with Active Labour Market Policies.” Swedish Economic Policy Review 8:9–56. OECD (1994). The OECD Jobs Study. Facts, Analysis, Strategies. Paris: OECD. Pagan A. and A. Ullah (1999). Nonparametric Econometrics. Cambridge: Cambridge University Press. Prognos AG (1993). Die Bundesrepublik Deutschland 2000- 2005- 2010, Entwicklung von Wirtschaft und Gesellschaft. Prognos Deutschland Report, Basel: Prognos. Puhani, P.(1999). “Estimating the Effects of Public Training on Polish Unemployment by Way of the Augmented Matching Function Approach.” ZEW Discussion Paper No. 99-38. Mannheim: ZEW. 45 Rosenbaum, P. R. and D.B. Rubin (1983). “The Central Role of the Propensity Score in Observational Studies for Causal Effects.” Biometrika 70:41–55. Roy, A.D. (1951). “Some Thoughts on the Distribution of Earnings.” Oxford Economic Papers 3:135–146. Rubin, D. B. (1974). “Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies.” Journal of Educational Psychology 66:688–701. Seifert, B. and T. Gasser (1996). “Finite–Sample Variance of Local Polynomials: Analysis and Solutions.” Journal of the American Statistical Association 91:267–275. Sianesi, B. (2004). “An Evaluation of the Swedish System of Active Labor Market Programs in the 1990s.” Review of Economics and Statistics. 86:133–155. Speckesser, S. (2004). Essays on Evaluation of Active Labour Market Policy, Dissertation, Department of Economics and Law, Mannheim: University of Mannheim. Van den Berg, G.J., M. Lindeboom and G. Ridder (1994).“ Attrition in Longitudinal Panel Data and the Empirical Analysis of Labor Market Behavior.” Journal of Applied Econometrics 9: 421–435. Van den Berg, G.J. and M. Lindeboom (1998).“ Attrition in Panel Survey Data and the Estimation of Multi-State Labor Market Models.” Journal of Human Resources 33: 458–478. van Hagen, J. and R. Strauch (2001). “East Germany: Transition With Unification. Experiments and Experience.” In M. Blejer (ed.) Transition: The First Decade. Boston: MIT Press, 87–120. Wurzel, E. (2001). “The Economic Integration of Germany’s New Länder.” OECD Economics Department Working Paper No. 307. Paris: OECD. Wunsch, C. (2006). “Labour Market Policy in Germany: Institutions, Instruments and Reforms since Unification.” Working Paper of University of St. Gallen. St. Gallen: University of St. Gallen. 46 Appendix Table 1: Sample Selection Selection Criteria Resulting Number of Observations Fully observed labor market history and year of birth 10,715 Aged between 25 and 50 years in January 1990 6,088 Employed in June 1990 5,529 Not in Education after June 1990 5,480 Not in Maternity Leave after June 1990 5,334 Not retired after June 1990 5,224 Final sample: with valid information on relevant covariates 5,165 Table 2: Program Participation in the LMM–SA during 1990 and 1999a One Program At least once As first program As first program Job Creation Scheme Training 13.3 (689) 19.8 (1,021) 9.4 (484) 17.2 (889) Training in 1990-1993 Training in 1994-1999 9.8 (495) 7.6 (394) a The numbers represent the participation rates and in brackets the absolute number of observation. 47 Table 3: Attrition of participation group and naı̈ve control group Period 1990-93 Time relative to beginning of treatment 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 Number of par- % participants Average % of ticipants relative to t1=1 kernel weight relative to t1=1a 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% 482 100% 100% Average % size of control relative to t1=1b 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% a: This column describes the share of the sum of the kernel weights for matching which are available after the beginning of treatment relative to the total sum of kernel weights available at t1 = 1 (≡the beginning). This sum of kernel weights for CDiDS across all nonparticipants j is P P calculated separately for all treated individuals i as i j∈{D6=1} Kij (t1) · Ij (t1) at time t1 since the beginning of treatment, where Ij (t) is a dummy variable for individual j being still observed in the data at t1. b: This column describes the attrition in the actual number of nonparticipants to match with. 48 Table 4: Attrition of participation group and naı̈ve control group Period 1994-99 Time relative to beginning of treatment 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 Number of par- % participants Average % of ticipants relative to t1=1 kernel weight relative to t1=1a 385 100% 100% 381 99% 100% 376 98% 99% 370 96% 99% 368 96% 99% 368 96% 98% 363 94% 97% 359 93% 97% 351 91% 95% 350 91% 95% 346 90% 94% 336 87% 93% 332 86% 92% 325 84% 91% 322 84% 91% 319 83% 91% 314 82% 90% 310 81% 89% 305 79% 88% 297 77% 88% 290 75% 87% 282 73% 87% 273 71% 87% 266 69% 86% 264 69% 84% 255 66% 84% 248 64% 84% 246 64% 84% 242 63% 83% 238 62% 83% 231 60% 81% 223 58% 80% 210 55% 80% 202 52% 79% 197 51% 78% 188 49% 78% a, b: see table 3. 49 Average % size of control relative to t1=1b 100% 100% 99% 99% 99% 98% 97% 96% 95% 94% 94% 93% 91% 90% 90% 90% 90% 89% 88% 87% 86% 86% 86% 86% 84% 83% 84% 83% 82% 82% 81% 80% 79% 79% 78% 77% Table 5: Propensity Score Estimation Variable Participation in Training as a First Program in ALMP during 1990-1993 Coef. (s.e.) mean num. derivative Constant -1.302 ( 0.197 ) Age in 1990: Age 25–34 is omitted category Age 35–44 -0.151 ( 0.055 ) -0.027 Age 45–50 -0.396 ( 0.071 ) -0.061 Labor Market Region: Dessau is missing category Halberstadt -0.228 ( 0.106 ) -0.041 Halle -0.346 ( 0.090 ) -0.058 Magdeburg -0.219 ( 0.083 ) -0.039 Merseburg -0.172 ( 0.094 ) -0.032 Sangerhausen -0.107 ( 0.101 ) -0.021 Stendal -0.187 ( 0.109 ) -0.034 Wittenberg -0.135 ( 0.125 ) -0.026 Professional education (all): Unskilled, semi–skilled or other skills are missing category Skilled Worker 0.028 ( 0.194 ) 0.004 Craftsman -0.164 ( 0.226 ) -0.021 Technical college 0.234 ( 0.214 ) 0.039 University education 0.217 ( 0.196 ) 0.035 Professional education (women) Skilled worker 0.559 ( 0.076 ) 0.093 Craftswoman 0.998 ( 0.215 ) 0.174 Technical college 0.169 ( 0.125 ) 0.028 University education 0.270 ( 0.094 ) 0.047 50 Table 6: Propensity Score Estimation Variable Participation in Training as a First Program in ALMP during 1994-1999 Coef. (s.e.) mean num. derivative Constant -1.622 ( 0.198 ) Age in 1990: Age 25–34 is omitted category Age 35–44 0.012 ( 0.059 ) 0.002 Age 45–50 -0.095 ( 0.071 ) -0.013 Labor Market Region: Dessau is missing category Halberstadt 0.083 ( 0.114 ) 0.012 Halle 0.107 ( 0.097 ) 0.015 Magdeburg 0.046 ( 0.094 ) 0.006 Merseburg 0.022 ( 0.106 ) 0.003 Sangerhausen 0.150 ( 0.110 ) 0.022 Stendal -0.163 ( 0.130 ) -0.019 Wittenberg -0.102 ( 0.147 ) -0.013 Professional education (all): Unskilled, semi–skilled or other skills are missing category Skilled Worker 0.121 ( 0.189 ) 0.016 Craftsman 0.088 ( 0.210 ) 0.011 Technical college 0.209 ( 0.210 ) 0.029 University education 0.115 ( 0.193 ) 0.015 Professional education (women) Skilled worker 0.237 ( 0.076 ) 0.140 Craftswoman 0.326 ( 0.224 ) 0.053 Technical college -0.137 ( 0.129 ) -0.019 University education -0.098 ( 0.108 ) -0.012 51 Table 7: Balancing Properties of Matching for CDiDHR with respect to Participation in Training during 1990-1993 Variable Age 25–34 Age 35–44 Age 45–50 Dessau Halberstadt Halle Magdeburg Merseburg Sangerhausen Stendal Wittenberg Unskilled, semi–skilled and other skills Skilled Worker Craftsman Technical college University education Female Female unskilled worker Female skilled worker Female Craftsman Female and technical college Female and university education Employment probability before (potential) participation Means of Variable in Subgroups All Nonpar– Parti– Matched Nonpar– Parti– Matched ticipants cipants Nonpart. ticipants cipants Nonpart. uncond. averaged over prev– averaged over prev– iously nonemployed iously employed 0.37 0.33 0.51 0.43 0.37 0.45 0.44 0.40 0.38 0.37 0.39 0.40 0.43 0.41 0.23 0.28 0.13 0.19 0.23 0.13 0.17 0.12 0.13 0.13 0.17 0.11 0.16 0.15 0.09 0.08 0.13 0.06 0.09 0.08 0.08 0.19 0.17 0.09 0.13 0.20 0.15 0.15 0.24 0.23 0.19 0.21 0.24 0.23 0.24 0.13 0.15 0.18 0.17 0.13 0.15 0.14 0.10 0.10 0.10 0.13 0.09 0.10 0.11 0.08 0.09 0.10 0.09 0.08 0.08 0.08 0.05 0.05 0.08 0.05 0.05 0.04 0.06 0.02 0.43 0.08 0.19 0.27 0.48 0.01 0.21 0.01 0.13 0.11 0.09 0.53 0.08 0.15 0.16 0.60 0.06 0.35 0.02 0.10 0.07 0.03 0.47 0.05 0.27 0.17 0.75 0.02 0.39 0.03 0.20 0.11 0.05 0.60 0.05 0.16 0.16 0.74 0.03 0.50 0.03 0.11 0.10 0.02 0.44 0.08 0.19 0.27 0.44 0.01 0.19 0.01 0.12 0.10 0.01 0.43 0.05 0.19 0.32 0.56 0.00 0.25 0.02 0.14 0.14 0.02 0.48 0.06 0.19 0.26 0.59 0.01 0.32 0.02 0.13 0.13 n.a. 0.32 0.03 0.31 0.92 0.52 0.65 52 Table 8: Balancing Properties of Matching for CDiDHR with respect to Participation in Training during 1994-1999 Variable Age 25–34 Age 35–44 Age 45–50 Dessau Halberstadt Halle Magdeburg Merseburg Sangerhausen Stendal Wittenberg Unskilled, semi–skilled and other education Skilled Worker Craftsman Technical College University Education Female Female unskilled worker Female skilled worker Female Craftsman Female and technical college Female and university education Employment probability before (potential) participation Means of Variable in Subgroups All Nonpar– Parti– Matched Nonpar– Parti– Matched ticipants cipants Nonpart. ticipants cipants Nonpart. averaged over prev– averaged over prev– uncond. iously nonemployed iously employed 0.37 0.29 0.29 0.30 0.41 0.50 0.40 0.40 0.38 0.42 0.40 0.39 0.40 0.39 0.23 0.33 0.29 0.30 0.21 0.10 0.22 0.12 0.13 0.13 0.13 0.12 0.10 0.12 0.09 0.07 0.12 0.07 0.09 0.08 0.08 0.19 0.15 0.20 0.16 0.20 0.17 0.19 0.24 0.23 0.21 0.22 0.24 0.28 0.25 0.13 0.16 0.13 0.16 0.14 0.12 0.14 0.10 0.11 0.11 0.13 0.11 0.15 0.11 0.08 0.09 0.08 0.08 0.07 0.04 0.07 0.05 0.06 0.03 0.05 0.04 0.06 0.04 0.02 0.43 0.08 0.19 0.27 0.48 0.01 0.21 0.01 0.13 0.11 0.06 0.50 0.06 0.18 0.19 0.60 0.04 0.33 0.02 0.12 0.09 0.02 0.58 0.06 0.12 0.22 0.58 0.01 0.39 0.03 0.07 0.07 0.06 0.58 0.06 0.15 0.16 0.68 0.04 0.45 0.03 0.10 0.08 0.02 0.45 0.08 0.19 0.27 0.44 0.01 0.19 0.01 0.12 0.10 0.01 0.46 0.08 0.21 0.23 0.42 0.00 0.21 0.01 0.13 0.08 0.03 0.51 0.08 0.17 0.22 0.49 0.02 0.27 0.02 0.10 0.09 n.a. 0.16 0.02 0.03 0.67 0.30 0.29 53 Table 9: Earnings Effects of TR 1997 to 1999 by Time since Beginning of Treatment (Matching on Long–run Preprogram Employment History) – Monthly Net Earnings as Outcome Variable Years since beginning of treatment (t1) 0 < t1 ≤ 1 1 < t1 ≤ 2 2 < t1 ≤ 3 3 < t1 ≤ 4 4 < t1 ≤ 5 5 < t1 ≤ 6 6 < t1 ≤ 7 7 < t1 ≤ 8 8 < t1 ≤ 9 TT Estimatea (Standard Error) No. of casesb -831 -784 -438 -139 -526 -243 -332 -230 379 (174) (167) (153) (150) (92) (108) (128) (127) (366) 49 52 56 62 123 136 129 91 14 The estimates are pooled across the years 1997 to 1999. Earnings are set to zero when nonemployed. a: Average earnings differences in matched sample by year since beginning of treatment. b: Number of cases in matched sample. Figure 1: Unemployment and Labor Market Policies in East Germany, in 1000’s∗ ∗ Note: after 2002 data for West-Berlin is included. Source: Institute (2003), Bundesanstalt für Arbeit (2002, 2005), own calculations. 54 Figure 2: Entries into Training in East Germany, Annual Totals ∗ In 1990, training programs took place only in October, November, and December. Following the 1998 reform, training can no longer be subdivided into the categories Further Training and Re-Training. Source: Bundesanstalt für Arbeit (1993, 1997, 2001, 2003), own calculations 30 400 20 200 10 Average participant stocks (1000) 600 0 0 1990* 1992 1994 Participant stocks 1996 1998 2000 Costs per participants-year (1000 €) Figure 3: Participation Stocks in Training and Expenditure per Participant / Year, Annual Average 2002 Expenditure (€ per participants-year) ∗ For 1990 no annual stock can be calculated. Source: Bundesanstalt für Arbeit (1993, 1997, 2001, 2003), own calculations 55 Figure 4: Labor Force Status by Treatment Status Labour force status of non-participants, 1990-99 Percenatges of all valid cases 100% 80% 60% Out of labour force Public Job Creation Program 40% Public sector sponsored training Unemployed Working PT 20% Working FT Dez 99 Jun 99 Dez 98 Jun 98 Dez 97 Jun 97 Dez 96 Jun 96 Dez 95 Jun 95 Dez 94 Jun 94 Dez 93 Jun 93 Dez 92 Jun 92 Dez 91 Jun 91 Dez 90 Jun 90 0% Calendar time Labour force status of participants, 1990-99 80% 60% Out of labour force Public Job Creation Program 40% Public sector sponsored training Unemployed Working PT 20% Working FT Calendar time 56 Dez 99 Jun 99 Dez 98 Jun 98 Dez 97 Jun 97 Dez 96 Jun 96 Dez 95 Jun 95 Dez 94 Jun 94 Dez 93 Jun 93 Dez 92 Jun 92 Dez 91 Jun 91 Dez 90 0% Jun 90 Percenatges of all valid cases 100% Figure 5: Mobility Pattern for Training Period 1990–1993 57 Figure 6: Mobility Pattern for Training Period 1994–1999 58 Figure 7: Differences in the Outcome Variable in the Matched Sample for CDiDS Estimator with Matching on Nonemployment in Month before Start of Training – Employment Probability as Outcome 59 Figure 8: Differences in the Outcome Variable in the Matched Sample for CDiDHR Estimator with Matching on Nonemployment in Month before Start of Training – Probability of Exiting Nonemployment as Outcome 60 Figure 9: Differences in the Outcome Variable in the Matched Sample for CDiDHR Estimator with Matching on Nonemployment in Month before Start of Training – Probability of Remaining Employed as Outcome 61 Figure 10: Employment Effects of TR (Matching on Long–run Preprogram Employment History) - Employment Probability as Outcome 62 Figure 11: Employment Effects of TR (CDiDS with Matching on Nonemployment in Month before Start of Training) - Employment Probability as Outcome 63 Figure 12: Employment Effects of TR (CDiDHR with Matching on Nonemployment in Month before Start of Training) – Probability of Exiting Nonemployment as Outcome 64 Figure 13: Employment Effects of TR (CDiDHR with Matching on Nonemployment in Month before Start of Training) – Probability of Remaining Employed as Outcome 65 Discussion on Recall Error in LMM–SA Retrospective data, which in our case covers at least 8 years, entails the danger of recall errors. In the following, we will argue that recall errors are less problematic in our analysis than is typically the case with retrospective data. First of all, note that the individuals were asked about their employment history starting with the year 1990. This year constitutes a turning point in the biography of East Germans, as the political and economic system changed dramatically. The connection of biographic events with historic events, as done here, typically improves the validity of recall data (Loftus/Marburger, 1983, Robinson, 1986). Additionally, starting with the salient year 1990 the individuals had to answer in chronological order, which is now commonly viewed as the best technique in collecting life history data in a single survey (Sudman/Bradburn, 1987). Second, our broad definition of employment states circumvents some of the recall errors which are present when analyzing more than two labor market states. It helps especially to merge the states unemployment and out of the labor force. For instance, after some time in unemployment, women tend to label this as having been out of the labor force (Dex/McCulloch, 1998). Third, our evaluation design (CDiDHR estimator) allows for recall errors occurring in the same fashion among treatment and matched comparison group. In particular, if both groups forget to mention transitions in a similar way then the errors simply cancel out. Thus, recall errors in our analysis might only increase the standard errors of our estimates. However, if we were estimating individual labor market flows, recall errors would be more worrying (Paull, 2002) and it might be useful to change the methodological approach (e.g. following Magnac/Visser, 1999). References Dex, S. and A. McCulloch (1998). “The Reliability of Retrospective Unemployment History Data.” Work, Employment and Society 12:497–509. Loftus, E.F. and W. Marburger (1983). “Since the Eruption of Mt. St. Helens, Has Anyone Beaten You Up? Improving the Accuracy of Retrospective Reports with Landmark Events.” Memory and Cognition 54:330–345. Magnac, T. and M. Visser (1999). “Transition Models with Measurement Errors.” Review of Economics and Statistics 81:466-474. Paull, G. (2002). “Biases in the Reporting of Labor Market Dynamics.” The Institute for Fiscal Studies Working Paper 02/10. London: The Institute for Fiscal Studies. Robinson, J.A. (1986). “Temporal Reference Systems and Autobiographical Memory.” In: D.C. Rubin (ed.), Autobiographical Memory, Cambridge: University Press, 159– 188. Sudman, S. and N.M. Bradburn (1987). “Effects of Time and Memory Factors on Response in Surveys.” Journal of the American Statistical Association 64:805–815. 66
© Copyright 2025 Paperzz