A Structural Model of Tax Progressivity and Self-Employment Jean-Francois Wen and Daniel V. Gordon Department of Economics University of Calgary Calgary, Alberta T2N 1N4 PRELIMINARY March 8, 2010 Abstract We extend the theoretical model of Rees and Shah (1986) to incorporate the e¤ects of progressive taxation, as arguments in the individual’s utility choice model between self-employment and paid employment. Measures of the degree of progressivity in the tax and transfer system are constructed from the concept of tax convexity (Gentry and Hubbard, 2000, 2004). The earnings equations are corrected for self-selection bias and are combined with the Canadian Tax and Credit Simulator (Milligan, 2007) to calculate tax di¤erentials. The structural model includes both the predicted net-of-tax earnings and the tax convexity variables in the self-employment/employed labor choice decision. The model is estimated using data spanning 19992005 from Statistics Canada’s Survey of Labour and Income Dynamics. The principal …ndings are that upside tax convexity has a negative e¤ect on the probability of self-employment, while downside tax convexity has a positive e¤ect on self-employment, both with high levels of statistical signi…cance. We use the model to estimate the e¤ects of federal tax rate reductions in 2001 on the rate of self-employment in Canada. 1 Introduction Entrepreneurship is regarded as vital for generating employment and innovation. It is, however, a risky occupation with high potential returns and high failure rates. Since the income tax system alters the relationship between risk and reward, its design may be an important determinant of entrepreurial activity. The impact of a proportional income tax with loss o¤set provisions is theoretically to increase risk-taking by risk averse individuals (Domar and Musgrave, 1944). Similarly, a revenue compensated increase in the rate of a linear progressive income tax can raise the share of the workforce engaged in entrepreneurship.1 A plausible view, however, is that progressivity achieved with a rising marginal tax rate schedule is likely to discourage entrepreneurship because the higher tax rates penalize success by more than the lower tax rates provide relief against poor returns. One way to examine this hypothesis empirically is to estimate the e¤ects of the tax and transfer system on the occupational choice between self-employment and employed labor. Self-employment is frequently used as a proxy for entrepreneurship in empirical studies. Hence, we develop a structural model of earnings and tax di¤erentials in a random utility model of a discrete occupational choice. Of particular interest are the “tax convexity” variables measuring the asymmetric …scal treatment of upside and downside income risks as a result of tax progressivity (Gentry and Hubbard, 2000, 2004). The idea is that uncertainty about prospects obliges entrepreneurs to consider income scenarios spanning di¤erent segments of the tax schedule. This is a potentially important interaction between tax policy and entrepreneurship. The degree of tax convexity is a¤ected not only by marginal income tax rates and thresholds, but also by payroll taxes, tax credits and their clawbacks, loss o¤set provisions, and numerous other tax and transfer considerations.2 1 Increasing linear progressivity positively impacts entrepreneurship, innovation, and growth in Garcia-Penalosa and Wen (2008). In Kanbur (1981) the e¤ect on entrepreneurship is ambiguous due to the general equilibrium e¤ects on …rm sizes and wages. 2 Blanch‡ower and Oswald (1998) tabulate the answers to a pertinent question asked in the 1987 National Survey of the Self-Employed in the United Kingdom. 139 individuals who said they were “seriously intending”to become self-employed in the next few months 1 Tax convexity is measured by Gentry and Hubbard as the spread in the e¤ective marginal (or average) tax rates for successful and unsuccessful selfemployment outcomes, de…ned relative to the ability of each individual. It is clear why high tax rates in the event of success may discourage entrepreneurial risk-taking. It is less evident why low tax rates in the case of failure should discourage self-employment. Gentry and Hubbard provide two illustrations of why both the upside and downside components of tax convexity may be negatively correlated with self-employment rates. First, the value of loss o¤sets, in the event of failure, increases with the tax rate. Hence, an entrepreneur with negative pro…ts can bene…t from a high tax rate in the lowest bracket. Second, since the alternative to self-employment is to work for wages, a comparatively high tax rate in the bracket encompassing wage earnings reduces the opportunity cost of self-employment, and most wage earners face marginal tax rates that are below the top rate. In both of these examples, small tax rates in the middle to lower tax brackets discourage self-employment. Although these examples show the potential relevance of Gentry and Hubbard’s tax convexity variable as a negative correlate of self-employment rates, the approach depends on particular con…gurations of self-employment and wage employment earnings. Loss o¤setting, for example, is not relevant to an entrepreneur earning low but positive pro…ts. A more straightforward way to assess the impact of tax progressivity is to calculate the expected value of the tax liability of an entrepreneur facing a distribution of possible returns and to compare this burden with the same individual’s predicted tax liability. If high marginal tax rates on successful entrepreneurs penalize success more than low marginal tax rates provide insurance against failure, then the expected tax liability will exceed the predicted tax liability and self-employment is discouraged. This idea is implemented in our empirical work using proxies for the potential income were asked to select one of 20 possible answers to the question, “What was your biggest concern with becoming self-employed?” Interestingly, the answers “Understanding tax” and “No guaranteed income” were tied at 14.4 percent as the second mostly frequently cited responses, after “Where to get …nance” at 20.1 percent. This suggests to us that individuals are aware that the decision to become self-employed has potentially important tax implications and that income risk is also a factor. 2 shocks. We also generalize the approach by measuring tax convexity separately for upside and downside income realizations relative to the predicted self-employment income. We expect upside tax convexity to discourage risktaking via the penalty on success, and downside tax convexity to encourage risk-taking via the social insurance e¤ect arising from lower marginal tax rates and income support programs.3 The overall e¤ect of tax progressivity on self-employment rates is the sum of the upside and downside tax convexity e¤ects. We calibrate the person-speci…c upside and downside income realizations (relative to the individual’s expected income) using both an ad hoc method and a more principled procedure that uses the the predicted residuals from the econometric earnings equations to determine the spread of the upside and downside income shocks. Our model is estimated using a longitudinal panel dataset with 157,310 observations for Canada spanning the available years 1999 to 2005. The Canadian Tax and Credit Simulator (CTaCS) developed by Milligan (2007) is used to calculate the total federal and provincial tax and transfer implications of the employment/self-employment choice for each individual in our data. The tax parameters include income taxes, payroll taxes, and money transfers and their clawbacks. Previous work on the e¤ects of tax convexity on occupational choice has focused exclusively on the United States. The income tax system in Canada di¤ers in important respects from the U.S. tax system. The personal and corporate income tax systems are integrated in Canada for incorporated businesses earning up to $300,000. This is achieved with a personal dividend tax credit that o¤sets corporate taxes.4 In contrast, in the U.S. the tax structure provides an incentive to shift labor income to corporate income when income realizations are large.5 There exists a $500,000 lifetime capital gains exemption on small business shares in Canada, which has no equivalent in the U.S. Marginal tax rates are higher in Canada and the rates apply to individual taxpayers, rather than to married 3 In contrast, Gentry and Hubbard (2000) hypothesize that downside tax convexity should discourage entrepreneurship, for reasons discussed above. 4 The integration arises only if the corporation pays dividends. 5 See Cullen and Gordon (2007) for the impact of this tax feature on self-employment. 3 couples as in the U.S. These and other comparative features of the Canadian tax system provide an alternative …scal setting to explore the e¤ects of tax progressivity on self-employment. A noteable feature of our sample period is the major tax reform in Canada in 2001. Marginal income tax rates were reduced for the middle and bottom income brackets and the surtax was eliminated. The corporate tax rate on small business income between $200,000 and $300,000 earned by a Canadian-controlled private corporation was cut from 28 to 21 percent (phased in over …ve years) and the capital gains inclusion rate for individuals and corporations was cut from three-quarters to one-half. These reforms decreased the degree of upside tax convexity at the median self-employment income in Canada and the federal government asserted that the new measures would promote entrepreneurship and growth. Cross-sectional studies of self-employment generally take the form of either reduced-form or structural models.6 Reduced-form models specify the self-employment choice as a function of variables thought to determine labor market outcomes, such as education, labor market experience, age, job stability, capital, occupational status, unemployment, location, marital status, spouse’s educational attainment, spouse’s employment status, the number of children and health, and marginal or average tax rates. This is the method used by Long (1982), Schuetze (2000), Gentry and Hubbard (2000), and Hansson (2006) to examine income tax e¤ects. A less common approach utilizes structural models derived formally from the self-employment decision as a function of the earnings di¤erential between self-employment and wage work.7 Only the studies by Bruce (2000), Parker (2003), and Fossen (2007) estimate structural models containing earnings equations and net-of-tax income di¤erentials in place of gross earnings di¤erentials as determinants of self-employment. These studies do not, however, include tax convexity as a determinant of self-employment. As shown by Gentry and Hubbard 6 Le (1999) provides a useful survey of studies on self-employment. Early examples of this approach can be found in Rees and Shah (1986) for the UK labour market, de Wit and van Winden (1989) for the Dutch labour market and Bernhardt (1994) for the Canadian labor market. 7 4 (2000), the omission of tax convexity considerations may be important for understanding how taxation a¤ects occupational choice. In this paper, we extend the theoretical model of Rees and Shah (1986) to include the e¤ects of tax progressivity. We then estimate a reduced-form equation in order to control for sample-selection bias in the estimated earnings equations and we use the corrected predicted earnings together with CTaCS to obtain net-of-tax income di¤erentials between self-employment and paid employment, and to construct tax convexity measures. These are used as explanatory variables in the structural probit equation for selfemployment.8 We …nd that upside tax convexity is negatively related to the rate of self-employment and, conversely, downside tax convexity is positively related to self-employment, both with high statistical signi…cance. Hence, the overall impact of tax progressivity depends on the balance between these e¤ects. These results corroborate some of the …nding of Gentry and Hubbard (2000) for the U.S., which is one of the few studies …nding a negative correlation between income tax rates and self-employment.9 Relatedly, Stabile (2004) and Folster (2002) …nd that self-employment increased after changes in legislation which exempted self-employment from certain payroll taxes. Most of the earlier studies conclude, on the contrary, that higher tax rates increase the level of self-employment, which is attributed to the opportunities provided by self-employment for tax avoidance or tax evasion (e.g. Long, 1982, Blau, 1987, Parker, 1996, Bruce, 2000, and Schuetze, 2000). The paper is organized as follows. Section 2 describes the random utility model of a discrete occupational choice between self-employment and salaried work. Section 3 describes the data and speci…es the econometric equations, the identi…cation techniques and self-selection issues. Section 4 presents the empirical results. Section 5 illustrates the usefulness of the estimated model by using it to analyze the 2001 federal income tax reform in 8 We focus on the rate of self-employment, as in Rees and Shah (1986), rather than on the transitions between wage work and self-employment, in contrast with Gentry and Hubbard (2000), Fossen (2007), Hansson (2006) and Bruce (2000). 9 Other exceptions include Folster (2002), Bruce and Mohsin (2006), Hansson (2006), and Fossen (2007). 5 Canada. Section 6 concludes. 2 A Model of Occupational Choice A simple model is used to illustrate the theoretical e¤ects of taxation on occupational choice between self-employment (S) and employment for …xed pay (E). Denote individual i’s before-tax income in occupation j 2 fS; Eg by yji and after-tax income by xij . Represent income tax progressivity using the function (Musgrave and Thin, 1948, and Benabou, 2000): xij = yji where 1 yb (1) and yb are parameters of the tax system. At the income level yb the taxpayer’s tax liability is zero. The term 1 is the elasticity of after-tax income with respect to before-tax income; hence, can be interpreted as the semi-elasticity of the average tax rate with before-tax income.10 The larger is the greater is the marginal progressivity of the income tax. The expression “tax convexity” conveys the notion that the slope of a graph of the average tax rate against the logarithm of income is positive, if the marginal tax schedule is rising, and in practice the graph may be upward bending.11 Utility is assumed to depend on after-tax income (consumption) and on an hedonic index Qij of non-pecuniary job characteristics associated with occupations S and E. The index is composed as follows: Qij = X i n qnj (2) n 10 In familiar notation, write 1 = d ln x d ln y which in turn equals d ln[y(1 s)] d ln y, where s is the individual’s average tax rate (i.e. the share of income paid in taxes). Express this as 1 1 + d ln(1 s) d ln y and use the approximation ln(1 s) s for small s to obtain ds d ln y as required. 11 We follow Gentry and Hubbard’s use of the expression “tax convexity” even though this may be somewhat confusing since tax progressivity need not imply convexity in the average or marginal tax schedules; rather, more progressivity implies a steeper tax schedule. 6 where qnj is the measure of a non-pecuniary job characteristic in occupation j and i n is its utility weight. As in Rees and Shah (1986), let the utility function be bi uij = U (xij ; Qij ) = xij exp di Qij (3) where di and bi are preference parameters; risk aversion is decreasing with bi and non-pecuniary considerations increase with di . Before-tax income is assumed to be lognormally distributed: yji i j; LN ( i j ): Substituting (1) for xij into (3) gives uij = yji (1 )bi yb bi exp di Qij : (4) An individual i chooses the occupation j 2 fS; Eg that maximizes expected utility E(uij ). Since yji is lognormal, yji is E h yji (1 )bi i (1 )bi is also lognormal and its expected value )bi = exp (1 i j + 1 (1 2 )bi i 2 j (Aitchison and Brown, 1966, Theorem 2.1). Thus maximizing expected utility in (4) is equivalent to maximizing the following utility index: Vji (Qij ; y ij ; cij ) = (1 where y ij 1 (1 2 ) ln y ij E(yji ) = exp i j + 12 ( di Qij ; (5) bi q is the mean of yji and cij = exp( ij )2 )(1 bi (1 i )2 j is the coe¢ cient of variation. )) ln 1 + (cij )2 + Expected utility is higher with self-employment compared with paid employment if C i VSi (QiS ; y iS ; ciS ) VEi (QiE ; y iE ; ciE ) 7 0. Using (5), C i 0 1 is equivalent to (1 ) ln The term (1 y iS y iE 1 (1 2 bi (1 )) ln 1 + (ciS )2 (1 + (ciE )2 + di QiS bi QiE 0: (6) ) appears twice on the left-hand side of (6). First, its appear- ance as a multiplicative factor indicates that greater tax progressivity (higher ) reduces the relative importance of …nancial di¤erences, both reward and risk, in determining occupational choice. The second appearance of the tax term is less familiar. To better understand this e¤ect, suppose individuals are risk-neutral, so bi = 1. In that case, the term, reduces to ln 1 + (ciS )2 1 + (ciE )2 : (1 bi (1 )) ln ( ) (7) We would normally expect ciS > ciE , as self-employment is loosely associated with entrepreneurial risk-taking.12 Assuming this to be the case, (7) shows another distinct way in which tax progressivity discourages self-employment. This e¤ects arises because income realizations above a self-employed individual’s expected income increase the tax liability by more than income realizations of similar magnitudes, but below the mean, decrease the tax liability. In the econometric model, proxies are constructed to represent the term (7) for each individual, based on the predicted changes in tax liabilities associated with self-employment income shocks above and below the expected income, referring to these as taxconvexU and taxconvexD, respectively. The expected signs of the regression coe¢ cients are negative for taxconvexU and positive for taxconvexD. The term (1 ) ln y iS =y iE in (6) can be interpreted as the percentage change in expected after-tax income that results from switching occupations from employment to self-employment. In the econometric model this variable is called NetIncdif. Hence, we would expect the estimated coe¢ cient on NetIncdif to be positive. We do not observe the individual’s degree of risk aversion, which is de12 In our data (over the whole sample) the coe¢ cient of variation is 0.946 for selfemployment and 0.562 for paid employment. 8 termined by bi ; the coe¢ cient of variation of i’s income distribution in occupation j; and the non-pecuniary aspects embodied in the hedonic indexes and di . The unobservable characteristics will be proxied with observable variables believed to a¤ect occupational choice. Finally, note that self-employment and paid employment determine an individual’s labor earnings, but the total tax liability depends also on investment income, alimony payments, and so on. In other words, total income is yji = eij + ai ; (8) where eij is labor earnings of i in occupation j, and ai is i’s non-labor income. We take ai as exogenous when calculating an individual’s tax liability but we predict eij with econometric equations. 3 Data and Econometric Issues The empirical work uses Statistics Canada’s Survey of labor and Income Dynamics (SLID) public use Microdata. SLID is a longitudinal study designed to capture changes in the economic well-being of individuals and families over time and the determinants of labor market income. In a given year, the sample is composed of two panels. A panel is surveyed for six consecutive years and a new panel is introduced every three years. Thus two panels are always overlapping. Each panel includes roughly 15,000 households. Households are interviewed up to twelve times over a six-year period. Every January interviewers collect information regarding respondents’labor market experiences during the previous calendar year. In May, information on income is collected from the same households.13 Interviews are conducted over the telephone with the assistance of a computer to minimize errors. 13 These interviews are conducted in May after households have completed their tax returns. Households have the option of granting Statistics Canada permission to use their T1 tax form information rather than spend the time detailing this information to Statistics Canada. Over 80% of households choose this option. Scripts were read from the computer screen and data was entered into the database. The computer noti…ed the interviewer of discrepancies so that they could be remedied before the interview was completed, reducing measurement error. 9 The sample frame for the survey covers all individuals in Canada.14 The data used in this research covers the …ve year period 1999-2005 for a total of 157,310 observations of which 9,805 are de…ned as self-employed. The key variable of interest in our analysis is employment status. We de…ne persons to be self-employed if they report their major source of earnings to be from self-employed work and they also report no farm income. Person are de…ned to be employed if they report their major source of earnings to be from wages and salary and they also report no farm income. Thus, we construct a binary dependent variable equal to one if a person is self-employed and equal to zero when a person is employed as de…ned above.15 The SLID Person File is used to download all variables associated with the individual and this is then linked to the SLID Economic Family File to collect household information. The variables obtained from SLID and used in the empirical work are de…ned in Table 1. The …rst column in Table 1 lists the variables as named in SLID and the second column de…nes the variables for use in this study. The main earnings variable (earng42) measures salaries, wages and self-employed income. Two variables that characterize the …nancial status of individuals and are assumed to impact the selfemployment choice are investment income (inva27) and taxable capital gains (capgn27).16 We specify a number of control variables describing individual characteristics believed to a¤ect labor market outcomes such as age, gender, education, health, marital status, parenthood, and immigration status. To account for labor market conditions we control for unemployment spells (wksuem28) and industry of employment (white collar, blue collar, and service). Finally, we control for geographic di¤erences by including a set of provincial dummy variables and where appropriate yearly dummy variables to account for possible distributional shifts each year. 14 Residents of the Yukon, the Northwest Territories and Nunavut, as well as, residents of institutions and persons living on native reservations, which constitute less than 3% of the population, are not included. 15 For the data used here, the yearly percentage self-employed in Canada is: 6.18, 5.66, 5.17, 6.55, 7.08, 5.95 and 6.42 for the years 1999-20005, respectively. 16 Blanch‡ower and Oswald, A. (1998) …nd evidence that …nancial constraints are an important deterrent to self-employment. 10 Table 2 shows summary statistics by employment classi…cation for the combined seven years. The data show an interesting statistical picture of the self-employed versus employed. On average the self-employed have lower earnings and a greater variation in earnings compared to the employed. This is consistent with the general notion that self-employed work is more risky. As well, the self-employed show both higher average levels of investment income and capital gains, are slightly less educated, moderately older and more likely to be married and have children. We need to estimate the earnings functions for employed and self-employed workers. In general, the expected earnings equation can be written as, ln earnings j = Z j + "j (9) for j = employed (E) and self-employed (S). Z is a vector of variables common to both equations and de…nes those variables thought to in‡uence earnings. A problem with a least squares estimation of equation (9) is the possibility of omitted variable bias caused by self-selection into employed/selfemployed classi…cations. We follow the standard Heckman procedure to correct for omitted variable bias by estimating, for each year, a reduced form probit speci…ed over the choice between employed and self-employed. From these equations the inverse Mills ratios are calculated for the self-employed and employed individuals and included in the appropriate earnings equation.17 The corrected earnings function is …rst estimated using data only for employed individuals and then used to generate predicted employed earnings for each individual in the data set, both self-employed and employed. This operation is repeated by estimating the earnings equation using only self-employed data and then predicting expected self-employed earnings for each individual in the data set. What we obtain from this exercise is the predicted earnings for an individual if that person had chosen self-employment or alternatively paid employment. Next the predicted earnings for self17 Under the assumption of normality of the error structure the inverse Mills variable is the ratio of the normal density (') to the cumulative normal density ( ). For the self-employed the inverse Mills ratio is de…ned as '( )= ( ) and for employed labor it is de…ned as '( )= (1 ( )) (Lee, 1978). 11 employment and paid employment for each individual, together with their …nancial and personal characteristics, are used in the CTaCS simulator to generate tax liabilities from each occupational choice. Finally, the predicted income (i.e. predicted earnings plus the non-labor income reported in the data) and tax liabilities from each occupation are used to calculate the logarithmic net-of-tax income di¤erence (NetIncdif) between self-employment and paid employment. We are particularly interested in measuring the impact of tax progressivity— i.e. the convexity of the tax system— on the decision for self-employment. We measure the responses of individuals to the portion of the tax schedule applicable to income greater than their expected self-employment income, and to the portion of the tax schedule applicable to incomes below their expected income from self-employment. The …rst approach to constructing such variables follows one of the methods used by Gentry and Hubbard (2000) (hereafter identi…ed as GH), whereby we double the predicted self-employment earnings and use the CTaCS simulator to generate “upside” tax liabilities. We use this to construct upside tax convexity as the log ratio of upside tax liabilities to the tax liabilities occuring when selfemployment earnings are at their predicted level, de…ning this variable as (taxconvexUGH ). The measure of “downside” tax convexity is calculated as the log ratio of tax liabilities with predicted self-employment income to tax liabilities with half the predicted self-employment income, de…ning this variable as (taxconvexDGH ).18 The second approach to tax convexity relies on the data to provide a more empirically justi…ed measure of tax response (hereafter identi…ed as PR). We do this by recovering the non-robust predicted residuals from the self-employed earnings function applied to the complete sample (both employed and self-employed individuals). In other words, for each individual we have the predicted residual conditional on self-employed income. This 18 In addition to this ad hoc approach to calculating tax convexity, Gentry and Hubbard (2000) also use a statistically-based procedure, whereby the income shocks are based on the instability of self-employment earnings observed in the data. They …nd similar results in each case. 12 error is doubled to represent two standard deviations above the mean and added to individual predicted self-employed earnings and the corresponding tax liabilities are computed with CTaCS. We then calculate the log ratio of tax liabilities at the “upside” error-adjusted self-employed earnings to the tax liabilities at predicted self-employed earnings, de…ning this variable taxconvexUPR . The corresponding “downside”shocks to self-employed earnings are calculated by subtracting two standard deviations of the predicted error from individual predicted self-employed earnings. Tax liabilities are then generated with the CTaCS simulator and we calculate the log ratio of tax liabilities with actual self-employed earnings to the tax liabilities from downside error-adjusted self-employed earnings, de…ning this variable taxconvexDPR .19 Using these generated variables the structural probit equation for occupational choice (C) is written as, C = Z +X + I NetIncdif + TU taxconvexU i + TD taxconvexD i + +"C (10) where i = GH or P R. The vector Z is as de…ned earlier. The vector X includes investment income, taxable capital gains, and the number of children in the household. "C is assumed to be iid and asymptotically normally distributed. Additional dummy variables are added to equation (10) to account for provincial and yearly changes. The provincial dummies control for differences in economic development across provinces. The yearly dummies account for distributional changes across years and for the stock market decline in the technology sector in 2001. 19 We also tested a speci…cation of tax convexity in which we do not take the logarithm of the tax liabilities ratio and we found qualitatively similar results to the logarithmic form. We prefer the latter formulation because it has a natural interpretation as the percentage change in tax liability as a result of an income shock. 13 4 Empirical Results This section presents the results obtained from estimating the three econometric models described above: the reduced form probit and the self-employed and employed earnings functions for each year, and the structural probit for the combined seven years of data.20 The results of the maximum likelihood estimation for the reduced form probit for 1999 to 2005 are reported in Table 3. Keep in mind that the purpose of the reduced form probit is to allow measurement of the inverse Mills ratios that allow correction for the omitted variable problem in the earnings equations for self-employed and employed. Consequently, to justify the reduced form equations it is necessary that they be econometrically identi…ed and the estimation procedure must produce consistent estimates. For identi…cation we maintain the assumption of normality of the population error terms. We default to the large number of observations used in estimation and the central limit theorem to support this assumption. This assumption is consistent with standard work in this area. However, normality is not enough to support identi…cation without at least one identifying variable in the reduced form estimation relative to the earnings equation. The reduced form equations have three identifying variables: inva27 (investment income), capgn27 (capital gains) and kids (number of children in the household). These variables proxy for risk in the self-employed decision but are assumed not to have a determining impact on earnings. The investment variable is statistically important at less than the 10% level in six of the seven years, the capital gains variable is statistically unimportant in all years but 1999,21 and the number of kids variable is signi…cant at standard p-values in all but one year (2001).22 For parameter consistency we argue that all right-hand-side variables in 20 The strategy for econometric model building, speci…cation and validation was based on data for 1999. The models were then applied directly to each additional year. 21 Although statistically insigni…cant the capital gains variable is not dropped from the model, beecause it serves a purpose as a control variable to reduce the variance. 22 Rees and Shah (1986) make a similar argument for identi…cation using a ‘child’variable for identi…cation. However, this variable is statistically insigni…cant in their estimation and brings into question the identi…cation of their reduced form equation. 14 the reduced form probits maintain the restriction that the expected value of the error term conditional on right-hand-side variables is zero. In other words, the right-hand-side variables can be treated as exogenous variables in estimation. As our regressors represent individual characteristics, past choices in occupation and geographical location, this assumption appears to be reasonable. Therefore, we conclude that in each year the probit equation is well speci…ed, identi…ed and econometrically reasonable, and we proceed to calculate the inverse Mills ratios for each earnings equation for each year. The maximum likelihood robust estimates for the self-employed earnings equation are reported in Table 4 and for the employed earnings equation in Table 5.23 The base reference individual is a non-immigrant, white collar, single female with no kids, in good health, and resident in Ontario. Comparing Tables 4 and 5 we observe a better …t to the data for the employed earnings equation compared to the self-employed earnings equation. For each year the estimated employed earnings equation shows almost all coef…cients with p-values of 1% or less. On the other hand, the self-employed equations report p-values for 15 out of 77 coe¢ cients greater than 10%. This likely re‡ects the risk involved in this type of activity, the heterogeneity of the self-employed and the number of observations used in estimating the di¤erent earnings equations; approximately 1,300 observations per year for self-employed compared to approximately 22,000 per year for employed. Nevertheless, over 80% of the coe¢ cients in the self-employed earnings equations are statistically accurate at less than the 10% level and, what is more, a null hypothesis that all coe¢ cients except the intercept term are zero can be easily rejected. In support of the estimated results it is also worth noting that except for one occasion there are no sign changes in coe¢ cients across the di¤erent years for both self-employed and employed equations. The exception is the age-squared variable in 2005 for the self-employed. The estimated earnings coe¢ cients for the employed tell a consistent 23 The error structure in the earnings equations are corrected for possible heteroscedasticity but as described by Amemiya (1984) this does not account for possible heteroscedasticity introduced by the fact that the inverse Mills ratios are generated variables. However, this problem does not cause inconsistency in the parameter estimates and consistency is required here. 15 story with the literature. Earnings increase with years of schooling and with the male and married indicators, are less for immigrants and (although not reported) Ontario o¤ers the highest earnings levels relative to the other provinces. Note that blue collar and service workers show negative earnings relative to white collar workers. The results for the self-employed are similar except for two interesting cases. It is true that the male/female ratio among the self employed is higher than for the employed but in terms of earnings, males show no dominance in earning potential relative to females. (The male dummy is statistically signi…cant in only three of the seven years at less than the 5% level.) This contrasts sharply with the earnings equation for the employed; here male dominance in earning potential is strikingly clear for each year in the data. The second case is the married indicator, which is strongly positively correlated with earnings in the employed equation for all years. On the other hand, for the self-employed the married indicators are negatively correlated with earnings for each year in the data set and statistically signi…cant at the 5% level in six of the seven years. It is also interesting to compare the inverse Mills ratio for each equation. The results show a very strong positive bias for those individuals who have chosen either self-employment or employed labor.24 Importantly, a Wald test of independence between the reduced form probit and the wage equation is soundly rejected for each year for each set of equations. We interpret these results to indicate that self-selection is important in employment classi…cation and that the correction for omitted variable is necessary for consistent results in the earnings equations. The next challenge is to model and estimate the structural probit equation for employment classi…cation. We estimate two forms of the structural probit. The …rst is based on the GH tax convexity speci…cation and the second is based on the PR tax convexity speci…cation. The estimated earnings equations are used to predict the earnings for each individual for each employment classi…cation and then combined with personal and …nancial information, and entered in the CTaCS simulator to generate the individual tax implications of each scenario. From the predicted earnings and corre24 Note the inverse Mills ratio for the self-employed is negative; see footnote 17. 16 sponding tax liabilities for each individual the net income di¤erence from employment classi…cation is calculated and the log transform of this ratio is de…ned as NetIncdif. We expect a positive coe¢ cient for this variable in the structural probit indicating larger predicted net income di¤erences in self-employment increase the probability of self-employment. The upside tax convexity variable is expected to be negative in the structural probit and indicates that as tax progressivity in the upper tax brackets increases the probability of self-employment decreases. The downside convex variable is expected to have a positive coe¢ cient in the structural probit re‡ecting lower taxes paid as a percentage of lower earnings, as a result of tax progressivity in the lower tax brackets. Provincial and yearly dummies are also added to the structural equation. The results of the structural model are reported in Table 6. Columns 2 and 3 report estimated coe¢ cients and p-values, respectively for the structural probit using the GH tax convexity formulation and columns 4 and 5 report the corresponding estimated coe¢ cients and p-values, respectively for the structural probit using the PR tax convexity formulation. Over, 150,000 observations are used in estimation for each structural equation. The reference person is a single non-immigrant female with no kids in good health working in the service industry in Ontario in 1999. For both consistency and e¢ ciency in estimating the parameter values, the maximum likelihood estimator is augmented by bootstrapping the standard errors. One thousand replications are used in the bootstrap results. The bootstrap is necessary because of the generated nature of the NetIncdif and tax convexity variables. For both the GH and PR equations, we observe a strong positive relationship of the probability of self-employment with investment income (inva27), age (ecage26), and the indicator variables health, male, married, immigrant, and blue collar (blucol) and service occupations. In contrast, the number of weeks unemployed (wksuem28) is negatively correlated with the probability of self-employment. Interestingly, years of education (yrschl18) o¤ers no strong incentive for self-employment in the GH equation but is negatively correlated in the PR equation. Children in the household (kids) has an important positive impact in the GH equation but no impact in the PR 17 equation. For the provincial dummies, only British Columbia (bc) shows a statistically positive coe¢ cient relative to Ontario and indicates that bc has a larger representation of the population self-employed. In fact, summary statistics show British Columbia with the highest rate of self-employment participation with 7.04% compared to Ontario with 5.45%. The 2001 year dummy clearly picks up the substantial decline in the probability of self-employment with stock market problems in that year. Perhaps the most interesting results are for the net income and tax convexity variables reported near the bottom of Table 6. We observe for both the GH and PR speci…cations that the net income and tax convexity variables are consistent with sign expectations. The coe¢ cients for NetIncdif and taxconvexD are positive with small p-values.25 The taxconvexU coe¢ cients are negative and again with very small p-values. It is interesting to note that for either the GH or PR speci…cation the magnitude of the coe¢ cient on the upside tax convexity variable is considerably large than on the downside. These results suggest overall that tax progressivity in‡uences the occupational choice between self-employment and paid employment in ways consistent with the theory. In further empirical work described below, we report only the PR speci…cation of the structural probit. Measures of robustness in the structural regression results are provided in Tables 7 and 8. In Table 7, the structural probit equation based on the PR speci…cation is taken to the yearly data and the table reports coe¢ cient results only for the net income and tax convexity variables. Only the taxconvexU coe¢ cient maintains the expected sign for each year of the data although it is statistically zero in year 2003. In addition, the NetIncdif and taxconvexD coe¢ cients measure large p-values in a number of years. Note particularly year 2003 which reports no statistical signi…cance across all three coe¢ cients even though the data set contains over 25,000 observations 25 The …nding that net income is an important determinant of the selfemployment/employment choice stands in sharp contrast to the statistical results of Parker (2003), who concludes from examining U.K. data, that the choice of self-employment is unrelated to pecuniary considerations. 18 for that year. Table 8 reports the same net income and tax coe¢ cients except for this table we run the model on yearly cumulative data totals. Note the size of the observations listed at the bottom of each row. This table shows that all coe¢ cients over cumulative totals have the expected signs. Now all coe¢ cients in each cumulative period reported in the table have reasonable p-values except NetIncdif in cumulative years 1999-00, 1999-01 and 1999-02. Based on the empirical results, we argue that the full structural probit model is econometrically reasonable and we proceed to evaluate probability simulations for di¤erent hypothetical policy scenarios. 5 Policy Application Finally, we use the model to calculate the change in the probability of selfemployment, resulting from the personal income tax reforms of the federal government in Canada announced in 2000 and implemented in 2001, and we compare this to a counterfactual case of no tax reform.26 Speci…cally, we compare the probability of self-employment under the 1999 and 2001 tax schedules. The application serves, …rst, to give a sense of the magnitudes of the self-employment probabilities implied by the estimated coe¢ cients of the model, and, second, to show how changes in the di¤erent tax brackets a¤ect self-employment via Netincdif and the upside and downside tax convexity variables. The analysis yields fresh insights on how to design tax policy to encourage self-employment. Table 9 describes the federal marginal personal income tax schedules in 1999 and 2001. In addition, tax brackets became fully indexed in 2001 (after no changes in the tax brackets during the previous …ve years) and the federal surtax was eliminated. 26 We ignore changes in the corporation income tax to focus on personal income taxes. The proportion of incorporated self-employed individuals in our data is small. 19 Table 9: Federal Personal Income Tax Schedules in 1999 and 2001 1999 2001 Tax bracket Tax rate Tax bracket Tax rate y 17% y 16% $29; 590 < y < $59; 180 26% $30; 755 < y < $61; 510 22% $59; 180 29% $61; 510 26% 29% y y $29; 590 y < $100; 000 $100; 000 $30; 755 y < $100; 000 $100; 000 29% It will be useful to decompose the move from the 1999 tax system to the 2001 tax system into three steps, indicated in Table 10. In “Tax Scenario 1” the only change is the tax rate in the …rst bracket, reduced from 17 percent in 1999 to 16 percent in 2001, holding the other parameters of the tax system constant at the 1999 levels. “Tax Scenario 2” changes only the second tax bracket, from a tax rate of 26 percent in 1999 to 22 percent in 2001, again holding the other tax parameters at their 1999 levels. In “Tax Scenario 3” only the third marginal tax rate is changed, from 29 percent to 26 percent on income between about $60,000 to $100,000, the other tax parameters remaining at the 1999 levels. This decomposition of the tax reform enables us to better understand the e¤ects arising from changes in dii¤erent parts of the tax schedule. Table 10: Scenarios o¤ Federal Personal Income Tax Schedules Tax bracket Scenario 1 Scenario 2 Scenario 3 y 16% 17% 17% $29; 590 < y < $59; 180 26% 22% 26% $59; 180 29% 29% 26% 29% 29% 29% y $29; 590 y < $100; 000 $100; 000 We simulate the average probability of self-employment for two broad classes of individuals (with sample sizes of about 1,000). The …rst is workers who are male, white collar, non-immigrant, in good health, married, and with all other variables set at the 2001 levels in Ontario; the average predicted self-employment income for the group is $63,639. The second group has the same characteristics as the …rst, except that the occupations are blue collar; their average predicted income for this group is $48,803. Table 11 re20 ports the simulated values of the average probability of self-employment for the white collar group in 1999, 2001, and for each of the three Tax Scenarios. According to the simulated model, the 2001 tax reforms would increase self-employment rates among the designated white collar group by 1.1 percent. Most of the increase stems from Tax Scenario 2, where the second marginal tax rate is changed by four percentage points. Table 12 shows the simulated values for the blue collar group, where the average probability of self-employment rises by 0.6 percent. In each case the e¤ect of the tax reform on self-employment rates is very modest. This relatively small impact of the tax reform on self-employment is in part due to the fact that the changes in the marginal income tax rates are not particularly large, amounting to less than $1,000 for most individuals; it would be surprising if such changes in tax liabilities would have a large e¤ect on occupational choices. However, part of the reason for the modest change can be ascertained from examining closely the impact of the tax reforms on the NetIncdif, taxconvexU and taxconvexD variables in the model. Hence, Table 13 provides a simulation of the probit equation for a single individual, who is male, blue collar, non-immigrant, healthy, married, and has all other characteristics at 2001 levels in Ontario, His whose predicted self-employment income is $58,861 and his residual from the income equation, used in the calculation of taxconvexU, is set at approximately the average for indnividuals with his characteristics, i.e. $10,000. The table shows the values of the key tax variables in 1999, 2001, and in the case of Tax Scenario 3; the table also shows the marginal coe¢ cients for the three tax variables. The marginal coe¢ cient multiplied by the change in the associated tax variable gives the contribution of the tax variable on the self-employment probability. Looking …rst at Tax Scenario 3 we see that the cut in the third marginal tax rate from 29 percent to 26 percent reduces upside tax convexity, as expected, which gives a boost to self-employment. However, we also see that the gap between after-tax self-employment income and after-tax employment income widens— i.e. NetIncdi¤ becomes more negative— which mitigates against self-employment. This is a consequence of the fact that predicted (and actual) self-employment income is systematically less than predicted employment income for most individuals. 21 Thus middle income tax cuts tend to discourage self-employment insofar as wage earners usually bene…t by more than self-employed workers do. Overall, for the individual simulated in Table 13, the 2001 tax reform reduced his likelihood of self-employment from 8.533 percen tin 1999 to 8.492 percent in 2001. Of course, the tax reforms could enhance entrepreneurship in other ways besides occupational choice, such as increased investment in capital, or more hours spent working. What lessons can be learned from the simulations of the model? Tax reforms designed to increase average rates of self-employment are unlikely to have much impact, given that the tax schedule creates counter-balancing incentives for self-employment via the three tax variables, NetIncdi¤, taxconvexU and taxconvexD. The reform with the most impact on average self-employment rates is a cut in the marginal tax rate faced by the average self-employed income, while not cutting the marginal rate faced by wage workers, whose incomes tend to be somewhat higher than self-employed incomes. This explains the relative success of Tax Scenario 2 in promoting self-employment. More generally, however, tax reforms may be better aimed at targeting the low frequency, but potentially highly innovative, upper income earners, via cuts in the top marginal tax rate, rather than attempting to raise the overall rate of self-employment. 6 Conclusions This paper makes two basic contributions. First, we develop a theoretical utility choice model that de…nes the causal variables of choice, including tax progressivity, in the decision to become self-employed. Second, the theoretical model guides the speci…cation of a structural probit model of the choice of self-employment. The structural model requires generated variables de…ning earnings and tax liabilities for each individual, in each employment classi…cation. We do this using earnings equations for both the self-employed and employed. These equations are corrected for sample selection bias and used to predict earnings for each individual if they had chosen either employment or self-employment. Tax implications of the predicted earnings are 22 calculated using the CTaCS tax simulator. From this information we derive the net-of-tax income di¤erences from the employment classi…cations and tax convexity variables measuring tax progressivity. These generated variables are included in the structural probit model of self-employment. The results show that expectations of higher net-of-tax income di¤erences between self-employment and employment increase the probability of choosing self-employment. Our major …nding is that tax progressivity matters in the self-employment choice via the tax convexity variables. In particular, we …nd strong statistical support for the conclusion that the “penalty on success” via progressive marginal tax rates in the upper tax brackets is a deterrent to self-employment, while the “social insurance” e¤ect of progressivity in the lower brackets is an inducement to self-employment. The usefulness of a structural model for predicting changes in self-employment rates for hypothetical tax reforms is illustrated with an example of a major tax reform implemented in Canada. The simulation suggests that tax reforms are unlikely to increase self-employment rates much, unless they manage to lower the marginal tax rate faced by self-employed workers, while not reducing the marginal tax rate faced by most wage earners, given that paid employment is on average more lucrative than self-employment for any given person. This e¤ect corresponds roughly to the reduction in Canada in 2001 of the tax rate applicable to incomes between about $30,000 and $60,000 from 26 percent to 22 percent. A useful extension to the model would be to allow for endogenous hours of work in paid- and self-employment, which is information that is available in the SLID data. In this way, taxation can impact self-employment not only via the extensive margin, but also through the intensive margin. 23 References [1] Aitchison and J.A.C. Brown (1966). The Lognormal Distribution with Special Reference to its Use in Economics (Cambridge: Cambridge University Press). [2] Benabou, R. (2000) Unequal societies: income distribution and the social contract. American Economic Review 90, 96 - 129. [3] Bernhardt, I. (1994) Comparative advantages in self-employment and paid work. Canadian Journal of Economics, 27, 273 - 289. [4] Blanch‡ower, D.G. and Oswald, A. (1998) What makes a young entrepreneur? Journal of labor Economics, 16, 26 - 60. [5] Blau, D. (1987) A Time-Series Analysis of Self-Employment in the United States. Journal of Political Economy, 95, 445 - 467. [6] Bruce, D. (2000) E¤ects of the United States tax system on transition into self-employment. Labour Economics, 7, 545 - 574. [7] Bruce, D. and M. Moshin (2006) Tax Policy and Entrepreneurship: New Time Series Evidence. Small Business Economics, 26, 409 - 425. [8] Clemens, C. and M. Heinemann (2006) On the E¤ects of Redistribution on Growth and Entrepreneurial Risk-taking. Journal of Economics, Springer, vol. 88, 131-158. [9] Cullen, J. and R. Gordon (2007) Taxes and Entrepreneurial RiskTaking: Theory and Evidence for the US. Journal of Public Economics, 91, 1479 - 1505. [10] De Wit, G. and van Winden, F.A.A.M. (1989) An empirical analysis of self-employment in the Netherlands. Small Business Economics, 1, 263 - 272. [11] Domar, E. and R. Musgrave (1944) E¤ects of Proportional Taxes on Risk-Taking. Quarterly Journal of Economics, 58, 388 - 422. 24 [12] Evans, D.S. and Leighton, L.S. (1989) Some empirical aspects of entrepreneurship. American Economic Review, 79, 519 - 535. [13] Fossen, F.M. (2007) Risky Earnings, Taxation and Entrepreneurial Choice - A Microeconometric Model for Germany. Mimeo. [14] Garcia-Penalosa, C. and J.-F. Wen (2008) Redistribution and entrepreneurship with Schumpeterian Growth. Journal of Economic Growth 13, 57-80. [15] Gentry, W. M. and R. G. Hubbard (2000) Tax Policy and Entrepreneurial Entry. American Economic Review Papers and Proceedings, 90, 283 - 287. [16] Gentry, W.M. and R.G. Hubbard (2004) ‘Success Taxes’, Entrepreneurial Entry, and Innovation. NBER Working Paper No. W10551. [17] Hansson, A. (2006) Income Taxes and the Probability to Become SelfEmployed: The Case of Sweden. Mimeo. [18] Heckman, J. J. (1979) Sample Selection Bias as Speci…cation Error. Econometrica, 47, 153 - 161. [19] Kamhi, N. and D. Leung (2005) Recent Developments in SelfEmployment in Canada. Bank of Canada Working Paper 2005-8. [20] Kanbur, S.M. (1981) Risk Taking and Taxation: An Alternative Perspective. Journal of Public Economics 15: 163–84. [21] Le, A.T. (1999) Empirical studies of self-employment. Journal of Economic Surveys, 13, 381 - 416. [22] Lee, Lung-Fei (1978) Unionism and Wage Rates in a Simultaneous Equations Model with Qualitative and Limited Dependent Variables. International Economic Review, 19, 415–33. [23] Long, J.E. (1982) The income tax and self-employment. National Tax Journal, 35, 31 - 42. 25 [24] Milligan, K. (2007) Canadian Tax and Credit Simulator. Database, software, and documentation, Version 2008 - 1. [25] Moore, R.L. (1983) Self-employment and the incidence of the payroll tax. National Tax Journal, 36, 491 - 501. [26] Moore, C.S. and Mueller R.E. (2002) The transition from paid to selfemployment in Canada: the importance of push factors. Applied Economics, 34, 791 - 801. [27] Musgrave, R. and T. Thin (1948) Income tax progression 1929-48. Journal of Political Economy 56, 498 - 514. [28] Parker, S. (2003) Does Tax Evasion A¤ect Occupational Choice? Oxford Bulletin of Economics and Statistics 65: 379–394. [29] Rees, H. and Shah, A. (1986) An empirical analysis of self-employment in the UK. Journal of Applied Econometrics, 1, 95 - 108. [30] Robson, M.T. and C. Wren (1999) Marginal and Average Tax Rates and the Incentive for Self-Employment. Southern Economic Journal 65, 757 - 773. [31] Schuetze, H. J. (2000) Taxes, economic conditions and recent trends in male self-employment: a Canada-US comparison. Labor Economics, 7, 507 - 544. [32] Stabile, M. (2004) Payroll Taxes and the Decision to be Self-Employed. International Tax and Public Finance, 11, 31 - 53. 26 Table 1: Variable Definitions Variable Self-employed earng42 inva27 capgn27 yrschl18 ecage26 age2 health male married kids immigrant wksuem28 blucol whtcol service newfoundland pei novascotia newbrunswick quebec ontario manitoba saskatchewan alberta bc Definition Takes value 1 if major source of income from self-employed earnings and no farm earnings. Takes value 0 if major source of income from wages and salaries and no farm earnings. Salaries, wages and self employed earnings Investment income Capital gains Years of schooling Age Age squared Takes value 1 if disabled 0 otherwise Takes value 1 if male 0 otherwise Takes value 1 if married 0 otherwise Number of children in household Takes value 1 if immigrant 0 otherwise Weeks unemployed Blue collar employee White collar employee Service employee Newfoundland Prince Edward Island Nova Scotia New Brunswick Quebec Ontario Manitoba Saskatchewan Alberta British Columbia Table 2: Summary Statistics: 1999-2005 earng42 inva27 capgn27 yrschl18 ecage26 Health Male Married Kids immigrant wksuem28 whtcol Blucol Service Obs. Self-Employed Mean Std. Dev. 32875.43 42570.13 1562.67 7112.22 696.99 6722.84 13.59 3.36 44.06 10.22 0.16 0.36 0.63 0.48 0.79 0.41 0.96 1.10 0.08 0.28 0.39 3.01 0.45 0.49 0.31 0.46 0.23 0.42 9,805 Employed Mean 38043.48 962.65 432.35 13.74 39.78 0.14 0.54 0.68 0.92 0.06 1.13 0.51 0.27 0.21 147,505 Std. Dev. 26288.21 5192.81 5342.68 2.97 11.29 0.34 0.49 0.47 1.05 0.24 4.76 0.49 0.44 0.41 Table 3: Maximum Likelihood Estimates of Reduced Form Probit by Year a) 1999 2000 2001 2002 2003 2004 2005 0.005 (0.002)b) 0.003 (0.103) -0.005 (0.347) 0.488 (0.000) -0.039 (0.001) -0.029 (0.000) 0.015 (0.715) 0.173 (0.000) 0.150 (0.000) 0.038 (0.000) -0.022 (0.661) 0.067 (0.068) 0.196 (0.000) -3.059 (0.000) 0.004 (0.067) 0.002 (0.208) -0.002 (0.641) 0.516 (0.000) -0.039 (0.001) -0.015 (0.001) -0.015 (0.718) 0.212 (0.000) 0.122 ()0.001 0.039 (0.006) -0.002 (0.977) 0.032 (0.404) 0.201 (0.000) -3.23 (0.000) 0.004 (0.051) 0.000 (0.949) -0.001 (0.791) 0.579 (0.000) -0.049 (0.000) -0.01 (0.001) 0.005 (0.901) 0.214 (0.000) 0.139 (0.000) 0.013 (0.387) -0.002 (0.971) 0.077 (0.057) 0.265 (0.000) -3.398 (0.000) 0.008 (0.000) -0.000 (0.946 ) 0.004 (0.469) 0.422 (0.000) -0.033 (0.003) -0.030 (0.000) 0.003 (0.940) 0.201 (0.000) 0.154 (0.000) 0.038 (0.006) 0.102 (0.040) 0.067 (0.067) 0.222 (0.000) -3.015 (0.000) 0.005 (0.000) 0.002 (0.319) 0.009 (0.027) 0.427 (0.000) -0.032 (0.001) -0.012 (0.001) -0.009 (0.755) 0.166 (0.000) 0.138 (0.000) 0.030 (0.011) 0.068 (0.117) 0.116 (0.000) 0.201 (0.000) -3.039 (0.000) 0.003 (0.225) 0.002 (0.199) 0.000 (0.973) 0.282 (0.003) -0.018 (0.109) -0.024 (0.000) -0.041 (0.300) 0.123 (0.000) 0.125 (0.001) 0.027 (0.079) 0.169 (0.002) 0.103 (0.008) 0.117 (0.003) -2.596 (0.000) 0.004 (0.038) 0.000 (0.936) 0.008 (0.119) 0.163 (0.060) -0.002 (0.837) -0.017 (0.001) 0.033 (0.340) 0.162 (0.000) 0.113 (0.001) 0.032 (0.029) 0.089 (0.078) 0.123 (0.001) 0.119 (0.002) -2.549 (0.000) SE c) 1,479 1,267 1,118 1,466 1,989 1,203 E 22,472 21,136 20,506 20,923 26,087 19,008 a) Provincial dummy variables are included in each equation but not reported. b) p-value in parentheses with robust standard errors. c) SE is Self-employed and E is employed observations 1,415 20,587 inva27 capgn27 yrschl18 ecage26 age2 wksuem28 health male married kids immigrant blucol service constant Obs. Table 4: Maximum Likelihood Estimates of Heckman Wage Equation by Year a): Self-Employed 1999 2000 2001 2002 2003 2004 2005 0.068 (0.000)b) -0.245 (0.298) 0.011 (0.692) 0.039 (0.001) -0.259 (0.008) 0.245 (0.001) -0.292 (0.001) -0.077 (0.512) -0.398 (0.000) -0.815 (0.000) 6.781 (0.000) 0.079 (0.000) -0.691 (0.003) 0.052 (0.056) 0.025 (0.006) -0.179 (0.094) 0.049 (0.536) -0.251 (0.008) -0.146 (0.28) -0.343 (0.000) -0.898 (0.000) 8.099 (0.000) 0.075 (0.000) -0.642 (0.015) 0.053 (0.080) 0.029 (0.012) -0.189 (0.077) 0.123 (0.163) -0.127 (0.156) -0.239 (0.055) -0.392 (0.000) -0.918 (0.000) 7.387 (0.000) 0.063 (0.000) -0.908 (0.000) 0.081 (0.002) 0.041 (0.001) -0.160 (0.056) 0.021 (0.786) -0.158 (0.057) -0.316 (0.004) -0.444 (0.000) -0.942 (0.000) 8.406 (0.000) 0.065 (0.000) -0.657 (0.000) 0.057 (0.007) 0.009 (0.259) -0.095 (0.202) 0.119 (0.064) -0.146 (0.030) -0.320 (0.001) -0.519 (0.000) -0.986 (0.000) 7.362 (0.000) 0.051 (0.000) -0.235 (0.271) 0.011 (0.659) 0.037 (0.001) -0.125 (0.210) 0.199 (0.012) -0.187 (0.029) -0.537 (0.000) -0.399 (0.000) -0.751 (0.000) 6.698 (0.000) 0.046 (0.000) -0.025 (0.900) -0.014 (0.537) 0.017 (0.102) -0.238 (0.003) 0.158 (0.029) -0.227 (0.003) -0.549 (0.000) -0.396 (0.000) -0.612 (0.000) 6.536 (0.000) 0.000 0.000 0.000 0.000 0.000 0.000 1,479 1,267 1,118 1,466 1,989 1,203 -2.192 -2.584 -3.331 -2.022 -2.664 -3.507 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) rho testd) 401.05 520.35 252.76 372.72 404.18 242.05 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) iterationse) 8 7 12 5 9 7 a) Provincial dummy variables are included in each equation but not reported. b) p-values in parentheses with robust standard errors. c) p-value on a Wald test that all coefficients except the constant are equal to zero d) Wald test of independence between the reduced form probit and wage equation. e) Number of iterations until convergence. 0.000 1,412 -1.875 (0.010) 288.46 (0.000) 5 yrschl18 ecage26 age2 wksuem28 health male married immigrant blucol service constant null testc) Obs. Mills Table 5: Maximum Likelihood Estimates of Heckman Wage Equation by Year a): Employed 1999 2000 2001 2002 2003 2004 2005 0.044 (0.000)b) 0.982 (0.000) -0.103 (0.000) -0.031 (0.000) -0.073 (0.000) 0.422 (0.000) 0.068 (0.000) -0.087 (0.000) -0.130 (0.000) -0.449 (0.000) 0.570 (0.000) 0.047 (0.000) 1.009 (0.000) -0.106 (0.000) -0.028 (0.000) -0.068 (0.000) 0.427 (0.000) 0.067 (0.000) -0.059 (0.000) -0.125 (0.000) -0.466 (0.000) 0.498 (0.000) 0.049 (0.000) 0.968 (0.000) -0.102 (0.000) -0.034 (0.000) -0.086 (0.000) 0.414 (0.000) 0.074 (0.000) -0.062 (0.001) -0.119 (0.000) -0.455 (0.000) 0.592 (0.000) 0.047 (0.000) 0.948 (0.000) -0.099 (0.000) -0.036 (0.000) -0.056 (0.000) 0.427 (0.000) 0.0542 (0.000) -0.074 (0.000) -0.149 (0.000) -0.458 (0.000) 0.701 (0.000) 0.049 (0.000) 1.006 (0.000) -0.105 (0.000) -0.030 (0.000) -0.079 (0.000) 0.413 (0.000) 0.052 (0.000) -0.128 (0.000) -0.159 (0.000) -0.474 (0.000) 0.576 (0.000) 0.051 (0.000) 1.025 (0.000) -0.108 (0.000) -0.035 (0.000) -0.094 (0.000) 0.404 (0.000) 0.057 (0.000) -0.111 (0.000) -0.110 (0.000) -0.452 (0.000) 0.523 (0.000) 0.053 (0.000) 0.931 (0.000) -0.097 (0.000) -0.032 (0.000) -0.075 (0.000) 0.399 (0.000) 0.048 (0.000) -0.09 (0.000) -0.128 (0.000) -0.469 (0.000) 0.725 (0.000) 0.000 0.000 0.000 0.000 0.000 0.000 22,472 21,136 20,506 20,923 26,087 19,008 1.410 1.642 1.813 1.274 1.333 1.349 (0.000) (0.000) (0.000) (0.000) (0.000) (0.001) rho testd) 42.75 44.25 30.39 33.34 41.19 32.80 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) 6 9 6 6 6 6 iterationse) a) Provincial dummy variables are included in each equation but not reported. b) p-values in parentheses with robust standard errors. c) p-value on a Wald test that all coefficients except the constant are equal to zero. d) Wald test of independence between the reduced form probit and wage equation. e) Number of iterations until convergence. 0.000 20,587 2.237 (0.000) 42.89 (0.000) 7 yrschl18 ecage26 age2 wksuem28 health male married immigrant blucol service constant null testc) Obs. Mills Table 6 Maximum Likelihood Estimation a) of Structural Probit: 1999-2005 Variable Coefficient GHb) p-value Coefficient PR c) inva27 4.15E-06 0.000 2.11E-06 capgn27 1.10E-06 0.143 1.37E-07 yrschl18 0.001 0.683 -0.013 ecage26 0.049 0.000 0.053 age2 -0.039 0.000 -0.04 health 0.041 0.006 0.116 male 0.177 0.000 0.098 married 0.146 0.000 0.161 kids 0.025 0.000 0.003 immigrant 0.069 0.001 0.125 wksuem28 -0.022 0.000 -0.027 blucol 0.091 0.000 0.148 service 0.198 0.000 0.315 newfoundland -0.144 0.000 -0.168 pei 0.028 0.378 -0.005 novascotia -0.069 0.003 -0.064 newbrunswick -0.085 0.001 -0.027 quebec -0.042 0.007 -0.078 manitoba -0.028 0.220 -0.024 saskatchewan -0.001 0.993 -0.038 alberta -0.031 0.108 -0.041 bc 0.086 0.000 0.107 D00 -0.041 0.040 -0.056 D01 -0.085 0.000 -0.111 D02 0.019 0.336 -0.028 D03 0.059 0.001 0.001 D04 -0.020 0.331 -0.042 D05 0.003 0.896 -0.056 NetIncdif 0.049 0.039 0.043 taxconvexUi d) -0.184 0.000 -1.029 taxconvexDi e) 0.043 0.000 0.076 Constant -2.969 0.000 -2.662 Observations 150,797 151,572 a) Bootstrapped standard errors (1,000 replications) b) Gentry and Hubbard (2001) (GH) procedure for tax convexity c) Predicted residual (PR) procedure for tax convexity d) taxconvexUi for i either GH (column 2) or PR (column 4) e) taxconvexDi for i either GH (column 2) or PR (column 4) p-value 0.011 0.864 0.000 0.000 0.000 0.000 0.000 0.000 0.583 0.000 0.000 0.000 0.000 0.000 0.879 0.007 0.267 0.000 0.276 0.088 0.035 0.000 0.006 0.000 0.161 0.947 0.041 0.005 0.038 0.000 0.008 0.000 Table 7: Robustness Check for Structural Probit Net Income and Tax Convex Variables by Year 1999 2000 2001 2002 2003 2004 2005 0.375 (0.002)a) -1.732 (0.000) 0.128 (0.024) 0.379 (0.006) -1.546 (0.000) -0.126 (0.094) 0.654 (0.000) -1.775 (0.000) 0.116 (0.107) 0.468 (0.001) -1.997 (0.000) 0.170 (0.014) 0.048 (0.200) -0.0004 (0.996) 0.014 (0.679) -0.059 (0.701) -1.408 (0.048) -0.088 (0.718) -0.005 (0.972) -1.193 (0.000) -0.006 (0.961) Obs. 21,753 21,263 20,446 a) p-values based on robust Standard Errors 21,460 25,407 19,203 21,265 NetIncdif taxconvexUPR taxconvexDPR Table 8: Robustness Check for Structural Probit Net Income and Tax Convex Variables Cumulative Data 1999 1999-00 1999-01 1999-02 1999-03 1999-04 1999-05 0.375 (0.002)a) -1.732 (0.000) 0.128 (0.024) 0.045 (0.434) -1.560 (0.000) 0.129 (0.005) 0.072 (0.166) -1.597 (0.000) 0.127 (0.001) 0.012 (0.774) -1.65 (0.000) 0.138 (0.000) 0.104 (0.000) -0.961 (0.000) 0.086 (0.000) 0.084 (0.000) -1.009 (0.000) 0.074 (0.015) 0.043 (0.045) -1.029 (0.000) 0.076 (0.011) Obs. 21,753 43,016 63,462 p-values based on robust Standard Errors 84,922 110,329 129,532 150,797 NetIncdif taxconvexUPR taxconvexDPR a) Table 11: Average Probability of Self-Employment for White Collar 1999 2001 Tax Scenario 1 Tax Scenario 2 Tax Scenario 3 7.061 7.138 7.055 7.159 7.062 Table 12: Average Probability of Self-Employment for Blue Collar 1999 2001 Tax Scenario 1 Tax Scenario 2 Tax Scenario 3 7.421 7.465 7.445 7.460 7.421
© Copyright 2025 Paperzz