The effect of financial goals and incentives on labor. An experimental test in an online market∗† David B. Johnson and Justin Weinhardt PRELIMINARY DRAFT October 2, 2014 Abstract Empirical studies investigating work motivation over time find people with fluctuating wages work more on days when their wage rate is lower compared to when wages are higher. The authors of these studies theorize individuals use daily income goals and stop working once they reach their goal. This study involves assignment and manipulation of financial goals in an online labor market that is nearly frictionless. Workers can quit at any time and can start jobs posted by competing employers almost instantly. Results, with pooled data, indicate financial goals do not lead to workers stopping work once they reached their goals and there is no significant wage related crowding out. However, when we separate the sample by western and non-western workers, we find production by western workers is greatest in treatments with goals and low incentives. This effect is absent in non-western workers. JEL classification: Keywords: Experiment; Online; Goal ∗ David Blake Johnson: University of Calgary, Department of Economics, 2500 University Dr NW, Suite 554 , Social Sciences, Calgary, Alberta, Canada T2N 1N4. [email protected]. Justin Weinhardt: University of Calgary, Department of Human Resources and Organizational Dynamics, Scurfield Hall 442, Calgary, Alberta, Canada T2N 1N4. [email protected] † We are grateful to valuable comments provided by Tim Salmon, Alexi Thompson, Gary Fournier, and Rob Oxoby. All mistakes are our own. 1 Introduction Several field studies investigate the influence of fluctuating wages on labor. Many of these studies find productivity does not increase with wages (e.g., Laisney et al., 1996). Instead, individuals work longer when the wage rate is low (Camerer et al., 1997; Chou, 2002; Lynn, 2002; Fehr and Goette, 2007).1 This set of findings is inconsistent with traditional theories of labor supply suggesting individuals work longer when their wage rate is high (e.g., Lucas Jr and Rapping, 1969) or, in other words, when there is a greater “extrinsic” work motivation. On the other hand, goals introduced by employers, experimenters, and individuals (Austin and Vancouver, 1996; Wu et al., 2008; GómezMiñambres et al., 2012; Sun and Weinhardt, 2014) enhance production and even in cases where the goals have no impact on the workers earnings.2 Such responses are thought to relate to “intrinsic” motivation and, when taken together, these findings suggest a “crowding out” effect of high wages. In economics and psychology, the crowding out theory posits tangible benefits and costs (e.g., monetary rewards and punishments or any motivation that is from outside the individual) can reduce an individual’s intrinsic motivation (e.g., task enjoyment or any motivation from within the individual). Several models explaining the observed crowding out phenomenon have been introduced with many piggybacking off of Kahneman and Tversky (1979).3 For instance Falk and Knell (2004) present a model where individuals set goals based upon the performance of their peers. Doing so enhances individual utility through self-enhancement and self-improvement.4 Falk and Knell (2004) go further and present survey evidence confirming the validity of their model. Wu et al. (2008) extend this line of research by presenting an elegant theory explaining the underlying mechanisms behind the performance gains. Put simply, gains relative to the goal (i.e., the reference point) enhance utility while losses (i.e., deviations from the goal) are costly. Our work most directly extends that of Gómez-Miñambres et al. (2012) and Goerg and Kube (2013). Gómez-Miñambres et al. (2012) builds upon Wu et al. (2008) by introducing exogenously assigned goals (e.g., goals introduced by an employer). Such a model has similar properties to that of Wu et al. (2008) which lead to goals increasing productivity. Gómez-Miñambres et al. (2012) provide experimental evidence in support of their model. In treatments with goals that have no direct impact on subjects’ monetary earnings, productivity significantly increases. Moreover, and quite surprising, Gómez-Miñambres et al. (2012) find no evidence of extrinsic crowding out, as the wage rate increases production further. This is not a trivial result and is in stark contrast to the “crowding out” hypothesis. Goerg and Kube (2013) furthers this line of research with a real effort experiment finding the use of personal goals increases worker production. This occurs if the goals are self-assigned or if selected by a principal.5 Moreover, much like previous work, these goals are effective even in instances where they are not tied to monetary earnings. Although similar to both Gómez-Miñambres et al. (2012) and Goerg and Kube (2013) our work furthers current understanding of goals. Instead of an environment where subjects know they are in an experiment, we take advantage of a venue in which it is left ambiguous as to whether or not subjects are in an experiment. Moreover, we use a task purposefully selected to be natural for the venue. We find little evidence of a wage effect seen in Gómez-Miñambres et al. (2012). Subjects 1 And to some extent in Lynn (2002). In this case, the author observes a negative correlation between tipping and turnover in low volume restaurants but not in high volume restaurants. 2 For a review see Locke and Latham (2002). 3 A general discussion of models is outside the scope of the current work but interested parties should see Kőszegi and Rabin (2006), Koch and Nafziger (2011), Gómez-Miñambres (2012) and Hsiaw (2013). 4 Self-enhancement increases the individual’s utility by making them feel better about themselves. On the other hand, self-improvement increases utility by giving the individual something to strive for which increases their utility by way of increasing their performance. 5 There is a caveat here in that the size of the goal matters quite a bit as easily obtained goals lead to the worker being less productive. 1 in treatments with a high wage do not significantly increase their productivity. Further, we find only weak evidence supporting a goal hypothesis and this effect is only present with workers from developed western nations. Our work does not contradict previous work but rather illustrates a need for a more nuanced view of goals. Specifically, workers in our experiment have the opportunity to switch employers or quit to attend to a personal task. Moreover, workers may do so at any time. In other words, there is little cost to quit. This is quite novel and most relevant in workplaces where employees act more like their own bosses. In such cases, because of the lack of a concrete work schedule and/or a “boss”, it is therefore less surprising that goals have a relatively weak effect. Moreover, a larger take away of the present work is how the importance of behavioral aspects change as the labor market approaches a neoclassical frictionless labor market with homogenous workers. Ex-post this might seem quite obvious; more complicated markets contain greater frictions which can alter behavior. Remove these frictions and behavior neatly follows many implications of neoclassical models. 2 Present Study Ideally, to test the effects of varying wage rates on work hours, one needs a job where wages vary across some temporal period (e.g., days) but relatively constant within a period. Further, individuals have a choice regarding how long they work and these variables (i.e., rate of pay and amount of time worked) must be observable. One job where the above conditions are met is New York City cab drivers (Camerer et al., 1997). Specifically, cab drivers wages stay relatively constant throughout the day, but fluctuate between days, presumably because of changes in the weather. Moreover, most NYC cab drivers are able to determine how long they work and the cab companies track this information. Camerer et al. (1997) find across three samples that wages and time worked are negatively correlated. Follow-up studies by Chou (2002) and Fehr and Goette (2007) discover similar effects for taxi drivers in Singapore and bike messengers in Zurich Switzerland. To account for their empirical results, as well as those seen in later studies, Camerer et al. (1997) posits individuals work in reference to an income goal and that, once reached, causes individuals to stop working for the day. Essentially, because individuals reach their goal relatively quickly on high wage days they work less. Yet, before accepting the goal-based account as a universal, a rigorous test with a more independent work environment and across cultures seems prudent. To investigate the effect of time-varying wage rates on labor supply in a market with fewer frictions and across cultures, we designed an online experiment allowing us to manipulate goals and wages while monitoring worker productivity. Specifically, to test the goal mechanism directly we assign a monetary goal to a random set of participants (Locke and Latham, 1990). If wages correlate negatively with productivity for individuals in the monetary goal conditions, it suggests the influential nature of monetary goals (Helson, 1964; Kahneman and Tversky, 1979). To contrast the goal condition, we include a control condition where individuals are told to do-your-best. We select this control because research demonstrates specific goals lead to better performance than vague do-yourbest goals (Locke and Latham, 1990). An additional advantage of the present study is that we are able to run the same experiment across dramatically different populations. Previous work is not designed to investigate how identical incentive schemes can have different effects across populations. In world where the outsourcing of interchangeable labor is becoming increasingly the norm, such an investigation is a worthy extension. 2 3 Methods The experiment takes place on Amazon Mechanical Turk (AMT). AMT is an online labor market where requesters pay workers to complete human intelligence tasks (HITs). Total compensation HITs depend on a flat fee (generally between 5 and 50 cents), and a bonus individually assigned by the requester.6 Following the critiques presented by Levitt and List (2007) workers are not told they are in an experiment until after they complete it.7 Workers are required to correctly answer English comprehension questions prior to the start of the experiment. Workers failing one or more of the English comprehension questions are not eligible to continue.8 The answers to these questions are randomly generated and serve as the treatment assignment mechanism. This random assignment allows all treatments to be posted at the same time, preventing autocorrelation in the treatments. The English comprehension questions are interspersed within the initial survey.9 After completing the survey, workers complete 5 practice tasks and begin the experiment. 3.1 Task and Treatments The task we use is particularly well suited for the market as it is a task that is difficult for a computer to do and would reasonably be needed for a firm trying to keep digital copies of “strange” documents. Workers transcribe pieces of an instruction manual from a 1996 Oldsmobile Cutlass (essentially a series of CAPTCHAs). Each of these pieces is a jpeg file so workers cannot copy and paste in the answer. After five transcriptions, paying 2 cents per transcription, workers are paid a piece rate of two or five American cents for each transcription they complete. There is no ambiguity in the currency; workers know they are being paid in American currency because they are paid through Paypal. Additionally, competing HITs advertise wages in American currency (always to our knowledge in cents). Connection speed should not matter. The HIT was coded in HTML which is essentially static.10 Workers complete as many transcriptions as they want (with the maximum being 100) and can quit anytime. Workers are only required to complete the first five transcriptions. The maximum number of transcriptions is capped at 100 mostly for ease. Subjects have computers of varying speed and we wanted to make sure load times would be reasonable even on mediocre machines with weak connectivity.11 Figure 1: Example Task As discussed above, before seeing the treatment screen, workers complete five practice transcriptions paying 2 cents each. The amount paid for practice questions is the same for all treatments. In the treatment screen, workers view 1 of 2 texts assigning them an unenforced goal. In“Do Your Best” treatments (DYB), which serve as a control, workers view the text “Working for us your goal is to 6 For an in-depth discussion of the bonus mechanism see Cooper and Johnson (2013). course workers have the option of requesting their data to not be used; this is truthfully respected. 8 If this happens, workers are shown a screen that requests them to “return” the HIT. A returned HIT can be completed by a different worker and any worker who dropped an HIT is no longer eligible to participate. Other than being no longer eligible to participate, returning a HIT does not harm the worker. 9 A common complaint of AMT is that workers randomly enter text in hopes of bilking the requesters. Any worker doing so, in our study, would likely answer the questions incorrectly and be kicked out of the experiment. 10 2 workers did lose connection but no others reported problems. These workers are paid based upon the number of transcriptions they said they completed but are omitted from the analysis. 11 We tested this with an older laptop and wifi connection which we gradually moved away from the wireless internet source. 7 Of 3 just do your best and do as many sentences as you can” while in GOAL treatments workers are told “Working for us your goal is to make $2”. Because we vary the piece rate we end up with 4 treatments (DYB - 2 Cents, DYB - 5 Cents, GOAL - 2 Cents, and GOAL - 5 Cents). Because our treatment relies on English comprehension we split the sample by national language but also present pooled results. Summary statistics for each of the treatments and split by Western English Speaking (WES) can be seen in Table 1. These treatments at a cursory glance maybe problematic as they require turkers to be aware of the value of a dollar is. Normally this is a valid criticism but in the case of AMT it is unwarranted. AMT is well established and has a long history of operating with the US dollar. Most, if not all, workers will be aware of this as well as the buying power of the dollar in their home country. Table 1: Treatments ALL WES Non - WES DYB - 2 Cents 55 24 31 GOAL - 2 Cents 53 26 27 DYB - 5 Cents 64 25 39 GOAL - 5 Cents 62 32 30 ALL 234 107 127 The stakes of the experiment are not trivial - especially by AMT standards. With the bonus, workers in the five cents treatments could make up to 4 dollars and 95 cents (or around a half day’s work in India) in under an hour.12 Workers in the 2 cent treatments could top out at lower amount ($2.20) but this amount is still generous by AMT standards. All currency is in United States denominations. This is not probably does not create any confusion as AMT is an established labor market and the working currency has been US dollars (or cents) since its inception. 4 Results Over ninety percent of workers are in the USA (45 %) or India (46 %). The remainder hail from a variety of nations but primarily developed western ones. As such, it comes as no surprise that our workers are much more heterogeneous than usual laboratory subjects. The gender of workers is evenly split; 49 % of workers indicated they are female. We gather information regarding the age of workers, education and income. The average age of workers is 34 years. The modal income and education of workers is between $12,500 and $25,000 per year and a bachelors degree.13 Average production (the number of transcriptions completed) is in Table 2. The distribution of production by treatment and by Western English Speaking nations14 (WES) can be found in the appendix (Figures 2, 3, and 4). The differentiation between WES and non-WES workers is essentially differentiating between high and low ability workers. Workers from WES nations spend about 15 seconds less per transcription on HIT (t = −1.91) .15 Moreover, workers from non-WES 12 National Sample Survey Organisation. (2011). Employment and Unemployment in India, 2009-10, NSS Sixty Sixth Round. Report No. 537. New Delhi: Department of Statistics, Government of India. Retrieved February 1, 2013, from http://mospi.nic.in/Mospi New/upload/NSS Report No 537.pdf (Login Required). 13 We present modes instead of averages here because workers are asked to select an appropriate category rather than enter a number. Of course averages are available upon request. 14 Australia, Canada and the United States of America. 15 This is rough approximation as it includes time spent on the survey. 4 Table 2: Production by Treatment and English ALL WES Non-WES DYB - 2 Cents 60.60 (4.34) 58.92 (6.69) 61.90 (5.78) GOAL - 2 Cents 62.81 (4.91) 70.00 (6.79) 55.89 (6.94) DYB - 5 Cents 58.89 (4.18) 64.16 (6.68) 55.51 (5.36) GOAL - 5 Cents 57.70 (3.97) 55.88 (5.72) 59.58 (5.57) ALL 59.86 (2.15) 61.93 (3.21) 58.13 (2.9) Standard deviations in parentheses. nations spend over five and a half minutes more on the HIT than workers from WES nations (−2.81). Initially, with pooled data we find no significant treatment effects. However when we separate by WES, we find Goal - 2 Cents production of workers in WES countries to be greater than similar workers in DYB - 2 Cents (p = 0.126) and Goal - 5 Cents (p = 0.057). Identical methods with non-WES nations’ residents result in no significant differences. In Table 3 we estimate worker production and whether or not they completed all the possible transcriptions as a function of the treatment, income, and whether or not they live in a WES nation. Models 1 through 3 are tobits with the dependent variable being the number of transcriptions completed; models 4 through 6 are probits with the dependent variable being equal to 1 if the worker completed all 100 transcriptions.16 We find no significant treatment effects in the regressions with pooled data (models 1 and 4) but workers from a WES nation are more productive and are more likely to complete the all 100 transcriptions. Workers reporting higher incomes complete fewer transcriptions but this primarily being driven by workers from WES nations; when we remove them from the sample the coefficient estimate on income switches sign and is not significantly different from zero. Returning now to productivity we find a single modest treatment effect occurring in Goal - 2 cents and only when we separate the sample by WES. Workers in non-WES nations exhibit no change in productivity across all treatments. This result is echoed in the probit models (4, 5, and 6) where only the workers in WES nations have higher likelihood of completing all 100 transcriptions.17 16 To save space, we do not present marginal effects here. However, they can be found in the appendix. Additionally, alternative models (e.g., simple regression and ordered probits with a bin size of ten) and “kitchen sink” models are also tested. Results did not change significantly. 17 Increasing the sample size may be worthwhile endeavor here and may result in significant differences. However we argue these differences, except in the case of the Goal -2 cents in WES workers, would not be economically significant - especially after controlling for income and nation of origin. Since only one treatment is large in magnitude and significant, increasing the sample size would almost certainly only shrink standard errors down to the point where we are reporting significant “zeros.” 5 Table 3: Empirical Results Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Tobit Tobit Tobit Probit Probit Probit DYB - 2 Cents 4.12 (8.45) 5.17 (12.74) 2.99 (11.15) 0.10 (0.25) 0.13 (0.37) 0.06 (0.35) DYB - 5 Cents 2.71 (8.00) 13.66 (12.65) -5.02 (10.46) 0.01 (0.25) 0.23 (0.36) -0.19 (0.34) GOAL - 2 Cents 8.90 (9.42) 22.56† (14.55) -3.46 (12.08) 0.40† (0.25) 0.7** (0.35) 0.08 (0.36) Income -2.73* (1.59) -3.58* (1.86) 0.88 (3.14) -0.06 (0.05) -0.06 (0.05) -0.03 (0.11) WES 10.97† (6.99) Constant 66.84*** (7.55) 75.83*** (11.78) 64.53*** (11.08) -0.64*** (0.23) -0.43 (0.32) -0.56* (0.34) Obs Log L Subject Pool Lower Limit Upper Limit 233 -910.071 107 -393.520 126 -513.942 233 -138.524 107 -65.429 126 -71.995 ALL WES Non-WES ALL WES Non-WES 2 70 2 37 0 33 NA NA NA NA NA NA 0.34* (0.20) Standard Errors in parentheses. ***: p < .01, **: p < .05, *: p < .1 and †: p < .15. Lower and Upper Limit are the number of workers who completed the 5 and 100 transcriptions. 6 5 Discussion and Conclusions We find no evidence of an increase in productivity in any of the 5 cent piece rate treatments. This result, while somewhat surprising, is consistent with much of the previous literature (Gneezy and List, 2006; Kube et al., 2006; Hennig-Schmidt et al., 2010) and in contrast to Gómez-Miñambres et al. (2012). Regarding our treatment, we find limited evidence in support of a negative correlation between time spent working and wage/piece rate. The treatment effect is only significant if the sample is separated and each sub-sample estimated independently. This results in an odd but non-trivial outcome where there are significant treatment and income effects in one sample that are absent in the other. Because income is insignificant in non-WES nations, income differences across the populations do not explain the results (poor people in non-WES nations are as productive as comparatively richer people in the same block of nations). Within the realm of WES nations, our results present additional evidence in support of the papers mentioned in the introduction. Positive wage shocks in labor markets where workers set their own working hours, reduces worker productivity. How this extends to labor markets where workers have set hours and/or wages based upon their time at work, we are somewhat agnostic to. However, we posit the following: within the lab, there exists a norm to not allow subjects to quit the experiment. While natural to most work environments it is far from universal. For instance, Gómez-Miñambres et al. (2012) allow subjects to goof off on the internet, and this option has much relevance in the world outside the lab, but it does not replicate environments where workers have almost literally any outside option they wish. What this means, is that goals can be quite effective in workplaces where workers have set hours and high job search costs. Relax both of these traits and ex-post it becomes less surprising that goals have little impact on productivity. This is especially interesting because it demonstrates that as labor markets move in the direction of a neoclassical firm (with homogeneous labor and no search costs ), the behavioral aspects become less important. It follows that as the environments become more complicated and move away from a neoclassical firm, the behavioral aspects becoming increasingly important. The lack of significance in the treatment effect in pooled and non-WES sample also presents a limitation researchers should consider before running an experiment on AMT. Read and Loewenstein (1995) suggest and present evidence of a diversification bias. An interesting but problematic twist of this bias is also present in AMT: workers have multiple HITs they are able to complete at any given time. Consequently, workers may select to both maximize their monetary earnings and also vary the portfolio of the HITs they complete. Such a bias would be more pronounced in populations less skilled in English. Presumably, workers in these populations glean less from instructions and learn by doing rather than the instructions and if they are not good at the task, they quit. In these cases, their stopping point would depend on a time per HIT threshold which would be independent of our treatment. This explains why workers from non-WES nations are not impacted by the treatments; they simply are not good enough at the task for the goals to matter. However, such a bias requires the AMT labor market to also be more in the direction of “real” labor market as it has a search feature which absent in the lab. This introduces another interpretation of our results: workers in WES nations are treating AMT as a real labor market where they can quit their job at anytime. This would be consistent with the observation that workers from WES nations are significantly richer (t= -7.35) and are more likely to be working on AMT for primary or secondary source of income (t=-2.37). This presents evidence to the contrary that turkers from WES nations are on AMT for more recreational purposes. 7 References Austin, James T, and Jeffrey B Vancouver (1996) ‘Goal constructs in psychology: Structure, process, and content.’ Psychological bulletin 120(3), 338 Camerer, Colin, Linda Babcock, George Loewenstein, and Richard Thaler (1997) ‘Labor supply of new york city cabdrivers: One day at a time.’ The Quarterly Journal of Economics 112(2), 407–441 Chou, Yuan K (2002) ‘Testing alternative models of labour supply: Evidence from taxi drivers in singapore.’ The Singapore Economic Review 47(01), 17–47 Cooper, David J, and David B Johnson (2013) ‘Ambiguity in performance pay: An online experiment.’ Available at SSRN 2268633 Falk, Armin, and Markus Knell (2004) ‘Choosing the joneses: Endogenous goals and reference standards.’ The Scandinavian Journal of Economics 106(3), 417–435 Fehr, Ernst, and Lorenz Goette (2007) ‘Do workers work more if wages are high? evidence from a randomized field experiment.’ The American Economic Review pp. 298–317 Gneezy, Uri, and John A List (2006) ‘Putting behavioral economics to work: Testing for gift exchange in labor markets using field experiments.’ Econometrica 74(5), 1365–1384 Goerg, Sebastian J, and Sebastian Kube (2013) ‘Goals (th) at work.’ Max Planck Institute for Research on Collective Goods Gómez-Miñambres, Joaquı́n (2012) ‘Motivation through goal setting.’ Journal of Economic Psychology 33(6), 1223–1239 Gómez-Miñambres, Joaquı́n, Brice Corgnet, and Roberto Hernán González (2012) ‘Goal setting and monetary incentives: When large stakes are not enough.’ Technical Report Helson, Harry (1964) Adaptation-level theory. (Harper & Row) Hennig-Schmidt, Heike, Abdolkarim Sadrieh, and Bettina Rockenbach (2010) ‘In search of workers’ real effort reciprocitya field and a laboratory experiment.’ Journal of the European Economic Association 8(4), 817–837 Hsiaw, Alice (2013) ‘Goal-setting and self-control.’ Journal of Economic Theory 148(2), 601–626 Kahneman, Daniel, and Amos Tversky (1979) ‘Prospect theory: An analysis of decision under risk.’ Econometrica: Journal of the Econometric Society pp. 263–291 Koch, Alexander K, and Julia Nafziger (2011) ‘Self-regulation through goal setting*.’ The Scandinavian Journal of Economics 113(1), 212–227 Kőszegi, Botond, and Matthew Rabin (2006) ‘A model of reference-dependent preferences.’ The Quarterly Journal of Economics pp. 1133–1165 Kube, Sebastian, Michel André Maréchal, and Clemens Puppe (2006) ‘Putting reciprocity to workpositive versus negative responses in the field.’ University of St. Gallen Economics Discussion Paper Laisney, Francois, Winfried Pohlmeier, and Matthias Staat (1996) Estimation of labour supply functions using panel data: A survey (Springer) Levitt, Steven D, and John A List (2007) ‘What do laboratory experiments measuring social preferences reveal about the real world?’ The journal of economic perspectives pp. 153–174 8 Locke, Edwin A, and Gary P Latham (1990) A theory of goal setting & task performance. (PrenticeHall, Inc) (2002) ‘Building a practically useful theory of goal setting and task motivation: A 35-year odyssey.’ American psychologist 57(9), 705 Lucas Jr, Robert E, and Leonard A Rapping (1969) ‘Real wages, employment, and inflation.’ The Journal of Political Economy 77(5), 721 Lynn, Michael (2002) ‘Turnover’s relationships with sales, tips and service across restaurants in a chain.’ International Journal of Hospitality Management 21(4), 443–447 Read, Daniel, and George Loewenstein (1995) ‘Diversification bias: Explaining the discrepancy in variety seeking between combined and separated choices.’ Journal of Experimental Psychology: Applied 1(1), 34 Sun, S., Vancouver J.B., and J.M. Weinhardt (2014) ‘Goal choices and planning: Distinct expectancy and value effects in two goal processes.’ Organizational Behavior and Human Decision Processes Wu, George, Chip Heath, and Richard Larrick (2008) ‘A prospect theory model of goal behavior.’ Technical Report, mimeo 9 A Figures Figure 2: Productivity of Workers Figure 3: Productivity of Non - WES Workers 10 Figure 4: Productivity of WES Workers B Marginal Effects Table 4: Marginal Effects from Table 3 Model 1 Tobit Model 2 Tobit Model 3 Tobit Model 4 Probit Model 5 Probit Model 6 Probit DYB - 2 Cents 4.12 (8.63) 5.17 (13.19) 2.99 (11.3) 0.04 (0.09) 0.05 (0.14) 0.02 (0.11) DYB - 5 Cents 2.71 (8.32) 13.66 (13.11) -5.02 (10.7) 0.00 (0.08) 0.09 (0.14) -0.06 (0.1) Goal - 2 Cents 8.9 (8.88) 22.56* (13.62) -3.46 (11.64) 0.14 (0.09) 0.27** (0.13) 0.03 (0.12) Income -2.73* (1.6) -3.58* (1.93) 0.88 (3.28) -0.02 (0.02) -0.02 (0.02) -0.01 (0.03) WES 10.97 (6.89) OBS Subject Pool Lower Limit Upper Limit 0.12* (0.07) 233 107 126 233 107 126 ALL WES Non-WES ALL WES Non-WES 2 70 2 37 0 33 NA NA NA NA NA NA Standard Errors in parentheses. ***: p < .01, **: p < .05, *: p < .1 and †: p < .15. Lower and Upper Limit are the number of workers who completed the 5 and 100 transcriptions. 11 C C.1 Online Appendix Instructions Phase 1: Before we begin we would like to gather some information regarding our workers (you). Please complete this short survey. When you are finished, please click the NEXT button. What is your gender? What is your age? What country do you currently live in? Which of the following best describes your highest achieved education level? Over the weekend, Bob watched two football games. In the box below below, type the number of football games that Bob watched over the weekend. Be sure to use a numeric character! What is the total income of your household? Why do you complete tasks in Mechanical Turk? Please check any of the following that applies: Next year, Jack and Jill are planning on visiting Disneyland. Jill has been to Disneyland many times while Jack has never been. In the box below below, type how often Jack has been to Disneyland. Be sure to use a numeric character! We are crowdsourcing the transcription of an instruction manual of the 1996 Oldsmobile cutlass. You will be given a series of photocopies depicting short texts from the manual. Your job is to type these short bits of text into the provided text box. We will spot check your work to make sure it is of sufficient quality. You will be paid 2 cents for each of these practice transcriptions. After which, you will start the actual task. Please click the NEXT button to begin the practice messages. YOU MUST COMPLETE THESE PRACTICE MESSAGES TO BE PAID. You have now completed the practice questions. Thank you for completing the practice assignments. You will again be given sentences that were scanned from a book. We need you to type out each sentence exactly. You will be paid {2 cents, 5 cents} for every sentence that you type. $2 - Working for us your goal is to make $ 2 dollars. DYB - Working for us your goal is to just do your best and do as many sentences as you can. In order to be sure you are reading the instructions, please input the number X in the input box below. Please note that you can quit at any time by clicking the quit button. Once/if you quit, you will be given some brief instructions and then be told to submit the HIT. Your performance will be recorded 12 to pay you and with your consent may be used for research in the future. The following study investigated the influence financial goals and financial incentives had on motivation to work and the decision to stop working. The reason we did not inform you in the beginning that this was a study was because we wanted you to treat the work task as a real Amazon Mechanical Turk job. Therefore, we are able to understand how people decide to continue working and stop working in a real economic marketplace. Previous research shows that when people have a smaller financial incentive but a difficult goal, they will work longer. We are trying to understand why this effect occurs. Survey Description and Consent Form: This study is conducted by a team of researchers at the Haskayne School of Business, the University of Calgary. The principal investigator is Dr. Justin Weinhardt, Assistant Professor at the Haskayne School of Business. This research protocol is approved by the Conjoint Faculties Research Ethics Board (CHREB). The purpose of this research is to investigate and better understand the factors affecting decision making under different conditions. Your participation in this study is voluntary and you may refuse to participate altogether or may choose to withdraw from the study at any time. In the experiment, you were asked type out various sentences. You were paid based on your performance. You will now be asked additional questions, consisting of general background, personality and demographic data. The collected data will be kept on a password protected computer drive, stored in a secured location and accessible only by the researchers for research purposes, including the publication of scientific papers. Participation is completely voluntary, anonymous and confidential. No one except the principal investigator and the research team will be allowed to see any of the answers to the questions. No personal identifying data will be collected in this study. There are no names collected and attached to the responses. Only group information will be summarized for any presentation or publication of results. In case you withdraw from this study, the data collected to the point of withdrawal will be deleted. There were no foreseeable risks, harms, or inconveniences associated with your participation in this study. The only cost on your part is the time you will spend for participating in this survey. Your responses may be used for research purposes, but only with your consent. Regardless of whether or not you want your responses used for research purposes, you will be paid. If you agree with the terms of this study, please press USE MY DATA FOR RESEARCH. If you do not agree for your data to be used in a research project, please click DO NOT USE DATA FOR RESEARCH. In the following survey, you will be presented with a number of different decision problems and fill out a scale regarding personality and mental focus. If you have questions about this study, you can contact Dr. Justin Weinhardt, the principal investigator, at [email protected]. If you have concerns about your rights as a research participant, you may contact the University of Calgary Ethics Resource Officer through email: [email protected] or telephone: 403-210-9863. USE MY DATA FOR RESEARCH DO NOT USE MY DATA FOR RESEARCH 13
© Copyright 2026 Paperzz