Down to Ten: Estimating the Effect of a Red Card in Soccer Author(s): G. Ridder, J. S. Cramer and P. Hopstaken Source: Journal of the American Statistical Association, Vol. 89, No. 427 (Sep., 1994), pp. 11241127 Published by: American Statistical Association Stable URL: http://www.jstor.org/stable/2290942 . Accessed: 01/04/2014 02:11 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. . American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal of the American Statistical Association. http://www.jstor.org This content downloaded from 128.135.12.127 on Tue, 1 Apr 2014 02:11:56 AM All use subject to JSTOR Terms and Conditions Down to Ten: Estimatingthe Effectof a Red Card in Soccer G. RIDDER,J. S. CRAMER,and P. HOPSTAKEN* We investigate the effectof the expulsionof a playeron the outcomeof a soccermatchby means of a probabilitymodel forthe score.We proposeestimators oftheexpulsioneffect thatare independentoftherelativestrength oftheteams.We use theestimates theexpulsioneffect on theoutcomeofa match. to illustrate KEY WORDS: Conditionallikelihood;Poissonprocess;Soccer; Unobservedheterogeneity. 1. INTRODUCTION and 90 minutes.Recordedtime is measuredfromthe beginningof the matchand fromits resumptionafterthe inProfessionalsoccer(knownoutsidethe United Statesas terval,however.As a result,theremay be some minutes football)is popularall overtheworld;in Europe and South whenthereis no playat all, whereasthe45thand 90thminAmericait is the dominantspectatorsport.Because soccer utes may lastlongerthana fullminute;but thisis a minor is a low scoringgame,the ruleshave been oftenrevisedso distortion. as to raisethenumberofgoalsscoredbyeithersideand thus Let increasetheplay'sappeal.Since 1990,playerscan be expelled ri = minutein whicha playeris expelledfromteam 2, forthe restof a matchforillegaldefensiveactions,such as goal by Nij = totalnumberof goals scoredin matchi by teamj, repeatedflagrantfoulsand preventingan adverse showing him Kij = numberofgoals scoredbefore-ri, illegalmeans.The refereeexpelstheplayerby Mij = numberofgoals scoredafterTi, a redcard. of teamj in matchi at the theeffect ofsuchan expulsion Xij(t) = scoringrateor intensity In thisarticlewe investigate tth minute of play, holds widely on the outcome of a match.Popular opinion = multiplicative effect on Xij(t)ofexpulsionofplayer oftheredcard,butas far different viewson theeffectiveness from team 2, and to not been submitted empirical as we knowthequestionhas of teamj in matchi as compared ,yij= relativestrength research.We proposea model forthe effectof the redcard with the overall in of the the teams strengths thatallowsforinitialdifferences averagescoringrate,X(t). in the match. for the during and variation scoringintensity We make thefollowingthreeassumptions: Poisson we proposea time-inhomogeneous Morespecifically, 1. The two teams score accordingto two independent forthescoreofeitherside. effect modelwitha match-specific Poisson processes.As a consequence,the numberof goals a card by conof the red effect We estimatethedifferential scored team 1 is stochastically ofthenumber by independent inis estimator that ditionalmaximumlikelihood(CML) of 2. scored team the time intervalsbegoals by Moreover, effects. This estimator was dependentof the match-specific tween are The subsequent goals stochastically independent. and GrilHall, in by Hausman, introduced econometrics are not the thus intensities constant scoring during match; of Andersen (1973). liches(1984), buildingon ideas In Section2 we specifythemodel,in Section3 we discuss thePoisson processesare nonhomogeneous. 2. The ratioofthescoringintensities ofthetwofullteams and in Section4 we givetheresults.We consider estimation, = is a each constant for game; that is, in Xij(t) yijX(t)formatches some implicationsoftheestimates Section5. of 11against11players, withX(t) theaveragescoringintensity 2. A MODEL FOR THESCORE at thetthminuteofplayoffullsidesof 11 against11. IN A SOCCER MATCH 3. Afterthe red card,fort > Ti, team 2 has 10 players, and thescoringintensities are OjyijX(t),j = 1, 2. i denotes First,we introducesome notation.The subscript In Assumption1 we describethe score in a matchas a a match,and j = 1, 2 denotesthetwo sides in thatmatch; a teamin a matchis thusidentified bytwosubscriptsij. We randomphenomenonthatis onlypartlypredictable.It deof the restrict attentionto matcheswitha redcard,and we always pends on the playingtime,on the relativestrength take it thatthe red card is givenagainstthe second side,j teams,and on the effectof the red card. As we show,the = 2. Time is measuredin minutesfrom0 to 90, whichis scoringintensity increaseswiththetimeplayed.Ifwe do not oftheredcardwillbe overstated, the officialdurationof a match.In soccerthe clock is not allowforthis,thentheeffect but the refereecan allow because we confoundit withthe timeeffect.Of coursethe stoppedwhenplay is interrupted, affected oftheteams. and secondhalfs,after45 scoreis strongly bytherelativestrength forlosttimeat theend ofthefirst The incidenceof red cards may be relatedto the relative so thata comparisonof red card games to unin* G. Ridderis Professor, Free University, strength, Departmentof Econometrics, In adgamesgivesa biased estimateoftheeffect. Amsterdam,The Netherlands.J. S. Crameris Professor,Departmentof terrupted Economics,and P. Hopstakenis SeniorResearchFellow,Foundationfor EconomicResearch,University of Amsterdam, The Netherlands.The authorsthankGusta Renes forhelpfulcomments, Tony Lancasterforspotting an embarrassing errorin a previousversion,and theeditorand tworeferees forcommentsthathave improvedthearticleconsiderably. ? 1994 AmericanStatisticalAssociation Journalof the AmericanStatisticalAssociation September 1994,Vol. 89, No. 427, Statisticsin Sports 1124 This content downloaded from 128.135.12.127 on Tue, 1 Apr 2014 02:11:56 AM All use subject to JSTOR Terms and Conditions Ridder, Cramer, and Hopstaken: The Effectof a Red Card in Soccer 1125 Table 1. Goals Scored in the 1991dition,the timingof red cards may also be relatedto the 1992 Season by 15-MinuteIntervals relativestrength oftheteams,and againthisbiasestheeffect. The thirdfactoris the effectof the red card,whichby AsTimeinterval Numberofgoals (min) sumption3 is measuredby 01 and 62. It is notouraim to predicttheoutcomeofsoccermatches, 0-15 128 whichrequiresan estimateof yij.Our estimateoftheeffect 16-30 140 31-45 147 oftheredcard is independentof yij,whichis ofgreathelp, 46-60 169 a good estimateof yj is difficult as experience becausefinding 61-75 170 shows. 76-90 198 The Poisson assumptionand its implicationsformAssumption1. It is notdifficult to relaxthePoissonassumption at thecostofa morecomplexstatistical model,butour limscoredbythesame teambeforeand aftertheredcard.More itednumberof observationswillnot supportthis. precisely,we considerthe fractionof the goals scoredafter the red card,whichwe denoteby yij.It is intuitively 3. STATISTICALANALYSIS clear thatthisfraction is independentofthetime-constant match3.1 Estimation of the Average Scoring Intensity specificeffect. Under Assumptions1 to 3 (withP denotingthe Poisson In Table 1 goals scored in 340 fullmatchesin the two professional soccerdivisionsin theNetherlandsin the 1991- distribution), 1992 season are classifiedby 15-minuteintervalsof play. This showsthatthe rateof scoringincreasesmonotonically Kij-P(y j fX(t)dt) and MijP(QP fjX(j (t)dt). I over the match,as has also been observedin Englandby Morris(1981). (3) If we assume thatthe averagescoringintensity increases linearlyduringthematch,thentheexpectednumberofgoals In the sequel we denote scoredby a teamj in matchi in intervals is = A(t) dt and Bi = A(t) dt. (4) s = 1, . ,6, (1) E(Nij,) = yij(15a + 112.50(2s -1)) Ai so thatthe averagenumberof goals scoredby one team in intervals is E(Ns) = 15a + 112.53(2s- 1) s = 1,..., 6, (2) definesa scale for yij; wherewe take y = 1. This implicitly forexample,if yiy= 2, thenteamj has a scoringintensity in matchi thatis two timestheaverage. The averagenumberofgoals per minutein timeinterval s equalstheentryin Table 1dividedby680,twicethenumber ofcontests.Estimatesofa and : are theneasilyobtainedby ordinaryleast squares (OLS) regression.With X(t) as the fora 90-minutegame,we find(R2 = .95; scoringintensity standarderrorsin parentheses) f f The conditionaldistribution ofMi, givenNij, is MijINij - B(Nij, gij(0)), (5) whereB denotesthebinomialdistribution and g,1(O)= A +BB (6) The conditionaldistribution is degenerateifNij = 0, andy1j is definedonly ifNij 2 1. In the CML procedurewe omit observationswithNij = 0. The estimatorof the red card effect is notbiasedbythisrestriction, as we shallsee presently. In theconditionaldistribution (5) and in theconditional effectsyiYcancel. Up to an likelihood,the match-specific a = 1.050(.024) and ,=.00776(.00072). additive constantthat does not depend on Oj, the logNote thatthe reportedstandarderrorsare consistentin the likelihoodis Inclusionofa quadraticterm presenceofheteroscedasticity. nj did notimprovethefit.In thesequel we ignorethesampling + (Ni - Mij)log(l -giJ(Oj)), logLJ= z M0jlog(g,1(61)) thecomputation varianceoftheseestimates.This simplifies i=l as theyare ofvariancesand is an acceptableapproximation, (7) small.The estimatesimplythatthescoringintensity increases duringa 90-minutegame from1.05 in the firstminuteto withnl, n2denotingthe numberof observationson teams 1.75 in thefinalminute. thatdo notand do receivea redcard.Because we condition on thetotalscores,Nij,we can treatthemas nonstochastic 3.2 A Conditional Maximum Likelihood Estimator constants.Hence omittingobservationswitha giventotal Because the incidenceof red cardsis probablyrelatedto score-in particular,observationswithNij = 0-does not the relativestrengthyij,a comparisonof red card matches affecttheCML estimator. The likelihoodequationis withothermatchesmay give a biased estimateof the red card effect.For thatreason,we proposean estimatorthat does not depend on the -ybJ's or on theirdistribution. This = E N,1yij. (8) E N,jg,j(OaCMLJ) estimatoris based on a comparisonofthe numberof goals i=l i=l This content downloaded from 128.135.12.127 on Tue, 1 Apr 2014 02:11:56 AM All use subject to JSTOR Terms and Conditions Journal of the American Statistical Association, September 1994 1126 This is a momentequation,equatinga weightedaverageof the yij's to a weightedaverage of theirexpectations.The weightsare thetotalscoresNij, whichby conditioningcan be treatedas knownconstants. In derivingthepropertiesoftheCML estimator, we note in thelogit thatthebinomialparametergij(0j)can be written is globallyconcavein log(0), form.Hence thelog-likelihood so thatthe CML estimatorforOjis uniquelydefined.The asymptoticvarianceof theCML estimatorcan be obtained in theusual way. 3.3 Table 3. Probabilitiesofthe Outcomeofa Match witha 15-MinuteExclusionStartingat r Minuteofstart ofpenaltyr Pr(teamof 11 wins) .42 .42 .43 .43 .43 .44 0 15 30 45 60 75 Pr(draw) Pr(teamof 10 wins) .24 .24 .24 .24 .24 .24 .34 .34 .33 .33 .33 .32 OLS Estimation Withan additionalassumption,we can estimatetheeffect and 0CML2 = .95 (.20). OCML1 = 1.88 (.29) From (3), ofthe redcard by linearregression. Accordingto the CML estimates,the scoringintensityincreasesby 88% forthe team with 11 players;thiseffectis Kij = _jAi+ (7ij - -j)Ai + (Kij - E(KiiLIij)) The scoringintensityforthe team statistically significant. = 7jAi + vij with 10 players(team 2) hardlychanges;the effectis not from1. significantly different and The OLS estimatorgivesratherdifferent results(withthe in = + + standard errors consistent the of heteroscedaspresence Mij _j6jBi (7ij 1j)0jBi (Mij E(MijLI0i)) ticity): = VjOjBi + V2ij (9) In (9) we allowtheaveragerelativestrength in redcardgames to differ fromthe overallaverage1. 1 and 2 indicatethe of the teamsbeforea playerof team 2 is averagestrengths and are independent, expelled.The disturbances vIijand V21J an additionalassumptionis requiredforconsistent estimates, viz. cov(yij, A ) = cov(yij, B ) = 0. A sufficient condition forthisis thatTi and -iyarestochastically Under independent. thisassumption,we can estimateQbytheratiooftheregresin (9). sion coefficients 4. ESTIMATIONRESULTS We applyCML estimationto data on 140 redcardgames in the seasons 1989-1990, 1990-1991, and 1991-1992 in bothdivisionsof the Dutch professionalfootballleague. In 13 of thesematches,two or moreredcardsweregiven.Beofbeingone playerup or down, cause we estimatetheeffect the partafterthe second expulsionis omitted.In onlytwo matcheswere a red card and a penaltykick givenjointly. Because forthe CML estimatorwe mustomitobservations numberof wherea team has not scoredat all, the effective observationsis 112 forteams with 11 playersand 93 for results(stanteamswith10players.We obtainthefollowing dard errorsin parentheses): Table2. Probabilities oftheOutcomeoftheMatch oftheRed Card byMinute of Minute redcard r Pr(teamof11 wins) 0 15 30 45 60 75 90 .65 .62 .58 .54 .49 .44 .375 6OLS1 = 1.43 (.03) 6OLS2 = 1.14 (.03). The estimatedincreasein thescoringintensity forteam 1 is much smallerthan forthe CML estimator(but highlysignificant).More surprisingly, theOLS estimatorshowsa staincreasein thescoringintensity forteam tistically significant 2. Hence usingbetween-game information givesratherdifferentestimatesthat moreoverare hard to interpret. The oftheOLS estimatorshowthatteams first-stage regressions thatreceivethered card have the same scoringintensity as theaverage( 2 = 1.03(.09)), buttheopposingteamis much stronger(^Y1= 1.33 (.09)). Hence the red card usuallyis givento thealreadyweakerteam. By stratifying our sample,we can investigate whetherthe estimates arerobustagainstchangesin thespecification. First, we testwhethertheredcard effect dependson the venue of play.Thiscaptures,amongotherthings, thehomeadvantage, and theestimateshouldbe invariantto thisdistinction. The LR statisticis .53 forthe team with 11 playersand .66 for theteamwith10 players;hencewe can notrejectinvariance. The estimatesare 61,home = 2.00 (.36), 61,away = 1.56 (.46), and 02,home = .73 (.28), 62,away = 1.07 (.27). The estimates are also invariantto stratification on the total score in a match.In thesequel we use theCML estimatesto illustrate of theredcard on the outcomeof a match. theeffect OF THEESTIMATES 5. IMPLICATIONS We can use the resultsto illustratethe effectof the red card on a soccermatch.In Table 2 we givetheprobabilities Pr(draw) Pr(teamof10 wins) ofthethree possibleoutcomesofthematchbetweenequally teams as a functionof 1-. The last row of the table .17 strong .18 .18 .20 of a drawbetweentwo teamsof showsthatthe probability .20 .22 average strength is .25. This is an indicationof the role of .25 .21 chance in the outcome of a soccer match.The roleofchance .28 .23 .24 .32 was also stressedby Osmond(1993) . A redcardearlyin the .375 .25 matchincreasesteam 1l'sprobability ofvictorysubstantially. This content downloaded from 128.135.12.127 on Tue, 1 Apr 2014 02:11:56 AM All use subject to JSTOR Terms and Conditions Ridder, Cramer, and Hopstaken: The Effectof a Red Card in Soccer 1127 Table 4. Expected Numberof Goals in Match by Minuteof Red Card Minuteofred card T Expected numberofgoals 0 15 30 45 60 75 90 3.95 3.80 3.63 3.45 3.25 3.03 2.80 APPENDIX:DATAUSED IN ANALYSIS The symbols are introduced in Section 2. Team 2's probability ofvictory decreasesevenmore,whereas thechangein theprobability of a drawis relatively small. Witha redcard,a playeris expelledfortheremainderof thematch.In indoorsoccerand ice hockey,a playercan be excludedfora certainperiod.In Table 3 we showtheeffect of a 15-minutetime penaltyforequally strongteams. Althoughthe effect dependson thetimeat whichthepenalty is imposed,thisdependenceis ratherweak. As notedin Section1,a motivationforthemorefrequent use oftheredcardis to increasethenumberofgoals scored in a match.Table 4 showsthatit has thedesiredeffect. We also considerthe dilemmaof a defenderwho facesa playerwho threatensto break throughthe defense.If the opposingplayerhas a clearwayto thegoal,trippingup the playerresultsin a red card forthe defender.If the player goes past thelastdefender,he willscorewitha highprobability.In our calculationwe assumethattheobjectiveofthe oflosingthematch. defenderis to minimizetheprobability Thereis a unique momentin a contestat whichtheoptimal actionofthedefenderchanges.Afterthatmoment,it is optimal to tripup the opposingplayer.These times,which depend on the probabilitythatthe attackerwill score and on therelativestrength of thedefender'steam (withtheattacker'steam of averagestrength, -y= 1), are reportedin incentiveto resort Table 5. The weakerside has a stronger to illegaldefense.This is consistentwithour observation thatthered card is usuallygivento theweakerside. It may also induce a correlationbetweenX and y,and such a correlationbiases theOLS estimates. K] K2 Ml M2 1 0 1 0 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 1 3 0 1 0 0 0 0 1 0 0 1 0 2 1 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 1 0 1 1 0 0 0 0 0 0 5 0 3 1 1 0 0 3 2 5 3 1 1 2 1 2 1 3 1 0 3 2 2 1 1 0 0 1 1 0 4 2 0 2 2 1 0 3 0 2 0 0 2 3 0 2 0 0 1 0 2 1 0 0 0 2 2 0 2 1 4 0 0 0 0 0 0 1 1 0 0 0 0 1 2 0 0 3 0 1 1 0 1 1 1 0 0 2 0 0 0 0 1 0 2 0 0 1 0 2 0 0 0 0 2 0 0 1 0 1 0 0 1 1 0 0 0 2 T K] K2 Ml 10 11 15 17 20 25 25 26 30 32 33 33 33 33 35 36 36 37 39 39 40 44 44 44 44 44 44 45 45 46 47 47 48 50 50 52 52 52 52 53 55 55 55 55 56 56 56 1 1 2 1 0 1 1 1 2 0 2 0 2 0 0 0 1 0 0 0 1 0 1 3 0 1 1 1 0 1 1 0 1 1 3 2 2 1 1 1 0 0 3 0 1 1 1 1 2 0 0 2 1 1 0 1 0 0 1 0 0 2 1 0 1 1 1 0 1 2 0 0 1 1 0 0 0 1 3 1 1 1 0 1 1 2 0 3 1 2 0 1 0 3 0 0 0 3 1 2 2 0 2 1 1 0 1 0 0 2 1 1 2 2 1 0 0 0 0 0 0 1 0 1 0 0 1 0 0 0 1 1 1 0 0 0 0 0 0 0 0 M2 r 1 0 1 0 1 1 0 0 0 0 0 0 1 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 58 58 58 60 60 61 61 62 62 65 65 65 65 66 67 0 6 1 0 1 1 1 1 2 0 4 1 1 2 1 0 0 1 1 1 0 3 1 2 2 0 1 1 1 2 1 1 0 0 0 0 0 0 0 0 0 0 0 1 0 68 68 68 70 70 70 70 70 70 71 71 73 73 74 74 74 75 77 78 78 79 80 80 80 80 80 80 81 82 82 82 82 2 2 2 2 2 0 3 2 2 2 4 2 2 2 1 2 1 0 3 0 2 0 0 1 2 0 1 1 2 2 0 1 1 1 0 0 1 1 0 2 1 0 1 0 1 0 1 1 1 3 0 0 1 0 0 0 1 1 1 0 0 0 0 0 0 1 0 0 0 1 0 1 2 2 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 2 0 0 2 0 0 1 1 1 0 2 0 1 2 1 0 1 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 K] K2 Ml M2 X 82 83 83 83 84 84 84 85 85 85 85 85 85 86 86 86 86 87 87 87 88 88 88 88 88 88 88 88 88 88 89 89 89 28* 30* 36* 36* 40* 43* 45* 60* 63* 64* 69* 71* 78* * Matches withtwo or more red cards. Second red card in 77, 85, 78, 73, 60, 89, 68, 67, 65, 88, 76, 77, 82. Table 5. Time(Minuteof Game) AfterWhicha DefenderShould Stop ofScore and Relative a Breaking-Away Playerby Probability Strengthofthe Defender'sTeam ofscore Probability Relativestrength of teams, y .3 .6 1 .5 1 2 70 71 72 42 48 52 0 16 30 [ReceivedJanuary1993. RevisedMarch 1994.] REFERENCES Andersen,E. B. (1973), ConditionalInferenceand ModelsforMeasuring, Copenhagen:MentalHygiejniskForlag. Hausman,J.A., Hall, B. H., and Grilliches, Z. (1984), "Econometric Models forCount Data Withan Applicationto thePatents-R&D Relationship," Econometrica,52, 909-938. Morris,D. (1981), The Soccer Tribe,London: JonathanCabe. Osmond,C. (1993), "Random Premiership?," RSS News,November,5. This content downloaded from 128.135.12.127 on Tue, 1 Apr 2014 02:11:56 AM All use subject to JSTOR Terms and Conditions
© Copyright 2026 Paperzz