Research Paper No. 1943 Selection Criteria for Roll Call Votes Keith Krehbiel Jonathan Woon September 2005 RESEARCH PAPER SERIES Selection Criteria for Roll Call Votes Keith Krehbiel and Jonathan Woon Stanford University and Carnegie-Mellon University Prepared for the meetings of the American Political Science Association Washington, D.C. September, 2005 Abstract On grounds of inclusion of undesirable votes (type I errors) and exclusion of desirable votes (type II errors), we question the convention of selecting only finalpassage votes for roll call analysis. We propose an alternative selection method based on the estimated salience and strategic significance of roll calls and argue that this method reduces type I and type II errors. We demonstrate that selection of roll calls based on alternative criteria has a major bearing on the asymmetry of partisan roll rates and, we conjecture that its application will also be substantively significant in other modes of inquiry. Selection Criteria for Roll Call Votes Keith Krehbiel and Jonathan Woon Politicians play multiple games in multiple arenas before multiple audiences. Among the many manifestations of this political complexity, roll call voting is one of the most thoroughly studied. But a given roll call cast by a given legislator on a given day may be of major, minor, or no consequence, depending upon individual and collective circumstances. As a general proposition, the political (and hence scholarly) relevance of such behavior is proportional to the salience and the strategic significance of the voting situation. Salience is of particular importance from the individual’s perspective. A reelection-seeking legislator is keenly concerned about the consequences of his roll call when media, interest groups, or constituents are watching—and less so when they are not. Such is the essence of position-taking. Strategic significance, in comparison, is an attribute of the inside game of collective choice and its consequences for policy outcomes. Is the agenda structured optimally for a minimal winning coalition to capture benefits from a minority? At what stage of the legislative process does this happen? Which votes are pivotal in determining the crucial contested content of policy outcomes as opposed to simply date-stamping the end of a piece of legislative business? The challenge of systematically identifying salient and strategically significant roll calls can be illustrated with a recent example. On July 21, 2005, the US House of Representatives chose to consider, debate, amend, and pass HR 3199, Reauthorization of the Patriot Act. In the course of its deliberations, the House recorded 14 roll calls, all of whose consideration was provided for by H.Res. 369, the rule providing for consideration of the bill. The two roll call votes on the rule itself were straight party-line votes: 224-197 on ordering the previous question, and 224-196 on agreeing 1 to the resolution. Thereafter, most of the amendment activity was successful and lopsided, some of its seemingly trivial as well. For example, roll call vote 407, which passed 418-7, changed the term “domestic terrorism” to “federal crime of terrorism.” Not everything was trivial, however. In particular, a motion to recommit with instructions that would impose sunset provisions on the bill was offered by a majority party member (Rep. Rich Boucher of Virginia). It not only attracted unanimous Democratic support and enough Republicans and abstentions to fail by an unusually small margin, 209-218, but also attracted considerable press attention, such as a reference in the Wall Street Journal to Republicans’ “heavy-handed tactics.”1 Likewise, an amendment offered by Democratic Representative Sheila Jackson-Lee was adopted in a reasonably close vote of 233-192, with unanimous minority party support (200-0), substantial majority-party defections (45), but opposition of a majority of the majority party. In other words, the majority party was “rolled.” These close calls vanished, however, when it came time to vote on final passage. The amended bill was adopted 257-171. We cite this example neither to express nor imply any substantive conclusions. Indeed, our lack of special expertise and inside information about the reauthorization of the Patriot Act is, in many respects, the point of the example. Analysts of roll call votes in large-N studies rarely have special expertise or inside information about their hundreds or thousands of observations. Yet, in this case, and undoubtedly many others like it, the convention in recent years is to include only the relatively lopsided and often uninformative vote on final passage, to emphasize (in examples like this one) that the roll call is an instance of the majority party rolling the minority, and to ignore the 1 “House Majority Limits Changes in Patriot Act,” Wall Street Journal, Friday, July 22, 2005, page A5. 2 earlier and closer votes that earned national coverage and in which the minority party rolls the majority party. Although researchers have become increasingly innovative in exploiting roll call votes for theory-testing purposes, alongside innovations come increasing concerns about which roll calls to use for which purposes. Whether by accident or design, and for better or for worse, the emerging convention is to focus on final passage votes.2 Thorson (1998) singles out such votes for being distinctively helpful for testing hypotheses about unified versus divided government. Jenkins, Crespin, and Carson (2003), Thorson and Nitzschke (2000), and Van Houweling (2001) argue that final passage votes are highly significant, and that they manifest systematically different forces than earlier votes (see, however, Roberts and Smith 2003). Cox and McCubbins’s (2004) extensive study focuses overwhelmingly on final passage votes, while Wiseman and Wright (2005) and Krehbiel, Meirowitz, and Woon (2005) conform with the convention and consider only votes on final passage (see, however, Krehbiel 2005). The explanation for this academic equivalent of pack ball3 is not entirely clear, but there appear to be two forces at work: a push effect and a pull effect. The push factor is the absence of a theory that has clear implications about how to interpret pre-final-passage roll call voting. For instance, while models of agenda setting (Cox and McCubbins 2004) or pivotal politics (Brady and Volden 1998, Krehbiel 1996) provide necessary and sufficient conditions for changing a status quo policy, they are silent about the procedural 2 Definitions vary, but a common if not modal definition is Rohde’s (http://www.msu.edu/~pipc/pipcdata.htm), which includes votes on adoption of a bill, conference report, a resolution, a joint or concurrent resolution, or passage of any of the aforementioned under suspension of the rules (in the House). 3 Pack ball—sometimes called kids’ soccer—is a game in which unquestioning players unconsciously and oxymoronically preserve a radius-minimizing spread as they occasionally move to, but mostly park at, some random place on the field. 3 choices that initiate the lawmaking process and about the proposal and destination-significant amendment behavior that transpires between procedural and policy choice stages. This fact seems to make empirical researchers skittish about drawing inferences based on roll calls that occur in the early and middle stages of the legislative process. The pull factor is that, in the case of final-passage votes, the nature of the collective choice is simple, relatively well-defined, and binary. Roberts and Smith put it succinctly: “Final passage votes, conference report votes, and veto override votes come very close to pitting a policy alternative against the status quo” (2003, 309). The corresponding attraction of final passage votes may inhere in the presumption that such votes, and only such votes, are clean instances of legislators’ making a well-defined choice between an equilibrium proposal p* and a status quo policy q.4 Whatever the sources of the convention, the selection criterion of final passage is error-prone in each of two ways, summarized in table 1. First, a high proportion of votes are included that should not be, either on conceptual grounds (i.e., they are not really final passage votes) or, more seriously, on grounds that they are not informative with respect to specific research objectives. These are parallels of type I errors in statistics, or false positives in many other contexts. The specifics supporting the claim that a high proportion of final passage votes are false positives stem from each of the three words that make up the term. Final passage votes are often not final, because when a chamber first passes a bill, there is rarely an assurance that it will not revisit the content of the bill later in the process, say, in amendments between the chambers, after a negotiated 4 An added layer of theoretical defense is available, though not commonly cited. In a complete information setting with a finite binary agenda, sophisticated voting (i.e., non-truthful revelation of preferences) may occur at any stage except one: the final vote (McKelvey and Niemi 1987). It can be questioned whether most congressional roll call situations conform with these conditions (Groseclose and Krehbiel 1993). 4 conference committee settlement, or after a presidential veto. A given piece of legislation can therefore be represented in a dataset several times with ostensible votes on final passage.5 Final passage votes also occasionally fail, in which case they are arguably mislabeled and of questionable theory-testing value for theories intended to predict outcomes. The reason, of course, is that in the presumed final matchup between the status quo q and the ostensible equilibrium policy p*, the latter is undeserving of its *. In other words, if the proposed policy does not pass, it evidently is not an equilibrium policy.6 Finally, final passage votes that happen to meet both the final and the passage criteria—the above arguments to the contrary notwithstanding—can also be diluted in value for a number of procedural and behavioral reasons. As a literal matter, the last congressional action on many bills that becomes law is often an act of unanimous consent or a voice vote rather than a recorded roll call vote. While such exclusion is more of a type II problem (see below) than a type I problem, it serves to underscore a closely related problem of a falsepositive sort. Specifically, as in the case of underlying preferences expressed in voice votes, it is well-known that coalition sizes regularly exceed those predicted by formal models. This fact suggests, and anecdotes further support the notion, that late-stage expressions via votes are often more acts of capitulation, rationalization, or idiosyncratic position-taking, than they are compelling indications that the supporting legislators strictly prefer policy state p* to policy state q. In 5 To be somewhat more precise, we regard this as more of a problem in logical consistency and operationalization than a problem of false positives. That is, if one embraces the theoretical argument that only final passage votes can be relied upon as instances of truthful preference revelation (see previous footnote), then the fact that final passage votes as operationalized are often not the final say of legislators is problematic. Acknowledging that it is a subjective matter, we tend not to find the theoretical premise to be compelling (see Krehbiel and Rivers 1990). 6 Bicameralism regularly causes this problem. For example, in “Daschle’s Dead Zone,” the Wall Street Journal (July 22, A12) identifies 11 major pieces of legislation that obtained majority [ostensibly final] votes in the House of Representatives over three-month period in early 2003. Each bill, however, was subject to a Senate filibuster, and none of the bills obtained cloture. Therefore, none of them passed. 5 summary, indiscriminant inclusion of final passage votes almost surely attenuates or biases the inferences that could otherwise be made from late-stage roll calls. [Table 1] The second source of error-proneness under the final-passage criterion is exclusion of significant and informative votes, i.e., errors of a false-negative type. The prevalence and severity of type II errors is to some extent research-specific. Roberts and Smith, for example, reason that “if party structures final passage or other forms of voting, party also should structure amendment voting” (2003: 309). If so, then the exclusion under the final-passage criterion of all votes on amendments guarantees that there will be a high incidence of type II errors. The baby is, in effect, thrown out with the bath water. Viewed in this light, the error-proneness of the final-passage selection criterion is disturbing. Another selection strategy that merits brief attention is to go to the other extreme and include all votes in roll call studies (see, for example, the voluminous body of work by Keith Poole and various coauthors). This approach is an improvement in one respect and a setback in another. Summarized in the second row of table 1, the improvement, of course, is that there are no type II errors (exclusion of useful votes) simply because almost no votes are excluded.7 The setback is that the very high number of idiosyncratic, trivial, and uninformative votes is likely to bias the findings or, at the very least, inject a great deal of noise into the analysis. This aim of this paper is to reduce the error-proneness of existing vote selection conventions by screening roll calls systematically on the basis of a richer set of roll call attributes than is 7 The exception is that unanimous (and near-unanimous) votes are excluded because the provide no (little) information about differences in preferences. 6 conventional. Our premise is that, other things equal, closeness of roll calls is indicative of the strategic significance of roll calls, and attention by media sources, interest groups, and Washington insiders is indicative of the salience of roll calls. We estimate salience and strategic significance using factor analysis and demonstrate the consequences of the resulting selection criteria. Data and Methods The unit of observation in our dataset is a House or a Senate roll call vote. The dataset includes every roll call in the 102nd through 107th Congresses with the exception of approximately ten phantom observations for which a roll call number was assigned in spite of the fact that no votes were actually cast.8 The number of observations is 10,789, with 6,679 supplied by the House of Representatives and 4,110 by the Senate. Data were obtained from vote descriptions from Congressional Quarterly’s website (www.cq.com). CQ’s vote descriptions are much more detailed and reliable than those of the ICPSR and somewhat more detailed than those available from the Library of Congress (www.thomas.gov). We wrote and ran Visual Basic and Perl programs to extract various quantifiable characteristics from the vote descriptions. Text searches in Microsoft Excel and queries in Microsoft Access were used to identify and code various vote attributes. As summarized in the introduction, we are striving for measurement of two somewhat amorphous and possibly related concepts that bear on the desirability of roll calls for analysis: the salience and the strategic significance of voting situations. A single dummy variable indicating where, in the sequence a bill’s history, a roll call occurred may have a bearing on these underlying factors, but it not likely to be the sole defining characteristic of any such standard of desirability. So, for instance, while late-stage or so-called final-passage votes are often important, mid-stage 8 Most commonly, the ephemeral roll calls were quorum calls rescinded before recorded votes were cast. 7 votes on amendments or early-stage votes on cloture or rules are surely not uniformly unimportant. Therefore, they sometimes attract attention of interest groups, the media, or constituents. They are often (though not necessarily) of substantive significance and are regularly (though not always) pivotal in the outcome, even though a near-unanimous majority eventually forms on the final passage vote. Factor analysis is an acceptable method for the combination of objectives, opportunities, and constraints we face. On the positive side, we have a set of vote-attribute variables that a priori are indicators of underlying unmeasured concepts (factors). On the negative side, we have neither an explicit theory nor a corroborated empirical hunch about how the indicators relate to the factors. In between, we are confident in assuming that, over a suitably large number of roll calls, close votes are considerably more likely to be of strategic and substantive significance than near-unanimous votes. Likewise, media coverage, interest-group attention, and/or presidential involvement, for example, are surely reflections of salience of roll calls and possible indicators of strategic substantive, too. Although we are generally comfortable with these conjectures, we are not sufficiently knowledgeable about any such relationship to devise and defend a specific weighting scheme in which we impose the relative degree to which these relationship hold. Exploratory or confirmatory factor analysis is therefore a way to accommodate these strengths and weaknesses. Factor analysis takes a set of (measured) exogenous variables and estimates their effects on a smaller set of common (unmeasured) factors. Formally, it finds q common factors that linearly reconstruct p original variables: y ij = z i1b1 j + z i 2 b2 j + ...z iq bqj + eij , 8 where yij is the value of the ith observation on the jth variable, zik is the ith observation on the kth common factor, bkj is the set of coefficients (called factor loadings), and eij is a residual, i.e., the jth variable’s unique factor (Spearman, 1904). We use several indicators of salience or attention that roll calls attract. Two of these are newspaper-based. Searches of text of the New York Times and the Wall Street Journal were conducted using an online search engine, Factiva. The search required that the word “vote” appear in the headline or lead paragraph.9 All articles meeting these criteria were downloaded and then parsed using a Perl script that identifies indicators of vote tallies.10 Articles were then screened and sorted into two types. False hits consist of those that either did not refer to votes after all, that referred to a vote in the other chamber, or that referred to votes taken more than one week prior to the publication date of the article and, therefore, failed to constitute current coverage. Confirmed hits consist of articles that can be linked specifically to a recent, specific roll call in the House or the Senate. The resulting variables NYT articles and WSJ articles are, therefore, counts of the number of articles in the respective newspapers. The variable CQ key vote is a similar indicator of salience albeit of a distinctively more inside-the-beltway sort. It is simply a dummy variable equaling 1 if the Congressional Quarterly Almanac identified the roll call as one of its approximately 30 “key votes” for the chamberCongress. 9 In addition to raising the salience standard by requiring that the press gives coverage to the roll call as opposed to the congressional action more broadly, this requirement substantially reduces the number of false positive hits on actual congressional roll calls. 10 For example, “#-#” or “# to #” where “#” denotes an integer in [0,435]. 9 The variable AAP vote does the same for the Almanac of American Politics for about half as many votes, while ACU vote codes the 553 roll calls that the American Conservative Union selected on which to base its annual ratings. A final, indicator of insider significance and/or salience of roll calls is President position: a dummy variable equal to 1 if the president took a stance specific to the roll call.11 The more challenging measurement task is to find indicators that allow for the possibility of, and that capture the strategic significance of, votes earlier in the legislative process than final passage and thereby address the false-negative problem, yet that do not over-select earlier, unimportant votes, thereby exacerbating false-positive errors. To do this, we use two measures: mid-stage votes and closeness of vote. We single out mid-stage votes by process of elimination. First omitted are roll calls that have no direct bearing on the passage of a specific law. These nonlegislative votes in the House are overwhelmingly are votes to approve the Journal; in the Senate, they are split between motions to instruct the sergeant-at-arms to request the attendance of absent senators and confirmations of executive appointees. Next-omitted are early-stage votes, which are often bill-specific but which precede actual consideration of the legislation in question. In the Senate, these are votes are votes to invoke cloture; in the House, they are votes pertaining to a resolution (rule) that specifies terms under which a bill subsequently is considered. The final omission is late-stage votes, which is our counterpart to what the literature calls votes on final passage. The remaining votes take on values of 1 in the dummy variable. It is important to clarify that the method of factor analysis does not 11 It bears emphasis that this is roll-call specific. That is, if the president supported passage on, say, omnibus reconciliation, the dummy variable is not 1 for all amendment activity on the budget package. Even so, the coverage of this variable is relatively large: the president took a position on almost 12 percent of the roll calls. 10 dictate that this variable be systematically related to a common factor (i.e., to covary with other indicator variables) but rather merely provides an opportunity to assess whether it seems to have something in common with other indicators (including, possibly, all of the salience indicators). Closeness of vote is the final variable in the analysis. It is intended to reflect the competitiveness of the legislative process at critical junctures of bills’ histories. From an individual legislator’s perspective, we wish to tap the perceived likelihood that a legislator could, by switching her vote, be pivotal on the given roll call. We do this as a function of the aggregate realized vote outcome as follows. Let y and n, respectively, be the number of yea and nay voters on a given roll call. Let q be the proportion of votes required for passage of the motion in question, e.g., q = .667 for a veto override or suspension of the rules, q = 3/5 for invoking cloture12 or passing a motion on the House Corrections Calendar, and q = (n+1)/2 for all simple majority motions. Finally, let v be the minimum y such that y ≥ q , that is, the size of a minimum winning coalition given the type y+n of roll call and parliamentary situation in which it arises. Then a measure of closeness that has useful comparability properties both across roll calls with different passage thresholds and across chambers is: ⎛ 1 ⎞⎛ | y − v | ⎞ ⎟⎟⎜⎜ ⎟⎟ . 1 − ⎜⎜ ⎝ 1 − q ⎠⎝ y + n ⎠ When a minimum winning coalition forms and the vote is, by definition, as close as possible (subject to a possibly-needed but trivial even/odd assumption), the right-most term is (almost) 0 in 12 Technically, q is strictly 60 votes even if fewer than 100 senators vote. We account for this properly in the implementation. For purposes of introducing and understanding the logic of the measure, assume that every senator votes. 11 which case the measure takes on its maximum value of (almost) 1. In contrast, when a maximallyoversized, unanimous coalition is formed, the difference between actual and needed votes, |y -v|, is such that the resulting measure is 0.13 Findings The factor analysis is based on the eight variables describe above. Although, by default, it generates loadings for four factors, the drop off in eigenvalues (measures of variation accounted for) between the second and third factors is precipitous, indicating that the third and fourth factors are essentially random noise. Table 2, therefore, presents the factor loadings only for factors 1 and 2. Somewhat to our surprise in light of the inherently coarse nature of most of the righthand-side variables, a sensible pattern emerges. All six of the salience-intended indicators load well on the first factor (but not the second), while the closeness of vote and mid-stage vote measures represent significant common variation in the second factor (while sharing relatively little variance with factor 1). [Table 2] 13 Due to the scale factor, 1 / (1 - q), the measure is normalized between 0 and 1 for y > v and, for simple-majority votes (q = .5), normalized also for y < v. For different types of supermajority votes (q’ > .5), however, y = 0 will produce different negative values. This is not problematic for two reasons. First the measure is still well-behaved in the sense that, for a given number of voters, the farther is y from the winning threshold, the lower is the roll call’s closeness value. There is nothing special or problematic about the possibility that roll calls requiring a supermajority may take on negative values. Second, only 24 of 10,789 votes produce negative closeness values. One example is vote 511 in the first session of the 106th Congress in the House: a motion to suspend the rules and pass the bill to adopt a Clinton-endorsed measure to increase taxes and user fees by $19.2 billion, to nearly triple the tobacco excise tax to $8 billion, to increase food inspection fees on poultry, livestock, and eggs by $504 million and forest service fees by $20 million, to increase a number of fees related to transportation by about $3 billion. The motion was rejected 0-419 and earned a record low closeness measure of -1. 12 The empirical finding appears to be very robust. In addition to the iterated principal components results presented in the table, we also estimated loadings using more common principle-component factor analysis, and garden-variety (non-iterated) principal components. The differences in these methods are statistically arcane and, in our case at least, of no practical consequence. Regardless of the type of estimation, and regardless of the subsequent method of rotation—varimax (orthogonal), oblique (promax), or none—a clean, two-factor pattern of estimates, such as that shown in table 2, results. We are therefore quite confident that the statistical pattern represents a genuine empirical regularity. Even so, there is an element of interpretive art that goes with the supposed science. The interpretation we give to factor 1 is relatively straightforward and, we expect, non-controversial. Each of the six factor loadings that are clearly nonzero is positive and pertains to coverage of, or special attention given to, a roll call. It therefore seems defensible to label this factor salience. The interpretation of factor 2 is more challenging but not impossible. Notice, first, that none of the designated salience indicators bears much relationship to the second factor; their loadings are all near zero.14 Therefore, votes with high factor scores on this dimension will not be systematically salient and will presumably capture more of the inside-the-beltway component of strategic significance than the more public electoral-connection game present in factor 1. Notice, second, that the two variables with large positive loadings, while substantively quite different from one another, can be related plausibly in a legislative context. The uncovered factor is consistent with the intuition expressed in the introduction that, perhaps often, strategically critical votes (as indicated by closeness here) occur in the middle of legislation’s multi-stage journey through the 14 Because the results presented are not rotated, the two factors are orthogonal by construction. However, even when rotated without an orthogonality constraint (promax rotation), the correlation between factors 1 and 2 is -.0055. 13 process, even though they may tend figuratively to fly under the radar of broader public scrutiny. Likewise, the second factor also fuels suspicions that focusing too much on final-passage or latestage votes—which tend not to be as close15—probably ignores a lot of outcome-consequential legislative voting behavior. Via this interpretation, it seems defensible to label the second factor strategic significance of the vote. This labeling, however, comes with the caveat that salient votes as defined by factor 1 are not necessarily insignificant. With these findings, it is a straightforward matter to define alternative selection mechanisms for roll calls which potentially reduce the incidence of both type I and type II errors. We simply use the factor loadings with the data to produce factor scores F1 and F2 as linear functions of the data, just as one would compute predicted values of a dependent variable from, say, coefficient estimates in a linear regression equation.16 This process yields two continuous-variable factor scores, F1 = salience and F2 = strategic significance. Finally, factor scores can be dichotomized at various arbitrary points and used as selection criteria instead of the relatively crude final-passage criterion.17 Consequences Acknowledging the possibility that this process may be much ado about nothing, we wish finally to investigate whether different roll call selection methods lead to different substantive inferences. Our application consists of reconsidering roll rates, i.e., the proportion of times a majority of one party is on the losing side of a winning coalition. The objective is not to reiterate 15 The average closeness of mid-stage votes (.64) is twice that of late-stage votes (.32). 16 Typically, factor analysis software rescales the loadings so that the resulting factor scores are normalized with mean zero and variance each approximately equal to 1 . 17 More sophisticated applications could weight observations by the factors. We shall not pursue this possibility here. 14 or revisit the ongoing dialogue about whether such measures represent what their proponents purport (Cox and McCubbins 2004, Krehbiel 2005). Rather, we focus simply on the consequences of applying roll rate measures to different subsets of roll call votes. Table 3 confirms that the consequences can be substantial. Column 1 summarizes the finding in the seminal work on roll rates, which is based on final passage votes in the House from the 45105th Congresses. Among these 3,134 roll calls, the minority party was rolled somewhat more than one-fourth of the time, while the majority party was rolled fewer than two times in 100—a striking finding that serves as the basis for strong claims about majority party dominance. [Table 3] Because in the present study, we use less data, more recent data, and data from the Senate as well as the House, comparability with the Cox-McCubbins benchmark rates might be a concern. Column 2 shows that this is not a problem. Employing our close-cousin criterion for final-passage votes—late-stage votes—the measured rates remain in the same ballpark. If anything, our late- stage criterion supports an even stronger claim than those in the earlier works, since the column-2 ratio of minority to majority party rolls exceeds 20:1.18 The consequences of the alternative selection criteria are shown first in column 3, which calculates roll rates and the roll ratio using our entire dataset. The proportion of minority party rolls goes up slightly, the proportion of majority party rolls increases five-fold, and the roll ratio, correspondingly, goes down by a factor of four. While this change from the final-passage 18 An Appendix [in progress] explores some discrepancies between columns 1 and 2 by comparing reports of rolls in the chamber (House) and congresses (102-105) in which out data overlaps with Cox and McCubbins’s. 15 benchmark is large, the indiscriminate inclusiveness of this criterion undoubtedly admits literally thousands of type II errors. The more important findings are those criteria in columns 4 and 5 that use the factor-based selection criteria. To make the Ns in the set of selected votes approximately equal to those in the baseline columns, we selected the top quintile of our derived measures of salience and strategic significance and then calculated roll rates for these two sets of approximately 2200 votes. The consequences are nearly identical using either criterion, even though they are, by construction, uncorrelated and therefore reflect different notions of importance in roll calls. Specifically, the minority roll rate rises again —now to at least 1 in 3. The majority roll rate, meanwhile, increases much more dramatically to roughly 10 times what it was in the set consisting of only late-stage votes in column 2. These effects combined reduce the roll ratio to 3.7 for each criterion. Finally, column 6 imposes a more stringent filter, requiring that the roll calls be in the top quintiles of both of the new criteria. The result is as expected based on the current analysis but probably not as expected based only on prior analyses that uses the final-passage criterion. The majority party now is rolled roughly 1 in 6 votes, and the roll ratio dips below 2:1. The comparison between columns 1 and 6 is dramatic. Selection criteria for roll call votes appear to matter greatly. Summary We question the convention of heavy reliance on final passage votes for roll call analysis on grounds that (1) often, such votes are unimportant, and (2) many non-final-passage votes are important. Defining importance and operationalizing a corresponding selection mechanism, however, is challenging in the absence of a theory of roll call importance. We addressed this challenge with what might be described as a method that lets the data speak for itself, making 16 minimal and theory-neutral assumptions about likely covariates of salience and strategic significance of roll calls and using factor analysis to uncover the specifics about these relationships. Our tentative conclusion is that, relative to the final-passage criterion, employment of our salience and strategic-significance criteria dramatically reduces type I errors by omitting finalpassage votes that are not salient and are of little strategic importance. Type II errors (false negatives) are also diminished by allowing many otherwise-excluded high-scoring pre-finalpassage votes into the selection pool. Still unknown, however, is the degree to which our method’s inclusion of close mid-stage votes opens the door to another kind of type I errors, such as the inclusion of close but frivolous amendments. We doubt whether this is a regular occurrence in the salience scores inasmuch as such roll calls tend not to be covered by major newspapers, selected by interest groups, designated as “key votes,” etc. However, if such roll calls are close, they probably pass through the current strategic significance filter. We plan to assess the extent of this problem in future work and to finetune the factor analysis by adding other indicator variables that discriminate more finely between roll calls. 17 References Brady, David and Craig Volden. 1998. Revolving Gridlock. Westview Press. Cox, Gary and Mathew McCubbins. 2004. Setting the Agenda. Manuscript: UCSD. Groseclose, Tim and Keith Krehbiel. 1993. The Pervasiveness of Sophisticated Sincerity (with Timothy Groseclose). In Political Economy: Institutions, Information, Competition and Representation. W. Barnett, M. Hinich, H. Rosenthal, and N.Schofield, eds., New York: Cambridge University Press, (1993) 247-78. Jenkins, Jeffrey, Michael Crespin, and Jamie Carson. 2003. Parties as Procedural Coalitions in Congress: Evidence from a Natural Experiment. Manuscript: Northwestern. Krehbiel, Keith. 1996. Institutional and Partisan Sources of Gridlock: A Theory of Divided and Unified Government. Journal of Theoretical Politics, 8: 7-40. Krehbiel, Keith. 2005. Macro Politics and Micro Models: Cartels and Pivots Reconsidered. In The Macro Politics of Congress, eds E. Scott Adler and John Lapinski. Princeton: Princeton University Press. Krehbiel, Keith, Adam Meirowitz, and Jonathan Woon. 2005. Testing Theories of Lawmaking (with). In Social Choice and Strategic Decisions: Essays in Honor of Jeffrey S. Banks, eds. David Austen-Smith and John Duggan, Berlin: Springer (pp. 249-268, 2005). Krehbiel, Keith, and Douglas Rivers. 1990. Sophisticated Voting in Congress: A Reconsideration. Journal of Politics, 52: 548-78. Krehbiel, Keith. 2005. Partisan Roll Rates in a Nonpartisan Legislature. Journal of Law, Economics and Organization forthcoming. 18 McKelvey, Richard and Richard Niemi. 1978. A Multistage Game Representation of Sophisticated Voting for Binary Procedures. Journal of Economic Theory. 18 (June): 1-22. Poole, Keith and Howard Rosenthal. 1997. Congress: A Political-Economic History of Roll Call Voting. New York: Oxford University Press. Roberts, Jason and Steven Smith. 2003. Procedural Contexts, Party Strategy, and Conditional Party Voting in the U.S. House of Representatives. American Journal of Political Science. Spearman, Charles. 1904. General Intelligence Objectively Determined and Measured. American Journal of Psychology. 15, 201-293. Thorson, Gregory and Tasina Nitzschke. 2000. Politics and Policy in the 103rd and 104th Congresses: Evaluating the Effects of Divided Government in the Postreform Era. Congress and the Presidency. Thorson, Gregory R., Divided Government and the Passage of Partisan Legislation, 1947-1990 Political Research Quarterly 51 (3): 751-64. Van Houweling, Robert. 2001. Legislators’ personal policy preferences and partisan legislative organization: special rules in the post-Reform House, Manuscript: Berkeley. Wiseman, Alan and John R. Wright. 2005. The Legislative Median and Partisan Policy Manuscript: Ohio State. 19 Table 1. Error-proneness in roll call selection Type I errors Inclusion of undesirable votes False positives Type II errors Exclusion of desirable votes False negatives High High Very High 0 Selection Criterion Final passage votes All votes Table 2. Factor analysis Variables Factor Loadings 1 2 NYT articles WSJ articles CQ key vote President position ACU vote AAP vote Closeness of vote Mid-stage vote 0.476 0.298 0.484 0.383 0.412 0.405 0.221 -0.135 -0.086 -0.071 -0.034 -0.045 0.108 0.008 0.591 0.611 *Factors are orthogonal and unrotated. Eigenvalues for factors 1-4: 1.097, .749, .136, .030. Factors 3 and 4 omitted. 20 Table 3. Roll rates under different vote-selection criteria 1 Rolls 2 Passage 3 Late-stage All 4 5 6 Salient Strategically Significant Both Minority 0.259 0.219 0.256 0.334 0.385 0.305 Majority 0.016 0.010 0.049 0.091 0.104 0.176 16.6 21.9 5.2 3.7 3.7 1.7 3,134 2,932 10,789 2,159 2,159 364 Ratio N *Column 1 data are as reported in Cox and McCubbins (2004, 175) and on the House of Representatives from the 45-105th Congresses. All other columns summarize both House and Senate data from the 102-107th Congresses. 21
© Copyright 2026 Paperzz