The Hunt for Party Discipline In CongREss#

 

Nolan McCarty*, Keith T. Poole**, and Howard Rosenthal***

2 October 2000

 

Abstract

This paper analyzes party discipline in the House of Representatives between 1947 and 1998.  The effects of party pressures can be represented in a spatial model by allowing each party to have its own cutting line on roll call votes.  Adding a second cutting line makes, at best, a marginal improvement over the standard, single-line model.  Analysis of legislators who switch parties shows, however, that party discipline is manifest in the location of the legislator's ideal point.  In contrast to our approach, we find that the Snyder-Groseclose method of estimating the influence of party discipline is biased toward exaggerating party effects.

 

 

 


THe HUNT FOR PARTY DISCIPLINE IN CONGRESS

Introduction

The past several years have seen renewed scholarly investigation of how political parties and their leaders influence legislative institutions and behavior (see Aldrich 1995, Cox and McCubbins 1993, Rohde 1991, and Sinclair 1995).  Much of this contemporary research investigates how parties solve the collective action problems that are inherent in the legislative and electoral processes.  Cox and McCubbins, who conceptualize political parties as “cartels” that direct legislative activity to enhance the collective electoral fortunes of their members, provide a typical variant on this theme, but by no means the only one.  The primary function of a cartel is to build a collective reputation for its members to run under.  They argue, however, that without strong leadership members have individual incentives to engage in legislative activities (such as pork) that diminish the collective reputation.

This focus on the problems of collective action has generated much interest in the cohesiveness of parties as floor coalitions.[1]  The principal prediction is that a party produces more cohesive coalitions of its members than would be possible if the members were to act on the basis of their individual preferences.  Rohde (1991) uses evidence of the increase in party cohesion since 1975 to demonstrate an increasing role of party in the post-reform House.  Aldrich, Berger, and Rohde (1999) also use party voting as the main dependent variable to test the predictions of “conditional party government,” and Cox and McCubbins (1993) use member support on leadership votes to test for the role of leaders in creating voting coalitions.  Furthermore, some scholars, including Rohde (1991), see a reassertion of party strength behind the increased cohesiveness and polarization of congressional parties since the mid-1970s (for alternative explanations, see Poole and Rosenthal 1984; McCarty, Poole, and Rosenthal 1997; and King 1998).

As Krehbiel (1993, 1998) points out, however, these empirical studies of party voting suffer from the problem that the patterns of behavior that have been uncovered are consistent with both theories of strong, influential parties and non-partisan models where member preferences are sorted along party lines.  This dilemma is exacerbated by the problem of measuring legislative preferences.  Ideally, one would like some exogenous measure of these preferences to test party theories.  Voting behavior under the null hypothesis of no party influence could then be compared with actual voting behavior.  The problem is that our usual measures of legislative preferences are derived from the voting behavior itself.

In this paper, we attempt to untie this Gordian knot.  We begin by reviewing the evidence from work that analyzes congressional roll call votes under the maintained hypothesis of sincere spatial voting.  We discuss how the evidence from this work in fact suggests the presence of some party discipline.  We next develop the party discipline model of Snyder and Groseclose (2000).  We argue that their estimation method both seriously biases the estimate of ideal points for ideological moderates and overestimates the extent of party discipline.  In the text, we provide a compelling theoretical illustration of the bias.  We detail our case further in Appendix B.

To assess party discipline (a.k.a. pressure) properly, we propose an alternative approach. The basic idea is very simple.  We start with Krehbiel’s (1993, 1998) proposal that the spatial model of purely preference-based voting is the appropriate benchmark for evaluation of models that incorporate party effects.  In one-dimension, the spatial model asserts that, on each roll call, “Yea” and “Nay” voters are separated by a cutpoint on the liberal-conservative continuum.  Now assume that the Republicans apply “pressure” to their membership.  This will cause some moderate Republicans to the left of the “sincere” cutpoint to vote with the conservative wing of the party.  Republicans will have a cutpoint to the left of the sincere cutpoint.  Similarly, if the Democrats apply pressure, the cutpoint for Democrats will be to the right of the sincere cutpoint.  That is, when one or both parties apply pressure, the voting patterns should look as if there were separate cutpoints for each party, with the Democrat cutpoint being to the right of the Republican cutpoint.  Consequently, if pressure is important, we should find a better fit to the data when we estimate two cutpoints than when we estimate a single cutpoint.

To keep the analysis as simple as possible, we use non-parametric optimal classification analysis where legislator ideal points and roll call cutpoints are jointly rank ordered to maximize predictive success on roll call votes (Poole, 2000).  By classifying the voting of each party independently and then comparing the results to classifying both parties together, we can evaluate the maximum possible improvement in correct classification attributable to party discipline. An advantage of the cutpoint approach is that it does not require any assumptions about which specific roll calls are subject to party pressure.  A particular advantage of the non-parametric approach is that it assumes only that the amount of pressure applied to individual members does not change the order of their induced ideal points.  It does not require making parametric assumptions about how pressure varies with the ideal point of the individual member, such as equal pressure being applied to all.  On the basis of our optimal classification analysis, we conclude that allowing for party discipline affords only a very marginal improvement over the sincere spatial model, particularly in recent Congresses.

Where, then, is party discipline?  We argue that the main influence of party discipline is not on the votes on specific roll calls but on the choice of ideal point made by the representative.  The smoking gun is provided by the great changes in ideological position demonstrated by those few legislators who have switched parties.  Wayne Morse and Strom Thurmond are two well-known examples in the post-war Senate.  The Democrats who defected to the Republicans after the 1994 election made equally dramatic shifts.  Our finding that parties shape ideal points ends our hunt for party discipline in roll call voting.

Independent Voting on the Floor: The Evidence from the Spatial Model

The well-known standard spatial model provides a benchmark approach to independent floor voting.  Poole and Rosenthal (1991, 1997) demonstrated that the spatial model is quite successful in accounting for floor decisions.  With two dimensions, one can correctly predict roughly 85% of the individual decisions -- even on close roll calls -- for the period 1789-1985.  McCarty, Poole, and Rosenthal (1997, p. 7) report additional results for the period 1947-1995.[2]  In recent Congresses, a one-dimensional model classifies nearly 90% of the individual decisions (see figure 5).

The spatial estimates present a strong suggestion that party influence underpins much of this remarkable classification success:[3]

n       In Congresses where voting is largely one-dimensional, party-line votes are along the main dimension.  The distribution of ideal points is strongly bimodal.  The two parties appear as two very distinct “clouds”.  The clouds, particularly in recent years, barely overlap. [As an illustration, see McCarty, Poole, and Rosenthal (1997, p. 11).]  The presence of a “channel” between the clouds suggests that party affiliation may discipline the roll call voting behavior of members.  The main dimension of political conflict clearly appears to reflect partisan conflict.  Parties perhaps also influence their members’ votes on specific roll calls.

n       In Congresses where voting is two-dimensional, there are also two distinct clouds separated by a channel.  Party-line votes are no longer on the main dimension, but a blend of the first and second dimensions.   [See Poole and Rosenthal, 1991, p. 233 or 1997, p. 44 for an example.]  An interpretation of such plots is that ideal points projected onto roughly a 45° line represent the ideological (liberal-conservative) dimension.  The orthogonal projection, roughly at –45°, represents a party loyalty or valence dimension.  Most votes occur along the main, 0° dimension.  On these votes, the legislator’s decision depends both on ideology and on party loyalty.

              Although this evidence shows that the structure of voting coalitions in Congress coincides strongly with party affiliation, it does not prove that party per se has any influence on voting behavior.  Party-line voting is, of course, consistent with both strong party models and ideological models where preferences are sorted by parties.  In the sections that follow, we review a recent attempt to separate partisan effects from preferences and propose a method of our own.

 

The Snyder-Groseclose Model of Party Discipline

One of the inherent problems in identifying the effects of party is that we observe only behavior, which is presumably a mix of individual preferences and party influence.  This problem is particularly acute with congressional voting data.  If party discipline is exercised on floor votes, the ideal points estimated on the assumption of independent spatial voting might be very biased estimates of legislator preferences.  If party influences these estimates, it is inappropriate to use them as controls for preferences when testing for a party effect.  Snyder and Groseclose (2000) noted this potential for bias.  They proposed both a method for first estimating unbiased ideal points and then for using the unbiased ideal points to estimate the effect of party discipline.[4]

The basics of the one-dimensional Snyder-Groseclose model are as follows:

n       On roll call j, a legislator i, if a Republican, has induced ideal point xij = xi

n       On roll call j, a legislator i, if a Democrat, has induced ideal point xij = xi + gj

In other words, the true ideal points of the Democrats, the xi, are displaced by the amount of party pressure given by gj.[5]  It turns out that only the relative amount of party pressure matters in the model, so the ideal points of the Republicans can just be given by their true values.  For the difference in pressure to be consistent with discipline, we would expect that pressure must move Democrats in a liberal direction relative to Republicans.  Thus, pressure works to increase the separation of the parties.  If preferences are scales such that left is liberal and right is conservative, then we would expect g to have a negative sign  

Snyder and Groseclose argue that, because there would be little need to apply party discipline on votes not expected to be close, ideological position-taking could occur on lopsided votes.  These votes, for example, those with margins over 65-35, could be used to estimate the true ideal points.  On these votes, the gj would be zero.  The true ideal points could then be used to estimate the gj on close votes, say those with margins less than 65-35.

In brief, their procedure is:

Stage 1.  Use votes with margins greater than 65-35 to estimate the ideal points, xi.

Stage 2.  On the remaining votes, for each roll call j, estimate the following OLS (ordinary least squares) model:

                                                                                                   (1)

where Di = 1, if legislator i is a Democrat, = 0 if Republican; and Yij = 1, if i votes Yea, = 0 if i votes Nay.  In the Appendix A, we show that if the underlying spatial utility is quadratic, then

.                                                                      (2)

As we noted above, the party pressure model predicts a negative estimate for g when preferences are scaled with Democrats on the left (as we assume they are).  Consequently, the two estimated  b’s should be of opposite sign.

 

Why the Method Overestimates Party Discipline

This method is likely to generate the inference that party pressure is substantial even when all voting is preference based.  Consider, for example, a six-member legislature with the party affiliations and spatial preferences given in figure 1.  If all voting in this legislature is spatial without error, there are only 12 possible voting configurations, which are given in figure 2.

Stage 1 of the Snyder-Groseclose method estimates a preference score using only voting patterns 1-10.  But, since voters 3 and 4 cast identical votes in each of these patterns, any scaling procedure will estimate voters 3 and 4 as having the same position.  Thus stage 1 provides biased estimates of the preferences of moderates.  There is not enough information in the lopsided votes to discriminate “left” moderates from “right” moderates.  The preference ordering that maximizes the classification of votes is shown in figure 3.

In stage 2, the preferences in figure 3 and party affiliation are used to explain vote patterns 11 and 12.  The votes of legislators 1, 2, 5, and 6 are correctly classified on the basis of the preference estimates, but the votes of legislators 3 and 4 cannot be.  However, since 3 and 4 are members of different parties, adding party to the model increases its explanatory power even though voting is purely preference driven.

Our example extends naturally to larger legislatures.  In general with perfect spatial voting, a first stage based only on lopsided votes will produce identical preference estimates of all members in the interval between the 35th and 65th percentiles.  The second stage will therefore produce a spurious party effect so long as party and ideology are correlated within this interval.   Given the assumptions of no voting error and no overlap of preferences between the parties, this example is somewhat special.  However, in Appendix B, we present Monte Carlo evidence that shows how this result extends to large legislatures where, as in the Snyder and Groseclose approach, there is some error in voting and the distributions of preferences of each party overlaps.  We now present an alternative procedure that maintains the essential features of their model of discipline.

A Non-Parametric Model

All specifications of a spatial model of voting have two critical elements: ideal points for the legislators and cutpoints (or separating hyperplanes) for the roll calls.  The Snyder-Groseclose model, with a discipline parameter to each roll call, is isomorphic with one where each party has its own cutting line.  (See Appendix A.) That is, moving the ideal points for all Democrats to the left by a magnitude gj  is equivalent to moving the cutpoint for Democrats to the right by the same amount.   Party discipline generally involves getting moderates to vote with extremists.[6]  Consequently, if there is party discipline, the cutpoint for the Democrats should be to the right of the cutpoint for Republicans.[7]

Consider a one-dimensional spatial configuration.  If the cutpoint is constrained to be the same for both parties, this produces the standard spatial model.  For example, in figure 4, with a common cutpoint, there are three classification errors, legislators 3, 11, and 15.  When each party can have its own cutpoint, this produces a model that allows for party discipline.  Moderate Democrats to the right of some Republicans can vote with the majority of their party.  Moderate Republicans to the left of some Democrats can vote with the majority of their party.  The best cutpoint for the Republicans in figure 4 remains the common cutpoint.  Legislator 15 is the only R classification error.  But the best cutpoint for the Democrats is to the right of the common cutpoint.  The D cutpoint leaves only legislator 3 as a classification error for this party.  Rather than estimate either the one-cutpoint model or the two-point model via a metric technique, such as Poole and Rosenthal's (1991) NOMINATE or Heckman and Snyder's (1997) method, one can simply find the joint rank order of legislators and cutpoints that minimizes classification error.  Poole (2000) presents an efficient algorithm that very closely approximates the global maximum in correct classification.[8]  Note that this method, in contrast to equation (1), does not require a uniform adjustment in the ideal points of all members of a party.  Only moderates would need to be disciplined.  All that is required is a displacement of the cutpoint.

We now turn to our empirical analysis.  This analysis involves both the testing of implications of our methodological critique of Snyder and Groseclose as well as testing the implications of their model of party discipline utilizing our two cutpoint model.  We begin by presenting evidence on three methodological predictions.  In each case, these predictions are  consistent with the mismeasurement of preferences in the Snyder-Groseclose framework under the hypothesis of purely spatial voting.  In only one of the cases is the prediction also consistent with their theoretical model.  Therefore, verification of these relationships illustrates the inability of Snyder-Groseclose to distinguish party pressure from mismeasurement of preferences.  The three methodological predictions are as follows:

M1.           Estimate the rank order of ideal points by one-dimensional optimal classification first using all roll call votes and then using only lopsided votes.  The correlation of the all votes rank orders and the lopsided votes rank orders will be greater for extremists (the first and last thirds of the all votes distribution) than for moderates (the middle third).  While this prediction is consistent with the Snyder-Groseclose assertion that party pressure primarily affects moderates, it also follows from our claim that, if there is preference-based voting, ideal points of moderates will be inaccurately recovered if only lopsided votes are used to estimate ideal points.

M2.           Similarly, when the rank order is estimated first on all roll call votes and second on only close votes, the correlation of the all votes ranks and the close ranks will be greater for moderates than for extremists.  The motivation for this prediction is similar to that of the first.  If there is preference-based voting, the ideal points of extremists will be inaccurately recovered if only close votes are used to estimate ideal points.  This prediction is inconsistent with Snyder and Groseclose as they implicitly assume that extremists will have similar preference estimates on pressured votes as they do on unpressured votes.

M3.           The close-all correlations for moderates will be high if there is preference-based voting, lower if there is party discipline.  The reason is that, if there is discipline only on close votes as claimed by Snyder and Groseclose, all votes estimates will mix preference-based lopsided votes and disciplined close votes.  The close votes estimates will have more distortion of the true ideal points.

After these tests concerning the effects of Snyder and Groseclose’s procedure on ideal point estimates, we turn to testing hypotheses from the party discipline model.  In all cases, the null model of preference-based voting predicts no difference.

H1.            Classification should be substantially higher with a two-point model than with a one-point model.  Note that classification cannot be lower with the two-point model.

H2.            The improvements in classification should be greater on close votes.  Since Snyder-Groseclose predict that rational parties will whip close votes, the incremental predictive power of the two-cutpoint model should be accordingly higher on those votes.

H3.            The rank order of the legislators should disclose more separation of the parties in the one-point model than in the two-point model.  The reason is that the one-point model ignores party discipline.  Moving Democrats to the left and Republicans to the right should pick up some of the effects of party pressure.  In contrast, in the two-point model, each legislator's ideal point can take on its true rank order position, because the cutpoints can pick up the effects of party discipline.

H4.            The separation of the cutpoints should be greater on close votes.  The identifying assumption of the Snyder-Groseclose model is that party pressures are more likely on close votes.  Therefore, under their assumptions, the distance between the Democratic and the Republican cutpoint should be greatest on those votes.

H5.            The estimated cutpoint for the Democrats should be to the right of the estimated cutpoint for the Republicans.

A Caveat

Some instances of party pressure may be masked.  Consider a legislature with no party overlap.  All Democrats are to the left of all Republicans.  Suppose that, were there no pressure, a Republican Party bill would be rejected by a majority composed of all Democrats and some moderate Republican defectors.  If the Republicans then apply “pressure” to the defectors, resulting in a party-line vote, the vote will still appear to be a vote consistent with preference-based voting.  Thus when ideal points are estimated correctly, the true explanatory power of party may be masked.  In fact, when there is no overlap in the distribution of party ideal points and there is errorless spatial voting, it is impossible to identify party pressure effects.

This masking of party pressure is inherent to spatial analysis.  It would confound a correct Snyder-Groseclose analysis as well as our optimal classification method.  Albeit important, the question we can ask is limited to “Can allowing for party discipline improve on the classification of a purely preference-based model?”[9]

With our optimal classification method, it is possible to calculate an upper bound for the amount that party pressure can increase vote classification, roughly as a function of the overlap between the two parties.  This upper-bound represents the classification on a strict party line vote of a two-cutpoint model (perfect classification) minus the classification of a strict party line vote using a single cutpoint.[10]  When there is no overlap between the parties, a single cutpoint correctly classifies a party line vote so as noted above their can be no classification gain for the two cutpoint model.  However, the greater the party overlap, the worse a one-cutpoint model does in explaining a strict party-line vote.  Thus, the maximum classification gain increases in the overlap.  If we use the configurations of preferences that emerge from optimal one-cutpoint classification to measure overlap, the maximum classification gain from party-pressure consistent cutpoints (i.e. D>R) ranges from 0 in the 80th House (where there is zero overlap) to 16% in the 92nd House.  The average upper bound over all of the congresses we analyze is 5%.  However, it is important to remember that these upper bounds are simply for roll calls consistent with party pressure (i.e. Democratic cutpoint to the right of the Republican cutpoint).  Perfect classification is the upper bound if we allow other cutpoint configurations (e.g the Republican cutpoint on the right).  Secondly, as we discuss below, optimal classification with a single cutpoint will underestimate party overlap which would lead to the underestimation of these upper-bounds.

 

Tests Using the Non-Parametric Model

One Dimensional Classification

We begin with the three predictions concerning correlations of ideal points.  In order to show that the pattern of estimates we expected would arise in actual data, we first performed optimal one-point classification using all the roll call votes in each House from the 80th through the 105th.  If the basic spatial model is correct, this procedure should produce a rank order of legislator ideal points that is very close to the true order.  Next, we did optimal classification using only lopsided votes, those with greater than 65-35 margins.  Finally, we did optimal classification using only close votes, those with margins of 65-35 or less.

We then computed Spearman rank order correlations between the lopsided vote rank orders and the all votes rank orders for left-wingers, the one-third of the legislators furthest to the left in the all votes classification; moderates, the middle one-third; and right-wingers, the one-third furthest to the right.  We would expect these correlations to be high for the left-wingers and right-wingers but low for the moderates because the lopsided votes provide little information about the ideal points of moderates (M1).  Conversely, when correlations are made between close  votes rank orders and all votes rank orders, we expect the correlations to be high for moderates but low for left-wingers and right-wingers.[11]

 

Insert Table 1 about here

 

 

The hypothesized patterns occur, as shown in Table 1.  Indeed, for the lopsided-all comparison, in every post-war House but one, the middle one-third correlation is lower than both that for the left-wingers and that for the right-wingers.[12]  Table 1 indicates that the middle correlation is particularly low in the period preceding the passage of the major civil rights bills of the 1960s.  In this period, there was an important second dimension (Poole and Rosenthal, 1997) that confounds the recovery of moderate positions on the first dimension.  When the second dimension vanishes, even the middle correlations are reasonably high because the “errors” in voting provide some information about moderates.  That is, for example, a relatively liberal moderate is still less likely to vote with the right-wingers than is a relatively conservative moderate, even on lopsided votes.  Nonetheless, as predicted by M2, correlations for moderates are lower than for extremists.[13]

As predicted, these results reverse for the close-all comparison.  The moderates always produce a correlation above 0.9.  The left-winger and right-winger correlations are always below 0.9, usually much below, and in one case, the correlation is negative.

The close-all correlations for moderates are strikingly high, predicted by preference-based voting but not by voting subject to party discipline (M3).  If the party discipline effect were important, we would expect lower rank order correlations, particularly for Houses before 1980, when there was still considerable overlap in the ideal point distributions of the two parties.

Classification with Two Cutpoints

In this section, we assess the ability of a party discipline model to improve on a preference-based model.  Our criterion is percentage of votes correctly classified.

To find the highest classification possible for a party discipline model, there is a simple solution: just classify each party separately.  This allows the cutpoint on each roll call to adjust to pressures internal to the party.  Because the cutpoints can adjust, one will find the true intra-party rank order of the ideal points.  The classification from this model can be compared to a single cutpoint model.

The results of this exercise appear in figure 5, which shows results for one-, two-, and six-dimensional models.  We used a six-dimensional model to parallel the high dimensionality used by Snyder and Groseclose in their empirical work.  With multiple dimensions, the cutpoint is replaced with a separating hyper-plane.

In one dimension, it is apparent that a two-party model adds little, particularly in recent Congresses.  The improvement in the earlier Houses is at the level that results when a two-dimensional model with one cutting line is used.  In one dim