6.16.2008

A Refinement to the Adjustment, Part II

The principal criticism of the Trend Adjustment that I introduced on Saturday is that it assumed that the trend was uniform across all states. Even if we can demonstrate that Barack Obama has gained, say, 3 points in his polling on average, and even if that average was taken across a fairly robust group of state and national polls, it might not hold that the bounce would be felt the same in Utah as it might be in Massachusetts.

I agree entirely with this criticism in theory. I would also argue that it is probably better to assume a uniform trend than no trend at all. The polling has become dense enough (particuarly if we include national polls) that we're getting a pretty fair mix of state and national polls in any given week. It is unlikely that Obama could improve his position in say 10 out of 12 state polls, and 5 out of 6 national polls, without his also being likely to have improved his position in other states that weren't polled during this period.

Nevertheless, it would clearly be best if we could have our cake and eat it too: adjust for the most recent trends (in a somewhat cautious way) without having to take some of the state-by-state specificity out of our model. I think I've developed a reasonable way to accomplsih that.

The basic way that we developed the trend estimator was to express each polling result as a combination of two dummy variables, one representing the state/pollster combination (e.g. "Quinnipac-Florida" or "Zogby-Delaware") and the other the week in which the poll was conducted. Each poll in our database can thus can be expressed in the form of a regression equation:



...'Margin' represents the polling result (Obama's total less McCain's), whereas the squiggly little 'e' you see is a term denoting the residual error/uncertainty. Technically speaking, there are coefficient terms on the two dummy variables, though over the long run, these coefficients will by definition equal one. Likewise, the error term will definitionally equal zero over the long run. However, just because the coefficients equal one on average does not mean that they do so in every single case. Another way to express our regression would be to embed the uncertainty term in the time-trend dummy, as follows:



In this equation, m represents a multiplier on the weekly trend variable. It is trivial to solve for m.



In a state which is more impacted by a time-defendant trend, m will be greater than one. In a state that is less impacted by the trend, it will be less than one.

Once we have a derived an m for each poll in our database, we can then regress it against a series of demographic variables in the state where the poll was conducted to see whether there is any pattern to the residuals. Since our particular concern is with recent trends, we weight recent polls much more heavily when conducting this analysis. (A couple of technical notes: we discard any cases in which the pollster has polled the state just once, as m will always be one in these cases. Also, we discard cases where the weekly dummy is a very small number -- anything less than one, in fact -- as this can produce very large, highly erratic values of m).

The demographic regression that I perform on m includes relatively few variables. This is because there aren't all that many useful data points to work with -- we need very recent polls, and for those polls to have been conducted in a state that the pollster surveyed previously -- so there is more risk of overfitting the model. The particular variables we include are a state's partisan ID index, its Kerry vote share in 2004, its black population, its Hispanic population, its average per capita income, its percentage of senior citizens, and its percentage of evangelicals. With the exception of the Kerry and 'partisan' variables, which are too fundamental to the model to be excluded, these variables have the virtue of not being strongly intercorrelated with one another.

As it turns out, there are some patterns in where Obama's bounce is showing up. It is coming in states where Democrats have a strong party identification advantage (no surprise), and seems to be especially strong in states where many voters are registered as Democrats, but where John Kerry did not perform well in 2004. This particularly describes states like West Virginia and Arkansas, where Obama's numbers have improved significantly, and where (assuredly not coincidentally) Hillary Clinton also performed well. The other observable trend is that Obama's bounce has been larger in states where there are not a lot of African-American voters, simply because there are few marginal gains for him to make among that demographic. It will probably always be the case in this election that states with lots of African-American voters will be less responsive to trends in the polling numbers.

This demographic regression allows us to estimate a unique value of m for each state. I cap the values of m at 0.0 and 2.0, respectively. The average value of m will not necessarily be 1.0, as it could be the case that particular kinds of states are especially predisposed to a bounce, and those states have also been polled more frequently (in fact, this does appear to have been the case to a small degree over the past couple weeks). The present m values for some representative states are as follows:

Kentucky       1.98
Arkansas 1.93
Massachusetts 1.76
Oklahoma 1.66
New York 1.37
Michigan 1.05
North Carolina 1.01
California 0.97
Pennsylvania 0.93
Florida 0.71
Nevada 0.70
Ohio 0.54
Arizona 0.29
Utah 0.00

In adjusting our polling numbers, we take the trend from our LOESS estimator and multiply it by m. For example, say that our LOESS curve estimates that Barack Obama is polling 3 points stronger now on average than he was three weeks ago. If we take a 3-week old poll from Kentucky, we will adjust it upward (toward Obama) by (3 x 1.98) = 5.94 points. In California, we will adjust it by (3 x 0.97) = 2.91 points. And in Arizona, we would adjust it by only 0.87 points.

Taking into account the sensitivity of individual states to time trends produces a slightly less impressive result for Obama than we had been figuring on over the weekend, as his bounce seems to be most profound in states where he was already well ahead (like Massachusetts), or where he is probably too far behind to catch up (like Oklahoma). Still, we have seen at least some bounce for Obama across a large and relatively diverse array of states, and can expect to see that trend manifested in other states where new polls will come out unless his bounce begins to recede nationally.

35 comments

Anonymous said...

You are a genius.

I think you should stop doing CNN appearances, though, unless the commentators promise to spend an hour actually understanding what you are talking about.

David said...

I realize this isn't really an issue, but the term "cap" bothers me.

I know you don't have much data, but I think it's a workable way to get around the fact that this model is non-linear. Have you tried other specifications?

Anonymous said...

This is why I love this site.

Anonymous said...

Hi Poblano,

1. How do you determine m in the states for which you don't have trend data?

2. Do you still use the national numbers in any way?

Dan

Icy said...

Perfect.

That was exactly what i was hoping for when i complained about your "all with the same brush"-approach.

I am eating the cake right now.

Regards,
Icyclemort

aaron said...

I find these changes to be particularly elegant corrections to the criticisms levied against the original changes, taking into account the two most common complaints that (1) changes were modeled as a uniform additive effect across the country and (2) that it ignored the unique demographics of each state.
I greatly appreciate your time and patience in trying to make this site as good as it possibly can be
The only additional things I would like to see when you have time are an update to the FAQ's with the description of the changes in the methodologies and the current m values for each states for us who are interested in how are own states are likely reacting to the national trends (though I'm not sure where to fit that cleanly on the page)

TheThirdMan said...

There is an error in the third equation, where you solve for m. m=(margin-Pollster:State)/Weekly.

no sure if it has a major effect or not.

aaron said...

Since m is a relative response, this is obviously just a typo in the right up and who can blame Nate since he posted this description at nearly 3 in the morning

Jaime said...

Excellent, I was also one to complaint about not taking into account the underlying demographics, local races, etc. I think this is a great compromise now! Cheers!

Modeler said...

Hi Nate,

This is certainly better, but I think you can improve your model even more. The current method of estimating m and using LOESS smoothing introduces a few somewhat arbitrary factors, and the results still heavily depend on recent polling. It's good to see that you are addressing demographics though.

The model expressed in the first equation in your post is the better of the two. An even better model would be as follows:

Margin = Pollster:State + Sum(Demographic Variables) + e

where e is error term with 0 mean with a variance equal to

(sample variance)
+ (pollster-introduced variance)
+ (time-dependent variance)

and the time-dependent variance is proportional to the time elapsed since the poll was taken (this is an approximation, but a justifiable one.)

This is essentially your original demographic model with additional terms for time-dependent variance and a Pollster:State term. The time-dependent variance decreases the weight of the poll over time (it should be weighted as 1/variance), which is why you see the square-root-of-time behavior you have observed.

You may fear about overfitting due to the large number of Pollster:State combinations. You can reduce this somewhat by using separate Pollster and State terms. Personally, I prefer the separate terms because they make clear:

1) If there is accross-the-board bias for a given pollster
2) If there are some states for which the demographic model does unusually poorly.

However, even if you included separate terms, you still risk overfitting. The solution to this is to use regularization, or equivalently ridge regression. This is conceptually equivalent to putting a prior distribution over the expected degree to which the polls are influenced by the Pollster:State terms outside your demographic model. In theory, you should be doing this anyway, but it becomes more important the more variables you include in your model. Using a single paramter on all the Pollster:State terms (or one parameter for Pollster and another for State) make sense because your prior expectation about the magnitude of all of these terms should be roughly the same.

If you use a regularization parameters on the Pollster:State terms, you can include these terms directly in your regression and you will no longer have to worry about LOESS smoothing or m. Your regression model will completely capture biases due to Pollster:State combinations, and you could essentially just report the latest output of your regression model as the prediction.

You will still have to determine the regularization parameter, but this is typically done using cross-validation which is much less arbitrary than what you are doing now. It is completely deterministic, and the equations for solving this sort of model are trivial linear equations.

I've sent you some emails about your model in the past; if you'd like to know more details about how to implement this (and I promise: it's easy), let me know.

homunq said...

You are a badass, this goes way beyond what I expected from your correction.

However, I would still ask for just a hair more conservative numbers. Basically, you are still using the whole of every apparent bounce, just dividing it up more intelligently over the states. Yes, your regression smoothing is taking some of the statistical noise out of the bounces; that is good. But there is a contradictory point here. Say you have a nation made up of two equal-sized states, and you have godlike (zero-error) knowledge that the national bounce this week is 10%, but you are not exactly sure where to put this bounce. The uncertainty should always cut in favor of no change, even when it is uncertainty about the location and not the size of the bounce; the average of your two adjustments will end up being a bit less than 10%. What I am suggesting is that you multiply your measured state/nation correlations by a regression-to-mean factor, to be conservative. I would expect this factor to be in the range of .6-.9, possibly towards the higher end of the range.

Before you made this latest adjustment, my gut estimate of the right conservative factor was .5 - that is, I thought that the "we know more than we think" change was a 100% overshoot, very clever but with no actual improvement. Now, I think you're only overshooting by a small amount, so the change on the whole is a great improvement, but you could still do better by being a hair more conservative.

And anyway, I agree that you are not obligated to stick to just a polling average, but you should always try to have some factor of caution when you go beyond them. Otherwise, you can really end up going down the garden path, your numbers can start to come unstuck from reality.

Steven said...

Question: What happens when the National Tracking Polls begin to show a dampening in Bounce, as they have the last 2 days with Gallup's recent and Rasmussen's recent? As national polls begin showing a tighter race, will that effect the eventual outcomes?

I read the post twice, and honestly, you can catagorize me in the class of readers who trust your intelligence yet don't understand or comprehend how you arrive at your reasoning... not everyone is a stat gengius (I design pillows for a living so pls give me a break).

Modeler said...

An addition to my above post:

If you use separate terms for pollsters and states, your prediction could exclusively use the state terms and ignore the "dummy" pollster error, since we are trying to predict actual results rather than the next poll from a given pollster. Right now your method, as I understand it, essentially predicts what a November poll in this state conducted by one of the more recent state pollsters would look like. Ideally it should predict what the election results will be, correcting for pollster bias.

In short: use dummy pollster-dependent variables to fit the regression, but don't use these variables when making predictions. Use state-dependent variables both for fitting and prediction.

For example, imagine we know that Insider Advantage tends to underestimate Obama's margin. If the last poll for a given state were done by Insider Advantage, your current estimate would use this poll (plus a time-dependent correction) to predict actual election results. The alternative method would also include a time-dependent correction (in terms of the most recent demographic variables), and it would automatically adjust for pollster bias.

Steven:

The short answer is yes. As national polls show a tightening race, that should be reflected to varying extents across the board. Nate, correct me if I'm wrong.

Another Mike said...

Nate. love the time trend adjustment. But, please consider adding a row to each individual state polling showing (1) the 538 Regression before time adjustment, (2) the polling/regression average before adjustment (i.e., this number would be what you were showing before this major change), (3) the time adjusted 538 Regression, and (4) the time adjustment to the state's polls. This would allow your readers to see what the individual states look like under the prior methodology, which some prefer but without requiring all the work of producing double graphics. It would also be fascinating to see to what extent individual states are being adjusted for the time trend. Since you're already doing these interim steps to get your final results, it shouldn't require much additional work. And, it would give your readers valuable additional information in seeing how you get to your projections.

Another Mike said...

Should have written "rows" instead of "a row" and, obviously, these rows would be added after the individual polls for each state and before the projection row.

Alex said...

This isn't a methodological question but simply a presentation question:

I know you're updating the way your site present information, but if you're going to be adjusting previous polls up or down as bounces occur, can you include that data in the state/poll chart on the right? A new column that shows exactly how much "bounce" your new method is inserting into each state will make the new results much easier to quickly grasp.

Thanks,
Alex

Patrick said...

I think it's almost perfect now, but perhaps a bit too "strong."

Anyway, are you planning on updating the polling tables on the right some time? So much has changed since they were first put up, and I have no idea what they mean. Are those the actual polls, or the "corrected" polls? And it would also be able to see some of how much the national polling is affecting a given state.

Charles Pluckhahn said...

After looking at this more extensively, I can see that I was on the wrong track when I tried that analogy to bond duration. Your methodology has always been "time-leveraged" -- no more now than before.

What you're trying to do, it would seem, is generate data for states that lack recent polling. You are doing this by looking at a combination of other state polls and national polls, taking into account the trend of changes by using the LOESS technique, and adjusting everything by what you call the demographics of each state.

It also seems as if, even when a particular state is polled frequently, you're including the results of other state polls and national data. But I'm not sure of that; how do you handle a state that's polled quite frequently as opposed to one that's polled infrequently?

As before, you are extrapolating trends out to November, which necessarily introduces some degree of leverage on the movements of recent polling results. Your "hybrid" LOESS curve attempts to take a middle ground between "sensitive" and "conservative" approaches.

Did I miss anything?

homunq said...

Shorter me, above: your average m (weighted by 2004 turnout) should be less than 1 by some real amount. If I am right, that will happen naturally.

So what is your average m? It would be interesting to see the version weighted by (2004 turnout * 2004-2008 pop growth) and also the version weighted by electoral votes. The first would tell us how much of the national bounce you can safely assign to the states; the second would tell us how much could show up in the actual results.

Methow Ken said...

Nate,

Your "Refinement to Adjustment I and II'' were excellent and very interesting. I agree that Obama's bounce is not limited just to the states that are heavily (D), and likely extends to varying degrees to most if not quite all states; even to my old home state of ND.

Also expect that in some states Obama's bounce will at least to some extent be permanent, as the anger of the most ardent Clintonistas recedes with time.

And using a state-by-state variable LOESS curve smoothing parameter does seem like it provides the most accurate and best possible projection. A substantiated and objective projection of the result at the time of the General Election is in fact the most useful info this site provides; no other website I've found comes close.

It does jump out that the current RCP ''No toss up states'' split is 272-266 in favor of Obama; while your latest EC projection is 300.3-237.7. From my perspective another comparison of ''RCP current versus 538 November projected'' numbers in 4-6 weeks or so will be telling, after the immediate post-(D)-nomination flurry has faded.

All that being said:
I stand by my statement that Obama has no chance to win ND; it will not be close (although I'm not claiming it will be a total blow-out for McCain in ND either; just a solid win).

SIDEBAR: On this and other blogs I find it interesting and completely in-character; that those from the far-left tout even casual and long-ago association of McCain with people who have had problems as fair game forever, but anybody who brings up even long-term out-of-bounds associates of Obama is automatically racist (Wright, Ayers, and Rezko were clearly people Obama did much more than just bump into once or twice). While it's certainly true that a candidate does not necessarily share all the views of those he has assoicated with for many years, who s/he hangs around with on a regular basis is a perfectly valid factor that voters should and do consider. And people I know in the mountain and prairie west are not going to be in the least bit intimidated by having lefties call them racist, just because they won't vote for the most liberal member of the Senate who just happens to be black.
I'm one of those in the minority who wishes McCain would and could convince Condi Rice to be his VP; that would be a great combination for a campain theme of ''Adult supervision''.

Anonymous said...

In the previous thread, Nate wrote "Before I continue, I want to make clear what the goal of this project is. I want to provide you, at any given moment in time, with the best possible projection of what's going to happen in the November election."


So, ideally the 538 Super Tracker projection curve should be a straight line; should smoothen with more data.

Since 538 is going to update the November projections with the latest polls (state and national,) it would be great if we have an archive of each projection; win probability for individual states, electoral vote projection... and not just the popular vote which I believe is reflected by the Super Tracker curve.

Anonymous said...

I understand, the Super Tracker curve is not based on the popular vote "November" projections of 538.

I think a 538 Super Projection curve (with the new methodology) would be a really good idea.

This graph(s) should plot the 538 projections(electoral vote, win probability, popular vote etc) with the time of updates.

This graph(S) if it remains smooth should be a good indicator of the strength of the new methodology.

Daliant said...

Excellent! I am happy to see how you have taken some of the criticism in the comments and have made some very good and reasonable modifications to your system.

I'd also like to make one more point on the "one size fits all" criticism that recently came to me. I think, especially as we get closer to the election, the majority of recent polls are going to be coming from "battleground states" where the candidates and their parties are pouring tons of money, organization, appearances, and advertising into. So with the old way you would be getting trends based on where the candidates are spending all their resources and using those trends to extrapolate what is happening in states where virtually no resources are being spent (my intuition is that any bounces that occur in these non-battleground states will be smaller). Just another reason I'm happy to see your modifications - and probably another reason to keep a close eye on the state-by-state trends as the election moves along, to look for any differences between battleground and non-battleground trends.

Richard said...

To further Daliant's point: have you investigated the statistical validity of including some sort of variable in your regression of m for the amount of attention (money spent, personal appearances, etc.) each state is given? Like Daliant, I have a suspicion that there there will be a correlation between candidates' focus on a state and its share of any bounce.

Of course the logistical problem of obtaining such data may be prohibitive. Still, it is something to keep in mind

homunq said...

One more rehash:

When I said that your m's should (and probably do) average to less than 1, I actually meant that they should average to less than the m for the United States as a whole. If recent polling focuses on a non-representative set of states, the m for the US could fall significantly below 1; the average m should fall even further, or you are overcounting the volatility.

homunq said...

aack. I keep being too trigger-happy with the 'publish' button.

I mean that the average state m *from the regression* should be bigger than the *empirical* US m. Obviously if you use the regression-based US m, and all coefficients are linear, it will be precisely equal to the population average of the state m's, assuming all your independent regression variables are per-capita.

Robby said...

This strikes me as a reasonable compromise between the "national polls are baaaad" people (e.g. me) and the "more data, please!" constituency.

I like where this is headed...

von etc said...

I, too, am here everyday.

As I understand it, the 538 regression regresses all the poll results from the this election cycle (essentially 2008) against the set of 16 independent variables. It is then used to make estimates of the polling margin in a given state, based on the values of the 16 variables in that state.

My question is whether you can use the demographic variables to make better projections about likely voting patterns in November, based on election outcomes prior to 2008.

Your analysis of the '88, '92, '96, '00, '04 polling results -- with the substantial changes between June/July and November -- suggest that there will be significant changes to the polling results over the next several months, and thus the coefficients on the demographic variables may also change over time.

But is there not some more long-term use we can make of the demographic data, rather than having the coefficients on those variables vary as new 2008 polls come in? Are there relationships between demographics and polling trends from the prior election cycles? Do some demographics have more stable patterns in voting for a certain party over several election cycles?
This would seem to offer you the potential for November projections that might be different from addressing the question of "what would happen if the election were held today?".

Modeler said...

Von,

What you describe can be accomplished through the use of prior distributions / regularization. Maybe we'll see it in the 2.0 version of the model.

Colin said...

I'm not a statistics guy, but i'm concerned about the logic of saying that a bounce in one state necessarily holds true in the rest of the states... Yes, it works for national bumps like conventions and "unity" bumps, but if someone is focusing all their work in one state, an improvement in that state isn't going to infer an improvement nationwide.

For Example, if this was like the 2004 election, with the entire race coming down to 4 or 5 states, and the republicans are spending all their money in Ohio to try and move that state's polls, let's say they are successful. They improve in Ohio by 4 points in a month. Does that mean that they're going to improve in any other state by that much? no.

Right now, we haven't gotten into focusing on battleground states (and I hope we don't at all this election), so all the bumps we're getting now are national bumps based on national events. So, assuming a bump in NC will (in some way) be also felt in SC is reasonable. At some time in the future, this won't be the case any more, as some states get worked hard and some get ignored (or at least short shrift).

I'm wondering, is there a national component to your predicted bump number that would help differentiate between a nationwide improvement and just a localized bump?

jagorev said...

Holy crap, this is like the smartest blog ever. I think I'm regretting not taking more stat/econometrics classes in college.

Modeler said...

Colin,

Nate's most recent change makes sure that he considers the demographics of a state when determining to what extent a national trend is relfected in the state. The point you raise re: the effects of campaigning is a valid one which, as far as I know, is not currently captured by the demographic model. However, Nate should be able to capture this in his model by using variables such as "Average minutes of TV advertising seen per voter" or "Average number of times a voter has seen the candidate in person." He has used similar variables in the past for primaries, and I think eventually he might want to introduce such variables for the general election.

von etc. said...

to Colin

in addition to what Modeler said, recall that in Nate's method, any one poll result (say, in Ohio) is but one small ingredient to the final "trend" calculation. Nate's method "averages" numerous polls across multiple states. If a candidate experiences higher poll results in one state (say, Ohio) because of localized campaiging, but that result doesn't show up in other states polled in the same week, then the effect of that result in Ohio will be negligible on what is ultimately translated into Nate's projections in other states. Only if polling results are consistent across several states (e.g., like the ~5% bump for obama in several states in recent polls) will those results be translated to other non-polled states.
In other words, if localized campaigning only shows up in one state's polls, Nate's model won't likely apply it anywhere else.

信次 said...

情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣用品,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,情趣,

酒店上班請找艾葳 said...

艾葳酒店經紀公司提供專業的酒店經紀, 酒店上班小姐,八大行業,酒店兼職,傳播妹,或者想要到打工兼差打工,兼差,或者八大行業,酒店兼職,想去酒店上班, 日式酒店,制服酒店,ktv酒店,禮服店,整天穿得水水漂漂的,還是想去制服店上班小姐,水水們如果想要擁有打工工作、晚上兼差工作兼差打工假日兼職兼職工作酒店兼差兼差打工兼差日領工作晚上兼差工作酒店工作酒店上班酒店打工兼職兼差兼差工作酒店上班等,想了解酒店相關工作特種行業內容,想兼職工作日領假日兼職兼差打工、或晚班兼職想擁有快速賺錢又有保障的工作嗎???又可以現領請找專業又有保障的艾葳酒店經紀公司!

艾葳酒店經紀是合法的公司工作環境高雅時尚,無業績壓力,無脫秀無喝酒壓力,高層次會員制客源,工作輕鬆,可日領現領
一般的酒店經紀只會在水水們第一次上班和領薪水時出現而已,對水水們的上班安全一點保障都沒有!艾葳酒店經紀公司的水水們上班時全程媽咪作陪,不需擔心!只提供最優質的酒店上班,酒店上班,酒店打工環境、上班條件給水水們。心動嗎!? 趕快來填寫你的酒店上班履歷表

水水們妳有缺現領、有兼職缺錢卡奴的煩腦嗎?想到日本留學缺錢嗎?妳是傳播妹??想要擁有高時薪又輕鬆的夜間兼職工作,打工機會和,假日打工,假日兼職賺錢的機會嗎??想實現夢想卻又缺錢沒錢嗎!??
艾葳酒店台北酒店經紀招兵買馬!!徵專業的酒店打工,想要去酒店的水水,想要短期日領,酒店日領,禮服酒店,制服店,酒店經紀,ktv酒店,便服店,酒店工作,禮服店,酒店小姐,酒店經紀人,
等相關服務 幫您快速的實現您的夢想~!!