6.16.2009

If He Did It

In the first round of voting in 2005, the three conservative candidates for Iran's presidency -- Mahmoud Ahmadinejad, Ali Larijani and Mohammad Baqer Qalibaf, got a collective 41 percent of the vote. Last Friday, ostensibly, Ahmadinejad got 63 percent of the vote. Where exactly did those extra votes come from?

One way to address this question is by means of multiple regression analysis. We can take the vote shares for the seven candidates in 2005, and compare in each of Iran's 30 provinces to the share of the vote received by each of the four candidates this year. I will weight this regression by the square root of the number of votes in each province, to give more emphasis to those with larger vote totals.



The way to read this table is from left to right. If we look at the second row down, for example -- those votes which went to Ali Larijani in 2005 -- we estimate that 83 percent of his voting block went to Ahmadinejad, 16 percent went to Mousavi, and 1 percent went to Karrobui. Those results are not surprising; Larijani was a conservative, and so is Ahmadinejad.

Likewise, Ahmadinejad apparently picked up most of the vote from Mohammad Baqer Qalibaf, the conservative former mayor of Tehran. He also kept most of his own vote. Mousavi, meanwhile, picked up most of the vote from Mohsen Mehralizadeh and Mostafa Moeen, two reformist candidates. Those results aren't surprising either.

Let me warn you, before proceeding further, that the margins for error on this type of analysis are quite high. It should not be considered definitive. We would need to look at city-level data to come up with more robust estimates.

But with that disclaimer, at least one result is rather surprising. Namely, Ahmadinejad appeared to pick up most of the vote from Mehdi Karroubi, who is routinely described as the most liberal of the candidates. This is in spite of the fact that Karroubi himself was on the ballot this year; he appeared to retain only about 5 percent of his own vote.

Renard already detected this pattern, so I am not really telling you anything new. But it seems to me to be the key to explaining the Iranian election -- whether it was legitimate or whether it was rigged. Ahmadinejad won all of the provinces that Karroubi won in 2005, and his cumulative share of the vote in these 11 provinces was 66 percent, exceeding his overall total. In the province where Karroubi did best in 2005, his home province of Lorestan, Ahmadinejad got some 71 percent of the vote.

Now, most of those provinces where Karrobui did well in 2005 are rural, and it's possible that the rural tilt toward conservative candidates was greater in 2009 than in 2005. As I mentioned earlier, Ahmadinejad actually didn't do that badly in Iran's most urban province, Tehran, in 2005, but he performed relatively poorly there last week. If Ahmadinejad won the election, he did it by winning over these rural Karrobui voters. And if he stole it, those were the votes he stole or intimidated.

A secondary factor in Ahmadinejad's purported success was his ability to capture most of the vote from Akbar Hashemi Rafsanjani, a moderate (by Iranian standards) whom Ahmadinejad defeated in the 2005 run-off. Rafsanjani won three provinces in the first round in 2005: Kerman, Zanjan and Gilan. Ahmadinejad ostensibly won all three, grabbing 78 percent of the vote in Kerman, 77 pertcent Zanjan and 68 percent in Gilan. Kerman, indeed -- Rafsanjani's best province in 2005 -- was Ahmadinejad's best in 2009.

This is arguably a bit easier to digest than Ahmadinejad's success among Karroubi voters. Rafsanjani was Preisdent of Iran between 1989 and 1997 and was the closest thing the voters had in 2005 to an incumbent; Ahmadinejad, of course, was the incumbent this year. However, Ahmadinejad and Rafsanjani have quite a rivalry, with Ahmadinejad calling Rafsanjani a "puppet-master" and Rafsanjani calling Ahmadinejad a liar on the eve of the election.

But this is Mahmoud Ahmadinejad's story, and we're guessing that he's sticking to it.

60 comments

Navid said...

Nate there is on fundamental problem in your analysis. Qalibuf was a young technocratish guys which is considered moderate or even mildly reformer guy.

julia said...

Second! Ath Cliath Abu!

Patrick said...

Keep the analyses coming -- best place to go for news about the "heart" of the issue: if the election was rigged, and, if so, how. Thanks Nate.

Kenny Powers said...

40 million hand balots counted in 3 hours flat. wanna give a probability on that ?!?!

JesseLivermore said...

This was not about "reformers vs. conservatives." Moussavi was Prime Minister of Iran during the 80's, and presided over 1000s of political executions. This is about the Old Guard of the Revolution (Moussavi and Rafsanjani) against a newer movement that wants to establish a military dictatorship.

http://www.wiserthanthecrowd.com/2009/06/its-getting-even-harder-to-find-out.html

Energy Evolutionist said...

The last 30 years have been repression under the guise of a government. The vote was clearly rigged, but that is beside the point. The entire government needs to be overthrown and the constitution rewritten to ensure true freedom that can not be denied whenever someone who claims to be the representative of G-d on earth decides that what he wants to do.

Matthew said...

Rezai is considered quite conservative, not a moderate. There was some hope that he would splinter off the anti-Ahmadenijad conservatives but that seems to have not happened.

Zach said...

What happened with the runoff in 2005? To win against Rafsanjani, Ahmadinejad would have had to have captured a part of the liberal vote. If he captured it from Karrobui in '05, then the results in this round would appear to be more legitimate, or at least, more consistant.

Amanda said...

Why is it that, according to your analysis, Ahmadinejad only held on to 72% of the voters that voted for him in 2005, yet got 83% and 82% of voters who had previously voted for other candidates? Unless both Larijani and Qalibaf are significantly more conservative than Ahmadinejad, that result seems counterintuitive to me.

bruce b. said...

In the runoff in 2005, Ahmad. got 63.4%; Rasfanjani got 36.6%.

there are two glaring problems with your analysis. Overall, it seems like you are bending over backwards to try to show some statistical "smoking gun" leading towards theft.

1) why are you comparing the FIRST ROUND in 2005 with the FIRST ROUND here? Isn't it obvious that this election had more of the characteristic of a two-person run off?

2) There were 38.728 million total votes in this election. That is 10-12 million more than last time.

Both of those factors are far more important than your obscure and hard to explain "multiple regression."

busiestday said...

Has anyone applied Benford's Rule to the results?

This rarely comes up in the US since the problem is rarely that people just made numbers up.

But the IRS uses it on tax returns

Simply put, for non-fabricated numbers, the lower digits actually appear more often.

1 = 30.10%
2 = 17.61%
3 = 12.49%
4 = 9.69%
5 = 7.92%
6 = 6.69%
7 = 5.80%
8 = 5.12%
9 = 4.58%

If people are making up numbers off the top of their head that "look good", they tend to put more 7, 8, 9 in them than nature would.

I would have already done this to the province-by-province results, but Nate only gave us an image, not the raw data :-(

zinfan94 said...

Nate, I think you should take another look at this analysis with one other fact included. Your analysis assumes a fairly static electorate, which I think is reasonable, EXCEPT for the large number (10M+) of new voters. In 2005, many moderates and liberals did not vote. Only 28 million votes were cast.

In 2009, the official vote count is over 38 million (the other "leaked count" supposedly from Mousavi camp is 42 million). In either case, there were at least 10 million new voters, and most analysts would expect these votes to be more overweight to the challenger, than the incumbent.

But to be conservative, assume the new votes were evenly distributed (say split 50/50)... then re-do the analysis to see how much of Karroubi's and Rafsanjani's vote would have had to swing to Ahmadinejad.

The numbers simply don't add up!

Further, if you adjust the minority candidates to take at least 10% or 20% of the vote, which is much more likely compared to previous first round elections, the numbers get even more ridiculous. It seems almost impossible that Ahmadinejad could collect over 50% in the first round.

hosertohoosier said...

It isn't clear to me that conservative vs. reformist is the only important cleavage in Iranian politics. In fact it looks like you are imposing a distinction that is primarily of importance to Americans, not necessarily to Iranians (and we tend to label folks "reformist" based more on their international policy than their domestic policy).

The results of the run-off in 2005 demonstrate this decisively. If the cleavage in Iran was decisive cleavage conservative vs. reformist, then Rafsanjani would have won (unless that was rigged too).

I would guess that voting is based largely on regional patronage networks, not an ideological split (at least for the majority of voters).

pokerjo said...

Benford's law occurs because a lot of non-fabricated real-world data follows a power law distribution. I don't see why vote tallies would follow such a distribution.

Erik Nilsson said...

@Kenny Powers "40 million hand balots counted in 3 hours flat. wanna give a probability on that ?!?!"

That's fast, but it's not as preposterous as it sounds to an American. The ballots, AFAIK, had only one question on them, as opposed to the many questions on the typical American ballot. So you can count by sorting into piles. If you want to be meticulous about it, it takes some time. I worked on an election with about 80 million hand-counted paper ballots. I recollect we got the bulk of the counting done in about a day.

Also, sloppy isn't the same as fraudulent. In a big national election, if you're not trying to be perfect, you can go surprisingly fast and still end up with a result that is very, very close to what a more careful count would give you.

So, it's conceivable to me that a well organized effort pursuing speed to the modest detriment of accuracy could in fact count 40 million ballots in 3 hours.

However, my question is, was it possible to collect that number of ballots in ballot boxes from polling places to counting places (in rural areas in particular), and report up central results that fast? (Perhaps this is what Kenny meant.) I suspect that this is completely impossible in Iran. Without knowing a lot about how the Iranian election was organized logistically, I can't say for sure, but Iran includes some fairly remote, rugged areas. Unless counting was all in polling places and reported by satellite phone,
it's hard to think of a way to do it that fast. Even with polling-place counting, it sounds tough.

So the speed of reported results is a potentially fruitful way of trying to understand more about what really happened, but as with most election questions, you have to be sure everybody interprets the question in a similar way.

Mighty said...

@busiestday:

I believe the full numbers were posted in an earlier post. Check here:


http://www.fivethirtyeight.com/2009/06/iran-does-have-some-fishy-numbers.html

@bruce b.

Multiple regression is neither obscure nor hard to explain. Check out Wikipedia at least
http://en.wikipedia.org/wiki/Regression_analysis before you try to rag on someone who obviously knows more about regressions than you.

Mark said...

The Iranian ballots don't need a recount so much as they need a count.

Shap said...

If he did it? Kind of like if O.J. killed Nicole?

Shaun Martin said...

Just wanted to say that I loved the title, Nate.

bruce b. said...

@mighty: let's not make it personal, please. Read what I said. the two factors I pointed out are (second round returns in 2005 as baseline; taking extra votes into account) were ignored by Nate. Second, if the model he used is not obscure (and in this case, it IS), then explain the math of the model to me. Then maybe we can analyze it together.

I, for one, don't believe Nate's word to be a fatwa on this or any other point. Sure, he knows more about statistics than I do. But it is clear that he wasn't applying "Occam's razor" (the cleanest simplest argument) in this case.

zinfan94 said...

I posted on one of the earlier threads, about the Iranian vote counting process. Each polling place is counted and tabulated onto a Form 22, which is then signed by the counters and observers. Following this, the signed Form 22s (along with the ballot boxes) are physically delivered to the province HQ, where the information is accumulated onto a Form 28.

I am not sure how the Form 28 information is collected, but it appears this information is transmitted to the Interior Department, which generates the final vote count and breakdown.

In previous elections, this process took all night, then the preliminary results were released, followed by the Supreme Leader accepting the vote three days after the election.

Along with cutting off communications, arresting other candidate personnel, and deploying security personnel onto the streets, this process was supposedly completed in less than four hours, in this election.

Hard to believe.

bruce b. said...

I mean, can anyone seriously argue that Nate's "how he did it" was the ONLY way? Or the simplest way to account for Ahmadinejad's 63%? Rubbish. Just take the 2nd round results from 2005... and also look at where the "extra" 10 million votes were.

Erik Nilsson said...

Thanks, zinfan94. Missed your earlier comment.

the Lady Gooner said...
This post has been removed by the author.
the Lady Gooner said...

Nate, I have a question about the percentage of the population that is being reported as voting. I have heard that 85% voted... but is that the percent of eligible voters or the total percent of the population. If it is the latter, that also seems highly suspicious given the skewed demographics of Iran's populatio

Juris said...

@Nate: Nice allusion to O. J. Simpson.

Tom said...

Kenny Powers at 5:58 PM said: 40 million hand balots counted in 3 hours flat. wanna give a probability on that ?!?!

One of two ways: If you have a million people counting the votes, it won't take long at all. Alternatively, if no one is actually counting the votes, it also doesn't take long.

Nate, anyone know if they hired hundreds of thousands of poll workers? I'm guessing no.

Aaron said...

Nate, I'm glad you included the disclaimer, but I'm not sure that the estimates are even worth doing given how much bias the Ecological Fallacy (http://en.wikipedia.org/wiki/Ecological_fallacy) can induce. Or, at the very least, you should not cast the results as individual-level behavior; i.e., keep the unit of analysis at the Province level.

juvanya said...

A few things:
Ahmadinejad is a radical. Conservatives support the revolution structure, but loosened, especially in economics. But they also don't support vitriolic foreign policy

Rezaee is a conservative.
Mousavi is a moderate/reformist.
Kerroubi is a reformist.

Rafsanjani is a conservative-moderate.

juvanya said...

@bruce b.
This can't be compared to the 2005 runoff. First, that was a runoff with naturally depressed turnout. Second, it had two candidates instead of four. Third, Karroubi and Rezaee are not Naders.

Ian said...

There is a smoking gun that the declared results in the recent Iranian election are not remotely credible.

Ignore the national results. Ignore Ahmadinejad and Mousavi. Concentrate on Mehdi Karroubi and on Lorestan.

Here are the recently announced results for Lorestan in the 2009 Iranian first-round election – the one we just had. I got this copy from Nate Silver’s site at http://www.fivethirtyeight.com

Ahmadinejad 677 829
Mousavi 219 846
Rezaee 14 290
Karroubi 44 036

OK, so we have Karroubi getting 5% of the vote in Lorestan. Lets see how he did 4 years ago …

http://www.electoralgeography.com/new/en/countries/i/iran/2005-president-elections-iran.html

LORESTAN
===============================================================
First round Second round
Candidate Votes % Votes %—————————————————————————————————-
Mahmoud Ahmadinejad 69,710 08.8
Mehdi Karroubi 440,247 55.5
Ali Larijani 31,169 03.9
Mohsen Mehralizadeh 6,865 00.9
Mostafa Moin 53,747 06.8
Mohammad Baqer Qalibaf 70,225 08.9
Akbar Hashemi Rafsanjani 121,130 15.3

By the way, Karroubi is from Lorestan, has his own political party, his own newspaper, was Speaker of the Parliament, came a strong third overall in the 2005 elections and may therefore fairly be assumed to have some sort of effective political machine in Lorestan, his home province.

Never the less, by the official results, in his own province, he goes from 440 247 votes in 2005 down to 44 036 votes in 2009.

It is simply not within the bonds of possibility that a regional Opposition candidate goes from 55% to 5% in his own province in just four years.

Ian Whitchurch

busiestday said...

@Mighty...I only ever saw *pictures* of results from Nate.

@pokerjo...I think Benford's law applies..votes start counting at 0, have no natural "sticking points" (like items costing $24.95), and no pre-set lower bounds (like a minimum check size of $100 or needing 10000 people to be a city) or upper bounds, so Benford should apply just like it does to all kinds of counting (street addresses, electric bills, etc).

If I download the town-level data from the Iranian Ministry of Information (found here http://www.moi.ir/Portal/File/ShowFile.aspx?ID=0793459f-18c3-4077-81ef-b6ead48a5065 )

And not reading Farsi, I back into the column titles because I know the official vote total are

62.6% Ahmadinejad (column C)
33.8% Mousavi (Column F)
01.7% Rezai (Column D)
00.9% Karroubi (Column E)

With F G and H being Total, Other and Total-Other (Top4).


So I can put titles on my columns and count first digits as my Benford's law analysis (http://www.auditnet.org/articles/JFA-V-1-17-34.pdf) on the first digit of each cell.

Every column totals 366 because there are 366 election districts

# Ah Re Ka Mo To Ot T4 AllObs
1 95 96 125 130 80 111 79 716
2 85 67 57 60 74 54 75 472
3 54 51 44 47 51 47 48 342
4 30 37 29 39 46 28 48 257
5 32 30 24 23 38 29 36 212
6 23 25 16 18 20 32 20 154
7 17 24 41 17 26 24 27 176
8 20 18 13 13 21 18 20 123
9 10 18 17 19 10 23 13 110
0 - - - - - - - 0
366 366 366 366 366 366 366 2562

For 366 rows (2562 total observations)

First digits as a fraction of all first digits by column:

# Ahmad Rezai Karro Mousa Total Other Top4 All Benford
1 0.260 0.262 0.342 0.355 0.219 0.303 0.216 0.279 0.301
2 0.232 0.183 0.156 0.164 0.202 0.148 0.205 0.184 0.176
3 0.148 0.139 0.120 0.128 0.139 0.128 0.131 0.133 0.124
4 0.082 0.101 0.079 0.107 0.126 0.077 0.131 0.100 0.097
5 0.087 0.082 0.066 0.063 0.104 0.079 0.098 0.083 0.079
6 0.063 0.068 0.044 0.049 0.055 0.087 0.055 0.060 0.067
7 0.046 0.066 0.112 0.046 0.071 0.066 0.074 0.069 0.058
8 0.055 0.049 0.036 0.036 0.057 0.049 0.055 0.048 0.051
9 0.027 0.049 0.046 0.052 0.027 0.063 0.036 0.043 0.046

Now there's going to be random variation, and 366 observations is small over which to even things out (you'd prefer precint-level, just to get more observations)

So, yes, I'm punting on the significance tests. Can someone else do them?

I'd start with Mousavi because he has twice as many numbers beginning with "7" as he should (11.2% instead of 5.8%)

busiestday said...

I mean Korroubi as 2x the number of 7s he should

hosertohoosier said...

Regarding the hand-counting in 3 hours thing, a few points. Firstly, you silly Americans, there are lots of countries that count paper ballots (Canada, for instance), where the result of elections are known within a few hours. You don't need to count every vote in order to call an election (from a media standpoint).

Iran may also have had advance polls, such most of the results had been counted already by election day.

As for the comment "but counting paper ballots would require hundreds of thousands of workers", err... yes, you are right. That is how you do it. Again, using my Canadian comparison, we had 170,000 ballot counters in 2006. That was for 15 million votes (I don't know how long they worked, but they got paid $185 a pop).

Iran's elections had 40 million votes, so assuming they worked at Canadian speeds, they probably needed about 450,000 vote counters (probably paid less than $185).

Kenneth Ranson said...

Nate an interesting analysis. I believe however that the mechanics were slightly different.

I think that the reformist voters you have identified, those who previously supported Karroubi and Rafsanjani, did in fact change their votes, but not to Ahmadinejad. I believe that seeing that Mousavi had a legitimate chance of getting elected these voters moved to him precipitating a groundswell and an actual margin of victory for Mousavi of close to the 60% he claims.

Faced with this fact, Ahmadinejad then stole the election by the transparent but effective device of simply moving some 20% of the total vote from Mousavi's column to his own. When Ahmadinejad did this there was probably some preference for changing the results in rural provinces, that were harder to monitor, rather than in Tehran. This would account for the massive, and entirely unbelievable shift of Karroubi support to Mousavi.


Faced with this fact, Ahmadinejad then stole the election by the transparent but effective device of simply moving some 20% of the total vote from Mousavi's column to his own. When Ahmadinejad did this there was probably some preference in changing the results in rural provinces that were harder to monitor rather than in Tehran. This would account for what the massive, and entirely unbelievable shift of Karroubi support to Mousavi.

juvanya said...

@hosertohoosier:
That's because they have reliable opinion polls and small areas to go with. There is simply no way to count 45 million something ballots that fast in a country like Iran. They can not afford to hire that many people to make it that fast.

Also, in every previous election, it took days to get results.

Erik Nilsson said...

@hosertohoosier: "There is simply no way to count 45 million something ballots that fast in a country like Iran. They can not afford to hire that many people to make it that fast." I find statements like the frustrating because I agree with the point the author is trying to make, even though their statements are actually false. There most certainly are methods that *might* work to count 45 million ballots in three hours in Iran, and many of those methods are things the fairly wealthy country of Iran could afford to do. Iran almost certainly hired enough people to do it, because they needed that many poll workers in the first place, and one fast method is to count in the polling place as soon as polls close.

I doubt Iran did implement a plan to really count the vote in three hours, and I doubt they would have succeeded in producing announced semi-official results in that time had they tried. I therefore doubt Iran can demonstrate that the vote count is credible, and the onus should be on the Iranian government to demonstrate that they did as they claimed. But if you base your argument on the false idea that such things are impossible, you are easily defeated, because then all you are asking Iran to do is show that there is a method they could have used to get a count in three hours. It is, at the outside margin *possible* to count 45 million ballots and more or less report the totals in 3 hours, so you aren't asking for enough.

@juvanya; "Also, in every previous election, it took days to get results." That's a valid point. What actions other than fraud can the Iranian government claim to have taken that speeded up the count. If none, then why were previous counts slower?

(If the Iranian power structure weathers this storm, I am confident they will never again make the mistake of reporting results so quickly.)

bruce b. said...

Regarding the low votes figures for the two minor candidates, Rezaee and Karroubi, which are being cited repeatedly as "proof" of cheating: the Ballen/Dougherty "Terror Free Tomorrow" Western poll PREDICTED this almost exactly... they would each be in the 1% - 2% range. So q.e.d, you can't CITE this as proof of cheating. Maybe they DID in fact have votes removed from them... but there is a logical, simpler explanation.

Politically, the explanation is that the vast majority of the reformist vote gravitated to Mousavi. Why is this so hard to understand? It happens all the time in the US with 3rd parties.

hosertohoosier said...

Yes, Iran is poorer than Canada. But tabulating elections with paper ballots is a labour-intensive activity, and cheap labour is abundant in Iran. So even if they have less money with which to hire poll workers than Canada, those workers are much cheaper. With unemployment at 12.5% officially (and surely higher unofficially), there is a much larger reservoir of poll workers to draw on.

As for Canada having an advantage in its smallness... err... Canada is the world's largest country, and has one of the lowest population densities.

Secondly, Iran did not count all of the votes in 3 hours. According to IRNA 66% of the ballots had been counted by early Saturday. That story is dated June 12 7:10 EST, or 4 am June 13th in Iran.

http://www.reuters.com/article/topNews/idUSEVA14340720090612?feedType=RSS&feedName=topNews

Most of Iran's polls closed at 9:30 am EST, or 6:30 PM Iranian time. Where does that 3 hour number come from? In a few cases where there were long lines, a 6 hour extension was granted. However, in the majority of cases, there were 9 hours to count, not 3.

Source: http://www.npr.org/templates/story/story.php?storyId=105287388&ft=1&f=1004

By the end of Saturday, 80% of the vote had been counted.

http://english.aljazeera.net/news/middleeast/2009/06/2009612195749149733.html

Are these numbers "too fast"? You claim this is not typical for Iranian elections. I think you are pulling this claim out of your rear end.

In the 2005 Iranian election, the polls closed at 22:22 June 17th in Ireland (see below for source), but that was partly because of a 4 hour extension. So polls closed in Iran at 2 am on June 18th, though officially at 10 pm on June 17th.

http://www.rte.ie/news/2005/0617/iran.html

60% of ballots were counted at some point on Saturday, when this article was released. Of course a result took longer to announce because it was a multi-candidate race, and a small difference in votes would have a large impact on the outcome (since only the top two guys go to the runoff).

http://www.washingtonpost.com/wp-dyn/content/article/2005/06/17/AR2005061701344.html

According to the Seattle Times, 66% of the vote was counted in 2005 by June 18th 12:47 am Seattle time, or 12:47 PM Iran time (10 hours after polls closed, and 14 hours after they closed in most of the country).

So yes, the vote count was a little slower in 2005. However, the difference was not radical and could easily be explained by different allocation of resources, and the added difficulties of a multi-candidate race.

The 3 hour figure (which should be 3 and a half hours) assumes that vote counting didn't start right away, but forgets that there was a 6 hour extension. It also ignores that only 66% of the vote was counted in this time, not all.

If you want an example of fast, Canada usually has close to 100% of votes counted by 3 am.

heyalchang said...

What do you think about stratfor's analysis that western focus on sources which spoke english and lived in the urban centers is driving a misunderstanding of internal iran dynamics?

http://www.stratfor.com/weekly/20090615_western_misconceptions_meet_iranian_reality

Bob X said...
This post has been removed by the author.
Bob X said...

@busiestday: I calculated the chi-square for the Korroubi column (after making a horrible algorithmic error, hence my deleted post) at 28.1, and at df 8 there is only a .005 chance of chi-square greater than 21.955; even applying Benferroni's rule of thumb (there are four columns, and we are picking out the worst-looking, so there is a 2% chance that the worst of four comes out with chi-square > 21.955) we have over 98% confidence that the Null Hypothesis is false (Null Hypothesis meaning the preconditions for Benford's Law, numbers resulting from a random count starting at zero with no preferred stopping points).

Joan said...

Leave aside that several commenters here seem so deeply mired in their own self-geekitude, they're missing the barely-concealed sarcasm in Nate's post, has it not occurred to anyone that the [not an ayotollah] Khameni picked the number he did had nothing whatsoever to do with an attempt to steal the election fairy away from Iranian voters, but instead was deeply personal?

While the aparatchick Ahmadinejad was still in school and working waaaHAAAY-behind-the-lines of the Iran-Iraq war, all of NAA K, Mousavi, Rafsanjani, Khatami and every single other one of the various semi-ayotollahs who has actually had his name mentioned in the dispatches of this uprising, were all in the upper reaches of the Khomeni Kabinet. It seems clear that Mousavi is far from a 'natural' reformer; he worked very well as P.M. through the toughest time in post-revolution Iran, and with NAA K as President, with reports being consistent that NAA K hated how unrelentingly disputatious Mousavi was - which implies that since Mousavi could not have survived so long there, indeed THRIVED, without the support of Ayatollah Khomeni, he of all the surviving founders is Mister Most Likely To Be Considered Chief Rival to NAA K - and was and is among if not the biggest sophist of the bunch.

Again: the reports are that NAA K utterly despises Mousavi, and so despises the idea of having to deal with him, plus very likely fears his standing with the mullah mafia.

So: it makes sense that the 63% 'award' was very likely, MOST likely a transparent slap in Mousavi's face.

And no unarguable facility with multiple regressions analysis is going to compete with someone pulling a number out of his ass to shove in his opponent's face.

(To those who would counter that there's no way kindly looking Mousavi would ever sacrifice the lives of kids to win a personal grudge match, consider just the reported incidents of literally hundreds of thousands of citizens including kids - PARTICULARLY kids - sacrificed by Iranian Republican government on the watches of Khomeni and Khameni - including mass murders of helpless prisoners of the mullahtocracy and long-term "chain murder" binges.]

As ugly as are the street assaults, the real battle is taking place in Qom -- and no matter who wins, the post-uprising murders will go on for weeks if not months.

Bob X said...

@busiestday: on the other hand, the Moussavi column has a chi-square of 10.0, meaningless (about a 25% chance of a chi-square that large).

Fangz said...

bruce b does have one point, beyond the rhetoric and the failure to read the previous posts on the site. It would be interesting to look at a regression analysis of the 2005 first round -> second round results.

Opus 132 said...
This post has been removed by the author.
Opus 132 said...

Regarding the low votes figures for the two minor candidates, Rezaee and Karroubi, which are being cited repeatedly as "proof" of cheating: the Ballen/Dougherty "Terror Free Tomorrow" Western poll PREDICTED this almost exactly... they would each be in the 1% - 2% range. So q.e.d, you can't CITE this as proof of cheating. Maybe they DID in fact have votes removed from them... but there is a logical, simpler explanation.

Well,how about this for an explanation of the reported numbers:

Instead of the "pulling numbers out of his ass" as Joan phrases it,the designer of this fraudulent vote used the totals of an existing poll. That's right,why not use the figures from that American poll that showed that Ahmadinejad would win big,make up supporting detail,and eliminate counting the actual ballots.The resultant phony numbers would look legitimate thanks to the work of those blessed Americans Ballen and Doherty.

Voila,the (almost) perfect crime!

Fangz said...

Actually, are the second round provincial results for the 2005 elections available anywhere? The only results table I can find seem to only have provisional national vote totals.

John said...

@Opus 132

"Regarding the low votes figures for the two minor candidates, Rezaee and Karroubi, which are being cited repeatedly as "proof" of cheating..."

The TFT data looked at the nationwide vote and doesn't seem to capture the veyry high votes Mousavi and Karroubi would have gotten in their hometown/province/ethnic enclaves. The poll simply doesn't take into account the density of votes in places like Lorestan.

What's the regional/provincial spread on those polled by TFT?

Joseph said...

Is it really possible to reduce Iranian politics to this left-right continuum? I'm not saying it isn't but I'm skeptical that politics in such a different system and culture would necessarily be so easily reducable to such a Western formulation of politics.

Joan said...

Opus 132 at 5:04 a.m.

I would not suggest it's impossible those who devised the reported numbers resorted to a convenient premise in case they needed defending. Your suggestion of the Ballen/Doherty study serving that purpose seems plausible; indeed, at seems to fit the notion of 'steal from one column - add to another' Nate is posting about here.

Moreover, since you and I agree, and all posting comments here agree with Nate's Iran Position 2.0, that, whatever the rationale(s) behind the reported numbers, it's wildly improbable they accurate report the faithfully recorded of the votes of several tens of millions of diverse Iranians, then the exercise we're engaged in here, led by Nate, is whether the thinking behind the numbers chosen to be reported are rooted in something more solid than the effluent to which, I take it, you interpreted I was referring.

Even effluent comes to its condition from some other. Indeed, were one to credit the confabulators with pure novelty, one would also expect of anyone capable of such imagination to come up with a product that's also more plausible at some level. MY point is that, since the device itself (the invention, the product, the fiction) is so implausible, it's MORE worthwhile to consider the motivations for reporting such a transparent fiction, than it is to root through the scat of the invention in pursuit of its origins.

But I have to concede that the choice as to which pursuit is more worthwhile is a matter of personal preference. I'm comfortable with your hypothesis until its credibly shot down; but it still doesn't go to motivation for the fiction, and I'm saying we have quite a bit of evidence, including the dimensions of the implausibility, to enable us to hypothesize as well about motivation. I assume you'd not be offended by my notion that there's possible value in attempting to extend the findings of mathematical exercise in an effort to derive meaning.

mclever said...

Joseph: "Is it really possible to reduce Iranian politics to this left-right continuum? I'm not saying it isn't but I'm skeptical that politics in such a different system and culture would necessarily be so easily reducable to such a Western formulation of politics.

I would argue that it's not really possible to reduce Western politics to a simple left-right continuum. There are at least four dimensions in play in Western politics. (social vs. fiscal, personal vs. private freedom, authoritarian vs. libertarian, liberal vs. conservative, etc.)

That said, the "conservative" vs. "reform" paradigm is how the Iranians appear to be classifying themselves. Therefore, for the sake of this analysis, we can probably accept that paradigm as having moderate legitimacy with how the voters are viewing the candidates.

Of course, to our western perspective, their "reform" candidates are still very conservative...

jdk said...

Benford's Law is really the only analysis that makes any sense in this circumstance. (Others have already posted. As to "significance", the way to handle this is to see if any of the data falls outside 3 std of the expected percentage (hence a sign that it makes sense to look for a special cause (cf. Deming and Shewhart).

p-bar = expected percentage
N = number of obvervations (i.e., precints or voting districts)

Upper and Lower Control Limits =

p-bar +/- 3 times
[
[p-bar times (p-bar minus 1)]
divide by N
]


If there was fraud and the frauders were not sophisticated, the fraud will out itself in the numbers.

Bob X said...

jdk: been there, done that. The number of 7's in the Karroubi column using busiestday's data is 4.4 standard deviations from the expected, for well under a .0001 probability; now that probability should be multiplied by 36 since we have four candidates and nine first-digit-totals for each, and we have picked out the worst-looking one (that is, the probability that the worst of 36 comes out as a 1 in 10,000 chance is 36 in 10,000), but still we have over 99% confidence these numbers are bogus.
Above, I computed the chi-square which is a more appropriate approach to a table of data like this; the figure for the Karroubi column, 28.1, is way off the charts, negligible probability of this happening by chance. The other columns are not so bad. My impression of the way these vote totals were obtained is: the votes were switched between Ahmedinejad and Moussavi in most precincts where Moussavi was ahead; votes for the minor candidates were just reassigned to Ahmedinejad and low numbers for the actual candidate invented out of whole cloth.
But as has been expressed above, figuring out the how is not so terribly important. We do need to satisfy ourselves that yes, these numbers are nakedly fraudulent.

wv: fomanti, could this foment an anti-regime movement?

Bob X said...

OK, fresh cup of coffee, I finished the chi-squares:

Ahmedinejad: 15.3
Razaee: 3.2
Korroubi: 28.1
Moussavi: 10.0

In a 9-row column, expected value (50/50 chance for over/under) for chi-square is 8 (the "degrees of freedom" number).
Moussavi's number is perfectly normal (p >25% of being that large).
Razaee's number is low but not freakishly so (p between 5% and 10% of being that low; low chi-squares mean data that is "too good", deliberately tailored to match the Benford percentages, but I doubt the Iranians have that kind of sophistication).
Ahmedinejad's number is on the borderline of statistical significance (5% chance of chi-square > 15.5, so his number has a p-value just over that .05 threshold of significance).
Karroubi's number is just horribly improbable.

Shaahin said...

These results look even stranger once you take into account the fact that in 2005 many people boycotted the election, while in 2009 about 85% voted, many of which were those who boycotted the previous election. And as you might know, the boycotters are those who think that reformists are not liberal enough, we need more radical changes. So, as you can guess, when boycotters decide to vote, they definitely won't vote for Ahmadinejad.

Besided, 85% turnout means people are pissed off with the current president.

Bradford said...

EARTH TO NATE!!!! This is your worst analysis ever!

How many times do people have to tell you that the 2005 elections were boycotted by a large swath of moderates and thus not comparable to 2009...

How many times do people have to tell you the votes boxes were stuffed in 2005 and thus not comparable to any year prior or since...

How many times do people have to tell you that you like the base "common sense" knowledge to do a numbers comparison on the ground in Iran..

Nate, I love you man, but STFU on Iran or find a clue!

Erik Swanson said...

Yes! This makes perfect sense! It's quite reasonable to wonder why Karroubi's votes went to the incumbent...until you consider ECONOMIC factors. NOBODY is talking about the fact that Mousavi is a total right-wing privatizer. He's basically the libertarian party candidate, if you will. Karroubi, on the other hand, is more of a traditional left-liberal.

Put it this way: if Dennis Kucinich ran for president in 2004 as an independent and got 10% of the vote, who do you think would get his votes in 2008? Obama, right? But Iran's politics aren't like that.

Imagine if you will that GW Bush had been an economic liberal...he passed single payer health care and expanded the welfare state while being hard right on foreign policy. Now imagine that the dems had nominated someone to the right of Obama on economics...like say, they convinced McCain to run as a democrat...so, Bush the right-wing socialist vs. McCain the middle-of the road reformer with right-wing economic policy. Wouldn't the Kucinich voters vote for this Bush?

This is a horribly tortured analogy. The fact is that the Iranian election is another fake "velvet revolution" to install an economic liberal. And NOBODY is reporting it. Be the one to stand up for truth!

diesel said...

Hey Nate,

Have you seen this photo essay (w/audio commentary) from the New Yorker covering one of Aahmadinejad's visits to Tabriz (Mousavi's "home town") in April.
It's about 2 minutes long. Well worth the time to give a bit of color to the rural discussion.

http://www.newyorker.com/online/multimedia/2009/04/13/090413_audioslideshow_ahmadinejad

It's only one data point, but it reminds me how little real information we're working with.

Cheers,
Diesel

freefun0616 said...

酒店經紀人,
菲梵酒店經紀,
酒店經紀,
禮服酒店上班,
酒店小姐兼職,
便服酒店經紀,
酒店打工經紀,
制服酒店工作,
專業酒店經紀,
合法酒店經紀,
酒店暑假打工,
酒店寒假打工,
酒店經紀人,
菲梵酒店經紀,
酒店經紀,
禮服酒店上班,
酒店經紀人,
菲梵酒店經紀,
酒店經紀,
禮服酒店上班,
酒店小姐兼職,
便服酒店工作,
酒店打工經紀,
制服酒店經紀,
專業酒店經紀,
合法酒店經紀,
酒店暑假打工,
酒店寒假打工,
酒店經紀人,
菲梵酒店經紀,
酒店經紀,
禮服酒店上班,
酒店小姐兼職,
便服酒店工作,
酒店打工經紀,
制服酒店經紀,
酒店經紀,

,