Poll of Polls: Examining Margin of Error & Combining Polls

In summary, the CNN Poll of Polls includes the USA Today/Suffolk poll; the ABC News Poll conducted October 21-24; the CNN/ORC poll conducted October 20-23; the Quinnipiac University poll conducted October 17-18; and the Fox News poll conducted October 22-25.
  • #1
jobyts
227
64
If each individual poll has a margin of error, when you combine all the polls, why do they say poll of polls has no margin of error? Shouldn't the margin of error get multiplied when you combine many polls?
 
Physics news on Phys.org
  • #2
jobyts said:
why do they say

Who is they?
 
  • #3
jobyts said:
If each individual poll has a margin of error, when you combine all the polls, why do they say poll of polls has no margin of error? Shouldn't the margin of error get multiplied when you combine many polls?
Because "they" refers in this case to people who don't know what they are talking about.
 
  • #4
Pretty much all US media says that. As an example,
http://edition.cnn.com/2016/10/26/politics/cnn-poll-of-polls-october/

The CNN Poll of Polls, which does not have a margin of error, includes the USA Today/Suffolk poll; the ABC News Poll conducted October 21-24; the CNN/ORC poll conducted October 20-23; the Quinnipiac University poll conducted October 17-18; and the Fox News poll conducted October 22-25.

EDIT: Not sure if it is all media or just cnn.
 
Last edited:
  • #5
jobyts said:
Pretty much all US media says that.
That doesn't make it right.
 
  • Like
Likes 1oldman2
  • #7
phinds said:
That doesn't make it right.

Care to elaborate a bit more? Are you implying the margin of error should get multiplied and cnn is incorrectly and ignorantly stating that there is no margin of error.

The 3 outcome I can think of are
1. margin of error gets multiplied on each inclusion of individual polls.
2. margin of error does not change if you consider the sum of all the groups just as one larger group.
3. margin of error reduces with multiple groups, since the question bias is reduced, the dates of each individual poll is different (so a specific date sampling error is reduced), sample space is larger etc.

Statistical evidence would be the key to know how well they engineered their sampling criteria.
 
  • #8
I agree with Student100: what they seem to be saying is that they are not estimating the margin of error, not that they are estimating it and that it is zero.
 
  • Like
Likes russ_watters
  • #9
I interpreted the OP's original statement "why do they say poll of polls has no margin of error" as meaning there was no error. If they're just saying they can't figure out what it is, that's a different story.
 
  • #10
In meta-analysis papers in Biology you will often see something to the effect 'no margin of error determined'. Since a poll of polls (seems a bit illogical to me) is really analyzing multiple resultsets of polling data sets that seems a fair statement to me. You cannot lump data from many disparate methodologies into one amorphous blob and make sense of it.

However @phinds hidden assumption is correct - science reporting in the general media stinks, IMO. People come away with weird ideas. As if this were a revelation...
 
  • #11
phinds said:
I interpreted the OP's original statement "why do they say poll of polls has no margin of error" as meaning there was no error.

So did I, which is why I wanted him to tell us where he got this from.
 
  • #12
jim mcnamara said:
You cannot lump data from many disparate methodologies into one amorphous blob and make sense of it.

I wonder if that's true. Chebyshev's inequality gives some upper limit to the variance (with very weak assumptions). Perhaps some analogous model could be used for a case where you mix data from different sources.

Could be the upper limit would be so huge it is not worth calculating though.
 
  • #13
Borek said:
I wonder if that's true. Chebyshev's inequality gives some upper limit to the variance (with very weak assumptions). Perhaps some analogous model could be used for a case where you mix data from different sources.

Could be the upper limit would be so huge it is not worth calculating though.
How does Chebyshev help at all??
 
  • #14
"Poll of polls?" Isn't that on a par with combining the forecast tracks for hurricanes?
 
  • #15
Bystander said:
"Poll of polls?" Isn't that on a par with combining the forecast tracks for hurricanes?
Yes, and I would expect the result to have a narrower error margin, even if it is tough to quantify.
 
  • #16
micromass said:
How does Chebyshev help at all??

I am not saying it helps directly. I just wonder if there is no similar inequality that would say something about the upper limit of errors for the poll of polls.
 
  • #17
It is possible to combine the results of individual polls to obtain a meta-poll. It's also possible to calculate a margin of error for that meta-poll, and that margin of error will have a expected, general trend to be [itex] \frac{1}{\sqrt{N}} [/itex] of that of an individual poll, where [itex] N [/itex] is the number of individual polls (this assumes that the individual polls are polling the same thing [i.e., apples-to-apples] and that their individual margins of error are comparable).

As a simplified example, suppose we have [itex] N [/itex] polls for comparison. For the sake of simplicity, suppose all polls have identical margins of error, which is proportional to the the poll's standard deviation which I'll call [itex] \sigma [/itex]. (This is essentially saying that each poll is an equally valid predictor, even though each poll might give a unique prediction.)

We'll treat each poll as a random variable [itex] x_n [/itex] with a mean [itex] \mu_n [/itex] and a standard deviation [itex] \sigma_n [/itex], where [itex] \sigma_n = \sigma [/itex]: the same value for all polls.

Summing the results of all the polls into the random variable [itex] y [/itex],

[itex] y = x_1 + x_2 + x_3 + \dots + x_{N-1} + x_N [/itex]

gives the mean,

[itex] \mu_y = \mu_1 + \mu_2 + \mu_3 + \dots + \mu_{N-1} + \mu_N [/itex]

and variance

[itex] \sigma^2_y = \sigma^2_1 + \sigma^2_2 + \sigma^2_3 + \dots + \sigma^2_{N-1} + \sigma^2_N [/itex]

but since the individual variances are all the same in this simple example, we can say,

[itex] \sigma^2_y = N \sigma^2. [/itex]

and standard deviation

[itex] \sigma_y = \sqrt{N} \sigma. [/itex]

But in the end, we're not really interested in the sum but rather the average. So we scale the mean of the sum and the standard deviation of the sum by [itex] N [/itex].

[itex] \mu_{ave} = \frac{\mu_y}{N} [/itex]

[itex] \sigma_{ave} = \frac{\sigma_{ave}}{N} = \frac{\sqrt{N} \sigma}{N} = \frac{\sigma}{\sqrt{N}} [/itex]

And that last one is the kicker. It shows that when you combine multiple random variables, the average tends to reduce the "noise" by an amount [itex] \frac{1}{\sqrt{N}} [/itex].

----

The simple example above has a glaring limitation that it assumes that all the individual polls are created equal. In the real world that is not the case. Still, statisticians have mathematical tools to weigh the individual polls before combining, but that is getting out of the scope of this thread.

My point was just to say that it is possible to combine the results of polls into a meta-poll and still obtain statically significant results. The claim in the original post that the meta-poll has "no margin of error" is not true.

(This of course assumes that the individual polls are comparing apples-to-apples. It doesn't make any sense to combine a poll for the 2016, US presidential election with another poll regarding favorite ice-cream flavors, for example.)
 
Last edited:
  • Like
Likes Borek
  • #18
collinsmark said:
It is possible to combine the results of individual polls to obtain a meta-poll. It's also possible to calculate a margin of error for that meta-poll, and that margin of error will have a expected, general trend to be 1√N1N \frac{1}{\sqrt{N}} of that of an individual poll,
IIRC, only in the case that the polls are i) statistically independent, ii) the errors Gaussian. A big IF.
 
  • #19
Seems like the easiest way is to take the raw data from each sample, apply your own methodology, and recalculate error. Your sample size is larger than anyone poll, but still realitivity small compared to the entire voting population. You could then look at one polls methodology, apply it to the sample, and your MOE should then be improved compared to that poll.
 
  • #20
Also, what do you do if you get one poll that measures 40 +/- 1% and the other that measures 60 +/- 1%? I don't think 50 +/- 0.7% is the right answer.
 
  • Like
Likes Bystander
  • #21
mheslep said:
IIRC, only in the case that the polls are i) statistically independent, ii) the errors Gaussian. A big IF.
The errors do not have to be Gaussian. They can be of any probability distribution that you can think up (assuming the standard deviations are finite). The sum will approach being Gaussian, but that is not a requirement of the individual trials. That's part of the beauty of the Central Limit Theorem. :woot:

Of course we're assuming a certain level of statistical independence in the individual polls -- such as we can assume that a given individual poll is not going to base its results on that of another individual poll. Each individual poll is conducted individually. That's not an unrealistic assumption.
 
  • Like
Likes mheslep
  • #22
Vanadium 50 said:
Also, what do you do if you get one poll that measures 40 +/- 1% and the other that measures 60 +/- 1%? I don't think 50 +/- 0.7% is the right answer.
Is it not? Assuming that we're careful on how percentage and percentage points are used here, I think your method of combining is correct.

What I mean by that is assuming that the "40" and "60" are in units of percent, and the margin of error is +/- 1 percentage point then I think the method of combination is okay. Otherwise just get rid of the percent sign. Combine 40 units +/- 1 unit with 60 units +/- 1 unit to make 50 units +/- 0.7 units.

If something seems wrong here it is because you know right away that one or both of the original data points (original polls, if you will) represents a statistical fluke. Assuming the methodology that went into determining the margin of error to be +/- 1 unit is correct, it is very, very improbable that the initial data points will be different by a whopping 20 units. It's not necessarily impossible, but very improbable.

Initial data points like that would make me question the validity of the original 40 unit and 60 unit values, and the claim that their margin of error is +/- 1 unit. It's a hint that there might be something wrong there. Is it possible that these values are actually correct and are merely the result of random chance? Yes, it is possible, yet very improbable.
 

FAQ: Poll of Polls: Examining Margin of Error & Combining Polls

What is a "Poll of Polls"?

A "Poll of Polls" is a method used to combine the results of multiple polls in order to create a more accurate estimate of public opinion. This is done by taking the average of the results from each individual poll and accounting for the margin of error in each poll.

What is margin of error?

Margin of error is a measure of the accuracy of a poll. It represents the amount of sampling error in the results, meaning the potential difference between the poll results and the true opinion of the entire population. It is typically expressed as a percentage and is affected by the sample size and the level of confidence chosen for the poll.

Why is it important to consider margin of error when interpreting poll results?

Margin of error is important because it helps to gauge the reliability and accuracy of the poll results. A larger margin of error means there is a greater chance that the results are not an accurate representation of the entire population's opinion. It also helps to account for any potential biases or limitations in the sampling method used for the poll.

How are multiple polls combined in a "Poll of Polls"?

In a "Poll of Polls," multiple polls are combined by taking the average of the results from each individual poll. This helps to reduce the impact of any outliers or biases in a single poll and produces a more accurate estimate of public opinion. Additionally, the margin of error for each poll is taken into account to create a more precise result.

What are the limitations of using a "Poll of Polls"?

While "Poll of Polls" can provide a more accurate estimate of public opinion, it also has its limitations. It assumes that all of the individual polls are unbiased and equally reliable, which may not always be the case. Additionally, it does not take into account any changes in public opinion over time, as it is based on a collection of polls at a specific point in time.

Similar threads

3
Replies
72
Views
1K
Replies
102
Views
9K
Replies
9
Views
2K
Replies
10
Views
2K
Replies
26
Views
3K
Replies
15
Views
2K
Replies
7
Views
2K
Replies
10
Views
2K
Replies
9
Views
2K
Back
Top