Uncertainty of histogram bins from MC-simulation

In summary: N. The third is from the Poisson Distribution: Poisson (N, mean=μ, variance=σ) where μ is the mean and σ is the variance.
  • #1
minits
13
0
TL;DR Summary
How to determine the uncertainty of histogram bins when the entries result from a MC-simulation
Hello everyone,

I calculated the matrix element of a parton level process and determined the total cross section via a MC-simulation. Then I wanted to look at some differential distributions like the differential cross section with respect to the energy of one of the particles in the final state. In order to do this I calculate the differential cross section for 10.000 phase-space points and then do a case analysis in which bin to add the respective result. Now I am stuck with assigning an uncertainty to each bin. How does this work? The differential cross sections do not carry any uncertainty since they are calculated analytically. One approach I often read is the use of the poissonian distibution but I do not think it is the proper way since I feel like I am not really doing a counting experiment. Thanks for any advice!
 
Physics news on Phys.org
  • #2
Can you show us the histogram? If you want to protect your data, feel free to apply a blind to the axes. I'd just like a sense for how many points fall in each bin before I say something stupid.

I know next to nothing about the physics of what you're studying; however, I'm confused how you could be generating a histogram from MC results and not be doing a counting experiment? Can you explain why you think you're not doing counting? Where I'm going with this is that all histograms are counting. The Poissonian distribution tells you the estimated uncertainty on the number of MC results in each bin (in the appropriate limit of many MC results).
 
  • #3
Here is the histogram you asked for. I am just not firm with the application of the poisson distribution since everyone says it kind of always applies in counting experiments but I still don't know why that is.
A second thing that I am unsure about is if the error from the MC ist poissonian I should take sqrt(N) as the errors for each bin but I have 10.000 events and in every event another number flows into one of the bins. So if I have 100 events flowing into one bin and the value of these events sum up to two the error won't be 10. So probably I should interpret the specific values as a weight which comlicates things.
 

Attachments

  • hist.PNG
    hist.PNG
    3.5 KB · Views: 191
Last edited:
  • #4
Ok, seeing that I feel more confident saying Poisson is what you want.

I could give you my spiel about why the Poisson distribution is appropriate, but there are probably 1000's of folks online that have already given good explanations and put more time in than I would have. This video might be a good place to start.

The technical reason for why the poisson distribution is appropriate is that it is the limit of the binomial distribution as the number of trials goes to infinity and the mean number of successes is kept finite. Whenever you skim a few outcomes from a much larger pot of outcomes (raindrops on a roof, electrons over a threshold, photon shot noise, or points in a histogram bin), the Poisson distribution is what you need. If what you're asking is how you assign error bars to your histogram plot, take the square root of the number of points in each bin (that's the signature feature of the PD).
 
  • #5
You are counting phase space points. If they are selected randomly then Poisson-like uncertainties should be right (your events have weights, so it's not a simple sqrt(N)). If they come from some grid then the uncertainty will be smaller. If computing power allows you can repeat the analysis with different grid patterns and compare the results to estimate the uncertainty.

Just from looking at your plot, bin-by-bin variations are probably very small.
 
  • Like
Likes Twigg
  • #6
  • #7
Thanks for your replies! I actually took the info for the uncertainties from the webseite you linked. However I did not understand the following equations:
var(w_i * 1 event) = w_i^2 * var(1 event) = w_i^2
Do you know why the these three expressions are equal? In the first relation why why is it the square of the weight? And after the second equal sign why is the variance of 1 event 1 and not 0? They also talk about
"poissonian fluctuation of the number of events" but where is this used explicitly?
 
  • #8
I'm on my phone so forgive the lack of LaTeX.

The first equality is error propagation. Var[a*x] = a^2 Var[x]. The second is because for poissonian Var[N events] = N.
 

FAQ: Uncertainty of histogram bins from MC-simulation

What is the purpose of using MC-simulation to determine the uncertainty of histogram bins?

MC-simulation allows us to generate a large number of random samples based on a given probability distribution. By analyzing these samples, we can estimate the uncertainty of the histogram bins and better understand the behavior of the underlying system.

How does MC-simulation work to determine the uncertainty of histogram bins?

In MC-simulation, we use random numbers to generate a large number of samples that follow the same probability distribution as the system we are studying. These samples are then used to calculate the mean, standard deviation, and other statistical measures that can help us determine the uncertainty of the histogram bins.

What are some advantages of using MC-simulation for determining the uncertainty of histogram bins?

One advantage is that it allows us to study complex systems that cannot be easily modeled mathematically. Additionally, MC-simulation is flexible and can be adapted to different types of systems and probability distributions. It also provides a more accurate estimate of uncertainty compared to traditional methods.

Are there any limitations to using MC-simulation for determining uncertainty of histogram bins?

One limitation is that MC-simulation requires a large number of samples to be generated in order to obtain accurate results. This can be computationally expensive and time-consuming. Additionally, the results may be affected by the quality of the random number generator used.

How can the uncertainty of histogram bins from MC-simulation be used in scientific research?

The uncertainty of histogram bins can provide important insights into the behavior and characteristics of a system. It can be used to validate or improve existing models, as well as guide future experiments and research. Additionally, it can help researchers make more informed decisions and draw more accurate conclusions from their data.

Similar threads

Replies
9
Views
2K
Replies
21
Views
3K
Replies
3
Views
7K
Replies
0
Views
1K
Replies
3
Views
2K
Replies
10
Views
4K
Back
Top