- #1
imsolost
- 18
- 1
Hello all,
I am a bit lost with a problem and my reasoning and would like to hear your thoughts about it.
Problem is the following :
For a set of data yi, I need to find the best value for Am given :
"i" is an indice {1, 2, 3, ... up to something like 20}. "m" stands for "mass" (Am is a physical quantity named "mass activity").
Anyway, so I thought about using common least-square method...
The data yi is not a set of "exact" values though... It comes from a report from a laboratory and they give me a set of values, each with an associated standard deviation uncertainty. For some yi, value seems to be accurately measured since the standard deviation is small, while sometimes, for other values, it can be bit bigger.
So I thought about using weighted least-square method...
As i understand weighted LS, I write :
I force the condition :
And thus Am should look like :
where Wi=1/sigma²_i are the weights. So far, so good.
Now, here is the tricky part... Ki also is "uncertain" : Ki comes from an big expression containing 3 uncertain parameters : λ1, λ2, λ3. These λ1, λ2, λ3 are calculated from other datas with non-linear expressions, and I had to propagate the uncertainties on these datas using Monte-Carlo methods. So what I have is something like 1000 "sets" of slightly different λ1, λ2, λ3. I use these 1000 sets and my big Ki expression to generate a set of 1000 Ki.
So, the question becomes :
1) Should I use my WLS expression to calculate Am for each of the 1000 Ki, which would give me 1000 slightly different Am (and then i can happily get the mean, standard deviation or whatever on these 1000 Am values and I'm done).
2) Or should i use a regular LS expression (i.e. without weights) and handle the uncertainty on the yi with Monte-Carlo. So i mean getting something like 1000 random normally-distributed sampling around the yi's and given the associated standard deviation uncertainty from the laboratory. Then with 1000 set of yi, my previous 1000 set of Ki and the common LS formula, i would get 1000 slightly different Am (and then i can happily get the mean, standard deviation or whatever on these 1000 Am values and I'm done)
3) Or should I do both 1) AND 2), i.e. doing "2)" but using the WLS instead of the LS formula ?
At the time of writing this, I'm doing "3)" but i wonder if this is not a bit stupid. Am I accounting for these uncertainties twice and over-estimating the uncertainty on Am ?
Sorry if the post is a bit long but i really tried to explain the thing as detailed as possible. Also sorry if the question is a bit dumb/obvious but I think i thought about this problem a bit too much and i feel like my brain is totally biased / unable to think about this stuff anymore :D
Anyway, big thank you for your time !
I am a bit lost with a problem and my reasoning and would like to hear your thoughts about it.
Problem is the following :
For a set of data yi, I need to find the best value for Am given :
"i" is an indice {1, 2, 3, ... up to something like 20}. "m" stands for "mass" (Am is a physical quantity named "mass activity").
Anyway, so I thought about using common least-square method...
The data yi is not a set of "exact" values though... It comes from a report from a laboratory and they give me a set of values, each with an associated standard deviation uncertainty. For some yi, value seems to be accurately measured since the standard deviation is small, while sometimes, for other values, it can be bit bigger.
So I thought about using weighted least-square method...
As i understand weighted LS, I write :
I force the condition :
And thus Am should look like :
where Wi=1/sigma²_i are the weights. So far, so good.
Now, here is the tricky part... Ki also is "uncertain" : Ki comes from an big expression containing 3 uncertain parameters : λ1, λ2, λ3. These λ1, λ2, λ3 are calculated from other datas with non-linear expressions, and I had to propagate the uncertainties on these datas using Monte-Carlo methods. So what I have is something like 1000 "sets" of slightly different λ1, λ2, λ3. I use these 1000 sets and my big Ki expression to generate a set of 1000 Ki.
So, the question becomes :
1) Should I use my WLS expression to calculate Am for each of the 1000 Ki, which would give me 1000 slightly different Am (and then i can happily get the mean, standard deviation or whatever on these 1000 Am values and I'm done).
2) Or should i use a regular LS expression (i.e. without weights) and handle the uncertainty on the yi with Monte-Carlo. So i mean getting something like 1000 random normally-distributed sampling around the yi's and given the associated standard deviation uncertainty from the laboratory. Then with 1000 set of yi, my previous 1000 set of Ki and the common LS formula, i would get 1000 slightly different Am (and then i can happily get the mean, standard deviation or whatever on these 1000 Am values and I'm done)
3) Or should I do both 1) AND 2), i.e. doing "2)" but using the WLS instead of the LS formula ?
At the time of writing this, I'm doing "3)" but i wonder if this is not a bit stupid. Am I accounting for these uncertainties twice and over-estimating the uncertainty on Am ?
Sorry if the post is a bit long but i really tried to explain the thing as detailed as possible. Also sorry if the question is a bit dumb/obvious but I think i thought about this problem a bit too much and i feel like my brain is totally biased / unable to think about this stuff anymore :D
Anyway, big thank you for your time !
Last edited: