- #1
botbml
- 1
- 0
Hi all,
I'm a biologist with a mathematical and statistical question. I have a dataset of measurements of the maximal cross-sectional width of icosahedral bodies inside bacteria. Since they are icosahedral, the cross-sectional width (if precisely through the middle of the object) is a measure of the diameter of the circumscribed sphere of the icosahedron, from which I should be able to calculate dimensional data. The problem is that, of course, I essentially have a dataset of values whose mean will be less than the average circumscribed sphere diameter!
Consider, as an example, determining the average volume of non-uniform (but similarly sized) balls in a box where the only known data is from a set of cross-sectional measurements of a sample of the balls. The measurements will be a range of diameters as the cross-section will sometimes be through the middle, sometimes near the top and sometimes near the bottom of the ball, and other values in between. The maximum measured diameter will be representative of the actual mean diameter of all the balls, but it will be one of very few measurements as outliers of a normal distribution. In addition, given that the diameter of balls in the box vary, the large size outliers may also represent a small cross-sectional measurement of a very large ball! How can I calculate a value close to the mean diameter of all balls in the box given that the value I need to use is poorly represented in the dataset?? Any help appreciated. Any further questions, please post.
Thanks and cheers,
botbml
An example dataset is attached. If anyone has any clues.
I'm a biologist with a mathematical and statistical question. I have a dataset of measurements of the maximal cross-sectional width of icosahedral bodies inside bacteria. Since they are icosahedral, the cross-sectional width (if precisely through the middle of the object) is a measure of the diameter of the circumscribed sphere of the icosahedron, from which I should be able to calculate dimensional data. The problem is that, of course, I essentially have a dataset of values whose mean will be less than the average circumscribed sphere diameter!
Consider, as an example, determining the average volume of non-uniform (but similarly sized) balls in a box where the only known data is from a set of cross-sectional measurements of a sample of the balls. The measurements will be a range of diameters as the cross-section will sometimes be through the middle, sometimes near the top and sometimes near the bottom of the ball, and other values in between. The maximum measured diameter will be representative of the actual mean diameter of all the balls, but it will be one of very few measurements as outliers of a normal distribution. In addition, given that the diameter of balls in the box vary, the large size outliers may also represent a small cross-sectional measurement of a very large ball! How can I calculate a value close to the mean diameter of all balls in the box given that the value I need to use is poorly represented in the dataset?? Any help appreciated. Any further questions, please post.
Thanks and cheers,
botbml
An example dataset is attached. If anyone has any clues.