- #1
4quila
- 3
- 0
Homework Statement
In his paper "regression towards mediocrity in hereditary stature" Francis Galton collects data on stature of children (928 obs.) & parents (205 obs.) now as part of my project i have been asked to recreate the original data set. However taking children the data set runs from 64.5,65.5,...,72.5 inches with corresponding numbers of obs. Fair enough. However we have 14 observations labelled "below" and 4 labelled "above" and the problem statement is;
"For objects labelled above or below you must assume some particular values. Please state these explicitly in a table and justify with one sentence."
Homework Equations
The Attempt at a Solution
Clearly if any of these above or below terms are large outliers they will have an impact on the regression so i want to avoid that. But i am struggling to think of a value for these variables to take? Do i assign them values so as to ensure the mean for example is unaffected? Or give them values so they form a nice looking histogram?
I guess in short i am asking if there is a common practice for this kind of thing? Or if it is just an arbritary assignment as long as i can justify it plausibly?
All help much appreciated.