- #36
Stephen Tashi said:Ok, apparently "interaction terms" is a standard terminologyi the social sciences for "products of variables". It looks like the stuff on this slide show: http://www.google.com/url?sa=t&rct=...HhnYEo&usg=AFQjCNExqJvcPRmlDyn1Gpg9zt_okXJ2DA
Stephen Tashi said:Suppose we have a model that predicts L as a sum of unknown constants [itex] C [/itex] times known functions [itex] f_i [/itex] of independent variables [itex] x_i [/itex].
Stephen Tashi said:If we have enough data then we can use linear regression to fit this model to the data even if the functions [itex] f_i(x_1,x_2,..x_n) [/itex] are non-linear because we can compute the values of the functions on each vector of observed data and treat each [itex] f_i [/itex] as an independent variable ( even though it depends on the [itex] x_i ) [/itex]).
Do you think the person who wrote the company model thought along these lines?
mdhastings said:Thanks MrAnchovy
No I'm saying the artificial weather station made from 6 stations weather goes into the initial equation - the Meteorology weather forecast replaces that data in the prediction function for the load forecast. What would your methodology be to find the weights?
Going back to the weighting then, I am not a maths thinker but it seems to me that
[itex] L = \sum_{i=1}^{ N_f} C f_i(x_1,x_2,..x_n) [/itex] should be modifiable. That is lose some of the terms [itex] f_i(x_1,x_2,..x_n) [/itex] and set the restriction on the weather coefficients to be positive (how?).
I am programming in R but this can be a tricky languageStephen Tashi said:From a mathematical point of view, there are many things that can be done, but from a practical point of view the question is whether you can do them. Are you a skilled and experienced programmer?
Stephen Tashi said:This problem is obviously one that requires a programmer and it would help if that person was a competent mathematician. You've received several suggestions that a mathematican would understand and that a programmer could implement.
I don't have a good picture of the "office politics" side of this scenario. Logic would say that if a company relies heavily on a program and it needs to be fixed or replaced, they would hire an expert to do the work - perhaps a consultant (- and not me since I'm happily retired). Of course, I know that Logic isn't the primarly consideration in management.
mdhastings said:In the modified equation I was talking about in my last post how would I set it up to ensure positive coefficients on the weather variables
I can help here [the above is not right]... recall that the interaction terms are products - so [itex] wt1.\sin(\omega t). dow1. \sin(\lambda t)[/itex] is an 4 way interaction term. wt1 would be a temp (say 10C), the dow1 dummy term is 1 for Monday (otherwise 0) and the 2 sine terms are sequence from -1 to 1 for a day's 48 intervals (say -1) and a year the same but across 17520 intervals (say 0.0154) - so the data for this interaction term is their product: 10*-1*1* 0.0154 = -0.154. This is the value for say the 8:30am interval for some day (Monday) of the year. The 9:00am interval will have all terms slightly changed (except the dow1 with Monday). The next interaction term may include a dow2 (Tuesday) and hence the product then is zeroed.Stephen Tashi said:I don't see any simple way to set up the linear regression to solve for a new set of weights on the weather stations (even one that need not sum to one) because the regression includes "interaction" terms. For example, if a variable like "temperature" appears inside a sin(...) function
Stephen Tashi said:If there is no obvious connection with the current set of weights and the demographics of the city, why do you think changing the weights will better represent the demographics? Are you focusing on the weights of the weather measurements merely because they are the only undocumented constants in the program?
mdhastings said:I can help here [the above is not right]... recall that the interaction terms are products - so [itex] wt1.\sin(\omega t). dow1. \sin(\lambda t)[/itex] is an 4 way interaction term. wt1 would be a temp (say 10C), the dow1 dummy term is 1 for Monday (otherwise 0) and the 2 sine terms are sequence from -1 to 1 for a day's 48 intervals (say -1) and a year the same but across 17520 intervals (say 0.0154) - so the data for this interaction term is their product: 10*-1*1* 0.0154 = -0.154. This is the value for say the 8:30am interval for some day (Monday) of the year. The 9:00am interval will have all terms slightly changed (except the dow1 with Monday). The next interaction term may include a dow2 (Tuesday) and hence the product then is zeroed.
No in the sense that the weather stations are not matched to areas of population and unfortunately this again is because of closed market and the necessary regulations.
I don't know how changing the weights of weather measurements can represent a growth in the population.The weights to the stations were set many years ago to represent the demographics - since the city has grown then the weights should be changed.
Thanks Stephen, The way the program is set up shows some interactions with a weather variable (either temp or dew point). The main focus of the program is on matching the days load shape. For that, most are not weather interactions.Stephen Tashi said:Are you saying that all terms of the model are linear in the weather variables? No function like sin(...) has an argument that depends indirectly on the weather variables? There are no terms involving the product of two weather variables?
Stephen Tashi said:I don't understand how these two statements jibe.
I don't know how changing the weights of weather measurements can represent a growth in the population.