In statistics, linear regression is a linear approach to modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is called simple linear regression; for more than one, the process is called multiple linear regression. This term is distinct from multivariate linear regression, where multiple correlated dependent variables are predicted, rather than a single scalar variable.In linear regression, the relationships are modeled using linear predictor functions whose unknown model parameters are estimated from the data. Such models are called linear models. Most commonly, the conditional mean of the response given the values of the explanatory variables (or predictors) is assumed to be an affine function of those values; less commonly, the conditional median or some other quantile is used. Like all forms of regression analysis, linear regression focuses on the conditional probability distribution of the response given the values of the predictors, rather than on the joint probability distribution of all of these variables, which is the domain of multivariate analysis.
Linear regression was the first type of regression analysis to be studied rigorously, and to be used extensively in practical applications. This is because models which depend linearly on their unknown parameters are easier to fit than models which are non-linearly related to their parameters and because the statistical properties of the resulting estimators are easier to determine.
Linear regression has many practical uses. Most applications fall into one of the following two broad categories:
If the goal is prediction, forecasting, or error reduction, linear regression can be used to fit a predictive model to an observed data set of values of the response and explanatory variables. After developing such a model, if additional values of the explanatory variables are collected without an accompanying response value, the fitted model can be used to make a prediction of the response.
If the goal is to explain variation in the response variable that can be attributed to variation in the explanatory variables, linear regression analysis can be applied to quantify the strength of the relationship between the response and the explanatory variables, and in particular to determine whether some explanatory variables may have no linear relationship with the response at all, or to identify which subsets of explanatory variables may contain redundant information about the response.Linear regression models are often fitted using the least squares approach, but they may also be fitted in other ways, such as by minimizing the "lack of fit" in some other norm (as with least absolute deviations regression), or by minimizing a penalized version of the least squares cost function as in ridge regression (L2-norm penalty) and lasso (L1-norm penalty). Conversely, the least squares approach can be used to fit models that are not linear models. Thus, although the terms "least squares" and "linear model" are closely linked, they are not synonymous.
Find the equation of the regression line for the given data. then construct A SCATTER PLOT of the data and draw the regression line. (each pair of variables has a significant correlation.) then use the regression equation to predict the value of y for each of the given x- values, if meaningful...
I really like the derivations here
http://en.wikipedia.org/wiki/Proofs_involving_ordinary_least_squares
Could some one recommend a good book for them. I'm tired of googling these equations every time I want to use them.
Thanks!
Homework Statement
We want to determine the coefficients of a polynomial of the form:
##p(x)=c_{1}x^2 +c_{2}x+c_{3}##The polynomial ##p(x)## must satisfy the constraint ##p(1)=1##.
We would also like ##p(x)## to satisfy the following 4 constraints:
##p(−1)=5##
##p(0)=−1##
##p(2)=6##...
This is for an experimental physics homework,I am using the latest version of MS Excel. I have a set of data, I perform linear regression on them and it gives me a line y=ax + b.
Given that both a and b have physical significance I would like to know how could I know the uncertainty...
I am using linear regression to predict 'y' based on 8 variables.
With my example, most the Betas that I got are negative. So, y, the value to predict, is negative.
To my data, y is a time in seconds, so I think it shouldn't be negative.
I my example in python, and I want to know if y...
Homework Statement
I'm looking through some example problems that my professor posted and this bit doesn't make sense
How do you come up with the values underlined?
Homework Equations
The Attempt at a Solution
Upon researching it, I find that you should use α/2 for both...
Sorry if I'm in the wrong subforum.
This is a rather simple and straightforward question, I hope.
I'm doing a measurement that requires me to do a linear regression on data points to get a value of the slope. The slope is the value of the actual property that I am measuring.
Assuming...
In a linear regression with 1 independent variable, if X is always the same (let's call I am unlucky), but Y present different values for the same X, I still can find the coefficient of the straight line equation?
Hi,
I am trying to understand how I find the error in linear regression, and what to do with it. I am using linear regression to predict the time of execution based on the size of the input and the number of tasks used in the computer to get the result.
1 - In a linear regression, I calculate...
I am trying to understand how linear regression and R-squared differ.
1 - Can anyone give me an example of use of linear regression and R-squared?
2 - They have some relation between them? E.g., they are useful for each other?
3 - What are the dangers when analysing the linear regression...
Homework Statement
Under the simple linear regression model Y= A + Bx + e, where A is the intercept (a known concept), B is the slope parameter (unknown) and e is a random error term satisfying the normality assumption.
If (X1,Y1)...(Xn,Yn) are the n data points observed, find the least squares...
Homework Statement
Why does excel give me this:
http://postimg.org/image/68b9z1lqt/
And various online calulators (for example http://www.alcula.com/calculators/statistics/linear-regression/), and my own calulations give me this:
http://postimg.org/image/kpljm4awx/
Homework...
Homework Statement
I have to design an experiment with 3 factors. One factor has to be quantitative with at least 3 levels. One Qualitative with at least 3 levels. And the last one can be either quant/qual with at least 2 levels.
My question is in regards to coding the variables. For example...
Homework Statement
Hi,
I need to create an experiment for my regression class and I would appreciate some ideas which would allow me to generate the data with minimal resources (preferably something on the computer where I can get data instantly)
The main criteria for the experiment...
I have posted this question before but I don't think I was clear on what i was trying to do exactly. I am trying to simulate a set of muon detecting drift tubes in 2d space. I have 2 sets of detector tubes (shown as black circles in the image), a particle trajectory goes through all tubes...
Hi,
I am trying to simulate muon paths through drift tubes and I have ran into a problem performing a linear regression. I have generated simulated muon trajectories in 2 dimensions and they passes through my simulated drift tubes represented as black circles with a '+' in the center. As the...
I wasn't sure where to put this question. Can anyone tell me what method LineFit uses to perform linear regression with error in both coordinates? Thank you.
I read about "Linear regression" and I want to make sure that what I read is right
Just tell if these equations are right-
Slope of line of regression for y on x is given by
m=\frac{E(XY)-E(X)E(Y)}{E(X^{2})-[E(X)]^{2}}
\\ m=\frac{Cov(XY)}{Var(X)}
\\ m=\frac{ρσ_{x}σ_{y}}{σ_{x}^{2}}
\\...
A text says that if you calculate the linear regression of data points and you get the equation y=mx+b with an r2 value, the error in the slope is given by:
δm/m=2(1-r)
No explanation was given. Could someone please explain this formula? Thanks!
βHomework Statement
Data y1,y2...yn are modeled as observations of random variables Y1,..Yn given by
Yi = α + β(xi-xbar) + σεi
Where α , β and σ are unknown parameters x1,x2...xn are known constants and xbar is
(1/n)Ʃxi and εi's are independent random variables each with the...
Homework Statement
Hello, I have done a laboratorial experiment (electron diffraction) and I've been doing the analysis of the obtained data.
I have plotted the data obtained experimentally, and the slope of the obtained linear regression should give a certain value.
What's happening...
I have couple questions about this and I was hoping someone with some stats knowledge could clarify.
First, when people report numbers such as 10 plus or minus 5, what does the 5 mean? Is it the standard deviation or the confidence interval or the variance? What is the relationship between...
Homework Statement
"You are asked to do an experiment where you will need to use a rotating blade to measure the wind speed. You measure the number of rotations of the blade at 10 different wind speeds, 10 times each and will make a linear fit to determine the wind speed as a function of...
Homework Statement
Suppose that data (x1,y1),(x2,y2),.?.,(xn,yn) is modeled with xi being non random and Yi being observed values of random variables Y1,Y2,...Yn which are given by
Yi = a + b(xi-xbar) + σεi
Where a, b, σ are unknown parameters and εi are independent random variables each...
I'm currently looking at a linear regression handout from Uni and there are two methods to calculate the equation. The Normal one is to find a and b for y=a+bx, the equations for a and b are given in the handout but I'll assume you're familiar with them. The simplified one is using
y = Bx +...
Hi all, I would like to understand the theory for determining outliers in the following scenario.
Let's say I am to fit a linear model to the data of house size v. sale price for a particular location.
And let's say I have a fairly good linear relationship, as house size increases, so does...
Hi everyone,
This is not a homework question. I just want to understand an aspect of linear regression better. The book "Applied Linear Models" by Kutchner et al, states that a linear regression model is of the form
Y_i = B_0 + B_1 X_i + \epsilon_i
where
Y_i is the value of the...
I've been told that their exists no perfect mathematical method of obtaining a line of best fit from a population of data.
This doesn't make a whole lot of sense to me, so I have made an attempt at doing such (see google docs link)...
Dear all,
let's say I want to know the elasticity constant of a spring (k), so I measure several times different values for the force applied to the spring, F, and the displacement of the spring, x.
So, for N measures, I have xi and Fi and their uncertainties.
Now, I'm really not an expert of...
Hi,
I've asked this question on another forum, but no response until now. Maybe I will have a little bit of luck here. So .. I have a problem. I have a set of 8 parameter and I use this parameters in order to compute a measure (I vary each parameter with a step of 50%). I would like to know...
Hi y'all, wondering if you could help me with this. I have a data set with a linear relationship between the independent and dependent variables. Both the depended and independent variables have error due to measurement and this error is not constant.
For example,
{x1, x2, x3, x4, x5}...
I've been trying to figure out how to do a linear regression on data with asymmetric x and y error bars (different for each data point). Any help would be much appreciated.
Homework Statement
[PLAIN]http://img822.imageshack.us/img822/4421/statsii.jpg
The Attempt at a Solution
Done parts (a) and (b). How do I do parts (c) and (d)?
Is the simple linear regression model just Y_i=\beta_0+\beta_1 X_i + \varepsilon_i where \varepsilon_i \stackrel...
I'm working through John Taylor's An Introduction to Error Analysis and so far this is the only problem I haven't been able to solve. I was hoping someone could lend me some insight.
The problem asks you to use error propagation to verify that the uncertainties in A and B for a line of the...
In the least square linear regression, say we have y=Xb+e (y,b,e are vector and X is matrix, y is observations, b is coefficient, e is error term)
so we need to minimize e'e=(y-Xb)'(y-Xb)=y'y-y'Xb-b'X'y+b'X'Xb we can take the derivative of e'e to b, and we can get the answer is 0-2X'y+2X'Xb...
I want to do multiple linear regression, but one of the requirements is the residuals to be normally distributed, and I can check that with QQplots but then the QQ plot shows it is about 95% of data fit into the normal line, but 5% is way off!
can I still proceed ?*or do I have to find a way...
Hello all;
I'm doing work for my job, and I've forgotten my statistics =(.
I first want to know if what I'm trying to do is possible.
I want to create a linear regression of the form Y = a * x1 + b * x2 + c.
http://imgur.com/Q4vGP"
As you can see, there is space that is grayed...
Homework Statement
Where can I find Y(subscript: 1) on the TI-83 plus calculator? I was working on Linear Regressions in the book and I came across a question that requires me to input Y1 on the screen but I can't find the 'y'.. how can I find it?
Homework Equations
The Attempt at...
Hi guys,
I have data of 20 peoples height, weight, calorie intake and skinfold thickness. I have carried out a regression of calorie on height, on weight and on height and weight. I have done the same thing for skinfold thickness. I then used R to work out the summary of results. each...
Hello,
In simple linear regression (or even in multiple linear regression) how does one prove that the coefficient of determination, given by
R^2 = \frac{SS_{Reg}}{SS_{Total}} = 1-\frac{SS_{Res}}{SS_{Total}}= 1-\frac{\sum_{i=1}^{n}(y_i-\hat{y}_i)^2}{\sum_{i=1}^{n}(y_i-\overline{y})^2}
is...
I am trying to figure out how to combine uncertainty (in x and y) into the standard error of the best fit line from the linear regression for that dataset.
I am plotting units of concentration (x) versus del t/height (y) to get a value for the flux (which is the slope)
I understand how...
Hi:
This is my first post and I'm not sure if this is the right forum. Please redirect if necessary.
I'm new to nonlinear regression, but from what I've read I realize that making "good" initial guesses for the model parameters is very important, otherwise a "best fit" may not result...
Hi Guys,
Can anyone recommend a code, preferably in fortran 90/77, or possibly in C, which provides a weighted linear regression of a 3 column file?
Natski
Homework Statement
Evaluate the partial effect of age of a firm on growth.(evaluated at the means)
Homework Equations
Growth=\beta0+\beta1age+\beta2age^2+\beta3size*age+\beta4plant*age
The Attempt at a Solution
We're supposed to do something like this...
In my textbook, the following results are proved in the context of SIMPLE linear regression:
∑e_i = 0
∑(e_i)(Y_i hat)= 0
I tried to modify the proofs to mutliple linear regression, but I am unable to do so, so I am puzzled...
Are these results also true in MULTIPLE linear regression...
In MULTIPLE linear regression, is it still true that the regression sum of squares is equal to
∑ (Y_i hat -Y bar)^2 ?
My textbook defines regression SS in the chapters for simple linear regression as ∑ (Y_i hat -Y bar)^2, and then in the chapters for multiple linear regression, the...
"Suppose that in a MULTIPLE linear regression analysis, it is of interest to compare a model with 3 independent variables to a model with the same response varaible and these same 3 independent variables plus 2 additional independent variables.
As more predictors are added to the model, the...