Regression
if x values are missing and x and y are related we are going to use x on y and if y values are given the Regression can be defined as a statistical method used to determine the strength and character of the relationship between one dependent variable (usually denoted by Y) and one or more independent variables (usually denoted by X). Regression analysis is a mathematical measure of the average relationship between two or more variables in terms of the original units of the data.
Equation of Regression Line
The regression line is a straight line that best fits the data points in a scatter plot. It is represented by the equation:
- Line of regression of on :
- Line of regression of on : 3) $$ x = a + by 4) $$x-\bar{x}=b_{xy}(y-\bar{y})$$
We can calculate the value of using
Properties of Regression Line
Average of the equation of regression line is the point . The regression line passes through this point.
- The regression line passes through the point .
- and
- Signatures of both and have to be the same as r.=
- The sign of r can is the sign of and . If r is positive, both and are positive, and if r is negative, both and are negative.
- is the solution to the equation
Least Squares Method
The least squares method is a standard approach in regression analysis to approximate the solution of overdetermined systems by minimizing the sum of the squares of the errors made in the results of every single equation.
To find the coefficients (a) and (b), we use the following equations:
These equations are derived from the condition that the sum of the squared differences between the observed and predicted values is minimized.
Multiple Regression
Lets say there is a regression line that goes by where is the dependent variable and and are the independent variables. The coefficients , , and are the regression coefficients.
We can summate on both the sides
References
Information
- date: 2025.03.26
- time: 16:57