In statistics, regression
analysis includes many techniques for modeling and analyzing several
variables, when the focus is on the relationship between a dependent
variable and one or more independent
variables. More specifically, regression analysis helps one
understand how the typical value of the dependent variable changes when any one
of the independent variables is varied, while the other independent variables
are held fixed. Most commonly, regression analysis estimates the conditional expectation of the
dependent variable given the independent variables — that is, the average value of
the dependent variable when the independent variables are held fixed. Less
commonly, the focus is on a quantile,
or other location
parameter of the conditional distribution of the dependent
variable given the independent variables. In all cases, the estimation target
is a function of
the independent variables called theregression function. In regression
analysis, it is also of interest to characterize the variation of the dependent
variable around the regression function, which can be described by aprobability distribution.
Regression
analysis is widely used for prediction and forecasting,
where its use has substantial overlap with the field of machine learning.
Regression analysis is also used to understand which among the independent
variables are related to the dependent variable, and to explore the forms of
these relationships. In restricted circumstances, regression analysis can be
used to infer causal
relationships between the independent and dependent variables.
A large body
of techniques for carrying out regression analysis has been developed. Familiar
methods such as linear
regression and ordinary
least squares regression are parametric,
in that the regression function is defined in terms of a finite number of
unknown parameters that
are estimated from the data. Nonparametric regression refers to
techniques that allow the regression function to lie in a specified set of functions,
which may be infinite-dimensional.
The
performance of regression analysis methods in practice depends on the form of
the data generating process, and how it relates
to the regression approach being used. Since the true form of the
data-generating process is in general not known, regression analysis often
depends to some extent on making assumptions about this process. These
assumptions are sometimes (but not always) testable if a large amount of data
is available. Regression models for prediction are often useful even when the
assumptions are moderately violated, although they may not perform optimally.
However, in many applications, especially with small effects or
questions of causality based
on observational
data, regression methods give misleading results.
1 komentar:
based on the article above regression analysis includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between a dependent variable and one or more independent variables. More specifically, regression analysis helps one understand how the typical value of the dependent variable changes when any one of the independent variables is varied, while the other independent variables are held fixed. Regression analysis is widely used for prediction and forecasting, where its use has substantial overlap with the field of machine learning. Regression analysis is also used to understand which among the independent variables are related to the dependent variable, and to explore the forms of these relationships. The performance of regression analysis methods in practice depends on the form of the data generating process, and how it relates to the regression approach being used.
Posting Komentar