Multiple correlation: Difference between revisions

From Glossary of Meteorology
No edit summary
m (Rewrite with Template:Term and clean up)
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
 
{{Term
 
|Display title=multiple correlation
{{TermHeader}}
|Definitions={{Definition
{{TermSearch}}
|Num=1
 
|Meaning=The [[correlation]] between a [[random variable]] and its [[regression function]].
<div class="termentry">
|Explanation=If ''Y'' denotes the regression function of a random variable ([[variate]]) ''y'' with respect to certain  other variates ''x''<sub>1</sub>, ''x''<sub>2</sub> . . ., ''x''<sub>''n''</sub> then the [[coefficient of multiple correlation]] between ''y'' and the ''x'''s  is defined as the coefficient of simple, [[linear correlation]] between ''y'' and ''Y''. However, the constants  of the regression function automatically adjust for algebraic sign, with the result that the [[coefficient of correlation|coefficient  of correlation]] between ''y'' and ''Y'' cannot be negative; in fact, its value is precisely equal to the ratio  of their two [[standard deviations]], that is, &#x003c3;(''Y'')/&#x003c3;(''y''). Therefore, the coefficient of multiple correlation  ranges from 0 to 1, and the square of the coefficient of multiple correlation is equal to  the [[relative reduction]] (or [[percent reduction]]), that is, the ratio of [[explained variance]] to [[total variance|total  variance]]. Since, in practice, the true regression function ''Y'' is seldom known, it is ordinarily necessary  to hypothesize its mathematical form and determine the constants by [[least squares]], thus obtaining  the approximation ''Y''&prime;. In that case, the conventional estimate of the multiple correlation is the  [[sample]] value of the simple linear correlation (symbol ''R'') between ''y'' and ''Y''&prime;, although a better  estimate is obtained by incorporating a correction for [[degrees of freedom]]. Such a corrected value  ''R''&prime; is given as follows:  <blockquote>[[File:ams2001glos-Me35.gif|link=|center|ams2001glos-Me35]]</blockquote> where ''N'' denotes the sample size and ''n'' + 1 equals the total number of constants (including the  [[absolute]] term) determined from the data. In case (''N'' - 1) ''R''<sup>2</sup> &lt; ''n'', the value of ''R''&prime; is taken as  zero. <br/>''See'' [[regression]].
  <div class="term">
}}
== multiple correlation ==
}}
  </div>
 
<div class="definition"><div class="short_definition">The [[correlation]] between a [[random variable]] and its [[regression function]].</div><br/> <div class="paragraph">If ''Y'' denotes the regression function of a random variable ([[variate]]) ''y'' with respect to certain  other variates ''x''<sub>1</sub>, ''x''<sub>2</sub> . . ., ''x''<sub>''n''</sub> then the [[coefficient of multiple correlation]] between ''y'' and the ''x'''s  is defined as the coefficient of simple, [[linear correlation]] between ''y'' and ''Y''. However, the constants  of the regression function automatically adjust for algebraic sign, with the result that the [[coefficient  of correlation]] between ''y'' and ''Y'' cannot be negative; in fact, its value is precisely equal to the ratio  of their two [[standard deviations]], that is, &#x003c3;(''Y'')/&#x003c3;(''y''). Therefore, the coefficient of multiple correlation  ranges from 0 to 1, and the square of the coefficient of multiple correlation is equal to  the [[relative reduction]] (or [[percent reduction]]), that is, the ratio of [[explained variance]] to [[total  variance]]. Since, in practice, the true regression function ''Y'' is seldom known, it is ordinarily necessary  to hypothesize its mathematical form and determine the constants by [[least squares]], thus obtaining  the approximation ''Y''&prime;. In that case, the conventional estimate of the multiple correlation is the  [[sample]] value of the simple linear correlation (symbol ''R'') between ''y'' and ''Y''&prime;, although a better  estimate is obtained by incorporating a correction for [[degrees of freedom]]. Such a corrected value  ''R''&prime; is given as follows:  <div class="display-formula"><blockquote>[[File:ams2001glos-Me35.gif|link=|center|ams2001glos-Me35]]</blockquote></div> where ''N'' denotes the sample size and ''n'' + 1 equals the total number of constants (including the  [[absolute]] term) determined from the data. In case (''N'' - 1) ''R''<sup>2</sup> &lt; ''n'', the value of ''R''&prime; is taken as  zero. <br/>''See'' [[regression]].</div><br/> </div>
</div>
 
{{TermIndex}}
{{TermFooter}}
 
[[Category:Terms_M]]

Latest revision as of 12:24, 29 March 2024

The correlation between a random variable and its regression function.
If Y denotes the regression function of a random variable (variate) y with respect to certain other variates x1, x2 . . ., xn then the coefficient of multiple correlation between y and the x's is defined as the coefficient of simple, linear correlation between y and Y. However, the constants of the regression function automatically adjust for algebraic sign, with the result that the coefficient of correlation between y and Y cannot be negative; in fact, its value is precisely equal to the ratio of their two standard deviations, that is, σ(Y)/σ(y). Therefore, the coefficient of multiple correlation ranges from 0 to 1, and the square of the coefficient of multiple correlation is equal to the relative reduction (or percent reduction), that is, the ratio of explained variance to total variance. Since, in practice, the true regression function Y is seldom known, it is ordinarily necessary to hypothesize its mathematical form and determine the constants by least squares, thus obtaining the approximation Y′. In that case, the conventional estimate of the multiple correlation is the sample value of the simple linear correlation (symbol R) between y and Y′, although a better estimate is obtained by incorporating a correction for degrees of freedom. Such a corrected value R′ is given as follows:
ams2001glos-Me35
where N denotes the sample size and n + 1 equals the total number of constants (including the absolute term) determined from the data. In case (N - 1) R2 < n, the value of R′ is taken as zero.
See regression.
Copyright 2024 American Meteorological Society (AMS). For permission to reuse any portion of this work, please contact permissions@ametsoc.org. Any use of material in this work that is determined to be “fair use” under Section 107 of the U.S. Copyright Act (17 U.S. Code § 107) or that satisfies the conditions specified in Section 108 of the U.S.Copyright Act (17 USC § 108) does not require AMS’s permission. Republication, systematic reproduction, posting in electronic form, such as on a website or in a searchable database, or other uses of this material, except as exempted by the above statement, require written permission or a license from AMS. Additional details are provided in the AMS Copyright Policy statement.