What is Correlation?

Correlation is the statistical measure of the relationship between two variables. There are different types of correlation coefficients like Pearson coefficient (linear) and Spearman coefficient (non-linear) which capture different degrees of probabilistic dependence but not necessarily causation. The correlation coefficient, or Pearson’s, is calculated using a least-squares measure of the error between an estimating line and the actual data values, normalized by the square root of their variances. The coefficients range in value from -1 (perfect inverse correlation) to 1 (perfect direct correlation), with zero being no correlation.


Why is Correlation Important?

Correlation is one step to show how connected the value of different variables might be. The common dictum “correlation does not imply causation” simply means that a measure of correlation is not sufficient proof of causation. Determining whether correlations might be useful for making predictions also depends on having sufficient data points, as well as an understanding of the relationship between the variables.


