Pearson Correlation Coefficient and Linear Regression Dissertation

Exclusively available on IvyPanda Available only on IvyPanda

Calculating the Pearson Product-Moment Correlation Coefficient

The product-moment correlation coefficient allows for the calculation of two linearly dependent variables. In this case, the variables will be represented by x and y. The example to apply is a case of a pharmacy with an owner interested in knowing the time taken in the pharmacy (x) and money spent (y) by every tenth client in minutes and dollars, respectively. The assumption is that those who take a long time will spend more (positive correlation) and vice versa (Kenney & Keeping, 2011). To solve this puzzle, it is possible to compute the product-moment correlation coefficient denoted by r. The above case can be calculated as illustrated below.

We will write a custom essay on your topic a custom Dissertation on Pearson Correlation Coefficient and Linear Regression
808 writers online

Step 1

Step 1

Step 2: Eliminate all the incomplete pairs. In doing this, observations with known values of x and y are included, even if the values are zero as illustrated in figure 2 below.

Eliminate all the incomplete pairs. In doing this, observations with known values of x and y are included, even if the values are zero as illustrated in figure 2 below.

Step 3: Step three entails summarizing the resulting data into distinct values for computation by using the following signs.

n: Summation of pairs of data

1 hour!
The minimum time our certified writers need to deliver a 100% original paper

Σ(x2): Summation of x values squared

Σx: Summation of the x values

Σ(x*y): Summation of each value of x which is multiplied by the y value that corresponds to the x values

Σy: Summation of the y values

Σ(y2): Summation of the y values squared

The above x and y values give the following figures when computed.

Step 4: Compute the ssxy, ssxx, and ssyy with the above values as summarized below.

Remember! This is just a sample
You can get your custom paper by one of our expert writers

Step 4: Compute the ssxy, ssxx, and ssyy with the above values as summarized below.

ssxy=Σxy-(ΣxΣy÷n)=283-(12*93/5)=59.8

ssxx=Σx2-(ΣxΣx÷n)=40-(12*12/5)=11.2

ssyy=Σy2-(ΣyΣy÷n)=2089-(93*93/5)=359.2

Step 5: The resulting values should then be inserted into the initial equation for the Pearson coefficient as illustrated below.

r=ssxy/(ssxx*ssyy)**0.5=59.8/(11.2*359.2)**0.5=0.9428

Step 5: Interpreting the results

Option 1: When the value is close to 1, it indicates that there is a strong and positive correlation (Howell, 2016).

We will write
a custom essay
specifically for you
Get your first paper with
15% OFF

Option 2: When the value is very close to zero, it indicates no correlation.

Option 3: When the value is close to -1, it indicates that there is a strong and negative correlation.

In this case, since the value is 0.9428, which is very close to 1, there is a strong positive correlation between time and money spent in the pharmacy.

Calculating Simple Linear Regression

A simple linear regression explains the link between variables with the use of a straight line (Mugenda & Mugenda, 2013). For instance, consider data collected from a health center on different tests where each yield is associated with the temperature reaction as summarized in the figure below.

Calculating Simple Linear Regression

The above data can be entered in Microsoft Excel and a simple scatter plot may be derived as indicated below. The variables of yield and temperature values are represented by yi and xi respectively.

Scatter plot

From the above scatter plot, there is no single line that can touch all the points. This is an indication that there is no linear relationship between yield and temperature for the tests. However, the scatter plot seems to suggest that a straight line might be drawn to touch specific points within the table. From a statistical perspective, the relationship between the x and y variables can be summarized in the equation below.

Formula

This means that the Y is assumed to be following a linear relation s summarized in the equation below.

Formula

The assumption in deriving a simple linear regression is that the values of Y are the summation of the E(Y) (mean value) and random error as summarized below.

Formula

References

Howell, D. (2016). Fundamental statistics for the behavioral sciences. New York, NY: Cengage Learning.

Kenney, J. F., & Keeping, E. S. (2011). Linear regression and correlation. Princeton, NJ: Van Nostrand.

Mugenda, C., & Mugenda, O. (2013). Applied statistics in research. Capetown, SA: CapeHouse Publishers.

Print
Need an custom research paper on Pearson Correlation Coefficient and Linear Regression written from scratch by a professional specifically for you?
808 writers online
Cite This paper
Select a referencing style:

Reference

IvyPanda. (2022, July 21). Pearson Correlation Coefficient and Linear Regression. https://ivypanda.com/essays/pearson-correlation-coefficient-and-linear-regression/

Work Cited

"Pearson Correlation Coefficient and Linear Regression." IvyPanda, 21 July 2022, ivypanda.com/essays/pearson-correlation-coefficient-and-linear-regression/.

References

IvyPanda. (2022) 'Pearson Correlation Coefficient and Linear Regression'. 21 July.

References

IvyPanda. 2022. "Pearson Correlation Coefficient and Linear Regression." July 21, 2022. https://ivypanda.com/essays/pearson-correlation-coefficient-and-linear-regression/.

1. IvyPanda. "Pearson Correlation Coefficient and Linear Regression." July 21, 2022. https://ivypanda.com/essays/pearson-correlation-coefficient-and-linear-regression/.


Bibliography


IvyPanda. "Pearson Correlation Coefficient and Linear Regression." July 21, 2022. https://ivypanda.com/essays/pearson-correlation-coefficient-and-linear-regression/.

Powered by CiteTotal, free referencing tool
If you are the copyright owner of this paper and no longer wish to have your work published on IvyPanda. Request the removal
More related papers
Cite
Print
1 / 1