The Regression Analysis: Econometrics Issues Report

Exclusively available on Available only on IvyPanda® Made by Human No AI

Regression analysis is a statistical tool that is used to develop and approximate linear relationships among various variables. Regression analysis formulates an association between several variables. When coming up with the model, it is necessary to separate between dependent and independent variables. Regression models are used to predict trends of future variables (Arnold 2011). The paper carries out simple regression analysis to establish the relationship between salaries and the number of years the employees have worked in the organization. Further, scatter plots will be drawn to establish the relationship between the variables. Finally, confidence intervals will be calculated to determine the range within the average values. The analysis will be based on a sample of 135 observations.

Descriptive statistics

Table 1.0 presented below shows a summary of the descriptive statistics.

SALARIESYEARS_COMPYEARS_SENIORS
Mean90.6185222.992597.8
Median75.2236
Maximum529.94537
Minimum1020
Std. Dev.62.572211.674376.40988
Skewness3.034276-0.1173781.518331
Kurtosis19.855051.876145.992716
Jarque Bera1805.1767.414711102.2494
Probability00.0245420
Observations135135135

From the table presented above, the mean and median for salaries are 90.62 and 75.2 respectively. The maximum and minimum values of salaries yield a range of 519.9. The standard deviation, a measure of dispersion, for salaries is 62.57. This implies that the mean can deviate by up to 62.57. The value of skewness (3.03), Kurtosis (19.86) and Jarque Berra (1805.176) indicates that the observations are not normally distributed. In a normal distribution, the value of skewness and Kurtosis should be zero. The mean and median for the number of years they have been senior officers are 22.99 and 23 respectively. The maximum and minimum values yield a range of 45 years. The standard deviation is 11.67. The value of skewness (-0.11), Kurtosis (1.88) and Jarque Berra (7.41) indicates that the observations are not normally distributed. Finally, the mean and median for the number of years they have worked for the company are 7.8 and 6 respectively. The maximum and minimum values yield a range of 37 years. The standard deviation is 6.41. The value of skewness (1.52), Kurtosis (5.99) and Jarque Berra (102.25) indicates that the observations are not normally distributed (Verbeek 2008).

Scatter plots

A scatter diagram is a graph that plots two variables on a Cartesian plane. Scatter diagram tries to establish if there exists a linear relationship between the two variables plotted. This can be observed by looking at the trend of the scatter plots. The independent variable is plotted on the x – axis while the dependent variable is on the y – axis. In this case, salary will be plotted on the y – axis while the number of years will be plotted on the x – axis (Greene 2003). Different graphs will be plotted for each explanatory variable.

Scatter plot for salary level against the number of years they have worked for the company.
Figure 1: Scatter plot for salary level against the number of years they have worked for the company.

In the graph above, it can be noted that as the number of years the employees have worked in the company increases, there is no significant change in salary. This implies that there seem to be no sign of the relationship between salary and the number of years the employees have worked in the company.

Scatter plot for salary level against the number of years they have been senior officers.
Figure 2: Scatter plot for salary level against the number of years they have been senior officers.

In the graph, it can be observed that the salary tends to increase as the number of years the employees have worked as seniors increases. The plots show an indication of a positive relationship between the two variables.

Simple regression

Simple regression develops a relationship between two variables. The dependent variable is salary while the independent variable is number of years. The regression line will take the form

Simple regression

when the ordinary least squares method is used (Asteriou & Hall 2011). The regression line can be simplified as shown below.

Simplified regression equation Y = b0 + b1X

Y = Salaries

X = the number of years they have worked for the company

The theoretical expectations are b0 can take any value and b1 > 0.

Regression results

Table 1.1 shows the regression results for salary level against the number of years they have worked for the company.

Dependent Variable: SALARY
Method: Least Squares
Date: 01/28/14 Time: 15:41
Sample: 1 135
Included observations: 135
VariableCoefficientStd. Errort-StatisticProb.
C84.7893311.961907.0882810.0000
YEARS_COMP0.2535250.4642330.5461160.5859
R-squared0.002237Mean dependent var90.61852
Adjusted R-squared-0.005265S.D. dependent var62.57220
S.E. of regression62.73669Akaike info criterion11.13047
Sum squared resid523473.7Schwarz criterion11.17351
Log likelihood-749.3070F-statistic0.298242
Durbin-Watson stat2.318641Prob (F-statistic)0.585901

The regression equation can be written as Y = 84.79 + 0.25X. The value of R-squared is 0.0022 while t-statistics is 0.5461 (it is less than t-tabulated at 95% confidence interval – 1.932). The results do not show a strong regression line.

Table 1.2 shows the regression results for salary level against the number of years they have been senior officers.

Dependent Variable: SALARY
Method: Least Squares
Date: 01/28/14 Time: 15:37
Sample: 1 135
Included observations: 135
VariableCoefficientStd. Errort-StatisticProb.
C77.398228.4033649.2103850.0000
YEARS_SENIOR1.6949110.8336022.0332370.0440
R-squared0.030146Mean dependent var90.61852
Adjusted R-squared0.022854S.D. dependent var62.57220
S.E. of regression61.85306Akaike info criterion11.10210
Sum squared resid508831.5Schwarz criterion11.14515
Log likelihood-747.3920F-statistic4.134052
Durbin-Watson stat2.255756Prob (F-statistic)0.044018

The regression equation can be written as Y = 77.40 + 1.69X. The value of R-squared is 0.0301 while t-statistics is 2.0332 (it is greater than t-tabulated at 95% confidence interval – 1.932). The results show that the variable is a significant determinant of salary.

Discussion

The regression results for the relationship between salary and the number of years they have been senior officers is more robust than the relationship between salary and the number of years they have worked in the company because it has a higher value of R-value and years_senior is a significant explanatory variable.

Confidence intervals

The confidence interval is calculated based on the values of the sample collected from a given population. It gives the range within which the population parameter is expected to lie. Further, the population parameter is not known (Baltagi 2011). Therefore, the confidence interval does not give the exact value of the population parameter. The intervals will be calculated at 95% confidence level.

Table 1.3 shows the confidence interval for the number of years they have worked for the company.

Coefficient Confidence Intervals
Date: 01/28/14 Time: 16:09
Sample: 1 135
Included observations: 135
95% CI
VariableCoefficientLowHigh
C77.3982260.7766994.01974
YEARS_SENIOR1.6949110.0460783.343743

The population parameter for years_senior will lie between 0.0461 and 3.3437.

Table 1.4 shows the confidence interval for the number of years they have been senior officers.

Coefficient Confidence Intervals
Date: 01/28/14 Time: 16:10
Sample: 1 135
Included observations: 135
95% CI
VariableCoefficientLowHigh
C84.7893361.12915108.4495
YEARS_COMP0.253525-0.6647091.171759

The population parameter for years_comp will lie between -0.6647 and 1.1718.

References

Arnold, R 2011, Economics, Cengage Learning, USA.

Asteriou, D & Hall, S 2011, Applied econometrics, Palgrave Macmillian, London.

Baltagi, G 2011, Econometrics, Springer, New York.

Greene, H 2003, Econometric analysis, Prentice–Hall, London.

Verbeek, M 2008, A guide to modern econometrics, John Wiley & Sons, England.

More related papers Related Essay Examples
Cite This paper
You're welcome to use this sample in your assignment. Be sure to cite it correctly

Reference

IvyPanda. (2022, April 3). The Regression Analysis: Econometrics Issues. https://ivypanda.com/essays/the-regression-analysis-econometrics-issues/

Work Cited

"The Regression Analysis: Econometrics Issues." IvyPanda, 3 Apr. 2022, ivypanda.com/essays/the-regression-analysis-econometrics-issues/.

References

IvyPanda. (2022) 'The Regression Analysis: Econometrics Issues'. 3 April.

References

IvyPanda. 2022. "The Regression Analysis: Econometrics Issues." April 3, 2022. https://ivypanda.com/essays/the-regression-analysis-econometrics-issues/.

1. IvyPanda. "The Regression Analysis: Econometrics Issues." April 3, 2022. https://ivypanda.com/essays/the-regression-analysis-econometrics-issues/.


Bibliography


IvyPanda. "The Regression Analysis: Econometrics Issues." April 3, 2022. https://ivypanda.com/essays/the-regression-analysis-econometrics-issues/.

If, for any reason, you believe that this content should not be published on our website, please request its removal.
Updated:
This academic paper example has been carefully picked, checked and refined by our editorial team.
No AI was involved: only quilified experts contributed.
You are free to use it for the following purposes:
  • To find inspiration for your paper and overcome writer’s block
  • As a source of information (ensure proper referencing)
  • As a template for you assignment
1 / 1