Gun Control in the US: Empirical Analysis Research Paper

Exclusively available on IvyPanda Available only on IvyPanda

Updated: Feb 17th, 2024

Table of Contents

Empirical Analysis
Regression Analysis
T-Test
Interpretation of R squared and adjusted r squared
F-test
Correlation matrix
Multicollinearity
VIF’s
Solution to multicollinearity

Empirical Analysis

Introduction

This analysis involves investigation the factors that affect the existence of gun related crimes. The data is collected for 49 states in the United States to help in this investigation. The data collected include the total number of fire arms in the US, the number of people living in poverty, number of people consuming alcohol, population between 18-24 years, and unemployment rate. The number of fire arms is the dependent variable while all the others are the independent variables. We seek to investigate if the independent variables have any significant effect on the dependent variable. The variables relate by the following model:

We will write a custom essay on your topic

808 writers online

Learn More

Cr_t= f (AC_t,P1824_t,PR_t,UE_t,)

The model can be expanded to give the following equation

Cr_t= β₀ + β₁AC_t + β₂P1824_t + β₃PR_t + β₄UE_t + €_t

This can also be represented by the following equation:

Y = β₀ + β₁X₁ + β₂X₂ + β₃X₃ + β₄X₄ + β₅X₅ + €_t

Where Y/ Cr_tis crime rate in year t, X₁/ AC_t

1 hour!

The minimum time our certified writers need to deliver a 100% original paper

Learn More

is alcohol consumption in year t, X₂/ P1824_t

is population between the age of 18 and 24 in the year t, X₃/ PR_t

is Poverty rate in the year t, X₄/ UE_t

is Unemployment rate for the year t, β₅X₅ is Brandy score/gun control and €_t is the error term.

Regression Analysis

The analysis will be done using regressions analysis which will be done with the help of Eviews statistical software. The Eviews output were obtained as follows:

Dependent Variable: Y
Method: Least Squares
Date: 11/14/12 Time: 15:53
Sample: 1 49
Included observations: 49
Variable	Coefficient	Std. Error	t-Statistic	Prob.
C	-4.436524	13.14918	-0.337399	0.7375
X1	2.38E-05	4.70E-05	0.507321	0.6145
X2	-9.18E-07	3.54E-05	-0.025964	0.9794
X3	0.000149	3.90E-05	3.810283	0.0004
X4	0.092357	0.169786	0.543962	0.5893
X5	0.158743	0.554915	0.286068	0.7762
R-squared	0.935355	Mean dependent var		176.2245
Adjusted R-squared	0.927838	S.D. dependent var		230.1436
S.E. of regression	61.82360	Akaike info criterion		11.20073
Sum squared resid	164352.8	Schwarz criterion		11.43238
Log likelihood	-268.4178	F-statistic		124.4334
Durbin-Watson stat	2.461739	Prob(F-statistic)		0.000000

The estimated regression equation is represented as follows:

Y = -4.436524 + 2.38E-05 X₁ – 9.18E-07 X₂ + 0.000149 X₃ + 0.092357 X₄ + 0.158743 X₅

Remember! This is just a sample

You can get your custom paper by one of our expert writers

Get custom essay

S. E. = 13.14918 4.70E-05 3.54E-05 3.90E-05 0.169786 0.554915

t- Statistic =-0.337399 0.507321 -0.025964 3.810283 0.543962 0.286068

T-Test

To test whether each of the independent variables is an important determinant, we use t-test. The t-test for the sample of 49 elements will be done at n – k = 49 – 5 = 44 degrees of freedom. We test at 95% confidence level. The value of α = 5% which is the significance level. For this test, the critical value of t at df = 44 and α = 5% is 2.0154. The decision criterion for t-test is that if t- Statistic is greater than t-critical, we reject the null hypothesis. The hypothesis being tested is as follows

The null hypothesis is H0: β = 0, meaning that the independent variable is not an important determinant of the dependent variable

The alternative hypothesis is H1: β ≠ 0, meaning that the independent variable is an important determinant of the dependent variable.

We test every independent variable at a time

For X₁, the t- Statistic < t-critical. In this case, we do not reject the null hypothesis. The conclusion is that X₁ representing alcohol consumption is not an important determinant of Y (Crime rate).

For X₂, t- Statistic < t-critical. We, therefore, do not reject the null hypothesis. This shows that X₂ (population between the age of 18 and 24) is not an important determinant of Y (Crime rate).

Get your first paper with

15% OFF

Learn More

For X₃, t- Statistic > t-critical. The null hypothesis is thus rejected. We reject the alternative hypothesis and conclude that X₃ (Poverty rate) is an important determinant of Y (Crime rate).

For X4, t- Statistic < t-critical. For this case, we do not reject the null hypothesis based on the decision criterion for t-test. We therefore reject the alternative hypothesis. This means that X4 (unemployment rate) is not an important determinant of Y (crime rate).

For X5, t- Statistic < t-critical. This means that we do not reject the null hypothesis. We conclude that X5 is not an important determinant of Y.

Interpretation of R squared and adjusted r squared

The value of R², the coefficient of determination, is 93. 5355%. This means that the 93. 5355% of the variation in the dependent variable is explained jointly by the independent variables included in the regression. The R² is said to have some problems that may lead to exaggerated results. The value of R²increases with increase in the number of the independent variables even if they are not important. This means the results may be exaggerated and misleading. To solve this problem, adjusted R² is used. In our case, R² is 92.7838%. This means that 92.7838% of variation in Y is jointly explained by X₁, X₂, X₃, X₄, and X₅.

F-test

This is a test of overall significance of the independent variables. The test aims at determining whether the variables are jointly insignificant. The null hypothesis tested he is as follows: H0: β₀ = β₁ = β₂ = β₃ = β₄ = 0. The alternative hypothesis, therefore, will be

H1: β₀ ≠ β₁ ≠ β₂ ≠ β₃ ≠ β₄ ≠ 0.

It is computed using the following formula:

F = (between – group variability) / (within – group variability )

In our case, F- statistic = 124.4334 which is the one computed as per the formula above. To validate the test, we obtain F- critical from the F- table at K-1 and N-K degrees of freedom. K is the number of samples in the system while N is the sample size. K-1 = 6 – 1 = 5. N – K = 49 – 6 = 43. The value of F in this case at α = 0.05 is 2.4322. The decision criterion is that if obtained F is greater than the critical F value, we reject the null hypothesis. In our case, F- statistic = 158.8357 and F- critical = 2.4322. Therefore, F- statistic > F- critical and we reject the null hypothesis. The conclusion in this case is that all independent variables jointly have a significant impact on the dependent variable. That is, β₀ ≠ β₁ ≠ β₂ ≠ β₃ ≠ β₄ ≠ 0. The variables are not jointly insignificant.

Correlation matrix

Correlation is a statistical measure of relationships between two random variables. A correlation matrix is used to determine correlation coefficients where there are several variables in the model. For our case, the correlation matrix is stated as follows:

	Y	X₁	X₂	X₃	X₄	X₅
Y	1.000000	0.920427	0.921526	0.963398	0.947228	0.447362
X₁	0.920427	1.000000	0.869810	0.923163	0.976301	0.567789
X₂	0.921526	0.869810	1.000000	0.957522	0.924903	0.389256
X₃	0.963398	0.923163	0.957522	1.000000	0.959688	0.411871
X₄	0.947228	0.976301	0.924903	0.959688	1.000000	0.527473
X₅	0.447362	0.567789	0.389256	0.411871	0.527473	1.000000

The correlation coefficients show that there are high degree relationships between variables. All independent variables are highly correlated with the independent variables. The correlation coefficients are over 0.9 meaning there is high correlation. This can be seen from the first column in correlation matrix above. The independent variables are also highly correlated with one another. For instance the correlation between X1 and X2 is 0.869810, X1 and X3 is 0.923163, X1 and X4 is 0.976301, X1 and X5 is 0.567789, X2 and X3 is 0.957522, X2 and X4 is 0.924903, X2 and X5 is 0.389256, X3 and X4 is 0.959688, X3 and X5 is 0.411871, and X4 and X5. Perfect correlation occurs when the correlation coefficient is equal to 1. This means that the independent variables are highly correlated because the correlation coefficients between them are close to 1, apart from those related to X5. This shows there is a problem of Multicollinearity that must be dealt with. This will be discussed in the next section.

Multicollinearity

This problem arises when there is a violation of the assumption of Ordinary Least Squares method of estimation. The assumption being violated is that there is no high correlation between independent variables that are used in the regression model. In our case above, we have seen that there is high correlation between the independent variables X₂, X₂, X₃, X_4,and X₅. This means that Multicollinearity exists. In reality, this problem always exists but what matters most is the degree or magnitude. It should be minimized as much as possible. This problem arises because of improper use of dummy variable, using a variable in the model that is computed from other variables, including the same or almost the same variable twice, or just cases where variables are really and truly highly correlated. Our data suggests presence of Multicollinearity. Firstly, there are four independent variables but only one of the t-ratios of the coefficient is statistically significant. The irony is that the overall F-statistic is significant. Secondly, the t-ratios are too small and the value of R² is high. There is also high correlation between the independent variables. To substantiate further the issue of Multicollinearity, we compute tolerance of the independent variables which helps us to calculate the Variance Inflation Factor, normally abbreviated as VIF. This concept is discussed in the section below.

VIF’s

The VIF shows the effect of Multicollinearity on the variance of the estimates in a model. It is computed by finding the reciprocal of the tolerance of the independent variables. Tolerance is compute as follows

Tolerance = 1- r² Where r² is the correlation between any two variables in the model. This is a good measure of Multicollinearity. A tolerance close to one means that multicollinearity is not a threat. If close to zero, multicollinearity is big. VIF = 1/Tolerance = 1/ (1- r²). There are computed in the table below:

From the correlation matrix below, we shall compute the tolerance and VIF.

	Y	X₁	X₂	X₃	X₄	X₅
Y	1.000000	0.920427	0.921526	0.963398	0.947228	0.447362
X₁	0.920427	1.000000	0.869810	0.923163	0.976301	0.567789
X₂	0.921526	0.869810	1.000000	0.957522	0.924903	0.389256
X₃	0.963398	0.923163	0.957522	1.000000	0.959688	0.411871
X₄	0.947228	0.976301	0.924903	0.959688	1.000000	0.527473
X₅	0.447362	0.567789	0.389256	0.411871	0.527473	1.000000

	X1		X2		X3		X4		X5
	Tolerance	VIF	Tolerance	VIF	Tolerance	VIF	Tolerance	VIF	Tolerance	VIF
X1			0.13019	7.681081	0.076837	13.01456	0.023699	42.19587	0.432211	2.313685
X2					0.042478	23.5416	0.075097	13.31611	0.610744	1.637347
X3							0.040312	24.80651	0.588129	1.700307
X4									0.472527	2.116281

From the above table, the values of tolerance are close to zero, meaning that there is high multicollinearity. We may also compute the VIF value for all the variables jointly as follows:

In this case, R² is the coefficient of determination. Our R²= 92.7838%. VIF = 1/ (1 – 0. 927838) = 13.858. The rule of thumb is that VIF > 5 means that multicollinearity exists and is of high degree. For the individual variables, it is clear that there is multicollinearity because all the values of VIF are greater than 5 apart from all correlations withX₅.

Solution to multicollinearity

Existence of multicollinearity leaves the OLS estimates still unbiased and BLUE (Best Linear Unbiased Estimators). However, when it is high, the values of standard errors tend to be too small. This results to very small values of t-statistic. The danger in this case is that due to small t-ratios, the null hypothesis might never be rejected. It means the coefficients of the independent variables will have to be large enough for the null hypothesis to be rejected. There are a number of ways of solving multicollinearity but for this case we choose to remove some of the variables that are related. The variables that need to be removed are the one that is theoretically not sensible. Theoretically, the number of people living in poverty is believed to be a major determinant of crime rates. This has the same effect as the issue of unemployment in a country. When the level of unemployment is high, the number of crimes also increases. The gun control (X4) plays a role in reducing gun crimes. The population between 18-24 years does not necessarily mean that there are gun crimes. The same case with those consuming alcohol, they may not necessarily affect crime rates. We then remove two variables X2 and X1. We then have to run regression again and test the significance of the remaining variables. We, thus, regress Y against X3, X4, and X5. The Eviews output is as follows:

Dependent Variable: Y
Method: Least Squares
Date: 11/14/12 Time: 18:19
Sample: 1 49
Included observations: 49
Variable	Coefficient	Std. Error	t-Statistic	Prob.
C	-1.898519	12.07316	-0.157251	0.8758
X3	0.000146	3.01E-05	4.838890	0.0000
X4	0.159076	0.099653	1.596308	0.1174
X5	0.225918	0.530858	0.425572	0.6724
R-squared	0.934901	Mean dependent var		176.2245
Adjusted R-squared	0.930561	S.D. dependent var		230.1436
S.E. of regression	60.64598	Akaike info criterion		11.12609
Sum squared resid	165507.1	Schwarz criterion		11.28053
Log likelihood	-268.5892	F-statistic		215.4167
Durbin-Watson stat	2.497802	Prob(F-statistic)		0.000000

Y = β₀ + β₃X₃+ β₄X₄ + β₅X₅

Y = -1.898519 + 0.000146X₃ + 0.159076X₄ + 0.225918 X₄

S.E. = 12.07316 3.01E-05 0.099653 0.530858

T-Statistic =-0.157251 4.838890 1.596308 0.425572

The t-critical at 46 degrees of freedom and α = 0.05 is 2.0129. Based on this statistic, the t-statistic for X₃ and X₄ is greater than the critical value. We then reject the null hypothesis. Then we conclude that X₃ and X₄ are important determinants of Y. The t-statistic for X5 is less than t-critical. In this case, we do not reject the null hypothesis. Then we conclude that X5 is not an important determinant of Y. The F-test is done at α = 0.05 and 4-1 = 3 and 49 = 4 = 45 degrees of freedom. F-critical = 2.8115. F-statistic = 215.4167. F-statistic is greater than F-critical and thus we reject the null hypothesis. We conclude that X₂, X₃ and X₄ are jointly important determinants of Y. The value of adjusted R²is 93.0373 %, meaning that 93.0373% of variation in Y is jointly determined by X₂, X₃, and X₄. We thus conclude that poverty and unemployment rates are significant determinants of crime rate. Brandy score is not a significant determinant of crime rate.

Need an custom research paper on Gun Control in the US: Empirical Analysis written from scratch by a professional specifically for you?

808 writers online

Get Writing Help

Cite This paper

Select a referencing style:

Reference

IvyPanda. (2024, February 17). Gun Control in the US: Empirical Analysis. https://ivypanda.com/essays/gun-control-in-the-us-empirical-analysis/

Work Cited

"Gun Control in the US: Empirical Analysis." IvyPanda, 17 Feb. 2024, ivypanda.com/essays/gun-control-in-the-us-empirical-analysis/.

References

IvyPanda. (2024) 'Gun Control in the US: Empirical Analysis'. 17 February.

References

IvyPanda. 2024. "Gun Control in the US: Empirical Analysis." February 17, 2024. https://ivypanda.com/essays/gun-control-in-the-us-empirical-analysis/.

1. IvyPanda. "Gun Control in the US: Empirical Analysis." February 17, 2024. https://ivypanda.com/essays/gun-control-in-the-us-empirical-analysis/.

Bibliography

IvyPanda. "Gun Control in the US: Empirical Analysis." February 17, 2024. https://ivypanda.com/essays/gun-control-in-the-us-empirical-analysis/.

More related papers

Reading time	8 min
Type	Research Paper
Pages	8
Words	2310
Subjects	Sciences Statistics
Topics	Gun Control
Language	🇺🇸 English