Introduction
Evaluation and comparison of weather across various states are significant. It aids in making important decisions such as choosing where to live and selecting a city to locate a business among others. The paper seeks to carry out an analysis to ascertain if there is a difference in weather patterns across three cities in the United States of America. The cities selected are New York City, Los Angeles, and Chicago. The three dependent variables that will be compared are low temperature, high temperature, and precipitation levels. Data will be collected over the same period across the cities. The analysis will focus on establishing whether the differences in values of the three variables across the cities are statistically significant. The tests that will be used are ANOVA and t-test. Also, the descriptive statistics of the variables will be compared.
Hypothesis and Research Questions
The paper seeks to carry out tests to ascertain if the weather patterns differ across the three states. The research questions and the hypotheses for the tests are presented below.
First
Research Question
Are there any differences in the daily low temperatures across the three cities?
Hypothesis
Null hypothesis: There is no difference in the daily low temperatures across the three cities.
H0: µltNY = µltLA = µltCH
Alternative hypothesis: There is a difference in the daily low temperatures across the three cities.
H1: µltNY ≠ µltLA ≠ µltCH
Second
Research Question
Are there any differences in the daily high temperatures across the three cities?
Hypothesis
Null hypothesis: There is no difference in the daily high temperatures across the three cities.
H0: µhtNY = µhtLA = µhtCH
Alternative hypothesis: There is a difference in the daily high temperatures across the three cities.
H1: µhtNY ≠ µhtLA ≠ µhtCH
Third
Research Question
Are there any differences in the daily precipitation across the three cities?
Hypothesis
Null hypothesis: There is no difference in the daily precipitation across the three cities
H0: µpNY = µpLA = µpCH
Alternative hypothesis: There is a difference in the daily precipitation across the three cities.
H1: µpNY ≠ µpLA ≠ µpCH
Fourth
Research Question
Are there any differences in low temperature, high temperature, and precipitation levels across the three cities?
Hypothesis
Null hypothesis: There is no difference in mean low temperature, high temperature, and precipitation levels across the three cities.
Alternative hypothesis: There is a difference in mean low temperature, high temperature, and precipitation levels across the three cities.
Data Collection
The data that will be used in the analysis is collected from the AccuWeather.com website (“United States Weather” par. 1). It is a website that contains historical data of various variables such as temperature, snow, and precipitation of various cities in the United States. The data that will be collected from the website is the daily low temperature for January (measured in Fahrenheit), daily high temperature for July (measured in Fahrenheit), and daily precipitation for July (measured in inches). Further, the data for the three variables were collected for the three cities (Baltagi 108). Besides, the data covered the same period across the three cities. The data collected for the analysis are presented in Table 1 below.
Table 1: Data
Analysis
Statistical Tests
The first three hypotheses will be tested using ANOVA. This can be attributed to the fact that ANOVA is the most suitable technique for the testing hypothesis that entails comparing the mean of more than one group. One-way ANOVA will be used because there is only one independent variable under each research question. The fourth hypothesis will be tested using an independent t-test. This can be attributed to the fact that the independent t-test is suitable for comparing the means between unrelated groups on the same variable. All the tests are carried out at the 95% confidence level (Verbeek 237). The results of all the tests are presented in the Tables below.
Variables
The variables that will be tested are daily low temperature, high temperature, and precipitation. These are the dependent variables. It is worth mentioning that the data for the three variables will be collected over the period across the three cities. The three cities represent the groups that will be used when carrying out the test. The test will ascertain if there is a difference in three variables across the three cities.
Results
Descriptive statistics
Despite being in the United States of America, the three cities had different levels of temperature and precipitation. From the results of descriptive statistics, it can be observed that Los Angeles (47.87°) had the highest value of average low temperature while Chicago (18.16°) had the lowest mean value of low temperature. Further, it can be observed that Chicago (0.374) had the highest variation in low temperature while Los Angeles (0.341) had the least variation in temperature. For the second variable, high temperature, it can be observed that Los Angeles (83.23°) had the lowest mean value of high temperature while Chicago (84.19°) had the highest mean value of high temperature (Mankiw 341).
In the case of standard deviation, New York (0.301) had the lowest value while Los Angeles (0.805) had the highest value. Further analysis shows that there was a slight difference between the mean high temperatures across the three cities. Under precipitation, it can be noted that New York (0.1284in) had the highest mean value while Los Angeles (0.0123in) had the lowest value of mean precipitation. A similar trend was observed in the case of standard deviation. The descriptive statistics show that the values of the three variables were different across the three cities. The results do not give much information on whether the differences are statistically significant (Baltagi 108). The significance of these differences will be evaluated in the subsequent section.
ANOVA
As mentioned above, ANOVA will be used to test the significance of the differences that are observed in the three dependent variables across the cities. For the first hypothesis, the results show that the value of F-calculated is 5.616E4. Further, the p-value (0.000) is less than the value of alpha (0.05). Thus, the null hypothesis for the first test will be rejected at the 95% confidence level. It can be concluded that the difference in the value of daily low temperatures across New York, Los Angeles, and Chicago is statistically significant. The results of the second hypothesis show that the value of F-calculated is 23.753. Further, the p-value (0.000) is less than the value of alpha (0.05). Thus, the null hypothesis of the second test will be rejected at the 95% confidence level (Bade and Parkin 246).
It can be concluded that the difference in the value of daily high temperatures across the three cities is statistically significant. For the third hypothesis, the value of F-calculated is 1.872. Also, the p-value (0.160) is greater than the value of alpha (0.50). The null hypothesis will not be rejected at the 95% confidence level. This indicates that the difference in precipitation levels across the three cities is not statistically significant. The results of ANOVA show that the difference in temperature across the three states is statistically significant while the difference in precipitation level is not statistically significant (Bade and Parkin 208).
T-test
The results for the fourth test focus on the equality of means. The value of t-calculated for low temperature is -233.966, 4.392 for high temperature, and 1.750 for precipitation. The value of t-critical is 1.96. Further, the results show that the p-values are 0.000 for low temperature, 0.000 for high temperature, and 0.085 for precipitation. The null hypothesis will be rejected for the low and high temperatures at the 95% confidence level. This shows that the temperature differences are significant across the cities. Further, the null hypothesis will not be rejected for the level of precipitation at the 95% level of confidence. This indicates that the difference in precipitation level is not statistically significant. Further, the test of equality of variances shows that the difference in the variance of low temperature is not statistically significant (Greene 98). Also, the differences in variance for high temperature and precipitation were statistically significant across the three groups.
Discussion
Since the three cities are in one nation, it is expected that the differences that are observed in weather patterns are not statistically significant. However, the results above show that the difference in temperature across the three cities is statistically significant. However, the difference observed in the mean level of precipitation is not statistically significant. Therefore, evaluation of the significance of variables and differences of values across various groups is important because it aids in making sound decisions.
References
Bade, Robin, and Michael Parkin. Essential Foundations of Economics, USA: Pearson Education, 2013. Print.
Baltagi, Badi. Econometrics, New York: Springer Publishers, 2011. Print.
Greene, William. Econometric Analysis, Harlow: Prentice–Hall, 2003. Print.
Mankiw, Gregory. Principles of economics, USA: South-Western Cengage Learning, 2011. Print.
United States Weather 2015. Web.
Verbeek, Marno. A Guide to Modern Econometrics, England: John Wiley & Sons, 2008. Print.