Updated:

Principal Component Analysis: Anxiety in Students Dissertation

Exclusively available on Available only on IvyPanda® Made by Human No AI

Statistical Test

The data analysis intends to employ SPSS in performing principal component analysis (PCA). Particularly, PCA aims to reduce the number of variables into principal components that explain most of the variation in SPSS anxiety among students. According to Field (2013), PCA eases exploration of data by establishing patterns, linear relationships, and the influence of each variable. In essence, PCA eliminates redundant variables and promotes dimensionality of variables (Jackson, 2015).

For robust PCA, data ought to meet the assumptions of the multiplicity of continuous variables, the linearity of data, sample adequacy, and the absence of significant outliers. Elliott and Woodward (2015) elucidate that PCA applies orthogonal transformation in converting raw variables that highly correlate to principal components with low correlation coefficients. Thus, PCA enables the determination of principal components with major influence on the variation of data.

The Basis of Data

The data emanated from the questionnaire developed and utilized in collecting data regarding SPSS anxiety among 2571 students. The questionnaire comprises 23 closed-ended questions with a five-point Likert scale answers indicating the strength of agreement from 1 (strongly disagree) through strongly agree (5) (Field, 2013). Since students experience anxiety in the course of learning SPSS, the questionnaire aims to measure and ascertain the extent of SPSS anxiety. Through the interview of students with and without anxiety, the study formulated 23 questions with viability to measure SPSS anxiety.

The examination of data indicates that they meet the assumptions of PCA because the questions are 23, variables have linear relationships, data do not have significant outliers, and the sample size is more than 300 (N = 2571). The use of the questionnaire would indicate the extent of anxiety in each student learning SPSS. Moreover, the questionnaire would reveal forms of SPSS anxiety existing among students. In essence, the study aims to use PCA in revealing principal variables that explain most of SPSS anxiety among students.

Research Questions

  • What are the principal components that account for the variation in SPSS anxiety among students?
  • What are the dominant themes in the extracted components?

Hypotheses

  • H0: The 23 questions in the questionnaire are not statistically significant variables in explaining the variation in SPSS anxiety among students.
  • H1: The 23 questions in the questionnaire are statistically significant variables in explaining the variation in SPSS anxiety among students.

Statistical Outcomes

Descriptive Statistics

The exploration of the data using descriptive statistics indicates that there are no missing data because 2571 students answered all the Likert items in the questionnaire. Denis (2016) asserts that descriptive statistics form the basis of data analysis for they reveal patterns and trends of data. Descriptive statistics also show that students neither agree nor disagree with most Likert statements (18) because their mean scores ranged from 2.23 to 2.89 as demonstrated in Table 1. Moreover, students agree with the remaining four Likert statements for their mean scores ranged from 3.16 to 3.62.

Table 1.

Descriptive Statistics
MeanStd. DeviationAnalysis N
Statistics makes me cry2.37.8282571
My friends will think I’m stupid for not being able to cope with SPSS1.62.8512571
Standard deviations excite me2.591.0752571
I dream that Pearson is attacking me with correlation coefficients2.79.9492571
I don’t understand statistics2.72.9652571
I have little experience of computers2.231.1222571
All computers hate me2.921.1022571
I have never been good at mathematics2.24.8732571
My friends are better at statistics than me2.851.2632571
Computers are useful only for playing games2.28.8772571
I did badly at mathematics at school2.26.8812571
People try to tell you that SPSS makes statistics easier to understand but it doesn’t3.16.9162571
I worry that I will cause irreparable damage because of my incompetence with computers2.45.9492571
Computers have minds of their own and deliberately go wrong whenever I use them2.88.9992571
Computers are out to get me2.771.0092571
I weep openly at the mention of central tendency2.88.9162571
I slip into a coma whenever I see an equation2.47.8842571
SPSS always crashes when I try to use it2.571.0532571
Everybody looks at me when I use SPSS2.291.1012571
I can’t sleep for thoughts of eigenvectors3.621.0362571
I wake up under my duvet thinking that I am trapped under a normal distribution3.17.9852571
My friends are better at SPSS than I am2.891.0412571
If I’m good at statistics my friends will think I’m a nerd3.431.0442571

Assumption Tests

The study employed the correlation in testing the assumption of multicollinearity. The scrutiny of the correlation matrix shows that the 23 Likert items have both negative and positive relationships. The correlation analysis indicates that the relationships of the Likert items range from moderate to very weak positive and negative relationships. Further scrutiny of significance coefficients shows that all Likert items have statistically significant relationships with one or more Likert items (p < 0.001). According to Field (2013), Likert items with correlation coefficients greater than 0.9 exhibit multicollinearity. Therefore, as correlation coefficients are less than 0.6, it implies that no variables exhibit multicollinearity, and thus, no elimination of Likert items.

Regarding the assumption of the adequacy of sample size, KMO statistic (0.93) indicates that the sample size is adequate for principal component analysis (Table 2). Since KMO statistic is considerably greater than the threshold of 0.5, Field (2013) describes the adequacy of sample size as marvelous. Anti-image correlation matrices indicates that individual OKM values of anti-image covariance and anti-image correlation are greater than 0.5, which means all Likert items are viable for PCA. As illustrated in Table 2, Bartlett’s test of sphericity is statistically significant (p = 0.000), and thus, it rejects the null hypothesis that the correlation matrix is similar to the identity matrix.

Table 2.

KMO and Bartlett’s Test
Kaiser-Meyer-Olkin Measure of Sampling Adequacy..930
Bartlett’s Test of SphericityApprox. Chi-Square19334.492
df253
Sig..000

Factor Extraction

From Table 3, it is apparent that four factors out of 23 factors have eigenvalues greater than one while the remaining 19 factors have eigenvalues less than one

Table 3.

Total Variance Explained
ComponentInitial EigenvaluesExtraction Sums of Squared LoadingsRotation Sums of Squared Loadings
Total% of VarianceCumulative %Total% of VarianceCumulative %Total% of VarianceCumulative %
17.29031.69631.6967.29031.69631.6963.73016.21916.219
21.7397.56039.2561.7397.56039.2563.34014.52330.742
31.3175.72544.9811.3175.72544.9812.55311.09941.841
41.2275.33650.3171.2275.33650.3171.9508.47650.317
5.9884.29554.612
6.8953.89358.504
7.8063.50262.007
8.7833.40465.410
9.7513.26568.676
10.7173.11771.793
11.6842.97274.765
12.6702.91177.676
13.6122.66180.337
14.5782.51282.849
15.5492.38885.236
16.5232.27587.511
17.5082.21089.721
18.4561.98291.704
19.4241.84393.546
20.4081.77395.319
21.3791.65096.969
22.3641.58398.552
23.3331.448100.000
Extraction Method: Principal Component Analysis.

Before extraction, factors 1, 2, 3, and 4 had eigenvalues of 7.290, 1.739, 1.317, and 1.227, which accounted for 31.696%, 7.560%, 5.725%, and 5.336% of variance in that order. After extraction, factors 1, 2, 3, and 4 had the same eigenvalues that explained the same proportion of variance respectively. However, after extraction, factors 1, 2, 3, and 4 had eigenvalues of 3.730, 3.340, 2.553, and 1.950, which accounted for 16.219%, 14.523%, 11.099%, and 8.476% of variance correspondingly. Overall, the four factors cumulatively accounted for 50.317% of the variation in SPSS anxiety among students.

The table in Appendix A shows that shared variance of factors ranged from 0.343 to 0.739. The mean of common variance is 0.5032 (11.573/23), which is less than Kaiser’s threshold of 0.7. Component matrix shows the distribution of factors’ loadings into the four components selected. According to Field (2013), factors with loadings greater than 0.4 are significant while factors with loadings less than 0.4 are not significant.

The scree plot demonstrates that the four components explain most of the variation in eigenvalues. According to McCormick, Salcedo, Peck, Wheeler, and Verlen (2017), the elbow of the scree plot provides a threshold of components in which further additions have no significant impact on the variation of data. Thus, the scree plot confirms the extraction of the four components in PCA.

 Scree plot showing the distribution of eigenvalues among component numbers.
Figure 1: Scree plot showing the distribution of eigenvalues among component numbers.

The analysis of loadings in the rotated component matrix shows themes of anxiety that are in each component. Pallant (2016) holds that the threshold of suppressing loading coefficients determines the emergence of these themes. The type of anxiety that loaded onto the first component relates to the fear of computers while the form of anxiety that loaded onto the second component relates to the fear of statistics. The same number of questions (8) loaded onto the second and third components. The questions that loaded onto the third component relate to the fear of mathematics whereas the questions that loaded onto the fourth component relate to the fear of social perception.

Interpretation

PCA was performed to determine which among the 23 Likert items considerably explain SPSS anxiety among students. The analysis showed the data met the assumptions of sample adequacy (OKM = 0.93), the absence of multicollinearity (r < 0.6), and significance of Bartlett’s test of sphericity. Factor extraction established four components with eigenvalues greater than one. Moreover, the four eigenvalues collectively accounted for 50.317% of the variation in SPSS anxiety among students. The scree plot confirms the extraction of the four components because the major inflection point occurred at the fourth factor.

The analysis of how each question loaded onto different components revealed varied themes in the questions. Eight questions that loaded onto the first component relate to the fear of computers while another eight questions that loaded onto the second component relate to the fear of statistics. Whereas three questions that loaded onto the third component relate to the fear of mathematics, the remaining five questions that loaded onto the fourth component relate to the fear of social perception.

Thus, in answering the research question, PCA shows that fears of computers, statistics, mathematics, and social perception are principal factors in the questionnaire that influence the occurrence of anxiety among students. Moreover, PCA rejects the null hypothesis for it demonstrates that the 23 questions in the questionnaire are statistically significant variables in explaining the variation in SPSS anxiety among students.

References

Denis, D. (2016). Applied univariate, bivariate, and multivariate statistics. Hoboken, NJ:Wiley.

Elliott, A. C., & Woodward, W. A. (2015). IBM SPSS by example: A practical guide to statistical data. Thousand Oaks, CA: SAGE Publications.

Field, A. (2013). Discovering statistics using IBM SPSS statistics (4th ed.). Los Angeles, CA: SAGE Publications.

Jackson, S. J. (2015). Research methods and statistics: A critical thinking approach (5th ed.). Belmont, CA: Cengage Learning.

McCormick, K., Salcedo, J., Peck, J., Wheeler, A., & Verlen, J. (2017). SPSS statistics for data analysis and visualization. Indianapolis, IN: Wiley.

Pallant, J. (2016). SPSS survival manual: A step by step guide to data analysis using IBM SPSS. Sydney, Australia: Allen & Unwin.

Appendices

Appendix A: Communalities

Communalities
InitialExtraction
Q_011.000.435
Q_021.000.414
Q_031.000.530
Q_041.000.469
Q_051.000.343
Q_061.000.654
Q_071.000.545
Q_081.000.739
Q_091.000.484
Q_101.000.335
Q_111.000.690
Q_121.000.513
Q_131.000.536
Q_141.000.488
Q_151.000.378
Q_161.000.487
Q_171.000.683
Q_181.000.597
Q_191.000.343
Q_201.000.484
Q_211.000.550
Q_221.000.464
Q_231.000.412
Extraction Method: Principal Component Analysis.
More related papers Related Essay Examples
Cite This paper
You're welcome to use this sample in your assignment. Be sure to cite it correctly

Reference

IvyPanda. (2020, December 29). Principal Component Analysis: Anxiety in Students. https://ivypanda.com/essays/principal-component-analysis/

Work Cited

"Principal Component Analysis: Anxiety in Students." IvyPanda, 29 Dec. 2020, ivypanda.com/essays/principal-component-analysis/.

References

IvyPanda. (2020) 'Principal Component Analysis: Anxiety in Students'. 29 December.

References

IvyPanda. 2020. "Principal Component Analysis: Anxiety in Students." December 29, 2020. https://ivypanda.com/essays/principal-component-analysis/.

1. IvyPanda. "Principal Component Analysis: Anxiety in Students." December 29, 2020. https://ivypanda.com/essays/principal-component-analysis/.


Bibliography


IvyPanda. "Principal Component Analysis: Anxiety in Students." December 29, 2020. https://ivypanda.com/essays/principal-component-analysis/.

If, for any reason, you believe that this content should not be published on our website, please request its removal.
Updated:
This academic paper example has been carefully picked, checked and refined by our editorial team.
No AI was involved: only quilified experts contributed.
You are free to use it for the following purposes:
  • To find inspiration for your paper and overcome writer’s block
  • As a source of information (ensure proper referencing)
  • As a template for you assignment
1 / 1