SubjectsSubjects(version: 964)
Course, academic year 2024/2025
   Login via CAS
Introduction to Data Analysis - JSM406
Title: Introduction to Data Analysis
Guaranteed by: Department of Sociology (23-KS)
Faculty: Faculty of Social Sciences
Actual: from 2024
Semester: both
E-Credits: 8
Hours per week, examination: 1/1, Ex [HT]
Capacity: winter:unknown / 25 (10)
summer:unknown / unknown (10)
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: English
Teaching methods: full-time
Note: course can be enrolled in outside the study plan
enabled for web enrollment
priority enrollment if the course is part of the study plan
you can enroll for the course in winter and in summer semester
Guarantor: PhDr. Ing. Petr Soukup, Ph.D.
Mgr. Ivan Petrúšek, Ph.D.
Teacher(s): Mgr. Tereza Klegr
Mgr. Ivan Petrúšek, Ph.D.
PhDr. Ing. Petr Soukup, Ph.D.
Class: Courses for incoming students
Is pre-requisite for: JSM503
Is interchangeable with: JSM513
Annotation
The course will introduce students to the basic data analysis methods used in quantitative social science research. As this is an introductory course, no previous knowledge of statistics is required. Students will learn and practice basic statistical methods by analyzing sociological survey data in a licenced software called IBM SPSS Statistics (each registered student will be provided a licence from the Faculty of Social Sciences). After taking this course, students should be able to prepare a data set, perform common data management tasks and analyze quantitative data using basic statistical techniques. This introductory data analysis course is recommended to students of Erasmus+ and other foreign exchange programs.
Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
Aim of the course

The main objective of this course is to introduce the key statistical theory and teach practical skills in quantitative data analysis. Students will learn the IBM SPSS Statistics software environment by editing and analyzing an established questionnaire survey dataset. Hence, the students will learn the basics of secondary data analysis (i.e. basic data management tasks such as creating new variables or subsetting the dataset based on specified conditions, computing descriptive statistics, preparing elementary data visualizations, and making inferences from sample data to a target population). This course will prepare students to employ the essential quantitative methods in their research projects and attend follow-up intermediate statistics courses.

Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
Literature

Required reading:

Field, A. (2013). Discovering Statistics Using IBM SPSS Statistics (4th Edition). London: Sage.

(detailed reading assignment from the course textbook will be specified after each class)

Recommended reading:

Agresti, A. (2018). Statistical Methods for the Social Sciences (5th Edition). Pearson.

Wheelan, Ch. (2013). Naked Statistics: Stripping the Dread from the Data. W. W. Norton.

Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
Teaching methods
The classes are a combination of lectures and seminars. The first part (approximately 40 minutes) is a lecture during which the tutor introduces key concepts in statistical theory and quantitative data analysis methods (see the syllabus below). The second part (approximately 40 minutes) is a seminar where students apply the methods introduced during the lecture in the data analysis software (IBM SPSS Statistics).
Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
Requirements to the exam

Grading will be based on homework assignments (6 assignments, each worth 5 points) and a final in-class exam (worth 70 points). Students may earn up to 100 total points.

Grading:

  • 91 - 100 points = grade A
  • 81 - 90 points = grade B
  • 71 - 80 points = grade C
  • 61 - 70 points = grade D
  • 51 - 60 points = grade E
  • 0 - 50 points = not passed (grade F)

NOTE: Total points earned will be rounded to the whole number (e.g., the overall result of 50.5 points is rounded to 51 points, which corresponds to the grade E).

Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
Syllabus

Course Schedule:

Week 1: Course overview. Introduction to the IBM SPSS Statistics software environment.
Week 2: Descriptive vs inferential statistics. Levels of measurement.
Week 3: Introduction to probability and probability distributions.
Week 4: Sampling variation. Central limit theorem. Confidence intervals (for the mean).
Week 5: Statistical hypotheses testing framework. One-sample t-test.
Week 6: Independent-samples t-test. Paired-samples t-test.
Week 7: Exploring assumptions of parametric tests. Assumption of normality.
Week 8: Analysis of variance (within-group and between-group variability, F-test, post-hoc tests).
Week 9: Correlation analysis (Covariance, Pearson and Spearman correlation coefficients, Scatterplot).
Week 10: Linear regression (method of least squares, simple/multiple regression).
Week 11: Analysis of categorical data I (confidence interval for a proportion, introduction to crosstabs).
Week 12: Analysis of categorical data II (chi-square test of independence, contingency coefficients, residuals).
Week 13: Review session.

Last update: Petrúšek Ivan, Mgr., Ph.D. (18.01.2025)
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html