PředmětyPředměty(verze: 953)
Předmět, akademický rok 2023/2024
   Přihlásit přes CAS
Introduction to Data Analysis - JSM406
Anglický název: Introduction to Data Analysis
Zajišťuje: Katedra sociologie (23-KS)
Fakulta: Fakulta sociálních věd
Platnost: od 2023
Semestr: oba
E-Kredity: 8
Rozsah, examinace: 1/1, Zk [HT]
Počet míst: zimní:neurčen / 25 (15)
letní:neurčen / neurčen (15)
Minimální obsazenost: neomezen
4EU+: ne
Virtuální mobilita / počet míst pro virtuální mobilitu: ne
Stav předmětu: vyučován
Jazyk výuky: angličtina
Způsob výuky: prezenční
Způsob výuky: prezenční
Poznámka: předmět je možno zapsat mimo plán
povolen pro zápis po webu
při zápisu přednost, je-li ve stud. plánu
předmět lze zapsat v ZS i LS
Garant: PhDr. Ing. Petr Soukup, Ph.D.
Mgr. Ivan Petrúšek, Ph.D.
Vyučující: Mgr. Ivan Petrúšek, Ph.D.
PhDr. Ing. Petr Soukup, Ph.D.
Mgr. Tereza Svobodová
Třída: Courses for incoming students
Je prerekvizitou pro: JSM503
Je záměnnost pro: JSM513
Anotace - angličtina
The course will introduce students to the basic data analysis methods used in quantitative social science research. As this is an introductory course, no previous knowledge of statistics is required. Students will learn and practice basic statistical methods by analyzing sociological survey data in a licenced software called IBM SPSS (each registered student will be provided a licence from the Faculty). After taking this course, students should be able to prepare a data set, perform common data management tasks and analyze quantitative data using basic statistical techniques. This introductory data analysis course is recommended to students of Erasmus+ and other foreign exchange programs.
Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Cíl předmětu - angličtina

The main objective of this course is to introduce the key statistical theory and teach practical skills in quantitative data analysis. Students will learn the IBM SPSS software environment by editing and analyzing an established questionnaire survey dataset. Hence, the students will learn the basics of secondary data analysis (i.e. basic data management tasks such as creating new variables or subsetting the dataset based on specified conditions, computing descriptive statistics, preparing elementary data visualizations, and making inferences from sample data). This course will prepare students to employ the essential quantitative methods in their research projects and attend follow-up intermediate statistics courses.

Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Literatura - angličtina

Required reading:

Field, A. (2013). Discovering Statistics Using IBM SPSS Statistics. Fourth edition. London: Sage.

(detailed reading assignment from the course textbook will be specified after each class)

Recommended reading:

Agresti, A. (2018). Statistical Methods for the Social Sciences (5th Edition). Pearson.

Wheelan, Ch. (2013). Naked Statistics: Stripping the Dread from the Data. W. W. Norton.

Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Metody výuky - angličtina
The classes are a combination of lectures and seminars. The first part (approximately 40 minutes) is a lecture during which the tutor introduces key concepts in statistical theory and quantitative data analysis methods (see syllabus below). The second part (approx. 40 minutes) is a seminar where students apply the methods introduced during the lecture in the data analysis software (IBM SPSS).
Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Požadavky ke zkoušce - angličtina

Grading will be based on homework assignments (6 mandatory assignments, each worth 5 points) and a final in-class exam (worth 70 points). Students may earn up to 100 total points.


  • 91 - 100 points = grade A
  • 81 - 90 points = grade B
  • 71 - 80 points = grade C
  • 61 - 70 points = grade D
  • 51 - 60 points = grade E
  • 0 - 50 points = not passed (grade F)

NOTE: Total points earned will be rounded to the whole number (e.g., the overall result of 50.5 points is rounded to 51 points, which corresponds to the grade E).

Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Sylabus - angličtina

Course Schedule 

Week 1: Course overview. Introduction to the software environment.
Week 2: Descriptive vs inferential statistics. Levels of measurement.
Week 3: Introduction to probability and probability distributions.
Week 4: Sampling variation. Central limit theorem. Confidence intervals (for the mean).
Week 5: Statistical hypotheses testing framework. One-sample t-test.
Week 6: Independent-samples t-test. Paired-samples t-test.
Week 7: Exploring assumptions of parametric tests. Assumption of normality.
Week 8: Analysis of variance (within- and between-group variability, F-test, post-hoc tests).
Week 9: Correlation analysis (Covariance, Pearson and Spearman correlation coefficients, Scatterplot).
Week 10: Linear regression (method of least squares, simple/multiple regression).
Week 11: Analysis of categorical data I (confidence interval for a proportion, introduction to crosstabs).
Week 12: Analysis of categorical data II (chi-square test of independence, contingency coefficients, residuals).
Week 13: Review session.

Poslední úprava: Petrúšek Ivan, Mgr., Ph.D. (30.01.2024)
Univerzita Karlova | Informační systém UK