DAT110 Introduction to Data Analysis and -Visualisation
Showing course contents for the educational year 2021 - 2022 .
Course responsible: Kristin Tøndel
ECTS credits: 10
Faculty: Faculty of Science and Technology
Teaching language: NO
Teaching exam periods:
This course starts in Spring parallel. This course has teaching/evaluation in Spring parallel.
Course frequency: Annually
First time: Study year 2019-2020
Students taking a master in Data Science.
- Datatypes and loading of data from various file formats.
- Visualization and explorative analysis for identification of structure and trends (histograms, scatterplots, box-plots etc.).
- Some simple statistics (mean, median, variance etc.).
- Correlation and covariance (of single variables and matrix data).
- Elementary normal distribution theory, normalizing transformations and testing of normality.
- Geometric distributions, binomial distributions, Poisson-distributions.
- Inference (parametrical and non-parametrical for investigation of one and two samples) and simple analysis of variance.
- Least squares modelling (linear and polynomial fit).
- Logistic regression (classification with two groups).
- Smoothing of time-dependent data.
- Cluster analysis (k-means, hierarchical clustering etc.).
Skills and insight into basic data analysis techniques based on modern data collection. The student will learn to choose appropriate methods for
1) Explorative data analysis (plotting/visualisation and descriptive statistics),
4) Modelling and prediction with continous and categorical responses (regression and classification) and validation of predictive models.
5) Introduction to cluster analysis and smoothing of time series data.
Lectures, and group work with manual- and computer lab exercises (assistant teachers and course responsible will be present at the groups). 4 hours mandatory attendance per week. Student-active learning is used.
Guidance during tutoring sessions.
Information about relevant literature will be given before the start of the course at the course home page.
INF120 Programming and Data Processing (or equivalent).
Mandatory exercises (hand-ins) and 2 double lectures mandatory attendance per week. Requirements for approval of mandatory activity will be announced at the beginning of the term.
Written examination: 3.5 hours. A-F.
Lectures 78 hours, exercises 26 hours, preparation for presentations 96 hours, colloquia and self study 50 hours.
Special requirements in Science
Reduction of credits:
5 credits with STAT100
10 credits with MATH-INF110
Type of course:
Lectures: 2 hours per week. Lectures/exercises: 2 hours per week. Exercises: 2 hours per week.
The external and internal examiner jointly prepare the exam questions and the correction manual. The external examiner reviews the internal examiner's examination results by correcting a random sample of candidate's exams as a calibration according to the Department's guidelines for examination markings.
Examination details: Written exam: Letter grades