STIN100 Biological Data Analysis
Showing course contents for the educational year starting in 2019 .
Course responsible: Jon Olav Vik, Torgeir Rhodén Hvidsten
Teachers: Hilde Vinje, Jon Olav Vik, Simen Rød Sandve
ECTS credits: 10
Faculty: Faculty of Chemistry, Biotechnology and Food Science
Teaching language: NO
Limits of class size:
Teaching exam periods:
This course starts in the Fall semester. This course has teaching/evaluation in the Fall semester.
Course frequency: Annually
First time: 2018H
Biology generates increasingly larger amounts of data that cannot be analyzed manually. To extract new insight from big data sets, modern biologists need to acquire knowledge and skills in computer programming and basic data analysis. This course provides basic skills in the programming language R and introduces the student to common methods for visualization and analysis of multi-dimensional biological data.The course is organized around supervised student groups analysing relevant data sets.
KNOWLEDGE: The students will acquire
- broad knowledge in handling, visualizing and analysing multidimensional biological data.
- familiarity with how some of the most important biological data sets are generated and how this data should be preprocessed to correct for systematic errors.
- a conceptual framework for mapping data to graphical elements.
- a repertoire of programming techniques and concepts that are required to perform the analyses in the course.
SKILLS: Students will be able to
- explain principles behind basic methods for data visualization and analysis.
- write programs that perform basic data processing tasks (subsetting, transformation and groupwise summaries) and employ simple visualization and data analysis methods.
- generate reproducible, executable reports that weave together expository text, program code and output.
- propose biological interpretations of analysis results.
COMPETENCES: Students will be well prepared to
- explore datasets they encounter in later term papers, theses and working life.
- perform reproducible research where data processing is fully documented through executable reports.
- compose data graphics using element appropriate to the data types and the biological structure in the data.
- pose follow-up questions to data analyses for discussion with domain experts.
- learn new methods and software packages with the help of documentation, code examples and web resources.
Some lectures, but emphasis will be put on practical group exercises using computers.
Teaching materials will be available in Canvas.
Lecture notes, exercises and handouts, and the website "R for data science" (https://r4ds.had.co.nzhttps://r4ds.had.co.nz), especially chapters 3 (Data visualisation), 9 (Introduction to data wrangling), 12 (Tidy data), 18 (Pipes), 27 (R markdown).
There will be compulsory assignments for the students to hand in and have approved before being evaluated.
Group work: Students write a paper based on analysis of a relevant data set (counts 100%). Pass/Fail.
Lectures: 12 hours. Exercises: 52 hours. Individual study: 236 hours.
MATRS - General admission requirements, and R1 or (S1+S2) or similar mathematical skills
Type of course:
Lectures: 2 hours per week (6 weeks). Exercises: 4 hours per week.
Students must bring their own laptop with Windows, Linux or macOS.
An external examiner must approve the evaluation arrangements for the course.
Examination details: Continous exam: Bestått / Ikke bestått