STIN100 Biological Data Analysis

Credits (ECTS):10

Course responsible:Torgeir Rhoden Hvidsten

Campus / Online:Taught campus Ås

Teaching language:Norsk

Limits of class size:800

Course frequency:Annually

Nominal workload:Plenary sessions: 54 hours. Exercise sessions: 52 hours. Self study: 144 hours.

Teaching and exam period:This course starts in the Fall semester. This course has teaching/evaluation in the Fall semester.

About this course

Biology has become a data-rich science with datasets that can no longer be analyzed manually. To extract knowledge from data, biologists need knowledge and skills in programming and data analysis that enable them to explore, visualize and interpret data. This must be done reproducibly, so that it is clear how the data has been processed and easy to modify the analyses if desired.

This course provides basic skills in the programming language R and introduces the student to common methods for visualization and analysis of multi-dimensional biological data. The course is organized around supervised student groups analysing relevant data sets.

In a time when trust in scientific knowledge is no longer obvious, yet challenges of sustainability require informed decisions, the understanding of data and verifiable production of knowledge are essential. STIN100 helps ensure that future employers and decision makers can rely on the knowledge basis prepared by our graduates.

Learning outcome

KNOWLEDGE: The students will acquire

  • broad knowledge in handling, visualizing and analysing multidimensional biological data.
  • familiarity with how some of the most important biological data sets are generated and how this data should be preprocessed to correct for systematic errors.
  • a conceptual framework for mapping data to graphical elements.
  • a repertoire of programming techniques and concepts that are required to perform the analyses in the course.

SKILLS: Students will be able to

  • explain principles behind basic methods for data visualization and analysis.
  • write programs that perform basic data processing tasks (subsetting, transformation and groupwise summaries) and employ simple visualization and data analysis methods.
  • generate reproducible, executable reports that weave together expository text, program code and output.
  • propose biological interpretations of analysis results.
  • efficiently search documentation and internet resources to realize analyzes.
  • simplify data sets for prototyping and debugging of analyzes.

COMPETENCES: Students will be well prepared to

  • explore datasets they encounter in later term papers, theses and working life.
  • perform reproducible research where data processing is fully documented through executable reports.
  • compose data graphics using element appropriate to the data types and the biological structure in the data.
  • pose follow-up questions to data analyses for discussion with domain experts.
  • learn new methods and software packages with the help of documentation, code examples and web resources.
  • Learning activities

    Two double sessions with plenary activities per week: a mix of lectures and active learning with teaching assistants available (4 hours). Two double sessions of computer-based exercises per week: students work independently, alone or in pairs, with teaching assistants available (4 hours).

    The course’s learning philosophy is: Active learning, where you program yourself and interpret what the data tells you. Problem-based learning, centered around research questions relevant to NMBU. Collaborative learning, through pair programming. Student-driven learning, where you choose what to practice more.

  • Teaching support

    Canvas contains weekly plans with links to learning materials, exercises, and other assignments.

    Teaching assistants are available for questions during plenary sessions and exercise session. Otherwise, questions about data analysis and programming are answered in the Discussions section on Canvas. Learning to ask effective questions through reproducible examples is a key skill you will develop during the course.

    Assignments receive feedback from teaching assistants (in some cases, you will receive automatic feedback).

  • Recommended prerequisites

    Do I need to know a lot of math? Biology? Computer Science?

    You do not need extensive knowledge of mathematics, biology or compiuter science to take this course. However, you must be familiar with your file system, keyboard, web browser, and computer! For some, this requires a lot of effort, especially in the first week.

  • Assessment method

    The assessment is pass/fail, based on a series of approved assignments throughout the semester. If an assignment is not approved, you will receive specific guidance and one additional attempt.

    Approved assignments are only valid within the current semester.



    Portfolio Karakterregel: Passed / Not Passed
  • Examiner scheme
    An external examiner must approve the evaluation arrangements for the course.
  • Notes
    Students must have their own laptop running Windows, Linux, or macOS capable of running RStudio (se oppdaterte systemkrav). Chromebooks do not meet the system requirements for the software used in this course.
  • Teaching hours

    2 double sessions of plenary activities per week.

    2 double sessions of practice exercises per week.

  • Admission requirements
    MATRS - General admission requirements, and R1 or (S1+S2) or similar mathematical skills