PředmětyPředměty(verze: 945)
Předmět, akademický rok 2023/2024
   Přihlásit přes CAS
Genome-oriented applications in plant evolution - MB120C54
Anglický název: Genome-oriented applications in plant evolution
Český název: Genomické metody pro studium evoluce rostlin
Zajišťuje: Katedra botaniky (31-120)
Fakulta: Přírodovědecká fakulta
Platnost: od 2022
Semestr: zimní
E-Kredity: 3
Způsob provedení zkoušky: zimní s.:
Rozsah, examinace: zimní s.:0/2, Z [HT]
Počet míst: 15
Minimální obsazenost: neomezen
4EU+: ne
Virtuální mobilita / počet míst pro virtuální mobilitu: ne
Stav předmětu: vyučován
Jazyk výuky: angličtina
Poznámka: povolen pro zápis po webu
Garant: Levi Jeremiah Yant, Dr. rer. nat.
Vyučující: Levi Jeremiah Yant, Dr. rer. nat.
Anotace - angličtina
Poslední úprava: Mgr. Michal Štefánek (17.01.2022)
Annotation:
This is a practical, plant genomics-based computational course taking students from beginning command line scripting and basic HPC cluster computing to pipeline reproducibility and container management, using real data from current research.

Participants will develop basic command line skills in Unix to manipulate data types commonly used in genomic studies. They will learn how to run scripts from others as well as write and develop their own. The module will cover Unix, HPC cluster computing, reproducibility, and basic python scripting and provide students with the tools to develop their own code and pipelines. Participants will work within interactive coding environments to maintain reproducibility in code and practice. By the end of the module, students will be equipped with the practical skills to write simple code to address diverse biological problems.

Learning Aims:
This module aims to introduce fundamental concepts of programming and data manipulation to students without prior experience of the command line. Participants will learn how to write useful, simple code in an interactive environment. Sessions will introduce fundamental programming concepts and approaches. Participants will then have sessions of directed learning to practice and develop the skills required to complete a series of programming challenges.

Learning Outcomes:
• Competence in the Unix environment, command line operations and scripting.
• Knowledge of information sources and guidance for solving common problems in computational biology.
• The ability to install and run scripts within an appropriate environment and install necessary co-requisites.
• Knowledge of how to packages to solve common problems in programming.
• Understanding how to use environments (e.g. Anaconda-based) and package managers for code portability.
• Knowledge of typical biological data file formats and ability to write new files or edit existing files in place.
• The ability to write simple scripts, employ the concepts of class-oriented programming and develop simple applications to reproducibly solve common problems.

The core of the work is thus hands-on practical experience with analysis of empirical data, supervised by Professor Yant - the format is approximating real work on own data generated during their independent research work (e.g. during a Masters or PhD project). Sample datasets will be provided, an analysis of own data in the project work is possible and welcomed but not required. Previous experience with scripting is welcome but not required.

Every student selects a topic and will run a project: i.e. will process a provided sample/own dataset using the presented tools and share their results with others during a short presentation. Credits will be given for the presentations and simple programming challenges.

This course will be held in English.
Literatura - angličtina
Poslední úprava: Mgr. Michal Štefánek (17.01.2022)

Literature

Bradnam K & Korf I (2012) UNIX and Perl to the Rescue!: A Field Guide for the Life Sciences (and Other Data-rich Pursuits) 1st edition. Cambridge University Press.

Saetre GP & Ravinet M (2019): Evolutionary Genetics. 1st edition. Oxford University Press.

For course materials please see  https://botany.natur.cuni.cz/TO_BE_GENERATED

Požadavky ke zkoušce - angličtina
Poslední úprava: Mgr. Michal Štefánek (17.01.2022)

Requirements to the exam

- individual and group-based practical challenges (in class and take-home)

Sylabus - angličtina
Poslední úprava: Mgr. Michal Štefánek (17.01.2022)

Syllabus:

* Preliminary schedule:

Week 1 Introduction to Unix

Week 3 Common Data Types & Tools Part 1: common genomic data types and tools.

Week 5 Common Data Types & Tools Part 2: HPC computing, package managers, containers, and pipelines

Week 7 Python Basics

Week 9 In-class Presentations and assessment

Week 11 Code and pipeline reproducibility

(2-3 hour blocks every two weeks for better immersion.)

 
Univerzita Karlova | Informační systém UK