First steps with Python in Life Sciences

Last update: Jan 08, 2023

Overview

First steps with Python in Life Sciences

This course material is part of the "First Steps with Python in Life Science" three-day course of SIB-training and is addressed to beginners wanting to become familiar with the Python syntax, environment, and the most common commands.

This course material provides an introduction to python and jupyter notebooks (a web based notebook system for creating and sharing computational documents) in an interactive manner.

prerequisite installation

You can find tips and instructions to ensure you have installed all the required software before starting the course.

course material organization

The course revolves around a sery of jupyter notebooks which take you on your first steps in you python journey.

Each jupyter notebook interleaves theory and examples of codes. We heartily recommend you execute and play around with these bits of code as you follow along : in programming, perhaps even more than anywhere else, practice makes perfect.

Additionally, each notebook is associated with a number of exercises (often in a separate notebook) of varying difficulty, with associated corrections.

If you are attending this course with a teacher (or if you are just curious), you can take a look at our schedule. In short, lessons 00 to 04 deals with generalistic aspect of the python language, while notebooks 05 or 08 present some of the most common modules used in data analysis and/or life sciences.

The notebooks/ folder contains each lesson:

00_jupyter_setup
01_python_basics
02_python_structures
03_reading_writing_files
04_modules
05_module_pandas : handle tabular data data-frames with pandas
06_module_matplotlib : create nice graphics and plots with matplotlib
07_module_biopython : do all kind of bioinformatics with [biopython]](https://biopython.org/)
08_module_numpy_and_scipy : fast numerical computations with numpy + a bit of statistics with scipy.stats

Exercise notebooks:

The data used in the practicals can be found in the data notebooks/data folder, and solutions codes can be found in the notebooks/solutions/ folder (NB: micro-exercises do not have a correction).

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Comments

Module 2-create your own functions - text columns

Your tutorials are fantastic! minor format issues: the multiple column format in some pages (ex: module 2 in python training) collapse the text and making it unreadable. Hope to see it fixed to complete the tutorial! thank you.

opened by catalicu 1

Releases(October2022)

October2022(Oct 12, 2022)

course material for the October 2022 edition of the SIB course "First Steps with Python in Life Sciences"
Source code(tar.gz)
Source code(zip)
May2022(May 12, 2022)

Release for the May2022 edition of the course in Basel
Source code(tar.gz)
Source code(zip)

First steps with Python in Life Sciences

Related tags

Overview

First steps with Python in Life Sciences

prerequisite installation

course material organization

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Statsmodels: statistical modeling and econometrics in Python

A computer algebra system written in pure Python

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Hidden Markov Models in Python, with scikit-learn like API

Deep universal probabilistic programming with Python and PyTorch

Fast, flexible and easy to use probabilistic modelling in Python.

Comments

Module 2-create your own functions - text columns

Releases(October2022)

October2022(Oct 12, 2022)

May2022(May 12, 2022)

Owner

SIB Swiss Institute of Bioinformatics

Using Data Science with Machine Learning techniques (ETL pipeline and ML pipeline) to classify received messages after disasters.

The Spark Challenge Student Check-In/Out Tracking Script

Snakemake workflow for converting FASTQ files to self-contained CRAM files with maximum lossless compression.

Binance Kline Data With Python

Useful tool for inserting DataFrames into the Excel sheet.

PyClustering is a Python, C++ data mining library.

Modular analysis tools for neurophysiology data

Universal data analysis tools for atmospheric sciences

Helper tools to construct probability distributions built from expert elicited data for use in monte carlo simulations.

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

Full ELT process on GCP environment.

Basis Set Format Converter

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

This repo is dedicated to the data extraction and manipulation of the World Bank's database called STEP.

The official repository for ROOT: analyzing, storing and visualizing big data, scientifically

Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.

A forecasting system dedicated to smart city data

International Space Station data with Python research 🌎

Open source platform for Data Science Management automation

An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R