First steps with Python in Life Sciences

Last update: Jan 08, 2023

Overview

First steps with Python in Life Sciences

This course material is part of the "First Steps with Python in Life Science" three-day course of SIB-training and is addressed to beginners wanting to become familiar with the Python syntax, environment, and the most common commands.

This course material provides an introduction to python and jupyter notebooks (a web based notebook system for creating and sharing computational documents) in an interactive manner.

prerequisite installation

You can find tips and instructions to ensure you have installed all the required software before starting the course.

course material organization

The course revolves around a sery of jupyter notebooks which take you on your first steps in you python journey.

Each jupyter notebook interleaves theory and examples of codes. We heartily recommend you execute and play around with these bits of code as you follow along : in programming, perhaps even more than anywhere else, practice makes perfect.

Additionally, each notebook is associated with a number of exercises (often in a separate notebook) of varying difficulty, with associated corrections.

If you are attending this course with a teacher (or if you are just curious), you can take a look at our schedule. In short, lessons 00 to 04 deals with generalistic aspect of the python language, while notebooks 05 or 08 present some of the most common modules used in data analysis and/or life sciences.

The notebooks/ folder contains each lesson:

00_jupyter_setup
01_python_basics
02_python_structures
03_reading_writing_files
04_modules
05_module_pandas : handle tabular data data-frames with pandas
06_module_matplotlib : create nice graphics and plots with matplotlib
07_module_biopython : do all kind of bioinformatics with [biopython]](https://biopython.org/)
08_module_numpy_and_scipy : fast numerical computations with numpy + a bit of statistics with scipy.stats

Exercise notebooks:

The data used in the practicals can be found in the data notebooks/data folder, and solutions codes can be found in the notebooks/solutions/ folder (NB: micro-exercises do not have a correction).

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Comments

Module 2-create your own functions - text columns

Your tutorials are fantastic! minor format issues: the multiple column format in some pages (ex: module 2 in python training) collapse the text and making it unreadable. Hope to see it fixed to complete the tutorial! thank you.

opened by catalicu 1

Releases(October2022)

October2022(Oct 12, 2022)

course material for the October 2022 edition of the SIB course "First Steps with Python in Life Sciences"
Source code(tar.gz)
Source code(zip)
May2022(May 12, 2022)

Release for the May2022 edition of the course in Basel
Source code(tar.gz)
Source code(zip)

First steps with Python in Life Sciences

Related tags

Overview

First steps with Python in Life Sciences

prerequisite installation

course material organization

You might also like...

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Statsmodels: statistical modeling and econometrics in Python

A computer algebra system written in pure Python

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Hidden Markov Models in Python, with scikit-learn like API

Deep universal probabilistic programming with Python and PyTorch

Fast, flexible and easy to use probabilistic modelling in Python.

Comments

Module 2-create your own functions - text columns

Releases(October2022)

October2022(Oct 12, 2022)

May2022(May 12, 2022)

Owner

SIB Swiss Institute of Bioinformatics

This is a repo documenting the best practices in PySpark.

Building house price data pipelines with Apache Beam and Spark on GCP

Analytical view of olist e-commerce in Brazil

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

PySpark bindings for H3, a hierarchical hexagonal geospatial indexing system

Automatic earthquake catalog building workflow: EQTransformer + Siamese EQTransformer + PickNet + REAL + HypoInverse

NFCDS Workshop Beginners Guide Bioinformatics Data Analysis

COVID-19 deaths statistics around the world

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

Employee Turnover Analysis

Methylation/modified base calling separated from basecalling.

A columnar data container that can be compressed.

Monitor the stability of a pandas or spark dataframe ⚙︎

Transform-Invariant Non-Negative Matrix Factorization

Churn prediction with PySpark

Full automated data pipeline using docker images

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

Used for data processing in machine learning, and help us to construct ML model more easily from scratch

This repository contains some analysis of possible nerdle answers

Find exposed data in Azure with this public blob scanner