Python has in recent years become one of the most popular programming languages, and it has found many applications in both business and scientific research. Unlike many other programming languages used in scientific research, Python is not developed specifically for statistical analysis. Instead, it is a general-purpose programming language.
This one-week intensive seminar will focus on getting you started with using Python for data analysis. Data analysis is an important task for both businesses and researchers. However, the data that we need to analyze is often organized in a way that is unsuitable for analysis. The course will focus on how to use Python to convert raw data into tidy data sets that we can use for data analysis. You will also learn how to use Python to summarize and communicate the information in tidy data sets through basic data analysis and visualization.
The course will start with a general introduction to Python. The rest of the course will focus on how to use Python for cleaning and analyzing data. The course is intended for students without any prior knowledge of Python, and for students with some prior knowledge of Python and who wish to learn how to use Python for data analysis.
The course consists of three modules:
- Getting started - introduction to Python and Jupyter Notebook
- Importing and cleaning data
- Analyzing and visualizing data
After successful completion of the course, you will be able to perform data analysis in Python. The course will also give you the foundation that you need for continuing to learn Python and how you can use it to solve a large variety of problems encountered in your academic and professional life.