Course Info
Main advantage: quicker Data Processing with Python in your company; ability to work with large data that you cannot process in Excel
Duration: 5 days
More information:
- Duration: 7 hours each day brutto (i.e. including breaks) + consultations after every course day
- Format: workshop (70% workshop / 30% lecture)
- Venue: online or client’s office or other place chosen by the client, in Europe
- Enrollment: in-house on-site course for a group of people within one company
- Group size: max 10 delegates
- Course language: English or Polish or both during the same training
Course Syllabus
- Python Builtin Data Structures
- Lists
- Tuples
- Nested Structures
- Dictionaries
- List Comprehension
- Other Essential Python Concepts
- Python Functions
- Positional and Keyword Arguments
- Default Parameter Values
- Pandas Fundamentals
- Datasets
- Import and Export from CSV
- Import and Export from Databases and Fixed Width Format Files
- Essential Transformations (Transpose, Selecting Single or Multiple Rows and Columns, Accessing Index)
- Fundamental Data Statistics
- Quick Data Visualization
- Modifying DataFrame (Adding and Deleting Columns)
- Processing Missing Values
- Aggregations (Sum, Count, Mean, Std etc.)
- Filtering with Masks
- Filtering by Timestamps on Time Serieses
- Filtering on Index
- Advanced Filtering Conditions
- Split-Apply-Combine Pattern (aka Groupby)
- Accessing Data from REST APIs
- Wide vs Long Format
- HTTP Protocol
- REST Fundamentals
- Requests and Response Structure
- Postman for Experimentation
- requests Library
- Advanced Case Study on Real and Complex API (7 Endpoints)
- How to Deal with Pagination?
- Data Wrangling
- Renaming Columns
- Type Conversion
- Categorical Columns
- Sorting Data
- Sorting by Index
- Setting and Reseting Index
- Reindexing
- Vectorised Conditionals
- Appending Data
- Pivoting Data (Converting from Long to Wide Format)
- Unstacking Data (Advanced Pivoting)
- Melting Data (Converting from Wide to Long Format)
- Stacking Data (Advanced Melting)
- Advanced Data Processing
- Where Clause for Easier Filtering
- Joins
- Inner vs Left vs Right vs Outer Joins
- Joins on Index
- Z-Score for Easier Outlier Detection
- Ranking
- Percentage Change
- Binning
- Quantile Binning
- Clipping
- Thresholds
- Aggregations
- Applying an Operation on All Columns
- Rolling Window
- Rolling Window on Time Serieses
- Expanding Window
- Cumulative Sum
- Advanced Groupby
- Groupby by Timestamp on Time Serieses
- Groupby on Multiple Columns
- Aggregating Using Different Functions
- Pivot Tables
- Cross Tabs
- Cross Tabs with Margins
- Time Series Specific Operations
- Time-Based Selection
- Aggregation on Time Serieses
- Shifting and Lagging
- Differenced Data
- Resampling
- Working with OHLC (Open-High-Low-Close) Data
- Upsampling
- Merging Datasets with Different Timestamps
- Data Visualisation with matplotlib
- Matplotlib Library
- Line and Point Plots
- Visualising Data from Rolling Windows
- Plot Components
- Plotting Multiple Serieses
- Matplotlib Integration with Pandas
- Subplots
- Hexbins
- Correlation Matrix
- Histograms
- Kernel Density Estimation
- Cumulative Distribution Function
- Advanced Data Visualisation
- Boxplots
- Groupby on a Single Plot
- Bar Charts
- Bar Charts with Groups of Columns
- Stacked Bar Charts
- Normalised Stacked Bar Charts
- Scatter Matrixes
- Lag Plots
- Autocorrelation Plots
- BigData & Pandas
- dask Library
- dask as a Task Scheduler
- Local Computational Cluster
- dask.DataFrame
- Alternatives
Benefits for the Sponsor
As the course sponsor or HR you get:
- Analysis of the needs and my help to choose or design a great course during a phone call with the sponsor, HR, team leader or/and course delegates. On top of that, we ask delegates on the very first day what their needs are, to make even better usage of the course time.
- Course customisation to your needs.
- Guarantee that the course is conducted by an expert that worked for Google.
- Course evaluation as an electronic form at the end of the last course day. The evaluation results are sent to interested people (most of often they’re course sponsor and HR).
- Simple communication – you can contact the trainer directly by phone or email.
- Easy buying procedure – one call or email is enough to get offer and to book a date. I don’t do overbooking. The course is confirmed once you send the Purchase Order.
- Friendly business partner – as a rule, I treat all my clients like friends. I don’t build walls, I’m not pretending to be a huge training company and I write in first person.
Clients very often decide to order other training (including dedicated courses) after observing positive results of this course.
Benefits for Delegates
Delegates will benefit because of:
- Seven hours course every day (including breaks)
- Consultations after every course day.
- Support after the course, via email and phone.
- Setup instruction before the course to save time at the beginning of the course. I’m happy to help you via email, phone or Skype, zoom.us etc. in case of any questions or issues.
- Course materials consisting of code snippets, comments, exercises and solutions. The entire courseware is a single web page which make it very easy to lookup something there. Courseware is available online during and after the training. Delegates can download it to use it offline. Courseware can be updated during the course in real time, so that we can include comments or entire new sections suggested by delegates.
- Environment ready to use after the course – we don’t use virtual machines. Instead, we install everything on delegates machines, so that they can reuse the same setup after the course.
- Recording of the training (in case of online training)
References
Below you can find some references.

Very inspiring training. I really appreciate the way Chris managed to walk us through the complex world of machine learning using Python. Good course materials updated real time. Highly recommend.
Krzysztof Gębal
Finance Director at DNB Bank Polska S.A.

Well prepared training and reasonably passed knowledge, thanks to which we develop better services.
Arkadiusz Baraniecki
Infrastructure Team Manager at allegro.pl

Chris recently taught a four day class on Machine Learning with Python four our team. The class was very good with the right balance of theory and practice. I cannot think of a better way to give a four day class about such an extensive topic.
Nicolas Leveroni
Head of Krakow Product Control Analytics at HSBC
You can read more references here.