fbpx
World Business StrategiesServing the Global Financial Community since 2000

Python Primers: Python for Data Science and Artificial Intelligence

Python Primers: 

Python for Data Science and Artificial Intelligence

Date: Tuesday 13th April 2021

Live and Online: 09.00 – 17.00


Advanced Python Techniques 

Date: Thursday 15th April 2021

Live and Online: 09.00 – 17.00


Python for Data Science and Artificial Intelligence

Tuesday 13th April 2020: 09.00 – 17.00

Overview

Python is the de factolingua franca of data science, machine learning, and artificial intelligence. Familiarity with Python is a must for modern data scientists.

The MLI Python Primers are designed to take you from the very foundations to state-of-the-art use of modern Python libraries.

You will learn the fundamentals of the Python programming language, play with Jupyter notebooks, proceed to advanced Python language features, learn to use distributed task queues (Celery), learn to work with data using NumPy, SciPy, Matplotlib, and Pandas, examine state-of-the-art machine learning libraries (Scikit-Learn, Keras, TensorFlow, and Theano), and complete a realistic, real-life data science lab.


Syllabus:

  • The fundamentals of the Python programming language and Jupyter notebooks
    • Jupyter notebooks
    • The Python syntax
    • Data types, duck typing
    • Data structures: lists, sets, and dictionaries
    • Data types
  • Advanced Python features; distributed tasks queues with Celery
    • List comprehensions
    • Lambdas
    • Objects
    • The Global Interpreter Lock (GIL)
    • Multithreading and multiprocessing
    • Distributed task queues with Celery
  • Python libraries for working with data: NumPy, SciPy, Matplotlib, and Pandas
    • Multidimensional arrays in NumPy
    • Linear algebra and optimisation with SciPy
    • Data visualisation in Matplotlib
    • Time series data
    • Dealing with Pandas DataFrames
  • Machine Learning with Scikit-Learn; Deep Learning with Keras, TensorFlow, and Theano
    • Overview of machine learning
    • Introduction to Scikit-Learn
    • Keras and TensorFlow
    • Introduction to Theano

Advanced Python Techniques

Thursday 15th April 2020: 09.00 – 17.00

09.00 – 12.30: Advanced Python Features and Putting them to use in Practice

  • Algorithmics and graph theory
  • Prime numbers
  • Cryptography
  • Blockchain
  • Distributed Computing with Python

13.30 – 17.00: High Performance Python

 Outline syllabus 

 

·         Profiling 

·         The use of NumPy and SciPy over pure python 

·         The importance of optimised NumPy and SciPy 

·         The use of Cython and ctypes to integrate compiled code 

·         Just In Time Compilation using Numba  

·         Distributed computing frameworks  

·         Numerical precision and speed 

·         Using specialised instead of generalised algorithms 

·         NAG Library for Python  

 

Abstract 

Python is a superb prototyping language that allows us to develop high quality data analyses and simulations in a relatively short amount of time.  The cost we pay for this is performance.  Python is essentially an interpreted and single threaded language which puts severe limitations on its speed.  In this session we will learn a range of techniques that will allow us to discover which parts of our Python code are slow and what we can do to speed things up. 

Paul Bilokon:

Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas

Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas

Dr. Paul Bilokon is CEO and Founder of Thalesians Ltd and an expert in electronic and algorithmic trading across multiple asset classes, having helped build such businesses at Deutsche Bank and Citigroup. Before focussing on electronic trading, Paul worked on derivatives and has served in quantitative roles at Nomura, Lehman Brothers, and Morgan Stanley. Paul has been educated at Christ Church College, Oxford, and Imperial College. Apart from mathematical and computational finance, his academic interests include machine learning and mathematical logic.

Jonathan Boyle:

Software Engineer, NAG

Jonathan Boyle: Software Engineer, NAG

Jonathan is a scientist and research software engineer with over 15 years’ experience of high-performance computing. He has contributed to various software projects written in C, C++, Fortran and Python. Most recently, Jonathan has been working at NAG as a software engineer on the EU funded POP project. This work offers services to improve the performance of parallel software, written in a range of languages (including Python), designed to run on HPC hardware, including the world’s largest supercomputers.

MLI Online Resource Library

Prior to the start of the MLI students are given access to our online resource in preparation for the certificate.

MLI Online Primer Resource:

  • Maths Primer Refresher Material: For each topic (Linear Algebra, Optimization, Probability & Statistics), there will be a specific quiz to test initial background knowledge. We recommend that you first attempt answering the quiz exercises without looking at any material.
  • Python Primer for Data Science. Presented by Nikolaos Aletras, Lecturer at The University of Sheffield
  • Recorded Primer: Python for Data Science and Artificial Intelligence. Presented by Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas
  • Recorded Primer: Advanced Python Techniques. Presented by Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas

Additional MLI Learning Resource:

  • Big Data, High-Frequency Data, and Machine Learning with kdb+/q Workshop. Presented by Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas.
  • Big Data and High-Frequency Data with kdb+/q Workshop. Presented by Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas

 

Level 1: Machine Learning Institute Certificate in Finance 

Level 1: Machine Learning Institute Certificate in Finance


Dates:

  • Level 1 Starts: Tuesday 20th April 2021

ASSIGNMENTS:

Throughout the programme, candidates work on hands-on assignments designed to illustrate the algorithms studied and to experience first hand the practical challenges involved in the design and successful implementation of machine learning models. The data sets and problems are selected to be representative of the applications encountered in finance.


Module 1 – Supervised Learning:

In this module, the concepts related to algorithmically learning from data are introduced. The candidates are given an early taste of a supervised machine learning application before going through the fundamental building blocks starting from linear regression and classification models to kernels and the theory underpinning support vector machines and then to the powerful techniques of ensemble learning.

Module 1 Faculty:

  • Adriano Soares Koshiyama: Research Fellow in Computer Science, University College London

The module includes a combination of theoretical and hands-on assignments:

Module 1 Welcome to The MLI. Supervised Learning Theory: Learning from Data and Linear Models 20-Apr-21
Module 1 Supervised Learning Practical: Learning from Data and Linear Models 27-Apr-21
Module 1 Supervised Learning Theory: Ensemble Models 4-May-21
Module 1 Supervised Learning Practical: Ensemble Models 11-May-21
Module 1 Supervised Learning Theory: Kernel Methods 18-May-21
Module 1 Supervised Learning Practical: Kernel Methods 25-May-21

End of Module 1 Assignment.


Module 2 – Unsupervised Learning:

An important and challenging type of machine learning problems in finance is learning in the absence of ‘supervision’, or without labelled examples.

In this module, we first introduce the theoretical framework of hidden variable models. This family of models is then used to explore the two important areas of dimensionality reduction and
clustering algorithms.

Module 2 Faculty:

  • Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas
  • Ivan Zhdankin: Associate, Quantitative Analyst, JP Morgan Chase & Co

There are theoretical and applied assignments with financial data sets.

Module 2 Unsupervised Learning Introduction & Dimensionality Reduction 1-Jun-21
Module 2 Unsupervised Learning Practical Lab Session 8-Jun-21
Module 2 Unsupervised Learning Clustering Algorithms 15-Jun-21
Module 2 Unsupervised Learning Applications & Practical Lab Session 22-Jun-21

End of Module 2 Assignment.


Module 3 – Practitioners Approach to ML:

This module focuses on the practical challenges faced when deploying machine learning models within a real life context.

Each session in this module covers a specific practical problem and provides the candidates with guidance and insight about the way to approach the various steps within the model development cycle, from data collection and examination to model testing and validation and results interpretation and communication.

Module 3 Faculty:

  • Paul Bilokon: Founder, CEO, Thalesians & Senior Quantitative Consultant, BNP Paribas
  • Ivan Zhdankin: Associate, Quantitative Analyst, JP Morgan Chase & Co
  • Phil Tooley: HPC Software Engineer, NAG (Numerical Algorithms Group)
Module 3 Practitioner’s Approach Reproducibility and Deployment of Data Science Workflows 29-Jun-21
Module 3 Practitioner’s Approach Feature engineering / Model tuning 6-Jun-21
Module 3 Practitioner’s Approach Introduction to Natural Language Processing and Practical Lab Session 13-Jun-21
Module 3 Practitioner’s Approach Using Natural Language Processing to Predict Bond Prices 20-Jun-21

End of Module 3 Assignment.

Level II: Machine Learning Institute Certificate in Finance

Level 2: Machine Learning Institute Certificate in Finance


Dates:

  • Level 2 Starts: Tuesday 27th June 2021

Module 4 – Neural Networks:

Neural Network models are an important building block to many of the latest impressive machine learning applications on an industrial scale.

This module aims to develop a solid understanding of the algorithms and importantly, an appreciation for the main challenges faced in training them. The module starts with the perceptron model, introduces the key technique of backpropagation before exploring the various regularisation and optimisation routines. More advanced concepts are then covered in relation to the next module on Deep Learning.

Although we cover the theoretical foundations of Neural Networks, the emphasis of the assignments will be on hands-on lab work where the candidates are given the opportunity to experiment with the techniques studied on financial and non-financial data sets.

Module 4 Faculty:

  • Terry Benzschawel: Founder and Principal, Benzschawel Scientific, LLC
  • Alexei Kondratyev: Managing Director, Head of Data Analytics, Standard Chartered Bank
Module 4 Neural Networks Perceptron Model 27-Jul-21
Module 4 Neural Networks Backpropagation 03-Aug-21
Module 4 Neural Networks Regularisation and Optimisation 31-Aug-21
Module 4 Neural Networks Network Architectures 7-Sept-21

End of Module 4 Assignment.


Module 5 – Deep Learning:

Deep Learning has been the driving engine behind many of the recent impressive improvements in the state of the art performance in large scale industrial machine learning applications.

This module can be viewed as a natural follow-up from the previous module on Neural Networks. First, the background and motivations for transitioning from traditional networks to deeper architectures are explored. Then the module covers the deep feedforward architecture, regularisation for deep nets, advanced optimisation strategies and the CNN Architecture.

The assignments of this module will be highly practical with ample opportunity to experiment on financial and non-financial data sets and become familiar with the latest open-source deep learning frameworks and tools.

Module 5 Faculty:

  • Blanka Horvath: Assistant Professor, Financial Mathematics King’s College London
  • Ivan Zhdankin: Associate, Quantitative Analyst, JP Morgan Chase & Co
  • Terry Benzschawel: Founder and Principal, Benzschawel Scientific, LLC
Module 5 Deep Learning Motivation and Examples 14-Sept-21
Module 5 Deep Learning Reinforcement Learning: introduction 21-Sept-21
Module 5 Deep Learning Reinforcement Learning: implementation 28-Sept-21
Module 5 Deep Learning Deep Learning Volatility / Practical Lab Session 5-Oct-21

End of Module 5 Assignment.


Module 6 – Time Series:

Time series data is an invaluable source of information used for future strategy and planning operations everywhere from finance to education and healthcare. You will be walked through the core steps of building, training, and deploying your time series forecasting models. You’ll build a theoretical foundation as you cover the essential aspects of time series representations, modeling, and forecasting before diving into the classical methods for forecasting time series data.

Module 6 Faculty:

  • Francesca Lazzeri: Machine Learning Scientist, Microsoft
Module 6 Time Series Financial Time Series Data 12-Oct-21
Module 6 Time Series Time Series Analysis 19-Oct-21
Module 6 Time Series Practical Lab Session 26-Oct-21

End of Module 6 Quizzes.

Examination & Final Project

Final Examination: 

Examination preparation lecture: Tuesday 2nd November 2021

Examination date: Tuesday 16th November 2021

  • Candidates will sit a formal examination on a computer. The exam is taken online by students globally.

Final Project:

DATE: Friday 17th December 2021

At the end of the programme, candidates apply the theoretical and practical skills acquired to a real world application within the financial services industry.

The assessment will take into account the quality and the originality of the work as well as the clarity of its presentation.