(Fall’23): CSCI-5930 Machine Learning

Graduate course, SCIENCE-2001 (Tue only), 2023

Welcome to Machine learning course. What is machine learning? It is concerned with the question of how to write computer programs that automatically improve with experience. Over the very recent years, the field has expanded so much in every direction of our daily lives that we mostly are unaware of its existence. But, as a concerned citizen of the world, we are going to know the nuts and bolts of machine learning.

The course is based on fundamental knowledge of computer science principles and techniques, probability and statistics, calculus, and the theory and application of linear algebra. The course provides a broad introduction to pattern recognition from given data and how it can relate to machine learning. Topics include supervised learning, unsupervised learning, semi-supervised learning, neural network, and reinforcement learning algorithms. The course will also discuss recent applications of these machine learning concepts in solving real-world problems.

Course objectives

By the end of the course you are expected to gain the following skills:

  1. Develop an understanding on how to extract patterns from data.
  2. Develop an understanding on a wide variety of machine learning algorithms – how the algorithms work, and their practical usages.
  3. Understand different types of optimization techniques, which are heavily utilized in many of the learning algorithms.
  4. Capable of discussing pros and cons of the learning algorithms.
  5. Be able to implement the covered algorithms in class by themselves using Python programming language.
  6. Apply the algorithms to solve real world problems.
  7. Understand and apply the principles of responsible AI system design: fairness, inclusiveness, transparency, reliability and safety, privacy and security, accountability, limits of capabilities.

Prerequisites

  1. The graduate standing.

Schedule

Week 1 [8/21/2023 – 8/25/2023]

  1. Introduction to Machine learning [ PDF ] [ Video-Recording, 19:46]
  2. I am wondering you want to become a data scientists [ PDF ] [Video-Recording, 17:00]
  3. Challenges with Machine learning system design [ PDF ] [Video-Recording, 26:09]
  4. Stories of Machine Learning [ PDF ][ Video-Recording, 58:06]
  5. Nuts and Bolts of Machine Learning (part 1): The types of learning [ PDF ][ Video-Recording, 34:31]
  6. Nuts and Bolts of Machine Learning (part 2): The task and the datasets [ PDF ] [ Video-Recording, 32:37 ]
  7. Nuts and Bolts of Machine Learning (part 3): How to tell the machine learning model is good? [ PDF ] [ Video-Recording, 52:21 ]
  8. Nuts and Bolts of Machine Learning (part 4): Exploratory Data analysis – a case study: House price prediction [ zip ] [ Video-Recording, 23:02 ]

Week 2 [8/28/2023 – 9/1/2023]

  1. Regression, and Simple linear regression [ PDF, slides 1-24 ] [ Video-Recording, 45:24 ]
  2. Multiple Linear regression [ PDF, slides 25-32 ] [ Video-Recording, 27:18]
  3. Comments on calculating gradient of linear regression loss function [ PDF, slides 51 and onward ] [ Video-Recording, 16:18]
  4. Gradient Descend Algorithm [ PDF, slides 33-49 ] [ Video-Recording, 31:06 ]

Weeks 3 [9/4/2023 – 9/8/2023]

  1. Evaluating Regression models [ PDF, slide 50 ] [ Video-Recording, 13:05 ]
  2. Introduction to classification, and kNN classifier [ PDF] [ Video-Recording, 53:06 ]
  3. Another look at evaluating linear regression models [Slide PDF] [ Video-Recording, 28:41]
  4. Bias-variance tradeoffs, Overfitting, Underfitting [Slide PDF] [ Video-Recording, 26:50 ] [ Another supporting slide]
  5. Dataset splits: training, testing, validation, cross-validation [Slide PDF] [ Video-Recording, 16:23 ]
  6. Label encoding, One-hot encoding [Slide PDF] [Video_Recording, 10:57]
  7. Animation of Gradient Descent code Zip

Weeks 4 [9/11/2023 – 9/15/2023]

  1. Logistic regression for binary classification [Slide PDF] [ Video-Recording, 54:59]
  2. Binary to multi-class classification [Slide PDF][ Video-Recording, 52:24]
  3. Multi-label classification [Slide PDF][ Video-Recording, 50:31]

Weeks 5 [9/18/2023 – 9/22/2023]

  1. Introduction to clustering [Slide PDF][ Video-Recording, 1:03:08]
  2. k-means clustering algorithm [Slide PDF][ Video-Recording, 1:05:18]
  3. Hierarchical clustering algorithm [Slide PDF] [ Video-Recording, 1:15:03 ]

Weeks 6 [9/25/2023 – 9/29/2023]

  1. Principal Component Analysis (PCA) for Dimensionality Reduction [Link 1] [Notebook] [Video-Recording, 50:44]
  2. Naive Bayes Classifier : The Bayes’ Theorem [ PDF ] [ Video-Recording, 14:39 ]
  3. Naive Bayes Classifier : A simple math problem [ PDF] [ Video-Recording, 7:25]
  4. Naive Bayes Classifier : Idea behind the classification algorithm [PDF] [ Video-Recording, 11:47]
  5. Definition of Naive Bayes Classifier [ PDF ] [ Video-Recording, 10:01]
  6. Naive Bayes Classifier in action [PDF] [ Video-Recording, 22:01 ]
  7. Naive Bayes Classifier : few implementation facts [ PDF] [ Video-Recording, 5:42 ]

Week 7 [10/2/2023 – 10/6/2023]

  1. Ensemble Learning [ PDF ] [ Video-Recording, 52:39 ]
  2. Ridge Regression [ PDF ] [ Video-Recording, 1:07:23 ]
  3. LASSO Regression [ PDF ] [ Video-Recording, 37:05]
  4. Support Vector Machines :: [ PDF ]

Week 8 [10/9/2023 – 10/13/2023]

  1. Diverse Counterfactual Explanations [ Paper, Slides ] [Talk, 8:10]

Weeks 9 [10/17/2023 – 10/21/2023]

  1. Reinforcement Learning applications (a higher-level overview)
  2. Introduction to Reinforcement Learning with OpenAI-Gym [ Notebook/Slide ] [ Github-Repo ] [ Video-Recording, 37:11 ]
  3. Working with OpenAI-Gym environments [Notebook/Slides: ALE/Breakout, Blackjack, CarRacing, ALE/Pong, ALE/Riverraid ][ Github-Repo ] [ Video-Recording, 16:00 ]
  4. Making of an Intelligent CartPole agent [Notebook/Slides: Random CartPole Agent, Q-learning CartPole Agent][ Github-Repo ] [ Video-Recording, 48:27]
  5. Non-gym environment and Reinforcement learning from scratch [Notebook/Slides: Goal-vs-Hole-v0, Goal-vs-Hole-v1][ Github-Repo ] [ Video-Recording, 35:09 ]
  6. Multi-bandit problem [Notebook+slides: ashiskb@github link] [A supporting video lecture by Connor Shorten] [Reference study: SB-2]
  7. Markov Decision Process (MDP), Dynamic Programming for action selection intro [ Slides] [Video-Lecture by Dr. B] [Reference study: SB-3, 4]

Week 10 [10/23/2023 – 10/27/2023]

  1. Introduction to Artificial neural networks (ANN) and deep learning [ PDF ] [Video Recording, 24:19]
  2. Artificial Neural Networks – the basics [ PDF ], [ Jupyter notebook ]

Week 11, 12 and 13 [10/30/2023 – 11/17/2023]

  1. Compute framework for Artificial Neural Network + Deep Learning [ PDF ] [Video Recording, 01:00:03]
  2. Tensorflow 2.0 [ PDF ] [Video Recording, 32:56]
  3. Introduction to Tensors [Jupyter Notebook] [Video Recording, 01:02:38]
  4. Backpropagation with Tensorflow 2.0 [Jupyter Notebook] [Video Recording, 01:08:55]
  5. Introduction to PyTorch [This is a 2 part video lecture:]
  6. Convolution Neural Network
    1. The intuition [ Notebooks ][Video Recording, 30:52]
    2. Convolution Operation, 1x1 conv, implementation aspects [Notebooks] [Video Recording, 01:27:10]
    3. The pooling layer [Notebook] [Video Recording, 11:11]
    4. Flattened layer and fully connected layer [Notebooks] [Video Recording, 14:08]
    5. Few examples [Notebooks] [Video Recording, 14:16]
    6. Epilog + comment on weight sharing property [Notebooks] [Video Recording, 09:02]
  7. Recurrent Neural Networks
    1. Introduction [Notebooks] [Video Recording, 38:20]
    2. Basic language modeling with Recurrent Neural Networks [Notebooks] [Video Recording, 29:15]
    3. Vanilla architecture of RNN [Notebooks] [Video Recording, 58:49]
    4. Long short term memory (LSTM) architecture of RNN [Notebooks] [Video Recording, 44:17]
  8. Transformers [ link to required study materials]
  9. Natural Language Processing (Part 1)