CSCI-5800: Machine Learning
Cross-listed course (ugrad+grad), Lawrence Street Center - 840, 2018
Welcome to Machine learning course. What is machine learning? It is concerned with the question of how to write computer programs that automatically improve with experience. The field has expanded so much in every direction of our daily lives that we mostly are unaware of its presence. Interestingly, there may be two perspectives to this oblivion. First one is that the machine learning applications have greatly improved themselves over time and became part of your lifestyle. However, a completely opposite perspective is that we became part of it, and unknowingly feeding input to the program as if we are in a computer simulation? In this class, you will learn about machine learning, its application in diverse domains, get to implement them yourself and raise awareness of its presence and associated implications in our day-to-day lives. Once again welcome aboard.
The course is based on fundamental knowledge of computer science principles and techniques, probability and statistics, calculus, and the theory and application of linear algebra. The course provides a broad introduction to pattern recognition from given data and how it can relate to machine learning. Topics include: supervised learning, unsupervised learning, semi-supervised learning, neural network / deep learning, and reinforcement learning algorithms. The course will also discuss recent applications of these machine learning concepts, such as in robotics, data mining, autonomous navigation, speech recognition, bioinformatics, meteorology, and so on.
Course objectives
By the end of the course you are expected to gain the following skills:
- Develop an understanding on how to extract patterns from data.
- Develop an understanding on a wide variety of machine learning algorithms – how the algorithms work, and their practical usages.
- Understand different types of optimization techniques which are heavily utilized in many of the learning algorithms.
- Capable of discussing pros and cons of the learning algorithms.
- Be able to implement the covered algorithms in class by themselves using any of the programming language of their choice from the set {Matlab, Python, R}.
- Apply the algorithms to solve real world problems.
Prerequisites
For undergraduates:
- MATH-3195 (Linear algebra and differential equations)
- CSCI-3412 (Algorithms).
For graduate students:
- The
graduate standing
.
Topics covered
- Introduction to Machine learning
- Linear regression
- Overfitting and complexity; training; validation; testing
- Logistic regression
- Classification – Bayesian classifier
- Classification – kNN
- Binary vs. Multi-class classification
- Classification - Decision tree; Ensemble methods: Bagging, random forests, boosting
- Classification – LDA, QDA
- Clustering - kMeans
- Clustering – hierarchical clustering
- Dimensionality reduction: Fisher’s LDA
- Dimensionality reduction: PCA, SVD
- Dimensionality reduction: Kernel PCA
- Classification – Linear SVM
- Classification – Kernel SVM;
- Kernel methods
- Support Vector regression
- Neural networks
- Neural networks / Deep learning (CNN)
- Recommender systems: Nonnegative Matrix Factorization (NMF)
- Recommender systems: IMC, RIMC, collaborative filtering
- Reinforcement learning
- Time series data analysis; Hidden Markov Model
- Markov Random Field vs Conditional Random Field
- Semi-supervised learning
- How to scale your algorithms for the big data analytics?
- Spectral clustering