Udacity's Machine Learning Nanodegree project files and lecture notes.
This repository contains project files and lecture notes for Udacity's Machine Learning Engineer Nanodegree program which I started working on in March 2018.
Lecture note reference
Model evaluation and validation
Topics covered in this section:
Model Evaluation
Confusion matrix, F1 score, F-beta score, ROC curve
Model selection
Types of errors, various types of cross validation, learning curves, grid search
Dynamic programming
Iterative policy evaluation, estimation of action values, policy improvement, policy iteration, truncated policy iteration, value iteration
Monte Carlo methods
Predicting state values, estimating action-values, incremental mean, policy evaluation, policy improvement, exploration-exploitation dilemma, GLIE MC control algorithm, constant-alpha GLIE MC control algorithm
Temporal difference learning
TD(0) prediction, action value estimation, solving the control problem,
Sarsamax (Q-learning), expected Sarsa
Deep reinforcement learning
Discrete and continuous spaces, discretization, coarse coding, tile coding, function approximation, kernel functions, coarse coding
Deep Q-Learning
NNs as value functions, Monte Carlo learning, TD learning, Q-learning, Sarsa vs. Q-learning, experience replay, fixed Q-targets, different types of DQNs
Policy-based methods
Policy function approximation, stochastic policy search, policy gradients, Monte Carlo policy gradients, constrained policy gradients
请发表评论