Reinforcement Learning In A Simple Card Game
Monte Carlo and backward view Sarsa model free control with function approximation using linear combinations of features to solve the card game Tuppence Hape...
Monte Carlo and backward view Sarsa model free control with function approximation using linear combinations of features to solve the card game Tuppence Hape...
My implementation of softmax regression and shallow neural networks used on the CIFAR 10 dataset for object recognition.
Slides from my presentation on feed-forward neural networks.
My implementation of the Multi Armed Bandits problem outlined in Sutton and Barto’s Reinforcement Learning: An Introduction.