Anuket Project

Algo-Selector

Created by Sridhar Rao, last modified by Kanak Raj on Jul 30, 2021

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Volunteers

Name	ML Category
Jahanvi	Supervised
Akanksha	Unsupervised
Kanak Raj	Reinforced

Supervised

Algorithms

Name	Comments on Applicability	Reference

Un-supervised

Algorithms

Name	Comments on Applicability	Reference

Reinforcement Learning

Active Learning
No labeled data
Can afford to make mistakes?
Is it possible to use a simulated environment for the task?
Lots of time
Think about the variables that can define the state of the environment.

State Variables and Quantify them
The agent has access to these variables at every time step
Concrete Reward Function and Compute Reward after action
Define Policy Function

Algorithms

Name	Comments on Applicability	Reference
Q Learning

No labels