Farshid

is creating Deep Reinforcement Learning (DRL) applications

0

patrons

$0

per creation
Reinforcement Learning Methods
  • Dynamic Programming (DP) [89]
  • Monte Carlo [94]
  • Temporal-Difference [88]
  • Q-Learning (Off-policy TD algorithm) [89]
  • Sarsa (On-policy TD algorithm) [94]
  • R-Learning (learning of relative values) [93]
  • Function Approximation methods (Least-Square Temporal Difference, Least-Square Policy Iteration) [96]
  • Policy Search / Policy Gradient [99]
  • Hierarchical RL [99]
  • Deep Learning + Reinforcement Learning [2014]
Tiers
Deep Reinforcement Learning
$1 or more per creation
Deep Reinforcement Learning 
Reinforcement Learning Methods
  • Dynamic Programming (DP) [89]
  • Monte Carlo [94]
  • Temporal-Difference [88]
  • Q-Learning (Off-policy TD algorithm) [89]
  • Sarsa (On-policy TD algorithm) [94]
  • R-Learning (learning of relative values) [93]
  • Function Approximation methods (Least-Square Temporal Difference, Least-Square Policy Iteration) [96]
  • Policy Search / Policy Gradient [99]
  • Hierarchical RL [99]
  • Deep Learning + Reinforcement Learning [2014]

Recent posts by Farshid

Tiers
Deep Reinforcement Learning
$1 or more per creation
Deep Reinforcement Learning