LAAI – Medium

LAAI

MODEL-ENSEMBLE TRUST-REGION POLICY OPTIMIZATION

ICLR 2018 paper from Thanard Kurutach, Ignashi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel

Jan 18, 2021

Jan 18, 2021

DDQN, Prioritized Replay, and Dueling DQN

DDQN — Double Deep Q-network, (Hasselt et al, AAAI 2016)

Apr 30, 2020

DDQN, Prioritized Replay, and Dueling DQN

Apr 30, 2020

Double Q-Learning and Value overestimation in Q-Learning

The problem is named maximization bias problem.

Apr 30, 2020

Double Q-Learning and Value overestimation in Q-Learning

Apr 30, 2020

Journey to an 2020 summer internship

In my PhD career, the first year for prelim, and the second year for Qual. In my third year EE PhD career, having an internship might be…

Apr 26, 2020

Journey to an 2020 summer internship

Apr 26, 2020

EWC:Elastic Weight Consolidation

Paper: “Overcoming catastrophic forgetting in neural networks”.

Apr 25, 2020

EWC:Elastic Weight Consolidation

Apr 25, 2020

Learning to Compare: Relation Network for Few-Shot Learning

這篇文章稱之為Relation Network，主要要解決few-shot learning中similairty function的問題，其架構可以自己最佳化出最好的similarity function．類似的文章可以參考Prototypical Network

Apr 24, 2020

Learning to Compare: Relation Network for Few-Shot Learning

Apr 24, 2020

Prototypical Networks for Few-shot Learning

主題：

Apr 24, 2020

Prototypical Networks for Few-shot Learning

Apr 24, 2020

AWS: Jupyter notebook

登入 (Log in AWS with Jupyter and Tensorboard port liked) You can create a shell script to execute the command below. Please replace…

Apr 24, 2020

Apr 24, 2020

Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO

這是一篇ICLR 2019 Oral paper，來自於MIT Logan Engstrom．

Apr 22, 2020

Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO

Apr 22, 2020

Useful Tips in Medium Editing

No words. Just give you frequent useful tips as reference.

Apr 22, 2020

Apr 22, 2020

LAAI

LAAI

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams