LAAI – Medium

LAAI

MODEL-ENSEMBLE TRUST-REGION POLICY OPTIMIZATION

ICLR 2018 paper from Thanard Kurutach, Ignashi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel

1 min readJan 18, 2021

--

--

LAAI

DDQN, Prioritized Replay, and Dueling DQN

DDQN — Double Deep Q-network, (Hasselt et al, AAAI 2016)

4 min readApr 30, 2020

--

DDQN, Prioritized Replay, and Dueling DQN

--

LAAI

Double Q-Learning and Value overestimation in Q-Learning

The problem is named maximization bias problem.

4 min readApr 30, 2020

--

Double Q-Learning and Value overestimation in Q-Learning

--

LAAI

Journey to an 2020 summer internship

In my PhD career, the first year for prelim, and the second year for Qual. In my third year EE PhD career, having an internship might be…

6 min readApr 26, 2020

--

Journey to an 2020 summer internship

--

LAAI

EWC:Elastic Weight Consolidation

Paper: “Overcoming catastrophic forgetting in neural networks”.

4 min readApr 25, 2020

--

EWC:Elastic Weight Consolidation

--

LAAI

Learning to Compare: Relation Network for Few-Shot Learning

這篇文章稱之為Relation Network，主要要解決few-shot learning中similairty function的問題，其架構可以自己最佳化出最好的similarity function．類似的文章可以參考Prototypical Network

2 min readApr 24, 2020

--

Learning to Compare: Relation Network for Few-Shot Learning

--

LAAI

Prototypical Networks for Few-shot Learning

主題：

2 min readApr 24, 2020

--

Prototypical Networks for Few-shot Learning

--

LAAI

AWS: Jupyter notebook

登入 (Log in AWS with Jupyter and Tensorboard port liked) You can create a shell script to execute the command below. Please replace…

1 min readApr 24, 2020

--

--

LAAI

Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO

這是一篇ICLR 2019 Oral paper，來自於MIT Logan Engstrom．

4 min readApr 22, 2020

--

Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO

--

LAAI

Useful Tips in Medium Editing

No words. Just give you frequent useful tips as reference.

1 min readApr 22, 2020

--

--

LAAI

LAAI

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams