LAAIMODEL-ENSEMBLE TRUST-REGION POLICY OPTIMIZATIONICLR 2018 paper from Thanard Kurutach, Ignashi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel1 min read·Jan 18, 2021----
LAAIDDQN, Prioritized Replay, and Dueling DQNDDQN — Double Deep Q-network, (Hasselt et al, AAAI 2016)4 min read·Apr 30, 2020----
LAAIDouble Q-Learning and Value overestimation in Q-LearningThe problem is named maximization bias problem.4 min read·Apr 30, 2020----
LAAIJourney to an 2020 summer internshipIn my PhD career, the first year for prelim, and the second year for Qual. In my third year EE PhD career, having an internship might be…6 min read·Apr 26, 2020----
LAAIEWC:Elastic Weight ConsolidationPaper: “Overcoming catastrophic forgetting in neural networks”.4 min read·Apr 25, 2020----
LAAILearning to Compare: Relation Network for Few-Shot Learning這篇文章稱之為Relation Network,主要要解決few-shot learning中similairty function的問題,其架構可以自己最佳化出最好的similarity function.類似的文章可以參考Prototypical Network2 min read·Apr 24, 2020----
LAAIAWS: Jupyter notebook登入 (Log in AWS with Jupyter and Tensorboard port liked) You can create a shell script to execute the command below. Please replace…1 min read·Apr 24, 2020----
LAAILiterature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO這是一篇ICLR 2019 Oral paper,來自於MIT Logan Engstrom.4 min read·Apr 22, 2020----
LAAIUseful Tips in Medium EditingNo words. Just give you frequent useful tips as reference.1 min read·Apr 22, 2020----