Open in app

Sign in

Write

Sign in

LAAI
LAAI

3 Followers

Home

About

LAAI

LAAI

MODEL-ENSEMBLE TRUST-REGION POLICY OPTIMIZATION

ICLR 2018 paper from Thanard Kurutach, Ignashi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel

Jan 18, 2021
Jan 18, 2021
LAAI

LAAI

DDQN, Prioritized Replay, and Dueling DQN

DDQN — Double Deep Q-network, (Hasselt et al, AAAI 2016)

Apr 30, 2020
DDQN, Prioritized Replay, and Dueling DQN
DDQN, Prioritized Replay, and Dueling DQN
Apr 30, 2020
LAAI

LAAI

Double Q-Learning and Value overestimation in Q-Learning

The problem is named maximization bias problem.

Apr 30, 2020
Double Q-Learning and Value overestimation in Q-Learning
Double Q-Learning and Value overestimation in Q-Learning
Apr 30, 2020
LAAI

LAAI

Journey to an 2020 summer internship

In my PhD career, the first year for prelim, and the second year for Qual. In my third year EE PhD career, having an internship might be…

Apr 26, 2020
Journey to an 2020 summer internship
Journey to an 2020 summer internship
Apr 26, 2020
LAAI

LAAI

EWC:Elastic Weight Consolidation

Paper: “Overcoming catastrophic forgetting in neural networks”.

Apr 25, 2020
EWC:Elastic Weight Consolidation
EWC:Elastic Weight Consolidation
Apr 25, 2020
LAAI

LAAI

Learning to Compare: Relation Network for Few-Shot Learning

這篇文章稱之為Relation Network,主要要解決few-shot learning中similairty function的問題,其架構可以自己最佳化出最好的similarity function.類似的文章可以參考Prototypical Network

Apr 24, 2020
Learning to Compare: Relation Network for Few-Shot Learning
Learning to Compare: Relation Network for Few-Shot Learning
Apr 24, 2020
LAAI

LAAI

Prototypical Networks for Few-shot Learning

主題:

Apr 24, 2020
Prototypical Networks for Few-shot Learning
Prototypical Networks for Few-shot Learning
Apr 24, 2020
LAAI

LAAI

AWS: Jupyter notebook

登入 (Log in AWS with Jupyter and Tensorboard port liked)  You can create a shell script to execute the command below. Please replace…

Apr 24, 2020
Apr 24, 2020
LAAI

LAAI

Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO

這是一篇ICLR 2019 Oral paper,來自於MIT Logan Engstrom.

Apr 22, 2020
Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO
Literature Review: Implementation matters in deep policy gradients: a case study on PPO and TRPO
Apr 22, 2020
LAAI

LAAI

Useful Tips in Medium Editing

No words. Just give you frequent useful tips as reference.

Apr 22, 2020
Apr 22, 2020
LAAI

LAAI

3 Followers
Following
  • The Medium Blog

    The Medium Blog

  • Bryan Johnson

    Bryan Johnson

  • Alex Wickstrom

    Alex Wickstrom

  • Scott Hwai

    Scott Hwai

  • CP Lu, PhD

    CP Lu, PhD

See all (8)

Help

Status

About

Careers

Press

Blog

Privacy

Terms

Text to speech

Teams