'강화학습' 태그의 글 목록

[Paper] Recurrent Attention Model 논문 리뷰 - 2

이전 시리즈 : https://hi-lu.tistory.com/entry/Paper-Recurrent-Attention-Model-%EB%85%BC%EB%AC%B8-%EB%A6%AC%EB%B7%B0-1?category=992577 [Paper] Recurrent Attention Model 논문 리뷰 - 1 [SAI 동아리 발표] RAM(Recurrent Attention Model) 관련 논문들. 원래는 한 포스트였지만, 이 블로그에 쓰다 보니 조금 긴 거 같아서 2개 포스트로 쪼갠다. 최근 들어 HAR(Human Activity Recognition), 특히 vis.. hi-lu.tistory.com 이어서 두 번째 논문을 리뷰해보자. 이번 포스트에서는 이전 포스트 논문의 mother paper를 리..

2021.09.05

[Paper] OpenAI의 Emergent Tool Use From Multi-Agent Autocurricula 논문 리뷰

Emergent Tool Use From Multi-Agent Autocurricula 논문 리뷰 Emergent tool use from multi-agent autocurricula (https://arxiv.org/abs/1909.07528) Multi Agent에 확 끌리게 해 준 OpenAI 논문. Hider와 Seeker가 숨바꼭질을 학습하는 내용이다. 유튜브에 영상이 있다. https://www.youtube.com/watch?v=kopoLzvh5jY 이 hide-and-seek에는 총 6가지의 emergent phase가 있다. Multi-agent는 environment 복잡성이 올라감에 있어서 사람과 같은 도구 사용 등의 능력을 학습한다. Introduction 'Human-releva..

2021.09.05

[Paper] RL Recommender System 논문 리뷰 2

강화학습을 이용한 추천 시스템 논문을 탐구해보자 No2. 리뷰 1탄 : https://hi-lu.tistory.com/entry/PaperRL-Recommender-System-%EB%85%BC%EB%AC%B8-%EB%A6%AC%EB%B7%B0-1 [Paper]RL Recommender System 논문 리뷰 1 논문 리뷰 이전하기 1탄 강화학습을 이용한 추천 시스템을 알아보자. 1. Deep Learning based Recommender System : A Survey and New Perspectives (https://arxiv.org/abs/1707.07435) Deep Learning based Reco.. hi-lu.tistory.com 지난 리뷰에서는 총 4개의 논문을 탐구해 보았다. 이번..

2021.09.05

[Paper]RL Recommender System 논문 리뷰 1

논문 리뷰 이전하기 1탄 강화학습을 이용한 추천 시스템을 알아보자. 1. Deep Learning based Recommender System : A Survey and New Perspectives (https://arxiv.org/abs/1707.07435) Deep Learning based Recommender System: A Survey and New Perspectives With the ever-growing volume of online information, recommender systems have been an effective strategy to overcome such information overload. The utility of recommender systems can..

2021.09.05

lu의 머신러닝 개발자로 살아남기

lu의 머신러닝 개발자로 살아남기

태그

최근글

댓글

공지사항

아카이브

강화학습(4)

티스토리툴바