Talk:End-to-end reinforcement learning

Learn more about this page

This redirect does not require a rating on Wikipedia's content assessment scale.
It is of interest to the following WikiProjects:

Articles for creation

	This redirect was reviewed by member(s) of WikiProject Articles for creation. The project works to allow users to contribute quality articles and media files to the encyclopedia and track their progress as they are developed. To participate, please visit the project page for more information.Articles for creationWikipedia:WikiProject Articles for creationTemplate:WikiProject Articles for creationAfC articles
	This redirect was accepted from this draft on 4 April 2017 by reviewer SwisterTwister (talk · contribs).

Engineering

This redirect is within the scope of WikiProject Engineering, a collaborative effort to improve the coverage of engineering on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.EngineeringWikipedia:WikiProject EngineeringTemplate:WikiProject EngineeringEngineering articles

Science

This redirect is within the scope of WikiProject Science, a collaborative effort to improve the coverage of Science on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ScienceWikipedia:WikiProject ScienceTemplate:WikiProject Sciencescience articles

Computer science

This redirect is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science articles

This redirect has been automatically rated by a bot or other tool because one or more other projects use this class. Please ensure the assessment is correct before removing the |auto= parameter.

Things you can help WikiProject Computer science with:

Here are some tasks awaiting attention:

Article requests :
- Requested articles/Applied arts and sciences/Computer science, computing, and Internet
Cleanup :
- Computer science articles needing attention
- Computer science articles needing expert attention
Copyedit :
- Computing
Expand :
- Computer science
Infobox :
- Computer science articles without infoboxes
Maintain :
- Timeline of computing 2020–present
Photo :
- Find pictures for the biographies of computer scientists (see List of computer scientists)
- Computing articles needing images
Stubs :
- Computer science stubs
Unreferenced :
- WikiProject Computer science/Unreferenced BLPs
Project-related :
- Tag all relevant articles in Category:Computer science and sub-categories with {{WikiProject Computer science}}

This article was nominated for merging with deep reinforcement learning on 19:59, 24 November 2020 (UTC). The result of the discussion (permanent link) was merge end-to-end reinforcement learning into the deep reinforcement learning page.

Merge with article on deep reinforcement learning?

Latest comment: 3 years ago8 comments2 people in discussion

The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.

The result of this discussion was to merge this article into deep RL. Anair13 (talk) 22:35, 31 October 2021 (UTC)Reply

The term "end-to-end reinforcement learning" is just another way to refer to "deep reinforcement learning" but deep RL is the more formal term. This article is actually better in giving examples of deep RL but the deep RL page is more informative/descriptive. I think these articles should be merged. Anair13 (talk) 19:59, 24 November 2020 (UTC)Reply

Went ahead and merged Anair13 (talk) 02:30, 1 December 2020 (UTC)Reply

Can I ask why this was restored? Especially without discussion, after I had started a discussion on it? I don't think there is any formal distinction between deep reinforcement learning and "end-to-end reinforcement learning", and I would challenge someone to find a citation saying that there is in order to keep this page and also to make the distinction clear on this page. They both just vaguely mean reinforcement learning with function approximation to handle raw inputs. Moreover, this page is unbalanced towards the work of one author, Katsunari Shibata (perhaps a violation of Wikipedia:Neutral_point_of_view), at the exclusion of a lot more famous and foundational work. Anair13 (talk) 17:00, 27 October 2021 (UTC)Reply

I apologize for restoring this page without discussion. Deep reinforcement learning refers to the use of a deep neural network. In most cases, a convolutional neural network is used with raw images as input. On the other hand, "end-to-end reinforcement learning" means that in reinforcement learning, the process being learned must be from one end (usually sensors) to the other (usually actuators). Therefore, the two are similar, but not exactly the same. However, as you said, this page is not balanced, and that should be solved. Therefore, for now, I am in favor of merging this page into deep reinforcement learning. Pioneerest (talk) 20:08, 28 October 2021 (UTC)Reply

OK, thanks for the reply. If we are on the same page, shall we go ahead and merge it then? I had already merged the contents of this page into deep reinforcement learning, including the history parts and a note about end-to-end reinforcement learning. Is there anything else you want to add to the deep RL page? Anair13 (talk) 17:38, 30 October 2021 (UTC)Reply

Yes, I would like to ask you to merge this page into deep reinforcement learning. I don't have anything more to add to the deep RL page at the moment. Thank you. Pioneerest (talk) 23:03, 30 October 2021 (UTC)Reply

OK, will do! Anair13 (talk) 22:35, 31 October 2021 (UTC)Reply

The discussion above is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.

Add topic