Výsledky vyhledávání - "Lagoudakis Michael"

Autor: Lagoudakis Michael, LaBean, T. H.

Summarization: DNA self-assembly has been proposed as a way to cope with huge combinatorial NP-HARD problems, such as satisability. However, the algorithmic designs for DNA self-assembly proposed so far are highly dependent on the instance to be solv

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::7ee3ed2411316778276197332b9dd7af
http://purl.tuc.gr/dl/dias/F6262134-523A-4CA3-87F8-6A2364848C8F

Zobrazit plný text záznamu

Reinforcement learning as classification: leveraging modern classifiers

Autor: Lagoudakis Michael, Parr, R.

Summarization: The basic tools of machine learning appear in the inner loop of most reinforcement learning algorithms, typically in the form of Monte Carlo methods or function approximation techniques. To a large extent, however, current reinforcemen

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::5e72ca786502c3c386f9a2df1413bc7d
http://purl.tuc.gr/dl/dias/78C8B833-D841-436A-82B4-676C1B860269

Zobrazit plný text záznamu

The standard platform league

Autor: Lagoudakis Michael, Chown, E.

Μη διαθέσιμη περίληψη Not available summarization Παρουσιάστηκε στο: 18th RoboCup International Symposium

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::da5067f2837fa510792bb095b4cbc483
http://purl.tuc.gr/dl/dias/15CD485B-9FD5-4438-BA2F-0E8D5A238D89

Zobrazit plný text záznamu

Model–free least–squares policy iteration

Autor: Lagoudakis Michael, Parr, R.

Summarization: We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely o policy. We are motivated by the least squares temporal dierence le

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::d30f20519f82746ffa95ba001a9df7ca
http://purl.tuc.gr/dl/dias/CDADBEEF-15F4-44B5-89B2-295FEC71FDAE

Zobrazit plný text záznamu

Universal access to mobile computing devices through speech input

Autor: Manaris, Bill, Lagoudakis Michael, MacGyvers Valanne

Summarization: This paper presents results on a user interface model for providing universal access to mobile computing devices. The model uses a continuous speech understanding engine to provide access to a virtual keyboard and mouse through speech

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::b018416293762d2e68b8b29f810d5d11
http://purl.tuc.gr/dl/dias/6028C335-384A-4EC0-BC17-98F69362B3B9

Zobrazit plný text záznamu

On the locality of action domination in sequential decision making

Autor: Rachelson, Emmanuel, Lagoudakis Michael

Summarization: In the field of sequential decision making and reinforcement learning, it has been observed that good policies for most problems exhibit a significant amount of structure. In practice, this implies that when a learning agent discovers

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::ac6961b1a173199408aaf4f0b61b177d
http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf

Zobrazit plný text záznamu

Semi-autonomous robotic platform for automobile experiments

Autor: Katzourakis, D., Papaefstathiou Ioannis, Lagoudakis Michael

Μη διαθέσιμη περίληψη Not available summarization Παρουσιάστηκε στο: 1st Hellenic Robotics Conference

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::d675a1a2ec2ab348bdddb035779af703
http://purl.tuc.gr/dl/dias/438DE77B-2536-4B5E-8F47-E5288E7E504F

Zobrazit plný text záznamu

Learning in zero–sum team Markov games using factored value functions

Autor: Lagoudakis Michael, Parr, R.

Summarization: We present a new method for learning good strategies in zero-sum Markov games in which each side is composed of multiple agents collaborating against an opposing team of agents. Our method requires full observability and communication

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::402e323e2f463bf23c0c6ef0cafebcd2
http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA

Zobrazit plný text záznamu

Least-squares policy iteration

Autor: Lagoudakis Michael, Parr Ronald

Δημοσίευση σε επιστημονικό περιοδικό Summarization: We propose a new approach to reinforcement learning for control problems which combines value-function approximation with linear architectures and approximate policy

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::67d4be44b24f89568b104182fed8ce1f
http://purl.tuc.gr/dl/dias/23F40241-5991-47D2-A68C-3B6EB59A4567

Zobrazit plný text záznamu

Learning to select branching rules in the DPLL procedure for satisfiability

Autor: Lagoudakis Michael, Littman, M.

Summarization: The DPLL procedure is the most popular complete satisfiability (SAT) solver. While its worst case complexity is exponential, the actual running time is greatly affected by the ordering of branch variables during the search. Several bra

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______4037::ac32d9f4d1d5e6c8c4327f0f33999e07
http://purl.tuc.gr/dl/dias/85854029-9F0F-4C83-9CBA-F91FC42A21D0

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání