Zobrazeno 1 - 10
of 93
pro vyhledávání: '"Lagoudakis Michael"'
Autor:
Lagoudakis Michael, LaBean, T. H.
Summarization: DNA self-assembly has been proposed as a way to cope with huge combinatorial NP-HARD problems, such as satisability. However, the algorithmic designs for DNA self-assembly proposed so far are highly dependent on the instance to be solv
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::7ee3ed2411316778276197332b9dd7af
http://purl.tuc.gr/dl/dias/F6262134-523A-4CA3-87F8-6A2364848C8F
http://purl.tuc.gr/dl/dias/F6262134-523A-4CA3-87F8-6A2364848C8F
Autor:
Lagoudakis Michael, Parr, R.
Summarization: The basic tools of machine learning appear in the inner loop of most reinforcement learning algorithms, typically in the form of Monte Carlo methods or function approximation techniques. To a large extent, however, current reinforcemen
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::5e72ca786502c3c386f9a2df1413bc7d
http://purl.tuc.gr/dl/dias/78C8B833-D841-436A-82B4-676C1B860269
http://purl.tuc.gr/dl/dias/78C8B833-D841-436A-82B4-676C1B860269
Autor:
Lagoudakis Michael, Chown, E.
Μη διαθέσιμη περίληψη Not available summarization Παρουσιάστηκε στο: 18th RoboCup International Symposium
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::da5067f2837fa510792bb095b4cbc483
http://purl.tuc.gr/dl/dias/15CD485B-9FD5-4438-BA2F-0E8D5A238D89
http://purl.tuc.gr/dl/dias/15CD485B-9FD5-4438-BA2F-0E8D5A238D89
Autor:
Lagoudakis Michael, Parr, R.
Summarization: We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely o policy. We are motivated by the least squares temporal dierence le
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::d30f20519f82746ffa95ba001a9df7ca
http://purl.tuc.gr/dl/dias/CDADBEEF-15F4-44B5-89B2-295FEC71FDAE
http://purl.tuc.gr/dl/dias/CDADBEEF-15F4-44B5-89B2-295FEC71FDAE
Summarization: This paper presents results on a user interface model for providing universal access to mobile computing devices. The model uses a continuous speech understanding engine to provide access to a virtual keyboard and mouse through speech
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::b018416293762d2e68b8b29f810d5d11
http://purl.tuc.gr/dl/dias/6028C335-384A-4EC0-BC17-98F69362B3B9
http://purl.tuc.gr/dl/dias/6028C335-384A-4EC0-BC17-98F69362B3B9
Summarization: In the field of sequential decision making and reinforcement learning, it has been observed that good policies for most problems exhibit a significant amount of structure. In practice, this implies that when a learning agent discovers
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::ac6961b1a173199408aaf4f0b61b177d
http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf
http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf
Μη διαθέσιμη περίληψη Not available summarization Παρουσιάστηκε στο: 1st Hellenic Robotics Conference
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::d675a1a2ec2ab348bdddb035779af703
http://purl.tuc.gr/dl/dias/438DE77B-2536-4B5E-8F47-E5288E7E504F
http://purl.tuc.gr/dl/dias/438DE77B-2536-4B5E-8F47-E5288E7E504F
Autor:
Lagoudakis Michael, Parr, R.
Summarization: We present a new method for learning good strategies in zero-sum Markov games in which each side is composed of multiple agents collaborating against an opposing team of agents. Our method requires full observability and communication
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::402e323e2f463bf23c0c6ef0cafebcd2
http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA
http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA
Autor:
Lagoudakis Michael, Parr Ronald
Δημοσίευση σε επιστημονικό περιοδικό Summarization: We propose a new approach to reinforcement learning for control problems which combines value-function approximation with linear architectures and approximate policy
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::67d4be44b24f89568b104182fed8ce1f
http://purl.tuc.gr/dl/dias/23F40241-5991-47D2-A68C-3B6EB59A4567
http://purl.tuc.gr/dl/dias/23F40241-5991-47D2-A68C-3B6EB59A4567
Autor:
Lagoudakis Michael, Littman, M.
Summarization: The DPLL procedure is the most popular complete satisfiability (SAT) solver. While its worst case complexity is exponential, the actual running time is greatly affected by the ordering of branch variables during the search. Several bra
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4037::ac32d9f4d1d5e6c8c4327f0f33999e07
http://purl.tuc.gr/dl/dias/85854029-9F0F-4C83-9CBA-F91FC42A21D0
http://purl.tuc.gr/dl/dias/85854029-9F0F-4C83-9CBA-F91FC42A21D0