|
Search the dblp DataBase
Csaba Szepesvári:
[Publications]
[Author Rank by year]
[Co-authors]
[Prefers]
[Cites]
[Cited by]
Publications of Author
- Csaba Szepesvári
Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results. [Citation Graph (0, 0)][DBLP] AAAI, 2004, pp:550-555 [Conf]
- Levente Kocsis, Csaba Szepesvári, Mark H. M. Winands
RSPSA: Enhanced Parameter Optimization in Games. [Citation Graph (0, 0)][DBLP] ACG, 2006, pp:39-56 [Conf]
- András Antos, Csaba Szepesvári, Rémi Munos
Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path. [Citation Graph (0, 0)][DBLP] COLT, 2006, pp:574-588 [Conf]
- Csaba Szepesvári, András Kocsor, Kornél Kovács
Kernel Machine Based Feature Extraction Algorithms for Regression Problems. [Citation Graph (0, 0)][DBLP] ECAI, 2004, pp:1091-1092 [Conf]
- Péter Torma, Csaba Szepesvári
Enhancing Particle Filters Using Local Likelihood Sampling. [Citation Graph (0, 0)][DBLP] ECCV (1), 2004, pp:16-27 [Conf]
- Levente Kocsis, Csaba Szepesvári
Bandit Based Monte-Carlo Planning. [Citation Graph (0, 0)][DBLP] ECML, 2006, pp:282-293 [Conf]
- András Kocsor, Kornél Kovács, Csaba Szepesvári
Margin Maximizing Discriminant Analysis. [Citation Graph (0, 0)][DBLP] ECML, 2004, pp:227-238 [Conf]
- Csaba Szepesvári
Learning and Exploitation Do Not Conflict Under Minimax Optimality. [Citation Graph (0, 0)][DBLP] ECML, 1997, pp:242-249 [Conf]
- Zsolt Kalmár, Csaba Szepesvári, András Lörincz
Module Based Reinforcement Learning: An Application to a Real Robot. [Citation Graph (0, 0)][DBLP] EWLR, 1997, pp:29-45 [Conf]
- Csaba Szepesvári, András Lörincz
Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers. [Citation Graph (0, 0)][DBLP] ICANN, 1996, pp:791-796 [Conf]
- Zoltán Szamonek, Csaba Szepesvári
X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown. [Citation Graph (0, 0)][DBLP] ICDM, 2005, pp:434-441 [Conf]
- Zoltán Gábor, Zsolt Kalmár, Csaba Szepesvári
Multi-criteria Reinforcement Learning. [Citation Graph (0, 0)][DBLP] ICML, 1998, pp:197-205 [Conf]
- Michael L. Littman, Csaba Szepesvári
A Generalized Reinforcement-Learning Model: Convergence and Applications. [Citation Graph (0, 0)][DBLP] ICML, 1996, pp:310-318 [Conf]
- Csaba Szepesvári, Rémi Munos
Finite time bounds for sampling based fitted value iteration. [Citation Graph (0, 0)][DBLP] ICML, 2005, pp:880-887 [Conf]
- Csaba Szepesvári, William D. Smart
Interpolation-based Q-learning. [Citation Graph (0, 0)][DBLP] ICML, 2004, pp:- [Conf]
- András György, Levente Kocsis, Ivett Szabó, Csaba Szepesvári
Continuous Time Associative Bandit Problems. [Citation Graph (0, 0)][DBLP] IJCAI, 2007, pp:830-835 [Conf]
- István Bíró, Zoltán Szamonek, Csaba Szepesvári
Sequence Prediction Exploiting Similary Information. [Citation Graph (0, 0)][DBLP] IJCAI, 2007, pp:1576-1581 [Conf]
- Csaba Szepesvári
The Asymptotic Convergence-Rate of Q-learning. [Citation Graph (0, 0)][DBLP] NIPS, 1997, pp:- [Conf]
- György Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári
FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. [Citation Graph (0, 0)][DBLP] TSD, 2000, pp:189-194 [Conf]
- Zsolt Kalmár, Csaba Szepesvári, András Lörincz
Modular Reinforcement Learning: A Case Study in a Robot Domain. [Citation Graph (0, 0)][DBLP] Acta Cybern., 2000, v:14, n:3, pp:507-522 [Journal]
- Csaba Szepesvári
Non-Markovian Policies in Sequential Decision Problems. [Citation Graph (0, 0)][DBLP] Acta Cybern., 1998, v:13, n:3, pp:305-318 [Journal]
- Csaba Szepesvári
Efficient approximate planning in continuous space Markovian Decision Problems. [Citation Graph (0, 0)][DBLP] AI Commun., 2001, v:14, n:3, pp:163-176 [Journal]
- Zsolt Kalmár, Csaba Szepesvári, András Lörincz
Module-Based Reinforcement Learning: Experiments with a Real Robot. [Citation Graph (0, 0)][DBLP] Auton. Robots, 1998, v:5, n:3-4, pp:273-295 [Journal]
- Tibor Fomin, Tamás Rozgonyi, Csaba Szepesvári, András Lörincz
Self-Organizing Multi-Resolution Grid for Motion Planning and Control. [Citation Graph (0, 0)][DBLP] Int. J. Neural Syst., 1996, v:7, n:6, pp:757-0 [Journal]
- András Lörincz, György Hévízi, Csaba Szepesvári
Ockham's Razor Modeling of the Matrisome Channels of the Basal Ganglia Thalamocortical Loops. [Citation Graph (0, 0)][DBLP] Int. J. Neural Syst., 2001, v:11, n:2, pp:125-143 [Journal]
- Csaba Szepesvári, András Lörincz
Approximate geometry representations and sensory fusion. [Citation Graph (0, 0)][DBLP] Neurocomputing, 1996, v:12, n:2-3, pp:267-287 [Journal]
- Zsolt Kalmár, Csaba Szepesvári, András Lörincz
Module-Based Reinforcement Learning: Experiments with a Real Robot. [Citation Graph (0, 0)][DBLP] Machine Learning, 1998, v:31, n:1-3, pp:55-85 [Journal]
- Levente Kocsis, Csaba Szepesvári
Universal parameter optimisation in games based on SPSA. [Citation Graph (0, 0)][DBLP] Machine Learning, 2006, v:63, n:3, pp:249-286 [Journal]
- Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms. [Citation Graph (0, 0)][DBLP] Machine Learning, 2000, v:38, n:3, pp:287-308 [Journal]
- János Murvai, Kristian Vlahovicek, Endre Barta, Csaba Szepesvári, Cristina Acatrinei, Sándor Pongor
The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments. [Citation Graph (0, 0)][DBLP] Nucleic Acids Research, 1999, v:27, n:1, pp:257-259 [Journal]
- Csaba Szepesvári, Michael L. Littman
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms. [Citation Graph (0, 0)][DBLP] Neural Computation, 1999, v:11, n:8, pp:2017-2060 [Journal]
- Zsolt Kalmár, Zsolt Marczell, Csaba Szepesvári, András Lörincz
Parallel and robust skeletonization built on self-organizing elements. [Citation Graph (0, 0)][DBLP] Neural Networks, 1999, v:12, n:1, pp:163-173 [Journal]
- Csaba Szepesvári, Szabolcs Cimmer, András Lörincz
Neurocontroller using dynamic state feedback for compensatory control. [Citation Graph (0, 0)][DBLP] Neural Networks, 1997, v:10, n:9, pp:1691-1708 [Journal]
- Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári
Tuning Bandit Algorithms in Stochastic Environments. [Citation Graph (0, 0)][DBLP] ALT, 2007, pp:150-165 [Conf]
- Peter Auer, Ronald Ortner, Csaba Szepesvári
Improved Rates for the Stochastic Continuum-Armed Bandit Problem. [Citation Graph (0, 0)][DBLP] COLT, 2007, pp:454-468 [Conf]
- Amir Massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert
Manifold-adaptive dimension estimation. [Citation Graph (0, 0)][DBLP] ICML, 2007, pp:265-272 [Conf]
Toward a Classification of Finite Partial-Monitoring Games. [Citation Graph (, )][DBLP]
Active Learning in Multi-armed Bandits. [Citation Graph (, )][DBLP]
Active Learning of Group-Structured Environments. [Citation Graph (, )][DBLP]
Empirical Bernstein stopping. [Citation Graph (, )][DBLP]
Fast gradient-descent methods for temporal-difference learning with linear function approximation. [Citation Graph (, )][DBLP]
Workshop summary: On-line learning with limited feedback. [Citation Graph (, )][DBLP]
Learning when to stop thinking and do something! [Citation Graph (, )][DBLP]
Learning to segment from a few well-selected training images. [Citation Graph (, )][DBLP]
Model-based reinforcement learning with nearly tight exploration complexity bounds. [Citation Graph (, )][DBLP]
Toward Off-Policy Learning Control with Function Approximation. [Citation Graph (, )][DBLP]
Budgeted Distribution Learning of Belief Net Parameters. [Citation Graph (, )][DBLP]
Model-based and model-free reinforcement learning for visual servoing. [Citation Graph (, )][DBLP]
Fitted Q-iteration in continuous action-space MDPs. [Citation Graph (, )][DBLP]
Regularized Policy Iteration. [Citation Graph (, )][DBLP]
Online Optimization in X-Armed Bandits. [Citation Graph (, )][DBLP]
A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. [Citation Graph (, )][DBLP]
Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstraction. [Citation Graph (, )][DBLP]
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. [Citation Graph (, )][DBLP]
Regularized Fitted Q-Iteration: Application to Planning. [Citation Graph (, )][DBLP]
LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. [Citation Graph (, )][DBLP]
X-Armed Bandits [Citation Graph (, )][DBLP]
Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs [Citation Graph (, )][DBLP]
Search in 0.003secs, Finished in 0.307secs
|