Nothing Special   »   [go: up one dir, main page]

skip to main content
Skip header Section
Learning automata: an introductionJanuary 1989
Publisher:
  • Prentice-Hall, Inc.
  • Division of Simon and Schuster One Lake Street Upper Saddle River, NJ
  • United States
ISBN:978-0-13-485558-5
Published:02 January 1989
Pages:
476
Skip Bibliometrics Section
Reflects downloads up to 12 Feb 2025Bibliometrics
Abstract

No abstract available.

Cited By

  1. Gheisari S and ShokrZadeh H (2024). LATA: learning automata-based task assignment on heterogeneous cloud computing platform, The Journal of Supercomputing, 80:16, (24106-24137), Online publication date: 1-Nov-2024.
  2. Gąsior J Enhancing Lifetime Coverage in Wireless Sensor Networks: A Learning Automata Approach Computational Science – ICCS 2024, (255-262)
  3. Seredyński F, Szaban M, Skaruz J, Świtalski P and Seredyński M Solving Coverage Problem by Self-organizing Wireless Sensor Networks: (,h)-Learning Automata Collective Behavior Approach Computational Science – ICCS 2024, (408-422)
  4. Gąsior J Learning Automata Strategies for Prolonging Lifetime of Wireless Sensor Networks Advances in Practical Applications of Agents, Multi-Agent Systems, and Digital Twins: The PAAMS Collection, (109-120)
  5. Jiao L, Zhang X, Granmo O and Abeyrathna K (2023). On the Convergence of Tsetlin Machines for the XOR Operator, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45:5, (6072-6085), Online publication date: 1-May-2023.
  6. Zhang X, Jiao L, Granmo O and Goodwin M (2022). On the Convergence of Tsetlin Machines for the IDENTITY- and NOT Operators, IEEE Transactions on Pattern Analysis and Machine Intelligence, 44:10_Part_1, (6345-6359), Online publication date: 1-Oct-2022.
  7. ACM
    Ni Y, Issa M, Abraham D, Imani M, Yin X and Imani M HDPG Proceedings of the 59th ACM/IEEE Design Automation Conference, (1141-1146)
  8. Karthikeyan A, Prakasam P, Karthik S, Ajayan J and Sai Gokul S (2021). Automata Theory-based Energy Efficient Area Algorithm for an Optimal Solution in Wireless Sensor Networks, Wireless Personal Communications: An International Journal, 120:2, (1125-1143), Online publication date: 1-Sep-2021.
  9. Jardine P and Givigi S (2021). Improving Control Performance of Unmanned Aerial Vehicles through Shared Experience, Journal of Intelligent and Robotic Systems, 102:3, Online publication date: 1-Jul-2021.
  10. Chamkoori A, Katebi S and Alazab M (2021). Robustness of the Storage in Cloud Data Centers Based on Simple Swarm Optimization Algorithm, Security and Communication Networks, 2021, Online publication date: 1-Jan-2021.
  11. Yazidi A, Bouhmala N and Goodwin M (2020). A team of pursuit learning automata for solving deterministic optimization problems, Applied Intelligence, 50:9, (2916-2931), Online publication date: 1-Sep-2020.
  12. Kostas J, Nota C and Thomas P Asynchronous coagent networks Proceedings of the 37th International Conference on Machine Learning, (5426-5435)
  13. Beigy H and Meybodi M (2019). An iterative stochastic algorithm based on distributed learning automata for finding the stochastic shortest path in stochastic graphs, The Journal of Supercomputing, 76:7, (5540-5562), Online publication date: 1-Jul-2020.
  14. Goodwin M and Yazidi A (2020). Distributed learning automata-based scheme for classification using novel pursuit scheme, Applied Intelligence, 50:7, (2222-2238), Online publication date: 1-Jul-2020.
  15. Rahmani P and Haj Seyyed Javadi H (2019). Topology Control in MANETs Using the Bayesian Pursuit Algorithm, Wireless Personal Communications: An International Journal, 106:3, (1089-1116), Online publication date: 1-Jun-2019.
  16. Barto A (2019). Reinforcement Learning, AI Magazine, 40:1, (3-15), Online publication date: 1-Mar-2019.
  17. Jardine P, Kogan M, Givigi S and Yousefi S (2018). Adaptive predictive control of a differential drive robot tuned with reinforcement learning, International Journal of Adaptive Control and Signal Processing, 33:2, (410-423), Online publication date: 3-Feb-2019.
  18. Chasparis G and Rossbory M (2019). Efficient Dynamic Pinning of Parallelized Applications by Distributed Reinforcement Learning, International Journal of Parallel Programming, 47:1, (24-38), Online publication date: 1-Feb-2019.
  19. Khani M, Ahmadi A and Hajary H (2019). Distributed task allocation in multi-agent environments using cellular learning automata, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 23:4, (1199-1218), Online publication date: 1-Feb-2019.
  20. Mostafaei H, Menth M and Obaidat M A Learning Automaton-Based Controller Placement Algorithm for Software-Defined Networks 2018 IEEE Global Communications Conference (GLOBECOM), (1-6)
  21. ACM
    Sahoo K, Tiwary M, Sahoo S, Nambiar R, Sahoo B and Dash R A Learning Automata-based DDoS Attack Defense Mechanism in Software Defined Networks Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, (795-797)
  22. ACM
    Bennouri H, Yazidi A and Berqia A A pursuit learning solution to underwater communications with limited mobility agents Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems, (112-117)
  23. Rezapoor Mirsaleh M and Meybodi M (2018). Assignment of cells to switches in cellular mobile network, Applied Intelligence, 48:10, (3231-3247), Online publication date: 1-Oct-2018.
  24. Barros Dos Santos S, Givigi S, Nascimento C, Fernandes J, Buonocore L and De Almeida Neto A (2018). Iterative Decentralized Planning for Collective Construction Tasks with Quadrotors, Journal of Intelligent and Robotic Systems, 90:1-2, (217-234), Online publication date: 1-May-2018.
  25. Fakhrmoosavy S, Setayeshi S and Sharifi A (2018). A modified brain emotional learning model for earthquake magnitude and fear prediction, Engineering with Computers, 34:2, (261-276), Online publication date: 1-Apr-2018.
  26. Smith T (2018). Prediction of Infinite Words with Automata, Theory of Computing Systems, 62:3, (653-681), Online publication date: 1-Apr-2018.
  27. Daliri Khomami M, Rezvanian A, Bagherpour N and Meybodi M (2018). Minimum positive influence dominating set and its application in influence maximization, Applied Intelligence, 48:3, (570-593), Online publication date: 1-Mar-2018.
  28. Saghiri A and Meybodi M (2018). An adaptive super-peer selection algorithm considering peers capacity utilizing asynchronous dynamic cellular learning automata, Applied Intelligence, 48:2, (271-299), Online publication date: 1-Feb-2018.
  29. Saghiri A and Meybodi M (2018). Open asynchronous dynamic cellular learning automata and its application to allocation hub location problem, Knowledge-Based Systems, 139:C, (149-169), Online publication date: 1-Jan-2018.
  30. Toffolo T, Christiaens J, Van Malderen S, Wauters T and Vanden Berghe G (2018). Stochastic local search with learning automaton for the swap-body vehicle routing problem, Computers and Operations Research, 89:C, (68-81), Online publication date: 1-Jan-2018.
  31. Yazidi A and Herrera-Viedma E (2017). A new methodology for identifying unreliable sensors in data fusion, Knowledge-Based Systems, 136:C, (85-96), Online publication date: 15-Nov-2017.
  32. Phadikar S, Ghosh P, Bardhan M and Chowdhury N (2017). IDS Using Reinforcement Learning Automata for Preserving Security in Cloud Environment, International Journal of Information System Modeling and Design, 8:4, (21-37), Online publication date: 1-Oct-2017.
  33. Farhoudi Z, Setayeshi S and Rabiee A (2017). Using learning automata in brain emotional learning for speech emotion recognition, International Journal of Speech Technology, 20:3, (553-562), Online publication date: 1-Sep-2017.
  34. Saghiri A and Meybodi M (2017). A closed asynchronous dynamic model of cellular learning automata and its application to peer-to-peer networks, Genetic Programming and Evolvable Machines, 18:3, (313-349), Online publication date: 1-Sep-2017.
  35. Fathipour Deiman S, Saghiri A and Meybodi M (2017). A Delay Aware Super-Peer Selection Algorithm for Gradient Topology Utilizing Learning Automata, Wireless Personal Communications: An International Journal, 95:3, (2611-2624), Online publication date: 1-Aug-2017.
  36. Zhang X, Oommen B and Granmo O (2017). The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ź-optimality, Pattern Analysis & Applications, 20:3, (797-808), Online publication date: 1-Aug-2017.
  37. Rezvanian A and Meybodi M (2017). Sampling algorithms for stochastic graphs, Knowledge-Based Systems, 127:C, (126-144), Online publication date: 1-Jul-2017.
  38. Cota L, Guimarães F, de Oliveira F and Souza M An Adaptive Large Neighborhood Search with Learning Automata for the Unrelated Parallel Machine Scheduling Problem 2017 IEEE Congress on Evolutionary Computation (CEC), (185-192)
  39. Jamali S and Jafarzadeh P (2017). An intelligent intrusion detection system by using hierarchically structured learning automata, Neural Computing and Applications, 28:5, (1001-1008), Online publication date: 1-May-2017.
  40. Vahidipour S, Meybodi M and Esnaashari M (2017). Adaptive Petri net based on irregular cellular learning automata with an application to vertex coloring problem, Applied Intelligence, 46:2, (272-284), Online publication date: 1-Mar-2017.
  41. Mostafaei H, Montieri A, Persico V and Pescap A (2017). A sleep scheduling approach based on learning automata for WSN partialcoverage, Journal of Network and Computer Applications, 80:C, (67-78), Online publication date: 15-Feb-2017.
  42. Yazidi A, Oommen B, Horn G and Granmo O (2016). Stochastic discretized learning-based weak estimation, Pattern Recognition, 60:C, (430-443), Online publication date: 1-Dec-2016.
  43. Rezapoor Mirsaleh M and Reza Meybodi M (2016). A Michigan memetic algorithm for solving the community detection problem in complex network, Neurocomputing, 214:C, (535-545), Online publication date: 19-Nov-2016.
  44. Rezvanian A and Meybodi M (2016). Stochastic graph as a model for social networks, Computers in Human Behavior, 64:C, (621-640), Online publication date: 1-Nov-2016.
  45. Yazidi A and Sandnes F Data fusion without knowledge of the ground truth using Tseltin-like Automata 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC), (004501-004506)
  46. Wang W, Kwasinski A, Niyato D and Han Z (2016). A Survey on Applications of Model-Free Strategy Learning in Cognitive Wireless Networks, IEEE Communications Surveys & Tutorials, 18:3, (1717-1757), Online publication date: 1-Jul-2016.
  47. Saghiri A and Meybodi M (2016). An approach for designing cognitive engines in cognitive peer-to-peer networks, Journal of Network and Computer Applications, 70:C, (17-40), Online publication date: 1-Jul-2016.
  48. Akbar R, Safaei F and Modallalkar S (2016). A novel power efficient adaptive RED-based flow control mechanism for networks-on-chip, Computers and Electrical Engineering, 51:C, (121-138), Online publication date: 1-Apr-2016.
  49. Goodwin M and Yazidi A (2016). A pattern recognition approach for peak prediction of electrical consumption, Integrated Computer-Aided Engineering, 23:2, (101-113), Online publication date: 1-Jan-2016.
  50. ACM
    Mostafaei H, Chowdhury M, Islam R and Gholizadeh H Connected P-Percent Coverage in Wireless Sensor Networks based on Degree Constraint Dominating Set Approach Proceedings of the 18th ACM International Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems, (157-160)
  51. Asemani M and Esnaashari M (2015). Learning automata based energy efficient data aggregation in wireless sensor networks, Wireless Networks, 21:6, (2035-2053), Online publication date: 1-Aug-2015.
  52. Chasparis G, Shamma J and Rantzer A (2015). Nonconvergence to saddle boundary points under perturbed reinforcement learning, International Journal of Game Theory, 44:3, (667-699), Online publication date: 1-Aug-2015.
  53. Ghasemi M, Abdolahi M, Bag-Mohammadi M and Bohlooli A (2015). Adaptive multi-flow opportunistic routing using learning automata, Ad Hoc Networks, 25:PB, (472-479), Online publication date: 1-Feb-2015.
  54. Mostafaei H (2015). Stochastic barrier coverage in wireless sensor networks based on distributed learning automata, Computer Communications, 55:C, (51-61), Online publication date: 1-Jan-2015.
  55. Mollakhalili Meybodi M and Meybodi M (2014). Extended distributed learning automata, Applied Intelligence, 41:3, (923-940), Online publication date: 1-Oct-2014.
  56. Zhang X, Granmo O, Oommen B and Jiao L (2014). A formal proof of the ε-optimality of absorbing continuous pursuit algorithms using the theory of regular functions, Applied Intelligence, 41:3, (974-985), Online publication date: 1-Oct-2014.
  57. Mostafaei H and Meybodi M (2014). An Energy Efficient Barrier Coverage Algorithm for Wireless Sensor Networks, Wireless Personal Communications: An International Journal, 77:3, (2099-2115), Online publication date: 1-Aug-2014.
  58. Jiao L, Zhang X, Granmo O and Oommen B A Bayesian Learning Automata-Based Distributed Channel Selection Scheme for Cognitive Radio Networks Proceedings, Part II, of the 27th International Conference on Modern Advances in Applied Intelligence - Volume 8482, (48-57)
  59. Zhang X, Oommen B, Granmo O and Jiao L Using the Theory of Regular Functions to Formally Prove the ε-Optimality of Discretized Pursuit Learning Algorithms Proceedings, Part I, of the 27th International Conference on Modern Advances in Applied Intelligence - Volume 8481, (379-388)
  60. ACM
    Peleteiro A, Burguillo J, Arcos J and Rodriguez-Aguilar J (2014). Fostering Cooperation through Dynamic Coalition Formation and Partner Switching, ACM Transactions on Autonomous and Adaptive Systems, 9:1, (1-31), Online publication date: 1-Mar-2014.
  61. ACM
    Horn G A vision for a stochastic reasoner for autonomic cloud deployment Proceedings of the Second Nordic Symposium on Cloud Computing & Internet Technologies, (46-53)
  62. Koupaei J and Abdechiri M (2013). Sensor deployment for fault diagnosis using a new discrete optimization algorithm, Applied Soft Computing, 13:5, (2896-2905), Online publication date: 1-May-2013.
  63. Kehagias A and Kartsiotis G (2013). On the use of fuzzy logic and learning automata optimization to resolve the Liar and related paradoxes, Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 24:1, (111-120), Online publication date: 1-Jan-2013.
  64. Zhang X, Granmo O and Oommen B Discretized bayesian pursuit --- a new scheme for reinforcement learning Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence, (784-793)
  65. Yazidi A, Granmo O, Oommen B and Goodwin M A hierarchical learning scheme for solving the stochastic point location problem Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence, (774-783)
  66. de Lope J, Maravall D and Quiñonez Y Decentralized multi-tasks distribution in heterogeneous robot teams by means of ant colony optimization and learning automata Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I, (103-114)
  67. Wauters T, Verbeeck K, Causmaecker P and Berghe G Fast Permutation Learning Revised Selected Papers of the 6th International Conference on Learning and Intelligent Optimization - Volume 7219, (292-306)
  68. Gosavi A and Purohit M Stochastic policy search for variance-penalized semi-Markov control Proceedings of the Winter Simulation Conference, (2865-2876)
  69. Tilak O and Mukhopadhyay S (2011). Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes, AI Communications, 24:4, (293-309), Online publication date: 1-Dec-2011.
  70. Quiñonez Y, Maravall D and de Lope J Stochastic learning automata for self-coordination in heterogeneous multi-tasks selection in multi-robot systems Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I, (443-453)
  71. Nicopolitidis P, Christidis K, Papadimitriou G, Sarigiannidis P and Pomportsis A (2011). Performance evaluation of acoustic underwater data broadcasting exploiting the bandwidth-distance relationship, Mobile Information Systems, 7:4, (285-298), Online publication date: 1-Oct-2011.
  72. Haugland V, Kjølleberg M, Larsen S and Granmo O A two-armed bandit collective for examplar based mining of frequent itemsets with applications to intrusion detection Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part I, (72-81)
  73. ACM
    Rachuri K, Mascolo C, Musolesi M and Rentfrow P SociableSense Proceedings of the 17th annual international conference on Mobile computing and networking, (73-84)
  74. Akbari Torkestani J and Meybodi M (2011). A cellular learning automata-based algorithm for solving the vertex coloring problem, Expert Systems with Applications: An International Journal, 38:8, (9237-9247), Online publication date: 1-Aug-2011.
  75. ACM
    Zhan J, Oommen B and Crisostomo J (2011). Anomaly Detection in Dynamic Systems Using Weak Estimators, ACM Transactions on Internet Technology, 11:1, (1-16), Online publication date: 1-Jul-2011.
  76. Granmo O and Glimsdal S A two-armed bandit based scheme for accelerated decentralized learning Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part II, (532-541)
  77. Zhang X, Granmo O and Oommen B The Bayesian pursuit algorithm Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part II, (522-531)
  78. Farzaneh N and Yaghmaee M Joint active queue management and congestion control protocol for healthcare applications in wireless body sensor networks Proceedings of the 9th international conference on Toward useful services for elderly and people with disabilities: smart homes and health telematics, (88-95)
  79. Akbari Torkestani J and Meybodi M (2011). A Learning Automata-Based Cognitive Radio for Clustered Wireless Ad-Hoc Networks, Journal of Network and Systems Management, 19:2, (278-297), Online publication date: 1-Jun-2011.
  80. Verkhogliad P and Oommen B Using artificial intelligence techniques for strategy generation in the commons game Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I, (43-50)
  81. Mériaux F, Lasaulce S and Kieffer M More about base station location games Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools, (372-380)
  82. Catteeuw D and Manderick B Heterogeneous populations of learning agents in the minority game Proceedings of the 11th international conference on Adaptive and Learning Agents, (100-113)
  83. Simian D and Stoica F Generic reinforcement schemes and their optimization Proceedings of the 5th European conference on European computing conference, (332-337)
  84. Kakali V, Sarigiannidis P, Papadimitriou G and Pomportsis A (2011). A Novel Adaptive Framework for Wireless Push Systems Based on Distributed Learning Automata, Wireless Personal Communications: An International Journal, 57:4, (591-606), Online publication date: 1-Apr-2011.
  85. Klos T, Van Ahee G and Tuyls K Evolutionary dynamics of regret minimization Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II, (82-96)
  86. Klos T, van Ahee G and Tuyls K Evolutionary dynamics of regret minimization Proceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II, (82-96)
  87. Yazidi A, Granmo O, Lin M, Wen X, Oommen B, Gerdes M and Reichert F Learning automaton based on-line discovery and tracking of spatio-temporal event patterns Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence, (327-338)
  88. Calitoiu D and Milici D Modeling with non-cooperative agents Proceedings of the 2010 Summer Computer Simulation Conference, (102-109)
  89. Yazidi A, Granmo O and Oommen B A learning automata based solution to service selection in stochastic environments Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III, (209-218)
  90. Granmo O and Berg S Solving non-stationary bandit problems by random sampling from sibling Kalman filters Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III, (199-208)
  91. Simian D and Stoica F A new nonlinear reinforcement scheme for stochastic learning automata Proceedings of the 12th WSEAS international conference on Automatic control, modelling & simulation, (450-454)
  92. Vrancx P, Verbeeck K and Nowé A (2010). Analyzing the dynamics of stigmergetic interactions through pheromone games, Theoretical Computer Science, 411:21, (2116-2126), Online publication date: 1-May-2010.
  93. Oommen B and Hashem M (2010). Modeling a student's behavior in a tutorial-like system using learning automata, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 40:2, (481-492), Online publication date: 1-Apr-2010.
  94. Akbari Torkestani J and Meybodi M (2010). Clustering the wireless Ad Hoc networks, Journal of Parallel and Distributed Computing, 70:4, (394-405), Online publication date: 1-Apr-2010.
  95. Akbari Torkestani J and Meybodi M (2010). Mobility-based multicast routing algorithm for wireless mobile Ad-hoc networks, Computer Communications, 33:6, (721-735), Online publication date: 1-Apr-2010.
  96. Ghader H, KeyKhosravi D and HosseinAliPour A DAG scheduling on heterogeneous distributed systems using learning automata Proceedings of the Second international conference on Intelligent information and database systems: Part II, (247-257)
  97. Forsati R and Meybodi M (2010). Effective page recommendation algorithms based on distributed learning automata and weighted association rules, Expert Systems with Applications: An International Journal, 37:2, (1316-1330), Online publication date: 1-Mar-2010.
  98. Horn G and Oommen B (2010). Solving multiconstraint assignment problems using learning automata, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 40:1, (6-18), Online publication date: 1-Feb-2010.
  99. Oommen B and Hashem M (2010). Modeling a student-classroom interaction in a tutorial-like system using learning automata, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 40:1, (29-42), Online publication date: 1-Feb-2010.
  100. Misra S, Oommen B, Yanamandra S and Obaidat M (2010). Random early detection for congestion avoidance in wired networks, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 40:1, (66-76), Online publication date: 1-Feb-2010.
  101. Sastry P, Nagendra G and Manwani N (2010). A team of continuous-action learning automata for noise-tolerant learning of half-spaces, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 40:1, (19-28), Online publication date: 1-Feb-2010.
  102. Misra S, Krishna P and Abraham K Energy efficient learning solution for intrusion detection in wireless sensor networks Proceedings of the 2nd international conference on COMmunication systems and NETworks, (423-428)
  103. Oommen B and Hashem M (2010). Modeling a domain in a tutorial-like system using learning automata, Acta Cybernetica, 19:3, (635-653), Online publication date: 1-Jan-2010.
  104. ACM
    Anari Z, Meybodi M and Anari B Web page ranking based on fuzzy and learning automata Proceedings of the International Conference on Management of Emergent Digital EcoSystems, (162-166)
  105. Fayyoumi E and Oommen B (2009). Achieving microaggregation for secure statistical databases using fixed-structure partitioning-based learning automata, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 39:5, (1192-1205), Online publication date: 1-Oct-2009.
  106. Hennes D, Tuyls K and Rauterberg M State-coupled replicator dynamics Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2, (789-796)
  107. Burguillo-Rial J A memetic framework for describing and simulating spatial prisoner's dilemma with coalition formation Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1, (441-448)
  108. Misra S, Tiwari V and Obaidat M (2009). LACAS, IEEE Journal on Selected Areas in Communications, 27:4, (466-479), Online publication date: 1-May-2009.
  109. Šter B and Dobnikar A Scalability of learning impact on complex parameters in recurrent neural networks Proceedings of the 9th international conference on Adaptive and natural computing algorithms, (273-282)
  110. Lotrič U and Dobnikar A Parallel implementations of recurrent neural network learning Proceedings of the 9th international conference on Adaptive and natural computing algorithms, (99-108)
  111. Šter B and Dobnikar A Scalability of Learning Impact on Complex Parameters in Recurrent Neural Networks Proceedings of the 2009 conference on Adaptive and Natural Computing Algorithms - Volume 5495, (273-282)
  112. Lotrič U and Dobnikar A Parallel Implementations of Recurrent Neural Network Learning Proceedings of the 2009 conference on Adaptive and Natural Computing Algorithms - Volume 5495, (99-108)
  113. Pediaditaki S and Marina M A learning-based channel allocation protocol for multi-radio wireless mesh networks Proceedings of the 28th IEEE international conference on Computer Communications Workshops, (302-303)
  114. Gosavi A (2009). Reinforcement Learning, INFORMS Journal on Computing, 21:2, (178-192), Online publication date: 1-Apr-2009.
  115. Martins J, Dente J, Pires A and Vilela Mendes R (2009). From controlled dynamical systems to context-dependent grammars, Engineering Applications of Artificial Intelligence, 22:2, (192-200), Online publication date: 1-Mar-2009.
  116. Ramana B, Pavan K and Murthy C A novel learning based solution for efficient data transport in heterogeneous wireless networks Proceedings of the 15th international conference on High performance computing, (402-414)
  117. Hennes D, Tuyls K and Rauterberg M Formalizing Multi-state Learning Dynamics Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02, (266-272)
  118. Soundararajan G and Amza C Towards end-to-end quality of service Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware, (287-305)
  119. ACM
    Abin A, Fotouhi M and Kasaei S Skin segmentation based on cellular learning automata Proceedings of the 6th International Conference on Advances in Mobile Computing and Multimedia, (254-259)
  120. Stoica F and Simian D Automatic control based on wasp behavioral model and stochastic learning automata Proceedings of the 10th WSEAS international conference on Mathematical methods, computational techniques and intelligent systems, (289-294)
  121. Peeters M, Könönen V, Verbeeck K, Segbroeck S and Nowé A Coordinated Exploration in Conflicting Multi-stage Games Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II, (391-402)
  122. Peeters M, Könönen V, Verbeeck K and Nowé A A Learning Automata Approach to Multi-agent Policy Gradient Learning Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II, (379-390)
  123. Stoica F and Popa E A new evolutionary reinforcement scheme for stochastic learning automata Proceedings of the 12th WSEAS international conference on Computers, (268-273)
  124. Kwon I, Kim C, Jun J and Lee J (2008). Case-based myopic reinforcement learning for satisfying target service level in supply chain, Expert Systems with Applications: An International Journal, 35:1-2, (389-397), Online publication date: 1-Jul-2008.
  125. Granmo O and Oommen B A Hierarchy of Twofold Resource Allocation Automata Supporting Optimal Web Polling Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence, (347-358)
  126. de Jong S, Tuyls K and Verbeeck K Artificial agents learning human fairness Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2, (863-870)
  127. Oommen B and Calitoiu D Modeling and simulating a disease outbreak by learning a contagion parameter-based model Proceedings of the 2008 Spring simulation multiconference, (547-555)
  128. Bouhmala N and Granmo O Solving graph coloring problems using learning automata Proceedings of the 8th European conference on Evolutionary computation in combinatorial optimization, (277-288)
  129. Granmo O and Oommen B On using a hierarchy of twofold resource allocation automata to solve stochastic nonlinear resource allocation problems Proceedings of the 20th Australian joint conference on Advances in artificial intelligence, (36-47)
  130. Lin T and Giles C (2007). Group-Linking Method, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E90-A:12, (2916-2929), Online publication date: 1-Dec-2007.
  131. Herik H, Hennes D, Kaisers M, Tuyls K and Verbeeck K Multi-agent Learning Dynamics Proceedings of the 11th international workshop on Cooperative Information Agents XI, (36-56)
  132. Vrancx P, Verbeeck K and Nowé A Optimal convergence in multi-agent MDPs Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III, (107-114)
  133. Baba N and Mogami Y A consideration on the learning performances of the hierarchical structure learning automata (HSLA) operating in the general nonstationary multiteacher environment Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III, (82-90)
  134. Stoica F and Popa E Application of stochastic learning automata to intelligent vehicle control Proceedings of the 11th WSEAS International Conference on Computers, (241-246)
  135. Oommen B, Misra S and Granmo O (2007). Routing Bandwidth-Guaranteed Paths in MPLS Traffic Engineering, IEEE Transactions on Computers, 56:7, (959-976), Online publication date: 1-Jul-2007.
  136. Oommen B, Kim S, Samuel M and Granmo O Stochastic point location in non-stationary environments and its applications Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems, (845-854)
  137. Hashem K and Oommen B On using learning automata to model a student's behavior in a tutorial-like system Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems, (813-822)
  138. Heidari F, Mannor S and Mason L Reinforcement learning-based load shared sequential routing Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet, (832-843)
  139. Xing Y, Mathur C, Haleem M, Chandramouli R and Subbalakshmi K (2007). Dynamic Spectrum Access with QoS and Interference Temperature Constraints, IEEE Transactions on Mobile Computing, 6:4, (423-433), Online publication date: 1-Apr-2007.
  140. Musunoori S and Horn G Co-ordination in Intelligent Ant-Based Application Service Mapping in Grid Environments Proceedings of the 2007 IEEE Swarm Intelligence Symposium, (303-309)
  141. Oommen B, Granmo O and Pedersen A Using Stochastic AI Techniques to Achieve Unbounded Resolution in Finite Player Goore Games and its Applications Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games, (161-167)
  142. Nicopolitidis P, Papadimitriou G, Obaidat M and Pomportsis A (2007). A new high rate adaptive wireless data dissemination scheme, Computer Communications, 30:5, (957-964), Online publication date: 1-Mar-2007.
  143. Oommen B, Granmo O and Pedersen A Empirical verification of a strategy for unbounded resolution in finite player goore games Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence, (1252-1258)
  144. Nicopolitidis P, Papadimitriou G, Obaidat M and Pomportsis A (2006). Performance optimization of an adaptive wireless push system in environments with locality of demand, Computer Communications, 29:13-14, (2542-2549), Online publication date: 1-Aug-2006.
  145. Tuffin B and Maillé P How many parallel TCP sessions to open Proceedings of the 5th international conference on Internet Charging and QoS Technologies: performability has its Price, (2-12)
  146. Misra S and Oommen B (2006). An Efficient Dynamic Algorithm for Maintaining All-Pairs Shortest Paths in Stochastic Networks, IEEE Transactions on Computers, 55:6, (686-702), Online publication date: 1-Jun-2006.
  147. Amer A and Oommen B Lists on lists Proceedings of the 5th international conference on Experimental Algorithms, (109-120)
  148. Vrancx P, Verbeeck K and Nowé A Analyzing stigmergetic algorithms through automata games Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics, (145-156)
  149. Westra R, Tuyls K, Saeys Y and Nowé A Knowledge discovery and emergent complexity in bioinformatics Proceedings of the 1st international conference on Knowledge discovery and emergent complexity in bioinformatics, (1-9)
  150. Oommen B and Rueda L (2006). Stochastic learning-based weak estimation of multinomial random variables and its applications to pattern recognition in non-stationary environments, Pattern Recognition, 39:3, (328-341), Online publication date: 1-Mar-2006.
  151. Lukac K, Lukac Z and Tkalic M Benefits of wavelength conversion in optical WDM mesh networks with dynamic routing Proceedings of the 24th IASTED international conference on Parallel and distributed computing and networks, (117-122)
  152. Ramana B and Murthy C Learning-TCP Proceedings of the 12th international conference on High Performance Computing, (454-464)
  153. Mogami Y New learning algorithm for hierarchical structure learning automata operating in p-model stationary random environment Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I, (115-120)
  154. Ozden M A new optimization heuristic for continuous and integer decisions with constraints in simulation Proceedings of the 37th conference on Winter simulation, (853-856)
  155. Oommen B and Rueda L On utilizing stochastic learning weak estimators for training and classification of patterns with non-stationary distributions Proceedings of the 28th annual German conference on Advances in Artificial Intelligence, (107-120)
  156. ACM
    Verbeeck K, Nowé A and Tuyls K Coordinated exploration in multi-agent reinforcement learning Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems, (1105-1106)
  157. Nowé A, Verbeeck K and Peeters M Learning automata as a basis for multi agent reinforcement learning Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems, (71-85)
  158. Ghanbari S and Meybodi M Learning automata based algorithms for mapping of a class of independent tasks over highly heterogeneous grids Proceedings of the 2005 European conference on Advances in Grid Computing, (681-690)
  159. Vrancx P, Verbeeck K and Nowé A Networks of learning automata and limiting games Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning, (224-238)
  160. Peeters M, Verbeeck K and Nowé A Solving multi-stage games with hierarchical learning automata that bootstrap Proceedings of the 5th , 6th and 7th European conference on Adaptive and learning agents and multi-agent systems: adaptation and multi-agent learning, (169-187)
  161. Rueda L and Oommen B On families of new adaptive compression algorithms suitable for time-varying source data Proceedings of the Third international conference on Advances in Information Systems, (234-244)
  162. Chandramouli R, Subbalakshmi K and Ranganathan N (2004). Stochastic channel-adaptive rate control for wireless video transmission, Pattern Recognition Letters, 25:7, (793-806), Online publication date: 1-May-2004.
  163. Fung W and Liu Y (2003). Adaptive categorization of ART networks in robot behavior learning using game-theoretic formulation, Neural Networks, 16:10, (1403-1420), Online publication date: 1-Dec-2003.
  164. Obaidat M, Papadimitriou G and Pomportsis A (2003). Efficient fast learning automata, Information Sciences: an International Journal, 157:1-2, (121-133), Online publication date: 1-Dec-2003.
  165. Narendra K From feedback control to complexity management Switching and Learning in Feedback Systems, (1-30)
  166. Mostafa J, Mukhopadhyay S and Palakal M (2003). Simulation Studies of Different Dimensions of Users' Interests and their Impact on User Modeling and Information Filtering, Information Retrieval, 6:2, (199-223), Online publication date: 1-Apr-2003.
  167. Sastry P, Magesh M and Unnikrishnan K (2002). Two timescale analysis of the Alopex algorithm for optimization, Neural Computation, 14:11, (2729-2750), Online publication date: 1-Nov-2002.
  168. Papadimitriou G, Pomportsis A, Kiritsi S and Talahoupi E (2002). Absorbing stochastic estimator learning automata for S-model stationary environments, Information Sciences: an International Journal, 147:1-4, (193-199), Online publication date: 1-Oct-2002.
  169. ACM
    Palakal M, Mukhopadhyay S and Mostafa J An intelligent biological information management system Proceedings of the 2002 ACM symposium on Applied computing, (159-163)
  170. Hadjiefthymiades S, Papayiannis S and Merakos L TCP Performance Enhancement in Wireless/Mobile Communications Proceedings of the 26th Annual IEEE Conference on Local Computer Networks
  171. Likas A (2001). Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network, Neural Processing Letters, 13:3, (213-220), Online publication date: 9-Jul-2001.
  172. ACM
    Hadjiefthymiades S and Merakos L Using proxy cache relocation to accelerate Web browsing in wireless/mobile communications Proceedings of the 10th international conference on World Wide Web, (26-35)
  173. Tian F, Li C and Wang D (2001). Evolving information filtering for personalized information service, Journal of Computer Science and Technology, 16:2, (168-175), Online publication date: 1-Mar-2001.
  174. Shi L and O´lafsson S (2000). Nested Partitions Method for Stochastic Optimization, Methodology and Computing in Applied Probability, 2:3, (271-291), Online publication date: 1-Sep-2000.
  175. Peshkin L, Kim K, Meuleau N and Kaelbling L Learning to cooperate via policy search Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence, (489-496)
  176. Oommen B and Roberts T (2000). Continuous Learning Automata Solutions to the Capacity Assignment Problem, IEEE Transactions on Computers, 49:6, (608-620), Online publication date: 1-Jun-2000.
  177. Sarkar S and Soundararajan P (2000). Supervised Learning of Large Perceptual Organization, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22:5, (504-525), Online publication date: 1-May-2000.
  178. ACM
    Hadjiefthymiades S and Merakos L (1999). ESW4, ACM SIGCOMM Computer Communication Review, 29:5, (24-35), Online publication date: 5-Oct-1999.
  179. Yang C A selectionist theory of language acquisition Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, (429-435)
  180. Venkataramana R and Ranganathan N Multiple Cost Optimization for Task Assignment in Heterogeneous Computing Systems Using Learning Automata Proceedings of the Eighth Heterogeneous Computing Workshop
  181. Dorigo M, Di Caro G and Gambardella L (1999). Ant algorithms for discrete optimization, Artificial Life, 5:2, (137-172), Online publication date: 1-Apr-1999.
  182. ACM
    Venkataramana R and Ranganathan N A learning automata based framework for task assignment in heterogeneous computing systems Proceedings of the 1999 ACM symposium on Applied computing, (541-547)
  183. Baird L and Moore A Gradient descent for general reinforcement learning Proceedings of the 12th International Conference on Neural Information Processing Systems, (968-974)
  184. Crites R and Barto A (1998). Elevator Group Control Using Multiple Reinforcement Learning Agents, Machine Language, 33:2-3, (235-262), Online publication date: 1-Dec-1998.
  185. ACM
    Billard E and Lakshmivarahan S Simulation of period-doubling behavior in distributed learning automata Proceedings of the 1998 ACM symposium on Applied Computing, (690-695)
  186. ACM
    Lam W, Mukhopadhyay S, Mostafa J and Palakal M Detection of shifts in user interests for personalized information filtering Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, (317-325)
  187. Lima P and Saridis G (1996). Learning Optimal Robotic Tasks, IEEE Expert: Intelligent Systems and Their Applications, 11:2, (38-45), Online publication date: 1-Apr-1996.
  188. Oommen B and de St. Croix E (1996). Graph Partitioning Using Learning Automata, IEEE Transactions on Computers, 45:2, (195-208), Online publication date: 1-Feb-1996.
  189. Papadimitriou G (1994). Hierarchical Discretized Pursuit Nonlinear Learning Automata with Rapid Convergence and High Accuracy, IEEE Transactions on Knowledge and Data Engineering, 6:4, (654-659), Online publication date: 1-Aug-1994.
  190. Billard E Learning in multi-level stochastic games with delayed information Proceedings of the Tenth international conference on Uncertainty in artificial intelligence, (86-93)
  191. Unnikrishnan K and Venugopal K (1994). Alopex, Neural Computation, 6:3, (469-490), Online publication date: 1-May-1994.
  192. Oommen B and Zgierski J (1993). Breaking Substitution Cyphers Using Stochastic Automata, IEEE Transactions on Pattern Analysis and Machine Intelligence, 15:2, (185-192), Online publication date: 1-Feb-1993.
  193. ACM
    Oommen B, Valiveti R and Zgierski J A fast learning automaton solution to the keyboard optimization problem Proceedings of the 3rd international conference on Industrial and engineering applications of artificial intelligence and expert systems - Volume 2, (981-990)
Contributors
  • Yale University
  • Indian Institute of Science

Reviews

Ernst L. Leiss

The capability and accessibility of modern digital computers, one of the most powerful scientific tools ever invented, have resulted in computer simulations assuming the role of traditional experiments in many areas of scientific investigation. This is particularly true of complex interconnected information systems. Such systems are generally both hierarchical and distributed in structure and contain a large number of decision makers operating in the presence of great uncertainty. Simulations of these systems, under widely different conditions using heuristic assumptions, provide a qualitative understanding of their performance. This in turn has led to a search for new theoretical principles, paradigms, and methods to explain failure and extend successful results. The learning automaton presented in this book is one such paradigm. The rationale underlying the learning approach is based on the desire for adaptive decision making in highly uncertain stochastic environments. The learning automaton achieves this, and in addition, can be implemented in a distributive manner, a critical feature for dealing with large complex systems. Perhaps most important, unlike heuristic techniques, the automaton approach has been developed in a systematic and analytically tractable fashion. Work on learning automata was started in the 1960s in the Soviet Union by M. L. Tsetlin and his co-workers. However, the term “automaton theory” used by Tsetlin had little in common with the research followed by western automaton theorists and was much closer in spirit to the models of mathematical learning theory and learning machines such as the perceptron. In fact, some of the mathematical models studied intensively by American psychologists in the 1950s could be regarded as the forerunners of the developments in this field. In the 1970s it was realized that similar problems had been independently and extensively investigated in many other fields, including statistics and operations research, but the common basis of these parallel developments was masked by different terminologies. Although the basic questions are quite similar in the different areas, the viewpoint is conditioned to a great extent by the practitioners. During the past two decades research workers in the systems field in different countries have consistently used the term “learning automaton” to describe systems that improve their performance in random environments. It is this class of systems that is treated in this book. …[Its] principal theme is how a sequential decision maker with a finite number of choices in the action set would choose an action at every instant, based on the responses of a random environment (from the Preface). Narendra and Thathachar provide a welcome comprehensive treatment of how automata are used in learning by pulling together results from mathematical systems theory, psychology, and automata theory. Chapter 1 gives a brief survey of some learning strategies, with chapter 2 introducing the basic model of a learning automaton. Chapters 3 and 4 examine fixed- and variable-structure automata and their variants in more detail. The important notion of convergence is explored and analyzed in chapter 5. Chapter 6 presents a major generalization of the previously discussed models by admitting as a response to an action any real value within a finite interval. Chapter 7 introduces time; specifically, it discusses what can and cannot be achieved if the characteristics of the learning environment vary over time. The final two chapters discuss interconnected automata and games, as well as applications of learning automata, such as network routing. Three brief appendices (on Markov chains, Martingales, and distance diminishing operators) survey some formal notions fundamental to the material of the book.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations