Abstract
One of the most important issues in Adaptive and Intelligent Educational Systems (AIES) is to define effective pedagogical policies for tutoring students according to their needs. This paper proposes to use Reinforcement Learning (RL) in the pedagogical module of an educational system so that the system learns automatically which is the best pedagogical policy for teaching students. One of the main characteristics of this approach is its ability to improve the pedagogical policy based only on acquired experience with other students with similar learning characteristics. In this paper we study the learning performance of the educational system through three important issues. Firstly, the learning convergence towards accurate pedagogical policies. Secondly, the role of exploration/exploitation strategies in the application of RL to AIES. Finally, a method for reducing the training phase of the AIES.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Brusilovsky P (1999) Adaptive and intelligent technologies for Web-based education. Kunstl Intell 4:19–25. Special Issue on Intelligent Tutoring Systems and Teleteaching
Lebowitz M (1987) Experiments with incremental concept formation: Unimen. Mach Learn 2:103–138
Murray RC, VanLehn K, Mostow J (2001) A decision-theoretic approach for selecting tutorial discourse actions. In: Proc of the NAACL workshop on adaptation in dialogue systems, pp 41–48
Iglesias A, Martínez P, Aler R, Fernando F (2002) Applying reinforcement learning in intelligent tutoring systems. In: Proc of international conference on new educational environments (ICNEE), Lugano (Switzerland), pp 11–14
Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge
Wenger E (1987) Artificial intelligence and tutoring systems. Morgan Kaufmann, San Mateo
Murray T (1999) Authoring intelligent tutoring systems: an analysis of the state of the art. J Artif Intell Educ 10:98–129
Andre E, Finkler W, Graf W, Schauder A, Wahister W (1993) Intelligent multimedia presentations. In: Maybury MT (ed) The automatic synthesis of multimodal presentations. MIT Press, Cambridge
Khan T, Yip YJ (1996) Pedagogical principles of case-based cal. J Comput Assist Learn 1(12):172–192
Sleeman A (1977) A system which allows student to explore algorithms. In: Proceedings of international joint conference on artificial intelligence, pp 780–786
Woolf BP (1987) Representing complex knowledge in an intelligent machine tutor. Comput Intell 3:45–55
Beck J (2001) ADVISOR: A machine learning architecture for intelligent tutor construction. PhD thesis, University of Massachusetts Amherst
Burns H, Capps C (1998) Foundations of intelligent tutoring systems: an introduction. Foundations of intelligent tutoring systems. Lawrence Erlbaum Associates, Hillsdale, pp 1–19
Linton F, Schaefer HP (2000) Recommender systems for learning: building user and expert models through long-term observation of application use. User Model User-Adapt Interact 10:181–208
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. Int J Artif Intell Res 4:237–285
Puterman ML (1994) Markov decision processes: discrete stochastic dynamic programming. Wiley series in probability and mathematical statistics: Applied probability and statistics. Wiley, New York
Watkins CJCH (1989) Learning from delayed rewards. Phd Thesis, King’s College, Cambridge
Thrun SB (1992) The role of exploration in learning control. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy and adaptive approaches. Van Nostrand Reinhold, New York
Elmasri R, Navathe SB (1994) Fundamentals of database systems, 2nd edn. Benjamin/Cummings, Redwood City
Rich E (1979) User modelling via stereotypes. Cogn Sci 4:329–354
Carr B, Goldstein I (1977) Overlays: A theory of modelling for computer aided instruction. Technical Report ai memo 406, AI Laboratory, Massachusetts Institute of Technology, Cambridge
VanLehn K, Zhendong N (2001) Bayesian student modelling, user interfaces and feedback: a sensitivity analysis. Int J Artif Intell Educ 2:155–184
Iglesias A, Martínez P, Aler R, Fernando F (2002) Learning to teach database design by trial and error. In: Proc of 4th international conference on enterprise information systems (ICEIS), Ciudad Real (Spain), pp 500–505
Sison R, Shimura M (1998) Student modeling and machine learning. Int J Artif Intell Educ 9:128–158
Quinlan JR (1993) C4.5 Programs for machine learning. Morgan Kaufmann, San Mateo
Ur S, VanLehn K (1995) STEPS: A simulated, tutorable physics student. J Artif Intell Educ 6(4):405–437
VanLehn K, Ohlsson S, Nason R (1994) Applications of simulated students: An exploration. J Artif Intell Educ 5(2):135–175
Montgomery DC (2005) Design and analysis of experiments, 5th edn. Wiley, New York
Brusilovsky P (1996) Methods and techniques of adaptive hipermedia. User Model User Adapt Interact 6(2–3):87–129
Iglesias A, Martínez P, Aler R, Fernando F (2003) An experience applying reinforcement learning in a web-based adaptive and intelligent education system. Inform Educ Int J 2:1–18
Doak ED, Keith M (1986) Simulation in teacher education: The knowledge base and the process. Tenn Educ 16(2):14–17
Carroll JM, Peterson ST (2002) Fixed vs dynamic sub-transfer in reinforcement learning. In: Proc ICMLA 2002, M Arif Wani, Las Vegas, Nevada
Stankov S, Božičević J (1998) The computer tutor in the new model of learning and teching control principles. In: Proc of 1st international workshop: mechatronics and industrial engineering, Bratislava, Slovak Republic, pp 78–88
VanLehn K, Zhendong N (2001) Bayesian student modelling, user interfaces and feedback: a sensitivity analysis. Int J Artif Intell Educ 2:155–184
Aleven V, Koedinger KR (2000) The need for tutorial dialog to support self-explanation. In: Building dialogue systems for tutorial applications, papers of the 2000 AAAI Fall Symposium, pp 65–73
Kumar R, Rosé CP, Aleven V, Iglesias A, Robinson A (2006) Evaluating the effectiveness of tutorial dialogue instruction in an exploratory learning context. Intell Tutor Syst 2066:666–674
Brusilovsky P, Schwarz E, Weber G (1996) Elm-art: An intelligent tutoring system on world wide web. In: Frasson C, Gauthier G, Lesgold A (eds) Intelligent tutoring systems, vol 1086. Springer, Berlin, pp 261–269
André E, Müller J, Rist T (1996) WIP/PPP: Automatic generation of personalized multimedia presentations. In: Proc of Multimedia 96, 4th ACM International Multimedia Conference, Boston, pp 407–408
Anderson J, Reiser B (1985) The lisp tutor. Byte 10(4):159–175
Prentzas J, Hatzilygeroudis I, Garofalakis J (2002) A web-based intelligent tutoring system using hybrid rules as its representational basis. In: Cerri SA, Gouarderes G, Paraguacu̧ F (eds) Proceedings of the 6th international conference, ITS 2002. Lecture notes in computer science, vol 1. Springer, Berlin, pp 119–128
Rich E (1979) User modelling via stereotypes. Cogn Sci 3(4):329–354
Carr B, Goldstein I (1977) Overlays: A theory of modelling for computer aided instruction. Technical Report ai memo 406, AI Laboratory, Massachusetts Institute of Technology, Cambridge
Langley P, Ohlsson S (1984) Automated cognitive modelling. In: Proceedings of the second national conference on artificial intelligence
Baffes P, Mooney R (1996) Refinement-based student modelling and automated bug library construction. J Artif Intell Educ 7(1):75–116
Sison R, Nũmao M, Shimura M (2000) Multistrategy discovery and detection of novice programmer errors. Mach Learn 38:157–180
Sleeman D, Brown S (1982) Intelligent tutoring systems. Computers and people series. Academic Press, London
Hoppe U (1994) Deductive error diagnosis and inductive error generalization for intelligent tutoring systems. J Artif Intell Educ 5:27–49
Beck J (2001) ADVISOR: A machine learning architecture for intelligent tutor construction. PhD thesis, University of Massachusetts Amherst
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Iglesias, A., Martínez, P., Aler, R. et al. Learning teaching strategies in an Adaptive and Intelligent Educational System through Reinforcement Learning. Appl Intell 31, 89–106 (2009). https://doi.org/10.1007/s10489-008-0115-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-008-0115-1