Overview
- This book covers the theory and algorithms of deep learning and it provides detailed discussions of the relationships of neural networks with traditional machine learning algorithms.
- The mathematical aspects are concretely presented without losing accessibility.
- The book is written in a textbook style, and it includes exercises, a solution manual, and instructor slides. The depth and breadth of coverage are unique to the book.
- Includes supplementary material: sn.pub/extras
- Request lecturer material: sn.pub/lecturer-material
Access this book
Tax calculation will be finalised at checkout
Other ways to access
About this book
This book covers both classical and modern models in deep learning. The primary focus is on the theory and algorithms of deep learning. The theory and algorithms of neural networks are particularly important for understanding important concepts, so that one can understand the important design concepts of neural architectures in different applications. Why do neural networks work? When do they work better than off-the-shelf machine-learning models? When is depth useful? Why is training neural networks so hard? What are the pitfalls? The book is also rich in discussing different applications in order to give the practitioner a flavor of how neural architectures are designed for different types of problems. Applications associated with many different areas like recommender systems, machine translation, image captioning, image classification, reinforcement-learning based gaming, and text analytics are covered. The chapters of this book span three categories:
The basics of neural networks: Many traditional machine learning models can be understood as special cases of neural networks. An emphasis is placed in the first two chapters on understanding the relationship between traditional machine learning and neural networks. Support vector machines, linear/logistic regression, singular value decomposition, matrix factorization, and recommender systems are shown to be special cases of neural networks. These methods are studied together with recent feature engineering methods like word2vec.
Fundamentals of neural networks: A detailed discussion of training and regularization is provided in Chapters 3 and 4. Chapters 5 and 6 present radial-basis function (RBF) networks and restricted Boltzmann machines.
Advanced topics in neural networks: Chapters 7 and 8 discuss recurrent neural networks and convolutional neural networks. Several advanced topics like deep reinforcement learning, neural Turing machines, Kohonen self-organizing maps, and generative adversarial networks are introduced in Chapters 9 and 10.
The book is written for graduate students, researchers, and practitioners. Numerous exercises are available along with a solution manual to aid in classroom teaching. Where possible, an application-centric view is highlighted in order to provide an understanding of the practical uses of each class of techniques.
Similar content being viewed by others
Keywords
- Deep Learning
- Machine Learning
- Radial Basis Function Networks
- Restricted Boltzmann Machines
- Recurrent Neural Networks
- Convolutional Neural Networks
- Neural networks
- perceptron
- deep reinforcement learning
- word2vec
- autoencoder
- logistic regression
- dropout
- pretraining
- backpropagation
- conjugate gradient-descent
- Adam
- RMSProp
- Kohonean self-organizaing map
- generative adversarial networks
Table of contents (10 chapters)
Reviews
Authors and Affiliations
About the author
Bibliographic Information
Book Title: Neural Networks and Deep Learning
Book Subtitle: A Textbook
Authors: Charu C. Aggarwal
DOI: https://doi.org/10.1007/978-3-319-94463-0
Publisher: Springer Cham
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer International Publishing AG, part of Springer Nature 2018
eBook ISBN: 978-3-319-94463-0Published: 25 August 2018
Edition Number: 1
Number of Pages: XXIII, 497
Number of Illustrations: 128 b/w illustrations, 11 illustrations in colour
Topics: Artificial Intelligence, Information Systems and Communication Service, Processor Architectures