Download textbook Sustained Simulation Performance 2016 Proceedings Of The Joint Workshop On Sustained Simulation Performance University Of Stuttgart Hlrs And Tohoku University 2016 1St Edition Michael M Resch ebook all chapter pdf

Sustained Simulation Performance
2016: Proceedings of the Joint

Workshop on Sustained Simulation
Performance, University of Stuttgart
(HLRS) and Tohoku University, 2016 1st
Edition Michael M. Resch
Visit to download the full and correct content document:
https://textbookfull.com/product/sustained-simulation-performance-2016-proceedings-
of-the-joint-workshop-on-sustained-simulation-performance-university-of-stuttgart-hlrs
-and-tohoku-university-2016-1st-edition-michael-m-resch/
More products digital (pdf, epub, mobi) instant
download maybe you interests ...
Biota Grow 2C gather 2C cook Loucas
https://textbookfull.com/product/biota-grow-2c-gather-2c-cook-
loucas/
High Performance Computing in Science and Engineering

16 Transactions of the High Performance Computing
Center Stuttgart HLRS 2016 1st Edition Wolfgang E.
Nagel
https://textbookfull.com/product/high-performance-computing-in-
science-and-engineering-16-transactions-of-the-high-performance-
computing-center-stuttgart-hlrs-2016-1st-edition-wolfgang-e-
nagel/
Strategic Organizational Learning Using System Dynamics

for Innovation and Sustained Performance 1st Edition
Martha A. Gephart
https://textbookfull.com/product/strategic-organizational-
learning-using-system-dynamics-for-innovation-and-sustained-
performance-1st-edition-martha-a-gephart/
Building Performance Simulation for Design and

Operation 2nd Edition Jan L. M. Hensen
https://textbookfull.com/product/building-performance-simulation-
for-design-and-operation-2nd-edition-jan-l-m-hensen/
Seismic Performance of Soil-Foundation-Structure
Systems: Selected Papers from the International
Workshop on Seismic Performance of ... Auckland, New
Zealand, 21-22 November 2016 1st Edition Nawawi Chouw
https://textbookfull.com/product/seismic-performance-of-soil-
foundation-structure-systems-selected-papers-from-the-
international-workshop-on-seismic-performance-of-auckland-new-
zealand-21-22-november-2016-1st-edition-nawawi-ch/
Simulation Driven Design with Inspire 2nd Edition

Altair University Team
https://textbookfull.com/product/simulation-driven-design-with-
inspire-2nd-edition-altair-university-team/
High-Performance Simulation-Based Optimization Thomas

Bartz-Beielstein
https://textbookfull.com/product/high-performance-simulation-
based-optimization-thomas-bartz-beielstein/
Enterprise and Organizational Modeling and Simulation

12th International Workshop EOMAS 2016 Held at CAiSE
2016 Ljubljana Slovenia June 13 2016 Selected Papers
1st Edition Robert Pergl
https://textbookfull.com/product/enterprise-and-organizational-
modeling-and-simulation-12th-international-workshop-
eomas-2016-held-at-caise-2016-ljubljana-slovenia-
june-13-2016-selected-papers-1st-edition-robert-pergl/
Energy Geotechnics Proceedings of the 1st International

Conference on Energy Geotechnics ICEGT 2016 Kiel
Germany 29 31 August 2016 Bauer
https://textbookfull.com/product/energy-geotechnics-proceedings-
of-the-1st-international-conference-on-energy-geotechnics-
icegt-2016-kiel-germany-29-31-august-2016-bauer/
Michael M. Resch · Wolfgang Bez
Erich Focht · Nisarg Patel
Hiroaki Kobayashi Editors
Sustained Simulation
Performance
2016
Proceedings of the Joint Workshop

on Sustained Simulation Performance,
University of Stuttgart (HLRS) and
Tohoku University, 2016
123
Sustained Simulation Performance 2016
Michael M. Resch Wolfgang Bez
•
Erich Focht Nisarg Patel

•
Hiroaki Kobayashi
Editors
Sustained Simulation
Performance 2016
Proceedings of the Joint Workshop
on Sustained Simulation Performance,
University of Stuttgart (HLRS)
and Tohoku University, 2016
123
Editors
Michael M. Resch Nisarg Patel
High Performance Computing Center High Performance Computing Center
(HLRS) (HLRS)
University of Stuttgart University of Stuttgart
Stuttgart Stuttgart
Germany Germany
Wolfgang Bez Hiroaki Kobayashi

NEC High Performance Computing Cyberscience Center
Europe GmbH Tohoku University
Düsseldorf Sendai
Germany Japan
Erich Focht
NEC High Performance Computing
Europe GmbH
Stuttgart
Germany
Figure on Front Cover: Domain decomposition of a hierarchical Cartesian mesh. A Hilbert curve is used
to partition the grid at a relatively coarse refinement level. Due to the depth-first ordering of the cells, this
leads to complete subtrees being distributed among the available MPI ranks, improving the parallel
performance of coupled multiphysics simulations
Figure on Back Cover: Hierarchical Cartesian mesh with local refinement towards the lower boundary.
Among neighbouring cells, the level difference is at most one, leading to a size ratio of 2:1 (2D) or 4:1
(3D) between the cells
ISBN 978-3-319-46734-4 ISBN 978-3-319-46735-1 (eBook)

DOI 10.1007/978-3-319-46735-1
Library of Congress Control Number: 2016953010
Mathematics Subject Classification (2010): 68Wxx, 68W10, 68Mxx, 68U20, 76-XX, 86A10, 70FXX,
92Cxx, 65-XX
© Springer International Publishing AG 2016

This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations,
recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission
or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar
methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are exempt from
the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, express or implied, with respect to the material contained herein or
for any errors or omissions that may have been made.
Printed on acid-free paper
This Springer imprint is published by Springer Nature

The registered company is Springer International Publishing AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
The field of high-performance computing is currently witnessing a significant shift

of paradigm. Ever-larger raw number crunching capabilities of modern processors
are in principle available to computational scientists. Imperative knowledge of
efficiently exploiting modern processors and performance achievements in the
scientific community is growing by leaps and bounds.
On the other hand, many areas of computational science have reached a satu-
ration in terms of problem size. Scientists often do no longer wish to solve larger
problems. Instead, they wish to solve smaller problems in a shorter time. The
current architectures, however, are much more efficient for large problems than they
are for the more relevant smaller problems.
This series of workshops focuses on Sustained Simulation Performance, i.e.,
high-performance computing for real-application use cases, rather than on-peak
performance, which is the scope of artificial problem sizes. The series was estab-
lished in 2004, initially named Teraflop Workshop, and renamed Workshop for
Sustained Simulation Performance in 2012. In general terms, the scope of the
workshop series has shifted from optimization for vector computers to emphasis on
future challenges, productivity, and exploitation of current and future high-
performance computing systems.
This book presents the combined results of the 22nd and 23rd installment of the
series. The 22nd workshop was held at the High-Performance Computing Center,
Stuttgart, Germany, in December 2015. The 23rd workshop was held in March
2016 at Sendai, Miyagi, Japan, and organized jointly with the University of
Tohoku, Sendai, Japan.
The topics studied by the contributed papers include exploitation of HPC sys-
tems (Part I) and numerical computations and approach toward multi-physics
applications (Part II).
v
vi Preface
We would like to thank all the contributors and organizers of this book and the
sustained simulation performance project. We thank especially Prof. Hiroaki
Kobayashi for the close collaboration over the past years and are looking forward to
intensify our cooperation in the future.
Stuttgart, Germany Michael M. Resch

August 2016 Nisarg Patel
Contents
Part I Exploitation of Existing HPC Systems: Potentiality,

Performance and Productivity
Parallel Algorithms: Theory, Practice and Education . . . . . . . . . . . . . . . 3
Vl.V. Voevodin
High Performance Computing and High Performance
Data Analytics—What is the Missing Link? . . . . . . . . . . . . . . . . . . . . . . . 11
Bastian Koller, Michael Gienger and Michael M. Resch
A Use Case of a Code Transformation Rule Generator
for Data Layout Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Hiroyuki Takizawa, Takeshi Yamada, Shoichi Hirasawa and Reiji Suda
APES on SX-ACE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Harald Klimach, Jiaxing Qi and Sabine Roller
Dealing with Non-linear Terms in a Modal High-Order
Discontinuous Galerkin Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Nikhil Anand, Harald Klimach and Sabine Roller
Efficient Coupling of Fluid and Acoustic Interaction
on Massive Parallel Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Verena Krupp, Kannan Masilamani, Harald Klimach and Sabine Roller
The Spectral Structure of a Nonlinear Operator
and Its Approximation II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Uwe Küster
Implementation of a Parallel Sparse Direct Solver on Vector
Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
Atsushi Suzuki and François-Xavier Roux
vii
viii Contents
Directive Translation for Various HPC Systems Using

the Xevolver Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa
and Hiroaki Kobayashi
An Automatic Performance Tracking System for Large-Scale
Numerical Applications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Shoichi Hirasawa, Hiroyuki Takizawa and Hiroaki Kobayashi
Part II Numerical Computations and Approach Towards

Multi-physics Applications
A Case Study of Urgent Computing on SX-ACE: Design and
Development of a Real-Time Tsunami Inundation Analysis System
for Disaster Prevention and Mitigation . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
Hiroaki Kobayashi
CFD/CAA Simulations on HPC Systems . . . . . . . . . . . . . . . . . . . . . . . . . . 139
Michael Schlottke-Lakemper, Fabian Klemp, Hsun-Jen Cheng,
Andreas Lintermann, Matthias Meinke and Wolfgang Schröder
HPC Applications for Manufacturing Innovation
in Aerospace Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159
Ryoji Takaki and Seiji Tsutsumi
High Resolution Climate Projections Using the WRF Model
on the HLRS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Viktoria Mohr, Thomas Schwitalla, Volker Wulfmeyer
and Kirsten Warrach-Sagi
Towards Aerodynamic Characteristics Investigation Based
on Cartesian Methods for Low-Reynolds Number
Flow Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Daisuke Sasaki, Yuya Kojima, Daiki Iioka, Ryohei Serizawa
and Shun Takahashi
Part I
Exploitation of Existing HPC Systems:
Potentiality, Performance and Productivity
Parallel Algorithms: Theory, Practice
and Education
Vl. V. Voevodin
Abstract Each new computing platform required software developers to analyze the
algorithms over and over, each time having to answer the same two questions. Does
the algorithm possess the necessary properties to meet the architectural requirements?
How can the algorithm be converted so that the necessary properties can be easily
reflected in parallel programs? Changes in computer architecture do not change
algorithms, but this analysis had to be performed again and again when a program was
ported from one generation of computers to another, largely repeating the work that
had been done previously. Is it possible to do the analysis “once and for all,” describing
all of the key properties of an algorithm so that all of the necessary information can
be gleaned from this description any time a new architecture appears? As simple
as the question sounds, answering it raises a series of other non-trivial questions.
Moreover, creating a complete description of an algorithm is not a challenge, it is a
large series of challenges, and some of them are discussed in the paper.
1 Introduction
Parallel computing system architectures have gone through at least six generations
over the past 40 years, each requiring its own algorithm properties and a special
program writing style. In each case, it was important not only to find suitable features
for the algorithms, but also to express them properly in the code, using special
programming technologies. In fact, each new generation of computing architecture
required a review of the entire software pool.
The generation of vector pipeline computers got off to a rapid start in the early
seventies with the launch of the Cray-1 supercomputer. Machines of this class were
based on pipeline processing of data vectors, supported by vector functional units and
vector instructions in machine code. Full vectorization was the most efficient program
implementation, which implied complete replacement of any innermost loops in the
program body with vector instructions. Hence the requirements for algorithms and
Vl.V. Voevodin (B)

Lomonosov Moscow State University, Moscow, Russia
e-mail: voevodin@parallel.ru
© Springer International Publishing AG 2016 3

M.M. Resch et al. (eds.), Sustained Simulation Performance 2016,
DOI 10.1007/978-3-319-46735-1_1
4 Vl.V. Voevodin
programs: parallelism was to be expressed in the form of independent iterations of the

program’s innermost loops. If that representation could be found and the innermost
loops are vectored, the program would run efficiently.
During the eighties, computers came with not one, but several independent vector
pipeline processors: vector parallel computers. The requirements for algorithm
structure changed again. To support pipeline processing, inner loop parallelism was
used as before. But this time, an additional parallelism resource was to be found within
the algorithms that would support the independent operation of several processors.
Inner loop parallelism was used to support vectorization, and outer loop parallelism
to support the simultaneous operation of several CPUs.
The next generation to become commonplace during the nineties were massively
parallel distributed memory computers, based on thousands of processors. Two
actions were needed to make a program efficient. First, a substantial parallelism
resource had to be identified in an algorithm to ensure the independent operation of
many processors. Second, it was also important to distribute data between comput-
ing nodes to minimize data exchange during the course of program execution. This
required not just another review of the algorithm pool based on the new program-
ming technologies (with MPI becoming the de-facto standard), but also completely
rewriting the software.
Shared memory computers also appeared actively. Shared memory substan-
tially simplified the interaction between processors, making it easier to write parallel
programs. Data distribution was no longer a major consideration as global address
space and global shared variables eliminated many complex data handling issues.
OpenMP technology appeared to reflect the new paradigm of parallel program oper-
ation. Shared memory computers required a new parallel program model, new means
and methods of programming and new constructs which meant programs had to be
rewritten yet again.
Computers combining the features of the two previous classes, computing clus-
ters with distributed memory, based on shared memory nodes, appeared during
the early 2000s. With these systems, one part of the parallelism resource inherent in
an algorithm was to be kept for using a certain number of independent nodes, and
the other for using several processors or cores within each node. In parallel applica-
tions, the first part was described through MPI and the second part through OpenMP.
Converting an algorithm to efficiently use these features of the architecture was no
trivial task, and was further complicated by the need to determine the proper data
distribution for the MPI part.
About 8 years ago, accelerators were first added to the computer architecture—
first as graphics processing units, and then as Xeon Phis. Now these devices can be
found everywhere, including big clusters [2]. But what did the addition of accelera-
tors mean for analyzing algorithm properties? It meant that a substantial parallelism
resource needed to be identified in an algorithm for using many computing nodes.
More parallelism needs to be present in each parallel process to utilize multiple
computing cores per node. Moreover, enough parallelism needs to be left to use the
accelerator features. Computers, like the need for parallelism, had become hetero-
geneous, which required revising the algorithm properties once more.
Parallel Algorithms: Theory, Practice and Education 5
2 What is a Complete Description of the Algorithm

Properties?
Each new computing platform required software developers to analyze the algorithms
over and over, each time having to answer the same two questions. Does the algorithm
possess the necessary properties to meet the architectural requirements? How can
the algorithm be converted so that the necessary properties can be easily reflected in
parallel programs? Changes in computer architecture do not change algorithms, but
this analysis had to be performed again and again when a program was ported from
one generation of computers to another, largely repeating the work that had been
done previously.
This begs a natural question: is it possible to do the analysis “once and for all,”
describing all of the key properties of an algorithm so that all of the necessary infor-
mation can be gleaned from this description any time a new architecture appears? As
simple as the question sounds, answering it raises a series of other questions. What
does it mean “to perform analysis” and what exactly needs to be studied? What kind
of “key” properties need to be found in algorithms to ensure their efficient imple-
mentation in the future? What form can (or should) the analysis results take? What
makes a description of algorithm properties “complete?” How does one guarantee
that a description is complete and that all of the relevant information for any computer
architecture is included?
The questions are indeed numerous and non-trivial. Obviously, a complete
description needs to reflect many ideas: computational kernels, determinacy, infor-
mation graphs, communication profiles, a mathematical description of the algorithm,
performance, efficiency, computational intensity, the parallelism resource, serial
complexity, parallel complexity…[3] All of these concepts, and many others, are
used to describe an algorithm’s properties from different perspectives, and they all
are quite necessary in practice under various situations.
To immediately introduce some order to these diverse concepts, one can begin
by breaking up an algorithm’s description into two parts. The first part is dedicated
to the algorithm’s theoretical properties, and the second part describes its particular
implementation features. This division allows the machine-independent properties of
algorithms to be separated from the numerous issues arising in practice. Both parts of
the description are equally important: the first one describes the algorithm’s theoreti-
cal potential, and the second one demonstrates the practical use of that potential. The
first part of the description explains the mathematical formulation of the algorithm,
its computational kernel, input and output data, information structure, parallelism
resources and properties, determinacy and computational balance of the algorithm,
etc. The second part contains information on an algorithm’s implementation: locality,
performance, efficiency, scalability, communication profile, implementation features
on various architectures, and so on.
6 Vl.V. Voevodin
3 Why Is It Hard to Describe Algorithms?
Many of the ideas described above are very well known. However, as you start
describing the properties of real algorithms, you realize that creating a complete
description of an algorithm is not a challenge, it is a large series of challenges!
Unexpected problems arise at each step, and a seemingly simple action becomes a
stumbling block. Let’s look at the information structure of an algorithm mentioned
above. It is an exceptionally useful term that contains a lot of information about the
algorithm. An information graph is a convenient representation of an algorithm’s
information structure. In many cases, looking at the information graph is enough to
understand its parallel implementation strategy. Figure 1a, b show the information
structure for typical computational kernels in many algorithms, Fig. 1c shows the
information structure of a Cholesky decomposition algorithm.
An information graph can be simple for many examples. However, in general,
the task of presenting an information graph is not a trivial exercise. To begin with,
a graph can potentially be infinite, as the number of vertices and arcs is determined
by the values of external input variables which can be very large. In this situation it
helps to look at likenesses: graphs for different values of external variables look very
“similar” to one another, so it is almost always enough to present one small graph,
stating that the graphs for other values will look “exactly the same.” Not everything
is so simple in practice, however; and one should be very careful here.
Next, an information graph is potentially a multi-dimensional object. The most
natural coordinate system for placing vertices and arcs in an information graph relies
Fig. 1 Information structure of various algorithms

on the nested loops in an algorithm’s implementation. If nested loops do not go deeper

than three levels (as in a classical matrix multiplication algorithm), the graph can be
placed in the traditional three-dimensional space. More complex looping constructs
with 4 or more nesting levels require special methods for presenting and displaying
the graph. But even if the number of dimensions does not exceed three, how does one
make the graphical presentation informative? Figure 2a shows a graph in its entirety,
which is barely comprehensible. Various projections of the same graph are shown
in Figs. 2b, d, which can help assess an algorithm’s parallelism potential, but these
aren’t always helpful…
Related questions also arise: how does one visualize the parallelism potential and
illustrate parallel implementation methods for the algorithm? Sometimes a canonical
parallel layer form [3] comes in handy, which reflects both the algorithm’s parallel
complexity and the fastest method for its parallel implementation (within the infinite
parallelism concept), but it is very difficult to build and not always feasible. Figure 3
shows the sequential execution of the fragment in Fig. 2a in five steps. Red indicates
vertices within the current level that can be executed in parallel on the current step.
Green indicates vertices executed in previous steps, and white indicates vertices that
can only be executed in subsequent steps. By visualizing the step-by-step sequential
movement of the red vertices, one can evaluate the parallelism available at each step.
How does one find, analyze, describe and display the canonical parallel layer form?
The question remains open for arbitrary programs.
The issues of data locality and computation locality are of paramount impor-
tance in describing an algorithm’s properties and its implementations. Locality is
Fig. 2 Methods of displaying an algorithm’s information structure

8 Vl.V. Voevodin
Fig. 3 Sequence of steps in the parallel execution of an algorithm based on a canonical parallel
layer form
Fig. 4 A description of data locality in programs using memory access profiles
what determines program execution efficiency on modern computing platforms. To

get the complete picture of an algorithm’s particular implementation features, it is
important to analyze both temporal and spatial locality, noting positive and negative
factors related to locality, and the conditions and situations by which they are caused.
However, even a quick look makes it obvious that there are many more questions
than answers in this area. What methodology can be used to evaluate the temporal
and spatial data locality in the programs? How can one compare temporal and spa-
tial data locality between different programs? Figure 4 shows the memory access
profiles for two programs, indicating the memory address after each memory access
operation. Which program has better temporal and/or spatial data locality? In some
cases, memory usage templates help: they are simple and their characteristics are
predetermined, but once again, the issue of carefully studying locality properties in
an arbitrary program generally remains open.
Another interesting question is related to how data locality is related to the algo-
rithm structure. In other words, can we predict data locality in a given program by
using just the information about its algorithm? On the one hand, there are no data
structures in algorithms—they only appear in programs; so talking about locality for
algorithms is not exactly right. On the other hand, it is the algorithm that determines
the structure and properties of a program to be coded, including its locality. Many
have probably heard the expression “the algorithm’s locality” or “this algorithm
has better locality than the other.” How appropriate are these statements, given that
algorithms do not contain data structures?
Determinacy is an important practical aspect of algorithms and programs, but how
can one describe all of the potential sources which violate this property? A serious
cause of indeterminacy in parallel programs is related to changes in the order of
executing associative operations. A typical example is the use of global operations
in Message Passing Interface (MPI) by a number of parallel processes, e.g., when
summing the elements of a distributed array. The MPI runtime system chooses the
order of execution on its own, assuming compliance with the associative law, which
results in various round-off errors and ultimately in different results when executing
the same application. This is a serious issue often encountered in massively parallel
computing systems that causes results of parallel program execution to not be repro-
ducible. If the analysis of an algorithm’s structure shows that the resulting parallel
application cannot work without global operations, this property must be included in
the algorithm description. To analyze this problem properly, a communication profile
should be built for the parallel program, pointing out the structure and interaction
method between parallel processes. A clear definition of the communication profile
hasn’t been produced to date, so it is premature to consider in-depth analysis in this
area.
Indeed, there are many open questions, and the list can go on. The main question
that still remains unanswered is “What does it mean to create a complete description
of an algorithm?” What must be included in this description, so that we can glean all
of the necessary information from it every time a new computing platform appears?
The task seems simple at first sight: an algorithm is just a sequence of mathematical
formulas, often short and simple, which should easily be analyzed. But at the same
time, no one can guarantee the completeness of such a description.
The properties of the algorithms and programs discussed in this work became
the foundation for the AlgoWiki project [1]. The project’s main goal is to provide a
description for fundamental algorithm properties which will enable a more compre-
hensive understanding of their theoretical potential and their implementation features
in various classes of parallel computing systems. The project is expected to result in
the development of an open online encyclopedia based on wiki technologies which
will be open to contributions by the entire academic and educational community.
The first version of the encyclopedia is available at http://AlgoWiki-Project.org/en,
where users can describe both their own pedagogical experience and their knowledge
of specific parallel algorithms.
10 Vl.V. Voevodin
4 Conclusion
All of the issues discussed in this work are highly important for training future spe-
cialists [4–6]. Right from the beginning of the education process, focus should be
placed on algorithm structure since it determines both the implementation quality
and the potential for efficiently executing programs in a parallel environment. The
algorithm structure and its close relationship to parallel computing system architec-
ture are central ideas in parallel computing, which are included in many courses for
Bachelor’s and Master’s degree programs at the Faculty of Computational Mathe-
matics and Cybernetics at Lomonosov Moscow State University, as well as in the
lectures and practical courses offered by the annual MSU Summer Supercomputing
Academy [7]. We are also trying to expand this concept to the Supercomputing Con-
sortium of Russian Universities [8] in order to develop a comprehensive supercom-
puter education system, rather than offering occasional training aimed at rectifying
the situation.
Acknowledgements This project is being conducted at Moscow State University with financial
support from the Russian Science Foundation, Agreement No 14-11-00190.
References
1. Antonov, A., Voevodin, V., Dongarra, J.: AlgoWiki: an open encyclopedia of parallel algorithmic
features. Supercomput. Front. Innov. 2(1), 4–18 (2015)
2. Dongarra, J., Beckman, P., Moore, T., Aerts, P., Aloisio, G., Andre, J.C., Barkai, D., Berthou,
J.Y., Boku, T., Braunschweig, B., et al.: The international exascale software project roadmap.
Int. J. High Perform. Comput. Appl. 25(1), 3–60 (2011)
3. Voevodin, V.V., Voevodin. Vl.V.: Parallel Computing. BHV-Petersburg, St. Petersburg (2002).
(in russian)
4. Computing Curricula Computer Science. http://ai.stanford.edu/users/sahami/CS2013 (2013)
5. Future Directions in CSE Education and Research, Workshop Sponsored by the Society for
Industrial and Applied Mathematics (SIAM) and the European Exascale Software Initia-
tive (EESI-2), http://wiki.siam.org/siag-cse/images/siag-cse/f/ff/CSE-report-draft-Mar2015.
pdf (2015)
6. NSF/IEEE-TCPP Curriculum Initiative on Parallel and Distributed Computing. http://www.cs.
gsu.edu/~tcpp/curriculum/
7. Summer Supercomputing Academy. http://academy.hpc-russia.ru/
8. Supercomputing Education in Russia, Supercomputing Consortium of the Russian Universities.
http://hpc.msu.ru/files/HPC-Education-in-Russia.pdf (2012)
High Performance Computing
and High Performance Data
Analytics—What is the Missing Link?
Bastian Koller, Michael Gienger and Michael M. Resch
Abstract Within this book chapter, technologies for data mining, data processing
and data interpreting are introduced, evaluated and compared. Especially, traditional
High Performance Computing, and the newly emerging fields High Performance
Data Analytics and Cognitive Computing are put into context in order to understand
their strengths and weaknesses. However, the technologies have not been evaluated
solely, but also the missing links between them have been identified and described.
1 Introduction
At this point of time, there are various technologies in the market that target data
analysis, data processing, data interpreting and data mining. So far, it has not been
clear if all of those technologies are direct competitors or can be seen in a comple-
mentary fashion. This book chapter therefore analyses the technologies carefully and
introduces as well as compares their direct angles. Being more concrete, traditional
High Performance Computing, the newly emerging field High Performance Data
Analytics as well as Cognitive Computing are evaluated. In particular, the interac-
tions between those technological fields are visualized in addition.
The book chapter is organized as follows: Section 2 is providing the High Perfor-
mance Computing context, Sect. 3 is introducing High Performance Data Analytics
whereas Sect. 4 compares the approaches and describes the missing links. Finally,
Sect. 5 concludes this book chapter.
B. Koller (B) · M. Gienger · M. Resch

High Performance Computing Center Stuttgart, Nobelstrasse 19,
70569 Stuttgart, Germany
e-mail: koller@hlrs.de
M. Gienger
e-mail: gienger@hlrs.de
M. Resch
e-mail: resch@hlrs.de

DOI 10.1007/978-3-319-46735-1_2
12 B. Koller et al.
2 The Evolution of High Performance Computing
Within this section of the book chapter, a generic view on High Performance Com-
puting (HPC) and its evolution over time is given. Although the purpose of such
HPC systems is in principle the same, the available performance, the customer base
as well as the computational and applications models changed in the last decade. In
summary, various application areas such as computational fluid dynamics, climate or
physics simulations are considered HPC relevant at the moment, which are executed
on innovative systems that may be equipped by vector central processing units, by
commonly used x86 processors or even accelerators.
2.1 Traditional High Performance Computing
High Performance Computing has been traditionally designed to solve problems

that are too large and complex for common desktop computers or even workstations.
Those systems enable a maximum of performance for memory, compute, storage
or input/output (I/O) intensive applications and operations. However, with respect
to their special design and the corresponding drastic costs, they clearly lack on the
flexibility to combine all requirements into a unique general-purpose system.
Although there are self-appointed general-purpose systems in the worldwide HPC
market, there is always a key application that drives the selection of such systems.
Applications that require solely a high computational demand will result in a system
architecture that is based on accelerators, whereas applications that require thousands
of memory operations per second will rather tend to the vector or x86 architecture.
Thus, due to the main area of applications and the corresponding costs, a HPC system
is always tailored to its common applications so that “real” general-purpose systems
cannot be seen in the markets.
2.2 Evolution Over Time
Within the last decade, there was a huge evolution with regards to the HPC systems.
Reaching from vector machines to the widely adopted x86 architecture and modern
accelerators, especially hardware evolved quickly. In the meantime, HPC systems
with more than 1.000.000 cores are not an utopia any more1 so that besides the
efficiency of the systems, also the models and applications can benefit from the huge
amount of provided computational performance.
But not just the hardware evolved, also the customer basis changes: industrial
applications from the automotive world, academic applications dealing with, for
instance, climate simulation as well as applications from small and medium sized
1 Top500: http://www.top500.org
High Performance Computing and High Performance Data . . . 13
enterprises from various kinds of areas are targeting the High Performance Com-
puting systems. However, with the evolved systems and the immense performance,
also the execution models get more complicated. On the one hand, there are still
traditional applications that require a huge amount of resources for a single run and
on the other, parametric studies with less constant performance requirements but
generating a huge amount of results are common in state-of-the-art HPC systems.
Nevertheless, HPC driving applications are still usual in the High Performance
Computing area, but due to the changing application and executions models, general-
purpose systems are becoming more evident as large computational intensive appli-
cations typically produce a huge amount of results. So there is currently a trade-off
between providing generic systems that are flexible enough to cope with different
kinds of workloads and such systems that are solely made to provide one single key
performance type.
3 Towards High Performance Data Analytics
In contrast to Sect. 2, this chapter focuses High Performance Data Analytics (HPDA),
a new emerging field for the High Performance Computing sector. High Performance
Data Analytics target the efficient analytics of various kinds of data, reaching from
structured up to unstructured as well as streaming data, which cannot be analysed
anymore on standard workstations or Clouds due to their volume, their variety or
their velocity.
3.1 Where Is It Needed?
As already highlighted in the introduction of this section, High Performance Data

Analytics target the analysis of available (e.g. stored) or real-time streaming data.
In contrast to HPC applications, HPDA requires typically not an extraordinary huge
amount of compute performance, but rather a very broad I/O backend that is able to
transfer data quickly enough to the actual processing engines.
The applications that typically cause such data intensive workloads are settled in
the sensor technologies area, such as the evolving Internet of Things, the aligned
Industry 4.0 and the cyber physical systems area. The physical sensors produce a
huge amount of data that has to be analysed in time, sometimes even real-time to
provide the corresponding actions. However, not just those industrial areas require the
implicitly described system architecture, but also modern Internet stores with their
designed customer marketing require as much as knowledge possible about their
customers. This fact results in strong correlations of data that have to be analysed on
huge-scale systems, since Clouds are not performing enough. Finally, not only the
described applications require HPDA functionality, fine-grained models and their
14 B. Koller et al.
corresponding applications produce Terabytes of data in the meanwhile that cannot

be analysed on a state-of-the-art HPC system anymore.
3.2 HPDA Concepts and Technologies
As already highlighted in the sections before, HPC and HPDA approaches in terms of
hardware and software require different technologies. Therefore, these requirements
will be discussed and addressed in particular in this sub-section to bridge the gap
between both technologies.
In terms of hardware, data intensive workloads require different key performance
indicators than standard HPC applications. The differences between both approaches
are highlighted below:
• Processors
In traditional HPC systems, fast processors with fast memory pipelines are focused.
For HPDA systems, the amount of Floating Point Operations Per Second is still
important, however the performance of the system is determined by the storage
system.
• Memory
The more memory available for data analytics, the better for the overall application
execution since most of the data and results can be kept in memory instead of check
pointing them to the storage backend. For HPC systems, the same statement holds,
although much smaller memory systems are targeted than in the HPDA area.
• Networks
Whenever data needs to be transferred, fast interconnects come into play. So both,
HPC and HPDA systems require fast memory and latency-oriented networks in
order to transfer the data efficiently.
• Storage
Typical HPC systems provide a central system storage from which all the required
data gets read and written. An approach like this is not possible for HPDA since
the data accessibility is the key performance indicator for the whole applications.
Therefore, data analytics systems provide fast local disks that can be used to
provide and cache the data in order to optimize the application execution.
As can be seen, the main differences between HPC and HPDA systems are located
in the area of processors and storages, since fast number-crunching processors are
required for HPC only. In contrast, very fast input/output systems with large capacity
are mandatory for efficient data processing.
The software requirements come along with the hardware requirements. In con-
trast to traditional HPC applications, which require programming models and para-
digms such as message passing or shared memory parallelism, data analytics appli-
cations rely on in-memory processing and programming languages such as Java,
Python or Scala. So the most important applications for data analytics are currently
the Apache tools Spark2 , Hadoop3 , Storm4 and Flink5 as well as some smaller projects
such as Disco Project6 , DataTorrent7 or BashReduce8 .
Most of those applications build on the MapReduce algorithm, which has been
introduced by the global player Google9 . The MapReduce algorithm consists of
three phases—map, shuffle and reduce, whereas the map and the reduce parts are
directly specified by the user in order to allow parallel processing of data on manifold
machines. Using his concept enables processing different kinds of data, reaching from
structured data including files and databases up to unstructured and real-time data
such as online data composed of several data structures.
3.3 A Practical Application Making Use of HPC and HPDA
In order to proof the statements of the last sections and sub-sections, the information
shall be complemented with a practical example from the Global Systems Science
community, which represents an emerging field in the HPC sector. Within the EC-
funded CoeGSS project10 , a set of applications is focused that require particular
workflows to retrieve the results. In particular, the workflow foresees HPDA, huge-
scale HPC, small-scale HPC and visualization to generate synthetic populations,
execute the resulting agent-based models and finally, visualize the results [1]. For
clarification, the workflow and its targeted technologies is depicted in Fig. 1.
Thus, those kinds of applications demonstrate that there is a new need to support
other methods and techniques than the classical HPC applications demand. As a
consequence, being competitive in terms of hardware and software reaches a new
level of complexity.
4 The Missing Link
Summarizing the previously mentioned evolution scenarios for High Performance

Computing and the raise of High Performance Data Analytics, this seems as a promis-
ing and valuable way to go. However the deeper one dives into the implications of
the use of these technologies and the potential they provide, it becomes obvious that
2 Apache Spark: http://spark.apache.org

3 Apache Hadoop: http://hadoop.apache.org
4 Apache Storm: http://storm.apache.org
5 Apache Flink: http://flink.apache.org
6 DiscoProject: http://www.discoproject.org
7 DataTorrent RTS: https://www.datatorrent.com
8 BashReduce: https://github.com/erikfrey/bashreduce
9 Google Inc.: http://www.google.com
10 Centre of excellence for Global Systems Science: http://www.coegss.eu
16 B. Koller et al.
Fig. 1 CoeGSS application

workflow
the resulting outputs, especially in terms of data variety and data size get hard to
handle for a human in the loop.
We see a tendency in so-called “business-ready solutions” to stress the support of
the human in the loop by application of technological fields such as machine learn-
ing, artificial intelligence or cognitive computing. For the remainder of this paper
we will stick to the term cognitive computing as a placeholder for the above men-
tioned disciplines, which can be described as the variety of scientific disciplines of
Artificial Intelligence and Signal Processing11 . A similar view has been presented by
James Kobielus, Big Data Evangelist, 2013, in a blog entry on Cognitive Computing:
Relevant at all Speeds, Scales and Scopes of Thought, where he defines cognitive
computing as
the ability of automated systems to handle the conscious, critical, logical, attentive, reasoning
mode of thought that humans engage in when they, say, play Jeopardy or try to master some
academic discipline.
11 Wikipedia Definition of Cognitive Computing: https://en.wikipedia.org/wiki/Cognitive_

computing
4.1 Cognitive Computing
The principles of cognitive computing are not new, and nearly everyone who is in the
Information Technology business has at a certain point in time heard of this topic.
Thus is it also not surprising, that it’s base assumptions and ideas were even reported
already at the end of the 19th century, when Boole proposed its book on “The Laws
of Thoughts” [2]. Even though this was just a conceptual approach, and the first
programmable computer by Zuse needed As already mentioned before, during the
evolution of these principles, the domain of cognitive methodologies and artificial
intelligence went either side by side or showing clear overlaps. A variety of theories
and implementation approaches were taken, the probably most prominent ones being
so far IBM’s Watson [3] and the recently presented AlphaGo [4].
4.2 Benefits
Figure 2 shows how High Performance Computing, High Performance Data Analyt-
ics and Cognitive Techniques can complement each other. High Performance Com-
puting (HPC) delivers the needed processing power for those kind of applications,
requiring massive parallel execution. At the same time, these kind of applications
produce partially enormous amounts of data, which may be too big to be manually
analysed, even having current support tools at hand. Thus the discipline of High
Performance Data Analytics can be used to analyse and handle these (and other
sources’ data sets) in a sufficient way. Cognitive techniques can provide support to
both disciplines, to help to interpret and present the results in a best possible way.
In a general way, the expected benefits from applying these concepts, are manifold.
In general support for those fields where big amounts of data are collected, handled
and interpreted is improved, examples are:
• Enhanced analysis of business potentials of new offerings/new activities. This
can reach from the virtual testing of new opportunities, e.g. in drug design or
on combined virtual and real world simulations such as finding new geographic
locations for drilling
• Support of staff (e.g. engineers) in decision processes by providing them a selection
of potential paths to follow
• Improving Operations by understanding of performed operations and their para-
meters, so that either in real time or after longer-duration analysis processes can
be optimized
Taken this complementarity into account, the workflow as described in Fig. 1 can
be extended to the one presented in Fig. 3.
18 B. Koller et al.
Fig. 2 Cognitive Techniques complementing the global picture of HPC and HPDA
Fig. 3 Extending the GSS

workflow with cognitive
techniques support
4.3 Available Technologies
Within this document, we also want to have a short look at those technologies, which
may act as baseline to realize an integration of cognitive concepts into a traditional
HPC/HPDA based workflow (e.g. the one presented in Fig. 3.
In the case of Watson, a variety of APIs is available for selected developers and
business users, as well as the Watson Analytics Solution12 . Furthermore there is a
variety of Open Source alternative available, which shall be discussed on a high level
in the following overview:
DARPA DeepDive
DeepDive [5, 6] is a free version of a Watson like system. It was developed within
the frame of the US Defense Advanced Research Projects Agency (DARPA) and
in opposite to Watson has the aim to extract structured data from unstructured data
sources. DeepDive uses machine learning technologies to train itself and targets
especially those users with moderate to no machine learning expertise.
UIMA
Apache Unstructured Information Management (UIMA)13 is supporting the analy-
sis of large sets of unstructured information. Its an implementation of the Oasis
Unstructured Information Management standard14 OpenCog
OpenCog [7] is a project targeting artificial intelligence and delivering an open
source framework. One output of OpenCog is the cognitive architecture OpenCog
Prime [8] for robot and virtual embodied cognition.
5 Conclusions
The previous sections have pointed out that High Performance Computing and High
Performance Data Analytics can be seen as rather complementary approaches, then
as direct competitors. Even though there are activities to provide a common software
stack, which may run on both, HPC and HPDA specific hardware, there is only a
subset of concrete problems in the problem space which can be addressed efficiently
in such a manner. Mainly, this is a result of the partially quite different hardware set
up of the respective technological environment.
Now, assuming that HPC and HPDA work with a high performance, we also
have to face the fact that the size and amount of data sets proceeded and again
resulting from this processing enter a dimension, which makes a satisfactory manual
processing by a human in the loop (e.g. an engineer) nearly impossible. Thus we see
that even if there is an issue (e.g. data analytics) solved with those appliances, another
issue pops up which is the understanding and respectively handling of information.
12 http://www.predictiveanalyticstoday.com/ibm-watson-analytics-beta-open-business/
13 http://uima.apache.org/
14 https://www.oasis-open.org/committees/download.php/28492/uima-spec-wd-05.pdf
20 B. Koller et al.
For that purpose we have introduced cognitive technologies, which can act as some
sort of “helper” technology to simplify the life of the end user and enable for improved
use of simulation results. This technology, even if it appears to be still in its infancy,
can support the (human) end user and provide decision baselines allowing improved
processing of information. We have shown that a variety of implementations already
exist, next steps need to see in how far they can cover the requirements of selected
use cases.
References
1. Wolf, S., Paolotti, D., Tizzoni, M., Edwards, M., Fuerst, S., Geiges, A., Ireland, A., Schuetze,
F., Steudle, G.: D4.1 - First report on pilot requirements. http://coegss.eu/wp-content/uploads/
2016/03/CoeGSS_D4_1.pdf
2. Boole, G.: Investigation of the Laws of Thought on Which are Founded the Mathematical
Theories of Logic and Probabilities (1853)
3. Ferrucci, D.A., Brown, E.W., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A., Lally, A., Mur-
dock, J.W., Nyberg, E., Prager, J.M., Schlaefer, N., Welty, C.A.: Building Watson: an overview
of the DeepQA project. AI Mag. 31(3), 59–79 (2010)
4. Silver, D., Hassabis, D.: AlphaGo: mastering the ancient game of Go with Machine Learn-
ing, Blogpost. https://research.googleblog.com/2016/01/alphago-mastering-ancient-game-of-
go.html (2016)
5. Niu, F., Zhang, C., Re, C., Shavlik, J.W.: DeepDive: web-scale knowledge-base construction
using statistical learning and inference. 884. In: VLDS: CEUR-WS.org. (CEUR Workshop
Proceedings), pp. 25–28 (2012)
6. Zhang, C.: DeepDive: a data management system for automatic knowledge base construction,
Ph.D. Dissertation, University of Wisconsin-Madison (2015)
7. Hart, D., Goertzel, B.: OpenCog: a software framework for integrative artificial general intelli-
gence. In: Wang, P., Goertzel, B., Franklin, S. (eds.) ’AGI’, pp. 468–472. IOS Press (2008)
8. Goertzel, B.: OpenCog Prime: a cognitive synerfy based architecture for artificial general intel-
ligence
9. Hurwitz, J.S., Kaufman, M., Bowles, A.: Cognitive Computing and Big Data Analytics. Wiley,
Indianapolis (2015)
A Use Case of a Code Transformation Rule
Generator for Data Layout Optimization
Hiroyuki Takizawa, Takeshi Yamada, Shoichi Hirasawa and Reiji Suda
Abstract Xevolver is a code transformation framework for users to define their

own code transformation rules. In the framework, an abstract syntax tree (AST) of
an application code is written in an XML format, and its transformation rules are
expressed in the XSLT format, which is a standard XML format to describe XML data
conversion; an AST and its transformation rules are both written in XML. Since it is
too low-level for standard users to manually write XSLT rules, Xevtgen is now being
developed as a tool to generate such rules from simple code description. In Xevt-
gen, users basically write just two code patterns, the original and transformed code
patterns. Then, Xevtgen automatically generates a transformation rule that trans-
forms the original code pattern to the transformed one. The generated rule is written
in XSLT, and hence usable by other tools of the Xevolver framework. This article
shows a use case of using Xevtgen for data layout optimization, and discusses the
benefits of using the tool.
1 Introduction
When data are stored in a memory space, the layout of data often needs to be opti-
mized so as to make a better use of memory hierarchy and architectural features.
Today, such data layout optimization is critically important to achieve high perfor-
mance on a modern high-performance computing (HPC) system, because the system
H. Takizawa (B) · T. Yamada · S. Hirasawa

Graduate School of Information Sciences, Tohoku University, Sendai, Japan
e-mail: takizawa@tohoku.ac.jp
T. Yamada
e-mail: tyamada@sc.cc.tohoku.ac.jp
S. Hirasawa
e-mail: hirasawa@sc.cc.tohoku.ac.jp
R. Suda
Graduate School of Information Science and Technology,
The University of Tokyo, Tokyo, Japan
e-mail: reiji@is.s.u-tokyo.ac.jp

DOI 10.1007/978-3-319-46735-1_3
22 H. Takizawa et al.
performance is very sensitive to memory access patterns. Memory access can easily
become a performance bottleneck of an HPC application.
The data layout of an application can be optimized by changing data structures
used in the code. One problem is that a human-friendly, easily-understandable data
representation is often different from a computer-friendly data layout. This means
that, if the data layout of a code is completely optimized for computers, the code
may be no longer human-friendly.
We have been developing a code transformation framework, Xevolver, so that
users can define their own rules to transform an application code [1, 2]. In this
article, such a user-defined code transformation rule is adopted to separate the data
representation in an application code from the actual data layout in a memory space.
Instead of simply modifying a code for data layout optimization, the original code
is usually maintained in a human-friendly way and then mechanically transformed
just before the compilation so as to make the transformed code computer-friendly.
One important question is how to describe code transformation rules. A con-
ventional way of developing such a code translator is to use compiler tools, such as
ROSE [3]. Actually, at the lowest abstraction level, Xevolver allows users to describe
a code transformation rule as an AST transformation rule. Since AST transforma-
tion is exactly what compilers internally do, compiler experts can implement various
code transformation rules by using the framework. However, standard programmers
who optimize HPC application codes are not necessarily familiar with such compiler
technologies. Therefore, we are also developing several high-level tools to describe
the rules more easily.
Xevtgen [4] is one of high-level tools to help users define custom code transfor-
mation rules. This article shows a use case of Xevtgen for data layout optimization,
and discusses how it can help users define their own transformations.
2 Data Layout Optimization
In many cases, an HPC application code is written in a low-level programming

language such as C/C++ and Fortran. In such a language, a data structure mostly
corresponds to a specific data layout. In practice, thus, the data layout of an HPC
application is usually altered by changing the data structure in the code.
A typical example of data layout optimization is so-called AoS-to-SoA conver-
sion [5]. Generally, an array of structures (AoS) is likely to be human-friendly,
leading to high code maintainability and readability. For example, the following C
code defines an AoS data structure, in which each point is a pair of two variables, x
and y.
struct { double x, y; } point2d[N];

Another random document with
no related content on Scribd:
Zohra, Beni, 289
Zomeil, 90
Zoroastrians, 72, 259, 260
FOOTNOTES:
[1] Manuscripts of the whole work have, however, been
procured, and are now being published on the Continent, but not
in time to be available for this work. They will serve hereafter to
correct, perhaps, some of the doubtful points of the history on
which, from the scantiness of the material, I may have gone
astray.
[2] Geschichte der Chalifen, 3 vols., Mannheim, 1846–1851.
[3] Culturgeschichte des Orients unter den Chalifen, Wien,
1875.
[4] The date ordinarily given as that of the Prophet’s death is
the 12th Rabi I. See note p. 280, Life of Mahomet, vol. iv.
For the term ‘Companion,’ technically used to signify all who
had a personal acquaintance with the Prophet, see ibid. p. 564.
The era of the Hegira was established by Omar, five or six
years after the Prophet’s death. The first Moharram of the first
year of the Hegira corresponds with 19th April, a.d. 622. The real
hegira, or flight of Mahomet from Mecca, took place two months
later (June 20). See ibid. p. 145, and C. de Perceval, vol. iii. p. 17.
[5] Al Siddîck; ibid. vol. ii. 102, 220. He was also called ‘the
Sighing one,’ from his compassionate nature.
[6] Meaning a palm-trunk left for the beasts to come and rub
themselves upon; a metaphor for a person much resorted to for
counsel. Hobâb was the chief whom Mahomet employed to
reconnoitre the enemy at Bedr.
[7] The Arabian mode of swearing fealty. The chief held out his
hand, and the people one by one struck their hand flat upon it as
they passed.
[8] It will be remembered that the native population of Medîna
was divided into the Aus and Khazraj, and Sád belonged to the
latter. Enmity and fighting had long prevailed between them
before Mahomet’s arrival (Life of Mahomet, p. 119).
[9] The followers of Mahomet were divided into the Muhâjerîn,
or Refugees from Mecca and elsewhere; and the Ansâr or
Helpers, the citizens of Medîna (Ibid. p. 189).
[10] The tradition regarding Zobeir and Talha, perhaps arose
from their attempt at the Caliphate, and refusal to acknowledge
Aly, five and twenty years afterwards. As to Aly himself, the
traditions vary. By some he is said to have been among the first to
swear fealty to Abu Bekr. But the more general tradition is that he
did not do so till Fâtima, who had a grudge against Abu Bekr for
her father’s patrimony, died (Life of Mahomet, p. 516). There are
other tales, but they all bear the stamp of Abbasside fabrication;
such as of Omar threatening to burn Aly’s house over his head;
Zobeir rushing out with a sword, &c. We are even told that Abu
Sofiân taunted Aly and Abbâs with allowing an insignificant
branch of the Coreish to seize the Caliphate from them; likened
them to a hungry donkey tethered up, or to a tent-peg made only
to be beaten; and offered to help them with horse and foot, but
that Aly declined his offer. These stories are childish and
apocryphal. There is absolutely nothing in the antecedents of Aly,
or his subsequent history, to render it in the least probable that
during the first two Caliphates, he advanced any claim whatever,
or indeed was in a position to do so. It was not till the reign of
Othmân that any idea arose of a superior right in virtue of his
having been the cousin of Mahomet and husband of Fâtima.
It is said that as the people crowded to the hall, where Sád lay
sick, to salute Abu Bekr, one cried out: ‘Have a care lest ye
trample upon Sád, and kill him under foot.’ ‘The Lord kill him, as
he deserveth!’ was the response of the heated Omar. ‘Softly,
Omar!’ interposed Abu Bekr, ‘blandness and courtesy are better
than curses and sharp words.’ Indeed, throughout this chapter
Abu Bekr appears to great advantage.
[11] See Life of Mahomet, p. 500.
[12] Life of Mahomet, p. 498.
[13] Some others of the chief Companions, Aly, Zobeir, &c.,
appear also to have remained behind; but they may possibly not
have originally formed a part of Osâma’s army ordered to
reassemble.
[14] The chronology at this period is uncertain, and the dates
only approximate. On the Prophet’s death we plunge at once from
light into obscurity. For the next two or three years we are left in
doubt, not only as to the period, but even as to the sequence of
important events and great battles. In the narrative of this
expedition, we only know that the army started soon after Abu
Bekr’s accession, but not before the spirit of rebellion had begun
to declare itself, which last, according to one tradition, was within
ten days of the Prophet’s death.
The length of the expedition varies, according to different
traditions, from 40 days to 70.
[15] See Life of Mahomet, chapter 32.
[16] Ibid. chapter xxx. Amru hastened home through Bahrein
immediately on hearing of Mahomet’s death. Corra ibn Hobeira,
Chief of the Beni Amir, took him aside, after a hospitable
entertainment, and advised, as the only way to avoid revolt, that
the tithe upon the Bedouins should be foregone. Amru stormed at
him for this; and subsequently, on Corra being brought in a
prisoner, advised his execution as an apostate.
On reaching Medîna, Amru made known the disheartening
news to his friends, who crowded round him. Omar coming up, all
were silent, but he divined what the subject of their converse was:
‘I think,’ he said, ‘that ye were speaking of what we have to fear
from the Arab tribes?’ On their confessing, he made them swear
that they would not discourage the people by letting the matter
spread, and added: ‘Fear ye not this thing; verily I fear far more
what the Arabs will suffer from you, than what ye will suffer from
them. Verily if a company of the Coreish were to enter into a cave
alone, the Bedouins would follow you into the same. They are a
servile crew: wherefore, fear the Lord, and fear not them.’
[17] Or Abrac. For the Beni Abs and Dzobiân, see Life of
Mahomet, vol. i. pp. ccxxiv. et seq.
[18] The riding camels had all been sent away with Osâma’s
army, and the only ones now available were those used to irrigate
the fields and palmgroves. The stratagem, was curious. The
Bedouins blew out their empty water-skins (mussucks), and when
thus buoyant and full of air, they kicked them (as you would a
foot-ball) in front of the Moslem camels, which, affrighted at the
strange sight, took to flight.
[19] The centre and wings were commanded by three sons of
Mocarran, a citizen of Medîna. These distinguished themselves
on many occasions in the Persian campaign. One of them,
Nomân, was killed ten years after in the decisive action of
Nehâwend.
[20] For the royal Fifth, see Sura, viii. 41.
[21] There is a tradition that when Abu Bekr issued, sword in
hand, to go to Dzul Cassa, Aly caught hold of his bridle,
exclaiming: ‘O Caliph, I say to thee what the Prophet said to thee
on the day of Ohod: Put up thy sword again and expose us not to
lose thee, for, by the Lord! if we were to lose thee, the prop of
Islam were gone.’ Whereupon Abu Bekr returned and went not
forth.
But this probably refers to the expeditions shortly after sent
out in all directions from Dzul Cassa, as narrated below, and to
Abu Bekr’s return to Medîna at that time.
[22] The notion given by tradition is that these eleven columns
were despatched on their several expeditions all at once from
Dzul Cassa, in presence of Abu Bekr. This of course is possible,
but it is very improbable. The arrangements could hardly have
been so speedily cut and dry as that supposes. It is enough to
know that, sooner or later, about this time, or shortly after, these
eleven expeditions started. Some of the eleven, as given by
tradition, seem hardly to have been separate commands.
[23] Meaning, no doubt, that as governors they would have
been immediately subordinate to himself, exposed to much
drudgery, and liable to be called to account for their stewardship.
[24] For an account of this marvellous system of oral tradition,
see the Essay in the Life of Mahomet on the Sources for the
Biography. The halo surrounding the Prophet casts something of
its brightness on the lives also of his chief Companions, whose
biographies are given by tradition in considerable detail; and from
them we can gather something of the early history incidentally.
[25] So uncertain is the chronology of this period, that Ibn
Ishâc makes the campaigns in Yemâma, Bahrein, and Yemen to
be in the twelfth year of the Hegira; whereas the received, and
manifestly correct, account, as ‘gathered from the learned of
Syria,’ is that the operations against the apostate tribes
throughout Arabia were brought practically to an end in the 11th
year of the Hegira. Only one exception is mentioned (and that
somewhat obscurely) of a campaign against Rabia, who was
beaten by Khâlid. Amongst the spoils of the expedition is
mentioned the daughter of Rabia, who, as a slave-girl, fell to the
lot of Aly.
[26] Life of Mahomet, p. 427.
[27] Ibid. p. 409.
[28] We have met Thâbit before as a poet of renown and a
chief of influence, especially among the Beni Khazraj (Ibid. p.
449).
The strength of Khâlid’s column is nowhere mentioned, but,
adverting to the great number slain at Yemâma (although he was
reinforced meanwhile from Medîna), it could hardly have been
less than twelve or fifteen hundred, besides the 1,000 men
contributed, as we shall see immediately, by the Beni Tay.
[29] Had there been anything else in Toleiha’s teaching, there
is no reason why we should not have heard of it, as Toleiha, when
he returned to the faith, became a distinguished champion of
Islam. There may, however, have been a disinclination on his part
to dwell on this chapter of his life. Al Kindy, the Christian, speaks
in his Apology with greater respect of Moseilama’s sayings as
calculated to draw off the followers of Mahomet. But I see no
evidence of this. See the Apology of Al Kindy, p. 31 (Smith &
Elder, 1881).
[30] A name familiar to us in the Life of Mahomet, see p. 323,
&c.
[31] The Beni Jadîla and Beni Ghauth.
[32] Abu Bekr means ‘Father of the young camel,’ and they
called him by the nickname Ab ul Fasîl, ‘Father of the foal.’ Adî
answered, ‘He is not Ab ul Fasîl, but, if you like it, Ab ul Fahl,’
‘Father of the stallion,’ i.e. endowed with power and vigour.
In the Persian version of Tabari, the surname is by a mistake
given as Ab ul Fadhl, ‘the Father of Excellence,’ and is applied to
Khâlid.
[33] Okkâsha was a warrior of renown and leader of some
expeditions in the time of Mahomet.
[34] The sub-tribe of the Beni Ghatafân to which Oyeina
belonged.
[35] Kahânat, the term used for the gift possessed by the
heathen soothsayers. The sayings ascribed to Toleiha are childish
in the extreme. For example: ‘I command that ye make a
millstone with a handle, and the Lord shall cast it on whom he
pleaseth;’ and again, ‘By the pigeons and the doves, and the
hungry falcons, I swear that our kingdom shall in a few years
reach to Irâc and Syria.’
[36] For the barbarous execution of Omm Kirfa, see Life of
Mahomet, chapter xviii. The malcontents here gathered together
were from all the tribes against which Khâlid had now been
engaged in warlike operations—the Ghatafân, Suleim, Hawâzin,
Tay, and Asad.
[37] It was a vain excuse, but was founded on the principle
that no bloodshed, treachery, sin, or excess of any sort, before
conversion, cast any blot on the believer; but that apostasy,
however, repented of, left a stigma which could never wholly be
effaced. At first the Caliph would receive no aid whatever from
any tribe or individual who had apostatised; and, though when
levies came to be needed urgently, the ban was taken off, still to
the end no apostate chief was allowed a large command, or put
over more than a hundred men.
Among the Beni Suleim was Abu Shajra, son of the famous
elegiac poetess, Al Khansa. A martial piece which he composed
in reference to an engagement at this time contains the verse:—
‘And I slaked my thirsty spear in the blood of Khâlid’s

troop.’
Some years after, he visited Medîna, while Omar was

distributing the tithe among the poor Arabs around the city: ‘Give
to me,’ said the stranger, ‘for I, too, am poor and needy.’ ‘And who
art thou?’ asked Omar. Being told his name, he cried out in anger:
‘Art not thou the same that said, I slaked my thirsty spear, &c.?’
and he beat him about the head with his whip till the poet was fain
to run off to his camel. A poem complaining of this treatment has
been preserved, in which he says:—
‘Abu Hafs (Omar) grudged me of his gifts,

Although every one that shaketh even a tree getteth at
least the leaves it sheddeth.’
Such poetical fragments, in the scantiness of the materials for

this early period, give at many points reality and fulness to the
story.
[38] The account as here given is from Abu Bekr’s own son.
According to other traditions, Fujâa employed the arms, &., which
he got from the Caliph, in attacking the loyal sections of his own
and neighbouring tribes, and was therefore a pure rebel. It is
more probable that he carried his marauding expeditions
indiscriminately against loyal and disloyal, wherever there was the
chance of plunder. Even in this view Fujâa deserved exemplary
punishment, had it been of a less barbarous kind.
[39] See Life of Mahomet, vol. i. chap. iii. Some of the sub-
tribes were great and powerful, as the Beni Hantzala, Mâlik,
Imrulcays, Dârim; and here the Beni Yerbóa.
[40] Ibid. ch. xxvii.
[41] The Beni Iyâdh, Namir, and Sheibân. We shall meet them
again in the Irâc campaign.
[42] Sajâh, it is said, lived quietly with her tribe after this in the
profession of Christianity, until with them she was converted to
Islam. There is a childish tale that on returning from the hasty
marriage, her army, scandalised that she had received no dower,
made her go back and ask Moseilama, who received her roughly,
refusing her admittance; but, in lieu of dower, agreed to remit two
of the daily prayers imposed by Mahomet.
By some of the historians the interview between Moseilama
and Sajâh is drawn (happily a rare case in these annals) in
language of gross indelicacy. The pruriency suggesting this, is the
more gratuitous, as we are told, almost in the same breath, that
Moseilama’s tenets were rather of an ascetic turn. His system
enjoined prayer and fasting, and prohibited (so the tradition runs)
cohabitation after the birth of a son, to be resumed only, if the
child died, till the birth of another. But our knowledge of the life
and doctrines of these pretenders to prophecy is really too scanty
to warrant us in pronouncing judgment upon them.
Belâdzori and Ibn Khaldûn are among the few who have here
kept their pages clean. Gibbon characteristically seizes on the
passage.
[43] In a passage of Tabari (vol. i. p. 188) it is stated that when
Amru passed through these regions with a column to clear the
roads, he and Mâlik had words with each other. It is possible,
therefore, that Khâlid may have had a stronger case against Mâlik
than appears.
[44] That is, the Ansârs, as opposed to the Refugees, i.e. the
men of Medîna, as opposed to the Coreish and men of Mecca.
[45] In the Kinânite.
[46] A full account of Mâlik and Motammim, with copious
extracts from their poetry, will be found in Nöldeke’s Poesie der
alten Araber, Hanover, 1864. Arab critics take Motammim as the
model of elegiac poets, both for beauty of expression and
intensity of feeling. For twenty years he had been blind of an eye,
and now he told Omar that grief for his brother’s cruel fate had
brought floods of tears from that eye, which all these years had
been bereft of moisture. ‘Verily this surpasseth all other grief!’ said
Omar. ‘Yes,’ replied Motammim, ‘it would have been a different
thing if my brother had died the death of thy brother Zeid upon the
field of battle.’ The noble mien and generosity of Mâlik are painted
in glowing colours. He used to kindle a great fire by his tent all
night until the day broke, in the hope of attracting travellers to his
hospitable home.
[47] The darker suspicion has been preserved by tradition,
both in prose and verse. See C. de Perceval, vol. iii. p. 368; and
Kitâb al Aghâny, vol. iii. p. 355. Leila, we are told, cast herself at
Khâlid’s feet, with hair dishevelled and unveiled face, imploring
mercy for her husband. The wretched man, noticing the admiring
look which the conqueror bestowed upon his wife, cried out, ‘Alas,
alas! here is the secret of my fate!’ ‘Not so,’ said Khâlid, as he
gave the sign for beheading him; ‘but it is thine own apostasy.’ All
the same, he took the wife straightway for his own. We may
dismiss the scene as unsupported by evidence. It is also
inconsistent with Abu Bekr’s treatment. His reproach of Khâlid
was based not on the impropriety of the act itself (which he could
hardly have avoided had the story been founded on fact), but on
its being at variance with the ideas of the Arabs to wed on the
field of battle. The example, however, was set by the Prophet
himself, who married Safia the night after the battle of Kheibar,
and at any rate it was not long in becoming a common practice.
Following the example of Khâlid (repeated by him again shortly
after), the Moslem warriors made no delay in the field to wed—or
rather, without wedding, to treat upon the spot as servile
concubines—the wives and daughters of their fallen foes. The
practice also now arose of taking their own families with them in
the field, and marriages were celebrated there among themselves
—on one occasion, we read, on the eve of an impending battle.
As to the tenor of tradition, there are two distinct versions of
the tragedy, one giving as its cause the misconception of Khâlid’s
order, the other Mâlik’s own disloyal speech. This last, taken
separately, is inconsistent with the admitted fact that Khâlid
justified himself before Abu Bekr by the former. In the text I have
endeavoured to combine the two narratives.
Mâlik had flowing locks, and (so runs the tradition) when the
skulls of the prisoners were cast into the fire under the cooking-
pots, his alone would not burn because of the mass of hair. The
story (true or false) shows the spirit of savagery rapidly fanned by
religious war.
I should perhaps mention that, though tradition is proud of
Khâlid’s heroism, he is not a special favourite with Abbasside
historians, as his son was afterwards a staunch supporter of the
Omeyyads.
[48] I.e. Shawwâl, or two months before the close of a.h. XI.
As already explained, the dates are arbitrarily assumed. The
Kâtib Wâckidi places the battle of Yemâma in a.h. XII. (which
begins March 18, a.d. 633), and even the engagement of
Bozâkha in the same year; but this would throw the campaign in
Irâc altogether too late. The cold which led Khâlid to order his
prisoners to be ‘wrapped,’ must have been on the approach of
winter, and corresponds with the chronology which I have been
obliged to assume on grounds admittedly vague.
[49] See Life of Mahomet, ch. xxxii. Moseilama is a diminutive
form of the adjective Moslem, and is supposed by some to be in
that sense a derisive epithet. He is described as of a contemptible
presence, a dark yellow complexion and a pug nose.
[50] Some say that he was deputed by Abu Bekr. He could
recite the whole of Sura Becr (s. ii.). Khâlid had not heard of his
defection, and looked for him to come out and join his army.
[51] The tales told of him are silly. He was desired to pray, as
Mahomet had done, for rain, but it only intensified the drought;
when he prayed for a blessing on young children, it made them
stammer, become bald, &c. He established a sanctuary, perhaps
in imitation of the Kâaba, but it became a mere rendezvous for
bandits. See also the ascetic doctrines ascribed to him, and the
opinion of Al Kindy, the Apologist, supra, pp. 23 & 32.
[52] Above, p. 18. Ikrima was the son of Abu Jahl, the arch-
enemy, cursed in the Corân by Mahomet, and himself an
inveterate opponent, until the taking of Mecca (Life of Mahomet,
ch. xxiv.). So completely was it all forgotten now under the new
dispensation of equality and brotherhood, that he had one of the
chief commands given him.
[53] If Ikrima and Shorahbîl were despatched from Dzul Cassa
at the general marshalling when Khâlid marched against Toleiha,
then Shorahbîl must have had long to wait. But it is probable (as
we have seen) that the popular tradition of the simultaneous
despatch of all the columns is a fiction, and that Khâlid’s
expedition preceded some of the others by a considerable
interval.
After finishing the Yemâma campaign, Shorahbîl’s original
orders were to join Amru in his proceedings against the Beni
Codhâa in the north.
[54] From the expression used, it would almost seem as if
Sâlim carried the Corân on the point of his flag-staff. This was a
common practice in after times, but the Corân was not yet
collected. Possibly some portion may have been thus borne aloft
by the leader, or the words may be metaphorical or anticipative.
[55] In some accounts of the battle, Khâlid is spoken of as
challenging his enemy to single combat, and slaying, one after
another, all who came out against him. But the circumstances
would hardly have admitted of this. These single combats are the
conventional drapery of all the early battles, and need not always
be taken as facts. Here they are specially introduced to give place
to an apocryphal story about Moseilama. He came forth to answer
the challenge of Khâlid, who, in reference to the offer made by
him to Mahomet, ironically asked whether he was now prepared
‘to share the Kingdom’; whereupon Moseilama turned aside ‘to
consult his dæmon.’ Khâlid then rushed at him, and he fled.
‘Where is that now which thou didst promise us?’ cried his
followers to the prophet, but all that he could reply was to bid
them fight for their honour and their families.
[56] The twelve Leaders at the Pledge of Acaba. Life of
Mahomet, ch. vi.
[57] It is said that 7,000 of the enemy were slain on each of
these occasions, but the statement is loose and, no doubt, vastly
exaggerated. One tradition gives the slain in the garden alone at
10,000.
[58] The greater loss among the men of Mecca and Medîna
was ascribed by themselves to their superior gallantry, but by the
Bedouins to their being raw and unused to fighting. We see
already the seed of the rivalry which afterwards broke out so
fatally between the Bedouins and the Coreish.
[59] The terms of the treaty, notwithstanding the alleged
artifice (which reads somewhat strangely) were sufficiently
severe. The Beni Hanîfa agreed to give up all their armour, their
silver and their gold; but they were allowed to retain half of their
slaves, and get back half of their own people taken prisoner.
Khâlid had already captured in the valleys and open villages so
many prisoners, that he had sent 500 to Abu Bekr as the royal
Fifth, implying a total number of 2,500. But Omar subsequently
freed all slaves of Arab blood.
Selma, one of the Hanîfa chiefs, sought to dissuade his
people from surrender, saying that the winter was not overpast,
and that the enemy must retire. Being overruled, he fled and
committed suicide.
[60] The sayings reported were such as these: ‘O croaking
frog, thou neither preventest the drinker, nor yet defilest the
water.’ ‘We shall have half the land and ye the other half; the
Coreish are an overbearing folk.’ But as I have said before, we
have not the materials for knowing what the real teaching of
Moseilama was, nor the secret of his influence.
[61] The Persian paraphrase of Tabari gives a highly coloured
version. Khâlid, it tells us, gave his bride the dower of a million
pieces out of the spoil, while on the marriage night the Moslem
warriors lay about hungry and in want. Verses banded about the
camp to this effect reached Omar, and put him in a towering
passion. He nearly persuaded Abu Bekr to recall Khâlid, but the
Caliph, reflecting that, after so great a victory, it would discourage
the army, contented himself with a reproachful letter. All this is
evidently gross exaggeration, founded probably on the dislike of
the Abbasside historians.
[62] See the previous history of the province, Life of Mahomet,
ch. xxx.
[63] The mission of Alâ must have been considerably later
than that of Khâlid. We have before seen reason to believe that
the various expeditions were not, as tradition represents,
despatched all at once from Dzul Cassa.
[64] The Beni Hanîfa, Moseilama’s tribe, was a branch of the
same Beni Bekr ibn Wail, mentioned in the text, as also the Beni
Temîm, who to this day (such is the tenacity with which the
Bedouins hold to their native soil) occupy the same pasture-lands.
Some details are given regarding the chiefs who had remained
tolerably loyal throughout. Thus Cays ibn Asim, Zibricân, &c., who
at first vacillated, though they kept aloof from Sajâh, now, as Alâ
drew near, came forth with the tithes which during the anarchy
had been kept in deposit, and fought upon his side.
We are also told of a staunch believer, Thomâma, who was
able to maintain his loyalty with a party of his tribe, until Alâ
appeared. He joined the force, but came to an untimely and
ignominious end. He was presented for his bravery with the spoils
taken from the person of Hotem (to be noticed below), and,
wearing them on a journey, was set upon by the people as
Hotem’s murderer and as such put to death.
[65] This is the solitary expedition since the death of Mahomet
around which tradition has gathered a halo of marvellous tales.
When they halted on that miserable night, the beasts of burden all
ran off wildly with their loads. Not one was left, and the army was
near perishing of hunger as well as thirst. In the morning, they
returned from all directions with their burdens, of their own
accord. The lake is likened to the water that flowed from the rock
in the wilderness when struck by Moses.
[66] Called Ebnâa. The traders from India settled (as they do
now) along the coast from the Euphrates to Aden, and so a
mongrel race sprang up.
[67] He bore the dynastic name of Mundzir, and, having been
freed at the instance of an Arab relative, embraced Islam. He had
the surname of Gharur (deceiver), but said that he ought rather to
have been called Maghrûr (deceived). The relations of these
tribes on the N.E. of Arabia, with Hîra and also with Persia, were
close and constant. Little more than twenty years before, the Beni
Bekr had beaten back the combined forces of Persia and Hîra.
The connection of the Arab tribes in this quarter with Persia
corresponded with that between the Syrian tribes and the Roman
empire. (Life of Mahomet, vol. i. p. clxxxii.)
[68] For the island Dârîn (or Dirîn) see an interesting article by
Sir H. Rawlinson, on the islands of Bahrein, Royal As. Society’s
Journal, vol. xii. p. 222, et seq. There were five bishops in this
province, and ‘the insular see is always named Dirîn.’ We have
here indirect evidence of the prevalence of the Christian faith in
northern Arabia, far down the shores of the Persian Gulf.
[69] Each horseman got 6,000 pieces. The tale is told with
such extravagances as we are accustomed to only in the life of
the Prophet, e.g. the strait was so broad that it took a day and a
night for a ship to cross, yet not the hoof of a camel was wetted. It
is remarkable that, with few exceptions, this expedition is the only
one, after the death of Mahomet, regarding which such childish
tales are told.
[70] There is a tradition that two chiefs Zibricân and Acra
obtained from Abu Bekr a patent appointing them collectors of
tithe in Bahrein, on condition that they made themselves
responsible for its loyalty. The document was shown to Omar,
who, angry apparently because Acra had been an apostate, tore it
up. Talha, who had negotiated the affair, went to Abu Bekr and
asked, ‘Art thou ruler, or is Omar?’ ‘Omar,’ he replied, ‘but
obedience is due to me.’ This (which illustrates the great influence
of Omar with the Caliph) may have referred to a part of the
Bahrein coast not under Alâ.
[71] He belonged to the Beni Shaybân, a sub-tribe of the Beni
Bekr.
[72] No dates are given. But as the battle which follows was
retrieved by reinforcements from the Beni Abd al Cays, and as
that tribe was only set free by the success of Alâ, the operations
in Omân must necessarily have been later than those in Bahrein.
[73] See Life of Mahomet, ch. xxx.
[74] They belonged to the great families of Azd and Himyar,
who inhabited that part of the Peninsula, and had therefore both
experience and local influence.
[75] Sohâr, still a mercantile port, lies above 100 miles west of
Maskat. The bazaar of Dabâ was probably near to it.
[76] Attâb had been governor ever since Mahomet appointed
him on the capture of Mecca. The rebels were headed by Jondob
of the Mudlij tribe. Penitential verses, recited by this rebel chief on
his submission, have been preserved (Tabari, i. p. 212). In the
paucity of trustworthy tradition at this period, such verses are
peculiarly valuable, amplifying as they do the meagre materials at
our command, and giving fixed and certain points.
[77] According to another account of this affair, Khâlid (who
had been appointed by Mahomet collector of tithes and resident
with the Beni Zobeid in the quarter south of Mecca), attacked Amr
ibn Mádekerib, and having taken his sister prisoner, obtained the
sword as her ransom. The sword came several years afterwards
into the possession of the Governor of Kûfa, who offered to give it
back to Amr; to show its marvellous temper, Amr took it, and at
one blow severed the pack on his mule’s back in two. Then he
returned it to the governor, saying that he could not retain a sword
of which he had once been despoiled. Among other poetry is
some by Amr himself:—‘The sword of the son of Dzu Cayfar (a.d.
475) was mine; its blade was tempered in the age of Ad. It hath a
grooved blade which cleaveth helmets, and the bodies of men, in
twain.’ See Caussin de Perceval, vol. i. p. 117; also Mr. C. J.
Lyall’s translations from the Hamasah. Journal As. Soc. of
Bengal, 1877, vol. xlvi. pp. 179, et seq. It is curious to remark how
many Arab warriors were also poets of renown.
[78] The tradition was preserved in the name of ‘the Villains’
(Akhabîth) road, by which this part of the coast was long known.
[79] Life of Mahomet, chap. xxxii.
[80] Yemen was, for a considerable period in the seventh
century, governed by a Satrap as a dependency of Persia; and
large numbers of Persians then settled in the country. These were
their descendants, and also the Ebnâa of mixed parentage. (Life
of Mahomet, vol. i., p. cxliv.)
[81] Dzul Kelâa and other semi-independent Himyar chiefs
occupying the neighbouring districts. Some of these remained
loyal, and distinguished themselves greatly in the Syrian
campaigns.
[82] Feroze was a poet, as well as a statesman; and his
verses lamenting the captivity of his family, and threatening
revenge, have been preserved. (Tabari i. p. 220.) Abd Yaghûth, or
servant of the idol of that name worshipped in the south of Arabia.
See Lyall’s translations from the Hamasah, quoted above. We
hear of him afterwards, but not much of Feroze.
[83] As usual, no date is given. But as only now he met Ikrima,
who had made a march of several weeks from Omân, after the
campaign in the East, the period must have been late in the year
a.h. XI., if not the beginning of a.h. XII. Tabari, as I have said
before, places the entire reduction of apostate Arabia within a.h.
XI.
Mohâjir was brother to Omm Salma, one of the Prophet’s
wives. He was one of the malingerers who absented himself from
the Tebûk campaign, and so incurred the displeasure of
Mahomet. (Life of Mahomet, chap. xxviii.) But Omm Salma, one
day, washing the Prophet’s head, made mention to him of her
brother, and, finding the opportunity favourable, called him in. His
excuse was accepted; and the government of Hadhramaut was
then and there conferred on him.
[84] The verses are quoted by Tabari, vol. i. p. 224. The Arabs,
and especially their poets, had the faculty of abusing one another
in the grossest manner. About the same time, lampoons were
bandied between Amr ibn Mádekerib and Farwa, a loyal chief of
the Beni Murâd, who maintained a constant check upon Amr’s
proceedings. As regards Farwa, we are told that when he first
presented himself to Mahomet, he explained how his tribe and the
Beni Hamdân had an idol which each kept alternately for a year.
The contested possession of this idol led in bygone time to the
famous battle of Al Razm.
[85] The Beni Sakûn were loyal throughout the rebellion, and
gave protection to the faithful refugees from other tribes. Among
others, Moâdz ibn Jabal, deputed by Mahomet to teach the tribes
of the south the Corân and the tenets of Islam (Life of Mahomet,
chap, xxx.), took refuge with them, and married a lady from
amongst them. He was so enamoured of this Sakûnite wife that it
used to be his constant prayer that in the resurrection he and she
might both be raised together. He died in the plague a.h. XVIII.
[86] See the account of their brilliant cavalcade and the
betrothal, Life of Mahomet, chap. xxx.
[87] A thousand women were captured in the fortress. They
called after Ashâth as he passed, ‘he smelleth of burning,’ i.e. he
is a recreant traitor.
[88] Her name was Omm Farwa. Their son Mohammed was
killed fighting in the army of Musáb against Mokhtâr. Some verses
by Ashâth lamenting the catastrophe of Nojeir have been
preserved by Tabari, vol. i. p. 248.
[89] She was the daughter of one Nomân, who, praising her
attractions to Mahomet, added, as the climax, that she never had
had sickness of any kind. After a private interview with her,
Mahomet sent her back to her home in the south, saying, ‘Had
the Lord seen anything good in her, it had not been thus.’
In the Life of Mahomet, I rejected as apocryphal this and other
accounts of the Prophet’s betrothal to certain females with whom
marriage was not consummated. In the present case, however,
the betrothal is certainly confirmed by the curious objection taken
by the army to Ikrima’s marriage on account of the inchoate
relation in which she at one time stood to the Prophet; and it is
therefore possible that other betrothals which at the time
appeared to me improbable may also be founded on fact. See
Life of Mahomet, chap, xxii., and Ibn Cotâba, p. 18.
It will be remembered that the widows of the Prophet, as
‘Mothers of the Faithful,’ were prohibited by the Corân from re-
marrying. Ibid. p. 303.
[90] See Life of Mahomet, chap. xxix.
[91] ‘The days of Ignorance,’ that is, the period preceding
Islam.
[92] Two such are named by Tabari, i. p. 248.
A light ransom was fixed for each Arab slave, namely seven
camels and six young ones. In the case of some tribes which had
suffered most severely (as the Beni Hanîfa, the Beni Kinda, and
the people of Omân discomfited at Dabâ), even this was remitted.
[93] Fadak was a Jewish settlement north of Medîna,
conquered by Mahomet at the same time as Kheibar. Portions of
both were retained by Mahomet for the support of his household.
(See Life of Mahomet, pp. 394 and 548.)
[94] According to most authorities she survived her father six
months; others say only three.
[95] Some say that Abu Bekr appointed Abd al Rahman to the
duty. The uncertainty on this (to the Moslem) most important point
is indicative of the confusion which still prevailed, and the
vagueness of tradition for the period immediately following
Mahomet’s death.
[96] Gibbon, chap. xlvi.
[97] Above, p. 50.
[98] By some accounts Mothanna appeared in person before
Abu Bekr and promised to engage the local tribes in carrying the
attack into the border lands of Irâc.
[99] Such are said to have been Abu Bekr’s orders; but
tradition here probably anticipates the march of events. It is very
doubtful whether he had yet the city of Hîra in view. The
campaign widened, and the aims of Khâlid became more definite
as his victories led him onwards.
[100] The pre-Islamite history of these Arab races is given in
the introductory chapters to the Life of Mahomet, vol. i.
[101] i.e. ‘Irâc of the Arabs’ as distinguished from Irâc Ajemy,
‘foreign’ or Persian Irâc.
[102] The mounds are, no doubt, not only the remains of
embankments but also of the clearances of silt, which (as we
know in India) become hillocks in the course of time.
[103] This, as well as the main stream, is sometimes called by
our historians Furât, or Euphrates; at other times by its proper
name of Bâdacla, and also Al Atîck, the ‘old’ or deserted channel;
but the streams have varied their course from age to age.
[104] The country suffers similarly in the present day at the
hands of the Turkish Government. A traveller writes regarding it:
‘From the most wanton and disgraceful neglect, the Tigris and
Euphrates, in the lower part of their course, are breaking from
their natural beds, forming vast marshes, turning fertile lands into
a wilderness,’ &c.
[105] These seem to have occupied a position similar to that
of the great Talookdars in Upper India.
[106] Beyond the general outline we must not look for much
trustworthy detail at the outset of these campaigns. The narrative
of them is brief and summary, often confused and contradictory.
For example, Hîra is said by some to have submitted at the outset
and agreed to pay tribute, which is inconsistent with the course of
the narrative. The summons to Hormuz as given in the text
savours too much of the set type of after days to be above
suspicion; so with the constant repetition of single combats,
without which the historians seem to think no Arab battle
complete.
There is one point of some importance. It is the call on
Hormuz to pay tribute. Now, tribute was permitted by Mahomet
only to ‘the people of the Book,’ that is, to Jews and Christians.
No such immunity was allowed to the heathen, who were to be
fought against to the bitter end. Zoroastrians (for such was
Hormuz) should strictly have been offered no terms but Islam.
They had not, however, yet been thought of, for they were
altogether beyond the limits and tribes of Arabia. Eventually,
Omar ruled that having ‘a Book’ or Revelation, they might be
admitted into the category of those to be spared on payment of
tribute. But, as I have said, the summons is no doubt cast in the
conventional mould of later days.
[107] Horsemen received three shares; the foot soldiers one.
This was the standing rule from the time of the Prophet. Two
shares were for the horse.
[108] The grade of Persian nobility was marked by the
costliness of the jewelled turban.
[109] No elephant had ever been seen before at Medîna, and
only one at Mecca—‘the year of the elephant’ marking the era of
Abraha’s attack (Life of Mahomet, p. xxvi.). The astonishment of
the women and children of Medîna was unbounded, and some
inquired in childish amazement whether it was an artificial thing,
or really was a work of nature.
[110] It is also called the battle of Kâtzima, a neighbouring
town reduced by Khâlid.
This tale of soldiers being chained together, or tied with ropes,
is commonly told both of Persian and Roman armies. How far it is
founded on fact it is difficult to say. We must ever remember that
the materials for our story are all one-sided, and that there is
much ignorance of their enemies displayed by the annalists, as
well as much contemptuous fiction regarding them.
[111] It will be more convenient hereafter (dropping the
Occidental forms of Ctesiphon and Seleucia) to speak of the
Persian capital by its Arabic name, Medâin.
[112] Cârin, they say, was the last noble of the first rank who
took the field against the Mussulmans. The slain are put at
30,000, besides those drowned in the canal. Such numbers,
always loose, are especially so in the traditions of this early
period. Among the prisoners was a Christian, father of the famous
jurisconsult Abul Hasan of Bussora (d. a.h. 110). Also Mâckia,
afterwards the freedman of Othmân, and Abu Ziâd, freedman of
Moghîra.
[113] Khâlid’s speech is quoted by Al Kindy the Christian
Apologist (Smith and Elder), p. 33.
[114] The iddat (or interval prescribed between divorce and re-
marriage, or before the cohabitation of a new master with his
slave-girl) is not observed in respect of women taken captive on
the field of battle. I can find no authority on the subject, but am
told by those versed in the law that the only exception is that of
women with child in which event cohabitation would be unlawful
till after delivery. In all other cases, in conformity with the
precedent of the Prophet’s marriage with Safia at Kheibar, the
captives, whether maid or matron, are lawful to the captors’
embrace upon the spot (Life of Mahomet, p. 391).
[115] Tabari tells us that every month it was the turn of a new
prince to rule as minister, and this was Bahmân’s month.
[116] The slain are given at the fabulous figure of 70,000. The
decapitation of the captives went on for a night and a day (so we
are told), and then they scoured the country for more. Cacâa, one
of the Arab captains, told Khâlid that ‘the Lord had forbidden the
earth to allow human blood to flow upon its face more than the
length of a man’s dress,’ and that it never would run in a stream
until water was turned on. Blood, as we know, soon thickens and
curdles of itself.
There is, presumably, great exaggeration in the story, and I
should willingly have put down the whole as a fiction growing out
of the name of the river; but the narrative unfortunately is in
keeping with the bloodthirstiness of the Arab crusaders, and
specially with the character of ‘the Sword of the Lord.’ The
tradition about the flour-mills comes from Moghîra, through one of
Tabari’s standing string of traditional authorities.
[117] She bore him children, or the circumstance would
probably have been too common to merit a place in tradition. Abu
Bekr was so charmed with his stalwart mien that he burst forth in
a martial couplet in the envoy’s praise.
[118] For the history of Hîra up to this time, see Life of
Mahomet, vol. i. introd. chap. iii. The Lakhmite dynasty sprang
from the southern branch of the Arabs, and, both on this account
and for the reasons stated in the text, their influence did not
penetrate deeply into the peninsula.
[119] Called also Manîshia. It never recovered the calamity; at
any rate we do not hear of it again.
[120] The escapes were opened perhaps as well to flood the
country and impede the enemy’s progress, as to lay the
navigating channel dry. These channels have greatly altered, so
that attempt at identification would be fruitless.
[121] The palace of Khawarnac was built 200 years before, by
Nomân I., for the reception of his pupil Bahrâm Gour, heir-

Download textbook Sustained Simulation Performance 2016 Proceedings Of The Joint Workshop On Sustained Simulation Performance University Of Stuttgart Hlrs And Tohoku University 2016 1St Edition Michael M Resch ebook all chapter pdf

Uploaded by

Copyright:

Available Formats

Download textbook Sustained Simulation Performance 2016 Proceedings Of The Joint Workshop On Sustained Simulation Performance University Of Stuttgart Hlrs And Tohoku University 2016 1St Edition Michael M Resch ebook all chapter pdf

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download textbook Sustained Simulation Performance 2016 Proceedings Of The Joint Workshop On Sustained Simulation Performance University Of Stuttgart Hlrs And Tohoku University 2016 1St Edition Michael M Resch ebook all chapter pdf

Uploaded by

Copyright:

Available Formats

Sustained Simulation Performance

2016: Proceedings of the Joint

Biota Grow 2C gather 2C cook Loucas

High Performance Computing in Science and Engineering

Strategic Organizational Learning Using System Dynamics

Building Performance Simulation for Design and

Simulation Driven Design with Inspire 2nd Edition

High-Performance Simulation-Based Optimization Thomas

Enterprise and Organizational Modeling and Simulation

Energy Geotechnics Proceedings of the 1st International

Proceedings of the Joint Workshop

Erich Focht Nisarg Patel

Wolfgang Bez Hiroaki Kobayashi

ISBN 978-3-319-46734-4 ISBN 978-3-319-46735-1 (eBook)

© Springer International Publishing AG 2016

Printed on acid-free paper

This Springer imprint is published by Springer Nature

The ﬁeld of high-performance computing is currently witnessing a signiﬁcant shift

Stuttgart, Germany Michael M. Resch

Part I Exploitation of Existing HPC Systems: Potentiality,

Directive Translation for Various HPC Systems Using

Part II Numerical Computations and Approach Towards

Vl.V. Voevodin (B)

© Springer International Publishing AG 2016 3

programs: parallelism was to be expressed in the form of independent iterations of the

2 What is a Complete Description of the Algorithm

3 Why Is It Hard to Describe Algorithms?

Fig. 1 Information structure of various algorithms

on the nested loops in an algorithm’s implementation. If nested loops do not go deeper

Fig. 2 Methods of displaying an algorithm’s information structure

Fig. 4 A description of data locality in programs using memory access profiles

what determines program execution efficiency on modern computing platforms. To

Bastian Koller, Michael Gienger and Michael M. Resch

B. Koller (B) · M. Gienger · M. Resch

© Springer International Publishing AG 2016 11

2 The Evolution of High Performance Computing

2.1 Traditional High Performance Computing

High Performance Computing has been traditionally designed to solve problems

2.2 Evolution Over Time

3 Towards High Performance Data Analytics

3.1 Where Is It Needed?

As already highlighted in the introduction of this section, High Performance Data

corresponding applications produce Terabytes of data in the meanwhile that cannot

3.2 HPDA Concepts and Technologies

3.3 A Practical Application Making Use of HPC and HPDA

4 The Missing Link

Summarizing the previously mentioned evolution scenarios for High Performance

2 Apache Spark: http://spark.apache.org

Fig. 1 CoeGSS application

11 Wikipedia Definition of Cognitive Computing: https://en.wikipedia.org/wiki/Cognitive_

4.1 Cognitive Computing

Fig. 3 Extending the GSS

4.3 Available Technologies

Hiroyuki Takizawa, Takeshi Yamada, Shoichi Hirasawa and Reiji Suda

Abstract Xevolver is a code transformation framework for users to define their

H. Takizawa (B) · T. Yamada · S. Hirasawa

© Springer International Publishing AG 2016 21

2 Data Layout Optimization

In many cases, an HPC application code is written in a low-level programming

struct { double x, y; } point2d[N];