Matrix Mathematics Chemical Problems: Applications
Matrix Mathematics Chemical Problems: Applications
Matrix Mathematics Chemical Problems: Applications
Refractive Index Curve. Small weighing bottles of about 30- calibration points from the line was within the limits of accuracy
ml. capacity, with female-joint tops, were found convenient for (0.05 O C. ) of t h e temperature-measuring equipment. Subse-
weighing out the liquids in the determination of t h e refractive quent voltage readings were corrected b y adding t h e AE for
index curve. T h e fact t h a t the grinding on the bottle proper each When the boiling point of the carefully puSified
was on t h e outside prevented accidental wetting of t h e ground toluene was measured, it was 110.59",as compared t o a National
portion as t h e liquids were poured in from small graduated Bureau of Standards value of 110.62'. This serves as a further
cylinders. The smallest quantity of a n y one liquid weighed in indication of the accuracy of the temperature measurements.
these determinations was 1 gram, and because the balance was
accurate to 0.0002 gram, t h e combined error for any sample could Literature cited
not be over 0.04%.
Pressure-Regulating System. The barostat used was that Bromiley, E. C., and Quiggle, D., IND.ENG.CHEX.,25, 1136
designed by Willingham and Rossini ( 2 0 ) a t t h e National Bureau (1933).
of Standards. It was thoroughly cleaned, then heated under Brown, J., and Ewald, A . H., Australian J . Sci. Research, 3A,
high vacuum and filled with clean, redistilled mercury, a vacuum 306-23 (1950); 4A, 198-212 (1951).
being obtained b y carefully boiling out dissolved air under vac- Cornell, L. W., and Montonna, R. E., IKD.ENQ.CHEM.,25,
uum IThile the filling was in progress. The barostat was installed 1331-35 (1933).
in an air thermostat, the temperature of which was controlled t o Fowler, R. T., J. SOC.Chem. Ind. (London), 68, 131-2 (1949);
u-ithin 3" C. b y a Fenwall thermoswitch, which was surrounded Ind. Chemist, 24, 717, 824 (1948).
by the circulating air stream. T h e temperature of the mercury Garner, F. H., Ibid., 25, 238 (1948).
lagged behind t h a t of the air, and consequently did not change Gillespie, D. T. C., IND.ENG.CHEM.,ANAL.ED.,18, 575 (1940).
over as a i d e a range. 4 dibutyl phthalate manometer regis- Jones, C. A., Schoenborn, E. >I., and Colburn, A. P., IND.
tered the difference between the system pressure and t h a t of t h e ENG.CHEM.,35, 666-72 (1943).
atmosphere. B y reading this manometer and the barometer, Kortiim, G., Moegling, D., and Woerner, F., Chem. Eng. Tech.,
it was possible t o calculate the pressure within the apparatus. 22, 453-7 (1950).
Kitrogen was supplied t o the barostat through a diaphragm Natl. Bur. Standards, Circ. C461, 39, 43, 123, 126 (1947).
valve, from a standard nitrogen cylinder. When the mercury Othmer, D. F., IND.ENQ.CHEM.,20,743-6 (1928); Anat. Chem.,
column reached the upper contact of the barostat, a circuit was 20, 763 (1948).
completed which activated a relay t h a t opened a solenoid valve Regnault, Mem. mad. inst. France, 26, 727 (1862).
and allowed escape of nitrogen through a needle valve. All runs Rose, Arthur, Williams, E. T., Sanders, W. W., Heiny, R. L.,
were made a t 760 mm. of mercury T h e prevailing pressure in and Ryan, J. F., IND.ENG.CHEW,45, 1568-72 (1963).
State College is about 730 mm. The pressure variation due t o Rose-Innes and Young, Phil. Mag., (V) 47, 353 (1899).
the variation of the temperature of the mercury was about 0.1 Saddington, A. W., and Krase, N. W., J . Am. Chem. Soc., 56,
mm. This far exceeds pressure variation due t o the opening and 353-61 (1934).
closing of the solenoid valve. It was easy to adjust t h e dia- Sage, B. H., and Lacey, W. N., Trans. Am. Inst. Mining Met.
phragm valve on the nitrogen cylinder and the final outlet needle Engrs., 136, 13G-57 (1940).
valve, so that the pressure variation could not be detected on the Sieg, L., Chem. Eng. Tech., 22, 322-6 (1950).
dibutyl phthalate manometer. Steinhauser, H. H., and White, R. R., IND.ENG.CHEY.,41,
Temperature Measurement. The temperature-measuring sys- 2912-20 (1949).
tem consisted of a calibrated single-junction copper-constantan Swietoslawski. W., J . Chem. Educ., 5, 469 (1928).
thermocouple, a Rubicon galvanometer with a sensitivity of Williams, E. T., Ph.D. thesis, Pennsylvania State College,
1.1 pv. per mm., and a Leeds & Northrup Type K-2 potenti- University Park, Pa., 1952.
ometer, which could be read to 0.1 mv. Willingham, C. B., and Rossini, F. D., API Research Project
The calibration was accomplished b y reading the electromotive 6, December 1945.
force with carefully purified n-heptane, and with pure water Zawideki, J., 2. physik. Chem., 35, 129 (1900).
boiling in the apparatus. The difference between the observed
voltage and the voltage of a standard sample was taken a t each R E C E I V Efor
D review April 21, 1953. ACCEPTED February 2 , 1955.
point and plotted against the observed voltage. A straight Presented before the Division of Industrial end Engineering Chemistry a t
line was drawn from the zero point ( A E = 0, EOb4 = 0) the 123rd Meeting of the AMERICAN CHEMICAL SOCIETY,Los .4ngeles,
which passed between the plotted points. The distance of t h e Calif,
it seems t h a t the aeronautical and electrical engineers were the where t h e ai's are scalars, is, of course, another matrix, P(A).
first among t h e engineers t o have employed matrices extensively Then
i n their researches. At present, of course, this interest is wide-
spread among applied mathematicians, as revealed b y the ap-
propriate literature.
The present article will, i t is hoped, serve two purposes-to
present t h e essential features of matrix algebra, and thus make is said t o b e a polynomial of A of t h e n t h degree. This, of course,
them known to a larger circle of chemical engineers; and t o is a n extension of the definition of polynomials for scalars. Note
illustrate the usefulness of this new approach by means of some that A", A"-',. . ., A2, A, Zare all of the same order.
rather interesting examples. N o general theorems will be proved, Power Series in Matrices. A natural extension of the above
except in some special cases, and constant reference t o the text- definition of a polynomial of a matrix t o include infinite series
books on the subject will be made. Some of the recommended in matrices can now be put forth. Let there be a sequence, Ao,
textbooks in English are those of Perlis (22), Ferrar (6),and AI, A2,. . ., of matrices t h a t are all of t h e same order, and let
the first chapter of Courant and Hilbert ( 4 ) , which are primarily P m
devoted t o the basic principles; those of Michal (18),Ferrar S, = Ai. Then the infinite series, Ai, is said t o be con-
0 0
(?'), and especially Dwyer (6) and Frazer, Duncan, and Collar
vergent if every element in S, converges t o a bounded limit as p
(9),which contain material of considerable practical interest ;
approaches infinity.
and t h a t of MacDuffee (16), in which most of the finer points
Of particular interest in applications are the infinite power
are examined. m
(211 (212 (213 . . @;) It can easily be seen from the above definition t h a t
(221 (222 (223 . .
A = [ u c ~=] . e*e-A =
(Un1 an2 an3 . 1 ann and
A square matrix is of order n if it has n rows and n columns. eAeB -. eA+B
for an arbitrary A of the same order as I . If A is a square matrix of order n,the determinantal equation
T h e matrix, 0,all the elements of which are equal to zero, is
/ X I - AI = 0 (5)
called the null matrix, since
is called the characteristic (or secular) equation of A. X is a
A 0 = OA = 0 scalar, and b y expanding the above determinant it can be re-
again for a n arbitrary A. duced t o a polynominal in X o€ degree n. Thus,
++
The inverse matrix, A-1, of a matrix A has the property t h a t 1XI- A l ~ P ( X ) ~ X " + p i X " - ' + p z X n - 2 + ,..
AA-1 = A-'A = I pn-1X pa 0 (6)
where the p ' s are scalar coefficients. The algebraic equation,
A necessary and sufficient condition for the existence of A-' is ( k = 1,2,. . which are called t h e
Equation 6, has n roots,
t h a t the determinant of A, denoted b y 1 A I or det A, be different
characteristic, or latent, roots of the square matrix, A. The
from zero. I n t h a t case
Cagley-Hamilton theorem states t h a t P( A ) is the null matrix,
or, in other words, t h a t
P(A) E A" + PIA"..' + pzAn-2 + . . . + pn-jA + p n I = 0 (7)
A matrix, A, for which I A 1 = 0, is said to be singular. If A' =
where t h e p l ' s are identical with the corresponding coefficients
A, A is said t o be symmetric. If A' = A-I, A is said to be or-
of the characteristic equation. ilnother formulation of the
thogonal.
The matrix, B = C-'AC, where C is any nonsingular matrix, above theorem is t h a t any square matrix, A, satisfies its own
characteristic equation.
is said t o be obtained from A by a similarity transformation.
Sylvester Formula. The Sylvester formula, a n exceedingly
Matrix Polynomials. The finite sum
useful expression, can readily be derived from the Cayley-Hamil-
wA" + oriA"-l + . . . 4- or,-~A + ornI ton theorem and the Lagrange interpolation formula. It states
that, if the characteristic roots of a square matrix, A, of order in the vapor leaving the nth plate, and z n ( i j , the mole fraction of
n are all distinct, and F( A ) is any matrix polynomial, or any con- the ith component in the liquid leaving the n t h plate, is given by
vergent power series, in A , Ydi) = Kn(i)zn(i) (12)
where the function, K n ( i ) has
, to be estimated a priori. The state
of the rectification column can then be described b y the following
equation, which can be obtained b y combining a material balance
around a n y plate in t h e tower with Equation 12:
Thus
Ln-izn-~(i)- [Ln + V n K n ( i ) l ~ n (+i )
(9) Vn+1K,+1(i)sn+.(i)= 0 (13)
for 0 5 n iN , b u t n # f, where f denotes the feed tray. I n
Equation 13, L, and V, denote the total moles of liquid and
from which it follows t h a t e A , is defined b y the infinite series,
vapor, respectively, leaving plate n. If z~(i) denotes the mole
exists for all square matrices, A, since ex exists for all scalars,
fraction of the i t h component in the feed, when n = f,
A. A similar argument can be used also for any power series in
A, and it follows t h a t a power series in A , F ( A j , converges only Lj-l~j-,(i) - [ L , + V / K / ( i ) l ~ t (+i )
as long as all the characteristic roots of A lie within the radius Vj+iKj+i(ihj+i(i) + Z F ( ~ )= 0 (14)
of convergence of F( A). Equations 13 and 14 can next be combined into a single matrix
The usefulness of the Sylvester theorem cannot be over-
equation b y defining t h e vectors
estimated. Fractional powers of a matrix can now be defined
without difficulty, and it can be stated that, under suitable con-
ditions, a n y analytic function of a square matrix, A , of order n
can be expressed as a polynominal in A of degree n - 1. -4s
a matter of fact, many functions of a matrix A can be formu-
lated satisfactorily only by means of the Sylvester formula.
When multiplicities in the latent roots occur,
the Sylvester formula has to be modified. Thus
if, for example, a set of roots, X1, b, . . . , A,, are ~ ( iE)
0 0 LZ
equal, Sylvester's formula can be stated in its con-
fluent form ( 9 )
where D is the nusnber of moles of overhead product. The matrix
equation, which takes the place of Equations 13 and 14, is
where &(A) = ( A -A, + 1) (X - A, + 5). . .(X - A,) and the sum A(i)x(ij + z(ij = 0 (15)
is taken over all the distinct roots of A . It is worth remembering and its solution is
t h a t the following identity exists
~ ( i= )-A-'(i)Z(ij (16)
adj ( A i l - A ) = (-l)n-l ?r (Ail - A) (11)
i #j I n practice it is best t o obtain zo(i)from Equation 16 and then
to calculate z,(i) for n > 0 b y iterating Equation 13. Example
Differentiation and Integration of Matrices. If u ( t ) is a square I11 will demonstrate the method of evaluating the determinant
matrix or a vector, the elements of which are fractions of a param- and the appropriate cofactor of a matrix of same form as A .
eter, t, the derivative of u ( t ) with respect to t is defined as the Suppose that, as a result of the preceding calculations, the
matrix obtained by differentiating each element of u(t). estimate in the function, K n ( i ) ,has t o be revised slightly so t h a t
the new matrix thus obtained is
thus
A i = A + B
where A is identical with t h a t defined previously and B is an-
other square matrix, the elements of which are much smaller in
similarly magnitude than t h e corresponding elements of A .
(A +
B ) - l = A-I(Z +
BA-l)-' =
A-'[I - BA-' +
(BA-')Z - . . . j
or, approximately,
I n addition, there exists a definition for the derivative of a
matrix with respect t o a matrix, but this concept will not be ~ i ( i=) - A i - ' ~ ( i ) ( I - A-'Bjx(i) (17)
used in this discussion. where ~ ( iis )given by Equation 16. This method appears at-
tractive, in t h a t once a good approximation to the answer has
Example 1. Inversion of a matrix by iteration been arrived at-that is, once A and A-I have been determined-
an iteration can be set up b y varying Bin Equation 17.
It often happens t h a t the investigator is able to discover a n
approximate solution t o a given problem, and then is faced with
Example II. Analytical solution for
t h e task of devising a simple iterative procedure which will equations of multicomponent rectifscation
converge and which, moreover, will improve the answer rapidly.
Such a n example is presented in the design of fractionating Some years ago Underwood ($6) and Murdoch (60),among
columns, if use is made of the Geddes and Thiele method (25). others, arrived a t an analytic solution for the equations of multi-
It is assumed t h a t the column has a total of N plates, including component rectification. Their solution can be derived b y the
the reboiler, and a condenser, the plate 0, and that, moreover, use of matrix algebra in a n elegant way.
t h e relation between yn(i), the mole fraction of the i t h component If constant molal overflow is assumed, constant relative
i=l
i= 1
and the roots of A are identical with those of
where R is the reflux ratio, x o ( i ) denotes the composition of the
overhead product, m the total number of components in the mix-
ture and p ( i ) 2 [ a ( i ) ]-I, where a ( i ) is the relative volatility of
t h e i t h component. Equation 18 can now be linearized b y a
change in the dependent variable-that is, by defining a new It is, of course, very easy to establish t h a t the roots of Equation
function, X n ( i ) ,as 26 are all positive and distinct (20). Moreover, it can be shown
that
j= 1
where
and
Equation 28 can be shown t o be identical to the formulas derived
by Underwood (26) and Murdoch ( 2 0 ) . It is also apparent that,
in general, if n is small, Equation 24 is t h e more convenient form
of the solution, while if n is large, Equation 27 is preferable.
component in the liquid and the vapor stream leaving the n t h which is but a product of c square matrices. Similarly,
plate; L,(t) and V,(t) are, respectively, the moles of liquid and
vapor leaving the nth plate; while h, is the liquid holdup on the
nth plate. In constructing the above model, any time lags that
[M;(A)]-I = &1 [I- A(t,-,) :] (38)
as2 -aar
0
a23
0
0a4
a :) (39)
A2 = -CLBZAI- a l u ~ z i=
1
- [LiLz + G1Lz + GIG21 >0 where AlA(A0) = A(Ao + dA) - A@,), etc.
hihz
The coefficients, b,j, can be computed once and for all, as follows:
where G, = V,K,
and, in general,
where
A=(-;
a + B -a
a-6+ p
0
o rfo
-01 -01
0
0
i ) Let the chemical reaction
(
and
where Akr 1 < k 6 N, are the characteristic roots of A . (kl + ;) -k:
x A(X) A'A(X) A2A(X) A'A(h) A4A(X) A5A(X) ASA(h) The more general problem, for which the con-
0 0.29430 -0.27446 0.23418 -0.15676 -0.01452 1.73248 11.25001 centrations in all t h e tanks are not all equal t o
0.5 0,01984 -0.04028 0.07742 -0.17128 1.71796 12.98249 zero a t t = 0, can easily be reduced t o the one under
1.0 -0.02044 0.03714 -0.09386 1.54668 14.70045
1.0 0,01670 -0,05672 1.45282 16.24713 consideration b y changing t h e dependent varia-
2.0 -0.04002 1.39610 17.69995
2.5 1.35608 19,09605 bles.
3.0 20.45213 For t h e second tank reactor
where Xk, 1 k 6 63, are the characteristic roots of B . Of course, wise process can be described by the matrix Equation 36, with
Equation 59 holds no matter what t h e order of the matrix, B; the matrix, A ( t ) , given by
. Equation
- 39. There is one difference,
hence any number of consecutive reactions may be considered however, in t h a t the elements of A ( t ) are not numbers any longer
As an example, consider the following: but are themselves square matrices of order m. Thus, the ele-
ments, A l j , of A are
e = 1, kl = 0.20, k: = 0.05, k z = 0.10, k: = 0.05
C i ( 0 ) = 1, C2(0) = ($0) = 0
Then
1.20 -0.05
B = (-0.20 1.15 -0.05
0 -0.10 1.05
O )
T h e characteristic equation for B is
/XI - BI Xa - 3.4000X' + 3.8325X - 1.4325 = 0 \.
which has a s roots
XI = 1.2866, XZ = 1.1133, and X 3 = 1.0000
and, therefore, from Equation 60,
and
1 - (1.2866t)i
n - 1 ___ ](-~ : ~ ~ ~ $ )
e-1.2868t
c
0 i! 0.4027
+ (0.8982)" { 1-
n-1 (1.1133t);} ( ::::::)
e-l.1133<
C 0 T - 1.0186 A ( t ) is a partitioned matrix.
+
{ I + e-t 2 '.
0
0.0769
:}(0.3080)
0.6159
From this point the treatment parallels exactly the one given
in the general solution to Equation 32, and, if A ( t ) is independent
of t , the answer is given by Equations 41 and 42. However, the
This one formula gives t h e concentration in the n t h reactor matrix, A , is now of order m ( N +
I ) , something which adds,
of all three components. Additional reactions would merely naturally, t o the complexity of the problem. I n addition, since
increase the order of the vectors. A now does not possess the simplifying features of Equation 39,
and since the evaluation of det A would, as a consequence, pre-
sent some rather formidable computational difficulties, the
Example V. More complicated
method of Hicks, described in t,he previous example, for reducing
systems-partitioned matrices
the characteristic equation of A t o a polynomial, cannot, in gen-
I n examining the transient behavior of stagewise processes, eral, be recommended.
the simplifying assumption has been made t h a t the characteristics B s a matter of fact, the problem of how t o calculate the
of such phenomena could be described b y m independent equa- latent roots of a matrix is a very important one, and has therefore
tions, Equation 32, where A ( t ) is a square matrix of the form of received some attention. There are two main directions along
Equation 39, and where m denotes the total number of compo- which one can proceed. One can attempt to reduce the charac-
nents in the system. However, if the changes t h a t occur in the teristic equation t o a polynomial or t r y to obtain the roots of a
process, following a change in the operating variables, are not matrix by some iterative procedure.
small, Equation 30 does not usually remain valid, and a better Probably the more promising method for reducing the char-
approximation is obtained if i t is assumed t h a t acteristic equation t o a polynomial is t h a t developed b y Frame
m. ( 5 ) . It has the disadvantage, however, like all the methods of a
similar nature, t h a t the accuracy t o which all the calculations
y ( i , n, 1 ) = K ( i , k , n ) z ( k , n, t )
k=l have t o be performed is usually very extensive. One could also
refer t o the article by Fettis ( 8 ) and t o the very interesting and
It is not difficult t o see, then, t h a t the state of the system under complete paper b y Wayland (68). On the other hand, the itera-
consideration, the distillation column in particular, will not be tive methods ( 2 , 6, 9, 19, 1 4 ) for calculating characteristic roots
described any longer by m, ( N +
1) dimensional vectors, xi, invariably result in rather good estimates for the larger roots
but by a single m(N +
1)dimensional vector, x-that is, a column with progressively poorer estimates for the smaller roots.
matrix with m(N + 1) elements. T h e investigation of such a However, from examination of Equations 41 and 42 i t is ap-
system, however, becomes simpler if use is made of partitioned parent t h a t the Sylvester formula becomes useful for large values
matrices. of the time, t , and that, consequently, one is primarily interested
Again, let i be the component-designating symbol, 1 6 i m, 6 in the smaller roots of A . This particular point is open t o debate,
and n be the plate designating symbol, 0 6 n 6 N and, as before, but it appears that, if a sufficiently powerful computer is available,
consider, for the sake of illustration, the transient features of it is best t o use, in general, Equation 41 in preference t o Equation
rectification columns. Define the vectors 42, and then t o estimate the smallest root, Amin., of A from the
behavior of x ( 1 ) for large values of t , since as t + m ,
x(t) - x( m ) - eXmlnJc
where c is a column vector.