LINEAR OPERATORS BANACH CENTER PUBLICATIONS, VOLUME 38 INSTITUTE OF MATHEMATICS POLISH ACADEMY OF SCIENCES WARSZAWA 1997 LOG-MAJORIZATIONS AND NORM INEQUALITIES FOR EXPONENTIAL OPERATORS FUMIO HIAI Department of Mathematics, Ibaraki University Mito, Ibaraki 310, Japan E-mail: [email protected] Abstract. Concise but self-contained reviews are given on theories of majorization and symmetrically normed ideals, including the proofs of the Lidskii–Wielandt and the Gelfand– Naimark theorems. Based on these reviews, we discuss logarithmic majorizations and norm inequalities of Golden–Thompson type and its complementary type for exponential operators on a Hilbert space. Furthermore, we obtain norm convergences for the exponential product formula as well as for that involving operator means. 1. Introduction. Since the notion of majorization was introduced by Hardy, Littlewood, and Pólya, it has been discussed by many mathematicians in various circumstances with various applications. First let us recall the notion of (weak) majorization in the simplest case of real vectors. For real vectors a = (a1 , . . . , an ) and b = (b1 , . . . , bn ), the weak Pk Pk majorization a ≺w b means that i=1 a[i] ≤ i=1 b[i] holds for 1 ≤ k ≤ n, where (a[1] , . . . , a[n] ) is the decreasing rearrangement of a. The majorization a ≺ b means that Pn Pn a ≺w b and i=1 ai = i=1 bi . When a and b are nonnegative, the multiplicative or Q logarithmic (weak) majorization can be also defined by taking product in place of sum P in the above, which we referred to in [6] as the log-majorization. Several (weak) majorizations are known for the eigenvalues and the singular values of matrices and compact operators, as was fully clarified in Marshall and Olkin’s monograph [62] and also in [4, 61]. These majorizations give rise to powerful devices in deriving various norm inequalities (in particular, perturbation norm inequalities) as well as trace or determinant inequalities for matrices or operators (see e.g. [15]). Among other things, the Lidskii–Wielandt majorization theorem is especially famous and important. A crucial reason why the (weak) majorization is useful in operator norm inequalities is the following fact: For bounded Hilbert space operators A and B, the weak majorization µ(A) ≺w 1991 Mathematics Subject Classification: Primary 47A30, 47B10; Secondary 47A63, 15A42. Research supported by Grant-in-Aid for Scientific Research 06221207. The paper is in final form and no version of it will be published elsewhere. [119] 120 F. HIAI µ(B) holds if and only if kAk ≤ kBk for any symmetric (or unitarily invariant) norm k · k, where µ(A) = (µ1 (A), µ2 (A), . . .) are the (generalized) singular values of A with multiplicities. General theory of symmetric norms and symmetrically normed ideals was extensively developed in Gohberg and Krein’s monumental monograph [32] (also [77, 79]). The majorization technique sometimes plays an important role in the study of symmetrically normed ideals. For instance, the Hölder type inequality for symmetric norms is a simple consequence of Horn’s majorization of multiplicative type. The celebrated Golden–Thompson trace inequality, independently proved by Golden [33], Symanzik [83], and Thompson [84], is tr eH+K ≤ tr eH eK for self-adjoint operators (particularly for Hermitian matrices) H and K. So far, there have been many extended or related inequalities around the Golden–Thompson inequality. For example, when H and K are Hermitian matrices, this inequality was extended in [58, 85] to the weak majorization µ(eH+K ) ≺w µ(eH/2 eK eH/2 ) or equivalently keH+K k ≤ keH/2 eK eH/2 k for any unitarily invariant norm, and in [81] to tr eH+K ≤ tr(eH/n eK/n )n for all n ∈ N. Also, the Araki–Lieb–Thirring inequality [12] (also [88]) is regarded as a strengthened Golden–Thompson inequality and is reformulated in terms of log-majorization (see [6, 40]). On the other hand, a complementary counterpart of the Golden–Thompson trace inequality was discovered in [44] in the course of study on lower and upper bounds for the relative entropy, and was strengthened in [6] to the form of log-majorization by using the technique of antisymmetric tensor powers. Restricted to the matrix case, the above log-majorizations of Golden–Thompson type and its complementary type yield the following norm inequalities: If H and K are Hermitian matrices and 0 < α < 1, then k(erH/(1−α) #α erK/α )1/r k ≤ keH+K k ≤ k(erH/2 erK erH/2 )1/r k, r > 0, for any unitarily invariant normk·k, where #α denotes the α-power mean, i.e. the operator mean corresponding to the operator monotone functionxα . Moreover, the above left-hand (resp. right-hand) side increases (resp. decreases) to keH+K k as r ↓ 0. The Golden–Thompson trace inequality was originally motivated by quantum statistical mechanics. When a Hamitonian K is given as a self-adjoint operator (assumed here to be lower-bounded) on a Hilbert space, the partition function tr e−βK and the free energy log tr e−βK where β is an inverse temperature constant are basically important from the quantum statistical mechanical viewpoint. When K receives a lower-bounded perturbation by H, physicists sometimes approximate tr e−β(H+K) by tr(e−βH/n e−βK/n )n via the Trotter product formula. Although the convergence tr(e−βH/n e−βK/n )n → tr e−β(H+K) might have been strongly believed by physicists, there was no rigorous proof up to [40]. Indeed, it was more strongly proved in [40] that if H and K are lower-bounded self-adjoint operators such that e−K is of trace class and H + K is essentially self-adjoint, then the following trace norm convergence holds: lim k(e−rH/2 e−rK e−rH/2 )1/r − e−(H+K) k1 = 0. r↓0 (Another k · k1 -convergence under a rather strong assumption was given in [65]. Other recent developments on the Trotter–Kato product formula in the operator norm and trace norm are found in [37, 46, 47, 75].) LOG-MAJORIZATIONS AND NORM INEQUALITIES 121 The present paper enjoys both aspects of a review paper and of a research paper. It is organized in five sections which are divided into several subsections. Our main aim is to present log-majorizations and norm inequalities for infinite-dimensional exponential operators. For this sake, in Sections 1 and 2 we concisely review the majorization theory and theory of symmetrically normed ideals. Although several distinguished monographs, as cited above, are available, we intend to make the exposition completely self-contained, so that Sections 1 and 2 may independently serve as a concise text on these subjects. The main part of Section 1 is the proofs of the Lidskii–Wielandt and the Gelfand–Naimark theorems for (generalized) singular values of matrices (operators). Our proofs are rather new and based on the real interpolation method (or the K-functional method). In Section 2 we stress the majorization technique in the theory of symmetrically normed ideals. Section 3 is taken from [40] and is not new, but we sometimes give more detailed accounts for the convenience of the reader. We investigate log-majorizations and norm inequalities of Golden–Thompson type for exponential operators. For instance, it is shown that if H and K are lower-bounded self-adjoint operators, then ke−(H +̂K) k ≤ k(e−rH/2 e−rK e−rH/2 )1/r k, r > 0, for any symmetric norm k · k, where H +̂K is the form sum of H and K. Preliminaries on antisymmetric tensor powers and the Trotter–Kato exponential product formula are included, which are quite beneficial in proving our results. Also in Subsection 3.5 we discuss the trace norm convergence of exponential product formula together with some technical preliminaries in the framework of von Neumann algebras. Section 4 is considered as a complementary counterpart of Section 3. Extending the matrix case [6, 44] (also [71]), we investigate log-majorizations and norm inequalities involving operator means (in particular, the α-power mean), which are opposite to those in Section 3 and considered as complementary Golden–Thompson type. If H is bounded self-adjoint and K is lower-bounded self-adjoint, then the following is proved for any symmetric norm k · k and 0 < α < 1: k(e−rH/(1−α) #α e−rK/α )1/r k ≤ ke−(H+K) k, r > 0. The most important ingredient in the extension from the matrix case to the infinitedimensional case is an exponential product formula for operator means established in Subsection 4.3. Finally in Section 5 we obtain further log-majorization results, for example, the log-majorization equivalent to the Furuta inequality [28], generalized logmajorizations of Horn’s type and of Golden–Thompson type, etc. Some determinant inequalities are also included. Most results of Section 4 and many of Section 5 are new. Although we confine ourselves to the setting of Hilbert space operators (in other words, the setup of B(H)) in this paper, it should be mentioned that many subjects treated here extend to the von Neumann algebra setup. In fact, based on the noncommutative integration theory ([20, 66, 78]), we can discuss the majorization theory in semifinite von Neumann algebras (see e.g. [21, 38, 39, 41, 42, 43, 48, 49, 64]) by using the notion of generalized s-numbers introduced in [25, 26] for measurable operators. Noncommutative Banach function spaces (i.e. generalized symmetrically normed ideals) associated with semifinite von Neumann algebras have been discussed in [21, 22, 53, 89, 90] for instance. 122 F. HIAI In particular, theory of noncommutative Lp -spaces over arbitrary von Neumann algebras was developed (e.g. the Haagerup Lp -spaces [35]). Kosaki [55] extended the Araki–Lieb– Thirring inequality (i.e. a log-majorization result in Subsection 3.2) to the von Neumann algebra case. Furthermore, many authors have worked on the Golden–Thompson inequality in von Neumann algebras in several ways ([8, 25, 45, 70, 76]). However, at present, the von Neumann algebra versions for the norm convergence in Subsection 3.5 and for the study of complementary Golden–Thompson type in Section 4 are not yet investigated. When we want to extend the study of Section 4 to the von Neumann algebra setup, the antisymmetric tensor technique is no longer available, so that we would have to exploit a new method. The contents of the paper are as follows: 1. Majorization and log-majorization 1.1. Majorization for vectors 1.2. Generalized singular values 1.3. Majorization for matrices: Lidskii–Wielandt and Gelfand–Naimark theorems 1.4. Majorization for operators 2. Symmetric norms and symmetrically normed ideals 2.1. Symmetric gauge functions and symmetric norms 2.2. Symmetrically normed ideals 2.3. Further properties of symmetric norms 2.4. Ando’s extension of Birman–Koplienko–Solomyak majorization result 3. Inequalities of Golden–Thompson type 3.1. Antisymmetric tensor powers 3.2. Araki’s log-majorization result 3.3. Trotter–Kato exponential product formula 3.4. Log-majorization and norm inequalities of Golden–Thompson type 3.5. Trace norm convergence of exponential product formula 4. Inequalities of complementary Golden–Thompson type 4.1. Preliminaries on operator means 4.2. Log-majorization for power operator means 4.3. Exponential product formula for operator means 4.4. Norm inequalities of complementary Golden–Thompson type 4.5. Norm convergence of exponential product formula for operator means 5. Miscellaneous results 5.1. Interplay between log-majorization and Furuta inequality 5.2. Other log-majorizations 5.3. Determinant inequalities 1. Majorization and log-majorization. The purpose of this section is to give a concise but self-contained review on the majorization theory for (generalized) singular values of matrices and operators. Complete expositions on the subject are found in [4, 62]. Also see [5] for recent developments. 1.1. Majorization for vectors. Let us start with the majorization for real vectors, which was introduced by Hardy, Littlewood, and Pólya. For two vectors a = (a1 , . . . , an ) 123 LOG-MAJORIZATIONS AND NORM INEQUALITIES and b = (b1 , . . . , bn ) in Rn , the weak majorization a ≺w b means that k X (1.1) i=1 a[i] ≤ k X b[i] , i=1 1 ≤ k ≤ n, where (a[1] , . . . , a[n] ) is the decreasing rearrangement of a, i.e. a[1] ≥ . . . ≥ a[n] are the components of a in decreasing order. The majorization a ≺ b means that a ≺w b and equality holds for k = n in (1.1). The following characterizations of majorization and weak majorization are fundamental. Proposition 1.1. The following conditions for a, b ∈ Rn are equivalent : (i) a ≺ b; Pn Pn (ii) i=1 |ai − r| ≤ i=1 |bi − r| for all r ∈ R; Pn Pn (iii) i=1 f (ai ) ≤ i=1 f (bi ) for any convex function f on an interval containing all ai , b i ; (iv) a is a convex combination of coordinate permutations of b; (v) a = Db for some doubly stochastic n × n matrix D, i.e. D = [dij ] with dij ≥ 0, Pn Pn i=1 dij = 1 for 1 ≤ j ≤ n. j=1 dij = 1 for 1 ≤ i ≤ n, and P r o o f. (i)⇒(iv). We show that there exist a finite number of matrices D1 , . . . , DN of the form λI + (1 − λ)Π where 0 ≤ λ ≤ 1 and Π is a permutation matrix interchanging two coordinates only such that a = DN . . . D1 b. Then (iv) follows because DN . . . D1 becomes a convex combination of permutation matrices. We may assume that a1 ≥ . . . ≥ an and b1 ≥ . . . ≥ bn . Suppose a 6= b and choose the largest j such that aj < bj . Then there exists k with k > j such that ak > bk . Choose the smallest such k. Let 1 − λ1 = min{bj − aj , ak − bk }/(bj − bk ) and Π1 be the permutation matrix interchanging the jth and kth coordinates. Then 0 < λ1 < 1 because bj > aj ≥ ak > bk . Define D1 = λ1 I + (1 − λ1 )Π1 and b(1) = D1 b. Now it is easy to check that a ≺ b(1) ≺ b and (1) (1) b1 ≥ . . . ≥ bn . Moreover the jth or kth coordinates of a and b(1) are equal. When a 6= b(1) , we can apply the above argument to a and b(1) . Repeating finite times we reach the conclusion. (iv)⇒(v) is trivial from the fact that any convex combination of permutation matrices is doubly stochastic. (v)⇒(ii). For every r ∈ R we get n X i=1 |ai − r| = n n n n X X X X |bj − r|. dij |bj − r| = dij (bj − r) ≤ i=1 j=1 i,j=1 j=1 (ii)⇒(i). Taking large r and small r in the inequality of (ii) we have Noting that |x| + x = 2x+ for x ∈ R where x+ = max{x, 0}, we get (1.2) n n X X (bi − r)+ , (ai − r)+ ≤ i=1 i=1 Pn i=1 ai = Pn i=1 bi . r ∈ R. Now prove that (1.2) implies a ≺w b. When b[k] ≥ r ≥ b[k+1] , Pk i=1 a[i] ≤ Pk i−1 b[i] follows 124 F. HIAI because n X i=1 (ai − r)+ ≥ k X i=1 (a[i] − r)+ ≥ k X i=1 a[i] − kr, k n X X b[i] − kr. (bi − r)+ = i=1 i=1 PN P (iv)⇒(iii). Suppose that ai = k=1 λk bπk (i) , 1 ≤ i ≤ n, where λk > 0, N k=1 λk = 1, and πk are permutations on {1, . . . , n}. Then the convexity of f implies that n X i=1 f (ai ) ≤ n X N X λk f (bπk (i) ) = i=1 k=1 n X f (bi ). i=1 (iii)⇒(v) is trivial because f (x) = |x − r| is convex. Note that (v)⇒(iv) is seen directly from the well-known theorem of Birkhoff [17] saying that every doubly stochastic matrix is a convex combination of permutation matrices. Proposition 1.2. The following conditions (i)–(iv) for a, b ∈ Rn are equivalent : (i) a ≺w b; (ii) there exists c ∈ Rn such that a ≤ c ≺ b, where a ≤ c means that ai ≤ ci , 1 ≤ i ≤ n; Pn Pn (iii) i=1 (ai − r)+ ≤ i=1 (bi − r)+ for all r ∈ R; Pn Pn (iv) i=1 f (ai ) ≤ i=1 f (bi ) for any increasing convex function f on an interval containing all ai , bi . Moreover , if a, b ≥ 0, then the above conditions are equivalent to the following: (v) a = Sb for some doubly substochastic n × n matrix S, i.e. S = [sij ] with sij ≥ 0, Pn j=1 sij ≤ 1 for 1 ≤ i ≤ n, and i=1 sij ≤ 1 for 1 ≤ j ≤ n. Pn P r o o f. (i)⇒(ii). By induction on n. We may assume that a1 ≥ . . . ≥ an and b1 ≥ Pk Pk . . . ≥ bn . Let α = min1≤k≤n ( i=1 bi − i=1 ai ) and define ã = (a1 +α, a2 , . . . , an ). Then Pk Pk a ≤ ã ≺w b and i=1 ãi = i=1 bi for some 1 ≤ k ≤ n. When k = n, a ≤ ã ≺ b. When k < n, we get (ã1 , . . . , ãk ) ≺ (b1 , . . . , bk ) and (ãk+1 , . . . , ãn ) ≺w (bk+1 , . . . , bn ). Hence the induction assumption implies that (ãk+1 , . . . , ãn ) ≤ (ck+1 , . . . , cn ) ≺ (bk+1 , . . . , bn ) for some (ck+1 , . . . , cn ) ∈ Rn−k . Then a ≤ (ã1 , . . . , ãk , ck+1 , . . . , cn ) ≺ b is immediate from ãk ≥ bk ≥ bk+1 ≥ ck+1 . (ii)⇒(iv). Let a ≤ c ≺ b. If f is increasing and convex on an interval [α, β] containing ai , bi , then ci ∈ [α, β] and n X i=1 f (ai ) ≤ n X i=1 f (ci ) ≤ n X f (bi ) i=1 by Proposition 1.1. (iv)⇒(iii) is trivial and (iii)⇒(i) was already shown in the proof (ii)⇒(i) of Proposition 1.1. Now assume a, b ≥ 0 and prove (ii) ⇔ (v). If a ≤ c ≺ b, then we have, by Proposition 1.1, c = Db for some doubly stochastic matrix D and ai = αi ci for some 0 ≤ αi ≤ 1. LOG-MAJORIZATIONS AND NORM INEQUALITIES 125 So a = Diag(α1 , . . . , αn )Db and Diag(α1 , . . . , αn )D is a doubly substochastic matrix. Conversely if a = Sb for a doubly substochastic matrix S, then a doubly stochastic matrix D exists so that S ≤ D entrywise and hence a ≤ Db ≺ b. Let a, b ∈ Rn and a, b ≥ 0. We define the weak log-majorization a ≺w(log) b when (1.3) k Y i=1 a[i] ≤ k Y b[i] , 1 ≤ k ≤ n, i=1 and the log-majorization a ≺(log) b when a ≺w(log) b and equality holds for k = n in (1.3). It is obvious that if a and b are strictly positive, then a ≺(log) b (resp. a ≺w(log) b) if and only if log a ≺ log b (resp. log a ≺w log b), where log a = (log a1 , . . . , log an ). The notions of weak majorization and weak log-majorization are similarly defined for positive bounded infinite sequences. But, to avoid discussing the decreasing rearrangement of an infinite sequence (this is considered as a special case of generalized singular values of a bounded operator introduced later in this section), we here confine ourselves to infinite sequences a = (a1 , a2 , . . .) and b = (b1 , b2 , . . .) such that a1 ≥ a2 ≥ . . . ≥ 0 and P P b1 ≥ b2 ≥ . . . ≥ 0. For such a, b we define a ≺w b and a ≺w(log) b when ki=1 ai ≤ ki=1 bi Qk Qk and i=1 ai ≤ i=1 bi , respectively, for all k ∈ N. Proposition 1.3. Let a, b ∈ Rn with a, b ≥ 0 and suppose a ≺w(log) b. If f is a continuous increasing function on [0, ∞) such that f (ex ) is convex , then f (a) ≺w f (b). In particular , a ≺w(log) b implies a ≺w b. Moreover , the same assertions hold also for infinite sequences a, b with a1 ≥ a2 ≥ . . . ≥ 0 and b1 ≥ b2 ≥ . . . ≥ 0, whenever f (0) ≥ 0 is additionally assumed. P r o o f. First assume that a, b ∈ Rn are strictly positive and a ≺w(log) b, so that log a ≺w log b. Since g◦h is convex when g and h are convex with g increasing, the function (f (ex ) − r)+ is increasing and convex for any r ∈ R. Hence we get, by Proposition 1.2, n X i=1 (f (ai ) − r)+ ≤ n X i=1 (f (bi ) − r)+ , which implies f (a) ≺w f (b) by Proposition 1.2 again. When a, b ≥ 0 and a ≺w(log) b, we can choose a(m) , b(m) > 0 such that a(m) ≺w(log) b(m) , a(m) → a, and b(m) → b. Since f (a(m) ) ≺w f (b(m) ) and f is continuous, we obtain f (a) ≺w f (b). The case of infinite sequences is immediate from the above case. In fact, a ≺w(log) b implies that (a1 , . . . , an ) ≺w(log) (b1 , . . . , bn ) for every n ∈ N. Hence (f (a1 ), . . . , f (an )) ≺w (f (b1 ), . . . , f (bn )), n ∈ N, so that f (a) ≺w f (b). 1.2. Generalized singular values. In the sequel of this section, we discuss the majorization theory for singular values of matrices and operators. Our goal is to prove the Lidskii–Wielandt and the Gelfand–Naimark theorems for generalized singular values of bounded operators. Indeed, these theorems were proved by using the real interpolation method in the setting of von Neumann algebras in [41, 64] (also [21]). In Subsection 1.3 we first prove the theorems for matrices by using this new method. After that, in Subsection 1.4 we extend them from matrices to operators in a rather simple way. 126 F. HIAI For any n × n matrix A let µ(A) = (µ1 (A), . . . , µn (A)) be the vector of singular values of A in decreasing order, i.e. µ1 (A) ≥ . . . ≥ µn (A) are the eigenvalues of |A| = (A∗ A)1/2 with multiplicities. When A is Hermitian, the vector of eigenvalues of A in decreasing order is denoted by λ(A) = (λ1 (A), . . . , λn (A)). The notion of singular values is generalized to infinite-dimensional operators. Let H be a Hilbert space (always assumed to be separable) and B(H) the algebra of all bounded operators on H. For any A ∈ B(H) we define the generalized singular values µ1 (A) ≥ µ2 (A) ≥ . . . of A by T∞ µn (A) = inf{λ ≥ 0 : rank(I − E|A| (λ)) < n}, n ∈ N, where |A| = 0 λdE|A| (λ) is the spectral decomposition of |A| so that I − E|A| (λ) is the spectral projection of |A| corresponding to the interval (λ, ∞). The above definition of µn (A) is a special case of the generalized s-numbers of measurable operators in the setting of von Neumann algebras introduced in [25, 26] (also [69]). If A is a compact operator (in particular, a matrix), then µn (A) are the usual singular values of A in decreasing order with multiplicities. Let µ∞ (A) = limn→∞ µn (A). Then it is easy to see that µ∞ (A) = kAke , the essential norm of A; namely µ∞ (A) is equal to the largest α ∈ R such that E|A| (α+ε)−E|A| (α−ε) is of infinite rank for every ε > 0. Note that µ∞ (A) = 0 if and only if A is compact. Furthermore, if µn (A) > µ∞ (A), then µn (A) is an eigenvalue of |A| with finite multiplicity. The basic properties of µn (A) are summarized as follows. See [26] for the proof in the von Neumann algebra setting. Proposition 1.4. Let A, B, X, Y ∈ B(H) and n, m ∈ N. (1) Mini-max expression: (1.4) µn (A) = inf{kA(I − P )k∞ : P is a projection , rank P = n − 1}, where k ·k∞ denotes the operator norm and a projection means always an orthogonal one. Furthermore, if A ≥ 0 then (1.5) µn (A) = inf{ sup hAξ, ξi : M is a subspace of H, dim M = n − 1}. ξ∈M⊥ , kξk=1 (2) Approximation number expression: (1.6) µn (A) = inf{kA − Xk∞ : X ∈ B(H), rank X < n}. (3) µ1 (A) = kAk∞ . (4) µn (αA) = |α|µn (A) for α ∈ C. (5) µn (A) = µn (A∗ ). (6) If 0 ≤ A ≤ B then µn (A) ≤ µn (B). (7) µn (XAY ) ≤ kXk∞ kY k∞ µn (A). (8) µn+m−1 (A + B) ≤ µn (A) + µm (B). (9) µn+m−1 (AB) ≤ µn (A)µm (B). (10) |µn (A) − µn (B)| ≤ kA − Bk∞ . (11) µn (f (A)) = f (µn (A)) if A ≥ 0 and f is a continuous increasing function on [0, ∞) with f (0) ≥ 0. LOG-MAJORIZATIONS AND NORM INEQUALITIES 127 P r o o f. (1) Let αn be the right-hand side of (1.4). First note that this does not change when rank P = n − 1 in (1.4) is replaced by rank P < n. So, if rank(I − E|A| (λ)) < n, then αn ≤ kAE|A| (λ)k∞ = k |A|E|A| (λ)k∞ ≤ λ. Hence αn ≤ µn (A). Conversely, for any ε > 0 choose a projection P with rank P = n − 1 such that kA(I − P )k∞ < αn + ε. Suppose rank(I − E|A| (αn + ε)) ≥ n. Then there exists ξ ∈ H with kξk = 1 such that (I − E|A| (αn + ε))ξ = ξ but P ξ = 0. This implies that αn + ε ≤ k |A|ξk = kA(I − P )ξk < αn + ε, a contradiction. Hence rank(I − E|A| (αn + ε)) < n and µn (A) ≤ αn + ε, implying µn (A) ≤ αn . When A ≥ 0, since EA1/2 (λ) = EA (λ2 ), we get µn (A) = µn (A1/2 )2 . So (1.5) follows from (1.4), because the right-hand side of (1.5) is written as inf{kA1/2 (I − P )k2∞ : P is a projection, rank P = n − 1}. (2) Let βn be the right-hand side of (1.6). If rank(I − E|A| (λ)) < n, then rank(A(I − E|A| (λ))) < n and βn ≤ kAE|A| (λ)k∞ ≤ λ. Hence βn ≤ µn (A). Conversely, if rank X < n, then the support projection P of |X| has rank < n. Since X(I − P ) = 0, we get by (1.4) µn (A) ≤ kA(I − P )k∞ = k(A − X)(I − P )k∞ ≤ kA − Xk∞ , implying µn (A) ≤ βn . (3) is (1.4) for n = 1. (4) and (5) follow from (1.4) and (1.6), respectively. (6) is a consequence of (1.5). It is immediate from (1.4) that µn (XA) ≤ kXk∞ µn (A). Also µn (AY ) = µn (Y ∗ A∗ ) ≤ kY k∞ µn (A) by (5). Hence (7) holds. Next we show (8)–(10). By (1.6), for every ε > 0, there exist X, Y ∈ B(H) with rank X < n, rank Y < m such that kA − Xk∞ < µn (A) + ε and kB − Y k∞ < µm (B) + ε. Since rank(X + Y ) < n + m − 1, we have µn+m−1 (A + B) ≤ k(A + B) − (X + Y )k∞ < µn (A) + µm (B) + 2ε, implying (8). For Z = XB + (A − X)Y we get rank Z ≤ rank X + rank Y < n + m − 1, kAB − Zk∞ = k(A − X)(B − Y )k∞ < (µn (A) + ε)(µm (B) + ε). These imply (9). Letting m = 1 and replacing B by B − A in (8) we get µn (B) ≤ µn (A) + kB − Ak∞ , which shows (10). (11) Assume f (0) = 0 and let fε (t) = f (t)+εt for ε > 0. Then µn (fε (A)) = fε (µn (A)) is easily checked from Efε (A) (λ) = EA (fε−1 (λ)). Since by (10) |µn (fε (A)) − µn (f (A))| ≤ kfε (A) − f (A)k∞ → 0 as ε ↓ 0, we have µn (f (A)) = limε↓0 fε (µn (A)) = f (µn (A)). When α = f (0) > 0, let g(t) = f (t) − α. Then the assertion follows as µn (f (A)) = µn (g(A) + αI) = µn (g(A)) + α = g(µn (A)) + α = f (µn (A)). 128 F. HIAI 1.3. Majorization for matrices: Lidskii–Wielandt and Gelfand–Naimark theorems. The following majorization results are the celebrated Lidskii–Wielandt theorem for singular values of matrices as well as for eigenvalues of Hermitian matrices. Theorem 1.5. For any n × n matrices A and B, |µ(A) − µ(B)| ≺w µ(A − B), that is, k X j=1 |µij (A) − µij (B)| ≤ k X j=1 µj (A − B) for every 1 ≤ i1 < i2 < . . . < ik ≤ n. Theorem 1.6. For any Hermitian n × n matrices A and B, λ(A) − λ(B) ≺ λ(A − B), or equivalently (λi (A) + λn−i+1 (B)) ≺ λ(A + B). The following results due to Ky Fan are consequences of the above theorems. The direct proofs of these weakened majorizations are much easier than those of Theorems 1.5 and 1.6. Corollary 1.7. (1) For any matrices A and B, µ(A + B) ≺w µ(A) + µ(B). (2) For any Hermitian matrices A and B, λ(A + B) ≺ λ(A) + λ(B). The proof of Theorem 1.5 is based on the real interpolation method. We need several lemmas. In the following, tr X denotes the usual trace of an n × n matrix X and kXk1 Pn the trace norm of X, i.e. kXk1 = tr |X| = j=1 µj (X). Lemma 1.8. If A and B are Hermitian, then n X |λj (A) − λj (B)| ≤ kA − Bk1 . j=1 P r o o f. Take the Jordan decomposition A − B = (A − B)+ − (A − B)− and let C = A + (A − B)− = B + (A − B)+ . Then we have kA − Bk1 = tr(A − B)+ + tr(A − B)− = 2tr C − tr A − tr B n X {2λj (C) − λj (A) − λj (B)}. = j=1 Since A ≤ C and B ≤ C, it follows from Proposition 1.4(6) that λj (A) ≤ λj (C) and λj (B) ≤ λj (C). Hence |λj (A) − λj (B)| ≤ 2λj (C) − λj (A) − λj (B), which gives the conclusion. LOG-MAJORIZATIONS AND NORM INEQUALITIES 129 The following simple observation is due to Wielandt (see [61]). ∗ Lemma 1.9. For any n × n matrix X let X̂ = X0 X0 , a 2n × 2n Hermitian matrix. Then µj (X), 1 ≤ j ≤ n, λj (X̂) = −µ2n−j+1 (X), n + 1 ≤ j ≤ 2n. P r o o f. We get |X̂| = |X| 0 , 0 |X ∗ | 1 (|X̂| + X̂) = 2 1 X̂− = (|X̂| − X̂) = 2 X̂+ = 1 |X| X ∗ , 2 X |X ∗ | 1 |X| −X ∗ . 2 −X |X ∗ | Since λ(|X|) = λ(|X ∗ |) = µ(X), we have λ2j−1 (|X̂|) = λ2j (|X̂|) = µj (X), 1 ≤ j ≤ n. But since 1 1 0 X̂+ 0 0 −1 0 = X̂− , −1 it follows that λj (X̂+ ) = λj (X̂− ) = λ2j (|X̂|), 1 ≤ j ≤ n. These give the assertion. Lemma 1.10. For any A and B, n X j=1 |µj (A) − µj (B)| ≤ kA − Bk1 . ∗ and B̂ = B0 B0 . Then |A − B| 0 k − B̂k1 = tr = 2kA − Bk1 . 0 |(A − B)∗ | P r o o f. Let  = A∗ A 0 0 By Lemmas 1.8 and 1.9 we have k − B̂k1 ≥ 2n X j=1 |λj (Â) − λj (B̂)| = 2 n X j=1 |µj (A) − µj (B)|. The right-hand side of (1.7) below is known as the K-functional in the real interpolation theory. Lemma 1.11. For any A and 1 ≤ k ≤ n, (1.7) k X j=1 µj (A) = min{kXk1 + kkY k∞ : A = X + Y }. 130 F. HIAI P r o o f. For any decomposition A = X + Y , since µj (A) ≤ µj (X) + kY k∞ by Proposition 1.4(10), we get k X j=1 µj (A) ≤ k X j=1 µj (X) + kkY k∞ ≤ kXk1 + kkY k∞ . For the converse, take the polar decomposition A = V |A| and the spectral decompositon Pn |A| = j=1 µj (A)Pj with mutually orthogonal projections Pj of rank 1. Define X=V k X j=1 {µj (A) − µk (A)}Pj , n k o n X X µj (A)Pj . Pj + Y = V µk (A) j=1 j=k+1 Then X + Y = A and k X kXk1 = j=1 µj (A) − kµk (A), so that kXk1 + kkY k∞ = k X kY k∞ = µk (A), µj (A). j=1 P r o o f o f T h e o r e m 1.5. Fix k and choose, by Lemma 1.11, X and Y such that A − B = X + Y and k X µj (A − B) = kXk1 + kkY k∞ . j=1 Let aj = µj (X + B) − µj (B) and bj = µj (A) − µj (X + B) for 1 ≤ j ≤ n. Since aj + bj = µj (A) − µj (B), considering the diagonal matrices we have by Lemma 1.11, k X j=1 |µ(A) − µ(B)|[j] = k X j=1 µj (Diag(µ1 (A) − µ1 (B), . . . , µn (A) − µn (B))) ≤ kDiag(a1 , . . . , an )k1 + kkDiag(b1 , . . . , bn )k∞ n X |aj | + k max |bj |. = j=1 But Lemma 1.10 gives n X j=1 |aj | = n X j=1 1≤j≤n |µj (X + B) − µj (B)| ≤ kXk1, while by Proposition 1.4(10) max |bj | = max |µj (A) − µj (X + B)| ≤ kA − (X + B)k∞ = kY k∞ . 1≤j≤n 1≤j≤n Therefore k X j=1 as desired. |µ(A) − µ(B)|[j] ≤ kXk1 + kkY k∞ = k X j=1 µj (A − B), LOG-MAJORIZATIONS AND NORM INEQUALITIES 131 P r o o f o f T h e o r e m 1.6. Let A and B be Hermitian. Since n n X X λj (A − B), {λj (A) − λj (B)} = tr(A − B) = j=1 j=1 it suffices to show that λ(A) − λ(B) ≺w λ(A − B). Choose a, b ∈ R such that A + aI ≥ B + bI ≥ 0. Then Theorem 1.5 gives λ(A + aI) − λ(B + bI) ≺w λ(A − B + (a − b)I). So the assertion follows from λ(A + aI) = λ(A) + a and analogous equalities for λ(B + bI) and λ(A − B + (a − b)I). Another important majorization for singular values of matrices is the Gelfand–Naimark theorem as follows. Theorem 1.12. For any n × n matrices A and B, (1.8) (µi (A)µn−i+1 (B)) ≺(log) µ(AB), or equivalently (1.9) k Y j=1 µij (AB) ≤ k Y j=1 {µj (A)µij (B)} for all 1 ≤ i1 < i2 < . . . < ik ≤ n. An immediate corollary of this theorem is the majorization result due to Horn. Corollary 1.13. For any matrices A and B, µ(AB) ≺(log) µ(A)µ(B), where µ(A)µ(B) = (µi (A)µi (B)). The following proof of Theorem 1.12 is a modification of [64] where (1.9) was proved in the von Neumann algebra setting. Lemma 1.14. Assume that A is invertible. Then for any 1 ≤ k ≤ n, there exist X and Y such that X ≥ I, A = Y X, and k Y j=1 µj (A) = det X · kY kk∞ , where det X denotes the determinant of X. Pn P r o o f. Let A = V |A| be the polar decomposition and |A| = j=1 µj (A)Pj the spectral decomposition. Define k n n k o n X X X X µj (A) X= Pj + µj (A)Pj . Pj + Pj , Y = V µk (A) µ (A) j=1 k j=1 j=k+1 j=k+1 Then we have X ≥ I and Y X = A. Moreover, det X · kY kk∞ = k k Y Y µj (A) µj (A). · µk (A)k = µ (A) j=1 k j=1 132 F. HIAI P r o o f o f T h e o r e m 1.12. To prove (1.8) and (1.9), we may assume by Proposition 1.14(10) that A and B are invertible. Note that n Y {µi (A)µn−i+1 (B)} = | det A · det B| = i=1 n Y µi (AB). i=1 Hence (1.8) is equivalent to (1.9) if we replace A and B by AB and B −1 , respectively, in (1.8). Now let 1 ≤ i1 < . . . < ik ≤ n and X, Y be as in Lemma 1.14. Since µi (AB) = µi (Y XB) ≤ kY k∞ µi (XB), 1 ≤ i ≤ n, we get k Y j=1 µij (AB) ≤ kY kk∞ · k Y µij (XB). j=1 On the other hand, since by Proposition 1.4, µi (XB) = µi (B ∗ X 2 B)1/2 ≥ µi (B ∗ B)1/2 = µi (B), 1 ≤ i ≤ n, we get k n Y µij (XB) Y µi (XB) det |XB| ≤ = = det X. (B) µ µ (B) det |B| i i j j=1 i=1 Therefore k Y j=1 µij (AB) ≤ det X · kY kk∞ · k Y µij (B) = k Y {µj (A)µij (B)}, j=1 j=1 showing (1.9). R e m a r k 1.15. The above proof is a multiplicative or logarithmic counterpart of the real interpolation method in the proof of Theorem 1.5. As shown in [64], for 1 ≤ k ≤ n we have k Y µj (A) = min{k log |X| k1 + k log kY k∞ : A = Y X} (1.10) log j=1 whenever µk (A) > 0. In fact, if A = Y X then log k Y j=1 µj (A) ≤ k X j=1 log{kY k∞ µj (X)} ≤ k X j=1 λj (log |X|) + k log kY k∞ ≤ k log |X| k1 + k log kY k∞ . This together with the proof of Lemma 1.14 shows (1.10). 1.4. Majorization for operators. In this subsection let us extend Theorems 1.5 and 1.12 to generalized singular values µ(A) = (µ1 (A), µ2 (A), . . .) of infinite-dimensional operators. The next lemma is useful for this sake. Lemma 1.16. Let A ∈ B(H), n ∈ N, and ε > 0. Then there exists a projection P of finite rank such that µi (P AP ) ≥ (µi (A) − ε)+ , 1 ≤ i ≤ n. 133 LOG-MAJORIZATIONS AND NORM INEQUALITIES Furthermore, there exists an orthonormal set {ξ1 , . . . , ξn } such that k |A|ξi − µi (A)ξi k ≤ ε, 1 ≤ i ≤ n. P r o o f. When µm (A) > 0 = µm+1 (A) for some 1 ≤ m < n, we may consider (µ1 (A), . . . , µm (A)) instead of (µ1 (A), . . . , µn (A)). So it suffices to assume that µn (A) > ε > 0. Let α = µ∞ (A) (= limn→∞ µn (A)). Suppose that µn (A) = α and µk−1 (A) > α = µk (A) = . . . = µn (A) for some 1 ≤ k ≤ n. Then all µi (A), 1 ≤ i < k (if any), are eigenvalues of |A| and E|A| (α) − E|A| (α − ε) is of infinite rank. So we can choose an orthonormal set {ξ1 , . . . , ξn } such that |A|ξi = µi (A)ξi for 1 ≤ i < k and ξi , k ≤ i ≤ n, are in the range of E|A| (α) − E|A| (α − ε). Let P0 and P be the projections onto the subspaces spanned by {ξ1 , . . . , ξn } and {ξ1 , . . . , ξn , Aξ1 , . . . , Aξn }, respectively. For 1 ≤ i ≤ n we get µi (P AP ) ≥ µi ((P AP )P0 ) = µi (AP0 ) = µi (P0 |A|2 P0 )1/2 ≥ α − ε. For 1 ≤ i < k, since µi (P0 |A|2 P0 )1/2 = µi (A), we get µi (P AP ) = µi (A). Hence the required condition is fulfilled. When µn (A) > α, all µi (A), 1 ≤ i ≤ n, are eigenvalues of |A| and the proof is done as above. Also the second assertion follows from the above proof. Theorem 1.17. If A, B ∈ B(H), then k X (1.11) j=1 |µij (A) − µij (B)| ≤ k X j=1 µj (A − B) for all 1 ≤ i1 < i2 < . . . < ik . In particular , (1.12) µ(A + B) ≺w µ(A) + µ(B), A, B ∈ B(H). P r o o f. Let 1 ≤ i1 < . . . < ik = n. For any ε > 0 there exists, by Lemma 1.16, projections P, Q of finite rank such that µi (P AP ) ≥ (µi (A) − ε)+ and µi (QBQ) ≥ (µi (B) − ε)+ for 1 ≤ i ≤ n. Let E = P ∨ Q, a projection of finite rank. Then for 1 ≤ i ≤ n, µi (A) ≥ µi (EAE) ≥ µi (P AP ) ≥ (µi (A) − ε)+ and µi (B) ≥ µi (EBE) ≥ (µi (B) − ε)+ . Applying Theorem 1.5 to EAE, EBE ∈ B(EH) (considered as matrices), we have k X j=1 |µij (EAE) − µij (EBE)| ≤ k X j=1 µj (E(A − B)E) ≤ k X j=1 µj (A − B). Letting ε ↓ 0 we obtain (1.11), which implies (1.12) by letting ij = j and replacing A by A + B. Theorem 1.18. If A, B ∈ B(H), then (1.13) k Y j=1 µij (AB) ≤ k Y j=1 {µj (A)µij (B)} for all 1 ≤ i1 < i2 < . . . < ik . In particular , (1.14) µ(AB) ≺w(log) µ(A)µ(B), A, B ∈ B(H). 134 F. HIAI P r o o f. Let 1 ≤ i1 < . . . < ik = n and ε > 0. By Lemma 1.16 there exists a projection P of finite rank such that µi (P ABP ) ≥ (µi (AB) − ε)+ for 1 ≤ i ≤ n. Let E be the projection onto the subspace spanned by P H ∪ BP H, which is of finite rank. Then P ABP = (P AE)(EBP ). Applying (1.9) to P AE, EBP ∈ B(EH), we have k X j=1 µij (P ABP ) ≤ k Y j=1 {µj (P AE)µij (EBP )} ≤ Y {µj (A)µij (B)}, which shows (1.13) as ε ↓ 0. 2. Symmetric norms and symmetrically normed ideals. This section is a selfcontained review on symmetric norms and symmetrically normed ideals. Our exposition is somewhat restricted to the material which will be necessary in the subsequent sections. See [32, 79] (also [77]) for full theory on the subject. 2.1. Symmetric gauge functions and symmetric norms. Let sfin denote the linear space of all infinite sequences of real numbers having only finitely many nonzero terms. A norm Φ on sfin is called to be symmetric if Φ satisfies (2.1) Φ(a1 , a2 , . . .) = Φ(ε1 aπ(1) , ε2 aπ(2) , . . .) for any permutation π on N and εi = ±1. This condition is equivalently written as Φ(a1 , a2 , . . .) = Φ(a∗1 , a∗2 , . . .) where (a∗1 , a∗2 , . . .) is the decreasing rearrangement of (|a1 |, |a2 |, . . .). A symmetric norm on sfin is called a symmetric gauge function. Lemma 2.1. Let Φ be a symmetric gauge function. (1) If (ai ), (bi ) ∈ sfin and |ai | ≤ |bi | for i ∈ N, then Φ(a1 , a2 , . . .) ≤ Φ(b1 , b2 , . . .). (2) Under the normalization Φ(1, 0, 0, . . .) = 1, X |ai |, sup |ai | ≤ Φ(a1 , a2 , . . .) ≤ i i (ai ) ∈ sfin , that is, the ℓ∞ -norm (resp. ℓ1 -norm) is the least (resp. greatest ) symmetric gauge function. P r o o f. (1) In view of (2.1) we may show that for 0 ≤ α ≤ 1, Φ(αa1 , a2 , a3 , . . .) ≤ Φ(a1 , a2 , a3 , . . .). This is seen as follows: Φ(αa1 , a2 , a3 , . . .) 1+α 1−α 1+α 1−α 1+α 1−α =Φ a1 + (−a1 ), a2 + a2 , a3 + a3 , . . . 2 2 2 2 2 2 1−α 1+α Φ(a1 , a2 , a3 , . . .) + Φ(−a1 , a2 , a3 , . . .) = Φ(a1 , a2 , a3 , . . .). ≤ 2 2 (2) Since by (2.1) and (1), |ai | = Φ(ai , 0, 0, . . .) ≤ Φ(a1 , . . . , ai , . . .), LOG-MAJORIZATIONS AND NORM INEQUALITIES 135 the first inequality holds. The second follows from Φ(a1 , . . . , an , 0, 0, . . .) ≤ n X Φ(ai , 0, 0, . . .) = n X i=1 i=1 |ai |. Lemma 2.2. If (a1 , . . . , an ), (b1 , . . . , bn ) ∈ Rn and (|a1 |, . . . , |an |) ≺w (|b1 |, . . . , |bn |), then Φ(a1 , . . . , an , 0, 0, . . .) ≤ Φ(b1 , . . . , bn , 0, 0, . . .). P r o o f. By Proposition 1.2 there exists (c1 , . . . , cn ) ∈ Rn such that (|a1 |, . . . , |an |) ≤ (c1 , . . . , cn ) ≺ (|b1 |, . . . , |bn |). Proposition 1.1 says that (c1 , . . . , cn ) is a convex combination of coordinate permutations of (|b1 |, . . . , |bn |). This implies by Lemma 2.1(1) and (2.1) that Φ(a1 , . . . , an , 0, 0, . . .) ≤ Φ(c1 , . . . , cn , 0, 0, . . .) ≤ Φ(b1 , . . . , bn , . . .). Let C(H) be the algebra of all compact operators on a separable Hilbert space H. We denote by Cfin (H) the set of all operators of finite rank on H. A norm k · k on Cfin (H) is called to be unitarily invariant if kU AV k = kAk for every A ∈ Cfin (H) and any unitaries U, V on H. A unitarily invariant norm on Cfin (H) is also called a symmetric norm. The following fundamental theorem is due to von Neumann [67] (also [63, 77]). Theorem 2.3. There is a bijective correspondence between symmetric gauge functions Φ and unitarily invariant norms k · k on Cfin (H) which is determined by the formula (2.2) kAk = Φ(µ1 (A), µ2 (A), . . .), A ∈ Cfin (H). Furthermore, a norm k · k on Cfin (H) is unitarily invariant if and only if (2.3) kXAY k ≤ kXk∞ kY k∞ kAk for every A ∈ Cfin (H) and X, Y ∈ B(H). P r o o f. Suppose that Φ is a symmetric gauge function. Define k · k on Cfin (H) by the formula (2.2). Let A, B ∈ Cfin (H). Since µ(A + B), µ(A) + µ(B) ∈ sfin and µ(A + B) ≺w µ(A) + µ(B) by (1.12) (or Corollary 1.7(1)), we get, by Lemma 2.2, kA + Bk ≤ Φ(µ1 (A) + µ1 (B), µ2 (A) + µ2 (B), . . .) ≤ Φ(µ1 (A), µ2 (A), . . .) + Φ(µ1 (B), µ2 (B), . . .) = kAk + kBk. Also it is clear that kAk = 0 if and only if µ1 (A) = 0 or A = 0. For α ∈ C we get by Proposition 1.4(4) kαAk = Φ(|α|µ1 (A), |α|µ2 (A), . . .) = |α| kAk. Hence k · k is a norm on Cfin (H), which is unitarily invariant because µn (U AV ) = µn (A) for all unitaries U, V . 136 F. HIAI Conversely, suppose that k · k is a unitarily invariant norm on Cfin (H). Choose an orthonormal basis {ϕi } of H and define Φ : sfin → R by X (ai ) ∈ sfin , ai ϕi ⊗ ϕi , Φ(a1 , a2 , . . .) = i where ϕ ⊗ ψ denotes the Schatten form, i.e. (ϕ ⊗ ψ)ξ = hξ, ψiϕ for ϕ, ψ, ξ ∈ H. Then it is immediate that Φ is a norm on sfin . For any permutation π on N and εi = ±1, we can define unitaries U, V on H by U ϕπ(i) = εi ϕi and V ϕπ(i) = ϕi for i ∈ N, so that X X aπ(i) U ϕπ(i) ⊗ V ϕπ(i) aπ(i) ϕπ(i) ⊗ ϕπ(i) V ∗ = Φ(a1 , a2 , . . .) = U i X i εi aπ(i) ϕi ⊗ ϕi = Φ(ε1 aπ(i) , ε2 aπ(2) , . . .). = i P Hence Φ is a symmetric gauge function. Any A ∈ Cfin (H) is written as A = ni=1 µi (A)ξi ⊗ ηi for some orthonormal sets {ξ1 , . . . , ξn } and {η1 , . . . , ηn }. Take unitaries U, V such that U ϕi = ξi , V ϕi = ηi for 1 ≤ i ≤ n. Then we get n n X X µi (A)ϕi ⊗ ϕi V ∗ = kAk, µi (A)ϕi ⊗ ϕi = U Φ(µ1 (A), µ2 (A), . . .) = i=1 i=1 so that (2.2) holds. Therefore the first assertion is proved. It is clear that a norm k · k on Cfin (H) is unitarily invariant if it satisfies (2.3). Conversely, if k · k is a unitarily invariant norm with the corresponding gauge function Φ, then by Proposition 1.4(7), kXAY k ≤ Φ(kXk∞ kY k∞ µ1 (A), kXk∞ kY k∞ µ2 (A), . . .) = kXk∞ kY k∞ kAk for A ∈ Cfin (H) and X, Y ∈ B(H). 2.2. Symmetrically normed ideals. Let Φ be a symmetric gauge function. When a = (a1 , a2 , . . .) is a bounded real sequence, we define Φ(a) = sup Φ(a1 , . . . , an , 0, 0, . . .) ∈ [0, ∞]. n Let sΦ be the set of all bounded real sequences a with Φ(a) < ∞. Moreover, extending k · k on Cfin (H), we define for any A ∈ B(H), (2.4) kAk = sup Φ(µ1 (A), . . . , µn (A), 0, 0, . . .) ∈ [0, ∞]. n Let CΦ (H) denote the set of all A ∈ B(H) with kAk < ∞, i.e. µ(A) = (µ1 (A), µ2 (A), . . .) ∈ sΦ . In this way, a symmetric norm k · k on Cfin (H) can extend to all operators in B(H) permitting ∞. Then we have: Proposition 2.4. Let A, B, X, Y ∈ B(H) and k · k be a symmetric norm. (1) (2) (3) (4) kAk = kA∗ k. kXAY k ≤ kXk∞ kY k∞ kAk. If µ(A) ≺w µ(B) (in particular , if |A| ≤ |B|), then kAk ≤ kBk. If µ(A) ≺w µ(B) and B ∈ C(H), then A ∈ C(H). 137 LOG-MAJORIZATIONS AND NORM INEQUALITIES (5) Under the normalization Φ(1, 0, 0, . . .) = 1 (or kP k = 1 for a projection of rank one), kAk∞ ≤ kAk ≤ kAk1 . P r o o f. (1) and (2) immediately follow from definition (2.4), Lemma 2.1(1), and the corresponding properties of Proposition 1.4. If µ(A) ≺w µ(B) then Lemma 2.2 gives Φ(µ1 (A), . . . , µn (A), 0, 0, . . .) ≤ Φ(µ1 (B), . . . , µn (B), 0, 0, . . .), n ∈ N, and so kAk ≤ kBk. Hence (3) holds. (4) is seen from the fact that A ∈ C(H) if and Pn only if µn (A) → 0, which is equivalent to n−1 i=1 µi (A) → 0. (5) follows from Lemma 2.1(2). Theorem 2.5. Let Φ be a symmetric gauge function. (1) CΦ (H) is a Banach space in the norm (2.4) and is a two-sided ideal of B(H). (2) If Φ is inequivalent to the ℓ∞ -norm, then CΦ (H) ⊂ C(H). P r o o f. (1) Let A, B ∈ CΦ (H). For every n ∈ N, since (1.12) gives (µ1 (A + B), . . . , µn (A + B)) ≺w (µ1 (A) + µ1 (B), . . . , µn (A) + µn (B)), we have by Lemma 2.2, Φ(µ1 (A + B), . . . , µn (A + B), 0, 0, . . .) ≤ Φ(µ1 (A), . . . , µn (A), 0, 0, . . .) + Φ(µ1 (B), . . . , µn (B), 0, 0, . . .) ≤ kAk + kBk. This implies that A + B ∈ CΦ (H) and kA + Bk ≤ kAk + kBk. Now it is easily verified that CΦ (H) is a linear space and k · k is a norm on CΦ (H). To prove the completeness, let {Ak } be a Cauchy sequence in CΦ (H). Since {Ak } is k · k∞ -Cauchy by Proposition 2.4(5), there exists A ∈ B(H) such that kAk − Ak∞ → 0. For every n ∈ N, since Φ(µ1 (Ak − Am ), . . . , µn (Ak − Am ), 0, 0, . . .) ≤ kAk − Am k and µi (Ak − Am ) → µi (Ak − A) as m → ∞ thanks to Proposition 1.4(10), we get Φ(µ1 (Ak − A), . . . , µn (Ak − A), 0, 0, . . .) ≤ lim kAk − Am k. m→∞ Therefore kAk − Ak ≤ limm kAk − Am k, which implies that A ∈ CΦ (H) and kAk − Ak → 0 as k → ∞. It is immediate from Proposition 2.4(2) that CΦ (H) is a two-sided ideal of B(H). (2) Suppose that CΦ (H) contains a non-compact operator A. Then we have α = inf n µn (A) > 0 and αΦ(1, 1, 1, . . .) = Φ(α, α, α, . . .) ≤ Φ(µ1 (A), µ2 (A), . . .) = kAk < ∞, so that Φ(a1 , a2 , . . .) ≤ α−1 kAk supn |an |. This together with Lemma 2.1(2) implies that Φ is equivalent to the ℓ∞ -norm. (0) The Banach space CΦ (H) as well as CΦ (H) defined below is called a symmetrically normed ideal associated with a symmetric gauge function Φ. It is seen as in Theorem 2.5 that sΦ is a Banach space in the norm Φ. In fact, if {ϕi } is an orthonormal basis P of H, then it is not difficult to show that Φ(a) = k i ai ϕi ⊗ ϕi k for any bounded real P sequence a = (ai ) and sΦ is isometrically imbedded in CΦ (H) by a ∈ sΦ 7→ i ai ϕi ⊗ ϕi (0) whose range is a closed (real) subspace of CΦ (H). Let sΦ be the closure of sfin in sΦ . 138 F. HIAI (0) (0) (0) We denote by CΦ (H) the set of all A ∈ B(H) such that µ(A) ∈ sΦ . When sΦ = sΦ or (0) CΦ (H) = CΦ (H), Φ is called regular. (0) Theorem 2.6. Let Φ, CΦ (H), and CΦ (H) be as above. (0) (0) (1) CΦ (H) is the closure of Cfin (H) in CΦ (H) and CΦ (H) ⊂ C(H). (0) (2) CΦ (H) is a Banach space in the norm k · k of CΦ (H) and is a two-sided ideal of B(H). (0) P r o o f. (1) It is clear that Cfin (H) ⊂ CΦ (H). It follows from Lemma 2.1(2) that if (0) a = (a1 , a2 , . . .) ∈ sΦ , then a belongs to the ℓ∞ -norm closure of sfin so that an → 0. P (0) (0) This shows that CΦ (H) ⊂ C(H). So each A ∈ CΦ (H) is written as A = i µi (A)ξi ⊗ ηi with orthonormal sequences {ξi } and {ηi }. Then ∞ n X X µi (A)ξi ⊗ ηi = Φ(µn+1 (A), µn+2 (A), . . .). µi (A)ξi ⊗ ηi = A − i=n+1 i=1 (0) But it is easily seen from µ(A) ∈ sΦ that lim Φ(µn+1 (A), µn+2 (A), . . .) = 0. n→∞ Hence A is in the closure of Cfin (H). Conversely, let A be in the closure of Cfin (H). For any ε > 0 there exists B ∈ Cfin (H) such that kA − Bk < ε. Since (1.11) says that for every n ∈ N, (|µ1 (A) − µ1 (B)|, . . . , |µn (A) − µn (B)|) ≺w (µ1 (A − B), . . . , µn (A − B)), it follows from Lemma 2.2 that Φ(µ1 (A) − µ1 (B), µ2 (A) − µ2 (B), . . .) ≤ kA − Bk < ε. (0) (0) Since µ(B) ∈ sfin , we have µ(A) ∈ sΦ and so A ∈ CΦ (H). (2) The first assertion follows from (1). The second is easily verified by using Proposition 2.4(2). For instance, let Φp be the ℓp -norm and k · kp the corresponding symmetric norm where 1 ≤ p ≤ ∞. When 1 ≤ p < ∞ we have CΦp (H) = Cp (H), the Schatten p-class. More generally, for 0 < p < ∞ we can define the p-class Cp (H) as the space of all A ∈ C(H) such that o1/p nX µi (A)p < ∞. kAkp = (tr |A|p )1/p = i But when 0 < p < 1, k · kp is not a norm but a quasi-norm. In particular, C1 (H) is the trace class and C2 (H) is the Hilbert-Schmidt class. When p = ∞ we have CΦ∞ (H) = B(H) (0) and CΦ∞ (H) = C(H). Note that kAkp ≥ kAkq and Cp (H) ⊂ Cq (H) if 0 < p < q ≤ ∞. Another important class of symmetric norms is the Ky Fan norms k · k(k) defined by kAk(k) = k X i=1 µi (A), k ∈ N. Obviously, all k · k(k) are equivalent to k · k∞ and k · k(1) = k · k∞ . See [32] for more delicate examples of symmetrically normed ideals such as Macaev ideals. LOG-MAJORIZATIONS AND NORM INEQUALITIES 139 Here we show the Hölder inequality to illustrate the usefulness of the majorization technique. Proposition 2.7. Let 0 < p, p1 , p2 ≤ ∞ and 1/p = 1/p1 + 1/p2 . Then kABkp ≤ kAkp1 kBkp2 , A, B ∈ B(H). Hence AB ∈ Cp (H) if A ∈ Cp1 (H) and B ∈ Cp2 (H). P r o o f. Suppose that 0 < p1 , p2 < ∞, because the result is obvious by Proposition 1.4(7) when p1 = ∞ or p2 = ∞. Since by (1.14) (µi (AB)p ) ≺w(log) (µi (A)p µi (B)p ), it follows from Proposition 1.3 that (µi (AB)p ) ≺w (µi (A)p µi (B)p ). Since (p1 /p)−1 + (p2 /p)−1 = 1, the usual Hölder inequality for vectors shows that for every n ∈ N, n n o1/p o1/p nX nX µi (A)p µi (B)p ≤ µi (AB)p i=1 i=1 ≤ n nX µi (A)p1 i=1 n o1/p1 nX µi (B)p2 i=1 o1/p2 ≤ kAkp1 kBkp2 . This yields the conclusion. Corresponding to each symmetric gauge function Φ, let us define Φ′ : sfin → R by o nX ai bi : a ∈ sfin , Φ(a) ≤ 1 . Φ′ (b1 , b2 , . . .) = sup i Then it is an easy task to check that Φ′ is again a symmmetric norm on sfin . The symmetric gauge function Φ′ is said to be conjugate to Φ. Note that Φ′′ = Φ. For example, when 1 ≤ p ≤ ∞ and 1/p + 1/q = 1, the ℓp -norm is conjugate to the ℓq -norm. The following generalized Hölder inequality can be shown as Proposition 2.7. Lemma 2.8. Let Φ, Φ1 , Φ2 be symmetric gauge functions with the corresponding norms k · k, ||| · |||1 , ||| · |||2 . If Φ(a1 b1 , a2 b2 , . . .) ≤ Φ1 (a)Φ2 (b), a, b ∈ sfin , then ′ kABk ≤ |||A|||1 |||B|||2 , A, B ∈ B(H). In particular , if k · k is the symmetric norm corresponding to Φ′ conjugate to Φ, then kABk1 ≤ kAk kBk′ for every A, B ∈ B(H). Hence AB ∈ C1 (H) if A ∈ CΦ (H) and B ∈ CΦ′ (H). P r o o f. By (1.14), Proposition 1.3, and Lemma 2.2, we have for every n ∈ N Φ(µ1 (AB), . . . , µn (AB), 0, 0, . . .) ≤ Φ(µ1 (A)µ1 (B), . . . , µn (A)µn (B), 0, 0, . . .) ≤ Φ1 (µ1 (A), . . . , µn (A), 0, 0, . . .)Φ2 (µ1 (B), . . . , µn (B), 0, 0, . . .) ≤ |||A|||1 |||B|||1 , 140 F. HIAI showing the first assertion. For the second part, note by definition of Φ′ that X |ai bi | ≤ Φ(a)Φ′ (b), a, b ∈ sfin . i Theorem 2.9. If Φ and Φ′ are conjugate symmetric gauge functions, then the dual (0) (0) Banach space CΦ (H)∗ of CΦ (H) is isometrically isomorphic to CΦ′ (H) by the duality (0) (A, B) 7→ tr(AB) for A ∈ CΦ (H) and B ∈ CΦ′ (H), where tr denotes the usual trace on C1 (H). P r o o f. For any B ∈ CΦ′ (H) we can define, by Lemma 2.8, a linear functional fB : → C by fB (A) = tr(AB). Since (0) CΦ |tr(AB)| ≤ kABk1 ≤ kAk kBk′ , (0) (0) A ∈ CΦ (H), (0) it follows that fB ∈ CΦ (H)∗ and kfB k ≤ kBk′ . Now let f ∈ CΦ (H)∗ and consider a sesqui-linear form (ξ, η) 7→ f (ξ ⊗ η), ξ, η ∈ H. Since |f (ξ ⊗ η)| ≤ kf k kξ ⊗ ηk = kf kΦ(1, 0, 0, . . .)kξk kηk, there exists B ∈ B(H) such that hBξ, ηi = f (ξ ⊗ η) for all ξ, η ∈ H. For each n ∈ N and ε > 0, by Lemma 1.16 there exists a projection P of finite rank such that µi (P BP ) ≥ (µ (B) − ε)+ for 1 ≤ i ≤ n and also m = rank P ≥ n. We write P BP as P BP = Pim i=1 µi (P BP )ξi ⊗ ηi with orthonormal bases {ξi } and {ηi } of P H. Since µi (P BP ) = hP BP ηi , ξi i = hBηi , ξi i = f (ηi ⊗ ξi ), if a ∈ sfin and Φ(a) ≤ 1, then n X ai µi (P BP ) = f i=1 n X i=1 n X ai ηi ⊗ ξi ai ηi ⊗ ξi ≤ kf k i=1 = kf kΦ(a1 , . . . , an , 0, 0, . . .) ≤ kf k. This shows that Φ′ (µ1 (P BP ), . . . , µn (P BP ), 0, 0, . . .) ≤ kf k. Letting ε ↓ 0 we get Φ′ (µ1 (B), . . . , µn (B), 0, 0, . . .) ≤ kf k, n ∈ N, so that B ∈ CΦ′ (H) and kBk′ ≤ kf k. Since f (A) = fB (A) for A ∈ Cfin (H) and Cfin (H) (0) is dense in CΦ (H), we have f = fB and hence kfB k = kBk′ . Thus B ∈ CΦ′ (H) 7→ fB ∈ (0) CΦ (H)∗ is a surjective isometry. ∼ B(H) and Cp (H)∗ ∼ As special cases we have C1 (H)∗ = = Cq (H) when 1 < p < ∞, 1/p + 1/q = 1. The above theorem shows that CΦ (H)∗ ∼ C = Φ′ (H) if Φ is regular and that ′ CΦ (H) is reflexive if and only if both Φ and Φ are regular. 2.3. Further properties of symmetric norms. In this subsection let us present, for later use, some further results concerning symmetric norms. The close relation between the (log-)majorization and the symmetric norm inequalities is summarized in the following proposition. LOG-MAJORIZATIONS AND NORM INEQUALITIES 141 Proposition 2.10. Consider the following conditions for A, B ∈ B(H). Then: (i)⇔(ii)⇒(iii)⇔(iv)⇔(v)⇔(vi). (i) µ(A) ≺w(log) µ(B); (ii) kf (|A|)k ≤ kf (|B|)k for every symmetric norm k · k and every continuous increasing function f on [0, ∞) such that f (0) ≥ 0 and f (ex ) is convex ; (iii) µ(A) ≺w µ(B); (iv) kAk(k) ≤ kBk(k) for every k ∈ N; (v) kAk ≤ kBk for every symmetric norm k · k; (vi) kf (|A|)k ≤ kf (|B|)k for every symmetric norm k · k and every increasing convex function f on [0, ∞) such that f (0) ≥ 0. P r o o f. (i)⇒(ii). Let f be as in (ii). By Propositions 1.3 and 1.4(11) we have (2.5) µ(f (|A|)) = f (µ(A)) ≺w f (µ(B)) = µ(f (|B|)). This implies by Proposition 2.4(3) that kf (|A|)k ≤ kf (|B|)k for any symmetric norm. (ii)⇒(i). Take k · k = k · k(k) , the Ky Fan norms, and f (x) = log(1 + ε−1 x) for ε > 0. Then f satisfies the condition in (ii). Since µi (f (|A|)) = f (µi (A)) = log(ε + µi (A)) − log ε, kf (|A|)k(k) ≤ kf (|B|)k(k) means that k k Y Y (ε + µi (B)). (ε + µi (A)) ≤ i=1 i=1 Qk Qk Letting ε ↓ 0 we get i=1 µi (A) ≤ i=1 µi (B) and hence (i) follows. (iii) ⇔ (iv) is trivial by definition of k·k(k) and (vi)⇒(v)⇒(iv) is clear. Finally assume (iii) and let f be as in (vi). Proposition 1.2 yields (2.5) again, so that (vi) follows. Hence (iii)⇒(vi) holds. The following is a noncommutative analogue of Fatou’s lemma. Proposition 2.11. Any symmetric norm k · k given by (2.4 ) is lower-semicontinuous in WOT (i.e. the weak operator topology) on B(H). P r o o f. Let Φ be the symmetric gauge function for k · k. Using (2.4), Lemma 1.16, and Theorem 2.9, we have for every A ∈ B(H), kAk = sup{kP AP k : P is a projection of finite rank} (0) = sup{|tr(XP AP )| : X ∈ CΦ′ (H), kXk′ ≤ 1, P is a projection of finite rank}. Hence the assertion follows because A 7→ tr(XP AP ) is continuous in WOT whenever P is of finite rank. Let B(H)+ denote the set of positive operators in B(H). The next proposition extending [79, Theorem 2.16] is a noncommutative variant of the dominated convergence theorem. Proposition 2.12. Let Aj , A ∈ B(H) and B ∈ B(H)+ . Let Φ be a symmetric gauge function with the corresponding norm k · k. Assume that |Aj | ≤ B and |A∗j | ≤ B for 142 F. HIAI (0) all j as well as |A| ≤ B and |A∗ | ≤ B. If Aj → A in WOT and B ∈ CΦ (H), then kAj − Ak → 0. For the proof we need: Lemma 2.13. Let Φ and k · k be as above. (1) If 1 < p < ∞ and 1/p + 1/q = 1, then Φ(a1 b1 , a2 b2 , . . .) ≤ Φ(|a1 |p , |a2 |p , . . .)1/p Φ(|b1 |q , |b2 |q , . . .)1/q , a, b ∈ sfin . (2) For every 1 < p < ∞ define (p) Φ(p) (a1 , a2 , . . .) = Φ(|a1 |p , |a2 |p , . . .)1/p , a ∈ sfin . Then Φ is a symmetric gauge function and the corresponding symmetric norm is k | · |p k1/p . P r o o f. (1) We may assume that a, b ≥ 0. From the Young inequality ai bi ≤ p−1 api + Lemma 2.1(1) gives 1 1 Φ(a1 b1 , a2 b2 , . . .) ≤ Φ(ap1 , ap2 , . . .) + Φ(bq1 , bq2 , . . .). p q q −1 bqi , Replacing a, b by αa, α−1 b with α > 0, we get α−q αp Φ(ap1 , ap2 , . . .) + Φ(bq1 , bq2 , . . .). Φ(a1 b1 , a2 b2 , . . .) ≤ p q Then the desired inequality follows by minimizing the above right-hand side in α > 0. (2) Let 1/p + 1/q = 1 and a, b ∈ sfin . We have Φ(|a1 + b1 |p , |a2 + b2 |p , . . .) ≤ Φ(|a1 | |a1 + b1 |p−1 , |a2 | |a2 + b2 |p−1 , . . .) + Φ(|b1 | |a1 + b1 |p−1 , |b2 | |a2 + b2 |p−1 , . . .) ≤ {Φ(|a1 |p , |a2 |p , . . .)1/p + Φ(|b1 |p , |b2 |p , . . .)1/p } × Φ(|a1 + b1 |p , |a2 + b2 |p , . . .)1/q thanks to (1). Hence Φ(p) (a + b) ≤ Φ(p) (a) + Φ(p) (b). The remaining properties of Φ(p) , being a symmetric gauge function as well as the last assertion are obvious. (0) P r o o f o f P r o p o s i t i o n 2.12. For any ε > 0, since B ∈ CΦ (H), there exists a projection P of finite rank such that kBP ⊥ k < ε where P ⊥ = I − P . In view of Lemma 2.13(2) let k · k(2) denote the symmetric norm corresponding to Φ(2) . Then we get by assumption k |Aj |1/2 P ⊥ k(2) = kP ⊥ |Aj |P ⊥ k1/2 ≤ kP ⊥ BP ⊥ k1/2 < ε1/2 and similarly k |A∗j |1/2 P ⊥ k(2) < ε1/2 . Since Φ(ab) ≤ Φ(2) (a)Φ(2) (b) for a, b ∈ sfin by Lemma 2.13(1), the generalized Hölder inequality (Lemma 2.8) gives kAj P ⊥ k = k |Aj |P ⊥ k ≤ k |Aj |1/2 k(2) k |Aj |1/2 P ⊥ k(2) ≤ kBk1/2 ε1/2 , and similarly kP ⊥ Aj P k ≤ kA∗j P ⊥ k ≤ kBk1/2 ε1/2 . The same inequalities hold also for A. Therefore kAj − Ak ≤ k(Aj − A)P ⊥ k + kP ⊥ (Aj − A)P k + kP (Aj − A)P k ≤ 4kBk1/2 ε1/2 + kP (Aj − A)P k. LOG-MAJORIZATIONS AND NORM INEQUALITIES 143 Since P is of finite rank and Aj → A in WOT, it immediately follows that kP (Aj − A)P k → 0. Since ε is arbitrary, we have kAj − Ak → 0. For instance, if Aj , B ∈ B(H)+ , Aj ≤ B ∈ C(H), and Aj → A ∈ B(H) in WOT (hence A ≤ B), then kAj − Ak∞ → 0. As is remarked in [79, p. 39], the condition |A∗j | ≤ B cannot be dropped in the above proposition. 2.4. Ando’s extension of Birman–Koplienko–Solomyak majorization result. The final result of this section is taken from [3]. (See [42, Appendix] by H. Kosaki for its extension to the von Neumann algebra case.) Before stating the theorem, let us recall the notion of operator monotone functions. A continuous real function f on [0, ∞) is said to be operator monotone if A ≤ B implies f (A) ≤ f (B) for any A, B ∈ B(H)+ . Such a function is characterized by the following integral representation: (2.6) f (x) = a + bx + \ xt ∞ 0 x+t dν(t), x ≥ 0, T∞ where a ∈ R, b ≥ 0, and ν is a positive measure on (0, ∞) with 0 t(1 + t)−1 dν(t) < ∞. This type of integral representation is a central result in the Löwner theory of operator monotone functions (see [1, 24, 36] for details). The operator monotonicity of xθ on [0, ∞) with 0 < θ < 1 is due to the Löwner–Heinz inequality. Also it is well known that log(1+x) is operator monotone. Theorem 2.14. Let A, B ∈ B(H)+ and k · k be any symmetric norm. (1) If f is an operator monotone function on [0, ∞) with f (0) ≥ 0, then (2.7) µ(f (A) − f (B)) ≺w µ(f (|A − B|)). (2) If g is a continuous strictly increasing function on [0, ∞) such that g(0) = 0 and limt→∞ g(t) = ∞ and whose inverse function is operator monotone, then θ µ(g(|A − B|)) ≺w µ(g(A) − g(B)). When f (x) = x with 0 < θ < 1, the weak majorization (2.7) above was formerly proved by Birman, Koplienko, and Solomyak [18], which gives the generalized Powers– Størmer inquality: If 0 < θ < 1 and θ ≤ p ≤ ∞, then (2.8) kAθ − B θ kp/θ ≤ kA − Bkθp , A, B ∈ B(H)+ . The case when θ = 1/2 and p = 1 is known as the Powers–Størmer inequality [72], whose extension to the von Neumann algebra setting was obtained in [9, 34]. We first prepare simple facts to prove the theorem. Lemma 2.15. Let X, Y ∈ B(H) be self-adjoint and X = X+ − X− , Y = Y+ − Y− be the Jordan decompositions. (1) If X ≤ Y then µi (X+ ) ≤ µi (Y+ ) for all i. (2) If µ(X+ ) ≺w µ(Y+ ) and µ(X− ) ≺w µ(Y− ), then µ(X) ≺w µ(Y ). P r o o f. (1) Let Q be the support projection of X+ . Since X+ = QXQ ≤ QY Q ≤ QY+ Q, we have µi (X+ ) ≤ µi (QY+ Q) ≤ µi (Y+ ) by Proposition 1.4. 144 F. HIAI (2) It is rather easy to see that µ(X) is the decreasing rearrangement of the combination of µ(X+ ) and µ(X− ). So for each k ∈ N we can choose 0 ≤ m ≤ k so that k X µi (X) = m X µi (X+ ) + i=1 i=1 i=1 k−m X µi (X− ). Hence k X i=1 µi (X) ≤ m X µi (Y+ ) + k−m X µi (Y− ) ≤ i=1 i=1 k X µi (Y ), i=1 as desired. P r o o f o f T h e o r e m 2.14. (1) First assume that A ≥ B ≥ 0 and let C = A−B ≥ 0. In view of Proposition 2.10 it suffices to prove that (2.9) kf (B + C) − f (B)k(k) ≤ kf (C)k(k) , k ∈ N. For t > 0 let t x =1− , x+t x+t which is increasing on [0, ∞) with ht (0) = 0. According to the integral representation (2.6) with a, b ≥ 0, we have ht (x) = µi (f (C)) = f (µi (C)) = a + bµi (C) + \ µi (C)t ∞ 0 µi (C) + t so that (2.10) kf (C)k(k) ≥ bkCk(k) + dν(t) = a + bµi (C) + \ ∞ tµi (ht (C))dν(t), 0 \ ∞ tkht (C)k(k) dν(t). 0 On the other hand, since f (B + C) = aI + b(B + C) + \ ∞ tht (B + C)dν(t) 0 as well as the analogous expression for f (B), we have f (B + C) − f (B) = bC + \ ∞ 0 t{ht (B + C) − ht (B)}dν(t), so that (2.11) kf (B + C) − f (B)k(k) ≤ bkCk(k) + \ ∞ 0 tkht (B + C) − ht (B)k(k) dν(t). By (2.10) and (2.11) it suffices for (2.9) to show that kht (B + C) − ht (B)k(k) ≤ kht (C)k(k) , t > 0, k ∈ N. Since ht (x) = h1 (x/t), the case t = 1 is enough because we may replace B and C by t−1 B and t−1 C, respectively. So what remains to prove is the following: (2.12) k(B + I)−1 − (B + C + I)−1 k(k) ≤ kI − (C + I)−1 k(k) , k ∈ N. LOG-MAJORIZATIONS AND NORM INEQUALITIES 145 Since (B + I)−1 − (B + C + I)−1 = (B + I)−1/2 h1 ((B + I)−1/2 C(B + I)−1/2 )(B + I)−1/2 and k(B + I)−1/2 k∞ ≤ 1, we obtain µi ((B + I)−1 − (B + C + I)−1 ) ≤ µi (h1 ((B + I)−1/2 C(B + I)−1/2 )) = h1 (µi ((B + I)−1/2 C(B + I)−1/2 )) ≤ h1 (µi (C)) = µi (I − (C + I)−1 ) by repeated use of Proposition 1.4(7). Therefore (2.12) is proved. Next let us prove the general case A, B ≥ 0. Since 0 ≤ A ≤ B + (A − B)+ , it follows that f (A) − f (B) ≤ f (B + (A − B)+ ) − f (B), which implies by Lemma 2.15(1) that k(f (A) − f (B))+ k(k) ≤ kf (B + (A − B)+ ) − f (B)k(k) . Applying the first case to B + (A − B)+ and B, we get kf (B + (A − B)+ ) − f (B)k(k) ≤ kf ((A − B)+ )k(k) . Therefore (2.13) µ((f (A) − f (B))+ ) ≺w µ(f ((A − B)+ )). Exchanging the role of A, B gives (2.14) µ((f (A) − f (B))− ) ≺w µ(f ((A − B)− )). Here we may assume f (0) = 0 because f can be replaced by f −f (0). Then it is immediate that f ((A − B)+ )f ((A − B)− ) = 0 and f ((A − B)+ ) + f ((A − B)− ) = f (|A − B|). So µ(f (A)−f (B)) ≺w µ(f (|A−B|)) follows from (2.13) and (2.14) thanks to Lemma 2.15(2). (2) Let f be the inverse of g. Since f satisfies the condition of (1), we have k X i=1 µi (f (A) − f (B)) ≤ k X i=1 f (µi (A − B)), k ∈ N. Here replace A and B by g(A) and g(B), respectively. Then k X i=1 µi (A − B) ≤ k X i=1 f (µi (g(A) − g(B))), k ∈ N, which means that µ(A − B) ≺w f (µ(g(A) − g(B))). As is well known, the operator monotonicity of f implies the concavity of f , so that g is convex. Hence µ(g(|A − B|)) = g(µ(A − B)) ≺w µ(g(A) − g(B)) by Proposition 1.2. Problem 2.16. Let A, B ∈ B(H)+ and f be an operator monotone function on [0, ∞) with f (0) = 0. Then it is natural to ask whether or not the following variant of (2.7) holds: µ(f (A + B)) ≺w µ(f (A) + f (B)). 146 F. HIAI But it seems that the method in proving (2.7) does not work in this case. Theorem 2.14(1) and (1.12) show that µ(f (A + B)) ≺w µ(f (A)) + µ(f (B)), while a stronger result is found in [4, Theorem 6.9]. 3. Inequalites of Golden–Thompson type. This section is mostly taken from [40]. We obtain log-majorization results and norm inequalities of Golden–Thompson type for exponential operators. Furthermore, we discuss the norm convergence of exponential product formula, which is the main result of [40]. But accounts more detailed than those in [40] are supplied concerning technical parts: Lemmas 3.1, 3.2, 3.8, and theory of state perturbation in von Neumann algebras. 3.1. Antisymmetric tensor powers. First let us establish a machinery of antisymmetric tensors, which is quite useful in deriving log-majorization results. Let H be a separable Hilbert space as before. For each n ∈ N let ⊗n H denote the n-fold tensor product of H with itself, which is the completed Hilbert space of the n-fold algebraic tensor product with respect to the inner product defined by hξ1 ⊗ . . . ⊗ ξn , η1 ⊗ . . . ⊗ ηn i = n Y i=1 hξi , ηi i. For ξ1 , . . . , ξn ∈ H define ξ1 ∧ . . . ∧ ξn ∈ ⊗n H by 1 X (sign π)ξπ(1) ⊗ . . . ⊗ ξπ(n) , (3.1) ξ1 ∧ . . . ∧ ξn = √ n! π where π runs over all permutations on {1, . . . , n} and sign π = ±1 according as π is even or odd. The closed subspace of ⊗n H spanned by {ξ1 ∧ . . . ∧ ξn : ξi ∈ H} is called the n-fold antisymmetric tensor product of H and denoted by Λn H. In fact, the linear extension of the map ξ1 ⊗ . . . ⊗ ξn 7→ √1n! ξ1 ∧ . . . ∧ ξn is the projection of ⊗n H onto Λn H. A straightforward computation from (3.1) shows that (3.2) hξ1 ∧ . . . ∧ ξn , η1 ∧ . . . ∧ ηn i = det[hξi , ηj i]1≤i,j≤n . Note that ξ1 ∧ . . . ∧ ξn 6= 0 if and only if {ξ1 , . . . , ξn } is linearly independent. Moreover, if {ϕi } is an orthonormal basis of H, then {ϕi1 ∧ . . . ∧ ϕin : i1 < . . . < in } is an orthonormal basis of Λn H. For every A ∈ B(H) the n-fold tensor product ⊗n A ∈ B(⊗n H) is given as (⊗n A)(ξ1 ⊗ . . . ⊗ ξn ) = Aξ1 ⊗ . . . ⊗ Aξn . Since Λn H is invariant for ⊗n A, the antisymmetric tensor power Λn A of A can be defined as Λn A = ⊗n A|Λn H ; in fact, (3.3) (Λn A)(ξ1 ∧ . . . ∧ ξn ) = Aξ1 ∧ . . . ∧ Aξn . When dim H = N < ∞, ΛN H = C, ΛN A = det A, and Λn H = {0} for n > N . In the matrix theory Λn A is usually called the nth compound of A. The following are elementary properties of antisymmetric tensor powers. 147 LOG-MAJORIZATIONS AND NORM INEQUALITIES Lemma 3.1. Let X, Xj , Y, A ∈ B(H) and n ∈ N. (1) Λn (X ∗ ) = (Λn X)∗ . (2) Λn (XY ) = (Λn X)(Λn Y ). (3) If kXj − Xk∞ → 0, then kΛn Xj − Λn Xk∞ → 0. (4) If Xj → X in SOT (resp. WOT ) and supj kXj k∞ < ∞, then Λn Xj → Λn X in SOT (resp. WOT ). (5) If A ≥ 0, then Λn A ≥ 0 and Λn (Ap ) = (Λn A)p for all p > 0. (6) Λn (|X|) = |Λn X|. P r o o f. (1) and (2) are the restrictions of the corresponding formulas ⊗n (X ∗ ) = (⊗ X)∗ and ⊗n (XY ) = (⊗n X)(⊗n Y ) to Λn H. For (3) and (4) it suffices to show the corresponding convergences for ⊗n , which are readily verified. If A ≥ 0 then Λn A = (Λn (A1/2 ))∗ (Λn (A1/2 )) ≥ 0 by (1) and (2). When p is rational, the second assertion of (5) is immediate from (2). Then (3) implies the case of general p > 0. Finally (6) follows from (1), (2), and (5). n A useful technique in the majorization theory for operators is supplied by the following lemma. For matrices this is a consequence of the so-called Binet–Cauchy theorem ([62, pp. 503–504]). Lemma 3.2. For every A ∈ B(H) and n ∈ N, n Y (3.4) i=1 µi (A) = µ1 (Λn A) (= kΛn Ak∞ ). P r o o f. We may assume by Lemma 3.1(6) that A ≥ 0. First suppose that A is compact. Then there exists an orthonormal basis {ϕi } of H such that Aϕi = µi (A)ϕi for all i. Since {ϕi1 ∧ . . . ∧ ϕin : i1 < . . . < in } is a complete set of eigenvectors of Λn A with n o nY µij (A) ϕi1 ∧ . . . ∧ ϕin , (Λn A)(ϕi1 ∧ . . . ∧ ϕin ) = j=1 we have (3.4). Next let A ≥ 0 be general. Let Pk be projections of finite rank such that Pk ↑ I. Since Pk APk → A in SOT, Lemma 3.1(4) implies that Λn (Pk APk ) → Λn A in SOT. Since k · k∞ is lower-semicontinuous in WOT, we have kΛn Ak∞ ≤ lim inf kΛn (Pk APk )k∞ = lim inf k→∞ k→∞ n Y i=1 µi (Pk APk ) ≤ n Y µi (A) i=1 by the first case and Proposition 1.4(7). On the other hand, by Lemma 1.16 we can choose (k) (k) (k) (k) sequences of orthonormal sets {ξ1 , . . . , ξn } such that limk kAξi − µi (A)ξi k = 0 for 1 ≤ i ≤ n. Hence it follows from (3.2) and (3.3) that (k) (k) (k) (k) kΛn Ak∞ ≥ h(Λn A)(ξ1 ∧. . .∧ξn(k) ), ξ1 ∧. . .∧ξn(k) i = det[hAξi , ξj i]1≤i,j≤n → n Y µi (A) i=1 as k → ∞, completing the proof. Before going into the main part of this section, let us prove the Weyl majorization theorem as warming-up practice in the antisymmetric tensor technique. 148 F. HIAI Theorem 3.3. Let A ∈ C(H) and λ1 (A), λ2 (A), . . . be the eigenvalues of A arranged as |λ1 (A)| ≥ |λ2 (A)| ≥ . . . with algebraic multiplicities counted. Then n Y i=1 |λi (A)| ≤ n Y µi (A), n ∈ N. i=1 P r o o f. If λ is an eigenvalue of A with algebraic multiplicity m, then there exists a set {η1 , . . . , ηm } of independent vectors such that Aηj − ληj ∈ span{η1 , . . . , ηj−1 }, 1 ≤ j ≤ m. Hence for each n ∈ N we can choose independent vectors ξ1 , . . . , ξn such that Aξi = λi (A)ξi + ζi with ζi ∈ span{ξ1 , . . . , ξi−1 } for 1 ≤ i ≤ n. Then it is readily checked that n o nY λi (A) ξ1 ∧ . . . ∧ ξn (Λn A)(ξ1 ∧ . . . ∧ ξn ) = Aξ1 ∧ . . . ∧ Aξn = i=1 and ξ1 ∧ . . . ∧ ξn 6= 0, so that Qn n i=1 λi (A) is an eigenvalue of Λ A. Hence Lemma 3.2 gives n n Y Y n µi (A). λi (A) ≤ kΛ Ak∞ = i=1 i=1 3.2. Araki’s log-majorization result. Extending a trace inequality of Lieb and Thirring [60], Araki [12] showed the following log-majorization result by essentially the same method as below. Also, the same result for matrices was shown in [88] in the same way. Note that this was further generalized in [55] to the von Neumann algebra case, while the antisymmetric tensor technique can be no longer used. A generalization in another direction will be given in Proposition 5.8. Theorem 3.4. For every A, B ∈ B(H)+ , µ((A1/2 BA1/2 )r ) ≺w(log) µ(Ar/2 B r Ar/2 ), (3.5) r ≥ 1, or equivalently (3.6) µ((Ap/2 B p Ap/2 )1/p ) ≺w(log) µ((Aq/2 B q Aq/2 )1/q ), 0 < p ≤ q. P r o o f. We can pass to the limit from A + εI and B + εI as ε ↓ 0 by Proposition 1.4(10). So we may assume that A and B are invertible. First let us show that (3.7) k(A1/2 BA1/2 )r k∞ ≤ kAr/2 B r Ar/2 k∞ , r ≥ 1. To do so, it suffices to show that Ar/2 B r Ar/2 ≤ I implies A1/2 BA1/2 ≤ I, equivalently B r ≤ A−r implies B ≤ A−1 . But this is just the Löwner–Heinz inequality. For every n ∈ N, since by Lemma 3.1, Λn ((A1/2 BA1/2 )r ) = ((Λn A)1/2 (Λn B)(Λn A)1/2 )r , Λn (Ar/2 B r Ar/2 ) = (Λn A)r/2 (Λn B)r (Λn A)r/2 , it follows from (3.7) with Λn A, Λn B instead of A, B that kΛn ((A1/2 BA1/2 )r )k∞ ≤ kΛn (Ar/2 B r Ar/2 )k∞ . LOG-MAJORIZATIONS AND NORM INEQUALITIES 149 This means thanks to Lemma 3.2 that n n Y Y µi (Ar/2 B r Ar/2 ). µi ((A1/2 BA1/2 )r ) ≤ i=1 i=1 Hence (3.5) is proved. If we replace A, B by Ap , B p and take r = q/p, then µ((Ap/2 B p Ap/2 )q/p ) ≺w(log) µ(Aq/2 B q Aq/2 ), which implies (3.6) by Proposition 1.4(11). Theorem 3.4 and Proposition 2.10 give: Corollary 3.5. Let A, B ∈ B(H)+ and k · k be any symmetric norm. If f is a continuous increasing function on [0, ∞) such that f (0) ≥ 0 and f (ex ) is convex , then kf ((A1/2 BA1/2 )r )k ≤ kf (Ar/2 B r Ar/2 )k, r ≥ 1. In particular , k(A1/2 BA1/2 )r k ≤ kAr/2 B r Ar/2 k, r ≥ 1. 3.3. Trotter–Kato exponential product formula. To obtain norm inequalities of Golden –Thompson type, let us recall the form sum of positive self-adjoint operators. Let H and K be positive self-adjoint (not necessarily bounded) operators on H. Let D0 = D(H 1/2 ) ∩ D(K 1/2 ), D(H 1/2 ) being the domain of H 1/2 , H0 the closure of D0 , and P0 the projection onto H0 . We denote by H +̂ K the form sum of H and K, which is the positive self-adjoint operator on H0 associated with the closed positive quadratic form ξ ∈ D0 7→ kH 1/2 ξk2 + kK 1/2 ξk2 so that k(H +̂K)1/2 ξk2 = kH 1/2 ξk2 + kK 1/2 ξk2 , ξ ∈ D0 . The form sum H +̂K can be defined also for lower-bounded self-adjoint H, K as follows: H +̂K = (H+ +̂K+ ) − P0 (H− + K− )P0 , where H = H+ − H− is the Jordan decomposition; or equivalently H +̂K = ((H + aI)+̂(K + bI)) − (a + b)P0 by taking a, b ∈ R such that H + aI, K + bI ≥ 0. Note that if H + K is essentially self-adjoint, then H +̂K coincides with the closure of H + K. The following (1) is the so-called Trotter–Kato product formula, which was shown in [51] in a more general form (see also [19]). The symmetric and continuous parameter version (2) was given in [40]. It is a simpler fact that if X, Y ∈ B(H) then limn→∞ (eX/n eY /n )n = eX+Y in the norm k · k∞ . But a quite general Trotter-like convergence for exponential products of bounded operators was established in [82]. Theorem 3.6. (1) If H and K are lower-bounded self-adjoint operators on H, then s-lim(e−tH/n e−tK/n )n = e−t(H +̂K) , n→∞ t > 0, the convergence being uniform in t ∈ [a, b] for any 0 < a < b. Here s-lim means the convergence in SOT and e−t(H +̂K) is written for e−t(H +̂K) P0 for brevity. 150 F. HIAI (2) With the same assumption and notation, s-lim(e−rtH/2 e−rtK e−rtH/2 )1/r = e−t(H +̂K) , r↓0 t > 0, the convergence being uniform in t ∈ [a, b] for any 0 < a < b. P r o o f. Let us only show (2), which is an easy consequence of (1). We may assume H, K ≥ 0 by taking H + aI, K + aI. For 0 < r < 1 write 1/r = n + s where n = n(r) ∈ N and s = s(r) ∈ [0, 1). Then (e−rtH/2 e−rtK e−rtH/2 )1/r = (e−rtH/2 e−rtK e−rtH/2 )s e−rtH/2 e−rtK (e−rtH e−rtK )n−1 e−rtH/2 . When r ↓ 0 (hence n → ∞), we have e−rtH/2 → I and e−rtK → I in SOT uniformly in t ∈ [0, b]. Since e−rtH/2 e−rtK e−rtH/2 ≤ (e−rtH/2 e−rtK e−rtH/2 )s ≤ I, we get s-lim(e−rtH/2 e−rtK e−rtH/2 )s = I (3.8) r↓0 uniformly in t ∈ [0, b]. Since r(n − 1)t → t, the uniform convergence of (1) implies that (3.9) s-lim{(e−rtH e−rtK )n−1 − e−r(n−1)t(H +̂K) } = 0 r↓0 uniformly in t ∈ [a, b]. Finally, it is immediate that (3.10) s-lim{e−r(n−1)t(H +̂K) − e−t(H +̂K) } = 0 r↓0 uniformly in t ∈ [0, b]. The above uniform convergences (3.8)–(3.10) altogether yield the conclusion. 3.4. Log-majorization and norm inequalities of Golden–Thompson type. The next theorem is the Golden–Thompson inequality strengthened to the form of log-majorization. Theorem 3.7. If H and K are lower-bounded self-adjoint operator on H, then µ(e−(H +̂K) ) ≺w(log) µ((e−rH/2 e−rK e−rH/2 )1/r ), r > 0. To prove the theorem, we give the following infinite-dimensional extension of Ky Fan’s multiplicative formula. n Y i=1 Lemma 3.8. For every A ∈ B(H) and n ∈ N, µi (A) = sup{det(P |A|P |P H ) : P is a projection of rank n} = sup{Re det(P U AP |P H ) : U is a unitary and P is a projection of rank n}. Qn Hence the function A 7→ i=1 µi (A) is lower-semicontinuous in WOT on B(H). P r o o f. If P is a projection of rank n and U is a unitary, then n n Y Y µi (A), µi (P |A|P ) ≤ det(P |A|P |P H ) = i=1 i=1 151 LOG-MAJORIZATIONS AND NORM INEQUALITIES Re det(P U AP |P H ) ≤ | det(P U AP |P H )| = n Y i=1 µi (P U AP ) ≤ n Y µi (A). i=1 To show the converse, we may assume that µn (A) > 0. By Lemma 1.16 (and its proof), (k) (k) we can choose sequences of orthonormal sets {ξ1 , . . . , ξn } in the range of |A| such (k) (k) that limk k |A|ξi − µi (A)ξi k = 0 for 1 ≤ i ≤ n. Let Pk be the projection onto (k) (k) the span of {ξ1 , . . . , ξn } and A = W |A| the polar decomposition. For each k, since (k) (k) (k) (k) {W ξ1 , . . . , W ξn } is orthonormal, there exists a unitary Uk such that Uk ξi = W ξi for 1 ≤ i ≤ n. Then (k) (k) det(Pk |A|Pk |Pk H ) = det[h|A|ξi , ξj i]i,j , (k) (k) (k) (k) (k) (k) det(Pk Uk∗ APk |Pk H ) = det[hAξi , Uk ξj i] = det[hAξi , W ξj i] = det[h|A|ξi , ξj i]. Q (k) (k) Since det[h|A|ξi , ξj i]ij → ni=1 µi (A) as k → ∞, we get the required formulas. Furthermore, the last assertion is immediate because A 7→ Re det(P U AP |P H ) is continuous in WOT whenever P is of finite rank. By the way, the following additive formula can be shown in a similar way: For every A ∈ B(H) and n ∈ N, n X i=1 µi (A) = sup{tr(P |A|P ) : P is a projection of rank n} = sup{Re tr(P U AP ) : U is a unitary and P is a projection of rank n} Pn and hence i=1 µi (A) is lower-semicontinuous in WOT. However, we do not know an explicit counter-example to µn (A) itself being lower-semicontinuous in WOT. Indeed, Lemmas 3.1(4) and 3.2 serve for the proof of Theorem 3.7. But the above multiplicative and additive formulas are worth pointing out by themselves. P r o o f o f T h e o r e m 3.7. For every n ∈ N we have n Y i=1 µi (e−(H +̂K) ) ≤ lim inf r↓0 n Y i=1 by Theorem 3.6(2) and Lemma 3.8. Since r ↓ 0 by (3.6), we conclude that n Y i=1 µi (e−(H +̂K) ) ≤ n Y µi ((e−rH/2 e−rK e−rH/2 )1/r ) Qn i=1 µi ((e−rH/2 e−rK e−rH/2 )1/r ) decreases as µi ((e−rH/2 e−rK e−rH/2 )1/r ), r > 0, i=1 as desired. Corollary 3.9. If H and K are lower-bounded self-adjoint operators on H and k · k is a symmetric norm, then ke−(H +̂K) k ≤ k(e−rH/2 e−rK e−rH/2 )1/r k, and the above right-hand side decreases as r ↓ 0. In particular , (3.11) r > 0, ke−(H +̂K) k ≤ ke−H/2 e−K e−H/2 k ≤ ke−H e−K k. 152 F. HIAI P r o o f. The first assertion is a consequence of Theorem 3.7, (3.6), and Proposition 2.10. The second inequality of (3.11) follows because by Proposition 1.4(5) ke−H e−K k = k |e−K e−H | k = k(e−H e−2K e−H )1/2 k. The specialization of (3.11) to the trace norm k·k1 is the celebrated Golden–Thompson trace inequality independently established in [33, 83, 84]. It was shown in [81] that tr eH+K ≤ tr(eH/n eK/n )n for every Hermitian matrices H, K and n ∈ N. For the matrix case, (3.11) was given in [58, 85]. Also (3.11) for the norm k · k∞ is known as Segal’s inequality ([73, p. 260]). A rather trivial consequence of (3.11) is that ke−(H +̂K) k ≤ ke−H k∞ ke−K k for any symmetric norm k · k. Hence, if Φ is a symmetric gauge function and e−K ∈ CΦ (H), then e−(H +̂K) ∈ CΦ (H) for every lower-bounded H. Concerning (quasi-)norms k · kp we have: Corollary 3.10. (1) If H, K are as above and 0 < p ≤ ∞, then ke−(H +̂K) kp ≤ k(e−rH/2 e−rK e−rH/2 )1/r kp , r > 0, and the above the right-hand side decreases as r ↓ 0. (2) When 0 < p, p1 , p2 ≤ ∞ and 1/p = 1/p1 + 1/p2 , if e−H ∈ Cp1 (H) and e−K ∈ Cp2 (H), then e−(H +̂K) ∈ Cp (H). P r o o f. (1) follows from Corollary 3.9 because for any 0 < p < ∞, 1/p k(e−rH/2 e−rK e−rH/2 )1/r kp = k(e−rH/2 e−rK e−rH/2 )p/r k1 1/p and ke−(H +̂K) kp = ke−p(H +̂K) k1 . (2) is immediate from (3.11) and the Hölder inequality (Proposition 2.7). The next example is given to show that k(e−rH/2 e−rK e−rH/2 )1/r k does not converge to ke−(H +̂K) k as r ↓ 0 in general. Example 3.11. Define 2 × 2 matrices P0 and Qn , n ∈ N, by p 1 − 1/n 1 0 p . P0 = , Qn = ξn ⊗ ξn with ξn = 0 0 1/n Then P0 and Qn are projections with P0 ∧ Qn = 0, so that P0⊥ + Q⊥ n is invertible. Let 0 < δ < 1 be arbitrary. For each n, since exp{(log ε)(P0⊥ + Q⊥ )} → 0 as ε ↓ 0, we can n choose 0 < εn < 1 such that k exp{(log εn )(P0⊥ + Q⊥ n )}k∞ ≤ δ. L∞ Putting Hn = (− log εn )P0⊥ and Kn = (− log εn )Q⊥ H = n , we define n=1 Hn and L∞ 2 L∞ K = n=1 Kn , which are positive self-adjoint operators on H = 1 C . Then e−rH/2 e−rK e−rH/2 = = ∞ M (e−rHn /2 e−rKn e−rHn /2 ) n=1 ∞ M r ⊥ r/2 {(P0 + εn P0⊥ )r/2 (Qn + εn Q⊥ }. n ) (P0 + εn P0 ) n=1 LOG-MAJORIZATIONS AND NORM INEQUALITIES 153 Since r ⊥ r/2 (P0 + εn P0⊥ )r/2 (Qn + εn Q⊥ n ) (P0 + εn P0 ) ≥ (P0 + εn P0⊥ )r/2 Qn (P0 + εn P0⊥ )r/2 = (P0 + εn P0⊥ )r/2 ξn ⊗ (P0 + εn P0⊥ )r/2 ξn , we have 1 εr ke−rH/2 e−rK e−rH/2 k∞ ≥ sup k(P0 + εn P0⊥ )r/2 ξn k2 = sup 1 − + n ≥ 1, n n n n so that k(e−rH/2 e−rK e−rH/2 )1/r k∞ ≥ 1 for all r > 0. On the other hand, e−(H +̂K) = ∞ M e−(Hn +Kn ) = n=1 and hence ke −(H +̂K) ∞ M exp{(log εn )(P0⊥ + Q⊥ n )} n=1 k∞ ≤ δ. 3.5. Trace norm convergence of exponential product formula. In this subsection let H and K be lower-bounded self-adjoint operators such that H +K is essentially self-adjoint. For simplicity we denote the closure of H + K by the same H + K. The following trace norm convergence of exponential product formula was established in [40]. Theorem 3.12. If e−K ∈ C1 (H), then (3.12) lim k(e−rH/2 e−rK e−rH/2 )1/r − e−(H+K) k1 = 0, (3.13) lim k(e−rK/2 e−rH e−rK/2 )1/r − e−(H+K) k1 = 0, (3.14) (3.15) r↓0 r↓0 lim k(e−H/n e−K/n )n+1 − e−(H+K) k1 = 0, n→∞ lim tr(e−H/n e−K/n )n = tr e−(H+K) . n→∞ First note that all the operators in the theorem belong to C1 (H). For instance, we have (e−rH/2 e−rK e−rH/2 )1/r ∈ C1 (H) for all r > 0, because with H ≥ aI, µk ((e−rH/2 e−rK e−rH/2 )1/r ) = µk (e−rK/2 e−rH e−rK/2 )1/r ≤ µk (e−ra e−rK )1/r = e−a µk (e−K ). The proof of the theorem in [40] is based on Araki’s Trotter product formula [8] involving the modular operator and Donald’s perturbation theory [23] for positive normal functionals on von Neumann algebras. So, for convenience, we first give a brief survey on these materials. Let M be a von Neumann algebra and (M, H, J, P) be a standard form ([9, 34]) of M. For example, a standard form of M = B(H) is given as (B(H), C2 (H), ∗ , C2 (H)+ ), where B(H) is represented on C2 (H) by left multiplication and C2 (H)+ is the set of positive + operators in C2 (H). Let M+ ∗ denote the positive part of the predual M∗ , and ϕ ∈ M∗ be faithful. Then ϕ is written as ϕ = h · Φ, Φi with the cyclic and separating vector Φ ∈ P. Let ∆Φ be the modular operator associated with Φ (or ϕ). See [80] for modular theory 154 F. HIAI of von Neumann algebras. For each h ∈ Msa where Msa is the self-adjoint part of M, Araki [7] defined the perturbed vector Φh by (3.16) h Φ = ∞ X \ t1 0 0 1/2 n (−1) n=0 dt1 \ \ tn−1 dt2 . . . t dtn ∆tΦn h∆Φn−1 −tn h . . . ∆tΦ1 −t2 hΦ, 0 ∆zΦ1 h∆zΦ2 h . . . ∆zΦn h where Φ is in the domain of for z = (z1 , . . . , zn ) ∈ Cn with Re z ∈ n {(s1 , . . . , sn ) ∈ R : s1 , . . . , sn ≥ 0, s1 + . . . + sn ≤ 1/2} and the sum in (3.16) absolutely converges. Then Φh is also a cyclic and separating vector in P. It is known [8] that Φ is in the domain of exp 12 (log ∆Φ − h) and log ∆Φ − h Φh = exp Φ. 2 Moreover, the perturbed functional ϕh is defined by ϕh = h · Φh , Φh i. (3.17) Note that Φh and ϕh here mean Φ−h and ϕ−h in the notation of [7, 8]. The following is a version of the Trotter product formula given in [8, Remarks 1, 2]. Proposition 3.13. Let Φ be as above. Then for every h ∈ Msa , 1/2n −h/2n n Φh = lim (∆Φ n→∞ e 1/2n n ) Φ = lim (e−h/2n ∆Φ n→∞ ) Φ strongly. The relative entropy for positive normal functionals on a von Neumann algebra was first introduced in [87] in the semifinite case and was extended in [10, 11] to the general case. For each ϕ, ψ ∈ M+ ∗ let Φ and Ψ be the vector representatives in P so that ϕ = h · Φ, Φi and ψ = h · Ψ, Ψi. Then Araki’s relative entropy S(ψ, ϕ) is defined by S(ψ, ϕ) = −h(log ∆Φ,Ψ )Ψ, Ψi if the support of ψ is dominated by that of ϕ, and S(ψ, ϕ) = ∞ otherwise, where ∆Φ,Ψ is the relative modular operator ([11]). See [86] for another definition and [54, 68] for properties of relative entropy. Next let h be a lower-bounded T∞ self-adjoint operator affiliated with M. So we have the spectral decomposition h = a λdeh (λ), where {eh (λ)}λ≥a is a spectral resolution in M T∞ with a ∈ R. For any ψ ∈ M+ ∗ , ψ(h) can be defined as ψ(h) = a λdψ(eh (λ)). For each ϕ ∈ M+ ∗ define c(ϕ, h) ∈ [−∞, ∞) by c(ϕ, h) = sup{−ψ(h) − S(ψ, ϕ) : ψ ∈ Σ∗ (M)}, where Σ∗ (M) denotes the set of normal states on M. According to [23, Theorem 3.1], if c(ϕ, h) > −∞, that is, ψ(h) + S(ψ, ϕ) < ∞ for some ψ ∈ Σ∗ (M), then there exists a unique ω ∈ Σ∗ (M) such that −ω(h) − S(ω, ϕ) = c(ϕ, h). This ω is denoted by [ϕh ], while it was denoted in [23] by ϕh . More generally, h can be an extended-valued lower-bounded operator (see [23]), but a self-adjoint h is enough for our purpose. h Note [70, Proposition 1] that when ϕ ∈ M+ ∗ is faithful and h ∈ Msa , [ϕ ] coincides h h h with (3.17) up to a normalization constant; more precisely [ϕ ] = ϕ /ϕ (I) and ϕh (I) = ec(ϕ,h) . So for any ϕ ∈ M+ ∗ and any lower-bounded self-adjoint operator h affiliated with LOG-MAJORIZATIONS AND NORM INEQUALITIES 155 M such that c(ϕ, h) > −∞, we can define the perturbed functional ϕh by ϕh = ec(ϕ,h) [ϕh ]. (3.18) Then h is called a relative Hamiltonian of ω = ϕh relative to ϕ. The following convergence result was proved in [23, Proposition 3.15], which plays a crucial role in the proof of Theorem 3.12. Proposition 3.14. Let hn and h be lower-bounded self-adjoint operators affiliated with + M such that hn ↑ h in the sense that ψ(hn ) ↑ ψ(h) for all ψ ∈ M+ ∗ . If ϕ ∈ M∗ satisfies h hn hn c(ϕ, h) > −∞, then c(ϕ, hn ) → c(ϕ, h), k[ϕ ] − [ϕ ]k → 0, and hence kϕ − ϕh k → 0. Now, under the above preparations, let us sketch the proof of Theorem 3.12. P r o o f o f T h e o r e m 3.12 (Sketch). The von Neumann algebra M = B(H) is represented by left multiplication on the standard Hilbert space C2 (H). Set Φ = e−K/2 ∈ C2 (H)+ , which is a cyclic and separating vector for M and defines a faithful ϕ ∈ M+ ∗ as ϕ = h · Φ, Φi = tr( · e−K ). We write ϕ = e−K under the usual identification B(H)∗ = C1 (H). Note that −it −itK ∆it XeitK , Φ X∆Φ = e (3.19) t ∈ R, X ∈ B(H), because both sides represent the modular automorphism group associated with Φ. We may assume that H, K ≥ 0. First assume that H is bounded. An argument of analytic continuation using (3.19) yields 1/2n n (e−H/2n ∆Φ (3.20) −H/2n −K/2n n ) Φ = (e−H/2n e−K/2n )n , −(H+K)/2 Since (e e ) → e show that ΦH = e−(H+K)/2 and n ∈ N. in SOT as n → ∞, Proposition 3.13 and (3.20) lim k(e−H/2n e−K/2n )n − e−(H+K)/2 k2 = 0, n→∞ which yields lim tr(e−H/4n e−K/2n e−H/4n )2n = tr e−(H+K) . n→∞ This together with Corollary 3.10(1) implies that (3.21) T∞ lim tr(e−rH/2 e−rK e−rH/2 )1/r = tr e−(H+K) . Tn r↓0 Next let H = a λdEH (λ) be lower-bounded and Hn = a λdEH (λ). Since c(ϕ, H) > −∞ follows from D(H) ∩ D(K) 6= {0}, we can define the perturbed functional ϕH by (3.18) as well as ϕHn . Then Proposition 3.14 shows that kϕHn − ϕH k → 0. We have ϕHn = e−(Hn +K) because ΦHn = e−(Hn +K)/2 for bounded Hn . Furthermore, (I + Hn + K)−1 ↓ (I + H + K)−1 and hence e−(Hn +K) → e−(H+K) in SOT (see the proofs of Lemmas 4.15 and 4.8(1) below). Therefore ϕH = e−(H+K) and lim ke−(Hn +K) − e−(H+K) k1 = 0. (3.22) n→∞ By (3.21) for bounded Hn and by (3.22) we get lim tr(e−rH/2 e−rK e−rH/2 )1/r ≤ lim tr(e−rHn /2 e−rK e−rHn /2 )1/r r↓0 r↓0 = tr e−(Hn +K) → tr e−(H+K) , 156 F. HIAI which implies (3.21) for lower-bounded H. Hence lim k(e−rH/2 e−rK e−rH/2 )1/2r k2 = ke−(H+K)/2 k2 . r↓0 Since it is easily seen from Theorem 3.6(2) that (e−rH/2 e−rK e−rH/2 )1/2r → e−(H+K)/2 weakly in C2 (H), we have lim k(e−rH/2 e−rK e−rH/2 )1/2r − e−(H+K)/2 k2 = 0, r↓0 which shows (3.12) by using the Hölder inequality. The proof of (3.13) is similar, and (3.14) is easily shown from (3.12) thanks to (3.23) (e−H/n e−K/n )n+1 = e−H/2n (e−H/2n e−K/n e−H/2n )n e−H/2n e−K/n . Finally (3.15) is immediate from (3.12). Corollary 3.15. If e−K ∈ Cp (H) where 0 < p < ∞, then lim k(e−rH/2 e−rK e−rH/2 )1/r − e−(H+K) kp = 0, r↓0 lim k(e−rK/2 e−rH e−rK/2 )1/r − e−(H+K) kp = 0, r↓0 lim k(e−H/n e−K/n )n+1 − e−(H+K) kp = 0. (3.24) n→∞ Furthermore, for every q > p, lim k(e−H/n e−K/n )n − e−(H+K) kq = 0. n→∞ P r o o f (Sketch). Choose k ∈ N such that 2k > 1/p. Applying (3.12) to pH, pK instead of H, K we have lim k(e−rH/2 e−rK e−rH/2 )p/r − e−p(H+K) k1 = 0, r↓0 so that by (2.8) with θ = 1/2k p k lim k(e−rH/2 e−rK e−rH/2 )1/2 r↓0 r k − e−(H+K)/2 k2k p = 0. Now, using the Hölder inequality repeatedly, we can obtain the first assertion. (A similar argument in more detail will be supplied in the proof of Corollary 4.23.) The second is similar and the third is shown by using (3.23). The last assertion is also shown by using (2.8) and the Hölder inequality (see [40] for details). Problems 3.16. (1) Is it possible to replace (e−H/n e−K/n )n+1 by (e−H/n e−K/n )n in (3.14) and (3.24)? (0) (2) In view of Corollary 3.15, we are tempted to conjecture that if e−K ∈ CΦ (H) then lim k(e−rH/2 e−rK e−rH/2 )1/r − e−(H+K) k = 0, r↓0 where Φ is a symmetric gauge function with the corresponding norm k · k. 4. Inequalites of complementary Golden–Thompson type. A log-majorization result in terms of power operator means was obtained in [6] in the matrix case, which gives rise to matrix norm inequalities considered as complementary to the Golden–Thompson LOG-MAJORIZATIONS AND NORM INEQUALITIES 157 ones. The aim of this section is to study log-majorizations and norm inequalities involving operator means for infinite-dimensional exponential operators. In the course of this study, we establish a Trotter-like exponential product formula for operator means, which may be interesting by itself. Most results in this section are new. 4.1. Preliminaries on operator means. First, for convenience, let us give a brief survey on operator means. An axiomatic approach for operator means was investigated by Kubo and Ando [57]. Assume that H is a separable and infinite-dimensional Hilbert space. A binary operation σ : B(H)+ × B(H)+ → B(H)+ is called an operator connection if it satisfies the following conditions (i)–(iii) for A, B, C, D ∈ B(H)+ : (i) A ≤ C and B ≤ D imply A σ B ≤ C σ D (joint monotonicity), (ii) C(A σ B)C ≤ (CAC) σ (CBC) (transformer inequality), (iii) An , Bn ∈ B(H)+ , An ↓ A, and Bn ↓ B imply An σ Bn ↓ A σ B (upper semicontinuity). An operator connection σ is called an operator mean if (iv) I σ I = I. The fundamental theorem of Kubo and Ando is summarized as follows: For each operator connection σ there exists a unique operator monotone function f ≥ 0 on [0, ∞) such that f (x)I = I σ (xI) for x ≥ 0. Then the map σ 7→ f is an affine order-isomorphism between the operator connections and the non-negative operator monotone functions on [0, ∞). The order preservation means that when σi 7→ fi for i = 1, 2, A σ1 B ≤ A σ2 B for all A, B ∈ B(H)+ if and only if f1 (x) ≤ f2 (x) for all x ≥ 0. The operator connection σ is defined via the corresponding function f by (4.1) A σ B = A1/2 f (A−1/2 BA−1/2 )A1/2 if A is invertible, and for general A, B −1/2 A σ B = s-lim A1/2 Bε Aε−1/2 )A1/2 ε f (Aε ε ε↓0 decreasingly, where Aε = A + εI and Bε = B + εI. Moreover σ is an operator mean if and only if f (1) = 1, which implies that A σ A = A for all A. The following are typical examples of operator means. (1) Arithmetic mean: A ∇ B = 12 (A + B). −1 −1 (2) Harmonic mean: A ! B = 2(A : B), where A : B = s-limε↓0 (A−1 is ε + Bε ) called the parallel sum of A, B. −1/2 1/2 1/2 −1/2 1/2 ) Aε . Bε Aε (3) Geometric mean: A # B = s-limε↓0 Aε (Aε The corresponding functions of ∇, ! , # are (1 + x)/2, 2x/(x + 1), x1/2 , respectively. For 0 ≤ α ≤ 1 let #α denote the α-power mean, which is the operator mean corresponding to the operator monotone function xα . Namely, for each A, B ∈ B(H)+ with A invertible, A #α B is defined by A #α B = A1/2 (A−1/2 BA−1/2 )α A1/2 . Here we adopt the usual convention B 0 = I for any B ∈ B(H)+ . Note that A #0 B = A, A #1 B = B, and A #1/2 B = A # B. 158 F. HIAI Using the integral representation (2.6) of f and transforming the measure ν to m = t(1 + t)−1 ν, we can represent A σ B as \ 1+t \ 1+t A σ B = aA + bB + {(tA) : B}dm(t) = {(tA) : B}dm(t), t t (0,∞) [0,∞] where a = m({0}) and b = m({∞}). In this way, it is seen that there exists an affine one-to-one correspondence between the operator means and the probability measures on [0, ∞]. The above representation shows that every operator connection σ satisfies (v) (A σ B) + (C σ D) ≤ (A + C) σ (B + D), (vi) T ∗ (A σ B)T ≤ (T ∗ AT ) σ (T ∗ BT ) for every T ∈ B(H) and the equality holds if T is invertible. Now let σ be an operator mean with the corresponding function f . The integral representation of f implies that f is automatically infinitely many times differentiable. So let α = f ′ (1). The concavity of f implies that 0 ≤ α ≤ 1 and f (x) ≤ (1 − α) + αx for x ≥ 0. Since f (x−1 )−1 is again operator monotone and hence concave, we have x ≤ f (x) ≤ (1 − α) + αx, x ≥ 0. (4.2) (1 − α)x + α The operator mean σ ′ determined by A σ ′ B = B σ A is called the transpose of σ. The corresponding function of σ ′ is xf (x−1 ) because I σ ′ (xI) = (xI) σ I = x(I σ (x−1 I)) = xf (x−1 )I, x > 0. In particular, if σ is symmetric, i.e. A σ B = B σ A for all A, B, or equivalently f (x) = xf (x−1 ), then α = 1/2 and so (4.2) says that the arithmetic mean A ∇ B is the greatest and the harmonic mean A ! B is the least among the symmetric operator means. If α = 0, then f (x) ≡ 1 and A σ B = A. Also if α = 1, then f (x) = x and A σ B = B. All the results in this section become trivial in these two extremal cases. So we may assume 0 < α < 1 in the sequel discussions. Although the next lemma will be used only for α-power mean #α , we give it for general σ. Lemma 4.1. Let σ be an operator mean and f the corresponding operator monotone function. Let A, B ∈ B(H)+ . (1) Assume that A is invertible. Then B 7→ A σ B is continuous in the norm k · k∞ on B(H)+ . (2) Assume that f (0) = 0 and B is compact. If An ↓ A then kAn σ B − A σ Bk∞ → 0. P r o o f. (1) is immediate from the expression (4.1) because of the k · k∞ -continuity of functional calculus. (2) Choose a > 0 such that An ≤ aI. Then we have An σ B ≤ (aI) σ B = af (a−1 B) and af (a−1 B) ∈ C(H) from f (0) = 0. Hence Proposition 2.12 shows the assertion because An σ B ↓ A σ B. 4.2. Log-majorization for power operator means. The log-majorization in the next theorem was given in [6] in the matrix case, which is considered as complementary to Theorem 3.4. 159 LOG-MAJORIZATIONS AND NORM INEQUALITIES Theorem 4.2. If A, B ∈ B(H)+ and A is either invertible or compact , then µ(Ar #α B r ) ≺w(log) µ((A #α B)r ), (4.3) r ≥ 1, or equivalently µ((Ap #α B p )1/p ) ≺w(log) µ((Aq #α B q )1/q ), (4.4) p ≥ q > 0. P r o o f. First assume that both A and B are invertible. The proof below is the same as that of [6, Theorem 2.1]. For each n ∈ N it is easily checked from Lemma 3.1 that Λn (Ar #α B r ) = (Λn A)r #α (Λn B)r , Λn ((A #α B)r ) = ((Λn A) #α (Λn B))r . So it suffices to show that kAr #α B r k∞ ≤ k(A#α B)r k∞ , (4.5) n n r ≥ 1, because (4.3) follows from Lemma 3.2 by taking Λ A, Λ B instead of A, B in (4.5). To show (4.5), we may prove that A #α B ≤ I implies Ar #α B r ≤ I. When 1 ≤ r ≤ 2, let us write r = 2 − ε with 0 ≤ ε ≤ 1. Let C = A−1/2 BA−1/2 . Suppose A #α B ≤ I. Then C α ≤ A−1 and A ≤ C −α , (4.6) so that thanks to 0 ≤ ε ≤ 1, A1−ε ≤ C −α(1−ε) . (4.7) Now we have ε ε ε ε Ar #α B r = A1− 2 {A−1+ 2 B · B −ε · BA−1+ 2 }α A1− 2 ε = A1− 2 {A− 1−ε 2 CA1/2 (A−1/2 C −1 A−1/2 )ε A1/2 CA− 1−ε 2 = A1/2 {A1−ε #α [C(A #ε C −1 )C]}A1/2 ε }α A1− 2 ≤ A1/2 {C −α(1−ε) #α [C(C −α #ε C −1 )C]}A1/2 by using (4.6), (4.7), and the joint monotonicity of power means. Since C −α(1−ε) #α [C(C −α #ε C −1 )C] = C −α(1−ε)(1−α) [C(C −α(1−ε) C −ε )C]α = C α , we get (4.8) Ar #α B r ≤ A1/2 C α A1/2 = A #α B ≤ I. Therefore (4.3) is proved when 1 ≤ r ≤ 2. When r > 2, write r = 2k s with k ∈ N and 1 ≤ s ≤ 2. Repeating the above argument we have k−1 µ(Ar #α B r ) ≺w(log) µ(A2 .. . s k−1 #α B 2 s 2 ) k ≺w(log) µ(As #α B s )2 ≺w(log) µ(A #α B)r . Second assume that A is invertible while B is general. Since by Lemma 4.1(1) Ar #α B r = lim Ar #α Bεr and (A #α B)r = lim(A #α Bε )r ε↓0 ε↓0 160 F. HIAI in the norm k·k∞ , (4.3) follows from the first case and Proposition 1.4(10). Third assume that A is compact. The second case shows that µ(Bεr #1−α Ar ) ≺w(log) µ((Bε #1−α A)r ), r ≥ 1, ε > 0. Since Lemma 4.1(2) implies that Bε #1−α A → B #1−α A and Bεr #1−α Ar → B r #1−α Ar in k · k∞ , we get µ(B r #1−α Ar ) ≺w(log) µ((B #1−α A)r ), r ≥ 1, which is nothing but (4.3). Finally (4.4) readily follows from (4.3) as in the last part of proof of Theorem 3.4. By Theorem 4.2 and Proposition 2.10 we have: Corollary 4.3. Let A, B ∈ B(H)+ and assume that A is either invertible or compact. Let k · k be any symmetric norm. If f is a continuous increasing function on [0, ∞) such that f (0) ≥ 0 and f (ex ) is convex , then kf (Ar #α B r )k ≤ kf ((A #α B)r )k, r ≥ 1. In particular , kAr #α B r k ≤ k(A #α B)r k, r ≥ 1. Problem 4.4. It is desirable to prove Theorem 4.2 (hence Corollary 4.3) without the assumption of A being invertible or compact. We can do so if the convergence kA #α Bk∞ = limε↓0 k(A + εI) #α Bk∞ holds true for general A, B ∈ B(H)+ . However, this is not the case as the following example shows. Example 4.5. Let us consider the infinite direct sum of 2 × 2 matrices. Let P0 , ξn , L∞ L∞ and Qn be as in Example 3.11. Define P = n=1 P0 and Q = n=1 Qn , so that P #Q = ∞ M (P0 # Qn ), n=1 ∞ M 1 0 (P + εI) # Q = ((P0 + εI2 ) # Qn ) where I2 = . 0 1 n=1 We have (P0 + εI2 ) # Qn = (P0 + εI2 )1/2 {(P0 + εI2 )−1/2 ξn ⊗ (P0 + εI2 )−1/2 ξn }1/2 (P0 + εI2 )1/2 = (P0 + εI2 )1/2 k(P0 + εI2 )−1/2 ξn k2 1/2 (P0 + εI2 )−1/2 ξn (P0 + εI2 )−1/2 ξn ⊗ (P0 + εI2 )1/2 k(P0 + εI2 )−1/2 ξn k k(P0 + εI2 )−1/2 ξn k ξn ξn = k(P0 + εI2 )−1/2 ξn k ⊗ k(P0 + εI2 )−1/2 ξn k k(P0 + εI2 )−1/2 ξn k = k(P0 + εI2 )−1/2 ξn k−1 ξn ⊗ ξn . 161 LOG-MAJORIZATIONS AND NORM INEQUALITIES Since q n−1 n(1+ε) (P0 + εI2 )−1/2 ξn = q we get k(P0 + εI2 ) # Qn k∞ = 1 nε , 1 n−1 + n(1 + ε) nε −1/2 . Letting ε ↓ 0 yields kP0 # Qn k∞ = 0 (this follows also from P0 ∧Qn = 0; see [57, Theorem 3.7]). Hence P # Q = 0. On the other hand, we have −1/2 1 n−1 = (1 + ε)1/2 . k(P + εI) # Qk∞ = sup k(P0 + εI2 ) # Qn k∞ = sup + nε n≥1 n≥1 n(1 + ε) Therefore kP # Qk∞ = 0 < 1 = lim k(P + εI) # Qk∞ . ε↓0 Furthermore, for each 0 < δ < 1 and n ∈ N, we can choose αn > 0 such that k(P0 + αn I2 ) # (Qn + αn I2 )k∞ ≤ δ. L∞ Define A = n=1 (P0 + αn I2 ) and B = n=1 (Qn + αn I2 ), which are strictly positive. Since P ≤ A and Q ≤ B, we have L∞ kA # Bk∞ ≤ δ < 1 ≤ lim k(A + εI) # Bk∞ . ε↓0 The next proposition extends [2, Theorem 1] where H, K are bounded and α = 1/2. Proposition 4.6. If H ∈ B(H) is self-adjoint and K is a lower-bounded self-adjoint operator on H, then the following conditions are equivalent : (i) (1 − α)H + αK ≥ 0; (ii) e−tH #α e−tK ≤ I for all t ≥ 0; (iii) t 7→ e−tH #α e−tK is a decreasing function from [0, ∞) to B(H)+ . P r o o f. (i)⇒(ii). Suppose that (1−α)H +αK ≥ 0, i.e. K ≥ −α−1 (1−α)H. To prove (ii), it suffices to show that lim k(e−rH #α e−rK )1/r k∞ ≤ 1, (4.9) r↓0 1/r ke−rH #α e−rK k∞ = k(e−rH #α e−rK )1/r k∞ increases as r ↓ α{(1 − α)kHk∞ }−1 , since I + rK ≥ I − α−1 (1 − α)rH ≥ 0 and because 0<r< is invertible, we get e−rK ≤ (I + rK)−1 ≤ 0 by (4.4). When I − α−1 (1 − α)rH −1 1−α rH I− α as well as e−rH ≤ (I + rH)−1 for small r > 0. Therefore −1 −α 1−α 1−α −(1−α) −rH −rK −1 I− rH rH = (I + rH) , e #α e ≤ (I + rH) #α I − α α 162 F. HIAI so that for small r > 0 k(e −rH #α e −rK 1/r ) Since −(1−α)/r lim(1 + rλ) r↓0 k∞ −α/r 1−α −(1−α)/r . I− ≤ (I + rH) rH α ∞ −α/r 1−α 1−α 1− rλ = (eλ )−(1−α) (e− α λ )−α = 1 α uniformly in λ ∈ [−kHk∞ , kHk∞ ], it follows that −α/r 1−α −(1−α)/r = 0, rH − I I − (I + rH) lim r↓0 α ∞ implying (4.9). (ii)⇒(iii). What we have to prove is that if A, B ∈ B(H)+ and A is invertible, then A #α B ≤ I implies Ar #α B r ≤ A #α B for every r ≥ 1. When both A and B are invertible, the result was shown in the proof of Theorem 4.2 (see (4.8)). When A is invertible but B is not, let Bε = B + εI for ε > 0. Since A #α (kA #α Bε k−1/α Bε ) = kA #α Bε k−1 ∞ ∞ (A #α Bε ) ≤ I, −1/α the first case applied to A and kA #α Bε k∞ r A #α Bεr Bε shows that for r ≥ 1 ≤ kA #α Bε kr−1 ∞ (A #α Bε ). Letting ε ↓ 0 we have by Lemma 4.1(1) Ar #α B r ≤ kA #α Bkr−1 ∞ (A #α B) ≤ A #α B, as desired. (iii)⇒(i). For every ε > 0 and t > 0, it follows from (4.2) that e−tH #α (e−tK + εI) = e−tH/2 {etH/2 (e−tK + εI)etH/2 }α e−tH/2 ≥ e−tH/2 {(1 − α)I + αe−tH/2 (e−tK + εI)−1 e−tH/2 }−1 e−tH/2 ≥ e−tH/2 {(1 − α)I + αe−tH/2 etK e−tH/2 }−1 e−tH/2 = {(1 − α)etH + αetK }−1 , which shows that e−tH #α e−tK ≥ {(1 − α)etH + αetK }−1 . Hence (iii) implies that (1− α)etH + αetK ≥ I for every t > 0. So for every n ∈ N we have (4.10) αn(eK/n − I) ≥ −(1 − α)n(eH/n − I). T∞ Let K = b λdEK (λ) be the spectral decomposition. For any ξ ∈ EK (c)H with b < c < ∞, (4.10) says that c \ α n(eλ/n − 1)dkEK (λ)ξk2 ≥ −(1 − α)hn(eH/n − I)ξ, ξi. b Tc The above left-hand side tends as n → ∞ to α b λdkEK (λ)ξk2 = αhKξ, ξi, while the right-hand side tends to −(1 − α)hHξ, ξi. Therefore h((1 − α)H + αK)ξ, ξi ≥ 0. This S implies (i) because c>b EK (c)H is a core of (1 − α)H + αK. LOG-MAJORIZATIONS AND NORM INEQUALITIES 163 R e m a r k 4.7. By using the Furuta inequality [28], a more general result for positive invertible operators was shown in [29] (also [27]) as follows: If A, B ∈ B(H)+ are invertible, the following conditions, among others, are equivalent: (I) log A ≥ log B; r (II) Ar ≥ (Ar/2 B p Ar/2 ) p+r for all p, r ≥ 0; t+2r (III) for any t ≥ 0, A−r (Ar B p Ar ) p+2r A−r is a decreasing function of both p ≥ t and 1−α r ≥ 0 to B(H)+ . If we set α = r/(p+r), then the above (II) is written as A−r #α B α r ≤ I for all r ≥ 0 and 0 ≤ α ≤ 1. Thus it is seen that (I) ⇔ (II) is nothing but (i) ⇔ (ii) of Proposition 4.6. To obtain the complementary counterpart of Theorem 3.7, let us introduce the additive version of antisymmetric tensor powers. For a lower-bounded self-adjoint operator H on H and n ∈ N, we define the lower-bounded self-adjoint operator Σ n H on Λn H by n X Σ nH = {(⊗m−1 I) ⊗ H ⊗ (⊗n−m I)} n . Λ H m=1 More precisely, for any A ∈ B(H), since it is easily checked that n X (4.11) (⊗m−1 I) ⊗ A ⊗ (⊗n−m I) m=1 commutes with the projection of ⊗n H onto Λn H, we can define Σ n A as the T restriction of T k ∞ n (4.11) to Λ H. Let Hk = a λdEH (λ) with the spectral decomposition H = a λdEH (λ). Then Σ n H is the limit of an increasing sequence {Σ n Hk }∞ k=1 of (mutually commuting) self-adjoint operators in B(Λn H). We prove (2) below as stated, while it will be used in the obvious case when H is bounded. Lemma 4.8. Let H and K be lower-bounded self-adjoint operators on H. For n ∈ N define Σ n H and Σ n K as above. n (1) Λn (e−H ) = e−Σ H . (2) Σ n (H +̂K) = (Σ n H)+̂(Σ n K). P r o o f. We may assume H, K ≥ 0 by taking H + aI, K + aI. (1) If H is bounded, then we have n Y n e−Σ H = {(⊗m−1 I) ⊗ e−H ⊗ (⊗n−m I)} n = ⊗n e−H |Λn H = Λn (e−H ). m=1 Tk Λ H For general H ≥ 0 let Hk = 0 λdEH (λ). Then I ≥ (I + Hk )−1 ↓ (I + H)−1 as k → ∞, which implies that e−Hk → e−H in SOT, because e−H is the functional calculus of (I + H)−1 for f (x) = exp(1 − x−1 ) on [0, 1]. Hence Λn (e−Hk ) → Λn (e−H ) in SOT by n n Lemma 3.1(4). Similarly e−Σ Hk → e−Σ H in SOT, showing the assertion. (2) Let P0 be the projection of H onto the closure H0 of D(H 1/2 ) ∩ D(K 1/2 ). Then Σ n (H +̂K) is a positive self-adjoint operator on Λn H0 (⊂ Λn H). Since Λn P0 is the n projection of Λn H onto Λn H0 and (1) says that e−Σ (H +̂K) = Λn (e−(H +̂K) ) on Λn H0 , we have n e−Σ (H +̂K) · Λn P0 = Λn (e−(H +̂K) ), 164 F. HIAI where e−(H +̂K) = e−(H +̂K) P0 by convention in Theorem 3.6. Hence using Theorem 3.6(1), Lemma 3.1, and (1), we have e−Σ n (H +̂K) · Λn P0 = s-lim Λn ((e−H/k e−K/k )k ) = s-lim(Λn (e−H )1/k Λn (e−K )1/k )k k→∞ = s-lim(e k→∞ k→∞ −(Σ n H)/k −(Σ n K)/k k ) = e−((Σ e n H)+̂(Σ n K)) . Indeed, this shows that (Σ n H)+̂(Σ n K) is a self-adjoint operator on Λn H0 and Σ n (H +̂K) = (Σ n H)+̂(Σ n K). Theorem 4.9. If H ∈ B(H) is self-adjoint and K is a lower-bounded self-adjoint operator on H, then (e−rH #α e−rK )1/r ≺w(log) e−((1−α)H+αK) , r > 0. P r o o f. Lemmas 3.1 and 4.8 imply that Λn ((e−rH #α e−rK )1/r ) = (Λn (e−rH ) #α Λn (e−rK ))1/r = (e−rΣ Λn (e−((1−α)H+αK) ) = e−((1−α)Σ n n H+αΣ K) n H #α e−rΣ n K 1/r ) , . So, as in the proof of Theorem 4.2, we may show that k(e−rH #α e−rK )1/r k∞ ≤ ke−((1−α)H+αK) k∞ . Thus it suffices to show that if e−((1−α)H+αK) ≤ I, i.e. (1 − α)H + αK ≥ 0, then e−rH #α e−rK ≤ I for all r > 0. But this is (i)⇒(ii) of Proposition 4.6. Problem 4.10. It is desirable to prove Theorem 4.9 when both H and K are lowerbounded. In view of Lemma 4.8, the problem consists in proving that (1 − α)H +̂αK ≥ 0 implies e−tH #α e−tK ≤ I for t ≥ 0. 4.3. Exponential product formula for operator means. The next theorem is a Trotterlike product formula for operator means. Theorem 4.11. Let σ be an operator mean with α = f ′ (1) for the corresponding operator monotone function f . If H ∈ B(H) is self-adjoint and K is a lower-bounded self-adjoint operator on H, then s-lim(e−rtH σ e−rtK )1/r = e−t((1−α)H+αK) , r↓0 t > 0, the convergence being uniform in t ∈ [a, b] for any 0 < a < b. We divide the proof of the theorem into several lemmas. Lemma 4.12. Let G(t), G1 (t), and G2 (t) be SOT-continuous functions from [0, ∞) to B(H) such that G(t), G1 (t), and G2 (t) mutually commute for any t ≥ 0 and such that for some a ≥ 0, 0 ≤ G1 (t) ≤ G(t) ≤ G2 (t) ≤ eat I, t ≥ 0. Let S be a self-adjoint operator on H and D a core of S. If Gi (t) − I lim ξ + Sξ ξ ∈ D, i = 1, 2, = 0, t↓0 t LOG-MAJORIZATIONS AND NORM INEQUALITIES then G(t) − I lim ξ + Sξ = 0, t↓0 t 165 ξ ∈ D. P r o o f. Replacing G(t), G1 (t), G2 (t) by those with multiple e−at and S by S + aI, it suffices to show the case a = 0. Let ξ ∈ D. Since by assumption I − G(t) I − G1 (t) I − G2 (t) ≤ ≤ , 0≤ t t t we get I − G2 (t) I − G(t) I − G1 (t) ξ, ξ ≤ ξ, ξ ≤ ξ, ξ . t t t This implies that ht−1 (I − G(t))ξ, ξi → hSξ, ξi as t ↓ 0. By polarization we have I − G(t) (4.12) ξ, η → hSξ, ηi, η ∈ D. t Thanks to the commuting assumption of G(t), G1 (t), G2 (t), we also get 2 2 2 I − G(t) I − G1 (t) I − G2 (t) ≤ ≤ , t t t so that I − G2 (t) I − G(t) I − G1 (t) ξ ≤ ξ ≤ ξ . t t t Hence kt−1 (I −G(t))ξk → kSξk and t−1 (I −G(t))ξ, t > 0, are bounded. These and (4.12) imply that t−1 (I − G(t))ξ → Sξ weakly. Therefore I − G(t) I − G(t) 2 I − G(t) ξ − Sξ = ξ − 2Re ξ, Sξ + kSξk2 t t t showing the conclusion. → kSξk2 − 2hSξ, Sξi + kSξk2 = 0, Let H, K be as in Theorem 4.11 and define F (t) = e−tH σ e−tK , t ≥ 0. Let L = (1 − α)H + αK and D = D(K), which is the domain of L. Then: Lemma 4.13. F (t) − I ξ + Lξ lim = 0, t↓0 t ξ ∈ D. P r o o f. Define A(t) = etH/2 e−tK etH/2 , so that F (t) = e−tH/2 f (A(t))e−tH/2 for t ≥ 0. Choose a > 0 such that H ≤ aI and K + aI ≥ 0. We have F (t) − I e−tH/2 − I f (A(t)) − I e−tH/2 − I ξ = e−tH/2 f (A(t)) ξ + e−tH/2 ξ+ ξ. t t t t Since A(t) → I in SOT and hence f (A(t)) → I in SOT, the first and third terms in the right-hand side of (4.13) strongly converge as t ↓ 0 to (−H/2)ξ, so that what remains to show is f (A(t)) − I ξ + α(K − H)ξ ξ ∈ D. (4.14) lim = 0, t↓0 t (4.13) 166 F. HIAI Define G(t) = f (A(t)), G1 (t) = A(t)((1 − α)A(t) + αI)−1 , and G2 (t) = (1 − α)I + αA(t), which mutually commute for each t ≥ 0. Since G2 (t) ≤ e2at I and (4.2) yields 0 ≤ G1 (t) ≤ G(t) ≤ G2 (t), we can apply Lemma 4.12. Let ξ ∈ D. It is easy to check that G2 (t) − I = α A(t) − I ξ + (K − H)ξ → 0. ξ + α(K − H)ξ t t Furthermore, we get G1 (t) − I ξ + α(K − H)ξ t −1 A(t) − ((1 − α)A(t) + αI) ≤ ((1 − α)A(t) + αI) ξ + α(K − H)ξ t + k{((1 − α)A(t) + αI)−1 − I}α(K − H)ξk A(t) − I + (1 − α)k(A(t) − I)(K − H)ξk → 0 ξ + (K − H)ξ ≤ t thanks to k((1 − α)A(t) + αI)−1 k∞ ≤ α−1 . Hence Lemma 4.12 implies (4.14). To prove the theorem, we may assume H, K ≥ 0 by taking H + aI, K + aI, so that 0 ≤ F (t) ≤ I for all t ≥ 0. Now let us proceed as in [19]. Fix any sequence {sn } such that 0 < sn → ∞. Let Ln = sn (I − F (s−1 n )) so that Ln ≥ 0. Then Lemma 4.13 says that k(L − Ln )ξk → 0 for all ξ ∈ D. Lemma 4.14. sn − e−Ln ) = 0. s-lim(F (s−1 n ) n→∞ P r o o f. For any ξ ∈ H we have sn kF (s−1 n ) ξ −e −Ln ∞ X skn −1 k −sn −1 sn F (sn ) ξ ξk = F (sn ) ξ − e k! k=0 ≤ e−sn ≤ e−sn Since r 0 ≤ I − F (s−1 n ) ≤ it follows that ∞ X sk n k=0 ∞ X k=0 k! k sn − F (s−1 k(F (s−1 n ) )ξk n ) skn |k−sn | k(I − F (s−1 )ξk. n ) k! r(I − F (s−1 n )), I − F (s−1 n ), kF (sn−1 )sn ξ − e−Ln ξk ≤ e−sn k(I − F (s−1 n ))ξk r ≥ 1, 0 ≤ r ≤ 1, ∞ X skn |k − sn | + k(I − F (s−1 n ))ξk. k! k=0 The Schwarz inequality gives 1/2 X 1/2 X ∞ ∞ ∞ X skn skn skn 2 sn |k − sn | ≤ (k − sn ) = esn /2 (sn esn )1/2 = s1/2 n e . k! k! k! k=0 k=0 k=0 LOG-MAJORIZATIONS AND NORM INEQUALITIES Hence for every ξ ∈ D, 167 1/2 −1 −Ln sn ξk ≤ (s1/2 kF (s−1 n + 1)k(I − F (sn ))ξk = n ) ξ−e sn + 1 kLn ξk → 0. sn sn and e−Ln are contractions. This shows the assertion because F (s−1 n ) The next step is much simpler than that in [19], because the functional calculus can be applied in our setting. Lemma 4.15. s-lim e−Ln = e−L . n→∞ P r o o f. First note that 0 ≤ (I + Ln )−1 ≤ I and 0 ≤ (I + L)−1 ≤ I. If η = (I + L)ξ with ξ ∈ D, then k(I + Ln )−1 η − (I + L)−1 ηk = k(I + Ln )−1 {(I + Ln )ξ + (L − Ln )ξ} − ξk = k(I + Ln )−1 (L − Ln )ξk ≤ k(L − Ln )ξk → 0. This implies that (I + Ln )−1 → (I + L)−1 in SOT. Now the result follows due to the functional calculus as in the proof of Lemma 4.8(1). P r o o f o f T h e o r e m 4.11 (Completion). Lemmas 4.14 and 4.15 show that sn F (s−1 → e−L in SOT if 0 < sn → ∞; namely n ) (4.15) s-lim F (rn )1/rn = e−L if 0 < rn → 0. n→∞ Replacing H, K by tH, tK with t > 0 yields s-lim(e−rtH σ e−rtK )1/r = e−tL , r↓0 t > 0. Finally it is an easy task to show the uniform convergence. Indeed, let 0 < a < b and suppose that the above convergence is not uniform in t ∈ [a, b]. Then there exist ξ ∈ H, ε > 0, rn ↓ 0, and tn ∈ [a, b] such that (4.16) kF (rn tn )1/rn ξ − e−tn L ξk ≥ ε, n ∈ N. Choosing a subsequence we may assume that tn → t. Then since λtn → λt uniformly in λ ∈ [0, 1], it follows from (4.15) that F (rn tn )1/rn = (F (rn tn )1/rn tn )tn → e−tL in SOT as well as ke−tn L − e−tL k∞ → 0, contradicting (4.16). It would be expected that Theorem 4.9 is a consequence of Theorems 4.2 and 4.11. Qn However this is not truly the case, because i=1 µi (A) is not continuous in SOT. It is worth noting the following specialization to the matrix case. Corollary 4.16. For arbitrary Hermitian matrices H and K, d tH e σ etK = (1 − α)H + αK, dt t=0 lim (etH σ etK )1/t = e(1−α)H+αK , t→0 d log(etH σ etK ) = (1 − α)H + αK. dt t=0 P r o o f. Theorem 4.11 gives the second, while the first follows from Lemma 4.13. For the third, take the logarithm of the second. 168 F. HIAI R e m a r k 4.17. Theorem 4.11 fails to hold when both H and K are lower-bounded. Indeed, Kosaki [56] gave an example of positive self-adjoint operators H, K such that D(H 1/2 ) ∩ D(K 1/2 ) is dense in H and hence e−(H +̂K)/2 is non-zero, but e−rH : e−rK = 0 for each r > 0. 4.4. Norm inequalities of complementary Golden–Thompson type. Theorems 4.9 and 4.11 show: Corollary 4.18. Let σ be an operator mean such that σ ≤ #α , i.e. f (x) ≤ xα for x ≥ 0 where f is the corresponding operator monotone function. If H and K are as in Theorem 4.11 and k · k is a symmetric norm, then k(e−rH σ e−rK )1/r k ≤ ke−((1−α)H+αK) k, (4.17) r > 0, and moreover (4.18) lim k(e−rH σ e−rK )1/r k = ke−((1−α)H+αK) k. r↓0 If σ = #α then the left-hand side of (4.17) increases to ke−((1−α)+αK) k as r ↓ 0. P r o o f. Since e−rH σ e−rK ≤ e−rH #α e−rK by assumption, Theorem 4.9 gives for every r > 0 k(e−rH σ e−rK )1/r k ≤ k(e−rH #α e−rK )1/r k ≤ ke−((1−α)H+αK) k. On the other hand, in view of Proposition 2.11, Theorem 4.11 implies that ke−((1−α)H+αK) k ≤ lim inf k(e−rH σ e−rK )1/r k. r↓0 Therefore the first part is shown. The second part is immediate from (4.4). In particular, if σ is a symmetric operator mean with σ ≤ # and H, K are as above, then (4.19) k(e−2rH σ e−2rK )1/r k ≤ ke−(H+K) k, r > 0, −(H+K) and the left-hand side converges to ke k as r ↓ 0. For instance, when σ = ! or σ = #, the left-hand side of (4.19) increases to the right as r ↓ 0. Norm inequalities such as (4.17) and (4.19) are considered as complementary to Golden–Thompson ones (see [6, 44, 71]). R e m a r k s 4.19. (1) Let H = 0, K = xI for x ∈ R, and k · k = k · k∞ . Then (4.17) menas that f (e−rx ) ≤ e−αrx for r > 0 and so σ ≤ #α . Thus the assumption σ ≤ #α is essential in Corollary 4.18. (2) Inequality (4.19) does not hold for a general symmetric mean. Indeed, if e−K ∈ Cp (H) where 0 < p < ∞ then, for any bounded H, e−(H+K) ∈ Cp (H) by (3.11), but k(e−2rH ∇ e−2rK )1/r kp = ∞ for all r > 0. When we are concerned with (quasi-)norms k · kp , the following holds: Proposition 4.20. Let σ be as in Corollary 4.18. Let H and K be lower-bounded self-adjoint operators such that H + K is essentially bounded. If e−K ∈ Cp (H) where 0 < p < ∞, then k(e−rH/(1−α) σ e−rK/α )1/r kp ≤ ke−(H+K) kp , r > 0. LOG-MAJORIZATIONS AND NORM INEQUALITIES 169 P r o o f. The case p = 1 is enough because we may replace H, K by pH, pK (see −K −(H+K) the proof of Corollary ∈ C1 (H) by Tn 3.10). So assume that e ∈ C1 (H). Hence e (3.11). Let Hn = a λdEH (λ). Then we proved the convergence (3.22). For each r > 0, since e−rH/(1−α) ≤ e−rHn /(1−α) , we get k(e−rH/(1−α) σ e−rK/α )1/r k1 ≤ k(e−rHn /(1−α) σ e−rK/α )1/r k1 ≤ ke−(Hn +K) k1 by (4.17). This and (3.22) yield the conclusion. 4.5. Norm convergence of exponential product formula for operator means. In this subsection let us consider norm convergence of (e−rH/(1−α) σ e−rK/α )1/r as r ↓ 0 when k · k is uniformly convex or k · k = k · kp for 0 < p < ∞. In Banach space theory, there are several important geometric notions of convexity or smoothness for norms (see e.g. [13]). For instance, a Banach space X is said to be uniformly convex if for each ε > 0 there exists δ > 0 such that, for any x, y ∈ X , k(x + y)/2k ≤ 1 − δ holds whenever kxk = kyk = 1 and kx − yk ≥ ε. A Hilbert space is a typical example of a uniformly convex Banach space. As is well known, a uniformly convex Banach space is reflexive. A useful property of a uniformly convex space X is that if {xj } ⊂ X weakly converges to x ∈ X and kxj k → kxk, then kxj − xk → 0. This property is typical in a Hilbert space, as was already used in the proof of Theorem 3.12. When 1 < p < ∞, the uniform convexity of Cp (H) is an immediate consequence of the Clarkson–McCarthy inequalities: kA + Bkpp + kA − Bkpp ≤ 2p−1 (kAkpp + kBkpp ), 2 ≤ p < ∞, kA + Bkqp + kA − Bkqp ≤ 2(kAkpp + kBkpp )q/p , 1 < p ≤ 2, where 1/p + 1/q = 1. (The proof in the most general setup is found in [26].) Now let Φ be a symmetric gauge function, Φ′ the conjugate one, and k·k the symmetric norm corresponding to Φ. Assume that CΦ (H) (or k · k) is uniformly convex. Then the reflexivity of CΦ (H) implies as remarked after Theorem 2.9 that both Φ and Φ′ are regular and so CΦ (H)∗ ∼ = CΦ′ (H). Lemma 4.21. Assume that CΦ (H) is uniformly convex. If {Aj } ⊂ CΦ (H), supj kAj k < ∞, and Aj → A in WOT , then Aj → A in w(CΦ (H), CΦ′ (H)), the weak topology. P r o o f. First the assumptions give A ∈ CΦ (H) by Proposition 2.11. Since Φ′ is regular as remarked above, Cfin (H) is dense in CΦ′ (H). By the WOT-convergence of {Aj } we get tr((Aj − A)B) → 0 for any B ∈ Cfin (H). In view of Theorem 2.9, this and the k · kboundedness imply the conclusion. Corollary 4.22. Let σ be as in Corollary 4.18. Assume that k·k is uniformly convex. If H and K are as in Theorem 4.11 and e−K ∈ CΦ (H), then lim k(e−rH/(1−α) σ e−rK/α )1/r − e−(H+K) k = 0. r↓0 P r o o f. By (3.11), (4.17), Theorem 4.11, and Lemma 4.21 altogether, we have as r ↓ 0, (e−rH/(1−α) σ e−rK/α )1/r → e−(H+K) in w(CΦ (H), CΦ′ (H)). Hence the result follows from the uniform convexity and (4.18). 170 F. HIAI Concerning k · kp , 0 < p < ∞, we have: Corollary 4.23. Let σ be as in Corollary 4.18. If H and K are as in Theorem 4.11 and e−K ∈ Cp (H) where 0 < p < ∞, then lim k(e−rH/(1−α) σ e−rK/α )1/r − e−(H+K) kp = 0. r↓0 P r o o f. The case 1 < p < ∞ is included in Corollary 4.22. Let 0 < p ≤ 1 and choose k ∈ N such that 2k > 1/p. Applying Corollary 4.22 to k · k2k p and 2−k H, 2−k K, we have k lim k(e−rH/(1−α) σ e−rK/α )1/2 (4.20) k r − e−(H+K)/2 k2k p = 0. r↓0 Noting that kX +Y kq ≤ 21/q (kXkq +kY kq ) for any q > 0, we get by the Hölder inequality k−1 k(e−rH/(1−α) σ e−rK/α )1/2 k−1 ≤ 21/2 p r k−1 − e−(H+K)/2 k {k(e−rH/(1−α) σ e−rK/α )1/2 k × [(e−rH/(1−α) σ e−rK/α )1/2 k k−1 p r k r + k[(e−rH/(1−α) σ e−rK/α )1/2 ≤ 21/2 k2k−1 p − e−(H+K)/2 ]k2k−1 p r k k − e−(H+K)/2 ]e−(H+K)/2 k2k−1 p } k k(e−rH/(1−α) σ e−rK/α )1/2 r k − e−(H+K)/2 k2k p k k × {k(e−rH/(1−α) σ e−rK/α )1/2 r k2k p + ke−(H+K)/2 k2k p }. This implies by (4.20) and Proposition 4.20 that k−1 lim k(e−rH/(1−α) σ e−rK/α )1/2 r↓0 r k−1 − e−(H+K)/2 k2k−1 p = 0. Repeating this argument yields the result. 5. Miscellaneous results. In this section we first discuss the interplay between log-majorizations and the Furuta inequalities [28, 30]. Several log-majorization results are obtained in Subsection 5.2. Furthermore, we give determinant inequalities as simple applications of the log-majorization results. 5.1. Interplay between log-majorization and Furuta inequlity. The Furuta inequality [28] says that if A ≥ B ≥ 0 in B(H)+ , then (B r/2 Ap B r/2 )α ≥ B α(p+r) , Aα(p+r) ≥ (Ar/2 B p Ar/2 )α , (5.1) whenever 0 ≤ α ≤ 1, p, r ≥ 0, and 1 + r ≥ α(p + r). As was clarified in [6], we can transform Furuta type operator inequalities into logmajorizations and vice versa. For instance, the Furuta inequality (5.1) is reformulated in terms of log-majorization as follows. Proposition 5.1. For every A, B ∈ B(H)+ , if 0 < α < 1, p ≥ 0, and 0 ≤ r ≤ min{α, αp}, then p−r µ(A 1−α #α (Ar/2α B p/α Ar/2α )) ≺w(log) µ((A1/2 BA1/2 )p ). Furthermore, the assumption r ≥ 0 is removed when A is invertible. LOG-MAJORIZATIONS AND NORM INEQUALITIES 171 P r o o f. We may assume that A is invertible. Let 0 < α < 1, q, s ≥ 0, and 1 + s ≥ α(q + s). Then it is seen from (5.1) that A1/2 BA1/2 ≤ I or A−1 ≥ B implies Aα(q+s) #α (A(α(q+s)−s)/2 B q A(α(q+s)−s)/2 ) = Aα(q+s)/2 (A−s/2 B q A−s/2 )α Aα(q+s)/2 ≤ I. Arranging the order of homogeneity, we have kAα(q+s) #α (A(α(q+s)−s)/2 B q A(α(q+s)−s)/2 )k∞ ≤ k(A1/2 BA1/2 )αq k∞ . As in the proofs of Theorems 3.4 and 4.2, the antisymmetric tensor technique yields µ(Aα(q+s) #α (A(α(q+s)−s)/2 B q A(α(q+s)−s)/2 )) ≺w(log) µ((A1/2 BA1/2 )αq ). Now putting p = αq and r = α(α(q+s)−s), we get the conclusion, where the assumptions q, s ≥ 0 and 1 + s ≥ α(q + s) are transformed into p ≥ 0 and r ≤ min{α, αp}. Corollary 5.2. If A, B ∈ B(H)+ and A is invertible, then for every 0 ≤ α ≤ 1, µ(Aα (A−1/2 BA−1/2 )α Aα ) ≺w(log) µ(Aα/2 B α Aα/2 ) ≺w(log) µ((A1/2 BA1/2 )α ). P r o o f. The case 0 < α < 1 is enough. The second log-majorization is nothing but (3.6). For the first, take p = 1 and r = 2α − 1 in Proposition 5.1. Then we have µ(A(A−1/2α B 1/α A−1/2α )α A) ≺w(log) µ(A1/2 BA1/2 ), which gives the result by replacing A, B by Aα , B α , respectively. Also, we can consider the converse direction from log-majorizations to operator inequalities. For instance, the following is a reformulation of the log-majorization (4.3). Proposition 5.3. If A ≥ B ≥ 0 and A is invertible, then (5.2) Ar ≥ {Ar/2 (A−1/2 B p A−1/2 )r Ar/2 }1/p , p, r ≥ 1. P r o o f. Let A, B ∈ B(H)+ with A invertible. Let 0 < α ≤ 1. Then (4.3) says that A ≥ (A−1/2 BA−1/2 )α implies A−r ≥ (A−r/2 B r A−r/2 )α for every r ≥ 1. Put p = 1/α and replace A−1 , (A−1/2 BA−1/2 )α by A, B, respectively. Then the result follows. −1 Recently Furuta [30] obtained a quite general operator inequality extending the original Furuta inequality as well as (5.2), which is stated in the following: Theorem 5.4. Assume that A ≥ B ≥ 0 and A is invertible. Then (5.3) 1−t+r A1−t+r ≥ {Ar/2 (A−t/2 B p A−t/2 )s Ar/2 } (p−t)s+r , whenever 0 ≤ t ≤ 1, p ≥ 1, rget, and s ≥ 1. Furthermore, for every 0 ≤ t ≤ 1 and p ≥ 1, 1−t+r (r, s) 7→ A−r/2 {Ar/2 (A−t/2 B p A−t/2 )s Ar/2 } (p−t)s+r A−r/2 is a decreasing function of both r ≥ t and s ≥ 1. Note that (5.1) is a consequence of (5.3) in case of t = 0 and s = 1. Also (5.2) is a special case of (5.3) when t = 1 and r = s ≥ 1. In this way, the operator inequality (5.3) interpolates (5.1) and (5.2) by the parameter 0 ≤ t ≤ 1. In [30] several log-majorizations extending (4.3) were produced from (5.3) in the same way as in the proof of Proposition 5.1. Among others, the following is obtained from (5.3) by taking t = 1 and p = 1/α. But for A compact, see the proof of Theorem 4.2. 172 F. HIAI Theorem 5.5. If A, B ∈ B(H)+ and A is either invertible or compact , then for every 0 ≤ α ≤ 1 and r, s ≥ 1 µ(Ar #αq/s B s ) ≺w(log) µ((A #α B)q ) where q = ((1 − α)r−1 + αs−1 )−1 . 5.2. Other log-majorizations. In this subsection let us show some other log-majorization results. The first two propositions were given in [6] in the matrix case, while the remaining are new. For any T ∈ B(H), the real part of T is Re T = (T + T ∗ )/2, and the exponential P∞ operator eT is defined as the power series expansion eT = n=0 T n /n!. The following log-majorization (1) for exponential operator is essentially due to [14]. Proposition 5.6. Let T ∈ B(H). (1) µ(eT ) ≺w(log) µ(eRe T ). (2) Assume that Re T ≥ 0. Then |eT | ≥ I (or log |eT | ≥ 0) and µ(log |eT |) ≺w µ(Re T ). Hence k log |eT | k ≤ kRe T k for any symmetric norm k · k. P r o o f. (1) For every X ∈ B(H) we get by (1.14) µ(X k ) ≺w(log) µ(X)k , Putting X = eT /k in the above yields ∗ k ∈ N. µ(eT ) ≺w(log) µ(|eT /k |k ), k ∈ N. But |eT /2k |2k = (eT /2k eT /2k )k converges in the norm k · k∞ to e(T k → ∞. Hence the assertion is shown by Proposition 1.4(10). (2) If Re T ≥ 0, then since ∗ ∗ ∗ +T )/2 = eRe T as ∗ |eT |−1 = (eT eT )−1/2 = (e−T e−T )1/2 = |e−T |, we get by (1), ∗ k |eT |−1 k∞ ≤ keRe(−T ) k∞ = ke−Re T k∞ ≤ 1, so that |eT | ≥ I. Then the required weak majorization is immediate from (1), because µn (log |eT |) = log µn (eT ) and µn (Re T ) = log µn (eRe T ). Proposition 5.7. Let A, B ∈ B(H)+ . (1) µ(Ap1 B q1 Ap2 B q2 . . . Apk B qk ) ≺w(log) µ(Ap1 +...+pk B q1 +...+qk ) if 0 ≤ p1 ≤ q1 ≤ p1 + p2 ≤ q1 + q2 ≤ . . . ≤ q1 + . . . + qk−1 ≤ p1 + . . . + pk = q1 + . . . + qk (in particular , if pj = qj ≥ 0 for all j). (2) µ(Ap B q Ar ) ≺w(log) µ(Ap+r B q ) for every p, q, r ≥ 0. (3) µ((AB)k A) ≺w(log) µ(A(k+1)/2 B k A(k+1)/2 ) for every k ∈ N. P r o o f. (1) We may assume that p1 + . . . + pk = q1 + . . . + qk = 1. It suffices as in the proof of Theorem 3.4 to show that when A, B are invertible, kABk∞ ≤ 1 implies kAp1 B q1 . . . Apk B qk k∞ ≤ 1. Suppose kABk∞ ≤ 1 or BA2 B ≤ I. Then A2 ≤ B −2 and B 2 ≤ A−2 . Since 0 ≤ q1 − p1 ≤ 1, we get B q1 A2p1 B q1 ≤ B q1 B −2p1 B q1 = B 2(q1 −p1 ) ≤ A2(p1 −q1 ) 173 LOG-MAJORIZATIONS AND NORM INEQUALITIES and hence Ap2 B q1 A2p1 B q1 Ap2 ≤ A2(p1 +p2 −q1 ) ≤ B 2(q1 −p1 −p2 ) . Repeating this argument yields B qk Apk . . . B q1 A2p1 B q1 . . . Apk B qk ≤ B 2(q1 +...+qk −p1 −...−pk ) = I, so that kAp1 B q1 . . . Apk B qk k∞ ≤ 1. (2) Put p1 = p, p2 = r, q1 = p + r, and q2 = 0 in (1). Then q q µ(Ap B q Ar ) = µ(Ap (B p+r )p+r Ar ) ≺w(log) µ(Ap+r (B p+r )p+r ) = µ(Ap+r B q ). (3) When k = 2m − 1 we get µ((AB)2m−1 A) = µ((AB)m−1 AB 1/2 )2 2m = µ({(A 2m−1 ) 2m−1 2m 2m B}m−1 (A 2m−1 ) 2m−1 2m B 1/2 )2 ≺w(log) µ(Am B (2m−1)/2 )2 = µ(Am B 2m−1 Am ). The log-majorization in the above follows from (1) with 2m − 1 p1 = . . . = pm = , q1 = . . . = qm−1 = 1, 2m The derivation is similar when k = 2m. qm = 1 . 2 Let H and K be lower-bounded self-adjoint operators on H. For t > 0 put S(t) = e−tH/2 e−tK e−tH/2 , which is the second order approximant of e−t(H +̂K) . Then {S(t/n)S((1 − 2t)/n)S(t/n)}n and {S(t/n)2 S((1 − 4t)/n)S(t/n)2 }n are sometimes used P as higher order approximants of e−(H +̂K) . When p1 , . . . , pk ≥ 0 and kj=1 pj = 1, Proposition 5.7(1) gives µ(S(p1 )S(p2 ) . . . S(pk )) ≺w(log) µ(e−H e−K ). A special case of the norm inequality proved by Kato [50] is the following: r kAr XB r k∞ ≤ kXk1−r ∞ kAXBk∞ , (5.4) 0 ≤ r ≤ 1, for every A, B ∈ B(H)+ and X ∈ B(H). For the proof of (5.4), it is enough to show that log kAr XB r k∞ is convex in r ∈ [0, 1]. This follows because for 0 ≤ r ≤ s ≤ 1, kA r+s 2 XB r+s 2 k2∞ = r(B r+s 2 X ∗ Ar+s XB r+s 2 ) = r(B r X ∗ Ar As XB s ) ≤ kAr XB r k∞ kAs XB s k∞ . Applying the antisymmetric tensor technique to (5.4), we have the following generalization of Theorem 3.4. Proposition 5.8. For every A, B ∈ B(H)+ and X ∈ B(H), (5.5) µ(Ar XB r ) ≺w(log) µ(X)1−r µ(AXB)r , 0 ≤ r ≤ 1, or equivalently µ(AXB)r ≺w(log) µ(X)r−1 µ(Ar XB r ), r ≥ 1. This yields the norm inequality proved by Kittaneh [52] as follows. Corollary 5.9. Let A, B ∈ B(H)+ , X ∈ B(H), and k · k be a symmetric norm. Then (5.6) kAr XB r k ≤ kXk1−r kAXBkr , 0 ≤ r ≤ 1. 174 F. HIAI P r o o f. Let Φ be the corresponding gauge function. For each 0 < r < 1, we have by Proposition 1.3 and by Lemmas 2.2 and 2.13(1) kAr XB r k ≤ Φ(µ(X)1−r µ(AXB)r ) ≤ Φ(µ(X))1−r Φ(µ(AXB))r = kXk1−r kAXBkr . We have also µ(Ar XB 1−r ) ≺w(log) µ(AX)r µ(XB)1−r , (5.7) 0 ≤ r ≤ 1, modifying (5.5) in the same way as in [52] where it is observed that (5.6) is equivalently reformulated as kAr XB 1−r k ≤ kAXkr kXBk1−r , 0 ≤ r ≤ 1. In fact, to check (5.7), we may assume that B is invertible. Then for 0 ≤ r ≤ 1 we get by (5.5) µ(Ar XB 1−r ) = µ(Ar (XB)B −r ) ≺w(log) µ(XB)1−r µ(A(XB)B −1 )r = µ(AX)r µ(XB)1−r . Furthermore, for any A, B, X ∈ B(H), it is immediate from polar decompositions that µ(AXB) = µ(|A| · X · |B ∗ |). Hence (5.7) with r = 1/2 yields µ(AXB) ≺w(log) µ(A∗ AX)1/2 µ(XBB ∗ )1/2 , (5.8) which is a generalization of Horn’s majorization (1.14). Let r > 0, 1 < p, q < ∞ with 1/p + 1/q = 1, and k · k be a symmetric norm. Then by (5.8) and Lemma 2.13(1) we have k |AXB|r k ≤ k |A∗ AX|rp/2 k1/p k |XBB ∗ |rq/2 k1/q . In particular, k |AB|r k ≤ k |A|rp k1/p k |B|rq k1/q , k |AXB|r k2 ≤ k |A∗ AX|r k · k |XBB ∗ |r k. The last inequality was given in [16]. As is well known, the Golden–Thompson trace inequality fails to hold for three matrices, while an attempt in the case of three matrices was made in [59]. In the rest of this subsection, we discuss log-majorizations of Golden–Thompson type for more than two operators which are commuting except one. Proposition 5.10. (1) If A1 , A2 ∈ B(H)+ and A1 A2 = A2 A1 , then for every B ∈ B(H), µ(BA1 A2 B ∗ ) = µ((A1 A2 )1/2 B ∗ B(A1 A2 )1/2 ) ≺w(log) µ(A1 B ∗ BA2 ). (2) If A1 , A2 , A3 ∈ B(H)+ and Ai Aj = Aj Ai , then for every B ∈ B(H), µ(BA1 A2 A3 B ∗ ) ≺w(log) µ(A21 B ∗ BA22 B ∗ BA23 )1/2 . P r o o f. (1) The equality is obvious because BA1 A2 B ∗ = |(A1 A2 )1/2 B ∗ |2 and (A1 A2 )1/2 B ∗ B(A1 A2 )1/2 = |B(A1 A2 )1/2 |2 . For the log-majorization, by the antisymmetric tensor technique, it suffices to show that |A1 B ∗ BA2 | ≤ I implies (A1 A2 )1/2 B ∗ B(A1 A2 )1/2 ≤ I. We may assume that A2 is in∗ 2 ∗ vertible. Then A2 B ∗ BA21 B ∗ BA2 ≤ I implies B ∗ BA21 B ∗ B ≤ A−2 2 and so A1 B BA1 B BA1 −1 2 −1 ∗ 1/2 ∗ 1/2 ≤ (A1 A2 ) . This gives A1 B BA1 ≤ A1 A2 and hence (A1 A2 ) B B(A1 A2 ) ≤ I. LOG-MAJORIZATIONS AND NORM INEQUALITIES 175 (2) By using (1) twice, we have µ(BA1 A2 A3 B ∗ ) ≺w(log) µ(A2 B ∗ BA1 A3 ) = µ(A1 A3 (A2 B ∗ B)∗ (A2 B ∗ B)A1 A3 )1/2 ≺w(log) µ(A21 B ∗ BA22 B ∗ BA23 )1/2 . Proposition 5.11. Let H1 , H2 , H3 , K be lower-bounded self-adjoint operators on H. Assume that H1 , H2 , H3 are mutually commuting (i.e. their spectral projections are all commuting). Then (5.9) (5.10) µ(e−((H1 +H2 )+̂K) ) ≺w(log) µ(e−H1 e−K e−H2 ), µ(e−((H1 +H2 +H3 )+̂2K) ) ≺w(log) µ(e−H1 e−K e−H2 e−K e−H3 ). Hence ke−(H1 +H2 +H3 )+̂2K) k ≤ ke−H1 e−K e−H2 e−K e−H3 k for any symmetric norm k · k. P r o o f. (5.9) is a special case of (5.10). The commuting assumption guarantees that H1 + H2 + H3 is defined as a lower-bounded self-adjoint operator and e−r(H1 +H2 +H3 ) = e−rH1 e−rH2 e−rH3 , r ≥ 0. By (3.6) and Proposition 5.10(2) we have for every 0 < r ≤ 1/2, µ((e−rK e−r(H1 +H2 +H3 ) e−rK )1/r ) ≺w(log) µ(e−K/2 e−(H1 +H2 +H3 )/2 e−K/2 )2 ≺w(log) µ(e−H1 e−K e−H2 e−K e−H3 ). Hence (5.10) follows by letting r ↓ 0 as in the proof of Theorem 3.7. Let A1 , . . . , A4 ∈ B(H)+ with Ai Aj = Aj Ai , and B ∈ B(H). Then, using Proposition 5.10(2) twice, we have µ(BA1 A2 A3 A4 B ∗ ) ≺w(log) µ(A4 B ∗ BA21 A22 A23 B ∗ BA4 )1/2 ≺w(log) µ(A41 B ∗ BA24 B ∗ BA42 B ∗ BA24 B ∗ BA43 )1/4 . This implies as Proposition 5.11 that if H1 , . . . , H4 , K are lower-bounded self-adjoint operators and H1 , . . . , H4 are mutually commuting, then µ(e−(H1 +...+H4 )+̂4K) ) ≺w(log) µ(e−H1 e−K e−H4 /2 e−K e−H2 e−K e−H4 /2 e−K e−H3 ). Problem 5.12. Do (5.9) and (5.10) extend to a greater number of operators? That is, if H1 , . . . , Hn , K are lower-bounded self-adjoint operators with H1 , . . . , Hn mutually commuting, then µ(e−((H1 +...+Hn )+̂(n−1)K) ) ≺w(log) µ(e−H1 e−K e−H2 . . . e−K e−Hn )? 5.3. Determinant inequalities. In the final subsection, some determinant inequalities of Golden–Thompson type are given. Let A ∈ C1 (H). Following [32], we define the determinant det(I + A) by ∞ Y det(I + A) = (1 + λk (A)), k=1 where λk (A) are the eigenvalues of A counted with algebraic multiplicities. The product of the above right-hand side converges because Theorem 3.3 and Proposition 1.3 imply that ∞ ∞ X X |λk (A)| ≤ µk (A) = kAk1 < ∞. k=1 k=1 176 F. HIAI Another equivalent definition is (5.11) det(I + A) = ∞ X tr(Λn A). n=0 Indeed, note that if A ∈ C1 (H) then kΛn Ak1 ≤ kAkn1 /n! for n ∈ N and det(I + zA) = ∞ X z n tr(Λn A), n=0 z ∈ C, is an entire analytic function. In particular, comparing the coefficients of z, we have Lidskii’s theorem: ∞ X tr A = λk (A). k=1 It is known that for every A, B ∈ C1 (H), (5.12) |det(I + A) − det(I + B)| ≤ kA − Bk1 exp(kAk1 + kBk1 + 1), which shows that A 7→ det(I +A) is k·k1 -continuous on C1 (H). See [79, §3], [74, §XIII.17], or [31, VII] for details of the facts on the determinant mentioned above. Lemma 5.13. Let a = (a1 , a2 , . . .) and b = (b1 , b2 , . . .) be such that a1 ≥ a2 ≥ . . . ≥ 0 Q∞ Q∞ and b1 ≥ b2 ≥ . . . ≥ 0. If a ≺w(log) b, then k=1 (1 + ak ) ≤ k=1 (1 + bk ). P r o o f. Since log(1 + x) is increasing on [0, ∞) and log(1 + ex ) is convex, Proposition Qk Qk 1.3 implies that (log(1 + ai )) ≺w (log(1 + bi )) and hence i=1 (1 + ai ) ≤ i=1 (1 + bi ) for every k ∈ N. This gives the result. Proposition 5.14. Let H and K be lower-bounded self-adjoint operators on H. Assume that e−K ∈ C1 (H). Then (5.13) det(I + e−(H +̂K) ) ≤ det(I + (e−rH/2 e−rK e−rH/2 )1/r ), r > 0, and the above right-hand side decreases as r ↓ 0. In particular , (5.14) det(I + e−(H +̂K) ) ≤ det(I + e−H/2 e−K e−H/2 ) = det(I + e−H e−K ). Moreover , if H +K is essentially self-adjoint , then the right-hand side of (5.13 ) decreases to det(I + e−(H+K) ) as r ↓ 0. P r o o f. Note as remarked after Theorem 3.12 that (e−rH/2 e−rK e−rH/2 )1/r ∈ C1 (H) for all r > 0. The first assertion is immediate from Theorems 3.4 and 3.7 due to Lemma 5.13. The equality in (5.14) is seen because e−H/2 e−K e−H/2 and e−H e−K have the same eigenvalues including algebraic multiplicities. (This is seen also via (5.11) and Lemma 3.1.) The last assertion is a consequence of (3.12) and (5.12). Proposition 5.15. Let σ be an operator mean such that σ ≤ #α where 0 ≤ α ≤ 1. Let H and K be lower-bounded self-adjoint operators. Assume that H + K is essentially self-adjoint and e−K ∈ C1 (H). Then det(I + (e−rH/(1−α) σ e−rK/α )1/r ) ≤ det(I + e−(H+K) ), r > 0. Moreover , if H is bounded , then the above left-hand side converges to det(I + e−(H+K) ) as r ↓ 0. LOG-MAJORIZATIONS AND NORM INEQUALITIES P r o o f. Let H = T∞ a λdEH (λ) and Hn = Tn a 177 λdEH (λ). For each r > 0 we get det(I + (e−rH/(1−α) σ e−rK/α )1/r ) ≤ det(I + (e−rHn /(1−α) #α e−rK/α )1/r ) ≤ det(I + e−(Hn +K) ) by Theorem 4.9 and Lemma 5.13. So the desired inequality follows because det(I + e−(Hn +K) ) → det(I + e−(H+K) ) due to (3.22) and (5.12). If H is bounded, then Theorem 4.11 and Lemma 3.8 show that n n Y Y µk (I + (e−rH/(1−α) σ e−rK/α )1/r ) µk (I + e−(H+K) ) ≤ lim inf r↓0 k=1 k=1 ≤ lim inf det(I + (e−rH/(1−α) σ e−rK/α )1/r ). r↓0 Letting n → ∞ we have the second assertion. The final result is restricted to the matrix case. Further determinant inequalities for matrices are found in [6]. Proposition 5.16. Let T be any n × n matrix. (1) det(I + |eT |) ≤ det(I + eRe T ). (2) det(log |eT |) ≥ det(Re T ) if Re T ≥ 0. P r o o f. (1) follows from Proposition 5.6(1) and Lemma 5.13. (2) For any matrix T , since det |eT | = |etr T | = etr(Re T ) = det eRe T , Proposition 5.6(1) is improved to µ(eT ) ≺(log) µ(eRe T ). Since |eT | ≥ I by Proposition 5.6(2), this means that µ(log |eT |) ≺ µ(Re T ). For any ε > 0, applying Proposition 1.1(iii) to f (x) = − log(x + ε) we have n n X X log{µi (Re T ) + ε}, log{µi (log |eT |) + ε} ≤ − − i=1 i=1 so that n Y i=1 {µi (log |eT |) + ε} ≥ n Y i=1 {µi (Re T ) + ε}. Letting ε ↓ 0 yields the required inequality. Problem 5.17. Is it possible to properly extend the determinant inequalities of Proposition 5.16 to operators? Concerning (1), both sides are equal to∞ for infinite-dimensional T ∈ B(H), so that we have to consider unbounded T . References [1] [2] [3] T. A n d o, Topics on Operator Inequalities, Lecture notes (mimeographed), Hokkaido Univ., Sapporo, 1978. —, On some operator inequalities, Math. Ann. 279 (1987), 157–159. —, Comparison of norms |||f (A)−f (B)||| and |||f (|A−B|)|||, Math. Z. 197 (1988), 403–409. 178 [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] F. HIAI T. A n d o, Majorization, doubly stochastic matrices and comparison of eigenvalues, Linear Algebra Appl. 118 (1989), 163–248. —, Majorizations and inequalities in matrix theory, ibid. 199 (1994), 17–67. T. A n d o and F. H i a i, Log majorization and complementary Golden–Thompson type inequalities, ibid. 197/198 (1994), 113–131. H. A r a k i, Relative Hamiltonian for faithful normal states of a von Neumann algebra, Publ. Res. Inst. Math. Sci. 9 (1973), 165–209. —, Golden–Thompson and Peierls-Bogolubov inequalities for a general von Neumann algebras, Comm. Math. Phys. 34 (1973), 167–178. —, Some properties of modular conjugation operator of von Neumann algebras and a non-commutative Radon-Nikodym theorem with a chain rule, Pacific J. Math. 50 (1974), 309–354. —, Relative entropy of states of von Neumann algebras, Publ. Res. Inst. Math. Sci. 11 (1976), 809–833. —, Relative entropy for states of von Neumann algebras II , ibid. 13 (1977), 173–192. —, On an inequality of Lieb and Thirring, Lett. Math. Phys. 19 (1990), 167–170. B. B e a u z a m y, Introduction to Banach Spaces and Their Geometry, North-Holland, Amsterdam, 1982. D. S. B e r n s t e i n, Inequalities for matrix exponentials, SIAM J. Matrix Anal. Appl. 9 (1988), 156–158. R. B h a t i a, Perturbation Bounds for Matrix Eigenvalues, Pitman Res. Notes in Math. Ser. 162, Longman, Harlow, 1987. R. B h a t i a and C. D a v i s, A Cauchy–Schwarz inequality for operators with applications, Linear Algebra Appl. 223/224 (1995), 119–129. G. B i r k h o f f, Tres observaciones sobre el algebra lineal , Univ. Nac. Tucuman Rev. Ser. A 5 (1946), 147–151. M. Sh. B i r m a n, L. S. K o p l i e n k o, and M. Z. S o l o m y a k, Estimates for the spectrum of the difference between fractional powers of two self-adjoint operators, Soviet Math. (Iz. VUZ) 19, no. 3 (1975), 1–6. P. R. C h e r n o f f, Note on product formulas for operator semigroups, J. Funct. Anal. 2 (1968), 238–242. J. D i x m i e r, Formes linéaires sur un anneau d’opérateurs, Bull. Soc. Math. France 81 (1953), 9–39. P. G. D o d d s, T. K. D o d d s, and B. d e P a g t e r, Non-commutative Banach function spaces, Math. Z. 201 (1989), 583–597. —, —, —, Non-commutative Köthe duality, Trans. Amer. Math. Soc. 339 (1993), 717–750. M. J. D o n a l d, Relative hamiltonians which are not bounded from above, J. Funct. Anal. 91 (1990), 143–173. W. D o n o g h u e, Monotone Matrix Functions and Analytic Continuation, Springer, New York, 1974. T. F a c k, Sur la notion de valeur caractéristique, J. Operator Theory 7 (1982), 307–333. T. F a c k and H. K o s a k i, Generalized s-numbers of τ -measurable operators, Pacific J. Math. 123 (1986), 269–300. M. F u j i i, T. F u r u t a, and E. K a m e i, Furuta’s inequality and its application to Ando’s theorem, Linear Algebra Appl. 179 (1993), 161–169. T. F u r u t a, A ≥ B ≥ 0 assures (B r Ap B r )1/q ≥ B (p+2r)/q for r ≥ 0, p ≥ 0, q ≥ 1 with (1 + 2r)q ≥ p + 2r, Proc. Amer. Math. Soc. 101 (1987), 85–88. LOG-MAJORIZATIONS AND NORM INEQUALITIES [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47] [48] [49] [50] [51] [52] 179 T. F u r u t a, Applications of order preserving operator inequalities, in: Operator Theory: Adv. Appl. 59, Birkhäuser, Basel, 1992, 180–190. —, Extension of the Furuta inequality and Ando-Hiai log-majorization, Linear Algebra Appl. 219 (1995), 139–155. I. G o h b e r g, S. G o l d b e r g, and M. A. K a a s h o e k, Classes of Linear Operators I , Birkhäuser, Basel, 1990. I. C. G o h b e r g and M. G. K r e i n, Introduction to the Theory of Linear Nonselfadjoint Operators, Trnansl. Math. Monographs 18, Amer. Math. Soc., 1969. S. G o l d e n, Lower bounds for Helmholtz function, Phys. Rev. 137 (1965), B1127–B1128. U. H a a g e r u p, The standard form of von Neumann algebras, Math. Scand. 37 (1975), 271–283. —, Lp -spaces associated with an arbitrary von Neumann algebra, in: Colloq. Internat. CNRS 274, 1979, 175–184. F. H a n s e n and G. K. P e d e r s e n, Jensen’s inequality for operators and Löwner’s theorem, Math. Ann. 258 (1982), 229–241. B. H e l f f e r, Around the transfer operator and the Trotter–Kato formula, in: Operator Theory: Adv. Appl. 78, Birkhäuser, Basel, 1995, 161–174. F. H i a i, Majorization and stochastic maps in von Neumann algebras, J. Math. Anal. Appl. 127 (1987), 18–48. —, Spectral majorization between normal operators in von Neumann algebras, in: Operator Algebras and Operator Theory, W. B. Arveson et al. (eds.), Pitman Res. Notes in Math. Ser. 271, Longman, Harlow, 1992, 78–115. —, Trace norm convergence of exponential product formula, Lett. Math. Phys. 33 (1995), 147–158. F. H i a i and Y. N a k a m u r a, Majorization for generalized s-numbers in semifinite von Neumann algebras, Math. Z. 195 (1987), 17–27. —, —, Distance between unitary orbits in von Neumann algebras, Pacific J. Math. 138 (1989), 259–294. —, —, Closed convex hulls of unitary orbits in von Neumann algebras, Trans. Amer. Math. Soc. 323 (1991), 1–38. F. H i a i and D. P e t z, The Golden–Thompson trace inequality is complemented , Linear Algebra Appl. 181 (1993), 153–185. M. H i l s u m, Les espaces Lp d’une algèbre de von Neumann définies par la derivée spatiale, J. Funct. Anal. 40 (1981), 151–169. T. I c h i n o s e and H. T a m u r a, Error estimate in operator norm for Trotter–Kato product formula, Integral Equations Operator Theory 27 (1997), 195–207. —, —, Error bound in trace norm for Trotter–Kato product formula of Gibbs semigroups, preprint, 1996. E. K a m e i, Double stochasticity in finite factors, Math. Japon. 29 (1984), 903–907. —, An order on statistical operators implicitly introduced by von Neumann, Math. Japon. 30 (1985), 891–895. T. K a t o, A generalization of the Heinz inequality, Proc. Japan Acad. 37 (1961), 305–308. —, Trotter’s product formula for an arbitrary pair of self-adjoint contraction semigroups, in: Topics in Functional Analysis, I. Gohberg and M. Kac (eds.), Adv. Math. Suppl. Stud. 3, Academic Press, New York, 1978, 185–195. F. K i t t a n e h, Norm inequalities for fractional powers of positive operators, Lett. Math. Phys. 27 (1993), 279–285. 180 [53] [54] [55] [56] [57] [58] [59] [60] [61] [62] [63] [64] [65] [66] [67] [68] [69] [70] [71] [72] [73] [74] [75] [76] [77] [78] F. HIAI H. K o s a k i, Non-commutative Lorentz spaces associated with a semi-finite von Neumann algebra and applications, Proc. Japan Acad. Ser. A Math. Sci. 57 (1981), 303–306. —, Relative entropy of states: a variational expression, J. Operator Theory 16 (1986), 335–348. —, An inequality of Araki–Lieb–Thirring (von Neumann algebra case), Proc. Amer. Math. Soc. 114 (1992), 477–481. —, private communication, 1995. F. K u b o and T. A n d o, Means of positive linear operators, Math. Ann. 246 (1980), 205– 224. A. L e n a r d, Generalization of the Golden–Thompson inequality Tr(eA eB ) ≥ Tr eA+B , Indiana Univ. Math. J. 21 (1971), 457–467. E. H. L i e b, Convex trace functions and the Wigner–Yanase–Dyson conjecture, Adv. in Math. 11 (1973), 267–288. E. H. L i e b and W. T h i r r i n g, in: Studies in Mathematical Physics, E. H. Lieb, B. Simon, and A. S. Wightman (eds.), Princeton Univ. Press, Princeton, 1976, 301–302. A. S. M a r k u s, The eigen- and singular values of the sum and product of linear operators, Russian Math. Surveys 19 (4) (1964), 91–120. A. W. M a r s h a l l and I. O l k i n, Inequalities: Theory of Majorization and Its Applications, Academic Press, New York, 1979. L. M i r s k y, Symmetric gauge functions and unitarily invariant norms, Quart. J. Math. Oxford Ser. (2) 11 (1960), 50–59. Y. N a k a m u r a, An inequality for generalized s-numbers, Integral Equations Operator Theory 10 (1987), 140–145. H. N e i d h a r d t and V. A. Z a g r e b n o v, The Trotter–Kato product formula for Gibbs semigroups, Comm. Math. Phys. 131 (1990), 333–346. E. N e l s o n, Notes on non-commutative integration, J. Funct. Anal. 15 (1974), 103–116. J. v o n N e u m a n n, Some matrix-inequalities and metrization of matrix-space, Tomsk Univ. Rev. 1 (1937), 286–299 (Collected Works, Vol. IV, 205–218). M. O h y a and D. P e t z, Quantum Entropy and Its Use, Springer, Berlin, 1993. D. P e t z, Spectral scale of self-adjoint operators and trace inequalities, J. Math. Anal. Appl. 109 (1985), 74–82. —, A variational expression for the relative entropy, Comm. Math. Phys. 114 (1988), 345–349. —, A survey of certain trace inequalities, in: Functional Analysis and Operator Theory, Banach Center Publ. 30, Inst. Math., Polish Acad. Sci., Warszawa, 1994, 287–298. R. T. P o w e r s and E. S t ø r m e r, Free states of the canonical anticommutation relations, Comm. Math. Phys. 16 (1970), 1–33. M. R e e d and B. S i m o n, Methods of Modern Mathematical Physics II , Academic Press, New York, 1975. —, —, Methods of Modern Mathematical Physics IV , Academic Press, New York, 1978. D. L. R o g a v a, Error bounds for Trotter-type formulas for self-adjoint operators, Functional Anal. Appl. 27 (1993), 217–219. M. B. R u s k a i, Inequalities for traces on von Neumann algebras, Comm. Math. Phys. 26 (1972), 280–289. R. S c h a t t e n, Norm Ideals of Completely Continuous Operators, Springer, Berlin, 1970. I. S e g a l, A non-commutative extension of abstract integration, Ann. of Math. (2) 57 (1953), 401–457. LOG-MAJORIZATIONS AND NORM INEQUALITIES [79] [80] [81] [82] [83] [84] [85] [86] [87] [88] [89] [90] 181 B. S i m o n, Trace Ideals and Their Applications, London Math. Soc. Lecture Note Ser. 35, Cambridge Univ. Press, Cambridge, 1979. S. S t r ă t i l ă, Modular Theory in Operator Algebras, Editura Academiei and Abacus Press, Tunbridge Wells, 1981. M. S u z u k i, Quantum statistical Monte Carlo methods and applications to spin systems, J. Statist. Phys. 43 (1986), 883–909. —, Convergence of general decompositions of exponential operators, Comm. Math. Phys. 163 (1994), 491–508. K. S y m a n z i k, Proof and refinements of an inequality of Feynman, J. Math. Phys. 6 (1965), 1155–1156. C. J. T h o m p s o n, Inequality with applications in statistical mechanics, ibid. 6 (1965), 1812–1813. —, Inequalities and partial orders on matrix spaces, Indiana Univ. Math. J. 21 (1971), 469–480. A. U h l m a n n, Relative entropy and the Wigner–Yanase–Dyson–Lieb concavity in an interpolation theory, Comm. Math. Phys. 54 (1977), 21–32. H. U m e g a k i, Conditional expectation in an operator algebra, IV (entropy and information), Kōdai Math. Sem. Rep. 14 (1962), 59–85. B.-Y. W a n g and M.-P. G o n g, Some eigenvalue inequalities for positive semidefinite matrix power products, Linear Algebra Appl. 184 (1993), 249–260. K. W a t a n a b e, Some results on non-commutative Banach function spaces, Math. Z. 210 (1992), 555–572. —, Some results on non-commutative Banach function spaces II (infinite cases), Hokkaido Math. J. 22 (1993), 349–364. Editorial note: See also M. F i e d l e r and V. P t á k, A new positive definite geometric mean of two positive definite matrices, Linear Algebra Appl. 251 (1997), 1–20; W. P u s z and S. L. W o r o n o w i c z, Functional calculus for sesquilinear forms and the purification map, Rep. Math. Phys. 8 (1975), 159–170; and R. B h a t i a, Matrix Analysis, Springer, New York, 1997.
© Copyright 2025 Paperzz