Majorization and weak log-majorization #
This file develops the theory of majorization for finite sequences, leading to the key singular value inequality needed for the Schatten–Hölder inequality.
Main results #
sum_rpow_singularValues_mul_le: forr > 0, the singular values ofA * Bsatisfy∑ σᵢ(AB)^r ≤ ∑ σ↓ᵢ(A)^r · σ↓ᵢ(B)^r.holder_step_for_singularValues: the Hölder step giving∑ σ↓ᵢ(A)^r · σ↓ᵢ(B)^r ≤ (∑ σᵢ(A)^p)^{r/p} · (∑ σᵢ(B)^q)^{r/q}.
Sorted singular values #
The singular values of a square complex matrix A, defined as the square
roots of the eigenvalues of A†A. These are indexed by d without a
particular ordering.
Note: We use A.conjTranspose as the argument to isHermitian_mul_conjTranspose_self
so that the underlying Hermitian matrix is A† * A (matching the convention in schattenNorm).
Equations
- singularValues A i = √(⋯.eigenvalues i)
Instances For
Singular values are nonneg.
The sorted singular values of a square complex matrix, in decreasing order,
indexed by Fin (Fintype.card d).
We define them by sorting the multiset of singular values.
Equations
- singularValuesSorted A i = ((Multiset.map (singularValues A) Finset.univ.val).sort fun (x1 x2 : ℝ) => x1 ≥ x2).get ⟨↑i, ⋯⟩
Instances For
Sorted singular values are nonneg.
The sum ∑ singularValues A i ^ p equals the sum over sorted singular values.
Weak log-majorization and its consequences #
Sorted singular values are antitone (decreasing).
Compound matrices and auxiliary lemmas for Horn's inequality #
The proof of Horn's inequality uses the compound matrix (or k-th exterior power
of a matrix). For a d × d matrix M and k ≤ card d, the k-th compound matrix
C_k(M) is indexed by k-element subsets of the index type, with entry (S, T) being
the minor det M[S, T].
The key properties are:
- Cauchy–Binet:
C_k(M N) = C_k(M) · C_k(N). - Spectral characterisation: the largest singular value of
C_k(M)is∏_{i=1}^k σ↓ᵢ(M). - Operator-norm submultiplicativity:
σ₁(A B) ≤ σ₁(A) · σ₁(B).
Combining these gives Horn's inequality:
∏ σ↓(AB) = σ₁(C_k(AB)) = σ₁(C_k(A) C_k(B)) ≤ σ₁(C_k(A)) σ₁(C_k(B)) = (∏ σ↓(A))(∏ σ↓(B)).
A LinearOrder on any Fintype, obtained classically via well-ordering.
Instances For
The k-th compound (exterior-power) matrix of M.
Equations
- compoundMatrix M k S T = (M.submatrix (fun (i : Fin k) => ((↑S).orderEmbOfFin ⋯) i) fun (j : Fin k) => ((↑T).orderEmbOfFin ⋯) j).det
Instances For
Cauchy–Binet formula for rectangular matrices: if A is m × n and B is
n × m, then det(A * B) = ∑_S det(A[:,S]) * det(B[S,:]) where the sum is
over m-element subsets S of the column/row index.
The compoundMatrix of a product is the product of the compoundMatrixs.
compoundMatrix commutes with conjTranspose.
The compound matrix of a diagonal matrix is diagonal, with entries being products of eigenvalues over k-subsets.
The eigenvalues of the compound matrix of a Hermitian matrix are the products
of eigenvalues over k-subsets. More precisely, the singular values of
compoundMatrix M k are the square roots of products of eigenvalues of M†M
over k-subsets.
The 0th sorted singular value is the maximum of the singular values.
Each singular value appears in the sorted list.
Each sorted singular value appears among the original singular values.
Stronger version of singularValues_compoundMatrix_eq that exposes the permutation.
Converse of singularValues_compoundMatrix_eq: every singular value of the
compound matrix is a product of singular values of M over some k-subset.
There exists a bijection σ : Fin (card d) ≃ d such that
singularValues M (σ i) = singularValuesSorted M i for all i.
For any k-subset S of d, the product of singular values over S is ≤ the product of the top k sorted singular values.
The Rayleigh quotient bound:
For a Hermitian matrix H with eigenvalues λ, we have
v† H v ≤ (max λ) · v† v for all v.
The quadratic form of A† A is bounded by (max singular value)² * ‖v‖².
The largest singular value of a matrix product is at most the product of the
largest singular values: σ₁(M * N) ≤ σ₁(M) * σ₁(N).
This is operator-norm submultiplicativity.
Horn's inequality (weak log-majorization of singular values):
For all k, ∏_{i<k} σ↓ᵢ(AB) ≤ ∏_{i<k} σ↓ᵢ(A) · σ↓ᵢ(B).
This follows from submultiplicativity of the operator norm applied to
exterior powers of the matrices.
Weak log-majorization implies sum of powers inequality #
Weak log-majorization is preserved under positive powers.
Weak log-majorization of nonneg antitone sequences implies the sum of powers inequality.
Key singular value inequality for products #
Hölder inequality for singular values #
The finite-sum Hölder inequality applied to sequences of r-th powers of sorted singular values.
With conjugate exponents p' = p/r > 1 and q' = q/r > 1 (which satisfy
1/p' + 1/q' = 1 when 1/r = 1/p + 1/q), this gives:
∑ σ↓ᵢ(A)^r · σ↓ᵢ(B)^r ≤ (∑ σ↓ᵢ(A)^p)^{r/p} · (∑ σ↓ᵢ(B)^q)^{r/q}
Note: The sums on the RHS don't depend on the ordering, so we can replace sorted singular values with unsorted ones.