ICTS 2025 - Tensor norms for quantum entanglement
My name is Ion Nechita. I am a researcher at the CNRS, LPT Toulouse, part of the Tiq-Toqs team. Here's my webpage and my X account.
Most of the material in these lectures is from standard textbooks and review papers. Some of my favorites are:
I will be happy to discuss with you in person, or by email.
The goal of these lectures is to introduce you to the framework of tensor norms and to their applications in quantum information theory.
Here are some results I will present in detail:
You will learn about the following topics:
A normed space
To a normed space we associate its unit ball
The unit ball
Studying normed spaces is, loosely speaking, equivalent to studying the convex geometry of their unit balls.
These are the most important normed spaces that you will encounter. In the real case, we have
The complex case is very similar, adding absolute values in the formulas above when needed. Of special importance in the complex situation is the case
In the case of
In
The Schatten classes are the equivalent of the
The Schatten norms can either be defined as
Let us recall here the singular value decomposition of matrices, arguably the most important result in linear algebra. Given a matrix
Going back to the Schatten norm, let us emphasize the three most important cases:
What are
norms I could have mentioned?
The diamond norm.
The notion of duality fundamental in the study of normed spaces (or, more generally, Banach spaces). The corresponding notion for convex sets, polarity, plays an equally important role in convex geometry.
At the level of vector spaces, recall that the dual space
A scalar product
At the level of convex sets, the corresponding notion is defined as follows. Given a convex set
The duality relation has many interesting properties. For example, it naturally defines a bijection between extremal points of the primal set and facets of the dual set. In the case of
ball ofUnit |
Number of extremal points | Number of facets |
---|---|---|
In standard quantum mechanics textbooks, quantum states are represented by unit vectors inside a complex Hilbert space
A mixed quantum state (or a density matrix) is a positive semidefinite operator of trace
Apart the normalization condition (which, for most problems, is irrelevant and/or easy to deal with), the main feature of (mixed) quantum states is the positive semidefinite condition above. Importantly for us, this condition can be stated with the help of the
A self-adjoint matrix
The case of qubit density matrices (
Any such matrix of unit trace can be thus decomposed as
More generally, we have the following isomorphism of normed spaces:
Consider the following state discrimination scenario:
Bob receives an unkown quantum state
Using a measurement defined by an operator
Bob's best guess proability is
Let us first prove that the RHS of the equation above provides an upper bound for the success probability of any strategy Bob might use. The form of the optimal
Let
Back to our state distinguishability problem, we have obtained
Note that in the unbiased case
The Holevo-Helstrom theorem shows that the nuclear norm (or the 1-norm, or Schatten's 1-norm) is the "correct" norm to use when trying to measure how different two quantum states are.
The nuclear norm is contracting under quantum channels (trace preserving, completely positive maps):
How about more than two states? Alice produces, with probability
Discriminating between multiple states and the pretty good measurement.
Consider the scenario with two possible states
The relative entropy appears as a large deviations exponent:
from The tangled state of quantum hypothesis testing.
A tensor can mean many things, depending on the person you ask.
Here are DeepSeek-R1-Distill-Qwen-7B's answers when prompted as follows:
what is a tensor? give me a one-sentence definition
``A tensor is a mathematical object representing a multi-dimensional array of numerical values, which generalizes scalars, vectors, and matrices into higher dimensions, often used in physics and machine learning.
you are a nobel prize winning physicist. what is a tensor? give me a one-sentence definition
``A tensor is a mathematical generalization of scalars, vectors, and matrices, representing multi-dimensional quantities with specific transformation rules under coordinate changes, essential for describing complex physical systems involving multiple dimensions or directions.
you are an abel prize winning mathematician. what is a tensor? give me a one-sentence definition
``A tensor is a versatile mathematical object that generalizes concepts from vector calculus, serving as a bridge between geometry and algebra by efficiently representing complex, multi-dimensional data and transformations. It can be thought of as an extension of vectors and matrices into higher dimensions, used across various fields including physics, engineering, and machine learning to model intricate systems and phenomena with precision.
you are a turing prize winning computer scientist. what is a tensor? give me a one-sentence definition
``A tensor is a mathematical object that extends the concept of scalars, vectors, and matrices to an arbitrary number of dimensions, enabling efficient representation and manipulation of multidimensional numerical data.
In this lecture, a tensor will simply be an element of a tensor product space
Here is one way (computer science-y and not very nice from a mathematical point of view) to define the tensor product of vector spaces. For every
You are familiar with the tensor product space
Here is a more mathematical definition of the tensor product; we state it for two vector spaces for the sake of readability. Let
The rank of the matrix multiplication tensor represents the minimal number of scalar multiplications needed to compute the product of two matrices. In more detail, if the tensor can be expressed as a sum of
This universality property ensures that the tensor product captures in a minimal and unique way all bilinear information from
Tensors can be seen as higher order matrices. Indeed, we have the following hierarchy of mathematical objects as a function of the order
We said earlier that matrices are order 2 tensors. This is true if one considers matrices as an array of numbers, i.e. a linear map expressed in some basis.
From a theoretical perspective, expressing linear maps using coordinates is not recommended. Instead, consider a linear transformation between two vector spaces
Firstly, let us use coordinates. If
Secondly, there is a more mathematically elegant way to understand the isomorphism above. Consider the evaluation tensor
More generally, linear applications
The unit tensor is defined, for some order
For
For
The GHZ (Greenberger-Horne-Zeilinger) state is one of the paradigmatic examples of a maximally entangled state in the multipartite setting. It is defined by
The following is (slightly) adapted from an answer on math.stackexchange by Jordan Taylor, see also the arXiv paper, Wikipedia, or Hand-waving and interpretive dance: an introductory course on tensor networks.
In graphical notation, tensors are represented as shapes (small disks below) with legs sticking out of them. A vector can be represented as a shape with one leg, a matrix can be represented as a shape with two legs, and so on:
Each leg corresponds to an index of the tensor - specifying an integer value for each leg of the tensor addresses a number inside of it:
where 0.157 happens to be the number in the python
). The amount of memory required to store a tensor grows exponentially with the number of legs, so tensors with lots of legs are usually represented only implicitly: decomposed as operations between many smaller tensors.
Connecting the legs of two tensors indicates a tensor contraction (also known as an Einstein summation). Here are the most common kinds of contractions between vectors and matrices:
In every case you can tell how many legs (i.e. what is the order) the resulting tensor will have by the number of uncontracted "free" legs on the left column.
But graphical notation is most useful for representing unfamiliar operations between many tensors. One example in this direction is
The middle part of the graphical notation here shows that the number in each
Graphical notation really comes into its own when dealing with larger networks of tensors. For example, consider the contraction
and we can immediately see which tensors are to be contracted, and that the result will be a single number (no un-contracted, or dangling legs). Contractions like this can be performed in any order. Some contraction orders are much more efficient than others, but they all get the same answer eventually.
What is the most efficient way to multiply
In the general case, contracting a tensor network is known to be a computationally intractable problem: it is
Using the graphical notation, we can denote the unit tensor by a dot
The maximally entangled state (or the Bell state) is the pure bipartite state given by
The corresponding maximally entangled density matrix
Show graphically that the partial trace of the maximally entangled density matrix is a maximally mixed state.
The matrix multiplication tensor is a compact way of encoding the bilinear operation of matrix multiplication. It serves as a bridge between the algebraic description of multiplying matrices and the study of tensor rank, which is deeply connected to the complexity of matrix multiplication algorithms.
Consider matrices:
Let us now use the isomorphism we have seen earlier (assume also that the matrices are real valued):
where the
Note that the tensor products in the first and second operation above are not the same!
Graphically, we find the form of the
This implies that
In quantum information theory, the study of entanglement in multipartite systems is essential for understanding non-classical correlations between subsystems. A multipartite pure state
is said to be entangled if it cannot be expressed as a simple tensor product of states from each individual subsystem. In other words, if there are no states
then
We shall discuss next two ways of quantifying how entangled a pure multipartite quantum state is:
Not all elements
It the bipartite case, unit rank tensors
For example, one can easily show that, for three qubit states,
In his seminal work, Håstad (1990) proved that the decision problem of determining whether a given tensor (of order 3) has rank at most
Here is another aspect in which tensor rank is very different than the usual matrix rank. In matrix (
Let us focus in this section on square matrices:
The rank of the matrix multiplication tensor represents the minimal number of scalar multiplications needed to compute the product of two matrices. In more detail, if the tensor can be expressed as a sum of
Let us now focus on the
In his breakthrough paper from 1969, Volker Strassen showed that the rank of the
Later in 1971, Winograd showed that decomposition with only 6 term cannot exist, hence
where
In the case of the matrix multiplication tensor, the asymptotic rank is deeply connected to the matrix multiplication exponent
Recently, progress on similar questions have been achieved by machine learning systems. AlphaTensor improved the best know upper bound for the multiplication of
One of the key tools for quantifying the entanglement in multipartite pure states is the injective tensor norm. This norm measures the maximum overlap of the state
Here, each
This property makes the injective tensor norm a natural candidate for quantifying how "far" a state is from being separable.
In the bipartite case (
What is the injective norm of the GHZ and W states for three qubits? Which one is further away from the set of separable states?
The geometric measure of entanglement (GME) is another important concept used to quantify the entanglement of multipartite pure states. It is defined based on the maximal overlap between the state
The underlying idea is as follows: the closer the state
The goal of this section is to show how to construct larger normed spaces out of smaller ones using the tensor product construction.
Motivation: Let
For example: $\|(x, y)\|=\max \left(\|x\|_{X},\|y\|_{Y}\right)$ or $\|(x, y)\|=\|x\|_{X}+\|y\|_{Y}$
Remember that any norm space
A tensor norm on
Let us start with a simple example. Take the two spaces
Let
We write
Let us consider some simple examples. Given a linear map
This norm turns
Hence, we have shown the following equality of normed spaces.
Induced norms are injective tensor products:
In particular for
The operator norm is the injective tensor product norm of the euclidean (i..e.
Let us consider now the nuclear norm (or the trace norm)
where
Show
$$
\begin{aligned}
A & =\sum s{i} a{i} \otimes b{i}^{*} \
& =\sum s{i}\left|a{i}\right\rangle\left\langle b{i}\right|
\end{aligned}
proving the second inequality.
Therefore, the nuclear norm is the projective tensor product norm of the euclidean (i..e.
x=\sum{i=1}^{n} \ket{i} \otimes x{i}=\left(\begin{array}{c}
x{1} \
x{2} \
\vdots \
x_{n}
\end{array}\right)
\begin{aligned}
|x|{\ell{\infty}^n \otimes{\varepsilon} X} & = \sup {|\alpha|{\ell{1}^{n} \leq 1}, , |\beta|_{X^{*}} \leq 1}\langle\alpha \otimes \beta, x\rangle \
& =\sup{|\beta|{X^{*}} \leq 1,, i \in[n],, \varepsilon= \pm 1}\left\langle\varepsilon e{i} \otimes \beta, x\right\rangle \
& =\sup {|\beta|{X^{\circ}} \leq 1, i \in[n]}\left|\left\langle e{i} \otimes \beta, x\right\rangle\right| \
& =\sup {i \in[n]} \underbrace{\sup {|\beta|{X^{\circ}} \leq 1}\left|\left\langle\beta, x{i}\right\rangle\right|}{\left.| \beta, x{i}\right\rangle} \
& =\max {i \in[n]}\left|x{i}\right|_{X} .
\end{aligned}
Prove this theorem.
The following result shows that the injective (resp. the projective) tensor norm is the smallest (resp. the largest) tensor norm on
A norm
We need to prove two inequalities:
Thus
Let us revisit the example of the injective norm, used to measure the entanglement of pure quantum states. Recall its definition:
By the previous observations, the injective tensor product norm being the smallest tensor norm, we have
with equality if and only if the state
Find
Recall that mixed quantum states are represented by density matrices
A density matrix
with
Our goal in this section is to characterize the separability of a density matrix using an appropriate tensor norm. Note that the definition of separability is based on the notion of positivity. Recall the following useful characterization of positivity in terms of the trace norm:
In the multipartite case
We come now to the main result of this section.
Consider a density matrix
(1)
(2)
(3)
where we write
In order to keep the notation concise, let us write
Let us prove the different implications. We start with (1)
On the other hand,
The implication
For the final implication
where we use
with
where
Let us make on final important observation. Since
we can further decompose the
expressing separability in terms of the projective tensor product of
Unlike the situation for pure states where deciding separability is easy, Gurvits has shown that the membership problem for the set of separable states is
In order to prove that a given state is entangled, one needs to use entanglement criteria: easily computable conditions that imply entanglement, without the converse being guaranteed. We discuss next the most important entanglement criteria.
The positive partial transposition (or PPT) criterion states that
In coordinates, we have
The second most important entanglement criterion is the realignment criterion:
Note that
Prove the realignment criterion.
These two entanglement criteria were invented long before the use of tensor norms in quantum information theory. We shall show next that they appear very naturally in this framework, as a relaxation of the projective norm approach to entanglement.
Recall that a bipartite quantum state
and that the separability of
Grop the four spaces into two groups of two, and use
There are
Choice 1:
We have
so nothing interesting, we recover the trace norm of
Choice 2:
We have
So if
Choice 3:
We have
where
The notion of entanglement testers comes from the previous observations by extending and generalizing them to the multipartite setting and more general operations on the matrix.
Let
Recall that
A linear map
The following is the key observation:
Hence, we obtain the following entanglement criterion induced by the testers
Let us consider some examples. The map
is a tester, since
and thus
The entanglement criterion we obtain in the bipartite case is the realignment criterion from before:
One can recover some entanglement criteria studied in the literature using more general testers of the form
Entanglement testers are powerful enough to detect pure state entanglement:
If