Brunn-Minkowski inequality

In this short note, I posed a conjecture on Brunn-Minkwoski inequality and explain why we could be interested in this inequality, what is it meaning for further developing of some fully nonlinear elliptic equation come from geometry. The main part of the note devoted to discuss several different proof of classical Brunn-Minkowski inequality.

Brunn-Minkowski inequality

1. Introduction

I believe, every type of Brunn-Minkowski inequality, type of Brunn-Minkowski inequality is in some special sense and will be explained later, will be crucial with a corresponding regularity result of a fully nonlinear elliptic equation which could be realizable by geometric way which will also explained in further note.

So the key point is that Brunn-Minkowski inequality is crucial and have potential application, I posed a problem there and then consider the classical Brunn-Minkowski inequality, we give several proof of the classical Brunn-Minkowski inequality, everyone could help us to have a more refine understanding of the original difficulty with different angle.

Theorem 1 (conjecture) We have a map

$latex \displaystyle f:{\mathbb R}^d\times {\mathbb R}^d \longrightarrow {\mathbb R}^d \ \ \ \ \ (1)&fg=000000$

 


$latex \displaystyle (x,y) \longrightarrow f(x,y)&fg=000000$

We are willing to called the function $latex {f}&fg=000000$ as the hamiltonian function. then we could consider the hamiltonian flow of the function $latex {f}&fg=000000$, but this could only true for a even dimension manifold to make there exists $latex {f}&fg=000000$ that $latex {df}&fg=000000$ is a non-degenerate closed $latex {2-}&fg=000000$form.

Anyway we consider the level set of $latex {f}&fg=000000$, we get a foliation i.e $latex {{\mathbb R}^d=\{\amalg_{p\in {\mathbb R}}f^{-1}(p)\}}&fg=000000$. we consider the gradient flow with $latex {f}&fg=000000$, called the gradient flow begin with $latex {q\in {\mathbb R}}&fg=000000$ as $latex {\{\phi_q(t)\}_{t\in {\mathbb R}}}&fg=000000$. And we wish the gradient flow have a addition structure on itself then we could consider what is the Brunn-Minkowski inequality in this setting, the condition is a group structure on the space of level set $latex {L_f=\{\amalg_{p\in {\mathbb R}}f^{-1}(p)\}}&fg=000000$, i.e.

$latex \displaystyle \forall t_1,t_2\in {\mathbb R}, \forall q\in R^d, \phi_{q}(t_1+t_2)=\phi_{q}(t_1)\circ \phi_{q}(t_2) \ \ \ \ \ (2)&fg=000000$

Remark 1 take $latex {f(x,y)=x+y}&fg=000000$ in 1, this conjecture reduce to the toy model, i.e. classical Brunn-Minkowski inequality.

Remark 2 We could generate the problem to the problem which is charged by several energy function $latex {f_1,…,f_k}&fg=000000$, if the induced gradient flow is amenable, then this is somewhat similar with the one dimension case, I wish if we could do something for the single function $latex {f_1}&fg=000000$, then we can say something for the several functions involved case.

Remark 3 This could also generate to amenable group action case and quantization of it.

Meaning, the cohomology induce by a hamiltonian system on some special foliation on fiber of geometric bundle. This type of result could help to establish the vanish of the cohomology, the get the existence theory and regularity result for corresponding elliptic nonlinear differential equation. And solve the original problem I consider.

Now we given the statement of Brunn-Minkowski inequality.

Theorem 2 (brunn minkowski inequality) For $latex {A,B}&fg=000000$ measurale set in $latex {{\mathbb R}^d}&fg=000000$. we have following,

$latex \displaystyle \mu^{\frac{1}{d}}(A+B)\geq \mu^{\frac{1}{d}}\mu(A)+\mu^{\frac{1}{d}}\mu(B) \ \ \ \ \ (3)&fg=000000$

 

minkowski_functional.png“>

The general spproach of Brunn-Minkwoski inequality is following,

  1. divide the measurable set $latex {A,B}&fg=000000$ into small cubes.
  2. Shinking trick, transform the set into convex one.

for the first one, we have the following lemma,

Lemma 3 $latex {\forall 0<\lambda<1}&fg=000000$, $latex {\exists \epsilon>0}&fg=000000$ $latex {A_{\epsilon}}&fg=000000$ measurable set, $latex {A_{\epsilon}\subset A}&fg=000000$, $latex {\mu(A-A_{\epsilon})<\epsilon}&fg=000000$, and $latex {A=(A-A_{\epsilon})\amalg \cup_{i\in I}(c_i\cap A_{\epsilon} ) }&fg=000000$, and

$latex \displaystyle \frac{\mu(c_i\cap A_{\epsilon})}{\mu(c_i)}>\lambda, \forall i\in I \ \ \ \ \ (4)&fg=000000$

Proof: The proof of the lemma is a easy corollary of the construction of Lesbegue(or Borel) measurable $latex {\sigma}&fg=000000$ algebra. $latex \Box&fg=000000$

Remark 4 The existence of the property given in the lemma is not the key point, the key point is $latex {d(A_{\epsilon},A)\rightarrow 0, as\ \ \epsilon\rightarrow 0}&fg=000000$.

Has this two simplify in hand, we could give several approach to proof the inequality and these proof carry information more than just a proof, they carry some information with the structure of space $latex {(\{ 0,1 \}_{{\mathbb R}^d},\mu^{\frac{1}{d}},+)}&fg=000000$. \newpage

2. A proof with discretization

There is a lots of ways to attack the Brunn-Minkowski inequality, the most natural one is discretization. But unfortunately there is some technique obstacle for proof or even state the discretization version of “Brunn-Minkowski” inequality.

The “boundary” and “area” should not compatible.

$latex \displaystyle \mu_{d-1}(\partial E)<<\mu_{d}(\mu(E)) \ \ \ \ \ (5)&fg=000000$

And we need use the fact,

$latex \displaystyle A_{\epsilon}\overset{G-H \ sence}\longrightarrow A \ \ \ \ \ (6)&fg=000000$

Now we just state what we expect it should transform in, because we have a fully understanding with the discretization model, there is a result named Cauchy-Daveport inequality.

Theorem 4 (cauchy-daveport inequality) There are two case, one in $latex {{\mathbb Z}}&fg=000000$, one in finite field $latex {{\mathbb Z}_p}&fg=000000$.

  1. $latex {{\mathbb Z}}&fg=000000$ case, $latex {\forall A,B\subset {\mathbb Z}}&fg=000000$ are finite set,we have,

    $latex \displaystyle |A+B|\geq |A|+|B|-1. \ \ \ \ \ (7)&fg=000000$

  2. $latex {{\mathbb Z}_p}&fg=000000$ case, $latex {\forall A,B\subset {\mathbb Z}}&fg=000000$ are finite set,we have,

    $latex \displaystyle |A+B|\geq \max\{|A|+|B|-1.p\}. \ \ \ \ \ (8)&fg=000000$

 

Proof: for the $latex {{\mathbb Z}}&fg=000000$ case, the story is more or less trivial, just do to a observation, if $latex {A=\{a_i,a_1<a_2<…<a_n\},B=\{b_i,b_1<b_2<…<b_m\}}&fg=000000$, then

$latex \displaystyle a_1+b_1<\min\{a_1+b_2+a_2+b_1\}<….<a_n+a_m&fg=000000$

There exists a strictly increasing chain of length at least $latex {n+m-1}&fg=000000$.

For the $latex {{\mathbb Z}_p}&fg=000000$ case, following is a graph to explain what happen, basically we define a operation on tuples, i.e. $latex {T:(A,B)\rightarrow (T(A),T(B))}&fg=000000$, and make the additive energy $latex {E_{A,B}:=|A+B|}&fg=000000$ decreasing. after induction with this transform and the transform from a tuple to the minimum additive energy by translation, the additive energy decreasing and decreasing then arrive the global minimum. But it is easy to conclude in this case one of $latex {A,B}&fg=000000$ become null set and then the inequality 8 follows. $latex \Box&fg=000000$

But when we discrete the Brunn-Minkowski inequality, we expect a high dimension generation of the inequality 4. Naively we wish,

Theorem 5 (naive generation of cauchy-daveport inequality) For $latex {d\in {\mathbb N}}&fg=000000$, and $latex {\forall A,B\subset {\mathbb Z}_d}&fg=000000$ are finite sets,

$latex \displaystyle |A+B|^{\frac{1}{d}}\geq |A|^{\frac{1}{d}}+|B|^{\frac{1}{d}} \ \ \ \ \ (9)&fg=000000$

But this is not the case, there is a counterexample for 5. We could construct some $latex {A,B}&fg=000000$ such that $latex {|A+B|\sim |A|+|B|}&fg=000000$, consider they be very thin line.

So why we are in this worse situation? because we lose the information of $latex {A_{\epsilon}\overset{G-H}\longrightarrow A }&fg=000000$, $latex {B_{\epsilon}\overset{G-H}\longrightarrow B }&fg=000000$. So they have the trend tending to make the “boundary” campatible with “area”. Two thin line in the same direction is exactly the worst case, which is just a equal condition of 1-dimension case.

One natural way to except the situation is to bounded the “isperimetric constant”, to assume $latex {A,B}&fg=000000$ varies in a subset of measurable set, with addition condition that $latex {\frac{\mu_{d-1}(\partial A)}{\mu_d(A)}}&fg=000000$ is bounded by some constant. But this is also not the suitable set for our inequality, I explain how to capture the information of the G-H coverage.

Now assume $latex {A,B}&fg=000000$ are convex bounded set, and we take a global orthogonal basis in $latex {{\mathbb R}^d}&fg=000000$. named $latex {(e_1,…,e_d)}&fg=000000$. We give the definition of $latex {\epsilon-}&fg=000000$discretization of $latex {A}&fg=000000$, named $latex {A_{\epsilon}}&fg=000000$.

Definition 6 ($latex {\epsilon-}&fg=000000$ discretization) The construction of $latex {A_{\epsilon}}&fg=000000$ from $latex {A}&fg=000000$ is following:

  1. divide $latex {A}&fg=000000$ into $latex {\amalg_{i\in I}c_i\cap A}&fg=000000$, $latex {c_i}&fg=000000$ is the $latex {\epsilon}&fg=000000$ cubes.
  2. use $latex {c_i}&fg=000000$ or $latex {\emptyset}&fg=000000$ instead of $latex {c_i\cap A}&fg=000000$ depending on iff $latex {\frac{\mu(A\cap c_i)}{\mu(c_i)}>\lambda}&fg=000000$, where $latex {\lambda<1}&fg=000000$ is a given number only rely on $latex {\epsilon}&fg=000000$. i.e.

    $latex \displaystyle A\cap c_i\rightarrow D_{\epsilon} (A\cap c_i) \ \ \ \ \ (10)&fg=000000$

  3. glue them, define $latex {A_{\epsilon}:=\amalg_{i\in I}D_{\epsilon}(c_i\cap A)}&fg=000000$.

 

Now we describe the condition of $latex {A_{\epsilon}\overset{G-H}\rightarrow A}&fg=000000$ rigorous meaning campatible with $latex {\epsilon-}&fg=000000$discretization.

Under the basis, there is a coordinate we could know iff $latex {c_i=c(\lambda_1,…,\lambda_d)}&fg=000000$ is the cube center at $latex {(\epsilon\lambda_1,…,\epsilon\lambda_d )}&fg=000000$ if it is in $latex {A_{\epsilon}}&fg=000000$. Due to $latex {A}&fg=000000$ is convex, $latex {\partial A}&fg=000000$ is lipchitz. So you will have some locolization property, said, at every fix discretization scale $latex {\epsilon}&fg=000000$, the position of $latex {(\lambda_1\epsilon ,…,\lambda_d\epsilon)}&fg=000000$ is morally known so the number of cubes in $latex {A_{\epsilon}}&fg=000000$ in the one dimensional affine space $latex {\Omega^{\epsilon}_{a_1,…,a_{k-1},a_{k+1},…,a_n}}&fg=000000$ which is the subspace of $latex {{\mathbb Z}^d}&fg=000000$ the number $latex {\Omega^{\epsilon}_{a_1,…,a_{k-1},a_{k+1},…,a_n}}&fg=000000$ is asymptopic to the $latex {1-}&fg=000000$dimensional hausdorff measure of $latex {A\cap \Omega_{a_1,…,a_{k-1},a_{k+1},…,a_n}}&fg=000000$. So at least,

$latex \displaystyle |\Omega^{\epsilon}_{a_1,…,a_{k-1},a_{k+1},…,a_n}|=O((\frac{1}{\epsilon})) \ \ \ \ \ (11)&fg=000000$

 

Property 11 is crucial, which mean $latex {A_{\epsilon}}&fg=000000$ is really a n-dimensional space and automatically we have the bounded on isoperimetric constant $latex {\frac{\mu_{d-1}(\partial A)}{\mu_d(A)}}&fg=000000$.

Now we can look at every $latex {A_{\epsilon}}&fg=000000$ and take limit $latex {\epsilon\rightarrow \infty}&fg=000000$. In fact we a in the situation with Accumulation of wood to make the product have smallest volume. Not to optimized the tuples $latex {(A,B)}&fg=000000$ but fix one of it, said $latex {B}&fg=000000$, optimized the other one, said $latex {A}&fg=000000$. This is the key point of proof, a little bit different from the argument of $latex {1-}&fg=000000$dimensional 4 where we optimized the tuple.

Key point:

  1. we can ignore “small core”.
  2. This inequality is said, due to $latex {A+B=(\frac{A+B}{2})+\frac{A+B}{2}}&fg=000000$, the convex of the functional $latex {\mu^{\frac{1}{d}}}&fg=000000$ on convex set.

The way of discretization could not handle the problem but definitely said that the difficulty occur with the shape of boundaries $latex {\partial A,\partial B}&fg=000000$.

3. A proof with “central of mass” and Minkowski functional

Definition 7 (Central of mass) The central of mass $latex {p}&fg=000000$ of measurable set $latex {A}&fg=000000$, if exist, satisfied, $latex {\forall e\in S^d}&fg=000000$, there is a subspace $latex {L_e}&fg=000000$ with codimension 1 divide $latex {A}&fg=000000$ into two connected part $latex {A_{1,L_e},A_{2,L_e}}&fg=000000$ such that

$latex \displaystyle \mu(A_{1,L_e})=\mu(A_{2,L_e}) \ \ \ \ \ (12)&fg=000000$

then $latex {p\in l_e}&fg=000000$.

Remark 5 For a measurable set $latex {A}&fg=000000$ , if central of mass $latex {p}&fg=000000$ exists, then there exist only one. This is a easy observation do to the definition of $latex {p}&fg=000000$, i.e. the intersection of suitable affine subspace in every direction.

Theorem 8 (existence of central of mass)

Proof: It is easy to attain $latex {p}&fg=000000$ by take $latex {n+1}&fg=000000$ different directions in $latex {S^d}&fg=000000$, then easy to proof every line $latex {L_e}&fg=000000$ across it be definition of $latex {L_e}&fg=000000$. $latex \Box&fg=000000$

Definition 9 (Minkowski functional) for a measurable set $latex {X\subset {\mathbb R}^d}&fg=000000$ and a point $latex {p}&fg=000000$, define $latex {M_{X,p}}&fg=000000$ on $latex {S^{d-1}}&fg=000000$, such that

$latex \displaystyle M_X(e)=\sup_{\lambda>0,\lambda e+p\in X}\lambda \ \ \ \ \ (13)&fg=000000$

 

.

Remark 6 If $latex {A}&fg=000000$ is convex, then $latex {M_A}&fg=000000$ is a convex function on $latex {S}&fg=000000$, so it is lipchitz.

we have following formula for the measure of $latex {A}&fg=000000$.

Theorem 10

$latex \displaystyle \mu(A)=\frac{1}{\mu(S^{d-1})}\int_{S^{d-1}}M_A(e)de \ \ \ \ \ (14)&fg=000000$

Proof: trivial. $latex \Box&fg=000000$

Now the task reduce fixing $latex {B}&fg=000000$ and $latex {\mu(A)}&fg=000000$ to optimized $latex {A}&fg=000000$ make $latex {\mu(A+B)}&fg=000000$ small. It is the same as make $latex {\frac{1}{\mu(S^{d-1})}\int_{S^{d-1}}M_{A+B}(e)de}&fg=000000$ small when fix $latex {\frac{1}{\mu(S^{d-1})}\int_{S^{d-1}}M_A(e)de}&fg=000000$ and $latex {M_B}&fg=000000$. Due to

$latex \displaystyle M_{A+B}(x)=\sup_{x_1,x_2\in S^{d-1}}M_A(x_1)+M_B(x_2) \ \ \ \ \ (15)&fg=000000$

This lead to the whole story, given a proof of 3.

4. A proof with multi-scale analysis

This approach is a nonstandard one, due to I believe the renormlization or continue fractional or multilinear estimate is everywhere. We first play with a toy model, the rectangle.

Theorem 11 Brunn-Minkowski inequality is right for $latex {A,B}&fg=000000$ are rectangles.

Proof:

$latex \displaystyle (\Pi(a_1+b_i) )^{\frac{1}{d}}\geq (\Pi(a_1) )^{\frac{1}{d}}+(\Pi(a_1+b_i) )^{\frac{1}{d}} &fg=000000$

$latex \displaystyle \Leftrightarrow 1\geq [\frac{\Pi a_i}{\Pi(a_i+b_i)}]^{\frac{1}{d}}+[\frac{\Pi a_i}{\Pi(a_i+b_i)}]^{\frac{1}{d}} \ \ \ \ \ (16)&fg=000000$

 

In fact,

$latex \displaystyle \begin{array}{rcl} 16 RHS & \overset{A-G}\leq &\frac{1}{d}\sum_i\frac{a_i}{a_i+b_i}+ \frac{1}{d}\sum_i\frac{b_i}{a_i+b_i}\\ & = & 1. \end{array} &fg=000000$

$latex \Box&fg=000000$

The story is following,

5. connection of Brunn-Minkowski inequality and Sobolev inequality, the firth proof

We begin with a calculate based on intuition and it is not rigorous.

$latex \displaystyle \begin{array}{rcl} \mu(A+B)^{\frac{1}{d}} & = & [\int_{{\mathbb R}}(\int_{{\mathbb R}^{d-1}}\chi_{A+B}(\xi_1,…,\xi_{n-1})d\xi_1…d\xi_{n-1} )d\xi_n]^{\frac{1}{d}}\\ & \sim & \frac{1}{\mu(A_n)}\int_{{\mathbb R}}(\int_{{\mathbb R}^{d-1}}\chi_{A+B}(\xi_1,…,\xi_{n-1})d\xi_1…d\xi_{n-1})^{\frac{1}{d-1}}d\xi_n\\ &\overset{induction \ on \ d} \geq & \frac{1}{\mu(A_n)}(\mu^{\frac{1}{d-1}}(A(\xi_n))+\mu^{\frac{1}{d-1}}(B(\xi_n)))d\xi \\ &\overset{A-G \ inequality} \geq & \mu^{\frac{1}{d}}(A)+\mu^{\frac{1}{d}}(B). \end{array} &fg=000000$

The second line is due to I believe there $latex {\exists \ A,B}&fg=000000$ such that it is a equality, by the equal condition of Minkowski inequality, in fact this is morally inverse of Minkowski inequality. The second reason in general case why the second inequality is true is due to a rescaling argument, change $latex {(\xi_1,…,\xi_n)\rightarrow (\lambda\xi_1,…,\lambda\xi_n ), \forall \lambda\in {\mathbb R}}&fg=000000$, by the rescaling argument we conclude if there is a such inequality, the index of it must be the case.