Yang-Mills equations: Difference between revisions

From DispersiveWiki
Jump to navigationJump to search
(Added a clarification)
m (→‎Heisenberg equations: Fixed Wiki syntax)
 
(35 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{equation
{{equation
  | name = Yang-Mills  
  | name = Yang-Mills  
  | equation = <math>D_\alpha  F^{\alpha \beta} = 0</math>
  | equation = <math>\, D_\alpha  F^{\alpha \beta} = 0\!</math>
  | fields = <math>A_\alpha: \R^{1+d} \to \mathfrak{g}</math>
  | fields = <math>A_\alpha: \R^{1+d} \to \mathfrak{g}</math>
  | data = <math>A_\alpha[0] \in H^s(\R^d) \times H^{s-1}(\R^d)</math>
  | data = <math>A_\alpha[0] \in H^s(\R^d) \times H^{s-1}(\R^d)</math>
Line 9: Line 9:
  | critical = <math>\dot H^{d/2 - 1}(\R^d)</math>
  | critical = <math>\dot H^{d/2 - 1}(\R^d)</math>
  | criticality = energy critical for d=4
  | criticality = energy critical for d=4
  | covariance = [[Lorentzian]], [[gauge]]
  | covariance = [[Lorentzian]], [[gauge]], [[conformal]]
  | lwp = varies | gwp = varies
  | lwp = varies | gwp = varies
  | parent = [[DNLW]]
  | parent = [[DNLW]]
Line 16: Line 16:
}}
}}


====The Yang-Mills equation====
==The Yang-Mills equations==
 
===Classical equations===


Let <math>A</math> be a connection on <math>R^{d+1}</math> which takes values in the Lie algebra g of a compact Lie group G. Formally, the connection A is said to obey the ''Yang-Mills equation'' if it is a critical point for the Lagrangian functional
Let <math>A</math> be a connection on <math>R^{d+1}</math> which takes values in the Lie algebra g of a compact Lie group G. Formally, the connection A is said to obey the ''Yang-Mills equation'' if it is a critical point for the Lagrangian functional
Line 28: Line 30:
where <math>\nabla_{x,t} A = \partial_ a  A^ a</math>  is the spacetime divergence of <math>A</math>. A more succinct (but less tractable) formulation of this equation is
where <math>\nabla_{x,t} A = \partial_ a  A^ a</math>  is the spacetime divergence of <math>A</math>. A more succinct (but less tractable) formulation of this equation is


<center><math>D_\alpha  F^{\alpha \beta} = 0</math>.</center>
<center><math>\, D_\alpha  F^{\alpha \beta} = 0\!</math>.</center>


It is often convenient to split <math>A</math> into temporal and spatial components as <math>A = (A_0, A_i)</math>.
It is often convenient to split <math>A</math> into temporal and spatial components as <math>\, A = (A_0, A_i)\!</math>.


As written, the Yang-Mills equation is under-determined because of the gauge invariance
As written, the Yang-Mills equation is under-determined because of the gauge invariance


<center><math>A -> U^{-1} dU + U^{-1} A U</math></center>
<center><math>\, A\rightarrow U^{-1} dU + U^{-1} A U\!</math></center>
<center><math>F -> U^{-1} F U</math></center>
<center><math>\, F\rightarrow U^{-1} F U\!</math></center>


in the equation, where U is an arbitrary function taking values in <math>G</math>. In order to correctly formulate a Cauchy problem, one must impose a further constraint on the gauge. There are three standard ones:
in the equation, where U is an arbitrary function taking values in <math>G</math>. In order to correctly formulate a Cauchy problem, one must impose a further constraint on the gauge. There are three standard ones:


<center>[[Temporal gauge]]: <math>A^0 = 0</math></center>
<center>[[Temporal gauge]]: <math>\, A^0 = 0\!</math></center>
<center>[[Coulomb gauge]]: <math>\partial_i A_i = 0</math></center>
<center>[[Coulomb gauge]]: <math>\partial_i A_i = 0</math></center>
<center>[[Lorenz gauge]]: <math>\nabla_{x,t} A = 0</math></center>
<center>[[Lorenz gauge]]: <math>\nabla_{x,t} A = 0</math></center>
Line 88: Line 90:


====Yang-Mills on R<sup>3</sup>====
====Yang-Mills on R<sup>3</sup>====


* Scaling is s_c = 1/2.
* Scaling is s_c = 1/2.
Line 115: Line 116:
** For small smooth compactly supported data, one can obtain global existence from the [[QNLW|general theory of quasi-linear equations]].
** For small smooth compactly supported data, one can obtain global existence from the [[QNLW|general theory of quasi-linear equations]].
** For large data Yang-Mills, numerics suggest that blowup does occur, with the solution resembling a rescaled instanton at each time [[BizTb2001]], [[Biz-p]].
** For large data Yang-Mills, numerics suggest that blowup does occur, with the solution resembling a rescaled instanton at each time [[BizTb2001]], [[Biz-p]].
*** Further numerics suggests that the radius of the instanton in fact decays like <math>C t / \sqrt(\log t)</math> [[BizOvSi-p]].
*** Further numerics suggests that the radius of the instanton in fact decays like <math>C t / \sqrt{\log t}</math> [[BizOvSi-p]].
** GWP for small <math>B^{1,1}</math> data (with an additional angular derivative of regularity) in the [[Lorenz gauge]] is in [[Stz-p2]].
** GWP for small <math>B^{1,1}</math> data (with an additional angular derivative of regularity) in the [[Lorenz gauge]] is in [[Stz-p2]].


Line 133: Line 134:
* In the supercritical case p>5 one probably has LWP for <math>s \geq s_c </math>(because this is true for the Yang-Mills and NLW equations separately), but this has not been rigorously shown. No large data global results are known, but this is also true for the supposedly simpler supercritical NLW. It seems possible however that one could obtain small-data GWP results.
* In the supercritical case p>5 one probably has LWP for <math>s \geq s_c </math>(because this is true for the Yang-Mills and NLW equations separately), but this has not been rigorously shown. No large data global results are known, but this is also true for the supposedly simpler supercritical NLW. It seems possible however that one could obtain small-data GWP results.


====Yang-Mills spectrum on R<sup>4</sup>====
===Quantum equations===
 
Differently from the classical case, the corresponding quantum formulation has not proved to exist yet. The main difficulties rely on the gauge invariance that makes already an Euclidean formulation, eventually manageable through Wiener path integrals, at best problematic. From a physical standpoint, one ignores this kind of difficulties and puts down a quantum field theory exploiting it through a small perturbation theory in the coupling, limiting in this way the analysis at higher momenta (smaller distances). This approach has been proved frutiful in understanding the phenomenolgy observed at laboratory facilities and using lattice computations with computers. On the other side, at small momenta (larger distances), difficulties are overwhelming great making best suitable a numerical approach on the lattice. So, also non rigorous methods seem to fail to give a clever understanding of the situation for this limit. Then, in what follows, we just give a formal presentation of material having in view the idea that, through this, fundamental questions like the existence of the mass gap or existence of the theory itself could finally be proved. We will use as a reference [[FaSl1980]].
 
====Hamiltonian formulation====
 
Yang-Mills equations can be stated into a Hamiltonian form. The canonical variables are <math>(E^a_k,A^b_l)</math> with $a,b$ running on the [[Lie group]] index and $k,l$ enumerating the spatial coordinates. One has
 
<math> E_k=F_{k0} </math>
<math> B_k=-\frac{1}{2}\epsilon_{ijk}F_{ij} </math>
<math> C = DE </math>
 
and the [[Hamiltonian]] can easily be written down as
 
<math> H=\int d^{D-1}x{\rm Tr}(E^2+B^2) </math>
 
The dynamics has a constraint and we need a gauge condition to get the equations of motion. This can also be seen in the corresponding Lagrangian formulation, after a Legendre transform, where a Lagrange multiplier indeed appears.
 
Now, one can write down the corresponding [[Poisson bracket|Poisson brackets]] obtaining
 
<math>\{E_k^a(x),A_l^b(y)\}=\delta_{kl}\delta_{ab}\delta^{D-1}(x-y) </math>
 
<math>\{E_k^a(x),E_l^b(y)\}=0,\ \{A_k^a(x),A_l^b(y)\}=0</math>
 
<math>\{C^a(x),C^b(y)\}=f^{abc}C^c(x)\delta^{D-1}(x-y) </math>
 
being <math>f^{abc}</math> the structure constants of the [[Lie group]]. We further note that <math>\{C^a,H\}=0</math> and so <math>\partial_t C^a=0</math>.
 
====Heisenberg equations====
 
In order to obtain the quantum dynamics we can use [[Dirac quantization|Heisenberg equations]] of motion (<math>\hbar=1</math>)
 
<math>\partial_0E_k^a=i[H,E_k^a]</math>
<math>\partial_0A_l^b=i[H,A_l^b]</math>


Yang-Mills Lagrangian functional can be mapped on a NLKG Lagrangian functional after some conditions are set. This has been shown in [[FraE2007]]. To see this let us consider the Yang-Mills functional as
and the set of commuting relations obtained from the Poisson brackets through [[Dirac quantization]]


<center>
<math>[E_k^a(x),A_l^b(y)]=i\delta_{kl}\delta_{ab}\delta^{D-1}(x-y)</math>
<math>
    S=\int d^4x\left[\frac{1}{2}\partial_\mu A^a_\nu\partial^\mu A^{a\nu}+f^{abc}\partial_\mu A_\nu^aA^{b\mu}A^{c\nu}
+\frac{1}{4}f^{abc}f^{ars}A^b_\mu A^c_\nu A^{r\mu}A^{s\nu}\right]
</math>
</center>


being <math>f^{abc}</math> the structure constant of the Lie group and with the gauge choice <math>\partial_\mu A^{a\mu}=0</math> (other choices boil down to a change of coordinates that does modify our conclusions). We can assume for <math>A_\mu^a</math> that a "mapping theorem" holds [[FraE2007]] with some components being zero and the others are all equal. Then, the functional takes the form
<math>[E_k^a(x),E_l^b(y)]=0,\ [A_k^a(x),A_l^b(y)]=0</math>


<center>
<math>[C^a(x),C^b(y)]=if^{abc}C^c(x)\delta^{D-1}(x-y).</math>
<math>
    S=-C_1\int d^4x\left[\frac{1}{2}\partial_\mu A\partial^\mu A - \frac{C_2}{4}A^4\right]
</math>
</center>


being <math>C_1</math> and <math>C_2</math> two constants depending on the Lie group. We can recognize the NLKG functional already seen in [[Perturbation theory]]. This means that at a leading order of a gradient expansion the two quantum theories share the spectrum as
A further condition to fix the gauge is also needed.


<center>
We note that these are operator equations and so we need to build a proper state space to give them a meaning. Presently, a rigorous proof of existence of all this construction does not exist yet and would be a proof of existence for the Yang-Mills theory itself. The best one can do is to perform a small perturbation theory of these equations on a Fock space built on the solutions of the leading order equations. Also this construction has not a rigorous proof but is common practice between physics community with a considerable success. But, in order to perform such kind of computations, a different approach is used that heavily relies on path integration. No sound foundation for path integrals in Minkowski spaces exists yet. Finally, the existence of the theory is not granted for any <math>D</math>. It is known that for <math>D>4</math> small perturbation theory cannot be done and the theory is not renormalizable.
<math>
\mu_n=(2n+1)\frac{\pi}{2K(i)}\left(\frac{C_2}{2}\right)^\frac{1}{4}\Lambda
</math>
</center>


giving a mass gap in both cases. It should be noted that a weak perturbation expansion does not share this property for the spectrum of a NLKG equation due to quantum corrections. A first indication was given in [[FraD2007]] and a full mathematical proof is given in [[FraE2007]].


[[Category:Geometry]]
[[Category:Geometry]]
[[Category:Wave]]
[[Category:Wave]]

Latest revision as of 15:53, 1 July 2018

Yang-Mills
Description
Equation
Fields
Data class
Basic characteristics
Structure Hamiltonian
Nonlinearity semilinear with derivatives
Linear component Wave
Critical regularity
Criticality energy critical for d=4
Covariance Lorentzian, gauge, conformal
Theoretical results
LWP varies
GWP varies
Related equations
Parent class DNLW
Special cases Yang-Mills on R2, R3, R4
Other related MKG, Cubic NLW, Yang-Mills-Higgs


The Yang-Mills equations

Classical equations

Let be a connection on which takes values in the Lie algebra g of a compact Lie group G. Formally, the connection A is said to obey the Yang-Mills equation if it is a critical point for the Lagrangian functional

where is the curvature of the connection . The Euler-Lagrange equations for this functional have the schematic form

where is the spacetime divergence of . A more succinct (but less tractable) formulation of this equation is

.

It is often convenient to split into temporal and spatial components as .

As written, the Yang-Mills equation is under-determined because of the gauge invariance

in the equation, where U is an arbitrary function taking values in . In order to correctly formulate a Cauchy problem, one must impose a further constraint on the gauge. There are three standard ones:

Temporal gauge:
Coulomb gauge:
Lorenz gauge:

There are also several other useful gauges, such as the Cronstrom gauge Cs1980 centered around a point in spacetime.

The Lorenz gauge has the advantage of being invariant under conformal transformations, but it appears that the Yang-Mills equation is not well-behaved in this gauge for rough data. (For smooth data one can obtain local well-posedness in this gauge by energy estimates). The Coulomb gauge is the simplest to work with technically, and in this gauge the bilinear expression acquires a null structure KlMa1995 which allows for a satisfactory analysis of the equation. Unfortunately there are often difficulties in creating a global Coulomb gauge, and one often has to rely instead on local Coulomb gauges pieced together using finite speed of propagation; see KlMa1995. The Temporal gauge is fairly close to the Coulomb gauge, and one can develop a parallel theory for this gauge. The temporal gauge has the advantage of being easy to establish globally, but the null form structure is less obvious (one needs to partition the connection into divergence-free and curl-free components). See e.g. Ta2003.

In the Coulomb or Temporal gauges, one can create a model equation for the Yang-Mills system by ignoring cubic terms and any contribution from the "elliptic" portion of the gauge ( in the Coulomb gauge, or the curl-free portion of in the Temporal gauge). The resulting model equation is

where is some null form such as

.

The results known for the model equation are slightly better than those known for the actual Yang-Mills or Maxwell-Klein-Gordon equations.

The Yang-Mills equations come with a positive definite conserved Hamiltonian

which mostly controls the norm of and the norm of . However, there are some portions of the norm which are not controlled by the Hamiltonian (in the Coulomb gauge, it is ; in the Temporal gauge, it is the norm of the curl-free part of ). This causes some technical difficulties in the global well-posedness theory.

The Yang-Mills equations can also be coupled with a g-valued scalar field , with the Lagrangian functional of the form

where are covariant derivatives and is some potential function (e.g. . The corresponding Euler-Lagrange equations have the schematic form

and are generally known as the Yang-Mills-Higgs system of equations. This system may be thought of as a Yang-Mills equation coupled with a semi-linear wave equation. The Maxwell-Klein-Gordon system is a special case of Yang-Mills-Higgs.

The theory of Yang-Mills connections is considerably more advanced in the elliptic case (when the Minkowski metric is replaced by a Riemannian one), especially in the critical case of four dimensions, but a discussion of this topic is beyond our expertise.

Attention has mostly focused on the three and four dimensional cases; the one-dimensional case is trivial (e.g. in the temporal gauge it collapses to ). In higher dimensions n=5,7,9 singularities can develop from large smooth radial data CaSaTv1998 (see also Biz-p). Numerics suggest this phenomenon is generic, and also one appears to have blowup also at the critical dimension BizTb2001, Biz-p.

The Yang-Mills equations can also be coupled with a spinor field. In the case this becomes the Maxwell-Dirac equation.

The Yang-Mills equations in dimension n have many formal similarities with the wave maps equation at dimension d-2 (see e.g. CaSaTv1998 for a discussion).


Yang-Mills on

  • Scaling is .
  • One can use the method of descent and finite speed of propagation to infer R2 results from the R^3 results. Thus, for instance, one has LWP for s > 3/4 in the temporal gauge and GWP in the temporal gauge for . These results are almost certainly non-optimal, however, and probably have much simpler proofs (for instance, one can obtain the LWP result from the general theory of DNLW without using any null form structure).

Yang-Mills on R3

  • Scaling is s_c = 1/2.
  • LWP for s > 3/4 in the Temporal gauge if the norm is sufficiently small Ta2003. The main tools are bilinear estimates involving both spaces and product Sobolev spaces.
    • Presumably the small data assumption can be removed, but the usual methods to do this fail because there are too many time derivatives in the non-linearity in the temporal gauge.
    • For in the Temporal or Coulomb gauges LWP for large data was shown in KlMa1995.
    • For s > 1 LWP for the Temporal, Coulomb, or Lorenz gauges follows from Strichartz estimates PoSi1993.
    • For s > 3/2 LWP for the Temporal, Coulomb, or Lorenz gauges follows from energy estimates EaMc1982.
    • There is a tentative conjecture that one in fact has ill-posedness in the energy class for the Lorenz gauge.
    • For the model equation LWP fails for s < 3/4 MaStz-p
    • The endpoint s = 1/2 looks extremely difficult, even for Besov space variants.
  • GWP is known for data with finite Hamiltonian (morally, this is for ) in the Coloumb or Temporal gauges KlMa1995.


MKG and Yang-Mills in R4

  • Scaling is s_c = 1.
  • For the MKG equations in the Coulomb gauge, LWP is known for s > 1 Sb-p5. This is still not known for Yang-Mills.
    • For the model equations this is in KlTt1999
      • For general quadratic DNLW this is only known for s > 5/4 (e.g. by the estimates in FcKl2000). Strichartz estimates need s > 3/2 PoSi1993, while energy estimates need s > 2.
    • The latter two results (Strichartz and energy) easily extend to the actual MKG and YM equations in all three standard gauges.
  • It is conjectured that one has global well-posedness results for small energy, but this is open.
    • For small smooth compactly supported data, one can obtain global existence from the general theory of quasi-linear equations.
    • For large data Yang-Mills, numerics suggest that blowup does occur, with the solution resembling a rescaled instanton at each time BizTb2001, Biz-p.
      • Further numerics suggests that the radius of the instanton in fact decays like BizOvSi-p.
    • GWP for small data (with an additional angular derivative of regularity) in the Lorenz gauge is in Stz-p2.

MKG and Yang-Mills in Rd, d>4

  • Scaling is s_c = d/2 - 1.
  • LWP is almost certainly true for MKG-CG for s > s_c by adapting the results in Sb-p5. The corresponding question for Yang-Mills is still open.
    • For the model equations one can probably achieve this by adapting the results in Tt1999
  • For dimensions , GWP for small H^{d/2} data in MKG-CG is in RoTa-p. The corresponding question for Yang-Mills is still open, but a Besov result follows (in the Lorenz gauge) from Stz-p3.

Yang-Mills-Higgs on R3

  • Suppose the potential energy V( f ) behaves like (i.e. defocussing p^th power non-linearity). When , the Higgs term is negligible, and the theory mimics that of the ordinary Yang-Mills equation. The most interesting case is p=5, since the Higgs component is then H^1-critical.
  • There is no perfect scale-invariance to this equation (unless p=3); the critical regularity is .
  • In the sub-critical case p<5 one has GWP for smooth data EaMc1982, GiVl1982b. This can be pushed to H^1 by the results in Ke1997. The local theory might be pushed even further.
  • In the critical case p=5 one has GWP for Ke1997.
  • In the supercritical case p>5 one probably has LWP for (because this is true for the Yang-Mills and NLW equations separately), but this has not been rigorously shown. No large data global results are known, but this is also true for the supposedly simpler supercritical NLW. It seems possible however that one could obtain small-data GWP results.

Quantum equations

Differently from the classical case, the corresponding quantum formulation has not proved to exist yet. The main difficulties rely on the gauge invariance that makes already an Euclidean formulation, eventually manageable through Wiener path integrals, at best problematic. From a physical standpoint, one ignores this kind of difficulties and puts down a quantum field theory exploiting it through a small perturbation theory in the coupling, limiting in this way the analysis at higher momenta (smaller distances). This approach has been proved frutiful in understanding the phenomenolgy observed at laboratory facilities and using lattice computations with computers. On the other side, at small momenta (larger distances), difficulties are overwhelming great making best suitable a numerical approach on the lattice. So, also non rigorous methods seem to fail to give a clever understanding of the situation for this limit. Then, in what follows, we just give a formal presentation of material having in view the idea that, through this, fundamental questions like the existence of the mass gap or existence of the theory itself could finally be proved. We will use as a reference FaSl1980.

Hamiltonian formulation

Yang-Mills equations can be stated into a Hamiltonian form. The canonical variables are with $a,b$ running on the Lie group index and $k,l$ enumerating the spatial coordinates. One has

and the Hamiltonian can easily be written down as

The dynamics has a constraint and we need a gauge condition to get the equations of motion. This can also be seen in the corresponding Lagrangian formulation, after a Legendre transform, where a Lagrange multiplier indeed appears.

Now, one can write down the corresponding Poisson brackets obtaining

being the structure constants of the Lie group. We further note that and so .

Heisenberg equations

In order to obtain the quantum dynamics we can use Heisenberg equations of motion ()

and the set of commuting relations obtained from the Poisson brackets through Dirac quantization

A further condition to fix the gauge is also needed.

We note that these are operator equations and so we need to build a proper state space to give them a meaning. Presently, a rigorous proof of existence of all this construction does not exist yet and would be a proof of existence for the Yang-Mills theory itself. The best one can do is to perform a small perturbation theory of these equations on a Fock space built on the solutions of the leading order equations. Also this construction has not a rigorous proof but is common practice between physics community with a considerable success. But, in order to perform such kind of computations, a different approach is used that heavily relies on path integration. No sound foundation for path integrals in Minkowski spaces exists yet. Finally, the existence of the theory is not granted for any . It is known that for small perturbation theory cannot be done and the theory is not renormalizable.