# Introduction to instanton counting
Note prepared for [Geometry of instantons and quivers](https://www.mathconf.org/giq2026), 22-26 June, 2026, Lille
----
The aim of these lectures is to explain basic notions of instantons in relation to various mathematical concepts in algebraic geometry, representation theory, and enumerative geometry. Here is the plan of four lectures.
1. Yang-Mills theory in four dimensions and instantons
We start with a quick review of Yang-Mills (YM) theory and then introduce the instanton with emphasis on its motivation and also the role of Anti-Self-Dual (ASD) connections.
2. ADHM construction of instantons on $\mathbb{S^4}$
We discuss the Atiyah-Drinfeld-Hitchin-Manin (ADHM) construction of instantons on $\mathbb{S^4}$. We see the construction of ASD connections and computation of the 2nd Chern class integral. We will briefly mention a relation to the Atiyah-Singer index theorem and the twistor.
3. Instanton moduli space
We introduce the instanton moduli space based on the ADHM construction, discussing the equivariant action, GIT quotient, and its relation to the Hilbert scheme of points on $\mathbb{C}^2$ (and also its higher-rank version). We will mention a general framework of Nakajima's quiver variety.
4. Equivariant integral and localization formula
We discuss general ideas to compute the (equivariant) integral over the instanton moduli space, based on the equivariant localization formula, analysis of fixed points under the torus action, etc. We derive Nekrasov's formula of the instanton partition function (equivariant volume of the instanton moduli space) and discuss its higher-dimensional analogues yielding enumerative invariants of algebraic varieties in three and four dimensions, e.g., Donaldson-Thomas invariants and Pandharipande-Thomas invariants.
#### Main references
- Donaldson, Kronheimer, *[The Geometry of Four-manifolds](https://books.google.fr/books/about/The_Geometry_of_Four_manifolds.html?id=LbHmMtrebi4C)*
- Freed, Uhlenbeck, *[Instantons and Four-Manifolds](https://doi.org/10.1007/978-1-4613-9703-8)*
- Blaine Lawson, Michelsohn, *[Spin geometry](https://doi.org/10.1515/9781400883912)*
- Nakajima, *[Lectures on Hilbert Schemes of Points on Surfaces](https://doi.org/10.1090/ulect/018)*
## Lecture 1
### Setup
Let
- $X$ : a closed oriented Riemannian four-manifold
- $P \to X$ : a principal $G$-bundle with the structure group $G$. We denote the corresponding Lie algebra by $\mathfrak{g} = \operatorname{Lie} G$.
Let $\operatorname{Ad}: G \to \operatorname{Aut}(\mathfrak{g}) \subset \mathrm{GL}(\mathfrak{g})$. We define an adjoint bundle $\operatorname{ad}(P) = P \times_{\operatorname{Ad}} \mathfrak{g}$. We denote a connection on $P$ by $A$. Then, with an exterior covariant derivative on adjoint-valued forms, $d_A = d + A$, we define a curvature $$F_A = d_A^2 \in \Omega^2(X,\operatorname{ad}(P))$$
which is obstruction to flatness. Concretely, it is understood as $d_A^2 = [F_A,\cdot]$.
Locally, on a trivializing open set $U \subset X$, we may regard the connection as a $\mathfrak{g}$-valued one-form, $A \in \Omega^1(X,\mathfrak{g})$. Therefore, we have
$$F_A = dA + A \wedge A.$$
Two connections differing by gauge transformation should be regarded as equivalent. We define the gauge group,
$$\mathscr{G} = \operatorname{Aut}(P)$$
(In the physics literature, the structure group $G$ is often called the gauge group.) The gauge group consists of bundle automorphisms covering the identity. After choosing a trivialization, write
$$g:X\to G$$
and then,
$$A \longmapsto g^{-1}Ag+g^{-1}dg$$
Globally, gauge transformations are automorphisms of the principal bundle; locally they appear as $G$-valued functions.
### Variational problem
We define the **Yang--Mills (YM) functional** of the connection,
$$\operatorname{YM}(A) = ||F_A||^2 = \int_X \left< F_A,F_A \right> = \int_X \operatorname{tr} (F_A \wedge *F_A)$$
which is often called the YM action denoted by $S_\text{YM}[A]$ in the physics literature. Here we denote by $*$ the Hodge star oprator, and the norm uses the Riemannian metric on $X$ and the invariant inner product on $\mathfrak{g}$ (Killing form).
We look for critical points of this functional. Write $A_t = A + t a$ with $a \in \Omega^1(X,\operatorname{ad}(P))$. We vary the connection in the direction of an ajoint-valued one-form. Using the local formula, we have
$$ F_{A_t} = d(A + t a) + (A + ta) \wedge (A + ta) = F_A + t d_A a + O(t^2)$$
from which we conclude the linearization of the curvature,
$$\left.\frac{d}{dt} F_{A_t}\right|_{t = 0} = d_A a$$
Then, the derivative of the YM functional is given by
$$\frac{d}{dt} \operatorname{YM}(A) = 2 \int_X \left< d_A a, F_A \right> = 2 \int_X \left< a, d_A^* F_A \right>$$
where we used the formal adjoint of $d_A$, given by $d_A^* = -* d_A *$, which is essentially the integral by parts. Therefore, the critical point equation reads
$$ d_A^*F_A = 0$$
This is the **Yang--Mills equation**, a non-linear second order partial differential equation, which is elliptic modulo gauge transformation.
### Self-duality in four dimensions
In four dimensions, we have
$$ * : \Omega^2(X) \to \Omega^2(X)$$
and $*^2 = 1$ with eigenvalues, $\pm 1$. Every two-form decomposes uniquely into self-dual (SD) and anti-self-dual (ASD) parts:
$$\Omega^2(X) = \Omega_+^2(X) \oplus \Omega_-^2(X)$$
$$ \omega = \omega^+ + \omega^-$$
such that $* \omega^\pm = \pm \omega^\pm$.
Curveture therefore splits into two geometrically distinguished pieces,
$$ F_A = F_A^+ + F_A^- , \quad F_A^\pm \in \Omega^\pm(X,\operatorname{ad}(P))$$
Then, since the Hodge star is self-adjoint involution, $F_A^\pm$ are orthogonal with respect to the inner product,
$$||F_A||^2 = ||F_A^+||^2 + ||F_A^-||^2$$
Meanwhile, we have
\begin{align}
\left< F_A, *F_A \right>
& = \operatorname{tr} F_A \wedge F_A \\
& = \operatorname{tr} F_A^+ \wedge F_A^+ + \operatorname{tr} F_A^- \wedge F_A^- \\
& = \operatorname{tr} F_A^+ \wedge * F_A^+ - \operatorname{tr} F_A^- \wedge * F_A^- \\
& = \left< F_A^+, F_A^+ \right> - \left< F_A^-, F_A^- \right>
\end{align}
Combining these two expressions, we have
\begin{align}
\operatorname{YM}(A)
& = - \int_X \operatorname{tr} F_A \wedge F_A + 2 ||F_A^+||^2 \\
& = 8 \pi^2 k + 2 ||F_A^+||^2
\end{align}
where
$$k = \frac{-1}{8 \pi^2} \int_X \operatorname{tr} F_A \wedge F_A = \left< c_2(P), [X] \right> \in \mathbb{Z}$$
is called the *second Chern number* (second Chern class evaluated with the fundamental class), also called the *instaton charge*, or *topological charge*. By Chern--Weil theory, this four-form represents the second Chern class, $c_2 = (-1/8\pi^2) \operatorname{tr} F \wedge F \in H^4(X)$, so although it is written in terms of curvature, its integral depends only on the bundle topology.
Since $||F_A^+||^2 \ge 0$, we have
$$\operatorname{YM}(A) \ge 8 \pi^2 k$$
The equality holds for the **anti-self-dual (ASD) connection**, a connection obeying the **anti-self-dual Yang--Mills (ASDYM) equation**, $F_A^+ = 0$. The ASD connection is also called the **instanton**, while the SD connection is instead called the *anti-instanton*. Thus, the instanton minimizes the YM functional in a fixed topological sector.
We remark that the YM equation is a second order PDE, while the ASDYM equation is a first order PDE, both elliptic modulo gauge transformation.
We also remark that, by the **Bianchi identity**, $d_A F_A = 0$, we immediately see that the instanton solves the YM equation,
$$d_A^* F_A = - * d_A * F_A = * d_A F_A = 0$$
The (A)SDYM equation, $F_A^\pm = 0$, is interpreted as *a half* of the flatness equation, $F_A = 0$, which is a reasonable equation in four dimensions: $A$ has three degrees of freedom modulo gauge transformation, and $F_A^\pm$ also has three, while $F_A$ has six.
## Lecture 2
### Finiteness condition and topology
Set $X = \mathbb{R}^4$ and $G=$ SU(2). We are in particular looking for finite-action connections on $\mathbb R^4$,
$$ \operatorname{YM}(A) = \int_{\mathbb{R}^4} \left< F_A, F_A \right> < \infty $$
This finiteness condition implies that curvature decays sufficiently fast at infinity: We have asymptotically flat connection, $F_A \to 0$, hence the connection is asymptotically pure gauge (Maurer--Cartan form), $A \to g^{-1} dg$. This gives a map,
$$ g : \mathbb{S}^3 \longrightarrow \mathrm{SU}(2) $$
which determines the bundle topology. In this way, we may compactify $\mathbb{R}^4$ by one point, $\mathbb{S}^4 = \mathbb{R}^4 \cup \{\infty\}$, turning infinity into an equatorial $\mathbb{S}^3$. This is consistent with the stereographic projection, which identifies $\mathbb R^4$ with $\mathbb S^4$ minus one point, $\mathbb S^4\setminus\{\infty\} \cong \mathbb R^4$, and finite action guarantees that the missing point can be filled back in topologically.
**Proposition** : The instanton charge is given by
$$k = \frac{1}{24 \pi^2} \int_{\mathbb{S}^3} \operatorname{tr} \left( g^{-1} dg \wedge g^{-1} dg \wedge g^{-1} dg \right)
$$
This integral is called the *Wess--Zumino term* in the physics literature.
*Proof* : We compute
\begin{align}
k = \frac{-1}{8 \pi^2} \int_{\mathbb{R}^4} \operatorname{tr} F_A \wedge F_A
& = \frac{-1}{8 \pi^2} \int_{\mathbb{R}^4} d \omega_3
\end{align}
where we denote the *Chern--Simons 3-form* by $$\omega_3 = \operatorname{tr} \left(A \wedge F_A - \frac{1}{3} A \wedge A \wedge A\right) \xrightarrow{A \to g^{-1} dg} - \frac{1}{3} \operatorname{tr} \left( g^{-1} dg \wedge g^{-1} dg \wedge g^{-1} dg \right)$$
Then, we apply Stokes' theorem to rewrite the integral on the boundary $\mathbb{S}^3$ to obtain the result. $\square$
### Instanton solutions on $\mathbb{S}^4$
We then show that the ASDYM equation has nontrivial solutions for $X = \mathbb{S}^4$. Our idea is a quaternion analogue of the Hopf fibration,
\begin{align}
\mathrm{U}(1) & = \mathbb{S}^1 \longrightarrow \mathbb{S}^3 \longrightarrow \mathbb{S}^2 = \mathbb{CP}^1 \\
\mathrm{SU}(2) = \mathrm{Sp}(1) & = \mathbb{S}^3 \longrightarrow \mathbb{S}^7 \longrightarrow \mathbb{S}^4 = \mathbb{HP}^1
\end{align}
#### Two dimensions $\mathbb{S}^2 = \mathbb{CP}^1$
From the homogeneous coordinate, $[z_0:z_1] \in \mathbb{CP}^1$, we define the affine corrdinate of $\mathbb{CP}^1$, $z = z_1/z_0 \in \mathbb{C}$, which is interpreted as the complex coordinate of the stereographic projection. In this setup, curvature of the principal $\mathrm{U}(1)$-bundle on $\mathbb{CP}^1$ is identified with the Kähler form of the Fubini--Study metric,
$$F = 2\omega = \iota \partial \bar{\partial} \log (1 + |z|^2) = \iota \frac{d \bar{z} \wedge d{z}}{(1+|z|^2)^2}$$
where $\iota = \sqrt{-1}$ and $\partial$, $\bar{\partial}$ are the Dolbeault operators. In our convention, we have
$$\int_{\mathbb{CP}^1} \omega = \operatorname{vol}(\mathbb{CP}^1) = \pi$$
from which we obtain $c_1 = \frac{\iota}{2\pi} F$, so that $\left< c_1, [\mathbb{CP}^1] \right> = 1$.
#### Four dimensions $\mathbb{S}^4 = \mathbb{HP}^1$
In order to study the instanton on $\mathbb{S}^4$, the quaternion notation is natural because $\mathbb R^4 \cong \mathbb H$, and $\mathrm{SU}(2) = \mathrm{Sp}(1)$ is the group of unit quaternions. We write
$$x = x_1 + x_2 i + x_3 j + x_4 k \in \mathbb{H}$$
where $(i,j,k)$ are the standard imaginary quaternion units, $i^2 = j^2 = k^2 = ijk = -1$, and its conjugate denoted by
$$ \bar{x} = x_1 - x_2 i - x_3 j - x_4 k $$
The norm is given by
$$|x|^2 = x \bar{x} = \sum_{a=1}^4 x_a^2$$
We remark the identification of imaginary quaternions, $\operatorname{Im} \mathbb{H} = \mathfrak{sp}(1) = \mathfrak{su}(2)$.
As in the case of $\mathbb{CP}^1$, we identify the curvature of the $\mathrm{SU}(2)$-bundle with the *quaternionic Kähler form*,
$$ F = \frac{d\bar{x} \wedge dx}{(1 + |x|^2)^2}$$
which consists of a triple of two-forms, seen as an $\mathfrak{su}(2)$-valued two-form under the above-mentioned identification. $\mathbb{HP}^1 = \mathbb{S}^4$ is not (hyper-)Kähler, but quaternion-Kähler. Each two-form is not closed independently, but rather under the covariant one, $d_A F = 0$, which is nothing but the Bianchi identity. The curvature is clearly ASD from the quaternionic structure,
\begin{align}
\frac{1}{2} d\bar{x} \wedge dx
& = i (dx_1 \wedge dx_2 - dx_3 \wedge dx_4) \\
& \ + j (dx_1 \wedge dx_3 - dx_2 \wedge dx_4) \\
& \ + k (dx_1 \wedge dx_4 - dx_2 \wedge dx_3)
\end{align}
and hence $*F = - F$. We remark that there is no contribution proportional to the unit "1", corresponding to the $\mathfrak{u}(1)$-factor. Changing the orientation, $F \propto dx \wedge d\bar{x}$, it becomes SD, $*F = F$. In the current convention, we have $$\left< c_2, [\mathbb{HP}^1] \right> = \frac{-1}{8\pi^2} \int_{\mathbb{HP}^1} F \wedge F = 1$$
This curvature agrees with the so-called **BPST (Belavin--Polyakov--Schwartz--Tyupkin) instanton** solution of charge $k = 1$.
#### Twistor description
We have seen that the quaternionic Hopf fibration, $\mathbb{S}^3 \to \mathbb{S}^7 \to \mathbb{S}^4 = \mathbb{HP}^1$, plays a crucial role in the construction of the instanton solution. There is a complex projective analogue of this construction
$$
\mathbb{CP}^1 \to \mathbb{CP}^3 \to \mathbb{S}^4
$$
where $\mathbb{CP}^3 = S^7 / \mathrm{U}(1)$ and $\mathbb{CP}^1 = S^3 / \mathrm{U}(1)$. In this context, $\mathbb{CP}^3$ is called the **twistor space** of $\mathbb{S}^4$. There is a one-to-one correspondence between the ASD connection on $\mathbb{S}^4$ and the holomorphic vector bundle on $\mathbb{CP}^3$, $E \to \mathbb{CP}^3$, under the trivialization condition, $\left. E \right|_{L_x} \cong \mathscr{O}^{\oplus n}$ on the twistor line $L_x \cong \mathbb{CP}^1$ for each $x \in \mathbb{S}^4$ for the rank $n$ case. This setup is further related to the torsion-free sheaf on $\mathbb{CP}^2$ appearing at the boundary of $\mathbb{CP}^3$.
### Moduli problem
The previous expression of curvature allows us to consider a modified solution,
$$ F = \frac{d\bar{x} \wedge dx}{(\lambda + |x - x_0|^2)^2}$$
where $\lambda \in \mathbb{R}_{>0}$ and $x_0 \in \mathbb{H}$, corresponding to rescale and translation of the configuration. Hence, we have a 5-parameter family of the instanton solution. We remark that, under the gauge transformation, the curvature behaves $F \mapsto g^{-1} F g$, preserving $\lambda$ and $x_0$ in the above expression.
We define the moduli space as the space of ASD connections modulo gauge equivalence,
$${\mathscr{M}}_k=\{A \mid F_A^+=0, \left< c_2,[X] \right> = k \}/\mathscr{G}$$
The BPST instanton shows that this moduli space is nonempty for $X = \mathbb{R}^4$ ($\mathbb{S}^4$) with $k=1$. On $\mathbb R^4$, one often fixes the gauge transformation at infinity (the global part of the gauge choice),
$$\mathscr G_0=\{g:X\to G\mid g(\infty)=1\}$$
leading to the framed moduli space,
$${\mathscr M}_k^\text{fr}=\{A\mid F_A^+=0\}/\mathscr G_0$$
Framing remembers the asymptotic trivialization. In general, we have $\operatorname{dim} {\mathscr{M}}_k = 8k-3$, while $\operatorname{dim} {\mathscr{M}}_k^\text{fr} = 8k$ for $G = \mathrm{SU}(2)$. The deformation equation is given by the linearization, $d_A^+ a = 0$, together with the gauge fixing, $d_A^* a = 0$, and hence $\ker(d_A^* \oplus d_A^+)$, which may be also written as $\ker(d_A^+)/\operatorname{Im}(d_A)$. This is the elliptic operator governing infinitesimal moduli. In this way, we may know the moduli space dimension via the index theorem. See also the discussion in the ADHM moduli space below.
## Lecture 3
Let us discuss the **Atiyah--Drinfeld--Hitchin--Manin (ADHM) construction** of instantons on $\mathbb{S}^4$. For this purpose, we recall how to obtain the non-trivial connection for the Hopf fibration, $\mathrm{U}(1) \hookrightarrow \mathbb{S}^3 \to \mathbb{S}^2 = \mathbb{CP}^1$.
Let
$$\mathbb{S}^3 = \{ \xi = (\xi_1,\xi_2) \in \mathbb{C}^2 \mid \left< \xi, \xi \right> = 1 \}$$
and $x \in \operatorname{Im}\mathbb{H} \cong \mathfrak{su}(2)$. In a matrix notation, we may write
$$x = \iota \begin{pmatrix} x_3 & x_1 + \iota x_2 \\ x_1 - \iota x_2 & - x_3 \end{pmatrix}$$
We define $D = x - \lambda 1_2$ with $\lambda \in \mathbb{C}$ and consider the kernel, $\ker D$. In order to have a non-null kernel, we need to put $\lambda = \pm \iota |x|$. Then, we may write
$$\xi = \frac{1}{\sqrt{1 + |z|^2}} \begin{pmatrix} z \\ 1 \end{pmatrix} \in \ker D$$
where $z = (x_1 + \iota x_2)/(x_3 \pm |x|) \in \mathbb{C}$ is the corresponding affine coordinate of $\mathbb{CP}^1$. Then, the connection is given by
$$A = \left< \xi, d \xi \right> = \frac{1}{2} \frac{\bar{z} dz - z d\bar{z}}{1 + |z|^2} = \iota \Im \frac{\bar{z}dz}{1+|z|^2}$$
which leads to the curvature mentioned before.
### ADHM construction
Let us discuss the ADHM construction of $k$-instanton solution on $\mathbb{R}^4 \cong \mathbb{C}^2$ for $G = \mathrm{SU}(n)$. We write the coordinate $(z_1,z_2) \in \mathbb{C}^2$ where $z_1 = x_1 + \iota x_2$ and $z_2 = x_3 + \iota x_4$.
Let $N = \mathbb{C}^n$, $K = \mathbb{C}^k$. We define linear maps, $B_i \in \operatorname{End}(K)$ $(i=1,2)$, $I \in \operatorname{Hom}(N,K)$, $J \in \operatorname{Hom}(K,N)$. We define the so-called (zero-dimensional) Dirac operator,
$$ D = \begin{pmatrix} B_1 - z_1 1_K & B_2 - z_2 1_K & I \\ - B_2^\dagger + \bar{z}_2 1_K & B_1^\dagger - \bar{z}_1 1_K & - J^\dagger \end{pmatrix} : K \otimes \mathbb{C}^2 \oplus N \to K \otimes \mathbb{C}^2$$
We may also write it in a quaternion notation,
$$
\begin{pmatrix} B_1 - z_1 1_K & B_2 - z_2 1_K \\ - B_2^\dagger + \bar{z}_2 1_K & B_1^\dagger - \bar{z}_1 1_K \end{pmatrix} \cong B - x 1_K , \quad B \in \operatorname{End}(K) \otimes \mathbb{H}, \ x \in \mathbb{H}
$$
We impose the condition $D D^\dagger = \Delta \otimes 1_{\mathbb{C}^2}$ (quaternion scalar condition) with $\Delta : K \to K$ (asymptotically $\Delta \to |x| 1_K$ at infinity), leading to the ADHM equations (also called the moment map equations),
\begin{align}
\mu_{\mathbb{R}} & := [B_1,B_1^\dagger] + [B_2,B_2^\dagger] + II^\dagger - J^\dagger J = 0 \\
\mu_{\mathbb{C}} & := [B_1, B_2] + IJ = 0
\end{align}
Moreover, we assume that $\Delta$ is invertible ($D$ : full-rank), which will be related to the stability condition of the moduli space.
From the full-rank condition, we have $\dim \ker D = n$. We write the orthonormal basis of $\ker D$ as $\{\psi_a\}_{a=1,\ldots,n}$, such that $\left< \psi_a, \psi_b \right> = \delta_{a,b}$, and the corresponding basis matrix,
$$ \Psi = (\psi_1 \ldots \psi_n) : N \to K \otimes \mathbb{C}^2 \oplus N $$
We also define a projector from $K \otimes \mathbb{C}^2 \oplus N$ onto $N$,
$$P := \Psi \Psi^\dagger = 1_{K \otimes \mathbb{C}^2 \oplus N} - D^\dagger(\Delta^{-1} \otimes 1_{\mathbb{C}^2}) D $$
**Proposition** : $A = \left<\Psi,d \Psi\right>$ is ASD.
*Proof* : The corresponding curvature is given by
$$F = dA + A \wedge A = d \Psi^\dagger (1 - P) d\Psi = \Psi^\dagger (d D^\dagger) (\Delta^{-1} \otimes 1_{\mathbb{C}^2}) (dD) \Psi $$
Recalling that $dD = - 1_K dx$, $dD^\dagger = - 1_K d\bar{x}$ in the quaternion basis, the curvature is proportional to $d\bar{x} \wedge dx$, which is ASD. $\square$
**Proposition** : The instanton charge is given by $k = \dim K$.
*Proof* : From the curvature formula above, we may have (Osborn's formula),
$$ \operatorname{tr} F \wedge F = - \partial^2 \partial^2 \log \det \Delta^{-1}$$
From the asymptotic behavior of $\Delta$, $\Delta \to |x| 1_K$ at infinity, we obtain $c_2 = \operatorname{tr} 1_K = k$. $\square$
Here is another derivation. We have the following complex (ADHM complex), on the twistor description on $\mathbb{CP}^3$,
$$ 0 \to K \otimes \mathscr{O}(-1) \to (K \oplus K \oplus N) \otimes \mathscr{O} \to K \otimes \mathscr{O}(1) \to 0$$
Denoting the Chern character of $\mathscr{O}(\pm 1)$ by $e^{\pm H}$, we have the Chern character of the instanton bundle denoted by $E$, $\operatorname{ch}(E) = n - k H^2$, from which, together with $c_1(E) = 0$, we identify $c_2(E) = k H^2 \leadsto \int c_2(E) = k$.
### ADHM moduli space
In the ADHM construction, there is an analogue of the gauge transformation,
$$\mathrm{U}(k) : (B_{1,2},I,J) \mapsto (g B_{1,2} g^{-1}, g I, J g^{-1})$$
which preserves the ADHM equation, $\mu = (\mu_{\mathbb{R}},\Re \mu_{\mathbb{C}}, \Im \mu_{\mathbb{C}}) = 0$, since $\mu \mapsto g \mu g^{-1}$. There is another group action,
$$\mathrm{SU}(n) : (B_{1,2},I,J) \mapsto (B_{1,2}, I h^{-1}, h J)$$
which corresponds to rotation of the framing at infinity.
We then define the **ADHM moduli space** as follows,
$$\mathscr{M}_{n,k}^\text{ADHM} = \{(B_{1,2},I,J) \mid \mu = 0 \}/\mathrm{U}(k)$$
This moduli space is known to be hyper-Kähler due to the triple of the moment maps.
**Proposition** : All the ASD connections on $\mathbb{S}^4$ can be constructed by the ADHM construction (surjective), and the corresponding ADHM data, obtained from the inverse ADHM construction, are unique (injective).
From the existence and the uniqueness of the construction, we may identify the framed instanton moduli space on $\mathbb{S}^4$ with the ADHM moduli space,
$$ {\mathscr{M}}_{n,k}^\text{fr} = \mathscr{M}_{n,k}^\text{ADHM}$$
We count degrees of freedom of the moduli space, which will be the corresponding (virtual) dimension,
$$ \dim_\mathbb{R} {\mathscr{M}}_{n,k}^\text{fr} = 2 \cdot 2 k^2 + 2 \cdot 2 nk - 3 k^2 - k^2 = 4 nk $$
We also write $\dim_\mathbb{C} {\mathscr{M}}_{n,k}^\text{fr} = 2 nk$. The $\mathrm{SU}(n)$-quotient gives rise to the *unframed* moduli space of the dimention, $\dim {\mathscr{M}}_{n,k} = 4nk - \dim \mathrm{SU}(n)$. Recall that, for $G = \mathrm{SU}(2)$, $k = 1$, the dimension of the unframed moduli space was given by $4 \cdot 2 \cdot 1 - 3 = 5$, as seen before.
### Singularity and compactification
In the previous example,
$$ F = \frac{d\bar{x} \wedge dx}{(\lambda + |x - x_0|^2)^2}$$
the parameter $\lambda > 0$ gives the size of instanton, and the limit $\lambda \to 0$ causes the so-called *small instanton singularity*. In general, we have such a singularity at the boundary of the moduli space,
$$ \mathscr{M}_{n,k}^\text{fr} \leadsto \mathscr{M}_{n,k-1}^\text{fr} \times \mathbb{R}^4 $$
This leads to the **Uhlenbeck compactification** of the instanton moduli space,
$$\overline{\mathscr{M}}_{n,k} = \sqcup_{m=0}^k \mathscr{M}_{n,k-m}^\text{fr} \times \operatorname{Sym}^m (\mathbb{R}^4)$$
which is analogous to compactification of $\mathbb{R}^4$ with one point.
### Stability condition
The Uhlenbeck compactification does not resolve the small instanton singularities. In order to resolve them, we consider the following deformation of the moduli space introduced by Nakajima [[Nakajima '94](https://doi.org/10.1142/2407)],
$$\mathscr{M}_{n,k}^\zeta = \{ (B_{1,2},I,J) \mid \mu_\mathbb{R} = \zeta 1_K, \mu_\mathbb{C} = 0 \}/U(k), \quad \zeta \neq 0$$
which also appeared as the moduli space of instantons on the non-commutative space in the work of Nekrasov and Schwarz [[Nekrasov--Schwarz '98](https://doi.org/10.1007/s002200050490)].
**Definition** : Stability condition on the linear maps (quiver representation) is: No proper $S \subset K$ invariant under $B_{1,2}$ and containing $\operatorname{Im}(I)$, i.e., $$K = \mathbb{C}[B_1,B_2]I$$
Co-stability condition is similarly given by
$$ K = \mathbb{C}[B_1^\dagger, B_2^\dagger] J^\dagger $$
**Lemma** : The real moment map equation $\mu_\mathbb{R} = \zeta 1_K$ is equivalent to the (co-)stability condition when $\zeta > 0$ ($<0$).
*Proof* : We write $K' = \mathbb{C}[B_1,B_2] I$ and $K'' = \mathbb{C}[B_1^\dagger,B_2^\dagger] J^\dagger$ and define $K'_\perp$ and $K'_\perp$ as complements of $K'$ and $K''$ in $K$. We also define the projectors $P'$, $P''$ on $K_\perp'$ and $K_\perp''$. $P'$ annihilates all elements in a form of $B_1^a B_2^b I$, from which we have $[P',B_{1,2}] = 0$ and $P' I = 0$. Then, we have
$$ \zeta 1_{K_\perp'} = P' \mu_\mathbb{R} P' = P' ([B_1,B_1^\dagger] + [B_2,B_2^\dagger] - J^\dagger J) P' $$
When $\zeta > 0$, taking a trace yields
$$ 0 \le \operatorname{tr} \zeta_{>0} 1_{K_\perp'} = - \operatorname{tr} P' J^\dagger J P' \le 0 $$
Therefore, $K_\perp' = 0$. The same discussion applies to $K''$. $\square$
**Theorem** : The stable moduli space of framed rank-$n$ torsion-free sheaves on $\mathbb{P}^2$ is isomorphic to the $\zeta_{>0}$-deformed framed moduli space of $\mathrm{SU}(n)$-ASD connections on $\mathbb{S}^4$, i.e.,
$$ \widetilde{\mathscr{M}}_{n,k} = \mu_\mathbb{C}^{-1}(0)^\text{st}/\mathrm{GL}(K) \cong \mu_\mathbb{R}^{-1}(\zeta_{>0} 1_K) \cap \mu_\mathbb{C}^{-1}(0)/\mathrm{U}(K) = \mathscr{M}^\zeta_{n,k} $$
The left one is understood as the GIT quotient with the stability condition mentioned above.
*Proof* : This is an application of Kempf-Ness theorem: There is a bijection between the set of closed $\mathrm{GL}(K)$-orbits in $K$ and $\mu^{-1}(0)/\mathrm{U}(K)$. See [Nakajima, Corollary 3.22] for details. $\square$
In particular, $\widetilde{\mathscr{M}}_{n=1,k}$ agrees with the Hilbert scheme of $k$ points on $\mathbb{C}^2$ denoted by $\operatorname{Hilb}^k(\mathbb{C}^2)$.
### Nakajima quiver variety
The ADHM moduli space is now known to be the first example of **Nakajima's quiver variety** associated with $\widehat{A}_0$ quiver, which consists of only a single affine node. Let us briefly discuss a generalization of the ADHM construction by Kronheimer and Nakajima for *Asymptotically Locally Euclidean (ALE) spaces* to see its quiver description [[Kronheimer--Nakajima '90](https://doi.org/10.1007/BF01444534)].
Let $\Gamma$ be a finite subgroup of $\mathrm{SU}(2)$. The ALE space associated with $\Gamma$ is the minimal resolution of the quotient singularity given by $\widetilde{\mathbb{C}^2/\Gamma} \to \mathbb{C}^2/\Gamma$, which has a deep connection with the affine Dynkin diagram of type ADE via the *McKay correspondence*.
Let us focus on the cyclic group $\Gamma = \mathbb{Z}_m = \mathbb{Z} / m \mathbb{Z}$, which corresponds to the Dynkin diagram of type $\widehat{A}_{m-1}$. We define $K = \oplus_{i=0}^{m-1} K_i$, $N = \oplus_{i=0}^{m-1} N_i$ and the linear maps, $B_i \in \operatorname{Hom}(K_i,K_{i+1})$, $\overline{B}_i \in \operatorname{Hom}(K_{i+1},K_i)$, $I_i \in \operatorname{Hom}(N_i,K_i)$, $J_i \in \operatorname{Hom}(K_i,N_i)$. We write $\mathrm{U}(K) = \prod_{i=0}^{m-1} \mathrm{U}(K_i)$, and $n = (n_i)_{i=0}^{m-1}$, $k = (k_i)_{i=0}^{m-1}$ with $n_i = \dim N_i$, $k_i = \dim K_i$. Then, we define the moduli space as follows,
$$
\mathscr{M}_{\widehat{A}_{m-1};n,k} = \mu^{-1}(0) / \mathrm{U}(K)
$$
together with the moment maps given by
\begin{align}
\mu_\mathbb{R} & = (B_{i-1} B_{i-1}^\dagger - B_i^\dagger B_i + \overline{B}_i \overline{B}_i^\dagger - \overline{B}_{i+1}^\dagger \overline{B}_{i+1} + I_i I^\dagger - J_i^\dagger J_i)_{i=0}^{m-1} \\
\mu_\mathbb{C} & = (B_{i-1} \overline{B}_{i-1} - \overline{B}_i B_i + I_i J_i)_{i=0}^{m-1}
\end{align}
The index $i$ is understood as modulo $m$, $i \equiv i + m$ (mod $m$). We may consider the deformation, $\mu_{\mathbb{R},i} = \zeta_i 1_{K_i}$, leading to the corresponding stability condition. The resulting moduli space is identified with the Nakajima quiver variety of type $\widehat{A}_{m-1}$. This construction applies to generic quivers. See, e.g., Kirillov, [*Quiver Representations and Quiver Varieties*](https://doi.org/10.1090//gsm/174) for details.
## Lecture 4
The goal of this lecture is to derive the **Nekrasov's formula** for the equivariant volume of the instanton moduli space, a.k.a., the instanton partition function.
**Definition** : Let $|\mathfrak{q}|<1$. Nekrasov partition function is a generating function of the equivariant volume of the instanton moduli space,
$$ Z_n = \sum_{k=0}^\infty \mathfrak{q}^k Z_{n,k} , \qquad Z_{n,k} = \int_{\widetilde{\mathscr{M}}_{n,k}} 1$$
where the integral is understood equivariantly.
**Remark** : For the rank one case, we have $\widetilde{\mathscr{M}}_{n=1,k} = \operatorname{Hilb}^k(\mathbb{C}^2)$. Hence, the integral is given by
$$ Z_k = Z_{n=1,k} = \int_{\operatorname{Hilb}^k(\mathbb{C}^2)} 1 $$
which is analogous to the Donaldson--Thomas invariant of $\mathbb{C}^3$, given by an integral over the virtual fundamental class of the corresponding Hilbert scheme of points,
$$ \mathrm{DT}_k[\mathbb{C}^3] = \int_{[\operatorname{Hilb}^k(\mathbb{C}^3)]^\text{vir}} 1 $$
Since $\operatorname{vdim} \operatorname{Hilb}^k(\mathbb{C}^3) = 0$, this integral makes sense in the non-equivariant setting as well, while the equivariant setting is mandatory for $\mathbb{C}^2$.
In the following, we simply write $\mathscr{M} = \widetilde{\mathscr{M}}_{n,k}$.
### Equivariant localization formula
The key idea to compute the integral is the so-called *equivariant localization formula*.
**Theorem** [Berline--Vergne, Atiyah--Bott] : Let $X$ be a smooth manifold, which admits an action of a compact Lie group $G$. We denote by $F$ the fixed point loci of the $G$-action on $X$ and the inclusion $i : F \hookrightarrow X$. Let $N_F$ be the normal bundle to $F$ and the equivariant Euler class $e_\mathsf{T}(N_F)$. Then, integral of an equivariant close form $\alpha$ over $X$ localizes on $F$,
$$ \int_X \alpha = \sum_F \int_{F} \frac{i^* \alpha}{e_\mathsf{T}(N_F)} $$
When $F$ is a point, $F = \{p\}$, we have
$$ \int_X \alpha = \sum_p \frac{i^* \alpha}{e_\mathsf{T}(T_p X)} $$
Let us study the equivariant action on the moduli space,
$$ \mathscr{M} = \{ (B_{1,2},I,J) \mid \mu_{\mathbb{C}} = 0, \text{stability condition} \}/\mathrm{GL}(K) $$
We have
\begin{align}
\mathrm{GL}(N) : & (B_{1,2},I,J) \mapsto (B_{1,2},I h^{-1},h J) \\
\mathrm{GL}(K) : & (B_{1,2},I,J) \mapsto (g B_{1,2} g^{-1},gI,Jg^{-1}) \\
\mathrm{GL}(\mathbb{C}^2) : & (B_1,B_2,I,J) \mapsto ( q_1^{-1} B_1, q_2^{-1} B_2,I , q_{12}^{-1} J) , \quad q_{12} = q_1 q_2
\end{align}
We parametrize $g = e^\phi$, $h = e^a$, $q_i = e^{\epsilon_i} \in \mathbb{C}^\times$ ($i=1,2$), where $\phi \in \mathfrak{gl}(K)$, $a \in \mathfrak{gl}(N)$. We may assume $a = \oplus_{\alpha=1}^n a_\alpha$, $I = \oplus_{\alpha=1}^n I_\alpha$, $J = \oplus_{\alpha=1}^n J_\alpha$, such that $I_\alpha a = a_\alpha I_\alpha$, $a J_\alpha = a_\alpha J_\alpha$, without loss of generality. Then, the *fixed point equations* are given as follows,
$$
[\phi,B_i] - \epsilon_i B_i = 0 , \quad
\phi I - I a = 0 , \quad - J \phi + a J - \epsilon_{12} J = 0 , \quad \epsilon_{12} = \epsilon_1 + \epsilon_2
$$
**Lemma** : The left and right eigenvectors of $\phi$ are given by $B_1^i B_2^j I_\alpha$ and $J_\alpha B_1^i B_2^j$ with the eigenvalues,
$$
a_\alpha + i \epsilon_1 + j \epsilon_2, \quad
a_\alpha - (i+1) \epsilon_1 - (j+1) \epsilon_2
$$
*Proof* : It follows from the fixed point equations. $\square$
We then apply the stability condition, $K = \mathbb{C}[B_1,B_2]I = \oplus_{\alpha=1}^n \mathbb{C}[B_1,B_2]I_\alpha =: \oplus_{\alpha=1}^n K_\alpha$, and hence $J_\alpha B_1^i B_2^j = 0$ for any $i$, $j$, leading to $J = 0$. Since $\dim K = k < \infty$, $B_1^i B_2^j I_\alpha = 0$ for sufficiently large $i$, $j$. Indeed, this corresponds to an $n$-tuple monomial ideal parametrized by partitions, $\lambda = \{\lambda_\alpha\}_{\alpha=1,\ldots,n}$ with $|\lambda_\alpha| = \dim K_\alpha$.
### Instanton partition function
We have the complex on the moduli space (ADHM complex), $C^\bullet : C^0 \to C^1 \to C^2$, where
\begin{align}
C^0 & = \operatorname{End}(K) \\
C^1 & = \operatorname{End}(K) \otimes \mathbb{C}^2 \oplus \operatorname{Hom}(N,K) \oplus \operatorname{Hom}(K,N) \otimes \det \mathbb{C}^2 \\
C^2 & = \operatorname{End}(K) \otimes \det \mathbb{C}^2
\end{align}
together with maps,
\begin{align}
d_0(\phi) & = ([\phi,B_{1,2}],\phi I, - J \phi) \\
d_1(b_{1,2},i,j) & = [b_1,B_2] + [B_1,b_2] + i J + I j
\end{align}
We see that $d_1 \circ d_0(\phi) = [\phi,\mu_\mathbb{C}]$. Then, we have
$$
H^0(C^\bullet) = \ker d_0 , \quad H^1(C^\bullet) = \ker d_1 / \operatorname{im} d_0 , \quad H^2(C^\bullet) = \operatorname{coker} d_1
$$
corresponding to the spaces of symmetry, deformation, and obstruction. In this case, $H^0 = 0$, $H^2 = 0$. Hence, as a K-theory class, we have
\begin{align}
[T\mathscr{M}] & = [C^1] - [C^0] - [C^1] \\
& = N^\vee K + q_{12} K^\vee N - (1-q_1)(1-q_2) K^\vee K
\end{align}
under the abuse of notation, $N^\vee K = [N^\vee \otimes K]$, etc.
**Remark** : From this expression, we see the (virtual) dimension of the tangent space, $\dim_\mathbb{C} T\mathscr{M} = 2 nk = \dim_\mathbb{C} \mathscr{M}$.
Let $\lambda^\vee$ be the transposition of $\lambda$. We define the *arm length* and the *leg length*,
$$
a_\alpha(i,j) = \lambda_{\alpha,i} - j , \quad
\ell_\alpha(i,j) = \lambda^\vee_{\alpha,j} - i
$$
Then, we have the following.
**Lemma** : The equivariant Chern character of the tangent bundle at the fixed point $\lambda$ is given as follows,
$$
\operatorname{ch}_\mathsf{T} T_\lambda \mathscr{M} = \sum_{\alpha,\beta=1}^n e^{a_\beta - a_\alpha} \left( \sum_{s \in \lambda_\alpha} q_1^{\ell_\beta(s)} q_2^{-a_\alpha(s)-1} + \sum_{s \in \lambda_\beta} q_1^{-\ell_\alpha(s)-1} q_2^{a_\beta(s)} \right)
$$
*Proof* : Recall
$$
\operatorname{ch}_\mathsf{T} N = \sum_{\alpha=1}^n e^{a_\alpha} \quad
\operatorname{ch}_\mathsf{T} K = \sum_{\alpha=1}^n \sum_{(i,j) \in \lambda_\alpha} e^{a_\alpha} q_1^{i-1} q_2^{j-1}
$$
Then, the remaining is a combinatorial computation. See, e.g., [Nakajima, Proposition 5.8]. $\square$
**Theorem** [Nekrasov's formula] :
$$
Z_{n,k} = \sum_{\lambda \mid |\lambda| = k} \prod_{\alpha,\beta=1}^n \left[\prod_{s \in \lambda_\alpha} (a_\beta - a_\alpha + \epsilon_1 \ell_\beta(s) - \epsilon_2 a_\alpha(s) - \epsilon_2)^{-1} \prod_{s \in \lambda_\beta} (a_\beta - a_\alpha - \ell_\alpha(s) - \epsilon_1 + \epsilon_2 a_\beta(s))^{-1} \right]
$$
*Proof* : We may use the equivariant localization formula. From the previous Lemma, we read off the Chern roots of the tangent bundle at $\lambda$, $\operatorname{ch} T_\lambda \mathscr{M} = \sum_x e^x$, and the Euler class is given by the product of them, $e_\mathsf{T}(T_\lambda \mathscr{M}) = \prod x$. $\square$
We are in particular interested in the asymptotic behavior of the partition function,
$$
\mathscr{F} := \lim_{\epsilon_{1,2} \to 0} \epsilon_1 \epsilon_2 \log Z
$$
which is identified with the so-called **Seiberg--Witten prepotential**. For $n = 1$, corresponding to the Hilbert scheme of points on $\mathbb{C}^2$, it depends only on $\epsilon_{1,2}$. The generating function has a closed formula,
$$
Z_{n=1} = \sum_{\lambda} \mathfrak{q}^{|\lambda|} \prod_{s \in \lambda} (\epsilon_1 \ell(s) - \epsilon_2 a(s) - \epsilon_2)^{-1} (- \ell(s) - \epsilon_1 + \epsilon_2 a(s))^{-1} = \exp \left( \frac{\mathfrak{q}}{\epsilon_1 \epsilon_2} \right)
$$
Hence, in this case, the prepotential is simply given by $\mathscr{F} = \mathfrak{q}$.
### Further studies
#### Tautological insertion
We use the same notation $K$ for the tautological bundle over $\mathscr{M}$. Let $\alpha$ be a characteristic class associated with $K$. In general, we may consider the integral,
$$
\int_{\mathscr{M}} \alpha
$$
- $\alpha = \operatorname{ch}_\mathsf{T}(K)$ : generating function of chiral rings
- $\alpha = e_\mathsf{T}(M \otimes K)$ : hypermultiplet contribution (fundamental represetation), $\dim M \le 2n$
- $\alpha = e_\mathsf{T}(M \otimes T\mathscr{M})$ : hypermultiplet contribution (adjoint represetation), $\dim M = 1$. In the untwist limit $\operatorname{ch}_\mathsf{T} M = 1$, it computes the Euler characteristic of $\mathscr{M}$ (Vafa--Witten theory).
#### Equivariant K-theory
The K-theoretic Nekrasov partition function is given by the equivariant holomorphic Euler characteristic of the structure sheaf of the moduli space,
$$
Z_{n,k}^{(K)} = \chi(\mathscr{M},\mathscr{O}) = \int_{\mathscr{M}} \operatorname{td}(T\mathscr{M})
$$
We remark that $\operatorname{ch}_\mathsf{T}(O)=1$. We can in general consider the insertion,
$$
\chi(\mathscr{M},\mathscr{E}) = \int_{\mathscr{M}} \operatorname{ch}_\mathsf{T} (\mathscr{E}) \operatorname{td}(T\mathscr{M})
$$
Physically, the K-theoretic integral computes a gauge theory partition function on $\mathbb{R}^4 \times \mathbb{S}^1$. There is a further uplift to the equivariant elliptic cohomology corresponding to a six-dimensional theory compactified on an elliptic curve. In this case, the partition function is identified with the equivariant elliptic genus of the moduli space.
#### Higher-dimensions
As seen before, the rank-one instanton moduli space is identified with the Hilbert scheme of points on $\mathbb{C}^2$. This point of view allows us to study higher-dimensional generalizations. The equivariant integral over $\operatorname{Hilb}^k(X)$ computes the (equivariant) **Donaldson--Thomas invariants** of $X$ for $X = \mathbb{C}^3$ and $\mathbb{C}^4$. The higher-rank version is based on the so-called Quot-schemes.