Byers–Yang theorem

In quantum mechanics, the Byers-Yang theorem states that all physical properties of a doubly-connected system (a ring) enclosing a magnetic flux Φ through the opening are periodic in the flux with period Failed to parse (syntax error): {\displaystyle \Phi_0=h/e$ (the socalled [[flux quantum]]). The theorem was first stated and proven by [[Byers]] and [[Chen-Ning Yang]] (1961) <ref>{{cite journal|last=Feynman|first=R. P.|year=1939|title=Forces in Molecules|journal=Phys. Rev.|volume=56|issue=4|pages=340|doi=10.1103/PhysRev.56.340 |bibcode = 1939PhRv...56..340F }}</ref>, and later by [[Felix Bloch]] <ref>{{cite journal|last=Feynman|first=R. P.|year=1939|title=Forces in Molecules|journal=Phys. Rev.|volume=56|issue=4|pages=340|doi=10.1103/PhysRev.56.340 |bibcode = 1939PhRv...56..340F }}</ref>. relates the derivative of the total energy with respect to a parameter, to the [[Expectation value (quantum mechanics)|expectation value]] of the derivative of the [[Hamiltonian (quantum mechanics)|Hamiltonian]] with respect to that same parameter. According to the theorem, once the spatial distribution of the electrons has been determined by solving the [[Schrödinger equation]], all the forces in the system can be calculated using [[Classical electromagnetism|classical electrostatics]]. ==Proof== The flux Φ can be eliminated by a gauge transformation This proof of the Hellmann–Feynman theorem requires that the wavefunction be an eigenfunction of the Hamiltonian under consideration; however, one can also prove more generally that the theorem holds for non-eigenfunction wavefunctions which are stationary (partial derivative is zero) for all relevant variables (such as orbital rotations). The [[Hartree–Fock]] wavefunction is an important example of an approximate eigenfunction that still satisfies the Hellmann–Feynman theorem. Notable example of where the Hellmann–Feynman is not applicable is for example finite-order [[Møller–Plesset perturbation theory]], which is not variational.<ref>{{cite book|last=Jensen|first=Frank|title=Introduction to Computational Chemistry|publisher=John Wiley & Sons|location=West Sussex|year=2007|isbn=0-470-01186-6|page=322}}</ref> The proof also employs an identity of normalized wavefunctions – that derivatives of the overlap of a wavefunction with itself must be zero. Using Dirac's [[bra–ket notation]] these two conditions are written as :<math>\hat{H}_{\lambda}|\psi_\lambda\rangle = E_{\lambda}|\psi_\lambda\rangle,}

\langle \psi _{\lambda }|\psi _{\lambda }\rangle =1\Rightarrow {\frac {\mathrm {d} }{\mathrm {d} \lambda }}\langle \psi _{\lambda }|\psi _{\lambda }\rangle =0.

The proof then follows through an application of the derivative product rule to the expectation value of the Hamiltonian viewed as a function of λ:

{\begin{aligned}{\frac {\mathrm {d} E_{\lambda }}{\mathrm {d} \lambda }}&={\frac {\mathrm {d} }{\mathrm {d} \lambda }}\langle \psi _{\lambda }|{\hat {H}}_{\lambda }|\psi _{\lambda }\rangle \\&={\bigg \langle }{\frac {\mathrm {d} \psi _{\lambda }}{\mathrm {d} \lambda }}{\bigg |}{\hat {H}}_{\lambda }{\bigg |}\psi _{\lambda }{\bigg \rangle }+{\bigg \langle }\psi _{\lambda }{\bigg |}{\hat {H}}_{\lambda }{\bigg |}{\frac {\mathrm {d} \psi _{\lambda }}{\mathrm {d} \lambda }}{\bigg \rangle }+{\bigg \langle }\psi _{\lambda }{\bigg |}{\frac {\mathrm {d} {\hat {H}}_{\lambda }}{\mathrm {d} \lambda }}{\bigg |}\psi _{\lambda }{\bigg \rangle }\\&=E_{\lambda }{\bigg \langle }{\frac {\mathrm {d} \psi _{\lambda }}{\mathrm {d} \lambda }}{\bigg |}\psi _{\lambda }{\bigg \rangle }+E_{\lambda }{\bigg \langle }\psi _{\lambda }{\bigg |}{\frac {\mathrm {d} \psi _{\lambda }}{\mathrm {d} \lambda }}{\bigg \rangle }+{\bigg \langle }\psi _{\lambda }{\bigg |}{\frac {\mathrm {d} {\hat {H}}_{\lambda }}{\mathrm {d} \lambda }}{\bigg |}\psi _{\lambda }{\bigg \rangle }\\&=E_{\lambda }{\frac {\mathrm {d} }{\mathrm {d} \lambda }}\langle \psi _{\lambda }{\bigg |}\psi _{\lambda }\rangle +{\bigg \langle }\psi _{\lambda }{\bigg |}{\frac {\mathrm {d} {\hat {H}}_{\lambda }}{\mathrm {d} \lambda }}{\bigg |}\psi _{\lambda }{\bigg \rangle }\\&={\bigg \langle }\psi _{\lambda }{\bigg |}{\frac {\mathrm {d} {\hat {H}}_{\lambda }}{\mathrm {d} \lambda }}{\bigg |}\psi _{\lambda }{\bigg \rangle }.\end{aligned}}

For a deep critical view of the proof see^[1]

Alternate proof

The Hellmann–Feynman theorem is actually a direct, and to some extent trivial, consequence of the variational principle (the Rayleigh-Ritz variational principle) from which the Schrödinger equation can be made to derive. This is why the Hellmann–Feynman theorem holds for wave-functions (such as the Hartree–Fock wave-function) that, though not eigenfunctions of the Hamiltonian, do derive from a variational principle. This is also why it holds, e.g., in density functional theory, which is not wave-function based and for which the standard derivation does not apply.

According to the Rayleigh–Ritz variational principle, the eigenfunctions of the Schrödinger equation are stationary points of the functional (which we nickname Schrödinger functional for brevity):

E[\psi ,\lambda ]={\frac {\langle \psi |{\hat {H}}_{\lambda }|\psi \rangle }{\langle \psi |\psi \rangle }}.

2

The eigenvalues are the values that the Schrödinger functional takes at the stationary points:

E_{\lambda }=E[\psi _{\lambda },\lambda ],

3

where $\psi _{\lambda }$ satisfies the variational condition:

\left.{\frac {\delta E[\psi ,\lambda ]}{\delta \psi (x)}}\right|_{\psi =\psi _{\lambda }}=0.

4

Let us differentiate Eq. (3) using the chain rule:

{\frac {dE_{\lambda }}{d\lambda }}={\frac {\partial E[\psi _{\lambda },\lambda ]}{\partial \lambda }}+\int {\frac {\delta E[\psi ,\lambda ]}{\delta \psi (x)}}{\frac {d\psi _{\lambda }(x)}{d\lambda }}dx.

5

Due to the variational condition, Eq. (4), the second term in Eq. (5) vanishes. In one sentence, the Hellmann–Feynman theorem states that the derivative of the stationary values of a function(al) with respect to a parameter on which it may depend, can be computed from the explicit dependence only, disregarding the implicit one. On account of the fact that the Schrödinger functional can only depend explicitly on an external parameter through the Hamiltonian, Eq. (1) trivially follows. As simple as that.

Example applications

Molecular forces

The most common application of the Hellmann–Feynman theorem is to the calculation of intramolecular forces in molecules. This allows for the calculation of equilibrium geometries – the nuclear coordinates where the forces acting upon the nuclei, due to the electrons and other nuclei, vanish. The parameter λ corresponds to the coordinates of the nuclei. For a molecule with 1 ≤ i ≤ N electrons with coordinates {r_i}, and 1 ≤ α ≤ M nuclei, each located at a specified point {R_α={X_α,Y_α,Z_α)} and with nuclear charge Z_α, the clamped nucleus Hamiltonian is

{\hat {H}}={\hat {T}}+{\hat {U}}-\sum _{i=1}^{N}\sum _{\alpha =1}^{M}{\frac {Z_{\alpha }}{|\mathbf {r} _{i}-\mathbf {R} _{\alpha }|}}+\sum _{\alpha }^{M}\sum _{\beta >\alpha }^{M}{\frac {Z_{\alpha }Z_{\beta }}{|\mathbf {R} _{\alpha }-\mathbf {R} _{\beta }|}}.

The force acting on the x-component of a given nucleus is equal to the negative of the derivative of the total energy with respect to that coordinate. Employing the Hellmann–Feynman theorem this is equal to

F_{X_{\gamma }}=-{\frac {\partial E}{\partial X_{\gamma }}}=-{\bigg \langle }\psi {\bigg |}{\frac {\partial {\hat {H}}}{\partial X_{\gamma }}}{\bigg |}\psi {\bigg \rangle }.

Only two components of the Hamiltonian contribute to the required derivative – the electron-nucleus and nucleus-nucleus terms. Differentiating the Hamiltonian yields^[2]

{\begin{aligned}{\frac {\partial {\hat {H}}}{\partial X_{\gamma }}}&={\frac {\partial }{\partial X_{\gamma }}}\left(-\sum _{i=1}^{N}\sum _{\alpha =1}^{M}{\frac {Z_{\alpha }}{|\mathbf {r} _{i}-\mathbf {R} _{\alpha }|}}+\sum _{\alpha }^{M}\sum _{\beta >\alpha }^{M}{\frac {Z_{\alpha }Z_{\beta }}{|\mathbf {R} _{\alpha }-\mathbf {R} _{\beta }|}}\right),\\&=Z_{\gamma }\sum _{i=1}^{N}{\frac {x_{i}-X_{\gamma }}{|\mathbf {r} _{i}-\mathbf {R} _{\gamma }|^{3}}}-Z_{\gamma }\sum _{\alpha \neq \gamma }^{M}Z_{\alpha }{\frac {X_{\alpha }-X_{\gamma }}{|\mathbf {R} _{\alpha }-\mathbf {R} _{\gamma }|^{3}}}.\end{aligned}}

Insertion of this in to the Hellmann–Feynman theorem returns the force on the x-component of the given nucleus in terms of the electronic density (ρ(r)) and the atomic coordinates and nuclear charges:

F_{X_{\gamma }}=-Z_{\gamma }\left(\int \mathrm {d} \mathbf {r} \ \rho (\mathbf {r} ){\frac {x-X_{\gamma }}{|\mathbf {r} -\mathbf {R} _{\gamma }|^{3}}}-\sum _{\alpha \neq \gamma }^{M}Z_{\alpha }{\frac {X_{\alpha }-X_{\gamma }}{|\mathbf {R} _{\alpha }-\mathbf {R} _{\gamma }|^{3}}}\right).

Expectation values

An alternative approach for applying the Hellmann–Feynman theorem is to promote a fixed or discrete parameter which appears in a Hamiltonian to be a continuous variable solely for the mathematical purpose of taking a derivative. Possible parameters are physical constants or discrete quantum numbers. As an example, the radial Schrödinger equation for a hydrogen-like atom is

{\hat {H}}_{l}=-{\frac {\hbar ^{2}}{2\mu r^{2}}}\left({\frac {\mathrm {d} }{\mathrm {d} r}}\left(r^{2}{\frac {\mathrm {d} }{\mathrm {d} r}}\right)-l(l+1)\right)-{\frac {Ze^{2}}{r}},

which depends upon the discrete azimuthal quantum number l. Promoting l to be a continuous parameter allows for the derivative of the Hamiltonian to be taken:

{\frac {\partial {\hat {H}}_{l}}{\partial l}}={\frac {\hbar ^{2}}{2\mu r^{2}}}(2l+1).

The Hellmann–Feynman theorem then allows for the determination of the expectation value of ${\frac {1}{r^{2}}}$ for hydrogen-like atoms:^[3]

{\begin{aligned}{\bigg \langle }\psi _{nl}{\bigg |}{\frac {1}{r^{2}}}{\bigg |}\psi _{nl}{\bigg \rangle }&={\frac {2\mu }{\hbar ^{2}}}{\frac {1}{2l+1}}{\bigg \langle }\psi _{nl}{\bigg |}{\frac {\partial {\hat {H}}_{l}}{\partial l}}{\bigg |}\psi _{nl}{\bigg \rangle }\\&={\frac {2\mu }{\hbar ^{2}}}{\frac {1}{2l+1}}{\frac {\partial E_{n}}{\partial l}}\\&={\frac {2\mu }{\hbar ^{2}}}{\frac {1}{2l+1}}{\frac {\partial E_{n}}{\partial n}}{\frac {\partial n}{\partial l}}\\&={\frac {2\mu }{\hbar ^{2}}}{\frac {1}{2l+1}}{\frac {Z^{2}\mu e^{4}}{\hbar ^{2}n^{3}}}\\&={\frac {Z^{2}\mu ^{2}e^{4}}{\hbar ^{4}n^{3}(l+1/2)}}.\end{aligned}}

Van der Waals forces

In the end of Feynman's paper, he states that, "Van der Waals's forces can also be interpreted as arising from charge distributions with higher concentration between the nuclei. The Schrödinger perturbation theory for two interacting atoms at a separation R, large compared to the radii of the atoms, leads to the result that the charge distribution of each is distorted from central symmetry, a dipole moment of order 1/R⁷ being induced in each atom. The negative charge distribution of each atom has its center of gravity moved slightly toward the other. It is not the interaction of these dipoles which leads to van der Waals's force, but rather the attraction of each nucleus for the distorted charge distribution of its own electrons that gives the attractive 1/R⁷ force."

Hellmann–Feynman theorem for time-dependent wavefunctions

For a general time-dependent wavefunction satisfying the time-dependent Schrödinger equation, the Hellmann–Feynman theorem is not valid. However, the following identity holds:

{\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial H_{\lambda }}{\partial \lambda }}{\bigg |}\Psi _{\lambda }(t){\bigg \rangle }=i\hbar {\frac {\partial }{\partial t}}{\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg \rangle }

For

i\hbar {\frac {\partial \Psi _{\lambda }(t)}{\partial t}}=H_{\lambda }\Psi _{\lambda }(t)

Proof

The proof only relies on the Schrödinger equation and the assumption that partial derivatives with respect to λ and t can be interchanged.

{\begin{aligned}{\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial H_{\lambda }}{\partial \lambda }}{\bigg |}\Psi _{\lambda }(t){\bigg \rangle }&={\frac {\partial }{\partial \lambda }}\langle \Psi _{\lambda }(t)|H_{\lambda }|\Psi _{\lambda }(t)\rangle -{\bigg \langle }{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg |}H_{\lambda }{\bigg |}\Psi _{\lambda }(t){\bigg \rangle }-{\bigg \langle }\Psi _{\lambda }(t){\bigg |}H_{\lambda }{\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg \rangle }\\&=i\hbar {\frac {\partial }{\partial \lambda }}{\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial t}}{\bigg \rangle }-i\hbar {\bigg \langle }{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial t}}{\bigg \rangle }+i\hbar {\bigg \langle }{\frac {\partial \Psi _{\lambda }(t)}{\partial t}}{\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg \rangle }\\&=i\hbar {\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial ^{2}\Psi _{\lambda }(t)}{\partial \lambda \partial t}}{\bigg \rangle }+i\hbar {\bigg \langle }{\frac {\partial \Psi _{\lambda }(t)}{\partial t}}{\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg \rangle }\\&=i\hbar {\frac {\partial }{\partial t}}{\bigg \langle }\Psi _{\lambda }(t){\bigg |}{\frac {\partial \Psi _{\lambda }(t)}{\partial \lambda }}{\bigg \rangle }\end{aligned}}

Notes

^ Carfì, David (2010). "The pointwise Hellmann–Feynman theorem". AAPP Physical, Mathematical, and Natural Sciences. 88 (1). no. C1A1001004. doi:10.1478/C1A1001004. ISSN 1825–1242. {{cite journal}}: Check |issn= value (help)
^ Piela, Lucjan (2006). Ideas of Quantum Chemistry. Amsterdam: Elsevier Science. p. 620. ISBN 0-444-52227-1.
^ Fitts, Donald D. (2002). Principles of Quantum Mechanics : as Applied to Chemistry and Chemical Physics. Cambridge: Cambridge University Press. p. 186. ISBN 0-521-65124-7.

[1] Carfì, David (2010). "The pointwise Hellmann–Feynman theorem". AAPP Physical, Mathematical, and Natural Sciences. 88 (1). no. C1A1001004. doi:10.1478/C1A1001004. ISSN 1825–1242. {{cite journal}}: Check |issn= value (help)

[piela-2] Piela, Lucjan (2006). Ideas of Quantum Chemistry. Amsterdam: Elsevier Science. p. 620. ISBN 0-444-52227-1.

[3] Fitts, Donald D. (2002). Principles of Quantum Mechanics : as Applied to Chemistry and Chemical Physics. Cambridge: Cambridge University Press. p. 186. ISBN 0-521-65124-7.

[1]

[2]

[3]