Monday, 26 March 2018

Simple Quantum Chemistry: Hartree-Fock in Python

HF-Copy1

Computational chemistry allow properties of molecules to be determined with incredible accuracy. This includes how much energy is stored in a molecule (and how much is released when you combust it), at which frequencies does it vibrate (which can be used to identify chemicals), how quickly a reaction will occur etc. . One of the most widely used method to calculate these properties as a first approximation is the Hartree-Fock (HF) method. Many very successful widely used density functional methods (B3LYP, PBE0, B97) will add in exact exhange calculated using HF to improve the calculation. One of the classic texts in computational chemistry is Modern Quantum Chemistry by Szabo and Ostlund. The book includes a simple calculation of the molecule HeH+ which has the same electronic configuration as the hydrogen molecule but due to the +2 charge on the helium nucleus it is not symmetrical and must be calculated iteratively by converging on a consistent set of parameters which is why the method is often called self consistent field theory (SCF). I have rewritten this example in the python language based on the original Fortran code and try to explain how the calculations are performed and why the operations are done. I have also added in figures to show what the results look like.

Molecular Schrodinger equation

We begin with the time-independent Schrodinger equation for a molecule is

HΨ=EΨ

The Hamiltonian H performs some mathematical operations on the function Ψ and returns a number the energy and the Ψ function unchanged. This equation is what is called a eigenvalue equation and turns out to be fairly straightforward to solve particularly if you make use of matrix algebra which is the method used in with the Roothmaan-Hall equations.

[ii22AA22A,iZArAi+A>BZAZBRAB+i>j1rij]Ψ(r;R)=EΨ(r;R)

or in a more compact form

[T^e(r)+T^N(R)+V^eN(r;R)+V^NN(R)+V^ee(r)]Ψ(r;R)=EΨ(r;R)

You will notice that there are no units because we are making use of atomic units (a.u.) which simplifies the equations by equating these constants to one me=e=h/2π=1/(4πϵ0)=1.

The sums Σ have electron i,j and nuclei A,B indices. With the electron positions r and nuclei position R being used to calculate distances between nuclei-electrons rAi, nuclei-nuclei rAB and electron-electron rij. Atomic charges are needed ZA/B in our case ZHe=+2 and ZH=+1.

The terms are, in order;

  • Sum of each electron's kinetic energy
    T^e(r)=ii22
    • Using classical mechanics we can determine the kinetic energy from the velocity v and mass m of the particle (multiplied together to give the momentum p=mv)
      Ek=T=12mv2=p22m
      .
    • This classical definition cannot be applied in quantum mechanics as you cannot define a definite position r due to the uncertainity principle which states that the momentum and position cannot be known to any degree of accuracy.
    • We must make use of the quantum mechanical momentum operator p^=i which comes from the wave description of a free particle.
      Te=p22m=(i)22m=22m2
      ( is the symbol used to specify derivatives in multivariable calculus. i.e /x)
    • Think of this as a kinetic energy pressure which due to quantum uncertainity holds the electron away from the nuclei allowing it not to collapse to the positive nuclei.
    • This fluctuation is dependent inversely on the mass so the fluctuation in the light electron is much larger than the heavy nuclei which means it is only neccessary (in most cases) to consider the qunatum mechanical fluctutations of the electrons around fixed nuclei.
  • Sum of each nuclei's kinetic energy
    T^N(R)=AA22
    • Just as the electron does not have a definitive position so also the nuclei do not.
    • The uncertainity of the electron is much greater than the uncertainty of the nuclei due to the much greater mass of the later that we often will ignore the quantum uncertainity in the nuclei.
    • However, for light nuclei such as hydrogen this is important and can lead to interesting quantum tunneling effects.
  • Sum of energy from nuclei-electron attraction
    V^eN(r;R)=A,iZArAi
    • this is an attractive energy between nuclei and electrons and is just coulombs law which is usually written ECoulomb=qAqB4πϵ0RAB with two charge qA and qB a distance apart RAB
    • remember atomic units removes the 4πϵ0 constants, where ϵ0 is the electric permittivity.
  • Sum of energy from nuclei-nuclei repulsion
    V^NN(R)=+A>BZAZBRAB
    • a repulsive term this is just coulombs law usually written ECoulomb=qAqB4πϵ0RAB with two nuclear charges qA and qB a distance RAB apart
  • Sum of electron-electron repulsion
    V^ee(r)=+i>j1rij
    • electron-electron repulsion similar to the electrostatic term between the nuclei and electrons.

Most often, however, we only consider the electronic problem for a given set of nuclear positions R as the electrons are significantly lighter and therefore can be considered to instantanously adjust for any nuclear position. (However, the energy here does not include the nuclear repulsions which need to be added at the end to get the total energy.)

[T^e(r)+V^eN(r;R)+V^ee(r)]Ψ(r)=EelΨ(r)

Hartree-Fock equation

One of the challenges in solving this equation is that you do not know where the electron resides only where it is statisically likely to be if you took a measurement - called the many-body effect. This makes the terms involving the electron repulsions (+i>j1rij) difficult to calculate.

In the Hartree-Fock approach the first approximation used is to treat each electron separately interacting with an averaged distribution of the other electrons (mean field approach).

[ii22A,iZArAi+vHF(i)]Ψ(r)=ϵijΨ(r)

The functions chosen to represent each electron is based on the hydrogen atomic solutions, which are exact, for an electron around an atom of hydrogen the atomic orbitals.

So we consider a product of one-electron wavefunctions ψi(r) interacting with a mean field of all the other electrons.

Ψ(r)=ψ1(r1)ψ2(r2)ψ3(r3)...=iψi(ri)

However this does not have the correct symmetry requirements which means if you swap electrons the sign of the wavefunction inverts. This requirement means that the Fermions (electrons) cannot occupy the same quantum states and do not collapse towards the nucleus. We can construct something called a slater determinant which does have the correct symmetry properties.

Ψ(r1,r2,,rN)=1N!|ψ1(r1)ψ2(r1)ψN(r1)ψ1(r2)ψ2(r2)ψN(r2)ψ1(rN)ψ2(rN)ψN(rN)||ψ1,ψ2,,ψN

|ψ1,ψ2,,ψN is often written simply as ψi(ri)

This means we can write a set of i equations which can be solved for a set of one-electron wavefunctions.

[ii22A,iZArAi+vHF(i)]ψi(ri)=ϵijψi(ri)

What then is vHF(i) we can consider two interactions a coulomb interaction J (repulsion between the "averaged out" one-electron orbitals) and an exchange interactions K which is not classical and is due to electrons of the same spin being indistinguishable which lowers the energy of the system.

vHF(i)=j[2JjKj]

Where the sum is over all of the other j electrons. There is a double contribution from the coulomb operator as we are considering closed shell systems where there are two electrons per orbital but the exhange operator is only considered once as this interaction can only be between electrons with like-spins. This means for our HeH+ system it will not contribute as we have

Jj=ψj(rj)1ri,jψj(rj)dr
Kjψi(ri)=[ψj(rj)1ri,jψi(ri)dr]ψj(rj)

You will notice the indices have switched for the exchange operator.

The first two terms are often considered as the core terms as they are the kinetic energy and potential energy for an isolated system such as the hydrogen atom and therefore they are refered to as HCORE

HCORE=ii22A,iZArAi

Putting this all together we get the Hartree-Fock equation

(HCORE+j[2JjKj])ψi(ri)=jϵijψi(ri)

Or simply

fiψi(ri)=ϵiψi(ri)

where fi are the fock operators

Orbitals

The most successful method to construct these one-electron wavefunctions ψi is to consider them delocalised over the whole molecule. Therefore there will be a set of orbitals, one for each electron, which are spread over the whole molecule. We call this type of treatment molecular orbital theory. The question then is what functions do we use to treat these one-electron delocalised orbitals. We know the solutions exactly for a hydrogen like atom so we can use atomic orbitals ϕi(ri) centred at each atom ri to be the basis for our delocalised molecular orbitals. We need to be able to optimise the amounts of each of these atomic orbitals in each of our one-electron delocalised orbitals so we consider a linear combination of atomic orbitals (LCAO).

ψi(ri)=μcμϕμ(ri)

where the constants ci are going to be optimised to minimise the energy.

The method relies on the vartionial principle which is true for the Hartree-Fock equations which is that any approximation you make (i.e. describing the electrons as a LCAO) will raise the energy above the real energy of the system. The advantage of a variational method is that you can systematically improve the wavefunction and improve your result knowing that you will be able to converge upon a value (which will be above the actual energy).

The atomic orbitals are called the slater type orbitals for the 1s orbital this can be written as

ϕSLA(r)=(ζ3/π)1/2eζr

Let's have a look at the function by plotting it.

In [1]:
# Need to import some libraries to have the maths functions and plotting
import numpy as np
import scipy.special as sp
import matplotlib.pyplot as plt
%matplotlib notebook # Allows plotting in the notebook

x = np.linspace(-5,5,num=1000)
r = abs(x)

zeta = 1.0

psi_STO = (zeta**3/np.pi)**(0.5)*np.exp(-zeta*r)

plt.figure(figsize=(4,3))
plt.plot(x,psi_STO)
Out[1]:
[<matplotlib.lines.Line2D at 0x1065af0b8>]

Slater type orbitals (STO) are the exact solutions for the hydrogen atom and provide an accurate basis set for many electron molecules however the calculations of the integrals are expensive as their is no simple exact solution for the integrals. One way around this is to approximate the Slater type orbitals using a sum of contracted Gaussian functions (CGF). There are simple analytical expressions for the integrals between two Gaussians so this can save a lot of computing time. Let's look at this for the case of the 1s orbital

ϕGF(α)=(2α/π)3/4exp(αr2)
ϕCGF(r)=ndnϕnGF(α)

We will make use of three Gaussians to approximate the slater type orbitals.

ϕSTO3GCGF(r)=n3dnϕnGF(α)
In [19]:
# Coeff is the d_n variable in the equation above
Coeff = np.array([[1.00000,0.0000000,0.000000],
                  [0.678914,0.430129,0.000000],
                  [0.444635,0.535328,0.154329]])

# Expon is the alpha variable in the equation above
Expon = np.array([[0.270950,0.000000,0.000000],
                  [0.151623,0.851819,0.000000],
                  [0.109818,0.405771,2.227660]]) 

psi_CGF_STO1G = Coeff[0,0]*(2*Expon[0,0]/np.pi)**(0.75)*np.exp(-Expon[0,0]*r**2)
psi_CGF_STO2G = Coeff[1,0]*(2*Expon[1,0]/np.pi)**(0.75)*np.exp(-Expon[1,0]*r**2) \
                + Coeff[1,1]*(2*Expon[1,1]/np.pi)**(0.75)*np.exp(-Expon[1,1]*r**2) \
                + Coeff[1,2]*(2*Expon[1,2]/np.pi)**(0.75)*np.exp(-Expon[1,2]*r**2)
psi_CGF_STO3G = Coeff[2,0]*(2*Expon[2,0]/np.pi)**(0.75)*np.exp(-Expon[2,0]*r**2) \
                + Coeff[2,1]*(2*Expon[2,1]/np.pi)**(0.75)*np.exp(-Expon[2,1]*r**2) \
                + Coeff[2,2]*(2*Expon[2,2]/np.pi)**(0.75)*np.exp(-Expon[2,2]*r**2)
    
# Plot the three functions
plt.figure(figsize=(5,3))
plt.title("Approximations to a STO with CGF")
plt.plot(x,psi_STO,label="STO")
plt.plot(x,psi_CGF_STO1G,label="STO-1G")
plt.legend()
plt.figure(figsize=(5,3))
plt.plot(x,psi_STO,label="STO")
plt.plot(x,psi_CGF_STO2G,label="STO-2G")
plt.legend()
plt.figure(figsize=(5,3))
plt.plot(x,psi_STO,label="STO")
plt.plot(x,psi_CGF_STO3G,label="STO-3G")
plt.legend()
Out[19]:
<matplotlib.legend.Legend at 0x106ddda58>

As the number of Gaussians is increased the function more closely describes the slater type orbitals. You will also see that nearest the centre (x=0) the approximation is poorest. This region is called the cusp.

We will use this very poor basis set for our HeH+ molecule for ease of calculation but for accurate calculations at the very least a 6-311G(d,p) basis set will used which would have 6 contracted Gaussian functions to describe hydrogen and helium 1s orbital.

Roothaan-Hall equations

The Hartree-Fock equations need to be converted into a tabular form (matrix form) using the basis set of atomic orbitals to allow for a solution to be determined using the computer. We insert our basis set of orbitals iciϕiCGF(ri) (we will drop the superscript and just have the orbitals denoted as ϕi) We are also going to be building up a table by combining different basis set indices so will use the subscripts ν and μ to denotes the indices as we build up the table/matrix. We will also replace (rν) with (1) for ease of reading.

fiψi(ri)=ϵiψi(ri)
fiν=1Kcν1ϕν(1)=ϵiν=1Kcνiϕν(1)

We left multiply the other basis set index of our table and integrate.

ν=1Kcν1dν1ϕμ(1)fiϕν(1)=ϵiν=1Kcνidν1ϕμ(1)ϕν(1)

This last term dν1ϕμ(1)ϕν(1) is called the overlap integral written as Sμν. This term will help to illustrate the population of a table if we want to know how much the 1s orbital on the He interacts with the 1s orbital on the H atom we would calculate the values on the diagonal of the overlap matrix (for our calculation below this is 0.45077041 in the code the variable is called S12). However, the overlap of the basis function with itself is one by definition.

Sμν=[1.00.450770410.450770411.0]

We have another matrix called the Fock matrix Fμν which is made up of the core contributions HCORE and the effective potential vHF lets see how the matrices are constructed from these.

dν1ϕμ(1)HCOREϕν(1)=dν1ϕμ(1)[ii22A,iZArAi]ϕν(1)=HμνCORE

Where HμνCORE is another matrix that has to be calculated containing a kinetic energy and potential energy contribution. These are called one electron integrals (as they are the interactions of a single electron in a molecular orbital). In the code the core contributions are broken up into kinetic terms (T11, T12 and T22, note we do not need T21 as the matrix is symmetrical) and potential terms (V11A, V12A, V22A for atom one and V11B, V12B, V22B for the second atom, note the sum is also over the nuclei for this term)

The effective Hartree potential is a bit tricker to calculate.

dν1ϕμ(1)vHFϕν(1)=j=1N/2dν1ϕμ(1)[2JjKj]ϕν(1)=Gμνλσ2e

Where N is the total number of electrons and for closed shell molecules we want half of these electrons, due to two electrons residing in each occupied molecular orbital. This integrate is tricky as it involves the impact of exchange which requires that electrons are indistingishable and therefore instead of a matrix we have 3D matrix or a cube of values (often called a tensor in mathematics). We therefore have to introduce two new indices λ and σ to denote these indistinguishable versions of the electrons. Which creates quite a construction.

Gμνλσ2e=j=1N/2λ=1Kσ=1Kcλjcσj[2dν1dν2ϕμ(1)ϕν(1)1r12ϕλ(2)ϕσ(2)dν1ν2ϕμ(1)ϕλ(1)1r12ϕν(2)ϕσ(2)]

Or in short hand

Gμνλσ2e=j=1N/2λ=1Kσ=1Kcλjcσj[2(μν|λσ)(μλ|νσ)]

The code uses the variable names V1111, V2111, V2121, V2211, V2221 and V2222 and then constructs Gμνλσ2e with the variable name G.

Another shorthand that will be helpful is to consider the charge density matrix P

Pμν=2i=1N/2cμicνiandPλσ=2i=1N/2cλicσi

The electron density can be determined from the density matrix as

ρ(r)=μ=1Kν=1Kcμ(r)cν(r)

The Fock matrix Fμν is then calculated as for closed shell molecules

Fμν=HμνCORE+λ=1Kσ=1KPλσ[2(μν|λσ)(μλ|νσ)]=HμνCORE+Gμνλσ2e

In the code G is used when calculating this term

Putting this back into the Hartree-Fock equation we have now transformed into five matrices/tables or the Roothaan-Hall equations

FC=SCE

The coefficient matrix C is a table with K×K entries with coefficients where K are the number of electrons.

C=(c1,1c1,2...c1,Kc2,1c2,2...c2,KcK,1cK,2...cK,K)

E is a diagonal matrix of the energies of the molecular orbitals

E=(ϵ10...00ϵ2...000...ϵK)

We are trying to find out the values of C which minimise the energy however C is on both sides of the equation so we have to iterative solve the equations until the C on both sides are equal and the energy is the lowest possible this is called the self consistent field method or SCF for short.

Computers are best at solving equations in the form of FC=CE, however, that overlap integral is a bit of a problem (FC=SCE) so we have to do some matrix algebra to rearrange the equation into an equivalent equation FC=CE (where F=S1/2FS1/2 and C=S1/2C (In the code S1/2 and S1/2 are denoted as X and X.T )

Integrals

The computation of the various integrals can be calculated as simple functions when Gaussians are used (for a detailed description I recommend the appendix in Modern Quantum Chemistry).

In [2]:
def S_int(A,B,Rab2):
    """
    Calculates the overlap between two gaussian functions 
    
    """
    return (np.pi/(A+B))**1.5*np.exp(-A*B*Rab2/(A+B))
In [3]:
def T_int(A,B,Rab2):
    """
    Calculates the kinetic energy integrals for un-normalised primitives
    
    """
    return A*B/(A+B)*(3.0-2.0*A*B*Rab2/(A+B))*(np.pi/(A+B))**1.5*np.exp(-A*B*Rab2/(A+B))
In [4]:
def V_int(A,B,Rab2,Rcp2,Zc):
    """
    Calculates the un-normalised nuclear attraction integrals
    """
    V = 2.0*np.pi/(A+B)*F0((A+B)*Rcp2)*np.exp(-A*B*Rab2/(A+B))
    return -V*Zc
In [5]:
# Mathematical functions

def F0(t):
    """
    F function for 1s orbital
    """
    if (t<1e-6):
        return 1.0-t/3.0
    else:
        return 0.5*(np.pi/t)**0.5*sp.erf(t**0.5)
    
def erf(t):
    """
    Approximation for the error function
    """
    P = 0.3275911
    A = [0.254829592,-0.284496736,1.421413741,-1.453152027,1.061405429]
    T = 1.0/(1+P*t)
    Tn=T
    Poly = A[0]*Tn
    for i in range(1,5):
        Tn=Tn*T
        Poly=Poly*A[i]*Tn
    return 1.0-Poly*np.exp(-t*t)
In [6]:
def TwoE(A,B,C,D,Rab2,Rcd2,Rpq2):
    """
    Calculate two electron integrals
    A,B,C,D are the exponents alpha, beta, etc.
    Rab2 equals squared distance between centre A and centre B
    """
    return 2.0*(np.pi**2.5)/((A+B)*(C+D)*np.sqrt(A+B+C+D))*F0((A+B)*(C+D)*Rpq2/(A+B+C+D))*np.exp(-A*B*Rab2/(A+B)-C*D*Rcd2/(C+D))
In [7]:
def Intgrl(N,R,Zeta1,Zeta2,Za,Zb):
    """
    Declares the variables and compiles the integrals.
    """
    
    global S12,T11,T12,T22,V11A,V12A,V22A,V11B,V12B,V22B,V1111,V2111,V2121,V2211,V2221,V2222
    
    
    S12 = 0.0
    T11 = 0.0
    T12 = 0.0
    T22 = 0.0
    V11A = 0.0
    V12A = 0.0
    V22A = 0.0
    V11B = 0.0
    V12B = 0.0
    V22B = 0.0
    V1111 = 0.0
    V2111 = 0.0
    V2121 = 0.0
    V2211 = 0.0
    V2221 = 0.0
    V2222 = 0.0
    
    R2 = R*R
    
    # The coefficients for the contracted Gaussian functions are below
    Coeff = np.array([[1.00000,0.0000000,0.000000],
                      [0.678914,0.430129,0.000000],
                      [0.444635,0.535328,0.154329]])
    
    Expon = np.array([[0.270950,0.000000,0.000000],
                      [0.151623,0.851819,0.000000],
                      [0.109818,0.405771,2.227660]])
    D1 = np.zeros([3])
    A1 = np.zeros([3])
    D2 = np.zeros([3])
    A2 = np.zeros([3])
    
    # This loop constructs the contracted Gaussian functions
    for i in range(N):
        A1[i] = Expon[N-1,i]*(Zeta1**2)
        D1[i] = Coeff[N-1,i]*((2.0*A1[i]/np.pi)**0.75)
        A2[i] = Expon[N-1,i]*(Zeta2**2)
        D2[i] = Coeff[N-1,i]*((2.0*A2[i]/np.pi)**0.75)
    
    # Calculate one electron integrals 
    # Centre A is first atom centre B is second atom
    # Origin is on second atom
    # V12A - off diagonal nuclear attraction to centre A etc.
    for i in range(N):
        for j in range(N):
            # Rap2 - squared distance between centre A and centre P
            Rap = A2[j]*R/(A1[i]+A2[j])
            Rap2 = Rap**2
            Rbp2 = (R-Rap)**2
            S12 = S12 + S_int(A1[i],A2[j],R2)*D1[i]*D2[j]
            T11 = T11 + T_int(A1[i],A1[j],0.0)*D1[i]*D1[j]
            T12 = T12 + T_int(A1[i],A2[j],R2)*D1[i]*D2[j]
            T22 = T22 + T_int(A2[i],A2[j],0.0)*D2[i]*D2[j]
            V11A = V11A + V_int(A1[i],A1[j],0.0,0.0,Za)*D1[i]*D1[j]
            V12A = V12A + V_int(A1[i],A2[j],R2,Rap2,Za)*D1[i]*D2[j]
            V22A = V22A + V_int(A2[i],A2[j],0.0,R2,Za)*D2[i]*D2[j]
            V11B = V11B + V_int(A1[i],A1[j],0.0,R2,Zb)*D1[i]*D1[j]
            V12B = V12B + V_int(A1[i],A2[j],R2,Rbp2,Zb)*D1[i]*D2[j]
            V22B = V22B + V_int(A2[i],A2[j],0.0,0.0,Zb)*D2[i]*D2[j]
    
    # Calculate two electron integrals
    
    for i in range(N):
        for j in range(N):
            for k in range(N):
                for l in range(N):
                    Rap = A2[i]*R/(A2[i]+A1[j])
                    Rbp = R - Rap
                    Raq = A2[k]*R/(A2[k]+A1[l])
                    Rbq = R - Raq
                    Rpq = Rap - Raq
                    Rap2 = Rap*Rap
                    Rbp2 = Rbp*Rbp
                    Raq2 = Raq*Raq
                    Rbq2 = Rbq*Rbq
                    Rpq2 = Rpq*Rpq
                    V1111 = V1111 + TwoE(A1[i],A1[j],A1[k],A1[l],0.0,0.0,0.0)*D1[i]*D1[j]*D1[k]*D1[l]
                    V2111 = V2111 + TwoE(A2[i],A1[j],A1[k],A1[l],R2,0.0,Rap2)*D2[i]*D1[j]*D1[k]*D1[l]
                    V2121 = V2121 + TwoE(A2[i],A1[j],A2[k],A1[l],R2,R2,Rpq2)*D2[i]*D1[j]*D2[k]*D1[l]
                    V2211 = V2211 + TwoE(A2[i],A2[j],A1[k],A1[l],0.0,0.0,R2)*D2[i]*D2[j]*D1[k]*D1[l]
                    V2221 = V2221 + TwoE(A2[i],A2[j],A2[k],A1[l],0.0,R2,Rbq2)*D2[i]*D2[j]*D2[k]*D1[l]
                    V2222 = V2222 + TwoE(A2[i],A2[j],A2[k],A2[l],0.0,0.0,0.0)*D2[i]*D2[j]*D2[k]*D2[l]
    return 
In [8]:
def Colect(N,R,Zeta1,Zeta2,Za,Zb):
    """
    Takes the basic integrals and assembles the relevant matrices, 
    that are S,H,X,XT and Two electron integrals
    """
    # Form core hamiltonian
    H[0,0] = T11+V11A+V11B
    H[0,1] = T12+V12A+V12B
    H[1,0] = H[0,1]
    H[1,1] = T22+V22A+V22B

    # Form overlap matrix
    S[0,0] = 1.0
    S[0,1] = S12
    S[1,0] = S12
    S[1,1] = 1.0
    
    # This is S^-1/2
    X[0,0] = 1.0/np.sqrt(2.0*(1.0+S12))
    X[1,0] = X[0,0]
    X[0,1] = 1.0/np.sqrt(2.0*(1.0-S12))
    X[1,1] = -X[0,1]
    
    # This is the coulomb and exchange term (aa|bb) and (ab|ba)
    TT[0,0,0,0] = V1111
    TT[1,0,0,0] = V2111
    TT[0,1,0,0] = V2111
    TT[0,0,1,0] = V2111
    TT[0,0,0,1] = V2111
    TT[1,0,1,0] = V2121
    TT[0,1,1,0] = V2121
    TT[1,0,0,1] = V2121
    TT[0,1,0,1] = V2121
    TT[1,1,0,0] = V2211
    TT[0,0,1,1] = V2211
    TT[1,1,1,0] = V2221
    TT[1,1,0,1] = V2221
    TT[1,0,1,1] = V2221
    TT[0,1,1,1] = V2221
    TT[1,1,1,1] = V2222

Self consistent field calculation

A common method to solve these equations is to follow the steps below

  1. Guess an initial density matrix P (for this example will use the initial guess P=0)
  2. From P calculate the Fock matrix
  3. Calculate F=S1/2FS1/2
  4. Solve the eigenvalue problem using the secular equation |FEI|=0 giving the eigenvalues E and eigenvectors C which can be solved by diagonalising F
  5. Calculate the molecular orbitals coefficients by reverting C=S1/2C
  6. Calculate the new density matrix P from the matrix C
  7. Check to see if the energy has converged if not then head back to step 6 and repeat.

I have labelled the steps in the program below using the tag ######## STEP 1. ########

In [143]:
def SCF(N,R,Zeta1,Zeta2,Za,Zb,G):
    """
    Performs the SCF iterations
    """
    Crit = 1e-11 # Convergence critera
    Maxit = 250 # Maximum number of iterations
    Iter=0
    
    ######## STEP 1. Guess an initial density matrix ########
    # Use core hamiltonian for initial guess of F, I.E. (P=0)
    P = np.zeros([2,2])
    
    Energy = 0.0
    
    while (Iter<Maxit):
        Iter += 1
        print(Iter)
        
        ######## STEP 2. calculate the Fock matrix ########
        # Form two electron part of Fock matrix from P
        G = np.zeros([2,2]) # This is the two electron contribution in the equations above
        for i in range(2):
            for j in range(2):
                for k in range(2):
                    for l in range(2):
                        G[i,j]=G[i,j]+P[k,l]*(TT[i,j,k,l]-0.5*TT[i,j,k,l])

        # Add core hamiltonian H^CORE to get fock matrix
        F = H+G
        
        # Calculate the electronic energy
        Energy = np.sum(0.5*P*(H+F))
        
        print('Electronic energy = ',Energy)
        
        ######## STEP 3. Calculate F' (remember S^-1/2 is X and S^1/2 is X.T) ########
        G = np.matmul(F,X)
        Fprime = np.matmul(X.T,G)
        
        ######## STEP 4. Solve the eigenvalue problem ########
        # Diagonalise transformed Fock matrix
        Diag(Fprime,Cprime,E)
        
        ######## STEP 5. Calculate the molecular orbitals coefficients ########
        # Transform eigen vectors to get matrix C
        C = np.matmul(X,Cprime)
        
        ######## STEP 6. Calculate the new density matrix from the old P ########
        Oldp = np.array(P)
        P= np.zeros([2,2])
        
        # Form new density matrix
        for i in range(2):
            for j in range(2):
                #Save present density matrix before creating a new one
                for k in range(1):
                    P[i,j] += 2.0*C[i,k]*C[j,k]

        ######## STEP 7. Check to see if the energy has converged ########
        Delta = 0.0
        # Calculate delta the difference between the old density matrix Old P and the new P
        Delta = (P-Oldp)
        Delta = np.sqrt(np.sum(Delta**2)/4.0)
        print("Delta",Delta)
        
        #Check for convergence
        if (Delta<Crit):
            # Add nuclear repulsion to get the total energy
            Energytot = Energy+Za*Zb/R
            print("Calculation converged with electronic energy:",Energy)
            print("Calculation converged with total energy:",Energytot)
            print("Density matrix", P)
            print("Mulliken populations",np.matmul(P,S))
            print("Coeffients",C)
            
            break
In [144]:
def FormG():
    """
    Calculate the G matrix from the density matrix and two electron integals
    """
    for i in range(2):
        for j in range(2):
            G[i,j]=0.0
            for k in range(2):
                for l in range(2):
                    G[i,j]=G[i,j]+P[k,l]*(TT[i,j,k,l]-0.5*TT[i,j,k,l])
                    
def Mult(A,B,C_,IM,M):
    """
    Multiples two square matrices A and B to get C
    """
    for i in range(M):
        for j in range(M):
            for k in range(M):
                C_[i,j] = A[i,j]*B[i,j]
                
def Diag(Fprime,Cprime,E):
    """
    Diagonalises F to give eigenvectors in C and eigen values in E, theta is the angle describing the solution
    """
    # 
    import math
    # Angle for heteronuclear diatonic
    Theta = 0.5*math.atan(2.0*Fprime[0,1]/(Fprime[0,0]-Fprime[1,1]))
    #print('Theta', Theta)
    
    Cprime[0,0] = np.cos(Theta)
    Cprime[1,0] = np.sin(Theta)
    Cprime[0,1] = np.sin(Theta)
    Cprime[1,1] = -np.cos(Theta)
    
    E[0,0] = Fprime[0,0]*np.cos(Theta)**2+Fprime[1,1]*np.sin(Theta)**2+Fprime[0,1]*np.sin(2.0*Theta)
    E[1,1] = Fprime[1,1]*np.cos(Theta)**2+Fprime[0,0]*np.sin(Theta)**2-Fprime[0,1]*np.sin(2.0*Theta)
    
    if (E[1,1] <= E[0,0]):
        Temp = E[1,1]
        E[1,1] = E[0,0]
        E[0,0] = Temp
        Temp = Cprime[0,1]
        Cprime[0,1] = Cprime[0,0]
        Cprime[0,0] = Temp
        Temp = Cprime[1,1]
        Cprime[1,1]=Cprime[1,0]
        Cprime[1,0]=Temp
    return
In [145]:
def HFCALC(N,R,Zeta1,Zeta2,Za,Zb,G):
    """
    Calculates the integrals constructs the matrices and then runs the SCF calculation
    """
    # Calculate one and two electron integrals
    Intgrl(N,R,Zeta1,Zeta2,Za,Zb)
    # Put all integals into array
    Colect(N,R,Zeta1,Zeta2,Za,Zb)
    # Perform the SCF calculation
    SCF(N,R,Zeta1,Zeta2,Za,Zb,G)
    return
In [146]:
"""
Let's set up the variables and perform the calculations
"""
global H,S,X,XT,TT,G,C,P,Oldp,F,Fprime,Cprime,E,Zb

H = np.zeros([2,2])
S = np.zeros([2,2])
X = np.zeros([2,2])
XT = np.zeros([2,2])
TT = np.zeros([2,2,2,2])
G = np.zeros([2,2])
C = np.zeros([2,2])

P = np.zeros([2,2])
Oldp = np.zeros([2,2])
F = np.zeros([2,2])
Fprime = np.zeros([2,2])
Cprime = np.zeros([2,2])
E = np.zeros([2,2])

Energy = 0.0
Delta = 0.0

N = 3
R = 1.4632
Zeta1 = 2.0925
Zeta2 = 1.24
Za = 2.0
Zb = 1.0
HFCALC(N,R,Zeta1,Zeta2,Za,Zb,G)
1
Electronic energy =  0.0
Delta 0.882866853014
2
Electronic energy =  -4.14186287613
Delta 0.42637666285
3
Electronic energy =  -4.21250515815
Delta 0.169751444038
4
Electronic energy =  -4.22491713481
Delta 0.0726766265559
5
Electronic energy =  -4.22705796346
Delta 0.0305577508572
6
Electronic energy =  -4.22744516532
Delta 0.0129689170661
7
Electronic energy =  -4.22751420209
Delta 0.00548417493289
8
Electronic energy =  -4.22752659937
Delta 0.00232279034282
9
Electronic energy =  -4.22752881931
Delta 0.000983151834497
10
Electronic energy =  -4.22752921732
Delta 0.000416249714185
11
Electronic energy =  -4.22752928864
Delta 0.000176211995738
12
Electronic energy =  -4.22752930143
Delta 7.46000228255e-05
13
Electronic energy =  -4.22752930372
Delta 3.15815291915e-05
14
Electronic energy =  -4.22752930413
Delta 1.33699962565e-05
15
Electronic energy =  -4.2275293042
Delta 5.66014751815e-06
16
Electronic energy =  -4.22752930421
Delta 2.3962102418e-06
17
Electronic energy =  -4.22752930422
Delta 1.01442931622e-06
18
Electronic energy =  -4.22752930422
Delta 4.29456113385e-07
19
Electronic energy =  -4.22752930422
Delta 1.81809149079e-07
20
Electronic energy =  -4.22752930422
Delta 7.69684434551e-08
21
Electronic energy =  -4.22752930422
Delta 3.2584395203e-08
22
Electronic energy =  -4.22752930422
Delta 1.37945213101e-08
23
Electronic energy =  -4.22752930422
Delta 5.83987605061e-09
24
Electronic energy =  -4.22752930422
Delta 2.47229696259e-09
25
Electronic energy =  -4.22752930422
Delta 1.04664054128e-09
26
Electronic energy =  -4.22752930422
Delta 4.43092627596e-10
27
Electronic energy =  -4.22752930422
Delta 1.87582070145e-10
28
Electronic energy =  -4.22752930422
Delta 7.94123469356e-11
29
Electronic energy =  -4.22752930422
Delta 3.36188760201e-11
30
Electronic energy =  -4.22752930422
Delta 1.42322864473e-11
31
Electronic energy =  -4.22752930422
Delta 6.02525212134e-12
Calculation converged with electronic energy: -4.22752930422
Calculation converged with total energy: -2.8606621637
Density matrix [[ 1.28614168  0.54017322]
 [ 0.54017322  0.22687011]]
Mulliken populations [[ 1.52963579  1.11992783]
 [ 0.64243955  0.47036421]]
Coeffients [[ 0.80191698 -0.78226577]
 [ 0.33680121  1.06844537]]

Results

Let's have a look at what the energy comes out to be (units in Hartrees)

EHFCurrent(STO3G) = -2.8606621637

EHFPySCF(STO3G) = -2.8418364990824458

EHFPySCF(631G(d)) = -2.9098394146425748

EHFPySCF(ccpVTZ) = -2.9322482557926945

EHFPySCF(augccpVTZ) = -2.9322713663802804

EHFPySCF(augccpVQZ) = -2.932878077558255

Comparing these in a bar plot we have.

In [135]:
plt.bar([1,2,3,4,5,6],[-2.8669283534318275,-2.8418364990824458,-2.9098394146425748,-2.9322482557926945,-2.9322713663802804,-2.932878077558255],tick_label=['STO-3G','STO-3G','6-31G(d)','cc-pVTZ','aug-cc-pVTZ','aug-cc-pVQZ'])
Out[135]:
<Container object of 6 artists>

The energy calculated is getting close to the Hartree-Fock limited which is not the real energy of the molecule as it can get even lower if the correlation energy is included. These are the many-body effects which are being ignored in the mean-field approximation to speed up the calculations. The most accurate energy for HeH+ is __. So the mean-field approximation is getting very close to the full energy.

Molecular orbitals

Let's consider the shape of the molecular orbtials and electron density for this molecule along the bond length.

In [247]:
C = np.matmul(X,Cprime)
P = np.array([[ 1.28614168,0.54017322],
 [ 0.54017322 ,0.22687011]])

x = np.linspace(-8,8,num=1000)
r1 = abs(x+R/2)
r2 = abs(x-R/2)

psi_CGF_STO3G_He = Coeff[2,0]*(2*Expon[2,0]/np.pi)**(0.75)*np.exp(-Expon[2,0]*r1**2) \
                + Coeff[2,1]*(2*Expon[2,1]/np.pi)**(0.75)*np.exp(-Expon[2,1]*r1**2) \
                + Coeff[2,2]*(2*Expon[2,2]/np.pi)**(0.75)*np.exp(-Expon[2,2]*r1**2)
        
psi_CGF_STO3G_H = Coeff[2,0]*(2*Expon[2,0]/np.pi)**(0.75)*np.exp(-Expon[2,0]*r2**2) \
                + Coeff[2,1]*(2*Expon[2,1]/np.pi)**(0.75)*np.exp(-Expon[2,1]*r2**2) \
                + Coeff[2,2]*(2*Expon[2,2]/np.pi)**(0.75)*np.exp(-Expon[2,2]*r2**2)

density = np.zeros(x.shape)
        
density = density + P[0,0]*psi_CGF_STO3G_He*psi_CGF_STO3G_He
density = density + P[1,1]*psi_CGF_STO3G_H*psi_CGF_STO3G_H
density = density + 2*P[0,1]*psi_CGF_STO3G_He*psi_CGF_STO3G_H
In [193]:
plt.figure(figsize=(7,4))
plt.title("Molecular orbitals")

plt.plot(x,C[0,0]*psi_CGF_STO3G_He+C[1,0]*psi_CGF_STO3G_H,label="Bonding")
plt.plot(x,C[0,1]*psi_CGF_STO3G_He+C[1,1]*psi_CGF_STO3G_H,label="Anti-bonding")
plt.legend(loc=4)
Out[193]:
<matplotlib.legend.Legend at 0x111c98400>
In [248]:
plt.figure(figsize=(7,4))
plt.title("Electron density $\Psi^{2}$")
plt.plot(x,density)
Out[248]:
[<matplotlib.lines.Line2D at 0x114e35400>]

Let's compare these results with a larger basis set. This is the input script for the program Gaussian which can use much larger basis sets.

#p RHF/aug-cc-pVQZ SP

HeH+_energy

1 1 He -1.41702 0.00000 0.00000 H -0.82048 0.00000 0.00000

Output energy is -2.932878077558184 which is close to pySCF solution.

Let's plot the wavefunctions and the electron density.

In [250]:
LUMO = np.genfromtxt('LUMOHeH+.txt')
HOMO = np.genfromtxt('HOMOHeH+.txt')
density = np.genfromtxt('densityHeH+.txt')

plt.figure(figsize=(7,4))
plt.title("Molecular orbitals RHF/aug-cc-pVQZ")
plt.plot(-LUMO[750:2250,2]/0.529177*2,LUMO[750:2250,4],label='LUMO')
plt.plot(-HOMO[750:2250,2]/0.529177*2,HOMO[750:2250,4],label='HOMO')
plt.figure(figsize=(7,4))
plt.title("Electron density $e\AA^3$ RHF/aug-cc-pVQZ")
plt.plot(-density[750:2250,2]/0.529177*2,density[750:2250,4],label='density')
Out[250]:
[<matplotlib.lines.Line2D at 0x1162b8198>]

The larger basis set provides more electron density on the Helium atom. You can also see how well the cusp is modelled with many basis sets added together.

If we grab the fchk file we can plot the molecular orbitals in three-dimensions using Avogadro software. Here is the occupied molecular orbital HOMO the surface is the isosurface at the 0.02 density value.

Occupied orbital HOMO

And the unoccupied orbital LUMO

We can also plot the electron density at the 0.002 eÅ3 value which is roughly where the exchange energy is very strong and you have repuslion between the molecules.

Conclusions

You may be considering that these calculations overly complicated for a simple diatomic such as HHe+ and attempting anything larger would be a nightmare. Fortunately many free publically accessible codes are available for these calculations. If you want to get your hands on something right now there is the website called http://molcalc.org that allows you to do calculations on simple molecules right from your web browser. If you want to do some proper calculations I would recommend getting a copy of the software packages NWChem or GAMESS. To construct your own geometries you will need a program like Avogadro which can be used to build molecules and the input text files you need as input to these quantum chemistry codes.

In [108]:
# Stylings from http://lorenabarba.com
from IPython.core.display import HTML
def css_styling():
    styles = open("custom.css", "r").read()
    return HTML(styles)
css_styling()
Out[108]:

16 comments:

  1. Great example, thanks for sharing. To complete it, is there a download link available of the LUMO, HOMO and density input files?

    ReplyDelete
    Replies
    1. Here you go https://www.dropbox.com/s/wtevsqcjr18gmwo/HOMOHeH%2B.zip?dl=0

      Delete
  2. Hello, this post helps me a lot, because you let, in a very clear way the corresponding calculations, thanks a lot.
    I have a question, How do you obtain the coefficients for the Gaussian functions?

    best regards

    ReplyDelete
    Replies
    1. These are given by the matrix C[]. You can see how I made use of these coefficients to draw the molecular orbitals plt.plot(x,C[0,0]*psi_CGF_STO3G_He+C[1,0]*psi_CGF_STO3G_H,label="Bonding")

      Delete
  3. Hi,
    Thanks for your blog. It is really helpful. Can you explain, why the Hartree-Fock explained in Modern Quantum Chemistry by Szabo and Ostlund for HeH+ does not produce the same result as PhSCF(STO-3G). For Zzabo and Ostlund algorithm, the total energy is -2.8606621637 but for PhSCF (STO-3G), it is -2.8418364990824458. What is the difference between these two and what should be changed in Zzabo and Ostlund algorithm so it can produce PhSCf result.

    ReplyDelete
    Replies
    1. I think it has to do with the algorithm used to perform the integration. PySCF uses an advanced library for calculating the integrals see https://onlinelibrary.wiley.com/doi/full/10.1002/jcc.23981 (or preprint https://arxiv.org/pdf/1412.0649.pdf). This example, however, makes use of approximations from the 1970s to solve the integrals (see https://aip.scitation.org/doi/10.1063/1.432807), which would be expected to be less accurate. If you are able to find out anything more about this please let me know!

      Delete
    2. Thanks, Let me see

      Delete
  4. Hi,
    Thank you for this blog.

    I understand that T11 is the kinetic energy operator for the first atom and T22 is the kinetic energy operator for the second atom. But I can't understand why are we calculating T12.

    T11 = T11 + T_int(A1[i],A1[j],0.0)*D1[i]*D1[j]
    T12 = T12 + T_int(A1[i],A2[j],R2)*D1[i]*D2[j]
    T22 = T22 + T_int(A2[i],A2[j],0.0)*D2[i]*D2[j]

    ReplyDelete
    Replies
    1. Thank you for your interest. The diagonal entries of the one-electric kinetic energy operator T11 and T22 are the QM kinetic energy of each electron. The off-diagonal terms T12 describe the quantum mechanical effects of bonding. These terms go to zero at infinity but are non-zero with overlap due to constructive interference. I would recommend the discussion in Szabo and Ostlund textbook section 3.5.2 and the paper https://aip.scitation.org/doi/pdf/10.1063/1.4875735

      Delete
  5. Hello,
    Thanks a lot for the great efforts. I have a question and I wish you can help me with. According to you and to Szabo, the calculations of HF on HEH+ gives the same result of the ground state energy of He. Am I right, or there is sth I missed?

    ReplyDelete
    Replies
    1. They are isoelectronic meaning they have the same number of electrons in the same orbitals (s-orbital in this case) however the potential will be different for He and HeH+ and therefore they will have different energies. Using Gaussian 16 at the HF/cc-pVTZ level of theory I got HeH+=-2.932253 and He=-2.8611533

      Delete
    2. Very helpful. Thank you so much.

      Delete
  6. Hi, this is great! Can you provide the original Jupyter file, so that we can play around with it? Cheers!

    ReplyDelete
    Replies
    1. Thanks!

      Unfortunately, I am in lockdown in another country without my laptop so can't give you the version on the blog. But I did find an early version of the code in an email that I pushed to github (https://github.com/nzjakemartin/HF/blob/master/HF.ipynb).

      I will update this when I get back to my laptop.

      Thanks again

      Delete
    2. Thank you! Pls, whenever you are with your laptop, please post the current version here! Saty safe and cheers!

      Delete

Originaltext