Purpose of the Calculation
One of the quantities of interest when doing DFT calculations is the electronic density of states (DOS). The DOS is defined as the number of electron states with an energy in the interval E + dE, and is one method for determining the electronic state of the material without examining the full band structure. Our test case for studying the convergence of the DOS will be silver, a metal. The indicative feature of a metal in the DOS is that there are a continuum of available states from below the Fermi energy to above the Fermi energy, which is defined as the energy of the highest energy occupied electron state at zero temperature. This post addresses the topic of properly converging the DOS calculation with respect to k-point mesh, and how that convergence varies from the convergence of the system energy with respect to k-point mesh.
Theory of Our Calculation
As with any result that we might report, if we choose to compute the DOS, we must ensure that the result is appropriately converged with respect to the input parameters. To calculate the DOS, we integrate the converged electron density of the system in k-space [1]. Naively, we might then expect that the convergence of the electronic DOS would be similar to the system energy. However, we need to consider the fact that the DOS is a continuous function of energy that needs to be well-resolved for the entire range which we are covering.
The only points at which we calculate the electron density are the points on our k-point mesh. Computationally, we evaluate the integral by doing a weighted sum over the points that we have, using Simpson’s Rule or a similar algorithm. However, when doing something like a Simpson’s Rule algorithm, the function is interpolated in between the points with known values. With a function that rapidly varies in narrow regions, Simpson’s Rule will not give at accurate depiction of the true integral, unless the region is well-sampled. As a result, we should expect that the DOS will not be well-converged without many more k-points than is necessary for the convergence of the energy of the system. We should also expect that this convergence should take the form of the density of states becoming more smoothly varying in the regions where it is non-zero, as the under-sampled regions become appropriately represented.
Calculation Methodology
We use CASTEP [2], a plane-wave basis set package for computing the energies and DOS for our system. Our system is the system depicted in Figure 1, a unit cell of silver atoms on a fcc lattice, with lattice parameter a = 4.086 angstroms. The GGA PBE functional [3] was used for all calculations. The SCF tolerance is 5 x 10^-7 eV, the Koelling-Harmon treatment is used for the atomic solutions [4], and ultrasoft pseudopotentials are generated on the fly. The atoms used in the pseudopotential are 4s2 4p6 4d10 5s1. The integration for the DOS is performed using the 0.2 smearing.
Results
First, a convergence study was performed for the cutoff energy. The results are shown in Figure 2.
Above a cutoff energy of 600 eV, the system energy is converged to the order of tenths of an eV, specifically to 0.15 eV. We select 600 eV as the cutoff energy for the rest of our calculations due to being the most computational efficient value converged to this degree. We then calculate both the system energy and the density of states for different k-point meshes. Because our unit cell is cubic, all k-point meshes are of the form NxNxN, where N is an integer, and we will characterize each mesh simply by the number N. Calculations were performed for N=6 to N=20, covering all possible values.
The relevant feature of the convergence results for the system energy with respect to the k-point mesh is that the variance from the 7x7x7 mesh results is never greather than 0.05 eV for higher N, which is already more converged than our results for the cutoff energy. So we can say that our system energy is sufficiently converged at N = 7.
Before comparing the convergence of the DOS, we need to determine a metric for the convergence. It is clear that the behavior of the DOS should qualitatively change in a certain way with an increasing number of k-points, but it is harder to quantitatively measure this change than the change in system energy. We can no longer compare the change in a single value, as the calculated DOS is a curve for each k-point mesh, as show in Figures 4 and 5.
To compare these curves, we take the difference of the DOS value at every point on the curve. We then square these differences, and take the average for each curve. This quantity, defined as (1/N)*Sum(DOS(k, i) – DOS(k-1,i)) is the mean squared deviation between the two DOS curves. Here, N is the number of points on the curve, k or k-1 is the number of points in the k-point mesh used in that DOS calculation, and i is the eV value of a specific point on the curve. The mean squared deviation is a measure of how different two curves are, and minimizing this quantity will imply that the DOS curves are more similar.
![](https://sites.psu.edu/dftap/files/2018/04/Ag-DOS-N-6-2dxmfwd-1024x334.png)
Figure 4: The DOS for silver, calculated with a 6x6x6 k-point mesh at 600 eV. The 0 of the x-axis is the Fermi energy.
![](https://sites.psu.edu/dftap/files/2018/04/Ag-DOS-N-20-1def1ck-1024x333.png)
Figure 5: The DOS for silver, calculated with a 20x20x20 k-point mesh at 600 eV. The 0 of the x-axis is the Fermi Energy.
Because of the extent of the DOS, much of which has no states, we restrict our analysis from eight eV below the Fermi energy to eight eV above the Fermi energy. This portion of the DOS curves are plotted in figures 6 and 7.
As expected, when the DOS is calculated with a denser sampling of k-space, the curve is smoother and more detailed, rather than sharply varying due to poor sampling.
Quantitatively, as we move from one value of N to the next, the average change in a single point on the curve decreases. It is noteworthy for the some steps in N, there is no change at any point on the DOS curve. This is not indicative of the DOS being fully converged, as shown by later changes to the average deviation. This is representative of the sampling of the new k-point mesh not being significantly different from the previous mesh. The results of this analysis are shown in figures 8 and 9.
It is noteworthy that these curves are not quite the same as those given in figures 2 and 3. In the case of the system energy, we can simply plot the resulting system energy against k-point number and see the overall trend of the values. We can measure the convergence and fit for behavior at large numbers of k-points. In contrast, we cannot make a truly analogous comparison for the DOS curves.
When we speak about the convergence of the system energy with respect to k-point number, we are addressing the variation between calculation results at a given number of k-points and the result we would obtain with an infinite number of k-points. For a variational parameter, such as the variation of system energy with respect to cutoff energy, this is a relatively easy comparison, since the change in the system energy will always decrease for every additional increase of the energy cutoff by a given amount. For the case of the variation of system energy with respect to k-points, we often test a computationally efficient range of k-points and show that after a certain number of k-points, the system energy does not vary beyond a certain interval, such as 0.05 eV.
In the case of the DOS in this post, we use the mean squared deviation between subsequent curves as a method analogous to comparing variation in the system energy from k-point number to k-point number. The analogous convergence condition is then that the mean squared deviation above a certain number of k-points is below a certain threshold. To facilitate this comparison, we plot the sum of the mean squared deviation in figure 10.
Here, we compare the summed mean squared deviation against N=20, our most computationally intensive calculation. This figure makes it clear how the convergence of our calculation proceeds, but we cannot extract predictions about the behavior of the DOS curve at arbitrarily high number of k-points from this figure.
Conclusions
As predicted by the theory, we need many more k-points to appropriately converge the DOS results than we do for the system energy. This should be clear from the clear qualitative differences between figures 6 and 7. More quantitatively, the sum of the mean squared deviations of the DOS curves is approximately 0.005 from N=13 onward. From N=18 onward, the sum of the mean squared deviations is 0.001. If the DOS curve itself is being reported, then the calculation performed at 13x13x13 k-points would give a well-converged result, whereas a lower number of k-points would leave the the sum of the mean squared deviations above 0.18. Meanwhile, the convergence of the system energy with respect to k-points needs only a 6x6x6 k-point mesh.
Additional work in this topic area can investigate the area in an even narrower region around the Fermi energy, which would require an even denser sampling of k-space in that region. This part of the density of states is of particular interest for thermodynamic properties, but is beyond the scope of this post.
References
[1] D. Sholl and J. Steckel, Density Functional Theory: A Practical Introduction. (Wiley 2009)
[2] S. J. Clark, M. D. Segall, C. J. Pickard, P. J. Hasnip, M. J. Probert, K. Refson, M. C. Payne, “First principles methods using CASTEP”, Zeitschrift fuer Kristallographie 220(5-6) pp. 567-570 (2005)
[3] John P. Perdew, Kieron Burke, and Matthias Ernzerhof, “Generalized Gradient Approximation Made Simple”, Phys. Rev. Lett. 77, 3865 – Published 28 October 1996; Erratum Phys. Rev. Lett. 78, 1396 (1997)
[4] D D Koelling and B N Harmon 1977 J. Phys. C: Solid State Phys. 10 3107