Polarons from first principles, without supercells
Pith reviewed 2026-05-25 20:06 UTC · model grok-4.3
The pith
Polarons are computed from first principles by solving a secular equation using DFPT phonons and electron-phonon matrix elements, without supercells.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We develop a formalism and a computational method to study polarons in insulators and semiconductors from first principles. Unlike in standard calculations requiring large supercells, we solve a secular equation involving phonons and electron-phonon matrix elements from density-functional perturbation theory, in a spirit similar to the Bethe-Salpeter equation for excitons. We show that our approach describes seamlessly large and small polarons, and we illustrate its capability by calculating wavefunctions, formation energies, and spectral decomposition of polarons in LiF and Li2O2.
What carries the argument
The secular equation constructed from DFPT phonons and electron-phonon matrix elements, which is solved to obtain the polaron states and energies.
If this is right
- Polaron wavefunctions and formation energies become accessible without constructing supercells whose size scales with the polaron radius.
- Large (delocalized) and small (localized) polarons are obtained from the same eigenvalue problem.
- Spectral decomposition of the polaron states can be extracted directly from the eigenvectors of the secular equation.
- The computational cost is set by the DFPT calculation on the primitive cell rather than by supercell size.
Where Pith is reading between the lines
- The approach could be applied to materials with small primitive cells where supercell methods become impractical due to computational scaling.
- Direct comparison of the computed polaron binding energies against measured values in LiF would provide an external test of the secular equation's accuracy.
- Because the formalism is built on standard DFPT outputs, it can be implemented in existing first-principles codes that already generate phonon and electron-phonon data.
Load-bearing premise
The secular equation built from DFPT phonons and electron-phonon matrix elements is sufficient to capture the essential polaron physics for the materials considered.
What would settle it
A side-by-side comparison of the formation energies and wavefunction spreads obtained from this secular-equation method versus converged large-supercell calculations for the same materials LiF and Li2O2.
Figures
read the original abstract
We develop a formalism and a computational method to study polarons in insulators and semi-conductors from first principles. Unlike in standard calculations requiring large supercells, we solve a secular equation involving phonons and electron-phonon matrix elements from density-functional perturbation theory, in a spirit similar to the Bethe-Salpeter equation for excitons. We show that our approach describes seamlessly large and small polarons, and we illustrate its capability by calculating wavefunctions, formation energies, and spectral decomposition of polarons in LiF and Li2O2.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops a formalism for calculating polaron properties in insulators and semiconductors by solving a secular equation that incorporates phonons and electron-phonon matrix elements obtained from density-functional perturbation theory (DFPT). This approach is presented as an alternative to supercell-based methods and is claimed to handle both large and small polarons seamlessly. The method is illustrated through calculations of wavefunctions, formation energies, and spectral decompositions for polarons in LiF and Li2O2.
Significance. If the central construction holds, the work offers a computationally efficient first-principles route to polaron studies that could enable investigations in a broader range of materials. The seamless description across polaron sizes is a potential strength, and the use of standard DFPT quantities is advantageous for reproducibility with existing codes.
major comments (2)
- [Secular equation construction] The kernel of the secular equation is built from harmonic phonons and linear electron-phonon matrix elements. For the small polaron in LiF, where lattice distortions are large and localized, this linear approximation may not suffice; the manuscript should demonstrate that higher-order terms do not affect the reported formation energies by more than the claimed precision.
- [Results for LiF] No explicit validation against a fully relaxed supercell calculation for the same material is provided to test the assumption that the linear-response basis captures the essential physics.
minor comments (1)
- [Abstract] The abstract mentions 'spectral decomposition of polarons' without defining what this quantity represents in the context of the method.
Simulated Author's Rebuttal
We thank the referee for their careful reading of the manuscript and for highlighting potential limitations of the linear-response framework. We address each major comment below.
read point-by-point responses
-
Referee: [Secular equation construction] The kernel of the secular equation is built from harmonic phonons and linear electron-phonon matrix elements. For the small polaron in LiF, where lattice distortions are large and localized, this linear approximation may not suffice; the manuscript should demonstrate that higher-order terms do not affect the reported formation energies by more than the claimed precision.
Authors: The secular equation is constructed from the harmonic phonon frequencies and the first-order electron-phonon matrix elements obtained via DFPT, as is standard for this class of methods. The self-consistent solution of the secular equation permits substantial lattice relaxation through the polaron coefficients even within the linear coupling. For the small polaron in LiF the reported formation energy is given to the numerical precision of the underlying DFPT data. We agree that anharmonic and higher-order electron-phonon contributions could become relevant for strongly localized distortions; the present work does not include such terms. In the revised manuscript we will add an explicit paragraph discussing the linear-response approximation and its expected range of validity for the systems considered. revision: partial
-
Referee: [Results for LiF] No explicit validation against a fully relaxed supercell calculation for the same material is provided to test the assumption that the linear-response basis captures the essential physics.
Authors: A central motivation of the formalism is to obtain polaron properties without recourse to supercells, by working directly with DFPT quantities in reciprocal space. A fully relaxed supercell calculation for the small polaron in LiF would require cells large enough to isolate the localized distortion, rendering such a benchmark both computationally prohibitive and subject to its own finite-size errors. The internal consistency of the results—namely the seamless description of both large and small polarons in LiF and Li2O2—provides the primary validation within the scope of the paper. We will insert a short clarifying paragraph explaining why a direct supercell comparison is not performed. revision: partial
- Quantitative demonstration that higher-order (anharmonic or nonlinear) terms do not shift the reported formation energies beyond the stated precision for the small polaron in LiF, as this would require extending the formalism beyond the linear DFPT framework used throughout the manuscript.
Circularity Check
Derivation self-contained from standard DFPT inputs
full rationale
The paper constructs a secular equation from DFPT phonons and first-order electron-phonon matrix elements to obtain polaron wavefunctions, formation energies, and spectral weights. This is presented as a direct first-principles method analogous to the Bethe-Salpeter equation, applied to LiF and Li2O2. No quoted equations or steps reduce the output to a fitted parameter, self-citation chain, or input by construction. The central claim remains independent of the target results.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
L. D. Landau, Phys. Z. Sowjet. 3, 664 (1933)
work page 1933
-
[2]
J. G. Bednorz and K. A. M¨ uller, Z. Phys. B 64, 189 (1986)
work page 1986
- [3]
-
[4]
Nat. Mater. 15, 835 (2016)
work page 2016
-
[5]
C. Cancellieri, A. S. Mishchenko, U. Aschauer, A. Fil- ippetti, C. Faber, O. Bariˇ si´ c, V. Rogalev, T. Schmitt, N. Nagaosa, and V. N. Strocov, Nat. Commun. 7, 10386 (2016)
work page 2016
-
[6]
C. Chen, J. Avila, E. Frantzeskakis, A. Levy, and M. C. Asensio, Nat. Commun. 6, 8585 (2015)
work page 2015
-
[7]
Y. F. Nie, D. Di Sante, S. Chatterjee, P. D. C. King, M. Uchida, S. Ciuchi, D. G. Schlom, and K. M. Shen, Phys. Rev. Lett. 115, 096405 (2015)
work page 2015
-
[8]
H. Zhu, K. Miyata, Y. Fu, J. Wang, P. P. Joshi, D. Nies- ner, K. W. Williams, S. Jin, and X.-Y. Zhu, Science 353, 1409 (2016)
work page 2016
-
[9]
R. P. Feynman, Phys. Rev. 97, 660 (1955)
work page 1955
- [10]
-
[11]
M. Bahrami, A. Großardt, S. Donadi, and A. Bassi, New J. Phys. 16, 115007 (2014)
work page 2014
- [12]
-
[13]
H. Fr¨ ohlich, H. Pelzer, and S. Zienau, Lond. Edinb. Dubl. Phil. Mag. J. Sci. 41, 221 (1950)
work page 1950
-
[14]
T. D. Lee, F. E. Low, and D. Pines, Phys. Rev. 90, 297 (1953)
work page 1953
- [15]
- [16]
- [17]
-
[18]
A. S. Alexandrov and P. E. Kornilovitch, Phys. Rev. Lett. 82, 807 (1999)
work page 1999
-
[19]
A. S. Alexandrov, Phys. Rev. B 61, 12315 (2000)
work page 2000
-
[20]
A. S. Alexandrov and B. Y. Yavidov, Phys. Rev. B 69, 073101 (2004)
work page 2004
-
[21]
C. Perroni, V. Cataudella, and G. De Filippis, J. Phys. Condens. Matter 16, 1593 (2004)
work page 2004
- [22]
-
[23]
L. Cruzeiro-Hansson, J. Eilbeck, J. Marın, and F. Rus- sell, Phys. Lett. A 266, 160 (2000)
work page 2000
-
[24]
A. S. Alexandrov, Theory of superconductivity: from weak to strong coupling (IOP, Bristol, 2003)
work page 2003
-
[25]
F. Ortmann, F. Bechstedt, and K. Hannewald, Phys. Rev. B 79, 235206 (2009)
work page 2009
-
[26]
H. Fehske and S. Trugman, in Polarons in Advanced Ma- terials, Springer Series in Material Sciences 103, edited by A. S. Alexandrov (Springer Verlag, Dordrecht, 2007) pp. 393–461
work page 2007
- [27]
- [28]
-
[29]
P. Kornilovitch, in Polarons in Advanced Materials , Springer Series in Material Sciences 103, edited by A. S. Alexandrov (Springer Verlag, Dordrecht, 2007) pp. 192– 230
work page 2007
-
[30]
A. S. Mishchenko, N. V. Prokof’ev, A. Sakamoto, and B. V. Svistunov, Phys. Rev. B 62, 6317 (2000)
work page 2000
- [31]
-
[32]
T. Hahn, S. Klimin, J. Tempere, J. T. Devreese, and C. Franchini, Phys. Rev. B 97, 134305 (2018)
work page 2018
-
[33]
A. S. Alexandrov, Polarons in advanced materials , Springer Series in Material Sciences, Vol. 103 (Springer, Dordrecht, 2007)
work page 2007
-
[34]
J. T. Devreese and A. S. Alexandrov, Rep. Prog. Phys. 72, 066501 (2009)
work page 2009
-
[35]
A. S. Alexandrov and J. T. Devreese, Advances in polaron physics (Springer, Berlin, 2010)
work page 2010
-
[36]
Emin, Polarons (Cambridge University Press, Cam- bridge, 2013)
D. Emin, Polarons (Cambridge University Press, Cam- bridge, 2013)
work page 2013
- [37]
-
[38]
C. Franchini, G. Kresse, and R. Podloucky, Phys. Rev. Lett. 102, 256402 (2009)
work page 2009
- [39]
-
[40]
B. Himmetoglu, A. Janotti, L. Bjaalie, and C. G. Van de Walle, Phys. Rev. B 90, 161102 (2014)
work page 2014
-
[41]
N. Bondarenko, O. Eriksson, and N. V. Skorodumova, Phys. Rev. B 92, 165119 (2015)
work page 2015
-
[42]
M. Reticcioli, M. Setvin, X. Hao, P. Flauger, G. Kresse, M. Schmid, U. Diebold, and C. Franchini, Phys. Rev. X 7, 031053 (2017)
work page 2017
-
[43]
M. Reticcioli, M. Setvin, M. Schmid, U. Diebold, and C. Franchini, Phys. Rev. B 98, 045306 (2018)
work page 2018
-
[44]
J. M. Morbec and G. Galli, Phys. Rev. B 93, 035201 (2016)
work page 2016
- [45]
-
[46]
C. Spreafico and J. VandeVondele, Phys. Chem. Chem. Phys. 16, 26144 (2014)
work page 2014
- [47]
-
[48]
W. H. Sio, C. Verdi, S. Ponc´ e, and F. Giustino, (un- published)
- [49]
- [50]
- [51]
- [52]
-
[53]
Z. Feng, V. Timoshevskii, A. Mauger, C. M. Julien, K. H. Bevan, and K. Zaghib, Phys. Rev. B 88, 184302 (2013)
work page 2013
-
[54]
J. P. Perdew, K. Burke, and M. Ernzerhof, Phys. Rev. Lett. 77, 3865 (1996)
work page 1996
- [55]
-
[56]
D. R. Hamann, Phys. Rev. B 88, 085117 (2013)
work page 2013
- [57]
-
[58]
N. Marzari, A. A. Mostofi, J. R. Yates, I. Souza, and D. Vanderbilt, Rev. Mod. Phys. 84, 1419 (2012)
work page 2012
-
[59]
A. A. Mostofi, J. R. Yates, G. Pizzi, Y.-S. Lee, I. Souza, D. Vanderbilt, and N. Marzari, Comput. Phys. Commun. 185, 2309 (2014)
work page 2014
-
[60]
F. Giustino, M. L. Cohen, and S. G. Louie, Phys. Rev. B 76, 165108 (2007)
work page 2007
-
[61]
S. Ponc´ e, E. R. Margine, C. Verdi, and F. Giustino, Comput. Phys. Commun. 209, 116 (2016)
work page 2016
- [62]
-
[63]
N. F. Mott, Rev. Mod. Phys. 40, 677 (1968)
work page 1968
- [64]
- [65]
- [66]
-
[67]
J. Kang, Y. S. Jung, S.-H. Wei, and A. C. Dillon, Phys. Rev. B 85, 035210 (2012)
work page 2012
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.