Benchmark studies on the building blocks of DNA: I. Superiority of

The Journal of Physical Chemistry
Benchmark studies on the building blocks of DNA: I.
Superiority of Coupled Cluster methods in describing the
excited states of nucleobases in the Franck-Condon region
Journal:
Manuscript ID:
Manuscript Type:
Date Submitted by the
Author:
Complete List of Authors:
The Journal of Physical Chemistry
Draft
Article
n/a
Szalay, Peter; Institute of Chemistry, Eoetvoes Lorand University
Watson, Thomas; University of Florida,
Perera, Ajiith; University of Florida, Department of Chemistry
Lotrich, Victor; University of Florida, Quantum Theory Project
Bartlett, Rodney; University of Florida, Department of Chemistry
Note: The following files were submitted by the author for peer review, but cannot be converted to
PDF. You must view these files (e.g. movies) online.
paper1_submitted.pdf
ACS Paragon Plus Environment
Page 1 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
Benchmark studies on the building blocks of DNA: I.
Superiority of Coupled Cluster methods in
describing the excited states of nucleobases in the
Franck-Condon region
Péter G. Szalay,∗,† Thomas Watson,‡ Ajith Perera,‡ Victor F. Lotrich,‡ and Rod J.
Bartlett‡
Institute of Chemistry, Eötvös University, H-1518 Budapest, P.O.Box 32, Hungary, and Quantum
Theory Project, University of Florida, Gainesville, FL
E-mail: [email protected]
∗ To
whom correspondence should be addressed
of Chemistry, Eötvös University, H-1518 Budapest, P.O.Box 32, Hungary
‡ Quantum Theory Project, University of Florida, Gainesville, FL
† Institute
1
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Abstract
Equation of Motion Excitation Energy Coupled-Cluster (EOMEE-CC) methods including
perturbative triple excitations have been used to set benchmark results for the excitation energy and oscillator strength of the building units of DNA, i.e. cytosine, guanine, adenine and
thymine. In all cases the lowest twelve states have been considered including valence and
Rydberg ones. Triple-ζ basis sets with diffuse functions have been used and the results are
compared to CC2, CASPT2, TDDFT and DFT/MRCI results from the literature. The results
clearly show that it is only the EOMEE-CCSD(T) which is capable to provide accuracy of
about 0.1 eV. EOMEE-CCSD systematically overshoots all type of excited states by 0.1-0.3
eV, while CC2 is surprisingly accurate for π → π ∗ excited states but fails (often badly) for
n → π ∗ and Rydberg states. DFT and CASPT2 seems to give reliable results for the lowest
state but the error increases fast with the excitation level. The differences in the excitation
energies often change the energy ordering of the states which should even influence the conclusions of excited state dynamics obtained with these approximate methods. The results call
for further benchmark calculations on larger building blocks of DNA (nucleosides, basis pairs)
at the CCSD(T) level.
Introduction
Electronic properties of DNA and RNA lies at the focus of biochemical research. 1–4 The basic
building blocks of these systems are the nucleobases. Therefore it is important to obtain detailed
information on the structure, excitation energy, and the associated spectra of these molecules. For
earlier results we refer to the detailed review by Shukla and Leszczynski. 5 Recent results include
the accurate determination of the ground state geometry of uracil, 6 the simulation of the spectra of
cytosine, 7–9 potential energy surfaces of excited states to locate minima, conical intersections and
relaxation pathways for several of the nucleobases. 10–21 There are also several studies reporting
simulations of the excited state dynamics. 3,21–39
Due to the size of nucleobases, mostly lower level approaches such as MCSCF, CASPT2, 40–43
2
ACS Paragon Plus Environment
Page 2 of 27
Page 3 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
CC2, 43,44 TDDFT, 45,46 eventually MR-CI with limited reference or excitation space 3,14,47 have
been used to describe the excitation spectrum, but higher level calculations with different versions
of EOM-CC methods 7,9,13,43,48 are also available. To answer the question what happens with DNA
after excitation, dynamics simulations in the excited states need to be performed. These simulations can supply additional information on how the relaxation of excited molecules takes place
and what are the structural changes associated with this process. 3 Because of the high computational demand, however, these simulations are usually based on lower level calculations, like
CASSCF, 3,24,25,31 MR-CIS, 3,30 TDDFT 22,27,35 or even semi-empirical methods. 27,29,33,34,36 The
reliability of these calculations depends on the quality of the underlying potential energy surfaces,
and without accurate account of the relative energies of the electronic states involved, the picture
obtained from the simulation can be unreliable. Therefore it is very important to know how accurately the methods describe details of the surfaces, such as the order of excited states, location of
conical intersections etc. To that end calibration against high level calculations is important.
It is clear that in order to understand the electronic and spectral properties of DNA and RNA,
larger fragments than the monomers of nucleobases need to be studied. Such calculations are,
however, very expensive and approximate methods, provided they are accurate enough, can be
quite useful. In this respect, too, benchmarking by reliable and accurate methods is unavoidable.
EOMEE-CC (Equation of Motion Excitation Energy Coupled Cluster) method 49 (see also
Ref 50 ) is a powerful tool for calculating excitation energy within CC theory. In particular, the
version limited to single and double excitation (EOM-CCSD) 51,52 and the related linear response
CCSD-LR method 53 provide accurate results for excitation energies. 54,55 Extension to triple excitation have been done in different versions. The full EOM-CCSDT was presented by Kucharski,
et al. 56 and serves as the benchmark for triples methods. 54 However, in practice CCSDT is an ∼n8
method, and building its EOM matrix is also ∼n8 , to give a matrix of rank ∼n6 , which will require
partial, iterative diagonalization. Consequently, simplified methods reminiscent of the ground state
hierarchy, CCSDT-n 57 have been developed, EOMEE-CCSDT-1, 58 EOMEE-CCSDT-3, 59 along
with CC3-LR. 60 In these methods the triple excitation amplitudes are formally obtained from
3
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
an ∼n7 iterative procedure. However, they still suffer from the matrix diagonalization of rank
∼n6 . Hence, it is very important to formulate non-iterative versions, and two have been proposed,
EOMEE-CCSD(T) 58 and EOM-CCSD(Te). 59 Both add a single non-iterative evaluation of triples,
analogous to CCSD(T) 61,62 and only require an iterative diagonalization of a matrix of the same
size as that for EOM-CCSD, i.e. ∼n4 . Hence, they are vastly faster than the iterative triples models.
Recently, one of us 9 presented EOM-CC (Equation of Motion Coupled Cluster) simulation of
the UV spectrum of cytosine and compared it to a newly measured matrix-isolation (MI) spectrum.
After considering several tautomers, excellent agreement between the measured and calculated
spectra has been obtained. It was concluded that at the EOM-CCSD level the discrepancy in
excitation energy can be as large as 0.1-0.3 eV and inclusion of triple excitations are necessary to
get an accuracy of 0.1 eV. 9
The aim of this paper is to present a detailed analysis of the performance of different methods
by studying the excited states of nucleobases. In particular, the molecular building blocks of DNA,
i.e. cytosine, adenine, guanine, and thymine, will be investigated. The methods applied are all from
the Coupled-Cluster family, i.e. EOMEE-CC methods. The primary CCSD approximation will
be extended with triple excitations using the iterative EOMEE-CCSDT-3, 59 and CC3 60 methods
as well as the non-iterative EOMEE-CCSD(T) variant 58 which is based on perturbation theory
arguments (for more detail see below). (In what follows, for brevity, just EOM will be used for
EOMEE and even the EOM prefix will some times be omitted in the text and tables.)
The Coupled Cluster results will also be compared with various literature data. For studying
nucleobases, the most often used methods are TDDFT, CC2 and CASSCF; for review see Ref. 5
Since there are quite a large number of calculations with these methods, we have chosen those
which are in our opinion representative for the given method, available for most nucleobases and
represent a solid ground for an unbiased comparison. In particular we compare our CC results to
the CC2 results of Fleig et al., 44 to TDDFT results by Shukla and Leszczynski, 45 to the DFT/MRCI
results by Silva-Junior et al. 46 and the CASPT2 results from a large benchmark study by Schreiber
et al. 43 The later paper does not discuss guanine, therefore for this one molecule we compare with
4
ACS Paragon Plus Environment
Page 4 of 27
Page 5 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
the CASPT2 results by Yamazaki et al. 26
The large CC calculations presented in this paper should therefore serve as benchmarks and
a methodological guide for future studies on larger pieces of DNA. Thus, we include up to 12
excited states, both valence and Rydberg, in the calculations, because, in our opinion, an accurate
description of the valence states requires the inclusion of the proper interaction with Rydberg states,
as well (see discussion below). Description of Rydbeg states requires the inclusion of diffuse
functions in the basis sets. To that end aug-pVDZ and aug-pVTZ basis sets will be used. 63 Careful
assignment of the calculated states based on the characteristics of the wave function (dominant
excitations and form of corresponding orbitals; natural orbitals of the density differences) will be
performed to obtain relationship between the accuracy and types of the excitation.
We note in passing that the methodological conclusion will in some cases differ from that
obtained for small molecules in earlier benchmark studies. 58,59
Methodology
Two program systems have been used in the calculations, namely ACES III 64,65 and CFOUR. 66
CFOUR was used for the calculation on cytosine at all levels, except EOM-CCSD(T), while all
other calculations have been performed with ACES III.
To maintain compatibility with earlier and following projects, the geometries of the four nucleobases have been obtained at slightly different levels of theory. In the cases of cytosine and adenine
the optimization was performed at the CCSD level using cc-pVDZ basis, 63 for guanine MBPT(2)
(MP2) level using cc-pVDZ basis sets, 63 while for thymine the MBPT(2)/aug-cc-pVDZ level was
used. The slight difference in the geometry caused by the different methods, should not influence
the results. Only the biologically relevant, canonical tautomers of the four nucleobases have been
included in this study. The optimization of cytosine and adenine have been performed by CFOUR
using internal coordinates 67 and the GDIIS algorithm. 68 The other structures were optimized with
ACES III using redundant internal coordinates. 69,70
5
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Page 6 of 27
Excitation energies are the main focus of this paper and the calculations have been performed at
different levels of EOM-CC theory. These include the singles-doubles level (EOM-CCSD) 51,52 and
various levels including triple excitations: EOM-CCSDT-3, 59 CC3-LR 60 with iterative and EOMCCSD(T) 58 with non-iterative triples. EOM-CCSDT-3, CC3-LR as well as CC2-LR 71 calculations
have been performed with CFOUR, while all EOM-CCSD(T) calculations were done with ACES
III. Two basis sets are used in the excitation energy calculations, the aug-cc-pVDZ and aug-ccpVTZ. 63 While both basis sets are used in the EOM-CCSD and EOM-CCSD(T) calculations, only
the smaller bases are used in conjunction with the iterative triple methods.
Oscillator strengths have been calculated from transition moments obtained according to Ref. 51
at the EOM-CCSD level. We also report the CC3-LR oscillator strength 60 for cytosine.
A detailed description of the EOM-CCSD(T) method and its implementation in ACES III will
be presented separately. 72 For the sake of fully documenting the results we give here the formula
used to calculate the correction of the excitation energy (ωk ) with respect to the CCSD one:
CCSD(T )
ωk
(1)
(1)
(2)
(2)
(2)
(2)
= hLk3 | f (0) |Rk3 i + hLk3 |V (1) |Rk i + hLk |V (1) |Rk3 i
with f (0) = ∑ f pp {p† p}, V (1) = ∑ fai {a† i} + {i† a} +
p
ai
∑ hpq||rsi{p†q†sr}.
pqrs
(1)
Rk
(1)
(1)
and Lk
are the EOM-CCSD right and left eigenvectors of state k, respectively, being necessarily first order
(2)
(2)
quantities in V (1) (superscripts in parenthesis stand for the order of the quantity). Rk3 and Lk3
are the second order contributions to the eigenvectors in the triples space and calculated from:
(2)
(1)
(2)
Lk3 D3 = Lk V (1) Q3
(3)
D3 Rk3
= Q3V (1) Rk
(2)
(1)
with Q3 being a projection onto the triply excited state, D3 is the corresponding difference of the
occupied and virtual orbital energies. With this the final correction to the excitation energy reads:
CCSD(T )
ωk
(1)
(1)
(1)
(1)
(0) −1
(1)
= hLk |V (1) Q3 D−1
D3 Q3V (1) |Rk i + hLk |V (1) Q3 D−1
|Rk i
3 f
3 V
6
ACS Paragon Plus Environment
Page 7 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
(1)
(1)
(1)
+hLk |V (1) D−1
|Rk i
3 Q3V
(4)
Originally, Watts and Bartlett 58 presented this correction as ’second-order’ with respect to the
EOM-CCSD eigenvalue. Here an other choice was preferred: to be consistent with the treatment
of the ground state Λ-CCSD(T) method 61,73 henceforth, we consider it as a “fourth-order” triples
correction in the perturbation V (1) . We are also free to incorporate the ω CCSD value in the previous
expression into H0 , since it simply accounts for a shift. Numerically, there is virtually no difference in the answers. With these choices, we have analogous diagrams for the fourth-order triples
corrections in the ground state and the EOM excited states. We also retain the occupied-occupied
and virtual-virtual orbital rotation invariance for the approximation and the final equations can be
viewed as generalization of the so called Λ-CCSD(T) correction 54,61,62,73,74 for the ground state
using instead of the ground state quantities T3 , Λ1 and Λ2 the corresponding excited state quantities
R3 , R1 , R2 , and L3 , L1 and L2 , respectively. 72
ACES III was essential for the success of this study and those following this paper on larger
fragments of the DNA. 75,76 ACES III 64,77 is a newly written program to perform high level quantum chemical calculations with many-body methods. The program operates on blocks of numbers,
instead of individual floating point numbers, and parallelizes over these blocks. This offers a high
degree of tune-ability resulting in superior scaling across thousands to hundreds of thousands of
computing cores. 77 For more details see Refs. 64,77,78
Results
The choice of basis set is essential to get reliable and accurate results. It is generally assumed
that valence excitation energies are less sensitive to the basis set choice, and diffuse functions
are required only to describe the Rydberg states. However, description of the proper interaction
of the valence states with Rydberg ones might be also important which again necessitate diffuse
basis functions. For example, it has been shown for cytosine 7 that valence excitations energies are
7
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
influenced up to 0.2 eV by diffuse functions, due to some interaction of valence and Rydberg states.
Of the results obtained with the two basis sets (see previos section), only the aug-cc-pVTZ results
are presented below, while the aug-cc-pVDZ results can be found in supplementary materials.
The comparison of the results obtained by these two basis sets shows that they are essentially
the same for valence states (deviation is in the range of 0.01-0.03 eV), and even for the lowenergy Rydberg states only a small difference could be noticed: the triple-ζ excitation energies
are larger by up to 0.15-0.17 eV only. This means that valence-Rydberg interactions are already
well described with the aug-cc-pVDZ basis and in later studies on larger systems the smaller basis
can be used. Methodological comparison with calculations that do not include diffuse functions
should, however, be handled with care (see below).
It has been shown in Ref. 7 that the triple excitations have an effect of 0.1-0.3 eV on the excitation energies of cytosine. This means that if one seeks an accuracy of about 0.1 eV, inclusion
of triple excitations is an absolute necessity. There is no reason to doubt that this conclusion is
valid for the other nucleobases, as well. In Ref. 7 the CC3-LR method 60 was used to include triples
effect. It is an iterative method, and like the other ∼n7 methods we use, the storage of the triple
excitation amplitudes is not necessary, but the large matrix diagonalization remains making such
calculations substantially more expensive than CCSD calculations. Therefore we prefer the noniterative methods like CCSD(T) 58 but to document their accuracy, we also consider the iterative
CCSDT-3 method 59 in the case of cytosine. The results obtained by the various methods and the
aug-cc-pVDZ basis are listed in Table Table 1.
All three triples methods give very similar results, the excitation energies are lower by 0.2-0.3
eV if triple excitations are considered. CC3 always gives the lowest energy, but the difference
between CCSDT-3 and CC3 is no larger than 0.1 eV. As there is no variational bound on any of
these results for excitation energies, the only way to assess accuracy is calibration to experiment,
or in the cases where they are not available, to EOM-CCSDT or full CI. Due to the size of the
molecule these later calculations are not possible, but we know the excellent agreement of the CC3
excitation energy is for the π → π ∗ states of cytosine. 9 The non-iterative CCSD(T) results in most
8
ACS Paragon Plus Environment
Page 8 of 27
Page 9 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
cases fall between the CCSDT-3 and CC3 results. Considering the fact the CCSD(T) method is
non-iterative and therefore the most cost-effective, it seems to be reasonable to use it in the further
studies.
Tables Table 2-Table 5 present EOM-CCSD, EOM-CCSD(T) results with aug-cc-pVTZ basis
set obtained for cytosine, adenine, guanine and thymine, respectively along with the results from
the literature. Unfortunately different basis sets have been used in these latter studies: while the
results of the CC2 calculations 44 have also been obtained by the aug-cc-pVTZ basis set (allowing
direct comparison), the 6-311++(d,p) basis was used in the TDDFT study, while the TZVP basis
was used in the CASPT2 calculations. It is expected that the 6-311++(d,p) basis gives results of
similar quality as the aug-cc-pVTZ (both being valence triple zeta with polarization functions),
but the TZVP basis is certainly not capable of describing Rydberg states due to the lack of diffuse
functions. Yamazaki et al. 26 also use a tripel-ζ basis with polarization factions, so comparison is
appropriate in the case of guanine. These facts need to be kept in mind during the course of the
discussion below.
Beside the calculated excitation energies and oscillator strengths, Tables Table 2-Table 5 also
list the assignment of the excited states. Our assignment is based on the inspection of the EOMEE
vectors and orbitals as well as difference densities. Literature data are listed according to the
assignment given in the original papers (if not noted otherwise).
Now we start a systematic discussion of the results on the various nucleobases.
Cytosine
In a recent study by Bazsó et al. 7 it has been shown that the MI/UV spectrum of cytosine can
only be explained if the three lowest energy tautomers are included in the spectrum simulation. In
the present study we include only the canonical form, which is not the lowest energy conformer
in the gas phase, 79 but it is the only conformer in solid phase and it is the only one considered in
biological systems. The reason for the stability difference in gas and condensed phase is the larger
stabilization of the canonical form by hydrogen bonds. 80
A first inspection of Table Table 2 reveals that all methods predict the π → π ∗ state to be the
9
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
lowest followed by the n → π ∗ . According to our CC calculations, there are three more π → π ∗ and
two other n → π ∗ type excitations below 7 eV. The strongest excitation is the π → 2π ∗ transition
at 6.53 eV (CCSD) and 6.33 (CCSD(T)), respectively. CASPT2 gives similar results. The CC2
calculation of Ref. 44 does not include this latter state. According to our own CC2/aug-cc-pVDZ
calculations (see supplementary materials), CC2 gives similar excitation energy as CCSD(T) for
this state, the oscillator strength is, however, somewhat lower (0.294). TDDFT excitation energies
for the first two π → π ∗ states are similar to the other methods, the oscillator strength are, however,
substantially smaller. In the case of the third π → π ∗ excitation, two states appear with substantial
oscillator strength, as a result of a pronounced interaction between valence and Rydberg states, not
shown by the other methods. The excitation energy corresponding to the stronger one of these two
transitions appears to be too high. Even considering these differences one can conclude that all
methods would give very similar (dipole-allowed) spectrum with some differences in the details.
It is particularly striking to see how well CC2 and EOM-CCSD(T) results agree for valence
π → π ∗ states. Since theoretically CC2 is an approximation to CCSD and does not include any
triple excitation effect, the results suggest that there is some kind of error compensation. CC2
excitation energies are always lower than CCSD, as are results with triple excitations. While in
the latter case the lowering comes from the additional contribution of the triple excitations, in case
of CC2 the lowering is caused by some approximations introduced with respect to CCSD. Since
EOM-CC methods (including CC2-LR) can be viewed as direct methods to obtain excitation energies (i.e. energy differences), one could argue that the good agreement of CC2 is a result of
providing a better balanced description of ground and excited states than in CCSD. When considering also the n → π ∗ states, CC2 results differ considerably from the CCSD(T) results which
makes the latter argument weak. In fact, CC2 energies for these n → π ∗ states are again lower than
the CCSD ones but the amount is much larger: while it is about 0.2-0.3 eV for the π → π ∗ sates,
it is 0.3-0.8 eV for the n → π ∗ states. Since triples corrections remain in the range of 0.2-0.3 eV
for the latter case as well, as a result, CC2 highly underestimates the excitation energy of these
states. One does not find good agreement between the CC2 and CCSD(T) results for the Rydberg
10
ACS Paragon Plus Environment
Page 10 of 27
Page 11 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
states, either. Here again CC2 excitation energies are consistently lower than the CCSD(T) ones,
the difference is about 0.1-0.3 eV. Therefore, it does not seem to be proven that the good agreement observed between CCSD(T) and CC2 results for the π → π ∗ states comes from a balanced
description of ground and excited states; rather it appears that there is fortunate error cancelation in
this case which does not happen for states with other character, in particular not for n → π ∗ states.
Finding the reasons for this different performance is out of the scope of present paper and will be
investigated in a separate one where other model systems and other second order methods will be
investigated.
Note that this unbalanced description of the states with different character has the unwanted
effect that CC2 gives a different ordering of some of the states than CCSD and CCSD(T). In the
case of cytosine, the states designated as π−1 → π ∗ and n, n−1 → 2π ∗ are flipped by CC2. Possible
consequences are discussed below.
TDDFT gives very good agreement for the first excited state (π → π ∗ ), however, all other
excitation energies seem to be underestimated, considerably for some of the states. While the
resulting TDDFT spectrum would show some resemblance with the coupled-cluster spectra for
the lower states (in particular with CC2), for higher states it does not seem to be correct even
qualitatively.
Quite good agreement can be observed between CASPT2 and coupled cluster results for the
valence states listed in Table Table 2. Again, the n → π ∗ states are somewhat underestimated.
One has to remember, however, that the inclusion of the diffuse function would further lower the
excitation energies by about 0.2 eV resulting in too low values for the entire spectrum. Note also
that very different CASPT2 numbers are available in the literature for the states of cytosine. 41,41
The good agreement found in Ref. 43 was a result of a systematic procedure (selection of reference
space, choice of the denominator shift) used for a series of molecules. The present results suggest,
however, that this good agreement between calculated CASPT2 values 43 and experiment could be
worse if diffuse functions are also included in the basis.
Adenine
11
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Results for adenine are listed in Table Table 3. Here again, only the canonical tautomer (known
as adenine-N9H 5 ) has been investigated. See Guerra et al. 81 for a discussion of other tautomers.
The lowest three states, two π → π ∗ and an n → π ∗ states, are very close in energy. Beside several
π → R states there is another n → π ∗ around 6 eV. At the higher energy end of the studied region
there are again one n → π ∗ and two π → π ∗ states close in energy. Unfortunately, Ref. 44 does not
report CC2 energies above 6.14 eV, and the n → π ∗ state is missing from the CASPT2 list. 43
The difficulty with adenine’s spectrum is that there is a pair of close lying π → π ∗ states (first
and second excited states with CCSD and CCSD(T)). CC2 cannot separate them with the aug-ccpVTZ basis giving a pair of complex eigenvalues (see Ref.; 44 we have found the same problem
with aug-cc-pVDZ basis, see supplements). In case of TDDFT the lowest excited state is n → π ∗
but the energy of the two π → π ∗ excited states agree well with the CCSD(T) results. Note,
however, that the order of the two states are reversed: in the TDDFT case the brighter one is lower
in energy. CASPT2 43 gives a splitting of 0.1 eV of the two π → π ∗ states, but both states come out
with similar oscillator strength, a discrepancy to the CC results. Note that the oscillator strength
reported in Ref. 43 comes from a MCSCF calculation, so it does not include dynamic correlation.
Of the overall performance of the different methods, one can draw similar conclusions as for
cytosine: for π → π ∗ states CC2 excitation energies are close to the CCSD(T) ones, but that of
the n → π ∗ states are far too low. Except for the first two π → π ∗ states the TDDFT excitation
energies are too low with discrepancy as much as 0.5 eV. CASPT2 energies for the valence states
are in good agreement with CCSD(T). Note again, inclusion of diffuse functions might change this
conclusion.
Underestimation of the n → π ∗ excitation energies has the consequence that both CC2 and
TDDFT incorrectly give the n → π ∗ state as the lowest excited state contrary to CCSD and
CCSD(T) where both π → π ∗ states are below the n → π ∗ one. This can have severe consequences on relaxation dynamics where conical interactions between these states are of crucial
importance. 30
Guanine
12
ACS Paragon Plus Environment
Page 12 of 27
Page 13 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
In the case of guanine, the biologically relevant canonical form is the keto-N9H. 5 Table Table 4
shows that the lowest excited state predicted by all but the CCSD method is a π → π ∗ state; in
case of the CCSD, the first and second states are practically degenerate. There is, however, a
discrepancy in the assignment of the other component of this pair of states: while the CC methods
(CCSD, CCSD(T) and CC2) predict substantial π → R character for this state (A”symmetry), the
assignment based on the CASPT2 26 and also on the TDDFT 45 calculations is given as π → σ ∗ .
Note that the CASPT2 calculations by Yamazaki, Domcke and Sobolewski 26 reported in Table
Table 4 uses a triple-ζ basis with polarization functions, so the description of Rydberg states are
possible in this case. Nevertheless, all methods agree that the two lowest states for guanine are
close in energy, the brighter one is a π → π ∗ state, while the other one is weak. The largest gap
between them is at the CASPT2 level, in fact the energy of the π → π ∗ state seems to be far too
low.
It is interesting to note that CCSD(T) and CC2 results compare somewhat differently than
for cytosine: we cannot find a perfect agreement between CC2 and CCSD(T) excitation energies
for the π → π ∗ states, while for the π → R states CC2 and CCSD results almost match. The
n → π ∗ excitations energies are again too low at the CC2 level: while for the first n → π ∗ state
the underestimation with respect to CCSD(T) is small (0.05 eV), it is as much as 0.30 eV for the
n → 2π ∗ state (in Ref. 44 this state is assigned as n → R, along with another close lying state at 6.13
eV). As a result, CC2 does not seem to predict the order of the n → π ∗ and second π → π ∗ states
correctly. Note that these later two states are very close in energy even at the CCSD level; it is only
the CCSD(T) which pulls them somewhat more apart. Due to near-degeneracy, CC2 distributes
the oscillator strength of the π → 2π ∗ state between this and a π → R state. No such effect can be
observed for CCSD where these two states are further apart.
As mentioned already, the CASPT2 study by Yamazaki et al. 26 used a basis set which can describe Rydberg sates. Unfortunately, only valence states are reported in Ref. 26 The results for these
show, however, a different pattern from what we observed for other molecules by comparing to the
CASPT2 results of Schreiber et al. 43 The reason can be assigned to the fact that CASPT2 results
13
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
largely depend on several choices such as reference space, level shift, etc. and these have been
carefully chosen by Schreiber et al., 43 therefore their results seem to be more systematic. Another
important difference is that there are no diffuse functions included in the basis in Ref., 43 while
diffuse functions are included in the calculations of Ref.; 26 as stated above and in Ref., 9 diffuse
functions lower the excitation energy of the valence states by about 0.2 eV, as well. Therefore, it
seems to be proven that CASPT2 excitation energies are too low.
Thymine
Finally, the results on thymine are summarized in Table Table 5. It is seen there that all methods
predict the n → π ∗ state to be lowest followed by the π → π ∗ state. Comparison of CC2 and
CCSD(T) results shows much better overall agreement than before: while for π → π ∗ states one
does not find the perfect agreement observed for the other molecules (maximum discrepancy is less
than 0.1 eV though), the excitation energy of the first n → π ∗ state agrees perfectly and even in
case of the n−1 → 2π ∗ state the underestimation by the CC2 method is only 0.1 eV. All of TDDFT
excitation energies are too low. For higher states the discrepancy grows fast, for the π → 2π ∗ it is
already more than 0.3 eV. Thus, TDDFT does not seem to be very useful to predict the spectrum
of thymine. As before, CASPT2 results are reasonable for all valence states, although the splitting
of the n−1 → 2π ∗ and π → 2π ∗ states are largely overestimated in this case. One should not
forget about the possible effect of diffuse functions which presumably would lower all valence
state energies resulting in poorer agreement.
Discussion and Conclusions
The results presented above can be summarized as follows. The excited states of nucleobases
can be described by EOM-CC methods including triple excitations very accurately. The error of
the vertical excitation energies is assumed to be about 0.1 eV or less (aug-cc-pVTZ or aug-ccpVDZ basis) in this case. Among the methods including triple effects, the most cost effective,
non-iterative CCSD(T) performs well. In the case of cytosine we have found between the non-
14
ACS Paragon Plus Environment
Page 14 of 27
Page 15 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
iterative EOM-CCSD(T) and the iterative versions (CC3, CCSDT-3) a perfect agreement. EOMCCSD excitation energies are too high by about 0.1-0.3 eV. No relation between the size of this
discrepancy and the character of the states could be found. Therefore it seems to be essential to
include triple excitations in the wave function if accuracy of about 0.1 eV is required.
Concerning the accuracy of the approximate CC2 method we observed very good agreement
with the CCSD(T) results for the π → π ∗ excited states for all four molecules. In most cases, however, both n → π ∗ and π → R excitation energies are underestimated, sometimes by as much as 0.4
eV. In other cases, like thymine, there is good agreement even for the n → π ∗ states. Such reduced
accuracy of CC2 for n → π ∗ states has not been observed for other molecules. For example, in
the comprehensive study by Schreiber et al. 43 CC2 works equally well for n → π ∗ and π → π ∗
states of a large number of molecules including heterocycles. Note that in case of the nucleobases
Ref. 43 does not report triples results, however, CCSD and CC2 excitation energies agree very well.
This contradicts our findings, since we observe substantial differences between CCSD and CC2
excitation energies in most cases. We could not find any explanation for this discrepancy, but we
note that one important difference between the two studies is the lack of diffuse function in Ref. 43
It is out of the scope of the present paper to explain the failure of CC2, in particular for n →
π ∗ states. However, a quick analysis of the character of the excited states shows that for most
nucleobases the n → π ∗ states arise either from a nitrogen or an oxygen lone pair. They are the
two states of cytosine (showing the largest discrepancy) where strong interaction of the N and O
lone pairs could be observed for the hole orbital. On the other hand, in case of thymine, where
the error of CC2 seems to be the smallest, both n → π ∗ excitations are localized on one of the CO
fragments. More detailed analysis involving other model systems will be presented elsewhere.
TDDFT/B3LYP gives good excitation energies for the lowest π → π ∗ states (cytosine, adenine
and guanine) but it fails already if there is another lower state, like in case of thymine where there
is an n → π ∗ state close by. The discrepancy to CCSD(T) grows with the excitation energy so
TDDFT/B3LYP does not seem to be good enough to predict even the observable UV spectrum
of the nucleobases. Note that we compare only with one set of results obtained by the B3LYP
15
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
functional and we do not investigate whether other functionals are more reliable. In this respect
the recent paper of Silva-Junior et al. 46 states that B3LYP clearly outperforms other functionals
like BP86 and BHLYP. Since this conclusion is based on a large number of exited states (same
set as the one used by Schreiber et al. 43 ), this conclusion supports the representativeness of the
TDDFT/B3LYP numbers. Note that according to the Table III of Ref., 46 DFT/MRCI 82 results
also are no better than the TDDFT/B3LYP results for singlet states. Tables Table 2-Table 5 of the
present paper confirm this findings.
The data presented in this paper suggest that only the EOM-CC methods with the triple excitation correction can give a balanced description of all of the states of these four nucleobases. If the
goal is to obtain very precise results with uncertainty of about 0.1 eV, triple excitations in the form
of the EOM-CCSD(T) method needs to be included. The often used CC2, TDDFT and CASPT2
methods do not seem to fullfil this requirement.
In our forthcoming papers 75,76 we will investigate the effect of hydration, the effect of sugar
bound in nucleotides, and that of nearby nucleobases either as Watson-Crick base pairs, or in
stacked configurations. To gain unambiguous information on the electronic properties of these
systems the EOM-CCSD(T) method will be used. These calculation even at the CCSD(T) level
are, however, very expensive but considering the above describe uncertainty of the lower level
methods, one has to go into this endeavor and establish a benchmark set for these systems, as well.
These can then be used to evaluate the cheaper methods like CC2 and various TDDFT variants. A
detailed understanding of the phenomenon like electron and charge transfer certainly requires this
kind of precision at least in the early stage of the research.
Acknowledgements
P.G.S. acknowledges financial support by the Hungarian American Enterprise Scholarship Fund
(HAESF) during his sabbatical stay at the University of Florida. Contribution from Orszagos
Tudomanyos Kutatasi Alap (OTKA; Grant No. F72423) and TAMOP, supported by the European
16
ACS Paragon Plus Environment
Page 16 of 27
Page 17 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
Union and cofinanced by the European Social Fund (Grant Agreement No. TAMOP 4.2.1/B09/1/KMR-2010-0003) is also acknowledged. Most of the calculations have been performed on
DoD computer Chugach at the Arctic Region Supercomputer Center and financed by the U.S.
ARO, grant No.: 54344CH.
References
(1) Crespo-Hernandez, C.; Cohen, B.; Hare, P.; Kohler, B. Chem. Rev. 2004, 104, 1977–2019.
(2) Middleton, C. T.; de La Harpe, K.; Su, C.; Law, Y. K.; Crespo-Hernandez, C. E.; Kohler, B.
Annu. Rev. Phys. Chem. 2009, 60, 217–239.
(3) Barbatti, M.; Aquino, A. J. A.; Szymczak, J. J.; Nachtigallova, D.; Hobza, P.; Lischka, H.
Proc. Natl. Acad. Sci. USA 2010, 107, 21453–21458.
(4) Shukla, M.; Leszczynski, J. Radiation Induced Molecular Phenomena in Nucleic Acids,
Springer; 2008.
(5) Shukla, M. K.; Leszczynski, J. J. Biomol. Struct. Dyn. 2007, 25, 93–118.
(6) Puzzarini, C.; Barone, V. Phys. Chem. Chem. Phys. 2011, 13, 7158–7166.
(7) Tajti, A.; Fogarasi, G.; Szalay, P. G. ChemPhysChem 2009, 10, 1603–1606.
(8) Barbatti, M.; Aquino, A. J. A.; Lischka, H. Phys. Chem. Chem. Phys. 2010, 12, 4959–4967.
(9) Bazso, G.; Tarczay, G.; Fogarasi, G.; Szalay, P. G. Phys. Chem. Chem. Phys. 2011, 13, 6799–
6807.
(10) Chen, H.; Li, S. J. Phys. Chem. A 2005, 109, 8443–8446.
(11) Perun, S.; Sobolewski, A.; Domcke, W. Chem. Phys. 2005, 313, 107–112.
(12) Perun, S.; Sobolewski, A.; Domcke, W. J. Am. Chem. Soc 2005, 127, 6257–6265.
17
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
(13) Zgierski, M.; Patchkovskii, S.; Fujiwara, T.; Lim, E. J. Phys. Chem. A 2005, 109, 9384–9387.
(14) Tomic, K.; Tatchen, J.; Marian, C. J. Phys. Chem. A 2005, 109, 8410–8418.
(15) Marian, C. J. Chem. Phys. 2005, 122, 104314.
(16) Gustavsson, T.; Banyasz, A.; Lazzarotto, E.; Markovitsi, D.; Scalmani, G.; Frisch, M.;
Barone, V.; Improta, R. J. Am. Chem. Soc 2006, 128, 607–619.
(17) Serrano-Andres, L.; Merchan, M.; Borin, A. Proc. Natl. Acad. Sci. USA 2006, 103, 8691–
8696.
(18) Chung, W. C.; Lan, Z.; Ohtsuki, Y.; Shimakura, N.; Domcke, W.; Fujimura, Y. Phys. Chem.
Chem. Phys. 2007, 9, 2075–2084.
(19) Marian, C. M. J. Phys. Chem. A 2007, 111, 1545–1553.
(20) Zechmann, G.; Barbatti, M. J. Phys. Chem. A 2008, 112, 8273–8279.
(21) Asturiol, D.; Lasorne, B.; Robb, M. A.; Blancafort, L. J. Phys. Chem. A 2009, 113, 10211–
10218.
(22) Langer, H.; Doltsinis, N.; Marx, D. ChemPhysChem 2005, 6, 1734–1737.
(23) Merchan, M.; Gonzalez-Luque, R.; Climent, T.; Serrano-Andres, L.; Rodriuguez, E.;
Reguero, M.; Pelaez, D. J. Phys. Chem. B 2006, 110, 26471–26476.
(24) Groenhof, G.; Schaefer, L. V.; Boggio-Pasqua, M.; Goette, M.; Grubmueller, H.; Robb, M. A.
J. Am. Chem. Soc 2007, 129, 6812–6819.
(25) Hudock, H. R.; Levine, B. G.; Thompson, A. L.; Satzger, H.; Townsend, D.; Gador, N.;
Ullrich, S.; Stolow, A.; Martinez, T. J. J. Phys. Chem. A 2007, 111, 8500–8508.
(26) Yamazaki, S.; Domcke, W.; Sobolewski, A. L. J. Phys. Chem. A 2008, 112, 11965–11968.
(27) Lei, Y.; Yuan, S.; Dou, Y.; Wang, Y.; Wen, Z. J. Phys. Chem. A 2008, 112, 8497–8504.
18
ACS Paragon Plus Environment
Page 18 of 27
Page 19 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
(28) Hudock, H. R.; Martinez, T. J. ChemPhysChem 2008, 9, 2486–2490.
(29) Fabiano, E.; Thiel, W. J. Phys. Chem. A 2008, 112, 6859–6863.
(30) Barbatti, M.; Lischka, H. J. Am. Chem. Soc 2008, 130, 6831–6839.
(31) Szymczak, J. J.; Barbatti, M.; Hoo, J. T. S.; Adkins, J. A.; Windus, T. L.; Nachtigallova, D.;
Lischka, H. J. Phys. Chem. A 2009, 113, 12686–12693.
(32) Mitric, R.; Werner, U.; Wohlgemuth, M.; Seifert, G.; Bonacic-Koutecky, V. J. Phys. Chem. A
2009, 113, 12700–12705.
(33) Lan, Z.; Fabiano, E.; Thiel, W. ChemPhysChem 2009, 10, 1225–1229.
(34) Lan, Z.; Fabiano, E.; Thiel, W. J. Phys. Chem. B 2009, 113, 3548–3555.
(35) Improta, R.; Barone, V.; Lami, A.; Santoro, F. J. Phys. Chem. B 2009, 113, 14491–14503.
(36) Alexandrova, A. N.; Tully, J. C.; Granucci, G. J. Phys. Chem. B 2010, 114, 12116–12128.
(37) Barbatti, M.; Szymczak, J. J.; Aquino, A. J. A.; Nachtigallova, D.; Lischka, H. J. Chem. Phys.
2011, 134, 014304.
(38) Barbatti, M.; Aquino, A. J. A.; Szymczak, J. J.; Nachtigallova, D.; Lischka, H. Phys. Chem.
Chem. Phys. 2011, 13, 6145–6155.
(39) Nachtigallova, D.; Aquino, A. J. A.; Szymczak, J. J.; Barbatti, M.; Hobza, P.; Lischka, H. J.
Phys. Chem. A 2011, 115, 5247–5255.
(40) Lorentzon, J.; Fulscher, M.; Roos, B. O. J. Am. Chem. Soc 1995, 117, 9265–9273.
(41) Fulscher, M.; Roos, B. O. J. Am. Chem. Soc 1995, 117, 2089–2095.
(42) Fulscher, M.; SerranoAndres, L.; Roos, B. O. J. Am. Chem. Soc 1997, 119, 6168–6176.
(43) Schreiber, M.; Silva, M. R. J.; Sauer, S. P. A.; Thiel, W. J. Chem. Phys. 2008, 128, 134110.
19
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
(44) Fleig, T.; Knecht, S.; Haettig, C. J. Phys. Chem. A 2007, 111, 5482–5491.
(45) Shukla, M.; Leszczynski, J. J. Comput. Chem. 2004, 25, 768–778.
(46) Silva-Junior, M. R.; Schreiber, M.; Sauer, S. P. A.; Thiel, W. J. Chem. Phys. 2008, 129,
104103.
(47) Matsika, S. J. Phys. Chem. A 2004, 108, 7584–7590.
(48) Kowalski, K.; Valiev, M. J. Phys. Chem. A 2008, 112, 5538–5541.
(49) Sekino, H.; Bartlett, R. J. Int. J. Quantum Chem. 1984, S18, 255.
(50) Monkhorst, H. J. Int. J. Quantum Chem. 1977, S11, 421.
(51) Stanton, J. F.; Bartlett, R. J. J. Chem. Phys. 1993, 98, 7029–7039.
(52) Comeau, D. C.; Bartlett, R. J. Chem. Phys. Lett. 1993, 207, 414.
(53) Koch, H.; Jørgensen, P. J. Chem. Phys. 1990, 93, 3333.
(54) Bartlett, R. J.; Musial, M. Rev. Mod. Phys. 2007, 79, 291–352.
(55) Bartlett, R. J. WIREs Comput. Mol. Sci. 2012, 2, 126–138.
(56) Kucharski, S.; Wloch, M.; Musial, M.; Bartlett, R. J. J. Chem. Phys. 2001, 115, 8263–8266.
(57) Noga, J.; Bartlett, R. J.; Urban, M. Chem. Phys. Lett. 1987, 134, 126–132.
(58) Watts, J. D.; Bartlett, R. J. Chem. Phys. Lett. 1995, 233, 81–87.
(59) Watts, J. D.; Bartlett, R. J. Chem. Phys. Lett. 1996, 258, 581–588.
(60) Christiansen, O.; Koch, H.; Jørgensen, P. J. Chem. Phys. 1995, 103, 7429–7441.
(61) Taube, A. G.; Bartlett, R. J. J. Chem. Phys. 2008, 128, 044110.
(62) Bartlett, R. J. Mol. Phys. 2010, 108, 2905–2920.
20
ACS Paragon Plus Environment
Page 20 of 27
Page 21 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
(63) Dunning, T. H. J. Chem. Phys. 1989, 90, 1007.
(64) Lotrich, V.; Flocke, N.; Ponton, M.; Yau, A. D.; Perera, A.; Deumens, E.; Bartlett, R. J. J.
Chem. Phys. 2008, 128, 194104.
(65) ACES III is a massively parallel program for coupled-cluster and MBPT calculations
for molecular structure and spectra. Authors, V. Lotrich, M. Ponton, A. Perera, T. Watson, N. Jindal, T. Hughes, D. Lyakh, R. Bhoj, E. Deumens, B. Sanders, and R. J.
Bartlett, Quantum Theory Project, University of Florida. The program is available at
http://www.qtp.ufl.edu/ACES/index.shtml.
(66) CFOUR, a quantum chemical program package written by J.F. Stanton, J. Gauss, M.E. Harding, P.G. Szalay with contributions from A.A. Auer, R.J. Bartlett, U. Benedikt, C. Berger,
D.E. Bernholdt, O. Christiansen, M. Heckert, O. Heun, C. Huber, D. Jonsson, J. Jusélius, K.
Klein, W.J. Lauderdale, D. Matthews, T. Metzroth, D.P. O’Neill, D.R. Price, E. Prochnow, K.
Ruud, F. Schiffmann, S. Stopkowicz, A. Tajti, M.E. Varner, J. Vázquez, F. Wang, J.D. Watts
and the integral packages MOLECULE (J. Almlöf and P.R. Taylor), PROPS (P.R. Taylor),
ABACUS (T. Helgaker, H.J. Aa. Jensen, P. Jørgensen, and J. Olsen), and ECP routines by A.
V. Mitin and C. van Wüllen. For the current version, see http://www.cfour.de.
(67) Fogarasi, G.; Zhou, X. F.; Taylor, P. W.; Pulay, P. J. Am. Chem. Soc 1992, 114, 8191–8201.
(68) Császár, P.; Pulay, P. J. Mol. Struct. 1984, 114, 31.
(69) Pulay, P.; Fogarasi, G. J. Chem. Phys. 1992, 96, 2856–2860.
(70) Bakken, V.; Helgaker, T. J. Chem. Phys. 2002, 117, 9160–9174.
(71) Christiansen, O.; Koch, H.; Jørgensen, P. Chem. Phys. Lett. 1995, 243, 409–418.
(72) Watson, T.; Lotrich, V. F.; Szalay, P. G.; Bartlett, R. J. 2012, Triple Excitations in EOM-CC
methods, to be bulished.
21
ACS Paragon Plus Environment
The Journal of Physical Chemistry
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
(73) Kucharski, S.; Bartlett, R. J. J. Chem. Phys. 1998, 108, 5243–5254.
(74) Crawford, T. D.; Stanton, J. F. Int. J. Quantum Chem. 1998, 70, 601–611.
(75) Szalay, P. G.; Watson, T.; Lotrich, V. F.; Perera, A.; Fogarasi, G.; Bartlett, R. J. 2012, Effect
of biological environment on the excitation spectrum of cytosine, to be submitted.
(76) Szalay, P. G.; Watson, T.; Lotrich, V. F.; Perera, A.; Bartlett, R. J. 2012, Systematic Coupled
Cluster study of Watson-Crick and stacked base pairs, to be submitted.
(77) Lotrich, V. F.; Ponton, J. M.; Perera, A. S.; Deumens, E.; Bartlett, R. J.; Sanders, B. A. Mol.
Phys. 2010, 108, 3323–3330.
(78) Deumens, E.; Lotrich, V. F.; Perera, A.; Ponton, M. J.; Sanders, B. A.; Bartlett, R. J. WIREs
Comput. Mol. Sci. 2011, 1, 895–901.
(79) Fogarasi, G. J. Phys. Chem. A 2002, 106, 1381–1390.
(80) Fogarasi, G.; Szalay, P. G. Chem. Phys. Lett. 2002, 356, 383–390.
(81) Guerra, C.; Bickelhaupt, F.; Saha, S.; Wang, F. J. Phys. Chem. A 2006, 110, 4012–4020.
(82) Grimme, S.; Waletzke, M. J. Chem. Phys. 1999, 111, 5645–5655.
22
ACS Paragon Plus Environment
Page 22 of 27
Page 23 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
The Journal of Physical Chemistry
Table 1: Comparsion of various triples methods for the excitation energy (eV) and oscillator
strength of cytosine. All calculations with aug-pVDZ bases, frozen core
2A0
3A0
4A0
5A0
6A0
7A0
1A”
2A”
3A”
4A”
5A”
6A”
7A”
8A”
CCSD
4.94 0.064
5.86 0.164
6.50 0.508
6.70 0.026
6.88 0.181
7.05 0.007
5.56 0.004
5.46 0.003
6.04 0.002
6.06 0.005
6.19 0.007
6.34 0.000
6.51 0.005
6.82 0.000
CC3
4.71 0.065
5.55 0.138
6.30 0.426
6.43 0.025
6.62
5.46
5.18
5.94
5.97
5.60
CCSD(T)
4.74
5.62
6.35
6.57
6.69
6.93
5.49
5.25
5.91
5.96
6.08
5.90
6.43
6.73
23
ACS Paragon Plus Environment
CCSDT-3
4.79
5.65
6.38
6.57
6.7
5.52
5.29
6.00
6.05
5.79
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
π → π∗
n → π∗
π →R
π−1 → π ∗
n, n−1 → 2π ∗
π−1 → R
π →R
n−1 → π ∗
π → 2π ∗
π →R
n→R
π−1 → 2π ∗
EOMEE-CCSD
EOMEE-CCSD(T)
CC2-LRa
TDDFTb
aug-cc-pVTZ
aug-cc-pVTZ
aug-cc-pVTZ
B3LYP/6-311++(d,p)
4.96
0.066
4.69
4.66 0.052
4.63
0.046
5.44
0.003
5.18
4.87 0.002
4.73
0.003
5.73
0.004
5.59
5.53 0.005
5.11
0.001
5.89
0.171
5.60
5.61 0.138
5.44
0.074
6.06
0.000
5.82
5.26 0.002
5.28e
0.005
6.23
0.006
6.10
6.08 0.026
5.65
0.001
e
6.35
0.007
6.18
5.70
0.015
6.40
0.000
5.88
5.83 0.000
6.40 f
0.023
f
6.53
0.507
6.33
6.25
0.149
f
6.65
0.004
6.50
6.62
0.419
6.87
0.026
6.68
5.95 0.031
6.90
0.127
6.69
a Ref. 44 b Ref. 45 c Ref. 46 d Ref. 43 e Strong π → π ∗ component. f π → σ ∗ .
5.54 0.352
5.54 0.002
6.40
6.98
5.43
5.32
6.38
6.74
ACS Paragon Plus Environment
24
0.366
0.623
CASPT2d
TZVP
4.68 0.093
5.12 0.003
DFT/MRCIc
TZVP
4.62
4.86
Table 2: Comparsion of various methods for the excitation energy (eV) and oscillator strength of cytosine
The Journal of Physical Chemistry
Page 24 of 27
π → π∗
π → 2π ∗
n → 2π ∗
π →R
π →R
n → π∗
π →R
n−1 → 2π ∗
π−1 → π ∗ + 2π ∗
π−2 → π ∗ + 2π ∗
n→R
EOMEE-CCSD
EOMEE-CCSD(T)
CC2-LRa
TDDFTb
DFT/MRCIc
aug-cc-pVTZ
aug-cc-pVTZ
aug-cc-pVTZ
B3LYP/6-311++(d,p)
TZVP
5.30
0.021
5.04
5.25
–
4.98
0.205
4.99
5.47
0.275
5.23
5.25 0.302
5.21
0.023
5.15
5.54
0.001
5.28
5.12 0.007
4.88
0.013
5.11
5.70
0.009
5.58
5.53 0.011
5.28
0.008
6.04
0.001
5.91
5.86 0.004
5.59
0.007
6.10
0.003
5.84
5.75 0.003
5.55
0.002
5.72
6.49
0.001
6.38
6.52
0.002
6.27
6.14 0.001
5.82
0.001
6.58
0.478
6.36
6.14
0.095
6.29
6.68
0.035
6.53
6.20
0.200
6.19
–e
6.08 0.030
a Ref. 44 b Ref. 45 c Ref. 46 d Ref. 43 e n → R state has not been found among the first 12 states.
0.002
6.35 0.538
6.64 0.001
5.97
CASPT2d
TZVP
5.20 0.146
5.30 0.201
5.21 0.001
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Table 3: Comparsion of various methods for the excitation energy (eV) and oscillator strength of adenine
Page 25 of 27
The Journal of Physical Chemistry
ACS Paragon Plus Environment
25
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
EOMEE-CCSD
EOMEE-CCSD(T)
CC2-LRa
TDDFTb
CASPT2c
aug-cc-pVTZ
aug-cc-pVTZ
aug-cc-pVTZ
B3LYP/6-311++(d,p)
TZ with diffuse
π →R
5.07
0.002
4.91
5.08 0.028
5.08d
0.008
4.89d 0.005
π → π∗
5.10
0.160
4.86
4.98 0.132
4.88
0.122
4.51
0.164
π →R
5.47
0.005
5.32
5.43 0.141
π → 2π ∗ + R 5.61
0.366
5.37
5.47 0.179
5.18
0.224
5.25
0.080
∗
n→π
5.64
0.000
5.43
5.38 0.003
5.30
0.002
5.22
0.001
π →R
5.98
0.001
5.84
5.99 0.003
5.69
0.002
π →R
6.11
0.001
5.98
5.75
0.000
π →R
6.29
0.010
6.15
5.92
0.002
π → R(π)
6.41
0.000
6.28
6.67
0.038
π → 3π ∗
6.49
0.024
6.26
n → 2π ∗
6.59
0.003
6.37
6.07 0.006
6.06
0.005
π →R
6.79
0.001
6.51
n→R
6.13
0.008
a Ref. 44 b Ref. 45 c Ref., 26 (10s6p1d/5s1p)/[5s3p1d/3s1p] basis set of triple-ζ plus polarization quality. d Assigned as π → σ ∗
Table 4: Comparsion of various methods for the excitation energy (eV) and oscillator strength of guanine
The Journal of Physical Chemistry
ACS Paragon Plus Environment
26
Page 26 of 27
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
n → π∗
π → π∗
π →R
n−1 → 2π ∗
π → 2π ∗
π →R
π → 3π ∗
π →R
π →R
n→R
π−1 → R
n−1 → π ∗
EOMEE-CCSD
aug-cc-pVTZ
5.09
0.000
5.36
0.213
5.90
0.001
6.46
0.00
6.59
0.060
6.68
0.001
6.76
0.236
6.84
0.000
7.11
0.029
7.30
0.010
7.47
0.000
7.51
0.001
EOMEE-CCSD(T)
CC2-LRa
TDDFTb
aug-cc-pVTZ
aug-cc-pVTZ
B3LYP/6-311++(d,p)
4.85
4.82 0.000
4.71
0.000
5.15
5.20 0.182
4.95
0.138
5.77
5.74 0.000
5.44
0.000
6.27
6.16 0.000
5.82
0.000
6.26
6.27 0.037
5.92
0.068
6.54
6.49 0.000
6.15
0.000
6.46
6.53 0.178
6.22
0.130
6.70
6.97
7.08
6.39 0.061
7.29
6.96
6.20
0.000
a Ref. 44 b Ref. 45 c Ref. 46 d Ref. 43
6.38
6.15
6.52
6.86
6.42
6.43
ACS Paragon Plus Environment
27
0.000
0.356
0.000
0.067
CASPT2d
TZVP
4.94 0.000
5.06 0.334
5.93
5.98
DFT/MRCIc
TZVP
4.48
5.18
Table 5: Comparsion of various methods for the excitation energy (eV) and oscillator strength of thymine
Page 27 of 27
The Journal of Physical Chemistry