Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Molecular structure–property relationships for alkenes夽 Steven D. Nelson, Paul G. Seybold∗ Department of Chemistry, Wright State University, Dayton, OH 45435, USA Received 28 November 2000; received in revised form 7 February 2001; accepted 7 February 2001 Abstract Structure–property relationships were obtained for 11 physical and chemical properties (boiling points (bp), melting points (mp), molar refractions (MR), molar volumes (MV), heats of combustion (HCKJ), molar heats of vaporization (HVMOL), flashpoints (FLASHK), second virial coefficients (VIRC2), critical temperatures (Tc ), critical pressures (Pc ), and viscosities (VISC)) for a data set consisting of 162 C4–C9 monoalkenes. Both molecular connectivity indices and ad hoc descriptors were tested as structural descriptors, and both produced high-quality regression equations for most of the properties. As was observed in an earlier study of alkanes [J. Am. Chem. Soc. 110 (1988) 4186], mp were not well described by either descriptor set. For most properties, the mass/bulk of the molecule was found to be the most important structural feature determining the property, suggesting that dispersion forces play a dominant role in determining those properties influenced by intermolecular interactions. The amount of branching in the molecule and the nature of the double bond environment were also found to be influential features. © 2001 Elsevier Science Inc. All rights reserved. Keywords: Structure–property relationships; Alkenes; Principal component analysis (PCA) 1. Introduction One of the most fundamental ideas of chemistry is that the physical and chemical properties of a substance are determined, somehow, by its molecular structure — the term ‘structure’ taken here in its broadest sense to include both geometric and electronic aspects. Historically, the difficulty associated with this simple proposition has centered on how to implement it in practice, since finding appropriate mathematical terms to describe ‘molecular structure’ has not been easy. Such mathematical terms (‘descriptors’) are necessary if quantitative relationships are to be constructed between the structure of a compound and its properties. In principle, a full quantum mechanical treatment for a bulk property such as the boiling point (bp) should yield a solution to this conundrum, but in practice a full-blown, accurate quantum mechanical treatment of a sizable collection of molecules is presently out of the question. One turns, therefore, to simpler approaches. One such approach is to attempt to identify suitable quantum chemical descriptors obtained from studies of isolated molecules. This approach, although quite successful in some cases, has in many other cases produced only mixed results. A simpler, alternative approach is to employ 夽 Taken in part from the Honors Thesis of SDN at Wright State University. ∗ Corresponding author. Tel.: +1-937-775-2407; fax: +1-937-775-2717. E-mail address: [email protected] (P.G. Seybold). topological descriptors, which fortunately often prove quite adequate to the task and, moreover, highly informative. Several benefits can be derived from structure–property relationship studies. For example, unmeasured properties of related compounds can be estimated using the equations derived from a structure–property study. On a more fundamental level, a clearer understanding of the roles that specific structural features play in determining properties can be drawn from the equations. And once such an understanding is achieved, this information can be used to design hypothetical structures that might have desirable property values. Finally, the structure–property equations can serve as a useful check on the accuracy of property values already reported in the literature, some of which may be measured incorrectly or misreported. In an earlier report [1] we applied topological descriptors in a study of eight physical properties of a set of normal and branched alkanes. In that study, it was found that of the topological descriptors examined, molecular connectivity indices [2,3] and ad hoc descriptors [1,4], were especially successful in yielding high-quality structure–property relationships. Good regression equations were obtained for seven of the physical properties of the alkanes (the melting points (mp), traditionally a subtle and difficult property to represent, were an exception). In this report, we employ these same descriptor types as structural measures for a study of the physical and chemical properties of a set of monoalkenes, where a new structural feature, the double bond, is introduced. Only 1093-3263/01/$ – see front matter © 2001 Elsevier Science Inc. All rights reserved. PII: S 1 0 9 3 - 3 2 6 3 ( 0 1 ) 0 0 0 9 9 - 7 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 a few previous QSPR studies have been devoted to the properties of this class of compounds [5–8], and these have generally been limited to single properties. 37 Molecular connectivity indices were originally developed by Randic’ [15], Kier and Hall [2,3]. The form of these descriptors is m v χt 2. Methods The properties examined in this study were: bp, molar refractions (MR), molar volumes (MV) at 20◦ C, heats of combustion (HCKJ), mp, molar heats of vaporization (HVMOL) at 25◦ C, flashpoints (FLASHK), second virial coefficients (VIRC2) at 25◦ C, critical temperatures (Tc ), critical pressures (Pc ), and viscosities (VISC) at 20◦ C. The property values were taken from reference sources [9–14]. MV were calculated as Mw /d, where Mw is the molecular weight, and d is the density (g/cm3 ) at 20◦ C. MR were calculated using the Lorentz–Lorenz expression (n2 − 1)/(n20 + 2) MR = 0 Mw /d where n0 is the index of refraction. The compounds examined and their shorthand abbreviations are listed in Appendix A. The property values for each compound are shown in Appendix B. The parameter values for each compound are shown in Appendix C. Two types of parameters were used in this study: ad hoc descriptors [1,4] and molecular connectivity indices [2,3]. The ad hoc descriptors used were as follows: the number of carbon atoms (NC) represents the mass or bulk of the molecule. The square of the number of carbons (NCSC) and the square root (NCSR) account for non-linear aspects of the bulk dependence. The number of terminal methyl groups (TM) is a measure of the amount of branching in the molecule. The number of paths of length three carbon–carbon single bonds (P3S) is a steric index. The descriptor exterior double bonds (DBE) indicates whether the double bond is exterior or interior. The number of carbon atoms bonded to the double bond carbons (NCDB) measures the amount of crowding in the double bond environment. Several additional ad hoc indices were examined, but did not significantly improve the regression results. where m is the order of the substructure, indicating the number of bonds, and t is the substructure type indicator. The substructure types included were path (p), cluster (c), path/cluster (pc), double bond (DB), and total (t) [1]. Only the valence (v) type indices, i.e. those which take explicit account of heteroatoms and multiple bonds, have been used in this study. The construction of these indices is described in detail in the alkane study [1] and elsewhere [2,3]. The molecular connectivity indices, used in this study, were calculated using Hall’s MOLCONN2 software package [16]. Here, these indices are identified as XV(t)(m)(f ) where f is the functional form (I = inverse, SQ = square, SR = square root). For example, XV1SR is the square root of the first order valence index, and XVPC4 is the fourth order pc index. Also included were the numbers of three-bond clusters (NXC3) and paths length of six bonds (NXP6). Altogether, 22 connectivity index terms were included in the initial screening. Regression equations and other statistical measures were obtained using options in the SAS software package [17] on the Wright State University IBM 8083E computer. The final equations were selected on the basis of their standard errors (S.E.) and F-statistics. Principal component analysis (PCA) was used to determine the inherent dimensionality of the groups of properties. Orthogonal and oblique rotations did not significantly improve the results. 3. Results 3.1. Correlation analysis The correlations among the properties examined are shown in Table 1. As can be seen, most of the properties Table 1 Correlations among the properties examined Bp MR MV HCKJ HVMOL FLASHK VIRC2 Tc Pc VISC20 Mp Bp MR MV HCKJ HVMOL FLASHK VIRC2 Tc Pc VISC20 Mp 1.000 0.967 0.946 0.970 0.993 0.933 −0.960 0.996 −0.950 0.659 0.664 1.000 0.992 0.996 0.921 0.905 −0.981 0.965 −0.940 0.824 0.640 1.000 0.992 0.903 0.844 −0.969 0.979 −0.958 0.804 0.614 1.000 0.938 0.897 −0.976 0.989 −0.958 0.823 0.525 1.000 0.878 −0.974 0.998 −0.920 0.602 0.482 1.000 −0.975 0.997 −0.825 0.431 0.518 1.000 −0.942 0.903 −0.981 −0.543 1.000 −0.954 0.925 0.607 1.000 −1.000 −0.467 1.000 0.844 1.000 38 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Table 2 Multiple regression equation for the properties using ad hoc descriptors Bp (◦ C) −275.58(±3.51) + 139.31(±1.40) × NCSR + 7.49(±0.36) × NCDB − 7.64(±0.26) × TM − 5.93(±0.62) × DBE + 1.74(±0.19) × P3S; n = 162, r2 = 0.9941, S.E. = 2.28, F = 5252 MR (cm3 /mol) 20.65(±2.20) + 7.44(±0.35) × NC − 14.53(±1.76) × NCSR − 0.0862(±0.0130) × P3S; n = 156, r2 = 0.9976, S.E. = 0.2224, F = 21114 MV (cm3 /mol) 26.27(±0.66) + 17.33(±0.12) × NC − 1.99(±0.11) × NCDB − 1.27(±0.07) × P3T + 0.87(±0.11) × TM; n = 156, r2 = 0.9971, S.E. = 0.8547, F = 13105 HCKJ (kJ/mol) 84.57(±2.89) + 659.83(±0.45) × NC − 8.83(±0.53) × TM; n = 65, r2 = 1.0000, S.E. = 3.37, F = 1000000 Mp (◦ C) −169.58(±7.06) + 0.9947(±0.1508) × NCSO; n = 57, r2 = 0.4416, S.E. = 17.43, F = 44 HVMOL (J/mol) −27094(±1150) + 24224(±490) × NCSR + 1984(±192) × NCDB − 1972(±134) × TM − 1047(±237) × DBE; n = 34, r2 = 0.9896, S.E. = 503, F = 690 FLASHK (K) 22.08(±27.94) + 92.78(±11.25) × NCSR; n = 18, r2 = 0.8096, S.E. = 5.81, F = 68 VIRC2 (cm3 /mol) 3630(±597) + 111.5(±6.3) × NCSO + 2427(±343) × NCSR − 68.5(±16.7) × NCDB; n = 14, r2 = 0.9978, S.E. = 50.8, F = 1478 Tc (◦ C) −63.73(±11.54) + 64.51(±3.90) × NC·2.30(±0.31) × NCSO − 10.42(±1.50) × DBE; n = 13, r2 = 0.9981, S.E. = 2.45, F = 1607 Pc (MPa) 7.585(±0.319) − 1.789(±0.139) × NCSR; n = 12, r2 = 0.9432, S.E. = 0.128, F = 166 VISC20 (cP) 0.08661(±0.03813) + 0.00495(±0.00106) × NCSO + 0.00884(±0.00167) × TMSQ; n = 6, r2 = 0.9725, S.E. = 0.01843, F = 53 are highly correlated with one another, with the exception of mp, which is poorly correlated with the other properties. VISC is another possible exception. The remaining nine properties all have correlation coefficients greater than 0.82, and the subset of bp, MR, MV, and HV all have correlations greater than 0.90. 3.2. Multiple regression analysis Table 2 shows the most successful ad hoc descriptor models with their coefficients of determination (r2 ), S.E., and F-values, and Table 3 shows these measures for the most successful molecular connectivity index models. Most of the Table 3 Multiple regression equation for the properties using connectivity indices Bp (◦ C) 15.04(±16.81) + 74.70(±6.08) × XV1 + 307.40(±15.99) × XVDB·81.79(±5.36) × XV0 + 28.49(±1.69) × XVP3 + 51.20(±3.29) × XVZ + 157.89(±16.49) × XV1SR; n = 162, r2 = 0.9945, S.E. = 2.20, F = 46.80 MR (cm3 /mol) 1.91(±0.57) + 8.87(±0.14) × XV1 + 2.19(±0.04) × XV2 + 14.00(±1.26) × XVT; n = 156, r2 = 0.9973, S.E. = 0.2374, F = 18527 MV (cm3 /mol) 7.79(±0.89) + 14.19(±0.36) × XV1 + 14.21(±0.24) × XV0 + 59.32(±1.58) × XVDB + 0.14(±0.26) × XVP3; n = 156, r2 = 0.9973, S.E. = 0.8340, F = 13766 HCKJ (kJ/mol) 807.59(±10.34) + 1038.90(±3.81) × XV1 + 308.07(12.51) × XV2 + 103.80(±4.82) × XVP3 + 35.22(±2.68) × XVDB1; n = 65, r2 = 0.9998, S.E. = 6.44, F = 90708 Mp (◦ C) −414.50(±50.25) + 77.34(±12.22) × XV1 + 491.59(±107.89) × XVT; n = 57, r2 = 0.5510, S.E. = 15.77, F = 33 HVMOL (J/mol) −19604(±11.63) + 27409(±852) × XV1SR + 2644(±246) × XVDB1 + 1950(±282) × NXP6; n = 34, r2 = 0.9884, S.E. = 523, F = 852 FLASHK (K) −113.57(±59.27) + 208.28(±30.67) × XV1SR + 21291(±64.76) × XVT; n = 18, r2 = 0.8982, S.E. = 44.5, F = 64 VIRC2 (cm3 /mol) 1310.1(±145.7) + 405.7(±11.5) × XV1SQ + 3421(±402) × XVT − 336(±46) × XVPC4; n = 14, r2 = 0.9982, S.E. = 44.5, F = 1901 Tc (◦ C) −440.18(±56.79) + 416.76(±35.72) × XV1SR + 308.33(±50.29) × XVT + 5.65(±1.22) × XVT12; n = 13, r2 = 0.9974, S.E. = 2.90, F = 1151 Pc (MPa) 7.592(±0.336) + 1.051(±0.100) × XV0 + 0.141(±0.025) × XV1SQ + 4.135(±0.795) × XVDB2; n = 12, r2 = 0.9878, S.E. = 0.066, F = 215 VISC20 (cP) 0.2466(±0.0053) + 0.0454(±0.0023) × NXC3 + 0.1034(±0.0108) × NXP6; n = 6, r2 = 0.9028, S.E. = 0.0094, F = 208 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Fig. 1. Plot of calculated (ad hoc) boiling points vs. experimental boiling points. Fig. 2. Plot of calculated (molecular connectivity) boiling points vs. experimental boiling points. 39 40 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Table 4 Results of the principal components analysis for the properties: eigenvalues with cumulative fractional variance reproduced Factor Three properties (156 compounds) Four properties (57 compounds) Nine properties (11 compounds) 1 2 3 4 5 2.937(0.979) 0.058(0.998) 0.005(1.000) 3.446(0.862) 0.501(0.987) 0.047(0.999) 0.005(1.000) 7.632(0.848) 1.079(0.968) 0.148(0.984) 0.075(0.993) 0.055(0.999) properties are reasonably well modeled by these two types of descriptors (Figs. 1 and 2), with the exception of the mp, which were not well modeled by either descriptor set. The FLASHK are less well modeled than most other properties, but this is not entirely unexpected since this property is difficult to measure and some reported values may be relatively inaccurate. 3.3. Factor analysis Three factor analysis (PCA) studies were performed, focusing on differing sets of properties. These studies ranged from a ‘compound intensive’ study of just three properties (bp, MR, and MV), for which a large number of experimental values (156) were available, to a ‘property intensive’ study including nine properties, for which complete, matching data were available for only 11 compounds. The results from the first factor analysis, using only bp, MR and MV, are shown in Table 4. As can be seen above, the Table 5 PCA factor loadings for the physical properties Property Factor 1 Three properties Bp MR MV 0.981 0.997 0.990 2 3 0.191 −0.054 −0.135 0.014 −0.055 0.042 Four properties Bp MR MV Mp 0.976 0.982 0.972 0.766 −0.126 −0.174 −0.204 0.643 −0.177 0.056 0.109 0.015 Nine properties Bp MR MV Mp HCKJ VIRC2 Tc Pc VISC20 0.968 0.966 0.982 −0.062 0.998 −0.976 0.971 −0.954 0.996 0.163 −0.138 −0.093 0.993 −0.004 −0.143 0.106 0.078 −0.037 0.173 −0.110 −0.096 −0.100 0.017 −0.067 0.187 0.212 −0.036 three properties are highly correlated (Table 1) and a single factor dominates, accounting for 97.9% of the variation in these properties. As seen in Table 5, all three properties load strongly on the first factor. The data set is quite large (n = 156), and both low and high Mw compounds are well represented. Addition of mp to the analysis considerably reduced the size of the data set, to 57 compounds, but the set still retained a good sampling of both high and low Mw compounds and remained large enough for reliable statistical calculations. The results are shown in Tables 4 and 5. As was seen in Table 1, the mp were not highly correlated with the other properties. The property set now appears to be represented by two factors (Table 4). The first factor accounts for 86.2% of the variance. Including the second factor, related to mp, brings this total to 98.7%. Reminiscent of the alkane study [1], bp, MR, and MV load heavily on the first factor, whereas mp loads strongly on the second factor. Addition of the heat of combustion, the VIRC2, and the critical properties to the analysis markedly decreased the common compound population, and the population became strongly skewed toward the lower Mw monoalkenes. The molar heat of vaporization was excluded to increase the number of compounds from 8 to 11. The results are shown in Tables 4 and 5. This analysis is included only for the sake Table 6 Modeling of factor scores using structural descriptors Parameters r2 S.E. F Factor 1 NC NC, TMSQ NC, TMSQ, DBE XV1SR XV1SR, XV0 XV1SR, XV0, NXP6 0.9947 0.9966 0.9986 0.9684 0.9936 0.9960 0.0726 0.0585 0.0382 0.1783 0.0803 0.0638 29223 22553 35347 4720 11929 12630 Factor 2 NCDB NCDB, NCSR NCDB, NCSR, P3S XVTSQ XVTSQ, XV1SR XVTSQ, XV1SR, XV1 0.0757 0.1236 0.2225 0.1564 0.3121 0.3610 0.9701 0.9533 0.9063 0.9268 0.8446 0.8217 5 4 5 10 12 10 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 of completeness, since its statistical significance is questionable. However, it is interesting to note that the first factor continued to dominate for most properties, accounting for 83.7% of the variance. Addition of the second factor brought this up to 97.0%, and the addition of a third factor brought this up to 99.8%. In order to clarify, what the abstract factors represent, the factor scores were modeled using the ad hoc and connectivity descriptors. Factor 1 from the first analysis (n = 156) was employed to obtain a model for this factor, and Factor 2 was taken from the second analysis (n = 57) for the same purpose. The results are shown in Table 6. Factor 1 is clearly a bulk factor, depending on the leading descriptors, whereas Factor 2 was not well modeled by either of the descriptor sets. 4. Discussion The regression equations presented in Tables 2 and 3 are generally of high-quality for properties other than the mp. Therefore, property values estimated on the basis of these equations, with the exception of mp, should be sufficiently accurate for many practical purposes. The ad hoc descriptor equations in Table 2 show the relative influences of molecular mass/bulk (NC, NCSR and NCSQ), branching (TM), steric factors (P3S), and the double bond environment (DBE, NCDB) in determining the properties studied. As can be seen from the Table, the molecular mass/bulk clearly exerts the dominant influence for properties other than mp, suggesting that dispersion forces play a dominant role for those properties which depend on intermolecular forces. A similar conclusion was reached in the earlier alkane study [1,4]. This is a reasonable conclusion in the present case for bp, HVMOL, VIRC2, Tc , Pc , and VISC20. For MV the ‘mass/bulk’ dependence can be attributed directly to the larger volume of compounds with higher NC, as later modified by small corrections for branching and steric influences. Likewise, MR depends largely on the higher number of electrons in the larger compounds. For the two strictly “chemical” properties, the HCKJ and the flashpoints (FLASHK), the dependence on the mass/bulk dimension is more accurately attributed to the larger number of reacting bonds in the larger, higher NC compounds. Branching, steric factors, and the double bond environment exert smaller influences on the properties, as demonstrated by the coefficients in the ad hoc regression equations. Molecular branching, represented by the number of TM sequesters interior parts of these compounds and reduces the extent of contact between neighboring molecules. The latter effect is reflected in the positive influence of TM on the MV. Because dispersion forces are strongly dependent on distance — the interaction energies fall as 1/r6 , where r is the separation — a decrease in the amount of close contact decreases the cohesive forces 41 experienced by the compounds. Therefore, bp and HVKJ decrease as TM increases. Steric crowding, included here by the parameter P3S, leads to a small reduction in the MV, as reflected in the relatively small contributions to MV and MR from P3S. Whether the double bond is exterior or interior and the NSDB influence the accessibility of the double bond to its environment. When the double bond is exterior the bp, for example, is reduced on average by 6◦ C. Thus, DBE appear to exert a negative influence on the intermolecular forces. The NCDB exerts a positive influence on the intermolecular forces, possibly by effectively screening the effects of the double bond. Because of the above influences, those properties that depend on the strength of intermolecular forces, such as bp, HV, and Tc , show positive dependences on NC, NCSQ, NCSR, P3S, and NCDB, and negative dependences on TM and DBE. The failure of both parameter sets to model the mp is not surprising, since this property was also not well modeled by these same topological parameters in our earlier study of the alkanes [1,4]. This illustrates the greater subtlety of the melting transition as compared to the boiling and critical transitions. The latter transitions involve a direct dependence on the operative intermolecular forces, and so directly reflect the strengths of these forces. The melting transition, in contrast, maintains a condensed phase and involves a partial disruption of intermolecular orientations. Melting, thus, depends on geometric and other factors that are not well addressed by the present topological parameters. This dependence on shape and entropic factors, as opposed to a simple intermolecular force dependence, is reflected in the mp strong loading on the second factor, rather than the first (mass/bulk related) factor, in the factor analysis. Dearden has recently given a comprehensive review of mp predictions [18]. Factor analysis for the monoalkenes shows that a single factor dominates for most properties. In the worst case examined (Analysis 3), Factor 1 still accounted for nearly 84% of the variance. Addition of a second factor in this case raised the variance accounted to 97%. Factor 1 is clearly related to the mass/bulk of the molecules. As can be seen, in Table 6 both the ad hoc and connectivity index mass/bulk dependent descriptors give a good account of this factor. Factor 2, however, is a different story. From the results in Table 5, it is obvious that Factor 2 is related to the mp, but this factor was not well modeled by either the ad hoc or connectivity index descriptors employed in this study. Because of this, it is difficult to determine exactly what structural features are crucial for this dimension. Acknowledgements We thank Stephen Peterangelo for checking some of the regressions and preparing the figures. 42 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Appendix A. Compounds Observation no. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 Name Shorthand name 1-Butene Cis-2-butene Trans-2-butene 2-Methyl propene 1-Pentene Cis-2-pentene Trans-2-pentene 2-Methyl-1-butene 3-Methyl-1-butene 2-Methyl-2-butene 1-Hexene Cis-2-hexene Trans-2-hexene Cis-3-hexene Trans-3-hexene 2-Methyl-1-pentene 3-Methyl-1-pentene 4-Methyl-1-pentene 2-Methyl-2-pentene 3-Methyl-cis-2-pentene 3-Methyl-trans-2-pentene 4-Methyl-cis-2-pentene 4-Methyl-trans-2-pentene 2-Ethyl-1-butene 2,3-Dimethyl-1-butene 3,3-Dimethyl-1-butene 2,3-Dimethyl-2-butene 1-Heptene Cis-2-heptene Trans-2-heptene Cis-3-heptene Trans-3-heptene 2-Methyl-1-hexene 3-Methyl-1-hexene 4-Methyl-1-hexene 5-Methyl-1-hexene 2-Methyl-2-hexene 3-Methyl-cis-2-hexene 3-Methyl-trans-2-hexene 4-Methyl-cis-2-hexene 4-Methyl-trans-2-hexene 5-Methyl-cis-2-hexene 5-Methyl-trans-2-hexene 2-Methyl-cis-3-hexene 2-Methyl-trans-3-hexene 3-Methyl-cis-3-hexene 3-Methyl-trans-3-hexene 2-Ethyl-1-pentene 3-Ethyl-1-pentene 2,3-Dimethyl-1-pentene 2,4-Dimethyl-1-pentene 3,3-Dimethyl-1-pentene 3,4-Dimethyl-1-pentene 4,4-Dimethyl-1-pentene 1N4 2C4 2T4 2M1N3 1N5 2C5 2T5 2M1N4 3M1N4 2M2N4 1N6 2C6 2T6 3C6 3T6 2M1N5 3M1N5 4M1N5 2M2N5 3M2C5 3M2T5 4M2C5 4M2T5 2E1N4 23M1N4 33M1N4 23M2N4 1N7 2C7 2T7 3C7 3T7 2M1N6 3M1N6 4M1N6 5M1N6 2M2N6 3M2C6 3M2T6 4M2C6 4M2T6 5M2C6 5M2T6 2M3C6 2M3T6 3M3C6 3M3T6 2E1N5 3E1N5 23M1N5 24M1N5 33M1N5 34M1N5 44M1N5 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Appendix A (Continued) Observation no. Name Shorthand name 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 3-Ethyl-2-pentene 2,3-Dimethyl-2-pentene 2,4-Dimethyl-2-pentene 3,4-Dimethyl-cis-2-pentene 3,4-Dimethyl-trans-2-pentene 4,4-Dimethyl-cis-2-pentene 4, 4-Dimethyl-trans-2-pentene 2-Ethyl-3-methyl-1-butene 2,3,3-Trimethyl-1-butene 1-Octene Cis-2-octene Trans-2-octene Cis-3-octene Trans-3-octene Cis-4-octene Trans-4-octene 2-Methyl-1-heptene 3-Methyl-1-heptene 4-Methyl-1-heptene 5-Methyl-1-heptene 6-Methyl-1-heptene 2-Methyl-2-heptene 3-Methyl-cis-2-heptene 3-Methyl-trans-2-heptene 4-Methyl-cis-2-heptene 4-Methyl-trans-2-heptene 5-Methyl-cis-2-heptene 5-Methyl-trans-2-heptene 6-Methyl-cis-2-heptene 6-Methyl-trans-2-heptene 2-Methyl-cis-3-heptene 2-Methyl-trans-3-heptene 3-Methyl-cis-3-heptene 3-Methyl-trans-3-heptene 4-Methyl-cis-3-heptene 4-Methyl-trans-3-heptene 5-Methyl-cis-3-heptene 5-Methyl-trans-3-heptene 6-Methyl-cis-3-heptene 6-Methyl-trans-3-heptene 2-Ethyl-1-hexene 3-Ethyl-1-hexene 4-Ethyl-1-hexene 2,3-Dimethyl-1-hexene 2,4-Dimethyl-1-hexene 2,5-Dimethyl-1-hexene 3,3-Dimethyl-1-hexene 3,4-Dimethyl-1-hexene 3,5-Dimethyl-1-hexene 4,4-Dimethyl-1-hexene 4,5-Dimethyl-1-hexene 5,5-Dimethyl-1-hexene 3-Ethyl-cis-2-hexene 3-Ethyl-trans-2-hexene 3E2N5 23M2N5 24M2N5 34M2C5 34M2T5 44M2C5 44M2T5 2E3M1N4 233M1N4 1N8 2C8 2T8 3C8 3T8 4C8 4T8 2M1N7 3M1N7 4M1N7 5M1N7 6M1N7 2M2N7 3M2C7 3M2T7 4M2C7 4M2T7 5M2C7 5M2T7 6M2C7 6M2T7 2M3C7 2M3T7 3M3C7 3M3T7 4M3C7 4M3T7 5M3C7 5M3T7 6M3C7 6M3T7 2E1N6 3E1N6 4E1N6 23M1N6 24M1N6 25M1N6 33M1N6 34M1N6 35M1N6 44M1N6 45M1N6 55M1N6 3E2C6 3E2T6 43 44 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 Appendix A (Continued) Observation no. Name Shorthand name 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 4-Ethyl-cis-2-hexene 4-Ethyl-trans-2-hexene 2,3-Dimethyl-2-hexene 2,4-Dimethyl-2-hexene 2,5-Dimethyl-2-hexene 3,4-Dimethyl-cis-2-hexene 3,4-Dimethyl-trans-2-hexene 3,5-Dimethyl-cis-2-hexene 3,5-Dimethyl-trans-2-hexene 4,4-Dimethyl-cis-2-hexene 4,4-Dimethyl-trans-2-hexene 4,5-Dimethyl-cis-2-hexene 4,5-Dimethyl-trans-2-hexene 5,5-Dimethyl-cis-2-hexene 5,5-Dimethyl-trans-2-hexene 3-Ethy1-3-hexene 2,2-Dimethyl-cis-3-hexene 2,2-Dimethyl-trans-3-hexene 2,3-Dimethyl-cis-3-hexene 2,3-Dimethyl-trans-3-hexene 2,4-Dimethyl-cis-3-hexene 2,4-Dimethyl-trans-3-hexene 2,5-Dimethyl-cis-3-hexene 2,5-Dimethyl-trans-3-hexene 3,4-Dimethyl-cis-3-hexene 3,4-Dimethyl-trans-3-hexene 2-n-Propyl-1-pentene 2-Isopropyl-1-pentene 2-Ethyl-3-methyl-1-pentene 2-Ethyl-4-methyl-1-pentene 3-Ethyl-2-methyl-1-pentene 3-Ethyl-3-methyl-1-pentene 3-Ethyl-4-methyl-1-pentene 2,3,3-Trimethyl-1-pentene 2,3,4-trimethyl-1-pentene 2,4,4-Trimethyl-1-pentene 3,3,4-Trimethyl-1-pentene 3,4,4-Trimethyl-1-pentene 3-Ethyl-2-methyl-2-pentene 3-Ethyl-4-methyl-cis-2-pentene 3-Ethyl-4-methyl-trans-2-pentene 2,3,4-Trimethyl-2-pentene 2,4,4-Trimethyl-2-pentene 3,4,4-Trimethyl-cis-2-pentene 3,4,4-Trimethyl-trans-2-pentene 2-Isopropyl-3-methyl-1-butene 2-Ethyl-3,3-dimethyl-1-butene 1-Nonene Cis-2-nonene Trans-2-nonene Cis-3-nonene Trans-3-nonene Cis-4-nonene Trans-4-nonene 4E2C6 4E2T6 23M2N6 24M2N6 25M2N6 34M2C6 34M2T6 35M2C6 35M2T6 44M2C6 44M2T6 45M2C6 45M2T6 55M2C6 55M2T6 3E3N6 22M3C6 22M3T6 23M3C6 23M3T6 24M3C6 24M3T6 25M3C6 25M3T6 34M3C6 34M3T6 2NP1N5 21P1N5 2E3M1N5 2E4M1N5 3E2M1N5 3E3M1N5 3E4M1N5 233M1N5 234M1N5 244M1N5 334M1N5 344M1N5 3E2M2N5 3E4M2C5 3E4M2T5 234M2N5 244M2N5 344M2C5 344M2T5 21P3M1N4 2E33M1N4 1N9 2C9 2T9 3C9 3T9 4C9 4T9 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 45 46 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 47 48 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 49 50 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 51 52 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 S.D. Nelson, P.G. Seybold / Journal of Molecular Graphics and Modelling 20 (2001) 36–53 References [1] D.E. Needham, I.-C. Wei, P.G. Seybold, J. Am. Chem. Soc. 110 (1988) 4186–4194. [2] L.B. Kier, L.H. Hall, Molecular Connectivity in Chemistry and Drug Research, Academic Press, New York, 1976. [3] L.B. Kier, L.H. Hall, Molecular Connectivity in Structure–Activity Analysis, Wiley, New York, 1986. [4] P.G. Seybold, M. May, U.A. Bagal, J. Chem. Educ. 64 (1987) 575– 581. [5] R.H. Rohrbaugh, P.C. Jurs, Anal. Chem. 57 (1985) 2770–2773. [6] P.J. Hansen, P.C. Jurs, Anal. Chem. 59 (1987) 2322–2327. [7] S. Liu, R. Zhang, M. Liu, Z.J. Hu, Chem. Info. Comput. Sci. 37 (1997) 1146–1151. [8] S.P. Verevkin, D. Wandschneider, A. Heintz, J. Chem. Eng. Data 45 (2000) 618–625. [9] CRC Handbook of Chemistry and Physics, 68th Edition, CRC Press, Boca Raton, 1987. 53 [10] I.B.D. Smith, R. Srivastava, Thermodynamic Data for Pure Compounds. Part A. Hydrocarbons and Ketones, Elsevier, New York, 1986. [11] R.R. Dreisbach (Ed.), Physical Properties of Chemical Compounds. Part II. Advances in Chemistry Series, Vol. 22, American Chemical Society, Washington, DC, 1959. [12] Selected Values of Properties of Hydrocarbons and Related Compounds, A&M Research Foundation, Thermodynamic Research Center, College Station, TX, 1985. [13] M.P. Doss, Physical Constants of the Principal Hydrocarbons, 2nd Edition, The Texas Company, New York, 1939. [14] Aldrich Catalog of Fine Chemicals, Aldrich Chemical Co., New York, 1981–1982. [15] M. Randic’, J. Am. Chem. Soc. 97 (1975) 6609–6615. [16] L.H. Hall, Hall Associates, Quincy, MA. [17] SAS Institute, Box 8000, Cary, NC. [18] J.C. Dearden, The prediction of melting point, in: M. Charton (Ed.), Advances in Quantitative Structure Property Relationships, Vol. 2, JAI Press, New York, 1999, pp. 127–175.
© Copyright 2026 Paperzz