Town and Gown in Worcester, MA: Measuring the Distance to Colleges and its Effect on House Prices By Gael Carter December 2010 1 I. Introduction In a recent article in the Worcester Telegram and Gazette (Kotsopoulos, 2010), the citizens of Worcester condemned local college students for holding out of control off-campus parties. In the weeks following, many students from various colleges were arrested, including students from College of the Holy Cross and Assumption College, at various off-campus locations. In the news, it seems that the neighbors who live close to the colleges are the most directly affected by student behavior, but do the homeowners in neighborhoods further away from the schools feel the same effects? This study aims to answer the question: “To what extent does the location of a house, in relation to a Worcester college, affect the price of the house?” This paper also focuses on the separate effects of the colleges and universities in the city and their differing effects on the sales price of a house. In addition to the stereotype of wild, drunken behavior, students living in overcrowded off-campus residences also cause increased traffic and parking issues from student ownership of cars. Although it is easy to only see the negative impacts of living close to a college campus, there are many positive externalities as well. Colleges often host free cultural events such as speakers, art exhibitions, and movies on campus. They also hold athletic events, run summer camps, and allow access to the facilities such as libraries, swimming pools, tennis courts, and many more. Positive effects can also be seen in the community through student participation in community outreach programs. This balance of contradicting externalities has been referred to throughout history as “Town and Gown” (Mayfield, 2001). The term “town and gown” dates back to medieval Europe 2 identifying the town as the lay people in the communities and the gown as the universities. This term refers to the separation of town and gown and also implies conflict between the two groups. In 1862, the Morrill Act was passed in the United States, establishing a land-grant for colleges who provided a public service in return for federal aid (Mayfield, 2001). In this situation, farmers would identify an improvement, need, or specific problem and the experts at the university would solve the problem for the farmers. This became one of the first of many kinds of relationships between universities and the surrounding communities. Martin, Smith and Phillips’ (2005) paper explains that the separation of the universities in America began because of geographical separation from the larger towns and cities. Universities were usually located in rural areas and were often secluded. However, as the urban areas began to expand, the universities found themselves in the midst of the economic and social problems of the community around them. Throughout the history of American universities, there have been examples of successful college and community relationships, but for the majority of college towns the relationship continued to decline (Martin, Smith and Phillips, 2005). Ten years ago, the city of Worcester faced a similar situation in its local college neighborhoods. The debate at hand is whether or not colleges positively or negatively impact the community. In my paper, I use the distance of a given house from the nearest college or university in order measure a potential effect on the sales price of a house and if the effect is positive or negative. I also examine the schools individually to test if there is a difference in the effects between the diverse types of colleges in Worcester. This study finds that that the location of a house in relation to a college in the city of Worcester is significant. This distance is significant with and without college differentiation. This effect is also significant when the distances for the different colleges are controlled for, and 3 has a greater explanatory power than the original hedonic regression with the variable of interest being the distance to any college. This implies that the differences in the colleges impact local house values in Worcester, holding distance and all other neighborhood and house characteristics constant. II. Literature Review: No study exists that exactly examines the impact of colleges in a “college town” on house prices. However, the recent study conducted by Vandegrift, Lockshiss, and Lahr (2009) examines the value of a college on house values at the aggregate level. Their study looks at the entire state of New Jersey and they test to see if the presence of a college within the borders of a municipality causes an effect on home municipality tax and also tested to see if the presence of a college leads to higher house prices. They utilized a version of the hedonic pricing model to regress their data, which is explained in the Hedonic Model section of this paper. The presence of a college in the town corresponds with a tax base that is about 24% higher than that of a town in New Jersey without a college. This implies that the presence of a college within a town border causes a significant positive effect on the tax base per acre. This effect is greater in towns with a four-year college than towns containing a community college. They also conclude that the variables representing the size of the college and the degree to which the college is residential have virtually no effect on the tax base of the town. Vandegrift, Lockshiss, and Lahr also found that the presence of a college in a New Jersey town was associated with house prices that are 11% higher. They discovered that smaller colleges have a larger positive effect on house values and this effect disappears when the enrollment of any particular college reaches 12,500 students. This implies that while smaller colleges have an impact that increases house values in a town, once the enrollment of the college surpasses 12,500 students; this effect reverses to a negative 4 impact on house values. Similar to the influence of the tax base, the effects of a four-year college are higher than that of a community college and the difference is derived from the degree to which the college is residential. That is to say that a school with a larger percentage of the student body living in off-campus housing would have a greater negative effect on the prices of the surrounding houses in comparison to a college that has the entire student population in oncampus housing. They conclude that the overall effect for small sized, four-year colleges is a positive impact on house values and the tax base per acre. The effect is similar, but to a lesser degree in two-year colleges and the effect is negative for large schools. Other studies deal with the effects of environmental externalities, open space, and the quality of primary and secondary schools on house values. In Kiel’s (1995) research of the impact of hazardous waste sites on house values, she examines the distance to the closest hazardous waste site in the town of Woburn, MA. In Woburn there are two different hazardous waste sites. Similar to my process, Kiel used the hedonic approach using variables such as price, finished area in square feet, date of sale, year built, architectural style, and minimum distance from house to the Superfund sites. In order to measure the effect of distance from the Superfund sites on house values over time, the data were grouped by time periods. For example, the first period was prior to announcements in the media about the Superfund sites, the second time period represents the discovery of these sites, and the third time period represents the time period in which the EPA added the sites in Woburn to the Superfund list. The possibility of cleaning up the sites was made public during the fourth period, the fifth period was when the official cleanup plan was announced, and the final period is when the cleanup actually began. The regression results show that the information released on the toxic sites did impact the house prices. Unlike 5 the work of Vandegrift, Lockshiss, and Lahr, Kiel’s approach allows for the data to be viewed at a local level to gain the specific effects of the Superfund sites in Woburn over time. Downes and Zabel (2002) quantified the impact of school characteristics on house prices in Chicago from 1987-1991. In this study, they combined information from the American Housing Survey and the Illinois School Report Cards to form their own data set. They also used the hedonic method to estimate their findings. Due to the complexity of the data set, Downes and Zabel used 33 variables in their regression and found that in the district-level results, the perpupil expenditures and test scores have similar positive impacts on house values. The schoollevel results implied that potential buyers respond to the racial composition of a school when they are deciding the price that they are willing to pay for a given house. My study looks specifically at the city of Worcester and the twelve colleges and universities that it contains. The information that I have collected is on a microeconomic scale. Therefore, the effects of the colleges on the surrounding house prices will be much more accurately identified than in Vandegrift, Lockshiss, and Lahr’s paper. In Worcester the tax base will be constant for the entire city, so I do not include that variable in my study and I will simply focus on house sale prices from the year 2000 and the census level data from the corresponding year. Similar to their study, I also use the hedonic house price model in which I include variables such as percentage of students in off-campus housing, number of undergraduate students, and number of graduate students in my regression in order to account for differences in the various schools that I include in my data set. Based on the study conducted by Vandegrift, Lockshiss, and Lahr, the negative effects of the colleges on house values appear to be on a more local level, or within the given municipality. In their regressions, they used municipal-level data for the entire state of New Jersey to look at the effects on an aggregate level. The positive effects are 6 conspicuous at the aggregate level in the state of New Jersey. However, by looking at the sale prices of single family residences and their proximity to the nearest college, I am able to get a more accurate idea of the positive and negative impacts caused by the colleges on the house sales price. My economic model is very similar the model from Kiel’s paper. This study uses the the same house price hedonic method, and the price of the house as the dependent variable. However, I use distance as the variable of interest in my regression analysis. Based upon the assumption that effects of colleges on house prices would change very slowly over time, I only use data from a single year in order to determine a generalized effect of the colleges on house sale prices. Thus, by substituting in the distance from a given house to the nearest college in Worcester, I am able to see the effects on the year 2000 sales price of the house. Downes and Zabel present an interesting way to connect house prices and schools, but what I have done is different in nature from their study. The difference lies in the fact that homeowners who are choosing a home near a school are choosing based on the school itself. People buy houses located in good school districts because they will most likely have children to send to the schools. The purchase of a house near a particular primary or secondary school permits enrollment in the schools. On the other hand, the purchase of a house near a college or university is different because it does not guarantee enrollment in that particular college. For most colleges, in-state tuition is much less, which could a potential draw to living in close proximity to a certain college but not the sole reason. Also, the funding sources for colleges are much different than those of primary and secondary schools, who draw from local taxes for funding. Thus the decision to live near a particular college would not stem from the desire for the homeowner’s children to attend the college. However, since the study conducted by Downes and 7 Zabel proves that potential home buyers take school districts into account, it is reasonable to assume that the school district that the house is located in is a neighborhood effect that would affect the price of the house. They also include neighborhood variables such as the natural log of the median income in the house’s census tract, median age of individuals in the house’s census tract, proportion of nonwhite individuals in the census tract, and the proportion of blue-collar workers in the census tract, along with many other variables. These neighborhood variables are important for the hedonic method because they control for the differences in different parts of the city, which affect the house prices. I also use census level data for neighborhood variables in my regression, but my houses are divided up by census block groups rather than census tracts. III. Data In my research, I used information from the single family house sales from the year 2000 in the city of Worcester provided by the Warren Group. The Warren Group is a New England based company that provides its clients with comprehensive studies, based on detailed property and market information collected over time (www.thewarrengroup.com). The data set from the Warren Group initially contained 1450 observations of houses. Some of the houses in the data set were located outside of the Worcester city border; therefore they were eliminated from the usable data. In the column for house sale prices, some of the sale prices seem abnormally low. This low price may be a result of houses that were sold within a family for a price below market value. On the other hand, some of the houses were significantly above the mean house price. In order to correct for these outliers to some extent, I dropped the five lowest and the five highest sale prices. Another problem with the data set arose in the column that described the year in which the house was built. Thirty three of the houses in the data set were reported to be built after the 8 year 2000. However, since the data describe houses sold in the year 2000, it is impossible for the homes to be built in any year after the sale year. These houses have removed from the data set. After cleaning the data set, 1394 observations remain. The number of observations remains constant throughout the series of regressions. Similar to Kiel’s (1995) house price hedonic method in Woburn, MA, I originally had wanted to include variables such as lot size, interior square footage, number of floors, renovation year, style of the house, type of heating fuel, central air conditioning, basement designation, basement area, number of parking spaces, and if the house sale includes a garage. Unfortunately, the data from the Warren Group was limited in the house descriptive variables and only contained information on price, the year in which the house was built, the total number of rooms, number of bedrooms, the number of bathrooms, lot size, and if the house is mortgaged. Due to the restricted supply of house characteristic and information variables, I rely heavily on neighborhood information. In order to incorporate neighborhood statistics, I layered my house price data set with Census level data from the year 2000, collected from the Massachusetts Office of Geographic Information (MassGIS) website. I used Census data at the block group level, and each of the remaining houses in my data set was sorted into the Census block groups of the city of Worcester. The U.S Census Bureau defines “A census block group [as] a cluster of census blocks having the same first digit of their four-digit identifying numbers within a census tract” (www.census.gov). Therefore, the block groups are smaller than the census tracts and contain more specific neighborhood characteristics that I control for in my regression analysis. In general, census block groups contain any number of observations between 600 and 3,000 people, with an optimum size of 1,500 people; see Map 1 for a complete map of the block groups of Worcester, MA. 9 The information in the Census data that I collected included the total population, the median value of owned houses, and the median household income for each block group. I also included variables on the percentage of the population of the block groups broken down by race, age, high school graduate, college graduate, and others. Similarly, I included percentages of the total houses in each block group for variables such as occupied owned homes, occupied rented homes, the type of power in the homes, rented and owned homes without full plumbing, and the rented and owned homes without full kitchens. For a full list of variables and descriptive statistics, see Table 1. I ran a correlation matrix of all of my independent variables, a few notable correlations exist between the following neighborhood variables: the total population in a block group and the total housing units in a block group, the median house value of a block group and the median household income in a block group, and the percent of owned occupied houses and the percent of rented occupied houses. The median household income for a given block group is also highly correlated with the percent of owned occupied houses and the percent of rented occupied houses. The house characteristic variables for the total number of rooms and the total number of bedrooms were also highly correlated. The total number of rooms and the total number of bathrooms is also correlated but to a lesser degree than the variable for bedrooms. These correlations between the house characteristic variables are logical because the size of the house depends on the total number of rooms, a fraction of which are made up of bedrooms and bathrooms. I kept all of the variables in the regression because removing them would have caused omitted variable bias. All other variables had correlations of less than 0.5, and a correlation matrix of the significantly correlated variables is located in Table2. Finally, in order to create my variable of interest I used GIS software to calculate the distance of a given house to the nearest Worcester college or university. In order to do this, I 10 used a data layer from the MassGIS website which contained information about the colleges themselves. The data layer of colleges contained latitude and longitude locations for all of the schools in the data set. From those locations, I converted their locations to the corresponding coordinates on Massachusetts State Plane and calculated the distances from each house to the closest college, measured in meters. This variable is labeled as “Dist1” and each house is associated with the nearest college. The data layer of colleges contained variables such as the number of undergraduate and graduate students, the percentage of students living on campus, the length of the program, and if the college is classified as a liberal arts school. I then created variables to represent other information about the schools that could theoretically affect the sales price of the house. These variables are similar to the study conducted by Vandegrift, Lockshiss, and Lahr (2009) in that they controlled for two year versus four year colleges and the percentage of students living on campus. The correlation matrix of the college characteristic variables is shown on Table 3. The variable representing the number of undergraduate students is highly correlated with the following variables: the number of graduate students, the dummy variable for programs of four years and longer in length, and the dummy variable for liberal arts schools. Both dummy variables for programs of four years and longer in length, and the dummy variable for liberal arts schools are also highly correlated the percentage of students living in off-campus housing. The other correlations are less significant, and I control for the high correlation of the other variables in my regressions. IV. Hedonic Model The hedonic model originated in Rosen’s (1974) work on implicit markets. House price hedonic models are now widely accepted in economic studies, especially in environmental economics. The underlying foundation for the model uses the theory that the price of a house can 11 be derived from considering a house as a bundle of goods; including characteristics of the house and property itself, characteristics of the neighborhood and community, and environmental characteristics. Kiel (2006) explains that “if transactions in the housing market reflect the interaction of informed buyers and sellers, then the price that the house sells for is the sum of the prices that the buyer is willing to pay for each individual characteristic of the house”. Therefore, after controlling for non-environmental factors, any remaining differences in price can be attributed to differences in environmental quality. In essence, it is possible to assign a price for the said environmental quality. A general example of a hedonic regression would take the form: Sales Price= β0+ β1i House Characteristics+ β2i Neighborhood Characteristics + β3i Environmental Quality + ε. In order for this model to hold, four assumptions must be made. First, for the housing market to reach equilibrium, the observed prices must be an outcome of the buyers maximizing their utility functions and the sellers maximizing their profits. Even if this assumption is unlikely in a housing market, the prices should still reflect the preferences of the buyers and sellers of the houses. The next assumption is based upon the idea that the buyer of a house is aware of all of the characteristics of the house, so all of the house characteristics are included in the price that the buyer is willing to pay. In the case of bedrooms, bathrooms, square footage, and other similar house variables, it is a safe assumption that the information is equally available to both the buyer and seller. However, if there is radon in the basement or asbestos in the insulation, the buyer may not be aware of this information at the time of the purchase. In this case, the buyer’s purchase price would not reflect the true value of the house. The third assumption is that in the housing market, discrimination cannot exist. If this assumption fails, then the characteristics of the buyer affect the price of the house along with the house characteristics themselves and the resulting 12 price would not reflect the true value of the house. The fourth and final assumption is that the examined housing market cannot be a segmented market; it must be a single market. If a single equation is estimated, it requires a single market for the equilibrium condition (Kiel 2006). In order for the house price hedonic regression to hold and for the sales price of the house to fully represent the house characteristics, all four of the assumptions must be correct. V. Results: In my study I used the basic house price hedonic model; however I included distance to the colleges as one of my neighborhood characteristics in order to assist in the explanation of the variation of the sales prices of the houses in the data set. In the first regression, I used Dist1 to represent the minimum distance from the house to the nearest college or university, measured in meters. In this regression, I assumed that effects of the colleges on house prices jointly and did not separate out effects for the different schools. I regressed both price and ln(price) as the dependent variables, however I chose to use price as my dependent variable because it had more explanatory power than ln(price). Also, based on theory and the graph of price and dist1 in Graph 1, my variables do not appear to have a linear functional form. Therefore, I have chosen to use the variables dist1 squared and dist1 cubed, denoted dist1sq and dist1cubed respectively, in order to better explain the variation in the mean of the dependent variable of price in the regression. For the first regression, the estimated coefficients are shown in Table 4, and all results are represented with White’s Heteroskedasticity-Consistent Standard Errors. I used the dist1, dist1sq, and dist1cube to represent the distance to any college; this is to determine if the distance is significant without distinguishing between the colleges. The t-statistics imply that the variables 13 dist1, dist1sq, and dist1cube are all statistically significant at the 95% level in the regression. The coefficient for dist1 is negative, Therefore, being closer to a college has a positive effect on the sales price of a house. For example, if two completely identical houses are located in the same census block group, the house that is located closer to the college or university in Worcester would have a higher sale price than an identical house further away from the school. The dist1sq and dist1cube variables help to explain a more accurate picture of the impact that the distance variable has on the sale price of a house. Including all three distance variables, the combined coefficient is still negative, which implies that a house in close proximity to a college or university in Worcester will have a positive impact on the sale price of the house. More specifically, if a house is moved one meter closer any college in Worcester, the sales price of the house would increase by $28.877. All of the other included independent variables have the expected signs and the following variables have corresponding coefficients that are statistically significant at the 95% level: mortgage, t_yearbuilt, totrooms, bathrooms, lotsize, vl_med_own, p_pop_black, p_occ_owner, p_occ_renter, p_util_gas, p_tank_gas, p_elec, p_fuel, and p_other. The coefficients for t_yearbuiltsq and p_rnt_vhcl_1 are both statistically significant at the 90% level and all of the other coefficients of the remaining independent variables are statistically insignificant. In order to avoid omitted variable bias, I have kept all of the independent variables in the regression. The adjusted R-Squared for the regression is 0.6151, which implies that 61.5% of the variation in the sales price of the houses in the data set is explained by the independent variables, including distance to the nearest college. It is possible that there are different effects on the housing prices from the different universities in Worcester because of the many different types of colleges. For example, Bancroft School of Massage therapy may have a different impact on nearby house prices than Clark 14 University because of the characteristics of the schools. Therefore, after the initial regression I tested the colleges separately to see if the distance from a given house to a specific college is significant. In order to do this, first I created dummy variables for each college. Then, I generated an interaction term between the college dummy variable and the distance variable in order to test to see if the distance from a house to the nearest college is significant for each college individually. Based on the statistically significant results of dist1sq and dist1cube from the first regression, I also created interaction terms between the dummy variables of all of the colleges and the dist1sq and dist1cube variables respectively. This helps to control for the nonlinear functional form. The results of this regression are shown in Table 5.1. The adjusted R-Squared for this regression is 0.6309, which implies that this regression explains 63.09% of the variation around the mean of the dependent variable, which is still price. The distance variables for WPI, Salter, Clark, Becker, and Assumption are all statistically insignificant for the interaction terms between the schools and dist1, dist1sq, and dist1cube. Although the coefficients are significant for the schools, the overall signs of the coefficients are different for the different colleges. For example, the overall signs for the coefficients of WPI are positive, but the signs switch to negative when distance reaches 489.17 meters. The signs for the distance coefficients for WPI reverse again when the distance reaches 2134.476 meters. This implies that a house within 489.17 meters of WPI would have a lower sales price than an identical house that is greater than 489.17 meters away from WPI. On the other hand, this effect switches again at 2134.476 meters and the exact same house would have a lower price at any greater distance. A similar effect occurs in the prices of the houses surrounding Becker College. The signs for the coefficients of Salter, Clark and Assumption are generally negative. Holding all else constant, this implies that houses located in 15 the vicinity of WPI or Becker College have lower house prices than houses located closer to the Salter School, Clark University, and Assumption. More specifically, in the neighborhoods around Clark University, the coefficient for the distance to Clark variable is negative for any distance. This implies that any distance from a house to Clark is associated with an increase in house price. In the example of Assumption College, any distance less than 2137.567 meters has an increases house prices and any distance above 2137.567 meters causes a decrease in house prices. A similar effect is seen in the houses near the Salter School. Both coefficients for Holy Cross and the Bancroft School of Massage Therapy are statistically significant at the 90% level for the interaction term between the school and dist1. However, the interaction terms between the schools and dist1sq and dist1cube are both statistically insignificant for Holy Cross and Bancroft. The overall sign of the coefficients for Holy Cross is negative for distances under 977.62 meters and distances over 32,286.64 meters. Therefore, Holy Cross has a positive impact on house prices for distances under 977.62 meters and over 32,286.64 meters. This effect on house prices increases as distance from the college decreases. On the other hand, the coefficient for Bancroft is negative for distances under 103.463 meters and for distances over 2323.232 meters. Then, intuitively, the coefficient switches to positive between the 103.463 meters and 2323.232 meters. Therefore, the Bancroft School of Massage Therapy has a positive impact on house prices for distances below 103.463 meters and above 2323.232 meters and causes a negative effect on house prices for the distances located on the interval in-between. The coefficient for the interaction terms of Hair in Motion Beauty Academy, Worcester State, UMass Medical Center, and Quinsigamond Community College and the distance variable are all statistically significant, with the exception of worcesterstatedistcube coefficient, which is 16 not statistically significant. The overall coefficients for the Hair in Motion Beauty Academy, Worcester State, UMass Medical Center, and Quinsigamond Community College behave in the same manner as the coefficient for Bancroft. For example, the coefficient for U. Mass Medical Center is negative for distances below 791.419 meters and above 1755.195 meters. On the interval in between, the coefficient is positive. This implies that U. Mass Medical Center has a positive impact on house prices until distance from a house to the college reaches 791.419, when this impact reverses to a negative impact. This again reverses to a positive effect on house prices when distance reaches 1755.195 meters and all distances larger than this number are also positive. This reversal of coefficient signs is the same pattern for the Hair in Motion Beauty Academy, Worcester State, and Quinsigamond Community College, but the changes occur at different distances for each college. The full table of inflection point distances is located in Table 5.2. All three coefficients for the Rob Roy Academy are statistically significant at the 99% level, and the coefficients are positive, however the sign switches to negative when distance reaches 1263.59 meters. Therefore, the Rob Roy Academy has a general negative effect on house prices for houses within 1262.92 meters of the school. In other words, as the distance to from a house to the Rob Roy Academy decreases from 1262.92, the sales price of the house would decrease as well. Although the coefficient values of interaction terms are not statistically significant for all of the colleges, it is clear that the impacts of the distance to each college are different. This suggests that variations in the schools themselves have a diverse impact on the sales prices of houses. Next, I grouped the schools into three categories; two-year community colleges, programs of four-year and longer and non-academic career programs. I created an interaction 17 variable between these three group variables with the distance variable to separate out the effects of the different categories. In order to control for nonlinear functional form, I also included interactions between the groupings of college variables and dist1sq and dist1cube individually. Table 6 shows the regression results, the nonacademicdist and the nonacademicdistsq coefficients are statistically significant at the 99% level. The coefficient for nonacademicdistcube is also statistically significant at the 95% level; however the other two categories are not statistically significant. This implies that the colleges had more explanatory power when they were tested individually in the regression. The relationship between distance and house prices from the community college and four year programs were very different, so their combined effect is canceled out to some degree and not statistically significant. However, the nonacademic schools had similar effects on house prices from distance, which causes the additive effect to be statistically significant. The adjusted R-Squared for this regression is 0.6154, which implies that this regression explains 61.54% of the variation around the mean of the dependent variable. This is slightly less than the adjusted R-Squared of the previous regression, which is 0.6309. This also implies that the independent variables, the grouped colleges interacted with distance, distance squared, and distance cubed, have less explanatory power than the colleges interacted separately with distance, distance squared, and distance cubed. In the last set of regressions, I differentiated between the colleges and universities because they have many diverse characteristics that may impact house prices. I used variables, such as four-year colleges and community colleges, to account for the differences in school attributes. I created dummy variables for private schools, religious affiliation, two-year and under programs, liberal arts programs, and “non-academic” programs. I also included 18 information such as the number of undergraduate students, the number of graduate students, and the percentage of students living in off campus housing. The results for the third set of regressions are shown in Table 7, I have included the dummy variables for private, religious affiliation, programs of four years or longer, liberal arts colleges, and non-academic schools. I have also included variables for the number of undergraduate students, the number of graduate students, and the percent of the student population that resides in off-campus housing. The only variable that is statistically significant at the 95% level is the dummy variable for the non-academic schools. All other school variables are found to be statistically insignificant. The adjusted R-Squared for this regression is 0.6217, which implies that this regression explains 62.17% of the variation around the mean of the dependent variable. Since the variables for the four year programs and the liberal arts schools are highly correlated, for the next regression I dropped liberal arts dummy variable along with the religious affiliation dummy variable. These results are shown in Table 8. In this regression, the number of undergraduate students, the number of graduate students and the dummy variable for nonacademic schools are statistically significant at the 99% level. The coefficient for the private dummy variable is not statistically significant, but the presence of a private college increases house prices by $6,857.04. The coefficient for the number of undergraduate students is negative and significant at the 95% level. This is logical because it is the undergraduate students that are usually associated with many of the negative externalities for people who live near a college. The coefficient for the number of graduate students statistically significant at the 90% level and is positive, which is also logical because graduate students are generally older and more mature than undergraduate students. Also, graduate students usually live in off-campus housing, which 19 would cause an increase in demand in the surrounding neighborhoods of the colleges that have graduate programs. As a result of this increase in demand, house prices would increase near some colleges in Worcester. The dummy variable that represents the programs that are four years or longer is not statistically significant; however it is highly correlated with the number of undergraduate students, as seen in the correlation matrix in Table 3. The percentage of students living off campus is not statistically significant. This is a surprising outcome of the regression; however all of the students that attend the non-academic schools commute to the schools because they do not have on-campus housing. Therefore, the non-academic schools are likely skewing the results of this coefficient. The presence of a non-academic school has a negative impact on house prices, which is surprising since the non-academic distance to a house is significant and has a positive impact on house prices. This suggests that there are other amenities or neighborhood characteristics near the non-academic colleges that would cause an increase in house price since the presence of a non-academic school near a house has a negative impact on the price of a house. The adjusted R-Squared for this regression is 0.6195, which implies that this regression explains 61.95% of the variation around the mean of price. This R-Squared is slightly less than that of the previous regression, however by dropping the highly correlated variables, the effects of the remaining variables becomes more apparent. VI. Conclusion: The results suggest that there are definite and measurable effects from the different schools in Worcester that can be quantified in house prices. This study finds that the location of a house in relation to a college in the city of Worcester is significant. That is, the distance of a house from any college or university is significant. This effect is also true when the distances for the different colleges are controlled for, and has a greater explanatory power than the original 20 hedonic regression with the variable of interest being the distance to any college. When I controlled for distance separately, I found that the characteristics of the schools were significant. This implies that the schools themselves impact local house values in Worcester, holding distance and all other neighborhood and house characteristics constant. Unlike Vandegrift, Lockshiss, and Lahr (2009), this study yields results that are much more specific. Their paper also found that colleges in general have a positive impact on house prices. However, they only differentiate their colleges by the classification of a four-year college or a community college. In the city of Worcester, the colleges have different impacts on house prices within the categories of the four-year colleges. For example, I found that the College of the Holy Cross has a positive impact on house prices for distances under 977.62 meters, and switches to a negative impact on house prices until the distance of 32,286.64. On the other hand, Assumption changes from a positive effect to a negative effect on house prices once the distance of a house to the college surpasses 2137.567 meters. Both Holy Cross and Assumption are fouryear colleges, but they have different effects on house prices within the same city. Similar to this paper, the study conducted by Vandegrift, Lockshiss, and Lahr also controls for the different types of schools by including the number of students and the degree to which the college is residential. While their study did control for enrollment numbers, they did not differentiate between undergraduate and graduate students. As outlined in the results, there is a significant difference in the impact of undergraduate and graduate students in this study. This suggests that Vandegrift, Lockshiss, and Lahr’s paper does not capture the complete impact of students on the surrounding community. Also, I include variables such as dummy variables for private schools and non-academic schools. The differences between public and private colleges have a sizable impact on house prices in the surrounding neighborhoods. The schools that are identified as non21 academic are the more statistically significant than the schools that are identified as academic, and they are also classified as colleges by the state of Massachusetts in the data set. Therefore, these schools must be included in the data for an accurate idea of the full effects of colleges on house prices. While it was interesting to see the effects of all of the colleges in Worcester on house prices, I hope to continue my study and redefine what is meant by “colleges”. It is not a far stretch to imagine that the average person would not assign the term “college” to a school such as the Hair in Motion Beauty Academy. Therefore, I reconstructed my data set and to exclude the non-academic schools in my regression analysis. The effects for the academic colleges in Worcester are statistically significant and support the results of this study for the new definition of “college”. Worcester is such a unique city of colleges; it would be interesting to see this model applied to a city with fewer colleges to see if there is a similar effect. 22 References: Downes,Thomas A., Zabel,Jeffrey E.. “The Impact of School Characteristics on House Prices: Chicago 1987–1991”. J.Urban Econ., 2002, 52, 1, 1-25. Kiel, Katherine A. 2006. “Environmental Contamination and House Values.” College of The Holy Cross, Department of Economics research paper no. 06-01. Kiel, Katherine A.. "Measuring the Impact of the Discovery and Cleaning of Identified Hazardous Waste Sites on House Values". Land Economics, Vol. 71, No. 4 (Nov., 1995), pp. 428-435. Kotsopoulos, Nick. "Holy Cross President Tells Students Rowdy Behavior 'has to Stop'"Telegram.com - An Edition of the Worcester Telegram & Gazette and Sunday Telegram. Worcester Telegram and Gazette, 23 Nov. 2010. Web. 4 Dec. 2010. <http://www.telegram.com/article/20101123/NEWS/101129891>. Martin, L., H. Smith, and W. Phillips. 2005. Bridging 'Town & Gown' through innovative University-Community Partnerships. The Innovation Journal: The Public Sector Innovation Journal 10(2): Article 20. Mayfield, L. (2001). Town and gown in America: Some historical and institutional issues of the engaged university. Education for Health, 14, 231 – 240. Vandegrift, Donald, Amanda Lockshiss, and Michael Lahr. "Town versus Gown: The Effect of a College on Housing Prices and the Tax Base." Thesis. The College of New Jersey, 2009. MPRA. Dec. 2009. Web. <http://mpra.ub.uni-muenchen.de/18998/>. http://www.census.gov/geo/www/cob/bg_metadata.html http://www.mass.gov/mgis/laylist.htm 23 0 200000 price 400000 600000 Graph 1- Correlation of Price and Dist1 0 1000 2000 DIST1 3000 4000 Table 1- Summary and Explanation of Variables Variable price Explanation Mean Std. Dev. Min Max 136868.4 63886.14 5000 590000 1479.684 835.7461 49.60719 4386.892 dist1sq sales price of single family homes in the year 2000 distance in meters from a house to the closest college distance squared 2887434 3185106 2460.873 1.92E+07 dist1cube distance cubed 6.80E+09 1.18E+10 122077 8.44E+10 mtg mortgage of the house 102584.5 71169.7 0 1424000 t_yearbuilt 2000-yearbuilt 52.2231 35.0857 0 210 t_yearbuiltsq (2000-yearbuilt) squared 3957.375 4200.635 0 44100 totrooms total number of rooms in the house 6.263271 1.387665 4 13 bedrooms total number of bedrooms 3.022238 0.7711184 1 8 Bathrooms total number of bathrooms 1.8967 0.8131744 0 7 Lotsize the size of the lot 9787.54 9922.257 1573 240016 total_pop total population in a given block group 1399.878 837.5041 424 3538 vl_med_own median house value in a given block group 124432.1 27388.62 69100 261500 inc_med_hs Median household income in a given block group percentage of the population that is white in a given block group percentage of the population that is black in a given block group percentage of the population that is native american in a given block group percentage of the population that is asian in a given block group percentage of the population that is of any other race in a given block group 47910.01 14755.35 11887 91801 87.38704 10.98003 32.68156 100 4.61784 6.178511 0 44.34932 0.1731922 0.5049738 0 6.317689 3.068557 3.478741 0 18.23105 2.56002 4.230811 0 36.43533 dist1 p_pop_white p_pop_black p_pop_nat p_pop_asn p_pop_other 24 ptrav5_14 ptrav20_34 ptrav45_59 ptrav_gt90 Phsgrad Pbach p_poplt1_12 p_pop60_gt85 tot_hs_unt p_occ_owner p_occ_renter p_util_gas p_tank_gas p_elec p_fuel p_coal p_wood p_solar p_other p_own_vhcl_0 p_own_vhcl_1 p_own_vhcl_2 p_rnt_vhcl_1 p_rnt_vhcl_2 p_hs_noplumb p_own_noplumb p_hs_noktch p_own_noktch Wpidist Salterdist Robroydist Hairinmotiondist percentage of the population who commute 5 to 14 minutes to work in a given block group percentage of the population who commute 20 to 34 minutes to work in a given block group percentage of the population who commute 45 to 59 minutes to work in a given block group percentage of the population who commute > 90 minutes to work in a given block group percentage of the population who have a high school degree percentage of the population who have a bachelor degree percentage of the population between the ages of 0 to 12 in a block group percentage of the population between the ages of 60 to greater than 85 in a block group total number of housing units in a block group percentage of the owner occupied housing units in a block group percentage of the renter occupied housing units in a block group percentage of the houses in a block group that percentage of housing units heated by Bottled, tank, or LP gas percentage of housing units heated by electricity percentage of housing units heated by Fuel oil, kerosene, etc. percentage of housing units heated by Coal or coke percentage of housing units heated by wood percentage of housing units heated by solar energy percentage of housing units heated by other energy percentage of owner occupied housing units with no vehicle available percentage of owner occupied housing units with one vehicle available percentage of owner occupied housing units with two vehicles available percentage of renter occupied housing units with one vehicle available percentage of renter occupied housing units with two vehicles available percentage of housing units lacking complete plumbing facilities percentage of owner occupied housing units lacking complete plumbing facilities percentage of housing units lacking complete kitchen facilities percentage of owner occupied housing units lacking complete kitchen facilities interaction term between the dummy variable for WPI and the distance variable interaction term between the dummy variable for Salter and the distance variable interaction term between the dummy variable for Rob Roy and the distance variable interaction term between the dummy variable for Hair In Motion and the distance variable 25 11.91695 4.251587 0 24.87961 11.38936 3.937049 1.71969 24.31843 2.38751 1.580721 0 7.38255 0.8149164 0.9451505 0 3.914989 19.89729 6.181187 4.042716 36.01236 11.32368 5.262218 0 25.9334 15.95041 4.66628 0.686499 41.62679 20.02132 6.985242 2.513967 47.29064 566.2131 325.2542 162 1566 63.39662 21.94621 4.761905 94.58599 33.20107 20.89421 4.249668 92.2619 50.4312 14.51052 14.97976 82.20641 1.254863 2.150942 0 17.02586 13.70377 11.40389 0 60.76191 30.18372 14.02496 0 65.84615 0.0867292 0.3142741 0 1.842105 0.3084518 0.8517046 0 3.523035 0.0750145 0.7450353 0 7.469306 0.3473939 0.9011939 0 7.279694 4.750337 5.164563 0 55 38.12109 12.73393 0 100 44.90005 12.94604 0 100 54.61199 13.51995 0 100 24.36987 13.6662 0 70.58823 0.5690895 1.322179 0 21.19403 0.5551353 4.041682 0 100 0.7322192 1.627741 0 21.19403 0.3408535 1.326941 0 18.29268 67.39843 338.0859 0 2392.836 98.81541 339.1233 0 2041.725 3.337636 55.74801 0 1191.862 255.0557 706.2583 0 4277.911 Hcdist Clarkdist Beckerdist Bancroftdist Assumptiondist Worcesterstatedist Umassmeddist Quinsigamonddist Wpidistsq Salterdistsq Robroydistsq Hairinmotindistsq Hcdistsq Clarkdistsq Beckerdistsq Bancroftdistsq Assumptiondistsq Worcesterstatedistsq Umassmeddistsq Quinsigamonddistsq Wpidiscube Salterdistcube Robroydistcube Hairinmotiondistcube Hcdistcube Clarkdistcube Beckerdistcube Bancroftdistcube interaction term between the dummy variable for Holy Cross and the distance variable interaction term between the dummy variable for Clark and the distance variable interaction term between the dummy variable for Becker and the distance variable interaction term between the dummy variable for Bancroft and the distance variable interaction term between the dummy variable for Assumption and the distance variable interaction term between the dummy variable for Worcester State and the distance variable interaction term between the dummy variable for Umass Med and the distance variable interaction term between the dummy variable for Quinsigamond and the distance variable interaction term between the dummy variable for WPI and the distance variable squared interaction term between the dummy variable for Salter and the distance variable squared interaction term between the dummy variable for Rob Roy and the distance variable squared interaction term between the dummy variable for Hair In Motion and the distance variable squared interaction term between the dummy variable for Holy Cross and the distance variable squared interaction term between the dummy variable for Clark and the distance variable squared interaction term between the dummy variable for Becker and the distance variable squared interaction term between the dummy variable for Bancroft and the distance variable squared interaction term between the dummy variable for Assumption and the distance variable squared interaction term between the dummy variable for Worcester State and the distance variable squared interaction term between the dummy variable for Umass Med and the distance variable squared interaction term between the dummy variable for Quinsigamond and the distance variable squared interaction term between the dummy variable for WPI and the distance variable cubed interaction term between the dummy variable for Salter and the distance variable cubed interaction term between the dummy variable for Rob Roy and the distance variable cubed interaction term between the dummy variable for Hair In Motion and the distance variable cubed interaction term between the dummy variable for Holy Cross and the distance variable cubed interaction term between the dummy variable for Clark and the distance variable cubed interaction term between the dummy variable for Becker and the distance variable cubed interaction term between the dummy variable for Bancroft and the distance variable cubed 26 78.50254 386.237 0 3431.133 117.7479 497.3318 0 3010.97 15.00576 130.1521 0 1655.577 31.1403 203.4944 0 2287.123 273.9024 734.2812 0 4386.892 255.1696 644.8681 0 4361.995 60.88829 326.9612 0 2671.213 222.7197 589.2762 0 2604.93 118762.6 681239.9 0 5725664 124686.6 506355.4 0 4168641 3116.751 58582.47 0 1420536 563496.4 1989009 0 1.83E+07 155234.6 942603.9 0 1.18E+07 261026.1 1229283 0 9065942 17152.59 174529.5 0 2740935 42349.99 377240.8 0 5230933 613804.6 2274689 0 1.92E+07 480668.1 1691809 0 1.90E+07 110534.3 687798.8 0 7135381 396601.4 1227393 0 6785659 2.31E+08 1.47E+09 0 1.37E+10 1.78E+08 8.30E+08 0 8.51E+09 3214699 6.45E+07 0 1.69E+09 1.47E+09 6.40E+09 0 7.83E+10 3.57E+08 2.68E+09 0 4.04E+10 6.28E+08 3.20E+09 0 2.73E+10 2.23E+07 2.50E+08 0 4.54E+09 7.36E+07 7.91E+08 0 1.20E+10 Undergrad interaction term between the dummy variable for Assumption and the distance variable cubed interaction term between the dummy variable for Worcester State and the distance variable cubed interaction term between the dummy variable for Umass Med and the distance variable cubed interaction term between the dummy variable for Quinsigamond and the distance variable squared interaction term between the dummy variable for the community colleges and the distance variable interaction term between the dummy variable for the four year and longer programs and the distance variable interaction term between the dummy variable for non-academic schools and the distance variable interaction term between the dummy variable for academic schools and the distance variable dummy variable: 1 if the college is private, 0 if the college is public number of undergraduate students Grad number of graduate students d_religaff dummy variable: 1 if the college is religiously affiliated, 0 if the college is not religiously affiliated dummy variable: 1 if the college is a program of four years or longer, 0 if the college is two years or fewer dummy variable: 1 if the college is classified as liberal arts, 0 if the college is not a liberal arts school dummy variable: 1 if the college is a "nonacademic school", 0 if the college is an academic school percentage of the students residing in offcampus housing percentage of the students residing in oncampus housing Assumptiondistcube Worcesterstatedistcube Umassmeddistcube Quinsigamonddistcube d_communitcolldist d_4yeardist d_nonacademicdist d_academicdist d_private d_4year d_libarts d_nonacademic p_off_campus p_on_campus 1.71E+09 8.50E+09 0 8.44E+10 1.12E+09 5.79E+09 0 8.30E+10 2.24E+08 1.58E+09 0 1.91E+10 7.88E+08 2.74E+09 0 1.77E+10 222.7197 589.2762 0 2604.93 868.6148 1001.704 0 4386.892 289.5337 724.921 0 4277.911 1190.15 928.7803 0 4386.892 0.6312769 0.4826318 0 1 1543.224 1482.541 0 3982 288.1011 373.9063 0 1004 0.2080344 0.4060473 0 1 0.5502152 0.4976506 0 1 0.510043 0.5000785 0 1 0.197274 0.3980837 0 1 69.10689 38.04495 11 100 30.89311 38.04495 0 89 *Note: "non-academic" schools include: Salter, Rob Roy, Hair in Motion, and Bancroft School of Massage 27 Table 2- Correlation Matrix for Significant Correlations totrooms totrooms bedrooms bathrooms vl_med_own inc_med_hs p_tot_occ_hs p_occ_owner p_occ_renter 1 bedrooms 0.7828 bathrooms 0.5566 0.457 1 vl_med_own 0.3004 0.2295 0.2954 1 inc_med_hs 0.2008 0.1503 0.2299 0.6038 1 p_tot_occ_hs -0.0548 -0.0436 -0.0094 0.0037 0.245 1 p_occ_owner 0.0561 0.0227 0.1033 0.4025 0.7742 0.4569 1 p_occ_renter -0.0657 -0.0292 -0.1096 -0.4223 -0.7829 -0.3565 -0.994 1 1 Table 3- Correlation Matrix for College Variables d_private d_private undergrad grad p_off_campus d_4year d_libarts d_nonac. 1 undergrad -0.2305 1 grad -0.1987 0.8259 1 p_off_campus -0.4477 -0.4985 -0.4013 1 d_4year -0.0574 0.8616 0.6969 -0.7344 1 d_libarts 0.0481 0.8991 0.7555 -0.7962 0.9225 1 d_nonacademic 0.3789 -0.4934 -0.3821 0.4027 -0.5483 -0.5058 28 1 Map 1 – Worcester Block Groups 29 Map 2- Worcester Colleges The Salter School-Worcester Quinsigamond Community College Assumption College UMass Med Worcester Polytechnic Institute Becker College Worcester State College Bancroft School Of Massage Therapy Rob Roy Academy-Worcester Hair In Motion Beauty Academy Clark University College of the Holy Cross 30 Map 3- Worcester Colleges in relation to the Single Family House Sales in 2000 The Salter School-Worcester (! (! Quinsigamond Community College Assumption College (! Worcester Polytechnic Institute UMass Med (! (! Bancroft School Of Massage Therapy Becker College ! ( (! Worcester State College (! (! Rob Roy Academy-Worcester Hair In Motion Beauty Academy (! Clark University (! College of the Holy Cross (! *Houses represented by dots, * Colleges represented by stars, * Block Groups shown by black lines 31 Table 4 – First run regression Regression Number of obs = 1394 F(47, 1344) = 22.82 Prob > F = 0 R-squared = 0.6151 Root MSE = 40346 price dist1 dist1sq dist1cube mtg t_yearbuilt t_yearbuiltsq totrooms bedrooms bathrooms lotsize total_pop vl_med_own inc_med_hs p_pop_white p_pop_black p_pop_nat p_pop_asn p_pop_other ptrav5_14 ptrav20_34 ptrav45_59 ptrav_gt90 phsgrad pbach p_poplt1_12 p_pop60_gt85 tot_hs_unt p_occ_owner p_occ_renter p_util_gas p_tank_gas p_elec p_fuel p_coal p_wood p_solar Coef. Std.err t -28.90634** 0.014652** -2.10E-06** 0.294217*** -355.374*** 1.627906* 8739.121*** 749.1157 13503.23*** 1.023087** -0.2246738 0.5016043*** 0.3016507 832.0357 1207.852** -2038.844 132.1647 927.0958 335.4461 -19.14247 -722.8094 372.3882 -48.14107 422.6611 -31.45517 313.1355 -8.304475 3593.661** 3620.567** -3896.682*** -3942.5** -3988.075*** -3981.261*** -355.948 -1811.334 1827.4 12.53236 0.006808 1.03E-06 0.077051 113.6759 0.885238 1889.661 2788.253 2355.713 0.416871 6.874454 0.113288 0.211827 512.4422 555.7801 2197.948 570.7085 729.0399 313.0824 311.3587 862.5688 1384.403 330.882 361.012 329.7619 307.0879 15.81036 1623.358 1632.174 1485.385 1676.037 1520.413 1509.477 3494.379 2052.896 1208.122 -2.31 2.15 -2.02 3.82 -3.13 1.84 4.62 0.27 5.73 2.45 -0.03 4.43 1.42 1.62 2.17 -0.93 0.23 1.27 1.07 -0.06 -0.84 0.27 -0.15 1.17 -0.1 1.02 -0.53 2.21 2.22 -2.62 -2.35 -2.62 -2.64 -0.1 -0.88 1.51 32 p_other -6262.768*** 2306.535 -2.72 p_own_vhcl_0 -357.8544 265.4306 -1.35 p_own_vhcl_1 36.51372 199.3504 0.18 p_own_vhcl_2 194.7495 185.6174 1.05 p_rnt_vhcl_1 179.8301* 95.255 1.89 p_rnt_vhcl_2 125.6967 134.3859 0.94 p_hs_noplumb -1755.571 1157.142 -1.52 p_own_noplumb -213.1511 204.5949 -1.04 p_hs_noktch 955.9639 1203.072 0.79 p_own_noktch 254.711 789.3183 0.32 p_hshld15_64 -43.88334 224.9979 -0.2 _cons -110573 85302.62 -1.3 Note: *significant at the 90% level, ** significant at the 95% level, *** significant at the 99% level Table 5.1 - College-Distance Variables, Second Regression 1 Regression Number of obs. = 1394 F( 79, 1315) = 26.40 Prob > F = 0.0000 R-squared = 0.6309 Root MSE = 39961 price Coef. Std. Err. t mtg 0.287202*** 0.074909 3.83 t_yearbuilt -336.278*** 115.496 -2.91 37.23044 -50.9942*** -77.7228 641.9893*** -42.4932* -20.0464 47.65893 -93.9076* -13.4585 -95.8464*** -50.5661*** -63.5568** -0.04682 0.05644 -1.83473*** 72.91212 18.6238 54.32912 235.8619 22.59825 29.4129 90.5115 54.22715 23.32681 37.0027 19.08062 25.53092 0.084735 0.0691 0.477568 0.51 -2.74 -1.43 2.72 -1.88 -0.68 0.53 -1.73 -0.58 -2.59 -2.65 -2.49 -0.55 0.82 -3.84 ### all other variables### wpidist hairinmotindist Salterdist Robroydist Hcdist Clarkdist Beckerdist Bancroft dist Assumptiondist Umassmeddist worcesterstatedist quinsigamonddist Wpidistsq Salterdistsq Robroydistsq 33 hairinmotiondistsq 0.025865** 0.01145 2.26 Hcdistsq 0.022287 0.017698 1.26 Clarkdistsq 0.012721 0.024777 0.51 Beckerdistsq -0.08219 0.151979 -0.54 bancroftdistsq 0.065515 0.077141 0.85 assumptiondistsq 0.00135 0.014598 0.09 worcesterstatedistsq 0.020374* 0.011107 1.83 umassmeddistsq 0.087858** 0.038082 2.31 quinsigamonddistsq 0.059418** 0.023559 2.52 Wpidiscube 1.19E-05 2.36E-05 0.5 salterdistcube -6.22E-06 2.27E-05 -0.27 robroydistcube 0.001102*** 0.000246 4.49 hairinmotiondistcube -3.78E-06** 1.89E-06 -2 hcdistcube -3.71E-06 3.72E-06 -1 clarkdistcube -3.01E-06 5.60E-06 -0.54 beckerdistcube 0.000027 6.48E-05 0.42 bancroftdistcube -1.3E-05 0.000025 -0.52 assumptiondistcube 1.70E-07 2.29E-06 0.07 worcestersstate -2.00E-06 1.75E-06 -1.14 umassmeddistcube -2.3E-05** 1.05E-05 -2.16 quinsigamonddistcube -1.6E-05*** 6.18E-06 -2.62 _cons -153195 96085.49 -1.59 Note: All house and neighborhood characteristic variables were included in the regression, but not shown on the table. All variables not shown have the similar significance level as the first regression in Table 4. Full results are vailable upon request. Note: *significant at the 90% level, ** significant at the 95% level, *** significant at the 99% level Table 5.2 Coefficient Signs for Statistically Significant College-Distance Variables Distance Worcester State Quinsigamond Hair in Motion U Mass Med 0 0 0 0 between coefficient sign negative negative negative negative Rob Rob Academy 0 positive distance distance sign 1634.430 781.570 1440.956 791.419 between coefficient sign positive positive positive positive 5157.926 1694.17 3120.804 1755.195 negative negative negative negative 1263.590 negative ∞ negative *interpretation for Worcester State: Coefficient is negative from distance 0 to 1634.430 Coefficient is positive from 1634.430 to 5157.926 Coefficient is negative for all distances over 5157.926 34 Table 6- Grouped College-Distance Variables, Second Regression 2 Regression Number of obs = 1394 F( 52, 1341) = 21.37 Prob > F = 0 R-squared = 0.6154 Root MSE = 40379 price mtg t_yearbuilt Coef. Std. Err. t 0.292293*** -354.271*** 0.07771 117.3168 3.76 -3.02 ### All other Variables ### d_4yeardist -11.7131 10.35436 -1.13 d_nonacademicdist -38.4366*** 11.40585 -3.37 d_communitycolldist -20.616 17.30407 -1.19 d_4yeardistsq 0.003766 0.006605 0.57 d_nonacademicdistsq 0.022581*** 0.008451 2.67 d_communitycolldistsq 0.025495 0.019176 1.33 d_4yeardistcube -2.30E-07 1.08E-06 -0.21 d_nonacademicdistcube -3.46E-06** 1.49E-06 -2.32 d_communitycolldistbube -8.49E-06 5.36E-06 -1.58 _cons -131446 87409.67 -1.5 Note: All house and neighborhood characteristic variables were included in the regression, but not shown on the table. All variables not shown have the similar significance level as the first regression in Table 4. Full results are vailable upon request. Note: *significant at the 90% level, ** significant at the 95% level, *** significant at the 99% level 35 Table 7- College Variables, Third Regressions 1 Regression Number of obs. = 1394 F( 54, 1337) = 20.86 Prob > F = 0 R-squared = 0.6217 Root MSE = 40107 Price Coef. Std. Err. t dist1 -21.6345* 12.49811 -1.73 dist1sq 0.008861 0.00687 1.29 dist1cube -1.06E-06 1.06E-06 -1 mtg 0.289417*** 0.076748 3.77 t_yearbuilt -368.745*** 113.7281 -3.24 d_private 5143.946 5042.404 1.02 undergrad -5.09264 4.88857 -1.04 grad 8.163082 13.86759 0.59 p_off_campus 98.04768 220.5341 0.44 d_religaff -2779.32 14385.28 -0.19 d_4year -4261.03 6478.868 -0.66 d_libarts 5589.648 14445.91 0.39 -11868.9** 5221.336 -2.27 -157877 90908.06 -1.74 ###All other variables### d_nonacademic _cons Note: All house and neighborhood characteristic variables were included in the regression, but not shown on the table. All variables not shown have the similar significance level as the first regression in Table 4. Full results are vailable upon request. Note: *significant at the 90% level, ** significant at the 95% level, *** significant at the 99% level 36 Table 8– College Variables, Third Regressions 2 Regression Number of obs = 1394 F( 52, 1341) = 21.17 Prob > F = 0 R-squared = 0.6195 Root MSE = 40163 price dist1 dist1sq dist1cube mtg t_yearbuilt Coef. Std. Err. t -21.0707* 0.008508 -1.01E-06 0.292217 -372.481 12.57311 0.006929 1.07E-06 0.077346 115.0718 -1.68 1.23 -0.94 3.78 -3.24 6875.036 -4.67581** 11.05664* -83.018 -2037 -13145*** -156087 4492.823 2.08444 5.780309 80.74098 6016.986 4881.313 90947.79 1.53 -2.24 1.91 -1.03 -0.34 -2.69 -1.72 ### All other variables ### d_private undergrad grad p_off_campus d_4year d_nonacade~c _cons Note: All house and neighborhood characteristic variables were included in the regression, but not shown on the table. All variables not shown have the similar significance level as the first regression in Table 4. Note: *significant at the 90% level, ** significant at the 95% level, *** significant at the 99% level 37
© Copyright 2026 Paperzz