Quantile Regression Methodology Quantile regression is based on the minimization of weighted absolute deviations (also known as L_1 method) to estimate conditional quantile (percentile) functions (Koenker and Bassett, 1978; Koenker and Hallock, 2001). For the median (quantile = 0.5), symmetric weights are used, and for all other quantiles (e.g., 0.1, 0.2 ....., 0.9) asymmetric weights are employed. In contrast, classical OLS regression (also known as L_2 method) estimates conditional mean functions. Unlike OLS, quantile regression is not limited to explaining the mean of the dependent variable. It can be employed to explain the determinants of the dependent variable at any point of the distribution of the dependent variable. For hedonic price functions, quantile regression makes it possible to statistically examine the extent to which housing characteristics are valued differently across the distribution of housing prices. One may argue that the same goal may be accomplished by segmenting the dependent variable, such as house price, into subsets according to its unconditional distribution and then applying OLS on the subsets, as done, for example, in Newsome and Zietz (1992). However, as clearly argued by Heckman (1979), this “truncation of the dependent variable” may create biased parameter estimates and should be avoided. Since quantile regression employs the full data set, a sample selection problem does not arise. Quantile regression generalizes the concept of an unconditional quantile to a quantile that is conditioned on one or more covariates. Least squares minimizes the sum of the squared residuals, {}02,0minkjjkijjibijybx==−∑∑, 6where yi is the dependent variable at observation i, xj,i the jth regressor variable at observation i, and bj an estimate of the model’s jth regression coefficient. By contrast, quantile regression minimizes a weighted sum of the absolute deviations, {}0,0minkjjkijjiibijybxh==−∑∑, where the weight hi is defined as 2ihq=if the residual for the ith observation is strictly positive or as 22ihq=−if the residual for the ith observation is negative or zero. The variable q(01)q<< is the quantile to be estimated or predicted. The standard errors of the coefficient estimates are estimated using bootstrapping as suggested by Gould (1992, 1997). They are significantly less sensitive to heteroskedasticity than the standard error estimates based on the method suggested by Rogers (1993).3Quantile regression analyzes the similarity or dissimilarity of regression coefficients at different points of the distribution of the dependent variable, which is sales price in our case. It does not consider spatial autocorrelation that may be present in the data. Because similarly priced houses are unlikely to be all clustered geographically, one cannot expect that quantile regression will remove the need to account for spatial autocorrelation. 3 The quantile regressions employ the “sqreg” command in Stata for seed 1001. 7In this paper, spatial autocorrelation is incorporated into the quantile regression framework through the addition of a spatial lag variable. The spatial lag variable is defined as Wy, where W is a spatial weight matrix of size TxT, where T is the number of observations, and where yis the dependent variable vector, which is of size Tx1. Any spatial weight matrix can be employed, for example, one based on the ith nearest neighbor method, contiguity, or some other scheme. In the present application, a contiguity matrix is used.4Adding a spatial lag to an OLS regression is well known to cause inference problems owing to the endogeneity of the spatial lag (Anselin, 2001). This is not any different for quantile regression than for OLS. We follow the approach suggested by Kim and Muller (2004) to deal with this endogeneity problem in quantile regression. As instruments we employ the regressors and their spatial lags.5 However, instead of using a density function estimator for the derivation of the standard errors, we follow the well established route of bootstrapping the standard errors (Greene, 2000, pp. 400-401).64. Data and Estimation Results This study uses multiple listing service (MLS) data from the Orem/Provo, Utah
đang được dịch, vui lòng đợi..