Regression Methods in Biostatistics: Linear, Logistic, Survival and Repeated Measures Models

0
494

Being newly immersed in the upstream part of the oil business, I just recently had my first work session with data in ARC–GIS®. The project involves subsurface geographical modeling. Obviously I had considerable interest in discovering if the methodology in this book would enhance my modeling capabilities. The book is for social scientists, but I had no difficulty imagining my own important oil exploration application within the framework of geographically weighted regression (GWR). The first chapter nicely explains what is unique in this book. A standard regression model using geographically oriented data (the example is housing prices across all of England) is a global representation of a spatial relationship, an average that does not account for any local differences. In y = f (x), imagine a whole family of f ’s that are indexed by spatial location. That is the focus of this book. It is about one form of local spatial modeling, which is GWR. A more general resource for this topic is the earlier book by Fotheringham and Wegener (2000), which escaped the notice of Technometrics. Imagine a display of model parameters in a geographical information system (GIS) and you will understand the focus for this book. The authors note, “only where there is no significant spatial variation in a measured relationship can global models be accepted” (p. 10). The second chapter develops the basis of GWR. It analyzes the housing sales prices versus the 33 boroughs in London and begins by fitting a conventional multiple regression model versus housing characteristics. The GWR is motivated by differences in the regression models fitted separately by borough. The GWR is a spatial moving-window approach with all data distances weighted versus a specific data point using a weighting function and a bandwidth. A GIS can then be used to evaluate the spatial dependency of the parameters. As in kriging, local standard errors also are calculated. The chapter also provides all the math. Chapter 3 comprises several further considerations: parameters that are globally constant, outliers, and spatial heteroscedasticity. The first issue leads to hypothesis tests for model comparison using an Akaike information criterion (AIC). Local outliers are hard to detect. Studentized (deletion) residuals are recommended. The outliers can be plotted geographically. Robust regression is suggested as a less computationally intensive alternative. Hetereoscedasticity is harder to handle. Chapter 4 adds statistical inference to the capabilities of GWR: both a confidence interval approach using local likelihood and an AIC method. Four additional methodology chapters present various extensions of GWR. Chapter 5 considers the relationship between GWR and spatial autocorrelation, and includes a combined version of GWR and spatial regression using some complex hybrid models. Chapter 6 examines the relationship of scale and zoning problems in spatial analysis to GWR. Chapter 7 introduces the use of initial exploratory data analysis using geographically weighted statistics, which are based on the idea of using a kernel around each data point to create weights. Univariate statistics and correlation coefficients are defined for exploring local patterns in data. A final set of extensions in Chapter 8 discusses regression models with non-Gaussian errors, logistic regression, local principal components analysis, and local probability density estimation. The methods all use some kind of distributional model. The million-dollar question for me is always, “What about software?” The authors have a stand-alone program, GWR 3, available in CD–ROM by contacting the authors. Basically the drill with GWR 3 is to gather your data, use Excel to transform and reformat the data for GWR 3, use GWR 3 to produce a set of coefficients, and feed those coefficients to your favorite GIS to produce your maps. Forty pages of discussion about using the software are provided. A final epilogue chapter also discusses embedding GWR in R or Matlab and includes some references to people who have done that type of work. I probably would not have read this book if I had not happened to have had it in my briefcase on a visit with the exploration technologists. Though inclusive of appropriate mathematical development, this material is readily approachable because of the many illustrations and the pages and pages of GIS displays. The authors unabashedly present much of the material as their developmental work, so GWR offers a lot of opportunity for research and further development through novel applications and extensions.