NEIGHBORHOOD CRIME AND TRAVEL
BEHAVIOR:
AN INVESTIGATION OF THE INFLUENCE OF
NEIGHBORHOOD CRIME RATES ON MODE
Christopher E. Ferrell, Ph.D.
Shishir Mathur, Ph.D.
Emilia Mendoza
a publication of the
| 1. Report No. FHWA 2006-33 | 2. Government Accession No. | 3. Recipient’s Catalog No. | |
|---|---|---|---|
| 4. Title and Subtitle Neighborhood Crime and Travel Behavior: An Investigation of the Influence of Neighborhood Crime Rates on Mode Choice | 5. Report Date April 2008 | ||
| 6. Performing Organization Code | |||
| 7. Authors Christopher E. Ferrell, Ph.D.; Shishir Mathur, Ph.D.; Emilia Mendoza | 8. Performing Organization Report No. | ||
| 9. Performing Organization Name and Address Mineta Transportation Institute College of Business San José State University San José, CA 95192-0219 | 10. Work Unit No. | ||
| 11. Contract or Grant No. 65W136 | |||
| 12. Sponsoring Agency Name and Address California Department of Transportation Sacramento, CA 95815 U.S. Department of Transportation Research and Innovative Technology Administration 400 7th Street, SW Washington, DC 20590-0001 | 13. Type of Report and Period Covered | ||
| 14. Sponsoring Agency Code | |||
| 15. Supplementary Notes | |||
| 16. Abstract While much attention has been given to the influence of urban form on travel behavior in recent years, little work has been done on how neighborhood crimes affect this dynamic. This research project studied seven San Francisco Bay Area cities, and found substantiation for the proposition that neighborhood crime rates have an influence on the propensity to choose non-automotive modes of transportation for home-based trips. Specifically, high vice and vagrancy crime rates were associated with a lowered probability of choosing transit in suburban cities for both work and non-work trips, high property crime rates were associated with a lower probability of walking for work trips in urban cities and inner-ring suburban cities, high violent crime rates with a lower probability of walking for work trips in suburban study cities, while higher property crime rates in San Francisco were associated with an increased probability of walking for non-work trips. While the signs of these significant relationships generally conformed to the author’s expectations—i.e., that high crime rates reduce the probability of choosing non-automotive modes of travel—the authors did not find statistically significant relationships for all city/trip model runs, suggesting that these relationships differ depending on the urban form and trip type contexts. | |||
| 17. Keywords Crimes; Property crimes; Travel behavior; Travel by mode | 18. Distribution Statement No restriction. This document is available to the public through the National Technical Information Service, Springfield, VA 22161 | ||
| 19. Security Classif. (of this report) Unclassified | 20. Security Classif. (of this page) Unclassified | 21. No. of Pages 96 | 22. Price $15.00 |
All rights reserved
Library of Congress Catalog Card Number:
To order this publication, please contact the following:
Mineta Transportation Institute
College of Business
San José State University
San José, CA 95192-0219
Tel (408) 924-7560
Fax (408) 924-7565
E-mail: mti@mti.sjsu.edu
http://transweb.sjsu.edu
We would like to thank the leadership and staff at the Mineta Transportation Institute for their support of all phases of this work, including MTI Research Director Trixie Johnson and Research and Publications Assistant Sonya Carter-Cardenas (posthumously). We would also like to thank Emelia Mendoza, Charlie Chapin and Lakshimi Guruvayurappan, San José State University students who worked on and contributed to this project.
Additional thanks are offered to Dr. Anastasia Loukaitou-Sideris for allowing us to use her original photographs on pages 6 and 7.
Special thanks to Professor Earl Bossard and Steven Colman for their guidance and help in planing and launching this research effort. Many thanks to the city police departments that participated in and provided data for this study, including San Francisco, Oakland, Berkeley, Sunnyvale, Cupertino, Walnut Creek, Concord and Richmond.
Thanks are also offered to MTI staff, including Communications Director Donna Maurillo and Graphic Artist Sahil Rahimi. Editing and publication services were provided by Mary Buuck and Catherine Frazier.
OVERVIEW
STUDY PURPOSES
STUDY RESULTS
THE EFFECTS OF CRIME ON TRAVEL BEHAVIOR
THE EFFECTS OF CRIME ON PHYSICAL ACTIVITY
PERCEPTIONS AND CRIME
ENVIRONMENTAL FACTORS
RESEARCH OBJECTIVES
DATA SOURCES
TRAVEL BEHAVIOR MODELING APPROACH
DATASET PREPARATION
STUDY HYPOTHESES AND RESEARCH QUESTIONS
SUMMARY STATISTICS OF MODEL VARIABLES
TRANSIT MODEL TRIPS DATA
TRANSIT MODEL FACTOR ANALYSIS
PEDESTRIAN AND BICYCLE MODEL FACTOR ANALYSIS
BINARY LOGISTIC MODEL RUN RESULTS
SUMMARY AND CONCLUSIONS
APPENDIX C: COVER LETTER TO AGENCIES
ENDNOTES
ABBREVIATIONS AND ACRONYMS
ABOUT THE AUTHORS
PEER REVIEW
OVERVIEW
While there is a rich collection of studies that have investigated the connections between crimes, urban form, and socio-demographics, as well as a growing body of recent research on the links between urban form and the propensity to walk and exercise, little work has been done to look at the interactions between neighborhood crimes, urban form, and travel behavior. This exploratory research study collected individual crime records data from seven San Francisco Bay Area police departments for the year 2000 (or if unavailable, 2001) and calculated crime rate statistics by neighborhoods (as represented by Travel Analysis Zones, or TAZs) in these cities. These crime rates were then merged with travel survey data from the Bay Area Travel Survey for the year 2000, collected and provided by the Metropolitan Transportation Commission. A set of urban form and transit accessibility variables also were calculated and merged with the travel and crimes dataset in the form of a factor analytic score variable that removed most of the multicollinearity (where the predictor variables in a multiple regression are themselves highly correlated) effects found between these variables. These crime rate, urban form and transit accessibility, and other socio-demographic control variables were then analyzed using logistic regression techniques to identify the effects of neighborhood crime rates on mode choice.
The primary purpose of this study is to identify if there is a relationship between neighborhood crime rates and the propensity to choose non-automotive modes of travel for home-based trips. We hypothesize that people living in high-crime neighborhoods would be less likely to choose walking, bicycling, or transit.
This study also sought to look for relationships between urban form and crime rates. Because high-density, pedestrian-friendly, transit-rich neighborhoods tend to increase non-auto mode share, if crimes also tend to cluster in these areas, then we may find that there is a non-causal, positive correlation between crime rates and auto mode choice.
We also hypothesize that different crime types may have different spatial distributions. We could expect that violent and property crimes would have different patterns of distribution throughout urban space that may depend on the nature of the physical environment.
This project has the following policy and research implications:
This exploratory study covered seven San Francisco Bay Area cities—ranging from the urban core environment of San Francisco to suburban communities such as Concord and Sunnyvale— and found substantiation for the proposition that neighborhood crime rates have an influence on the propensity to choose non-automotive modes of transportation for home-based trips. Specifically, high vice and vagrancy crime rates were associated with a lowered probability of choosing transit in suburban cities for both work and non-work trips, high property crime rates were associated with a lower probability of walking for work trips in urban/inner-ring suburban cities, high violent crime rates with a lower probability of walking for work trips in suburban study cities, and higher property crime rates in San Francisco were associated with an increased probability of walking for non-work trips. While the signs of these significant relationships generally conformed to our expectations (i.e., that high crime rates reduce the probability of choosing non-automotive modes of travel), we did not find statistically significant relationships for all city/trip-type model runs, suggesting that these relationships differ depending on the urban-form and trip-type contexts.
Literature review activities for this project focused on identifying studies that measured the effects of neighborhood crime on travel behavior. Because there are very few studies that have specifically addressed this question, studies also were evaluated that measured the effects of crime on physical activities. These studies suggested that there may be a difference between the effects of the actual number of crimes in a neighborhood (i.e., the crime rate) and the perceptions people have of the threats of crime for them personally. Studies of perception and crime and the effects of the physical and social environments on crime rates and perceptions were identified and analyzed.
While there are very few studies that have directly investigated the effects of crime on travel behavior, a few are noteworthy. A common thread found in these studies is a complex interaction between urban environment, crime levels, perceptions of crime, and travel behavior.
Atkins et al. studied the effects of street lighting on neighborhood crime levels and perceptions of crime in the London borough of Wandsworth. They found no detectable changes in travel behavior among neighborhood residents. Specifically, people still seemed to engage in the same patterns of avoiding certain streets and places in both the before and after conditions of the study despite the fact that poor lighting fell from the most frequently cited reason for avoiding these areas to a minor ranking among reasons listed. 1 These results suggest that changes to the physical environment alone (i.e., improved street lighting) might not be sufficient to reduce residents’ fear of crime in certain locations and will not encourage them to walk in these areas.
Nevertheless, our perceptions of safety appear to be intimately connected to our assessment of our physical environment. Different behavioral responses to the introduction of street lighting were provided by Painter, who conducted surveys of residents in two neighborhoods in London, UK, “before” and “after” street lighting improvements were made. She found that incidents of crime and disorder as well as the general fear levels of crime dropped markedly while pedestrian activity increased significantly after dark in the study areas after lighting improvements. 2
Research conducted by Ingalls et al. 3 suggests that the different behavioral responses to street lighting found by Atkins et al. 4 and Painter 5 may be explained by differences in the urban context of the study areas they worked in. Ingalls et al. studied how concerns for personal safety affect people’s propensity to ride transit in small-city environments. Their results suggest that our culture’s perceptions of urban environments play a key role in determining a sense of personal safety and a willingness to use transit. They surveyed both residents and bus riders in Greensboro, North Carolina, and found that the city’s residents rarely used transit (i.e., most transit riders were from out of town). While both groups were found to be concerned for their personal safety and residents were two to three times as concerned as bus riders, neither was specifically concerned for the safety of the transit system itself, but rather were most concerned for their safety in their communities as a whole. The authors concluded that people associate their fear of crime and feelings of insecurity in downtown areas with the bus system even though they may feel that the bus system itself is safe. They further concluded that this fear of crime is a major impediment to transit ridership growth. 6
This conclusion is supported by the findings of Yoh et al. They studied the factors that contributed to the nationwide gains in transit ridership seen during the economic boom times of the 1990s. They found that among other factors, a major increase in immigrants living near subway stations and a reduction in crime and fare evasion contributed to increased ridership on some transit systems. 7
A related body of research has studied the effects of the physical environment and neighborhood crime on people’s propensity to engage in physical activities. This area of research has seen a burst of activity over the last decade. A common element in these studies are findings that women and ethnic minorities are likely to consider neighborhood crime levels as a significant impediment to their willingness to engage in regular, physical activities (e.g., bicycling and walking).
However, a number of studies have found seemingly contradictory results with regard to the importance of neighborhood crime levels in determining physical activity patterns. The results of Wilcox et al. are a good example. They compared the leisure time physical activity of rural and urban women in the United States, and found that the key environmental barriers to leisure time physical activity for urban women are a lack of sidewalks and streetlights, high crime, a lack of access to exercise facilities, and infrequently seeing others exercise in their neighborhood, among other factors. Rural women were significantly more likely to report the presence of unattended dogs as an important impediment. While these univariate statistical findings point to crime as one key factor that correlates with physical activity levels, multivariate analyses did not find crime among the significant determinants of a sedentary lifestyle for either rural or urban women. 8
The findings of several studies have suggested that gender, age, and race combine to form an intricate web of causality underlying how neighborhood crime levels affect the propensity to exercise. King et al. did not find a significant role for crime levels in influencing inactivity in middle-aged and older American women. 9 However, in a study that focused on the inactivity levels of adolescents in the United States, Gordon-Larsen et al. found that high neighborhood crime levels were associated with a decreased likelihood of study adolescents falling into the highest category of moderate to vigorous activity levels. 10
These apparent contradictions may be explained by the different effects of actual, reported crimes and a person’s perceptions of the dangers of crime in their neighborhood. The findings of Humpel et al. support this distinction. Their review of the existing literature found that while reported crime levels were not statistically related to the propensity to exercise, residents’ perceptions of their relative levels of safety from crime in their neighborhood was a statistically significant factor. 11 Race appears to be another important variable mediating perceptions of safety and physical activity levels. In another meta-analysis, Seefeldt et al. reviewed the research to date on the causes of physical inactivity. They found strong evidence that high crime rates and fears for personal safety were two important factors that have proven significant in reducing levels of physical activity among ethnic minorities. 12 Similarly, Eyler et al. studied physical activity levels in women from a variety of ethnic backgrounds and found that safety from crime (i.e., the perception of crime levels) and the presence of sidewalks were two of a small number of significant environmental factors that correlated with higher levels of physical activity among African American women. 13 These findings suggest that perceptions of crime can be just as important (if not more so) as reported crimes in affecting how willing people are to walk or bicycle in their neighborhoods. They also suggest that just as perceptions are influenced by our physical environment (e.g., crime levels), they also may be influenced by the social and psychological constructs that result from race, gender, education levels, and age.
Evidence that perceptions of crime may be a more important determinant of travel behavior than reported crime levels (e.g., Seefeldt et al., 14 Eyler et al. 15 ) leads to two questions:
Research suggests that there are two primary factors that influence both our perceptions of neighborhood crime: the physical and the social environments.
Wilson and Kelling proposed the now famous “broken windows” theory of neighborhood deterioration and crime. They suggested that neighborhoods that provide a space where “small” or relatively less serious crimes are tolerated or go unpunished send a message to criminals that this is an area where they can successfully commit more serious crimes. Therefore, signs of neighborhood disrepair—such as a broken window that remains un-repaired or an abandoned car that is not towed away—cause residents to feel less safe and leads to a reduced level of community involvement and vigilance, creating a fertile environment for more serious criminal activity. 16 This theory has had a profound impact on the approach to crime deterrence in the United States. While previous efforts largely concentrated on crime deterrence through punishments of the penal system, Wilson and Kelling’s theory turned attention towards preventing crimes by altering our perceptions of the physical environment and its likelihood to support or deter criminal behavior. Kelling and Sousa provide support for the broken windows theory in their study of the causes of sharp declines in crimes seen in New York City in the 1990s. They found that these declines were not due to the improving economy, an aging population, and declining crack cocaine use, as had been suggested, but rather that laws against minor crimes, known as “broken windows” policing, was a statistically significant cause of the decline in violent crimes. 17
Research by Doran and Lees has drawn a direct link between perceptions of neighborhood disorder and crime levels in New South Wales, Australia. Their findings suggest that graffiti, one of the most prevalent forms of physical disorder found, was most spatially correlated with concentrations of crime. 18
Research on crime at the ten most dangerous (from a crimes perspective) Los Angeles bus stops by Loukaitou-Sideris found a long list of “negative” environmental attributes that contribute to a sense of fear on the part of bus riders, including a lack of “defensible space” at these locations ( Figure 2 ). Most of these ten bus stops were located in downtown commercial areas at the intersections of multi-lane streets, and are often not visible from nearby shops and lack adequate lighting, pubic phones, or a nearby police presence. Many are located near vacant lots and abandoned buildings, with easy escape routes for criminals in alleys and mid-block connections, and generally dilapidated conditions (i.e., “broken windows”). 19
Figure 1 Typical High-Crime Bus Stop
Figure 2 Typical Low-Crime Bus Stop
However, several researchers have concluded that perceptions of neighborhood disorder (i.e., the physical environment) are less important than the social and economic conditions of the neighborhood in question. Sampson and Raudenbush performed a longitudinal study of crime and neighborhood disorder in 1,966 Chicago neighborhoods. They found that both crime and physical disorder were a result of two other social factors: concentrated poverty and what they termed “collective efficacy.” They defined collective efficacy as the level of social cohesion among neighborhood residents and their ability to establish and maintain a set of accepted norms that govern the control of public spaces there. 20 These results suggest that while perceptions of the physical environment may play a role in determining crime levels, the social and economic constructs of the neighborhood may play a more important role.
As suggested in the previous discussion of perceptions of crime and levels of physical activities, perceptions of neighborhoods and their relative safety from crime are determined by both the characteristics of the perceiver and the characteristics of the neighborhood. Taylor conducted a longitudinal study of the links between social disorder, physical disorder, fear of crime, and incidence of crime. He found that in neighborhoods with high property values, property crimes decreased faster or increased more slowly than they did in less economically well-off neighborhoods. In general, the amount of physical and social disorder in each neighborhood at the beginning of the study period did not affect changes in the fear of crime in study neighborhoods; rather, the economic status of the neighborhoods appeared to play the most important role in the levels of fear of crime there. 21
Transit Environs
Cozens et al. used virtual reality walkthrough scenes to test people’s fear of crime in the British rail system environs, and found that rail station designs that provided high levels of visibility for passengers were perceived as offering high levels of perceived safety. 22 They concluded that station designs that provide high visibility are good examples of effective crime prevention through environmental design.
The Effects of Street Lighting on Crime and Perceptions of Crime
Seen as a relatively inexpensive means to reduce neighborhood crime, a number of researchers have studied the effects of enhanced street lighting on crime levels. As described earlier, Atkins et al. found no change in people’s perceptions of safety in the study neighborhood after street lighting was introduced. 23
Wallace et al. studied the effects of transit safety measures, including improved lighting in transit facilities and vehicles, on passenger levels of perceived safety. They found that increased police presence and improved lighting were two of the most highly visible interventions studied and the most effective in terms of reducing the safety concerns of transit patrons. 24
According to Farrington and Welsh, there are two hypothetical reasons for why improved street lighting would have a beneficial effect on crime levels. The first reason is that improved lighting encourages surveillance of potential offenders on the street both through improved visibility and by increasing the number of people on the street in general. The second reason is that improved lighting sends a signal to potential criminals and the community in general that the neighborhood is improving and that there will be increased community pride, cohesiveness, and informal social controls. They performed a meta-analysis of sixteen studies of the effects of street lighting on crime. In their analysis of eight U.S. studies, they found mixed results: roughly half showed a significant effect of improved lighting on crime whereas the other half found none. They found no clear reasons for these differing results, although those studies that found a significant effect were more likely to have measured the crime levels during both daytime and nighttime periods. 25 This suggests that the beneficial effects of street lighting may be due to the second reason mentioned by Farrington and Welsh—that improved lighting affects community perceptions of the neighborhood, sending a signal that the area is improving and strengthening the informal social controls there.
Crime Prevention Through Environmental Design (CPTED)
Even before Wilson and Kelling first proposed their broken windows theory of neighborhood deterioration and crime, 26 researchers had begun to investigate ways of altering the physical environment to reduce crime. According to Clarke, traditional criminological theories concentrated on criminality and delinquency and did not pay attention to crime itself. More specifically, any theory of crime should explain and describe the interactions between the propensity for criminal behavior (i.e., criminality) and the opportunities for crime presented in the environment. Traditional criminology has assumed that explaining the behavioral dispositions for criminal behavior is the same as explaining crime. Based on this opportunity-based theoretical perspective, Clarke listed four different objectives to reduce crime opportunities: (1) to increase the perceived difficulty of crime, (2) to increase the perceived risks of crime, (3) to reduce the anticipated rewards of crime, and (4) to remove excuses for crime. 27
Some of the first researchers to articulate the relationships between crime and environment were Mayhew et al. 28 and Jeffery, 29 who proposed that crime prevention should be approached from the perspective of reducing the opportunities for crime rather than on enforcement and sentencing. Crime prevention was therefore a matter of redesigning urban physical spaces to reduce the opportunities for crime. This approach has become known as “Crime Prevention Through Environmental Design” or “CPTED.” Since the early 1970s, a number of crime researchers and practitioners have worked to articulate and refine specific CPTED interventions, techniques, and principles.
Newman was the first to articulate the theory of “defensible space,” which has become an organizing principle of CPTED. Defensible space is the concept that people feel safe from crime in environments that allow them to mark out and protect their territory, and feel that they can easily see and monitor all non-private spaces around them. His initial research focused on large, high-rise apartment buildings. Newman found that high-rise buildings with lobbies, fire escapes, roofs, and corridors that are hidden from public view had much higher crime rates than did low-rise buildings. He proposed that apartment blocks should be designed to maximize the amount of public space under public surveillance at all times. He also proposed three critical factors that linked crime and public housing design: territoriality, natural surveillance, and image and milieu. The first, territoriality, asserted that people naturally mark out and protect their territory. He proposed that physical design should encourage this tendency and that there should be clear demarcations between spaces intended for public, private, and other shared uses. His conceptualization of natural surveillance proposed that people who are engaging in their natural territorial tendencies should be encouraged by a physical design that allows them to easily see all non-private parts of their housing development. Image and milieu refer to the poor image of many housing projects, which in turn create opportunities for criminal activities there. To counteract these negative images, housing projects must be well integrated into the surrounding neighborhoods. 30
Geason and Wilson placed emphasis on physical design changes to residences and neighborhoods as opposed to increased police activities as an important and effective means to reducing crime. They noted that traditionally, increasing criminal activities have been met with increased policing and tougher sentencing to punish criminals after the crimes already have taken place. They listed a number of physical design elements that are potentially effective at reducing neighborhood crime: houses and their entrances are situated so they are clearly visible from the street, sufficient street and property lighting, children’s play areas that are clearly visible from residences, wide and straight streets that are easy for patrolling police to observe, residences have off-street parking that is visible from the owner’s house, the use of cul-de-sacs to control access to homes, residences are designed with “defensible space” by providing adequate building setbacks, clustered houses, the intended use of space is clear, and adequate recreational space for social cohesion. 31
Newman and Franck used path analysis to identify a number of factors influencing crime and instability in housing sites in urban areas across the United States: socioeconomic characteristics, management effectiveness, quality of city police and security services, and form of ownership. Supporting the CPTED perspective of Newman’s earlier work, they found that both physical (i.e., built environment) and social factors accounted for the most variation in the path analysis models. The two physical factors were the size of the development and the number of units sharing a common building entrance. The two social factors were the number of families on welfare and the ratio of teens to adults in the development. These factors together accounted for roughly 69 percent of the fear, 67 percent of the community’s instability, and 39 percent of the crime against persons. 32
Newman also reported on the results of an effort to reduce crime in the Dayton, Ohio, neighborhood of Five Oaks. Newman’s plan as implemented was to restrict automobile traffic to the neighborhood and break it up into “mini-neighborhoods,” thereby enhancing its defensible space. Gates were installed at key entrance points to the new mini-neighborhoods, excluding cut-through automobile traffic but allowing pedestrian access. One year after implementation, the city found that 67 percent of cut-through traffic through Five Oaks had been reduced and that traffic accidents had been reduced by 40 percent. Reported crimes in the neighborhood also had been reduced by 26 percent and violent crimes by 50 percent while crime in the city of Dayton as a whole went up 1 percent over the same period. Fears of crime displacement from the study area to surrounding neighborhoods also were shown to be unfounded since crime in the communities immediately surrounding Five Oaks dropped by
1.2 percent during the same period. A university survey of residents in Five Oaks found that 53 percent thought that there was less crime and that 4 percent felt safer, 33 suggesting that neighborhood design can play an important role in preventing crimes.
Further support for the CPTED perspective comes from Carter et al. They studied the effects of zoning, physical design changes, and community policing initiatives in the “crime-ridden” North Trail area of Sarasota, Florida. With local-resident and business-owner cooperation, city planners created a new zoning ordinance that required all new developments to submit site plans with design elements based on CPTED principles. Recommendations (which were often willingly complied with) included outside lighting, landscaping that allowed visibility, mixed uses, porches, balconies, and residential space above retail to allow “eyes on the street,” and shared parking. Analysis of local land-use links to crime revealed that prostitution was enabled in the area by the presence of an abundance of small hotels. Review of these sites revealed that many were unable to renovate and expand due to restrictive street-setback requirements, and parking and drainage requirements that greatly increased the costs of renovating old businesses or building new ones. Focused police interventions included working closely with local business owners and residents, high-visibility patrols, and undercover investigations to identify and arrest pimps and drug dealers. The study looked at changes in four measures of crime over a 9-year period in the study area and the rest of Sarasota: calls for police service, crimes against persons or property, narcotics crimes, and prostitution. Using linear regression techniques, the researchers found that calls for police service fell in the North Trail area and rose in the rest of the city. The changes in the number of crimes against people or property fell in both the study area and the city, and were statistically indistinguishable. While the changes in the number of narcotics crimes in both areas rose during the study period, the rate of increase in the North Trail area was significantly lower than that for the city. Finally, the number of prostitution police reports during the study period fell in the North Trail area and rose in the city as a whole. 34
A study by Loukaitou-Sideris et al. speaks directly to the influences of the social environment on crimes and focused in particular on neighborhoods surrounding transit stations. They found that there were more crimes against people at stations within low-income neighborhoods, with more persons per household, and higher concentrations of youth than in comparison neighborhoods. The researchers also found a strong correlation between station crime and the presence of liquor stores in the station neighborhood. In addition, they found that the busiest stations (i.e., those with the highest transit ridership) tended to incur the most serious crimes. Less serious crimes, such as vandalism, tended to be concentrated at stations in dense neighborhoods with high percentages of the population with less than a high school education. Taken together, these two studies indicate that the ridership levels, station area design and environmental characteristics, and neighborhood characteristics play a role in determining crime levels at transit stations. 35
Transit Crime
When the decline of transit use in the United States during the post-war period is considered, explanations often point to people’s associations of transit with dense, often crime-ridden, urban areas. With the growth of the suburbs came the commonly held perception of these new neighborhoods as sanctuaries from the crime that resides in older urban areas. Furthermore, the lack of transit in suburbs often leads people to associate transit with crime. The expansion of transit lines into wealthy, suburban areas is often fought by locals fearing that transit services will import crime into their neighborhood.
Research on this subject provides somewhat conflicting evidence on whether there is a causal link between transit and crime. Liggett et al. studied the effects of the introduction of light rail service along the Los Angeles Green Line on crime levels in its surrounding neighborhoods. This line passes through low-income, high-crime areas and terminates in the affluent areas of west Los Angeles. The researchers analyzed five years of crime data in the neighborhoods surrounding the Green Line “before” and “after” its introduction. They found that the transit line did not have a significant effect on crime trends or crime dislocations in the station areas, and did not transport crimes from high-crime areas to low-crime areas. 36
However, Block and Davis mapped and compared street robberies in four Chicago police districts with rapid transit stations—two with low overall crime rates and two with high-crime rates. In the low-crime districts, street robberies were concentrated near the rapid transit stations while in the high-crime districts, street robberies tended to be more dispersed. Street robberies near the stations in the low-crime districts also tended to have a more temporal pattern, with most incidents occurring during the off-peak transit ridership hours when there were fewer police patrols and observers. 37 These findings suggest that crimes may indeed concentrate around rapid transit stations in low-crime areas, taking advantage of the spatial and temporal concentration of pedestrians. These conclusions are supported by Loukaitou-Sideris et al. They studied crime patterns at light rail stations in Los Angeles and found that the busiest stations (i.e., those with the highest transit ridership) tended to suffer from concentrations of the most serious crimes. Less serious crimes, such as vandalism, tended to be concentrated at stations in dense neighborhoods with high percentages of the population with less than a high-school education. Taken together, these studies indicate that the ridership levels, station area design and environmental characteristics, and neighborhood characteristics play a role in determining crime levels at transit stations. 38
RESEARCH OBJECTIVES
This research project has several objectives, which focus on different aspects of identifying the relationships between neighborhood crime and mode choice and identifying the potential for routinely collecting, analyzing, and incorporating crime measures into the transportation modeling and policy arenas. The objectives are:
The objectives listed earlier served to guide our efforts at identifying and collecting the appropriate data sources for this project. Accordingly, this research focused on developing binomial logistic mode choice regression models to determine the influence of neighborhood crime and urban form on the choice of non-automotive modes. We sought disaggregate crime data, ideally geocoded to specific street addresses. Starting in January 2006, the police departments of thirty-six cities in the San Francisco Bay Area were contacted via e-mail or a letter requesting crime data for the year 2000. Of the 36 cities contacted, seven (Berkeley, Concord, Oakland, Santa Clara, Walnut Creek, San Francisco, and Sunnyvale) ultimately shared their data.
Crime Categories
The UCR Program was established by the federal government to coordinate the collection of crime data at local, state, and federal levels. The UCR defines two categories of crimes: Parts 1 and 2.
Part 1 crimes are considered the more serious crimes and are therefore most likely to be reported by law enforcement agencies. 40
Part 1 crimes are considered the more serious crimes and include the following offenses:
Abbreviations for these categories are respectively P1V (Part 1 violent) and P1P (Part 1 property).
As Part 2 crimes are described as all other crimes outside of Part 1 crimes, the list given in the UCR Handbook is comprehensive. Based on these UCR categories, we developed a more fine-grained list of crime categories for the purposes of this study to group Part 2 crimes.
The five Part 2 categories were determined as follows.
Final List of Crime Categories
Thus altogether, seven categories were developed to group Part 1- and Part 2-type crimes. The seven categories and their abbreviations are:
A detailed list of these crime categories and their constituent crime types is provided in Table 32 in Appendix A
In searching for a travel survey data source for this research, priority was placed on obtaining data that reported the amount of each individual’s activity and travel behavior as discrete records, including detailed individual and household demographic information for survey participants and geographically precise data on residential, employment, and other recorded activity information. Because we requested crime data from San Francisco Bay Area police departments, we needed travel and activity data for Bay Area residents as well. Data sources that were reviewed included U.S. Census Journey to Work data, and the Metropolitan Transportation Commission’s (MTC) Bay Area Travel Survey (BATS) conducted in 2000. We ultimately selected the BATS 2000 dataset for two reasons: First, because Journey to Work data are provided in aggregate form, they are not suitable for use in a disaggregate mode choice model. Second, it is a distinct possibility that neighborhood crime rates may have different effects on different trip purposes. Because the Census data only reports commute trips and the BATS 2000 data survey and report the full spectrum of trip types, we felt our research would benefit from a wider range of trip purposes.
This dataset provides detailed activity diary records for 14,563 households, which represents roughly 0.6 percent of the 2,429,257 total households in 1998 in the San Francisco Bay Area. The surveyors utilized a geographically stratified sample, with the stratification based on counties and MTC’s predefined traffic “superdistricts” within counties. To ensure a representative sample of the two counties with the lowest population densities—Napa and Marin—the surveyors chose to fix a minimum number of households (n = 600) for each of these counties. The other seven counties were randomly sampled according to the stratification method mentioned earlier.
These data are used by MTC to calibrate the region’s travel demand model. Because it contains detailed activity records for each individual, including travel purpose and mode choice, and detailed geographical location information for each activity, including trips, it can be combined with data on the distribution of employment to establish the relative accessibility of each surveyed residence to retail shopping opportunities.
To determine the influence of urban form on transit, pedestrian and bicycle mode choice, three measures of urban form were developed: the number of four-legged intersections per acre, the residential population per acre, and the residential and employment population per acre. For the residential and employment population density variables, we hypothesized that higher density values would promote the provision and use of non-auto modes by providing more local opportunities to use transit, walk and ride bicycles. For the four-legged intersection density measure, we hypothesized that the higher the density value, the more the neighborhood street network conforms to a traditional “gridiron” design that provides the greatest level of point-to-point connectivity within the neighborhood, reducing travel distances and encouraging the use of non-automotive mores. The greater point-to-point connectivity offered by a gridiron street network with a large number of four-legged intersections is shown in Figure 2, which shows the street patterns in a 9-square-mile area of San Francisco and Walnut Creek.
Figure 3 Gridiron Versus Suburban Street Network Patterns
The number of four-legged intersections per acre variable was calculated by counting the number of four-legged intersections per TAZ and then dividing the total count by the area of the TAZ. The street intersection map and the TAZ GIS map data files were both obtained from the MTC, and the number of employees per census tract data were obtained from the Association of Bay Area Governments (ABAG). Employment census tract data were converted to TAZ-level data using census tract to TAZ correspondence tables, also provided by MTC.
Both the residential and the residential plus employment population density variables were calculated by dividing the total residential or residential plus employment population of each study TAZ by the area of that TAZ. The TAZ-level residential data were obtained from MTC and the employment population data were obtained from the ABAG in census tract form. Using census tract to TAZ correspondence tables also provided by MTC, the employment per census tract estimates were converted to employment per TAZ estimates.
To determine the influence of urban geography and travel times on the transit, pedestrian, and bicycle mode choice, a measure of the relative accessibility to attractions around the Bay Area (e.g., shopping centers, central business districts, etc.) for each survey respondent in the BATS 2000 dataset was developed. Data on the geographical distribution of shopping opportunities were obtained from the ABAG, which provides estimates of employees at the TAZ level for the Bay Area. 41
Each household’s accessibility to attraction opportunities was calculated using a gravity-based measure based on the total number of employees, as shown in the following formula:
where:
Fij=Timeij
Jobs=# of jobs in TAZ
Time=network travel
i=residential zone
j= employment zone
–_= an empirically calculated friction factor using BATS 2000 data.
Preliminary pedestrian and bicycle mode choice models showed a high degree of multicollinearity between crime variables and the urban form/accessibility variables, and among the urban form/accessibility variables themselves. As a result, model runs with these variables included would often result in unanticipated changes in the sign and significance of these variables when one of its collinear partners was inserted or removed from the list of independent variables included in a model run. A number of approaches were tested, including factor variables that were developed from factor analysis using all of these crime and urban form/accessibility variables together to create two factor component variables: one representing crimes and the other representing urban form/accessibility. However, in testing these factor variables in the logistic model runs, we found that the crime factor variable would often produce a positive sign, indicating that increased crime rates were associated with an increased probability of survey participants choosing transit, bicycle, or pedestrian modes. Based on our earlier tests of logistic model runs, we suspected that the violent crimes variable was causing these unexpected results. Earlier logistic model tests with the violent crime variable would often produce a statistically significant and strongly positive relationship between pedestrian mode choice and high crime rates while the property crime variable was often negative and significant. We suspect that this may be due to an ecological fallacy, where violent crimes have a tendency to cluster in crime “hot spots” more than do property crimes, which are more spatially dispersed. Because these violent crime hot-spot clusters are likely to locate in dense, urban traffic analysis zones/neighborhoods where people are more likely to choose walking, bicycling, or transit, this may explain why we have found these counterintuitive results and lead use to conclude that the violent crime rate variable may not be a good choice for transportation mode choice modeling efforts.
As a result of these initial factor analysis runs, we chose to run the final logistic regression models with only the property crime rate variable, which we have concluded is more spatially dispersed and is less likely to cause an ecological fallacy. Multicollinearity between the three urban form/accessibility variables was removed by running factor analysis and creating a single urban form/accessibility factor variable. Each model type (i.e., pedestrian, bicycle, and transit) exhibited different patterns of multicollinearity between its independent variables. Furthermore, each model run with different groups of cities similarly displayed somewhat different patterns of multicollinearity between the urban form/accessibility and the crime variables. Therefore, for each model run, a separate factor analysis run and separate set of factor score variables were produced. After testing a number of combinations of urban form and accessibility measure variables, a combination of the following input variables yielded results consistent with theoretical assumptions without substantial collinear effects. These are the number of four-legged intersections per acre in each TAZ, the number of jobs plus the number of residents per acre in each TAZ, and the transit accessibility measure for each TAZ.
This study used a binary logistic regression modeling approach to estimate the impact of a set of independent variables on a person’s probability of taking a particular transportation mode. Maximum likelihood estimation technique was used to estimate the coefficient parameters.
A binary logit model is defined as:
where P is the probability of a binary outcome [e.g., a person taking transit (P = 1) or not taking transit (P = 0)], and Z = α + βX, where X is a vector of individual, household, urban form, and transit accessibility, and crime characteristics, and β is the slope of the variables.
A person’s individual characteristics included age, income, race, employment status, and status as the head of household. The household characteristics included the number of vehicles per licensed driver and the number of bicycles per household. The urban form & transit accessibility was a factor variable that grouped such urban form variables as number of four-legged intersections per acre and population per acre, with the transportation accessibility variable measured as accessibility to transit. The neighborhood crime characteristics were measured as the number of crimes per 1,000 residents of each TAZ (i.e., the crime rate).
Three sets of logistic regression models were run. The three model sets estimated the impact of independent variables on a person’s probability to take transit, bike, and walk for work and non-work trips.
BATS 2000 data were prepared for analysis by first importing the BATS 2000 data files into a Microsoft Access database. Because BATS 2000 data are distributed by MTC as text files, these files were converted into Access format. The BATS data are provided as three separate files:
1 = Driving, riding, walking, biking, flying 2 = Household chores and personal care 3 = Meals (at home, take-out, restaurant, etc.) 4 = Recreation/Entertainment 5 = Sleep 6 = Work or work related (in or out of home) 7 = School or school related (college/day care) 8 = Shopping (at home)(by Internet, catalog or television) 9 = Shopping (away from home) 10 = Personal services/bank/government 11 = Social activities 12 = Relaxing 13 = Volunteer/civic/religious services 14 = Sick or ill/medical appointment 15 = Non-work (non-shopping) Internet use 16 = Pick-up/drop off passenger
996 = Other
998 = Don’t know
999 = Refused
Source: MTV BATS 2000 Activity Survey File
The location of each activity also is identified by TAZ number, and if an activity is a trip, the origin and destination TAZs as well as the mode used for each trip also are provided.
Our first step was to create data tables that combined data from the various files described earlier. Mode choice analysis is typically done at a disaggregated level, meaning that each data record in the analysis tables needs to represent a single trip taken by a single person; however, each trip record needs to have data from multiple data files—household, person, and trip data—all in one record on one table. Therefore, we organized the BATS 2000 data tables into a relational database structure in Access, linking different data file records by common identifiers for household, person, and activity.
Because the largest share of trips taken by a person during a typical travel day are home-based and because the mode of travel chosen for a home-based trip plays an important role in determining the mode choice of trips throughout the travel day, it is our assumption that neighborhood crime levels will have their greatest effect on mode choice in a person’s home neighborhood. Therefore, we selected trip data records for analysis that were home-based.
Trips were categorized into five categories: auto, transit, walk, bicycle, and other. Only trips identified as auto, transit, walk, or bicycle were used for our analysis. To run the pedestrian binary logistic regression model, a “dummy” variable was constructed where pedestrian trips were coded with a “1” and all other trip types were coded with a “0.” Similar dummy variables were constructed for each of the other three modes of travel to use as dependent variables in the transit and bicycle binary logistic regression models.
There are several peculiarities of how trips are coded in the BATS 2000 dataset. We chose to use the Unlinked Trips File for our pedestrian and bicycle binary logistic model runs while we used the linked trips file for our transit analysis. We came to the conclusion that this was the most efficacious approach because home-based transit trips are under-represented in the Unlinked Trips File. Because very few people step directly out of their front doors onto a waiting transit vehicle, the transit trip is often the second, third, or later link in a trip chain, and the origin of this trip link will therefore not be coded as the home but rather as the bus stop, BART station, ferry terminal, or other transit station where the transit trip started. To reliably link the home’s neighborhood data (i.e., crime rates and transit accessibility) to each transit trip that began as a linked trip from the home, we used the Linked Trip File for the transit mode choice analysis. This way, transit trips that required a short walk or bicycle ride from home to reach the transit stop would be coded as home-based despite the fact that the origins of these individual trip links are actually located at the transit stop where the traveler boarded the transit vehicle. Pedestrian and bicycle trips were analyzed using the Unlinked Trip File because these modes are most likely to be used directly from the home.
Starting in January 2006, the police departments of 36 cities in the Bay Area were contacted via e-mail or hard copy letter requesting crime data for the year 2000. At each police department, follow-up phone calls were made to inquire as to the status of our request. Of the 36 cities contacted, five cities ultimately shared their Parts 1 and 2 data for the year 2000 or 2001 while three additional cities shared just the Part 1 data for the year 2000 or 2001.
Most city police departments that were unable to comply with our requests cited a lack of available staff to do the work of compiling and sending us their data. Some departments indicated that their policy was to charge the data requester for the staff time required to gather and send their data. In one case, we were told they would need to charge us up to $2,000 to gather and send their data.
Reasons given for not providing data also included:
While there were many negative or slow responses to our requests, one city responded within a week with the data and the GIS information. It is our assumption that two factors contribute to the willingness and ability of a police department to provide these detailed crime data records. The first is, as suggested by the negative responses summarized earlier, a lack of staff resources to collect and send the data. This includes both the availability of staff to perform the tasks as well as the expertise among staff to produce the data. The second factor is likely the degree to which a police department has made a successful transition from paper records or older generation database systems to more up-to-date and user-friendly systems. The data used in the study were collected by September 2006, 9 months after the process was started.
To facilitate future work involving crime data analysis (both academic research and analysis at the local, police-department level), we recommend that a national- or state-level policy be developed that would (perhaps as a supplement to the UCR system) encourage local police departments to digitally store historical crimes data. These data are extremely valuable, and our experience with police departments that routinely purge their historical data indicates that some departments do not understand the importance of storing and maintaining historical datasets. Although resources are often limited, researchers and other individuals who can analyze trends and patterns for reducing crime would be greatly helped by having comprehensive access to this information. This need presents opportunities for joint funding of crime database improvements by research institutions, universities, non-profit organizations, and government agencies in cooperation with individual police departments and the criminal justice agencies within state and federal governments. These opportunities should be explored and pilot-tested as a potential avenue for enhancing access to this potentially rich source of data.
Five cities used in this study provided both Part 1 and Part 2 crime data. The cities of Berkeley, Concord, Oakland, and Walnut Creek provided both Part 1 and Part 2 data for the year 2000. The city of Santa Clara provided Part 1 and Part 2 data for the year 2001. The cities of San Francisco and Sunnyvale were able to provide only Part 1 data. Details regarding the coding process for these cities are given next.
Berkeley
The crime data for Berkeley had 12,818 records of police activity for the year 2000. Each record has sufficient descriptive information for easy categorizing into the seven crime groupings. A total of 9,306 of the records provided (or 72.6 percent) were successfully geocoded and used for this study.
Concord
The crime data for the City of Concord contained 22,528 records of police activity for the year 2000. Of these records, 703 had addresses outside of the city of Concord. These records were not included in the analysis.
After the geocoding was done, 19,216 records, 85.3 percent, remained that were successfully geocoded with sufficient descriptive information for each record for the purposes of categorizing. All records had unique case numbers.
Oakland
The City of Oakland provided the most comprehensive dataset. We received 193,131 records of Part 1 and Part 2 crimes and incidents for the year 2000; however, these records included entries with either follow-up information on crimes which had been reported previously or entries with supplemental information for all persons involved in one crime. These duplicate and supplemental entries were removed from the dataset.
After these records were removed, other entries were found where either the crime description or the incident location was left blank. In some cases, the incident location given was unknown. City of Oakland personnel were unavailable for questions regarding these data. Consequently, these records also were removed from the dataset.
The remaining records were categorized and geocoded. A number of records were found to fall outside of the Oakland city limits. These records were removed from the study. The final number of records successfully geocoded and included in this study for the city of Oakland was 68,513 (or 35.5 percent).
Santa Clara
The City of Santa Clara provided 15,634 records of Part 1 and Part 2 crimes for the year 2001. Because this was the earliest year for which data were available, we used 2001 data as a proxy for 2000 data. While crime levels and geographic distributions undoubtedly change from year to year, we believe that these changes over the course of a single year are minimal. These data came with only code numbers to describe crimes. For this reason, categorizing these data was more challenging. Personnel at the city of Santa Clara made themselves available to help with interpreting and understanding the crime codes. For Santa Clara, 12,644 records (or 80.9 percent) were geocoded successfully.
Walnut Creek
The City of Walnut Creek provided 33,981 records of Part 1 and Part 2 crimes for the year 2000. Of these records, 25,023 (or 73.7 percent) geocoded successfully and fell within the bounds of the city limits.
The cities of San Francisco and Sunnyvale provided only Part 1 crime data for our study. Sunnyvale provided Part 1 data for the year 2000 while San Francisco provided Part 1 data for 2001.
Sunnyvale
A total of 2,123 Part 1 crime data records were provided for the year 2000 by the City of Sunnyvale. Street addresses were not provided for these crimes—only police department Reporting District information was provided; therefore, we were not able to geocode crimes in Sunnyvale at the address- or even intersection-level. However, an electronic map outlining Reporting Districts was made available, and we used it to create a GIS shapefile for Reporting Districts. This shapefile was then used to geocode a total of 2,120 records (or 99.9 percent) of the original dataset provided.
San Francisco
The City of San Francisco provided 22,429 Part 1 crime records for the year 2001. Data from San Francisco were received with no case numbers. A case number was created by concatenating the date, time, and address information for each record. For the concatenation, the Excel program transformed the given date from the date format into the numerical date value. For example, “12/7/2001” became the numerical date value 37232.
Addresses provided in the San Francisco data were “blocked” for reasons of confidentiality.
To geocode the San Francisco addresses, “XX”s were replaced with “00.” This effectively placed all crime locations that fell on a particular block at the corner adjacent to the lowest, even-numbered address on that block. While this reduced the accuracy of our crime geocoding for San Francisco, the fact that all crimes were aggregated to and summarized at the TAZ level made this loss of accuracy virtually inconsequential.
Cases where a range of addresses were given, such as “0001–2499 STOCKTON ST.”, the leading characters (in this case, namely “0001–”) were removed, leaving “2499 STOCKTON ST.” as the address for the geocoding. The final number of geocoded records was 19,169 (or
85.5 percent) of the San Francisco dataset.
Because the nature of this research effort was primarily exploratory, the methods we applied were primary exploratory in nature as well. Therefore, our approach, methods, and working hypotheses are somewhat informal and do not seek “ironclad” scientific confirmation of a set of working hypotheses but can be more accurately described as an exploration of a set of research questions and expectations. Our research questions and expectations can be summarized as follows:
density, high transit service levels, and lower crime rates). However, research related to this question provides somewhat mixed results. Cozens et al. found that well-lit transit station areas provide an enhanced feeling of safety from crime, and Loukaitou-Sideris et al. found that the introduction of the Green Line light rail system to Los Angeles did not increase crime rates in the station areas. Ingalls et al. found that people’s perceptions of high crime rates in urban, transit-rich environments was an impediment to transit ridership growth, and Block and Davis46 found that crimes tend to concentrate around rapid rail stations in Chicago. Therefore, we will explore two alternative relationships between density, transit service levels, and crime:
Because the nature of this research effort was primarily exploratory, the methods we applied were primary exploratory in nature as well. Therefore, our approach, methods, and working hypotheses are somewhat informal and do not seek “ironclad” scientific confirmation of a set of working hypotheses but can be more accurately described as an exploration of a set of research questions and expectations. Our research questions and expectations can be summarized as follows:
This section provides descriptive statistics for each study city, including the crimes data provided by each study city’s police department as well as an overview of the travel data for each city grouping obtained from the BATS 2000 survey. We present socio-demographic data in this section (e.g., household income) using Census 2000 data to provide an overview of each city’s socio-demographic makeup. We then give similar data using the BATS 2000 data used to run the logistic regression models to compare and contrast the trip data populations with the city-level census data.
Regarding Part 1 violent and Part 2 violent and vice-vagrancy crimes, the counts of crimes per 1,000 residents for Walnut Creek are the lowest of all the study cities. For property crimes, Part 1 property, Part 2 property, and broken window crimes, Walnut Creek is comparable to or above the average of the other study cities ( Table 2 ).
Table 2 Crime Rates for All Study Cities
| PIV | PIP | P2V | P2P | BROKWIN | VICEVAG | NOTAFF | |
| Berkeley | 1.2945 | 55.376 | 8.7695 | 0.4867 | 9.7914 | 7.8837 | 7.2122 |
| Concord | 2.2556 | 40.1062 | 13.9285 | 2.8318 | 10.7510 | 11.6482 | 76.5656 |
| Oakland | 7.5098 | 48.9440 | 14.8569 | 1.2191 | 10.7691 | 26.6423 | 61.5655 |
| San Francisco | 6.4308 | 18.2482 | |||||
| Santa Clara | 1.1536 | 26.1721 | 6.1156 | 0.1075 | 8.3821 | 12.8955 | 60.0131 |
| Sunnyvale | 1.1536 | 14.9362 | |||||
| Walnut Creek | 0.9332 | 30.3751 | 6.1901 | 9.0985 | 10.6850 | 6.3768 | 325.5257 |
BROKWIN = “Broken Windows” Crimes Category
VICEVAG = “Vice & Vagrancy” Cirmes Category
NOTAFF = “Does Not Affect Mode Choice” Crimes Category
Table 3 provides general descriptive statistics for each study city for the purposes of comparison and analysis. More detailed descriptive statistics and discussion are provided in Appendix B .
Median % % %
# crimedata
Income White Black Hispanic
sq. mi. int/ac
Berkeley 102,743 9,823.20 0.1108 9,306 44,485 63.7 15.3 9.7
Concord 121,780 4,041.00 0.0130 19,204 55,597 75.8 3.8 21.8
Oakland 399,484 7,126.66 0.0321 68,513 40,055 34.7 37.6 21.9
San Francisco 776,733 16,634 0.1274 19,169* 55,221 53.0 8.6 14.1
Santa Clara 102,361 5,566.20 0.0328 11,771 69,466 59.6 2.8 16.0
Sunnyvale 131,760 6,006.50 0.0213 2,120* 74,409 56.7 2.7 15.5
Walnut Creek 64,296 3,229.60 0.0075 25,023 63,238 86.8 1.5 6.0
* part 1 data only
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Based on analysis of the data summaries just presented, pilot test logistic regression model runs, and the local knowledge of the research team members, we summarized and analyzed the travel diary survey data based on grouping the study cities into four sub-groups: urban core (San Francisco only); inner-ring (Oakland & Berkeley); suburbs with Parts 1 and 2 data (Walnut Creek, Santa Clara, & Concord); and All Suburbs Combined (Walnut Creek, Santa Clara, Concord, and Sunnyvale). Preliminary logistic model runs showed that combining the data into these sub-groups improved overall goodness-of-fit results. Furthermore, the relationships between the crime, density, and travel behavior were the most internally consistent (within groups) when the cities were organized in this fashion. Groupings of individual variables (e.g., race into “white” and “non-white” and household income into four groups) were based on a combination of the researchers’ past analytic experience, understanding of the research literature, and the iterative process of testing various combinations of variable groupings in the process of determining the ultimate structure and components of the final logistic regression models. The summary data and model results that follow are reported using these groups for the purposes of facilitating comparisons between the descriptive statistics on travel behavior and the model outputs.
Frequency distributions of categorical-level variables are listed in Table 4 for work trips using public transportation or transit mode and in Table 5 for non-work trips using transit mode. These models were run for San Francisco alone; for Oakland and Berkeley; for Concord, Santa Clara, and Walnut Creek; and for all suburban cities, Concord, Santa Clara, Sunnyvale, and Walnut Creek.
The distributions closely parallel the Census Journey to Work data in all cases. For example, the census data describe San Francisco as having 31.1 percent using transit for their work commute while Table 4 lists 26.7 percent for this model’s dataset (i.e., all trip records less than 5 miles in length). Comparing household income data, the distributions listed in and for the transit models do not follow the patterns seen in the census data. In the models, the percentages for household income below $15,000 are much lower than that of the general population described in the census data. The percentages for the highest income level, above $75,000, are all higher than that of the general population described in census data. However, for incomes above $75,000, the suburban cities have higher percentages than San Francisco, and Oakland and Berkeley, as was the case in the census data.
Table 4 Frequency Distribution of Categorical-Level Variables: Transit File—Work Trips
| Variable | Percentage of Observations | ||
|---|---|---|---|
| SF Only | Oakland & Berkeley Walnut Creek, Santa Clara, Concord | All Suburbs | |
| Trips | |||
| Transit | 26.72 | 17.46 9.64 | 7.04 |
| Non-Transit | 73.28 | 82.54 90.36 | 92.96 |
| Income | |||
| Household income under $15,000 | 1.11 | 2.29 0.92 | 0.67 |
| Household income $15,000–49,999 | 22.16 | 27.54 19.95 | 16.93 |
| Household income $50,000–74,999 | 22.93 | 24.45 22.30 | 20.33 |
| Household income above $75,000 | 53.80 | 45.72 56.83 | 62.07 |
| Age | |||
| Age 19–39 | 52.44 | 45.71 34.97 | 40.44 |
| Age 40–59 | 42.68 | 46.97 56.50 | 51.97 |
| Age over 59 | 4.88 | 7.32 8.53 | 7.59 |
| Gender | |||
| Male | 57.01 | 52.64 54.08 | 55.54 |
| Female | 42.99 | 47.36 45.92 | 44.46 |
| Head of Household | 63.13 | 60.58 60.27 | 59.93 |
| Race | |||
| White | 76.05 | 74.56 80.99 | 77.06 |
| Non-White | 23.95 | 25.44 19.01 | 22.94 |
| Number of bicycles in household | |||
| 0 | 39.04 | 35.07 37.49 | 37.33 |
| 1 | 24.09 | 23.09 18.90 | 18.39 |
| 2 | 23.72 | 20.25 22.60 | 23.10 |
| 3 | 7.41 | 11.11 9.22 | 9.33 |
| 4 | 4.45 | 5.28 7.56 | 6.95 |
| 5 | 0.68 | 2.36 2.42 | 3.02 |
| 6 | 0.49 | 1.81 1.36 | 1.33 |
| 7 | 0.00 | 0.47 0.23 | 0.32 |
| 8 | 0.00 | 0.00 0.23 | 0.18 |
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 5 Frequency Distribution of Categorical-Level Variables: Transit File—Non-Work
Trips
Variable Percentage of Observations
| SF Only | Oakland & Berkeley | Walnut Creek, Santa Clara, Concord | All Suburbs | |
|---|---|---|---|---|
| Trips | ||||
| Transit | 13.96 | 6.48 | 2.05 | 1.55 |
| Non-Transit | 86.04 | 93.52 | 97.95 | 98.45 |
| Income | ||||
| Household income under $15,000 | 5.53 | 7.07 | 0.90 | 0.91 |
| Household income $15,000–49,000 | 26.34 | 30.62 | 23.52 | 21.86 |
| Household income $50,000–74,000 | 20.56 | 18.97 | 20.94 | 20.91 |
| Household income above $75,000 | 47.58 | 43.34 | 54.64 | 56.33 |
| Age | ||||
| Age 19–39 | 43.31 | 38.94 | 27.96 | 33.09 |
| Age 40–59 | 38.85 | 38.07 | 44.20 | 41.12 |
| Age over 59 | 17.85 | 22.99 | 27.84 | 25.80 |
| Gender | ||||
| Male | 48.97 | 44.18 | 43.25 | 44.12 |
| Female | 51.03 | 55.82 | 56.75 | 55.88 |
| Head of Household | 67.46 | 65.62 | 58.38 | 57.85 |
| Race | ||||
| White | 80.10 | 79.04 | 85.86 | 83.73 |
| Non-White | 19.90 | 20.96 | 14.14 | 16.27 |
| Number of bicycles in household | ||||
| 0 | 42.11 | 39.82 | 36.97 | 36.23 |
| 1 | 24.76 | 21.96 | 16.53 | 17.47 |
| 2 | 20.73 | 19.14 | 21.29 | 21.58 |
| 3 | 6.86 | 8.00 | 9.47 | 10.29 |
| 4 | 3.86 | 5.59 | 9.58 | 8.25 |
| 5 | 1.14 | 2.00 | 3.25 | 3.48 |
| 6 | 0.38 | 1.95 | 2.13 | 2.07 |
| 7 | 0.11 | 1.03 | 0.34 | 0.30 |
| 8 | 0.00 | 0.00 | 0.39 | 0.30 |
| 9 | 0.05 | 0.51 | 0.00 | 0.04 |
| 10 | 0.00 | 0.00 | 0.00 | 0.00 |
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Descriptive statistics for the transit mode datasets (all trips greater than 0.5 miles in length) are shown in Table 6 through Table 13 . As expected, the number of vehicles per licensed driver is lowest in San Francisco and highest in the suburban cities.
Table 6 Descriptive Statistics of Categorical-Level Variables: Transit File—Non-Work
Trips
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 3 0.8218 0.4502 Total number of bicycles in household 0 9 1.1902 1.2651 Part 1 property crimes per person in TAZ 0.0007 0.2059 0.0137 0.0156 Urban form & transit accessibility factor score -2.3369 4.9703 -0.0084 0.9762 N = 1599
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 7 Descriptive Statistics for Continuous Variables for Berkeley and Oakland: Work Trips—Transit Mode (All Trips Greater than 0.5 Miles)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 4 0.9570 0.3904 Total number of bicycles in household 0 9 1.4901 1.6091 Part 1 property crimes per person in TAZ 0.0118 0.4141 0.0444 0.0293 Urban Form & Transit Accessibility Factor Score -2.8738 4.7595 -0.0007 0.9661 N = 1255
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 8 Descriptive Statistics for Continuous Variables for Concord, Santa Clara and
Walnut Creek: Work Trips—Transit Mode (All Trips Greater than 0.5 Miles)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 4 1.0639 0.3713 Total number of bicycles in household 0 8 1.4565 1.5473 Vice & vagrancy crimes per Jobs + Population in TAZ 0 0.0259 0.0061 0.0054 Urban Form & Transit Accessibility Factor Score -1.7124 4.4424 0.1898 1.1061 N = 1314
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 9 Descriptive Statistics for Continuous Variables for All Suburbs:Work Trips—Transit Mode (All Trips Greater than 0.5 Miles)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 4 1.0637 0.3608 Total number of bicycles in household 0 8 1.4723 1.5499 Part 1 violent crimes per person in TAZ 0 0.0125 0.0012 0.0012
Table 9 Descriptive Statistics for Continuous Variables for All Suburbs:Work
Trips—Transit Mode (All Trips Greater than 0.5 Miles)
Standard
Minimum Maximum Mean
Deviation
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 10 Descriptive Statistics for Continuous Variables for San Francisco Only: Non-Work Trips—Transit Mode (All Trips Greater than 0.5 Miles)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 4 0.7912 0.4576 Total number of bicycles in household 0 9 1.1148 1.2609 Part 1 property crimes per person in TAZ 0.0007 0.2059 0.0151 0.0178 Urban Form & Transit Accessibility Factor Score -2.3369 4.9703 0.0543 1.0146 N = 1795
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 11 Descriptive Statistics for Continuous Variables for Berkeley and Oakland: Non-Work Trips—Transit Mode (All Trips Greater than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 3 0.9242 0.3992 Total number of bicycles in household 0 9 1.4012 1.6610 Part 1 property crimes per person in TAZ 0.0118 0.4141 0.0447 0.0301 Urban Form & Transit Accessibility Factor Score -2.8738 4.4784 0.0364 0.9643 N = 1923
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Table 12 Descriptive Statistics for Continuos Variables for Concord, Santa Clara and
Walnut Creek: Non-Work Trips—Transit Modes (All Trips Greater than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0 3 1.0426 0.3185 Total numbers of bicycles in household 0 11 1.6095 1.6956 Part 1 property crimes per person in TAZ 0 0.0259 0.0053 0.0052 Urban Form & Transit Accessibility Factor Score -1.7124 4.4424 -0.0502 0.9750 N = 1781
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
| Descriptive Statistics | ||||
|---|---|---|---|---|
| Minimum | Maximum | Mean | Standard Deviation | |
| Total number of vehicles per licensed driver in household | 0 | 4 | 1.0415 | 0.387 |
| Total numbers of bicycles in household | 0 | 11 | 1.5918 | 1.6573 |
| Part 1 violent crimes per person in TAZ | 0 | 0.0053 | 0.0010 | 0.0010 |
| Urban Form & Transit Accessibility Factor Score | -2.0542 | 4.2971 | -0.0355 | 0.9919 |
| N = 2698 | ||||
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
Descriptive statistics for the variables used in the pedestrian work trip binary logistic regression model are shown in Table 14 . For work trips starting in suburban cities less than 1 mile in length, 30 percent are pedestrian trips while the corresponding figures for San Francisco and Oakland, 80 percent and 69 percent, respectively, conform to expectations that people who live in more pedestrian-friendly, urban cities tend to choose walking more often than do people in the suburbs. This pattern is repeated for the non-work pedestrian trip variables as seen in Table 15 . From the same dataset, the percentage of households in each income level group is roughly consistent across the three city groupings, indicating that the BATS 2000 sampling techniques were effective.
Table 14 Frequency Distribution of Categorical-Level Variables: Persons Reporting Work Trips Under One Mile in Length
| Variable | Percentage of Observations | ||||
|---|---|---|---|---|---|
| SF Only | Oakland & Berkeley | Suburbs | |||
| Trips | |||||
| Pedestrian trips | 80.16 | 69.07 | 30.32 | ||
| Non-pedestrian trips | 19.84 | 30.93 | 69.77 | ||
| Income | |||||
| Household income under $15,000 | 0.83 | 2.85 | 2.21 | ||
| Household income $15,000–49,000 | 25.07 | 30.70 | 27.21 | ||
| Household income $50,000–74,999 | 27.00 | 23.32 | 28.68 | ||
| Household income above $75,000 | 47.11 | 43.12 | 41.91 | ||
| Age | |||||
| Age 19–39 | 56.67 | 57.56 | 43.75 | ||
| Age 40–59 | 38.97 | 38.02 | 45.00 | ||
| Age over 59 | 4.36 | 4.43 | 11.25 | ||
| Gender | |||||
| Male | 52.05 | 51.30 | 45.63 | ||
| Female | 47.95 | 48.70 | 54.38 | ||
Table 14 Frequency Distribution of Categorical-Level Variables: Persons Reporting Work
Trips Under One Mile in Length
| Variable | Percen | tage of Observations | |
|---|---|---|---|
| SF Only | Oakland & Berkeley | Suburbs | |
| White | 80.16 | 75.27 | 2.28 |
| Non-white | 19.84 | 24.73 | 17.72 |
| Number of bicycles in household | |||
| 0 | 47.18 | 40.92 | 31.88 |
| 1 | 26.67 | 25.04 | 16.88 |
| 2 | 13.59 | 16.03 | 20.63 |
| 3 | 7.69 | 9.31 | 13.13 |
| 4 | 4.10 | 5.34 | 8.75 |
| 5 | 0.51 | 2.29 | 6.88 |
| 6 | 0.26 | 0.46 | 0.63 |
| 7 | 0.00 | 0.15 | 0.63 |
| 8 | 0.00 | 0.00 | 0.63 |
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
| SF Only | Oakland & Berkeley | Suburbs | |
|---|---|---|---|
| Trips | |||
| Pedestrian trips | 60.00 | 42.65 | 18.77 |
| Non-pedestrian trips | 40.00 | 57.35 | 81.23 |
| Income | |||
| Household income under $15,000 | 6.67 | 2.85 | 3.57 |
| Household income $15,000–49,000 | 32.00 | 30.70 | 20.66 |
| Household income $50,000–74,999 | 21.94 | 23.32 | 20.41 |
| Household income above $75,000 | 39.39 | 29.38 | 55.36 |
| Age | |||
| Age 19–39 | 49.30 | 47.04 | 33.26 |
| Age 40–59 | 37.95 | 35.70 | 38.81 |
| Age over 59 | 12.76 | 17.27 | 27.93 |
| Employed persons | 70.38 | 65.54 | 57.05 |
| Gender | |||
| Male | 48.54 | 45.23 | 39.45 |
| Female | 51.46 | 54.77 | 60.55 |
| Head of Household | 67.35 | 69.85 | 60.77 |
Table 15 Frequency Distribution of Categorical-Level Variables: Persons Reporting
Non-Work Trips Under One Mile in Length
Variable Percentage of Observations
Suburbs
SF Only Berkeley
White 83.20 79.58 87.61
Non-white 16.80 20.42 12.39
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
The frequency distributions for the work and non-work mode trips for all cities as used in the bicycle binary logistic regression models are listed in Table 16 .
Table 16 Frequency Distribution of Categorical-Level Variables: Persons Reporting Work and Non-Work Trips Under Five Miles in Length
| Variable | Percentage of Observations | ||
|---|---|---|---|
| All Cities—Work | All Cities— Non-Work | ||
| Trips | |||
| Bike trips | 5.13% | 2.14% | |
| Non-Bike Trips | 94.87% | 97.86% | |
| Income | |||
| Household income under $15,000 | 1.99 | 5.34 | |
| Household income $15,000–$49,000 | 24.85 | 27.11 | |
| Household income $50,000–74,999 | 23.01 | 19.93 | |
| Household income above $75,000 | 50.15 | 47.63 | |
| Age | |||
| Age 19–39 | 47.47 | 39.36 | |
| Age 40–59 | 45.73 | 38.69 | |
| Age over 59 | 6.80 | 21.95 | |
| Employed Persons | XXX | 63.64 | |
| Gender | |||
| Male | 52.87 | 44.33 | |
| Female | 47.13 | 55.67 | |
| Head of Household | 62.37 | 64.36 | |
Table 16 Frequency Distribution of Categorical-Level Variables: Persons Reporting Work
and Non-Work Trips Under Five Miles in Length
Variable Percentage of Observations
| All Cities—Work | All Cities— Non-Work | |
|---|---|---|
| White | 78.58 | 83.11 |
| Non-White | 21.42 | 16.89 |
| Number of bicycles in household | ||
| 0 | 37.50 | 39.94 |
| 1 | 21.83 | 21.76 |
| 2 | 21.63 | 19.44 |
| 3 | 9.40 | 8.15 |
| 4 | 5.37 | 5.88 |
| 5 | 2.67 | 2.41 |
| 6 | 1.17 | 1.66 |
| 7 | 0.20 | 0.46 |
| 8 | 0.03 | 0.08 |
| 9 | 0.20 | 0.19 |
| 10 | XX | XX |
| 11 | 0.00 | 0.02 |
Source: Census 2000, Metropolitan Transportation Commission 2000 Travel Analysis Zones (1454)
As expected, a higher percentage of people in the study cities bicycled for work purposes (roughly five percent) than for non-work purposes (roughly two percent)—a function of the relative predictability of work trips compared to non-work trips, and the fact that non-work trips often require carrying purchased goods home—an activity that can be difficult on a bicycle. There also appears to be a slightly higher percentage of people from lower income categories who traveled for non-work purposes, most likely a result of the fact that unemployed people have lower incomes and will, by circumstance, take only non-work trips.
Regarding the descriptive statistics of continuous variables for the pedestrian work-trip and non-work trip models, the information is consistent with what has been observed in the census data.
For example, as to be expected, for the total number of vehicles per licensed driver in household, the mean is closer to “1” for the suburban cities (See Table 19 ) than it is in the San Francisco (See Table 17 ) or the Oakland/Berkeley models (See Table 18 ), indicating that suburban residents tend to have more cars per licensed driver than those in urban cities.
Interestingly, in San Francisco’s pedestrian work trips model’s dataset, that city has mean of 0.9744 bicycles per household while in the suburban model, the figure is 1.8 bicycles. This range also is seen in the non-work pedestrian model datasets.
Also of note, Part 1 property crime rates are consistently highest in Oakland for the work and non-work pedestrian models when compared to the other city groupings. Descriptive statistics for all these models are listed in the next eight tables ( Table 17 through Table 24 ).
Table 17 Descriptive Statistics for San Francisco: Work Trips—Pedestrian Model (All
Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.3333 2.5 0.8697 0.3342 Total number of bicycles in household 0.0 6.0 0.9744 1.1974 Part 1 property crimes/TAZ population 0.0007 0.1800 0.0136 0.0169 Urban Form & Transit Accessibility Factor Score -2.0704 4.9979 0.3108 1.1106 N = 273
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 18 Descriptive Statistics for Oakland and Berkeley: Work Trips—Pedestrian Model (All Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.20 2.5 0.8791 0.3300 Total number of bicycles in household 0 9 1.2580 1.4794 Part 1 property crimes/TAZ population 0.00066 0.41 0.0298 0.0381 Urban Form & Transit Accessibility Factor Score -1.44394 5.9904 0.4264 1.0914 N = 499
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 19 Descriptive Statistics for Suburban Cities: Work Trips—Pedestrian Model (All Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.3333 3 0.9995 0.3581 Total number of bicycles in household 0 8 1.8 1.7331 Part 1 Violent crimes/TAZ population 0 0.0053 0.0018 0.0016 Urban Form & Transit Accessibility Factor Score -1.4486 4.4804 0.3870 01.0732 N = 155
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 20 Descriptive Statistics For San Francisco: Non-Work Trips—Pedestrian Model
(All Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.3333 4 0.8898 0.3841 Total number of bicycles in household 0 6 1.0335 1.2015 Part 1 Property crimes/TAZ population 0.0007 0.1700 0.0136 0.0139 Urban Form & Transit Accessibility Factor Score -2.0704 4.1007 0.2572 1.0087 N = 689
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 21 Descriptive Statistics for Oakland and Berkeley: Non-Work Trips—Pedestrian Model (All Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.2 4 0.9240 0.3271 Total number of bicycles in household 0 9 1.4407 1.7624 Part 1 Property crimes/TAZ population 0.01 0.25 0.0462 0.0329 Urban Form & Transit Accessibility Factor Score -2.0704 4.1007 0.4877 1.1355 N = 643
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 22 Descriptive Statistics for Suburban Cities: Non-Work Trips—Pedestrian Model (All Trips Less than 1 Mile)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.3333 2 1.0265 0.2861 Total number of bicycles in household 0 8 1.7484 1.8059 Part 1 Violent crimes/TAZ population 0 0.0053 0.0012 0.0012 Urban Form & Transit Accessibility Factor Score -1.5070 4.4804 0.1750 1.0201 N = 463
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 23 Descriptive Statistics for All Cities: Work Trips—Bicycle Model (All Trips Less than 5 Miles)
| Minimum | Maximum | Mean | Standard Deviation | |
|---|---|---|---|---|
| Total number of vehicles per licensed driver in household | 0.2 | 4 | 0.9531 | 0.3336 |
| Total number of bicycles in household | 0 | 9 | 1.3857 | 1.4957 |
Table 23 Descriptive Statistics for All Cities: Work Trips—Bicycle Model (All Trips Less
than 5 Miles)
Standard
Minimum Maximum Mean
Deviation
N = 2666
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1454)
Table 24 Descriptive Statistics for All Cities Except Richmond: Non-Work Trips—Bicycle Model (Trips Less than 5 Miles)
Standard
Minimum Maximum Mean
Deviation
Total number of vehicles per licensed driver in household 0.2 4 0.9789 0.3455 Total number of bicycles in household 0 11 1.3643 1.5692 Part 1 Property crimes/TAZ population 0 0.41 0.0238 0.0252 Urban Form & Transit Accessibility Factor Score -1.5194 5.7514 0.1811 1.0519 N = 5275
Source: Census 2000, Metropolitan Transportation Commission 2000 Transit Analysis Zones (1554)
TRANSIT MODEL FACTOR ANALYSIS
The dataset used for each factor analysis run was defined by selecting all trip records in the transit trips dataset with trip origins in the group of study cities being analyzed (i.e., trips with origins in San Francisco for the San Francisco model, Oakland or Berkeley for the Oakland & Berkeley model, and Walnut Creek, Santa Clara, Concord, or Sunnyvale for the Suburbs Only model, and the same suburban cities minus Sunnyvale for the Suburbs Not Sunnyvale model).
The factor loadings for each factor analysis output variable and the variance in the input variables explained by the factor analysis component variables is shown in Table 25 .
Table 25 Urban Form & Transit Accessibility Factor Analysis Component Loading for
Transit Model Runs
Variables Component Factor Loadings
| SF Only Oakland & Berkeley | Suburbs Only | Suburbs Not Sunnyvale |
|---|---|---|
| Jobs+Pop Density 0.794 0.747 | 0.732 | 0.792 |
| 4-legged intersections per acre 0.844 0.819 | 0.784 | 0.818 |
In general, the component loading values for all four models/city groupings show that the Jobs+Pop Density and the four-legged Intersection per Acre variables have the highest factor loading coefficients and, therefore, generally have the dominant role in contributing to the final factor variable score. The relative contribution of the Transit Accessibility variable varies in importance from city grouping to city grouping, attaining its maximum influence for the San Francisco Only model and the lowest for the Suburbs Not Sunnyvale model. The variability in the input variables explained by the component output variable ranges from roughly 54 to 62 percent.
Factor scores for each component were saved as variables in the pedestrian/bicycle dataset and used as independent variables in the pedestrian and bicycle binary logistic model runs.
The dataset used for each factor analysis run was defined by selecting all trip records in the pedestrian/bicycle trips dataset with trip origins in the group of study cities being analyzed (i.e., trips with origins in San Francisco for the San Francisco model, Oakland or Berkeley for the Oakland & Berkeley model, and Walnut Creek, Santa Clara, Concord, or Sunnyvale for the Suburbs Only model, and all of these cities together for the All Cities model [used only for the bicycle mode choice model runs because there were not enough bicycle trip records in the BATS 2000 dataset to perform model runs for individual cities or city sub-groupings]).
The factor loadings for each factor analysis output variable and the variance in the input variables explained by the factor analysis component variables are shown in Table 26.
Table 26 Urban Form & Transit Accessibility Factor Analysis Component Loadings for
Pedestrian & Bicycle Model Runs
Variables Component Factor Loadings
All Cities
SF Only Berkeley Only
In general, the component loading values for all four models/city groupings show that the Jobs+Pop Density and the four-legged Intersection per Acre variables have the highest factor loading coefficients and, therefore, generally have the dominant role in contributing to the final factor variable score. The relative contribution of the Transit Accessibility variable varies in importance from city grouping to city grouping, attaining its maximum influence for the San Francisco Only model and the lowest for the Oakland & Berkeley and Suburbs Only models. In the All Cities model, the component loading coefficient for Transit Accessibility is roughly on par with those for the other two variables. The variability in the input variables explained by the component output variable ranges from roughly 53 to 62 percent.
Factor scores for each component were saved as variables in the pedestrian/bicycle dataset and used as independent variables in the pedestrian and bicycle binary logistic model runs.
Because transit trips are typically more than a half-mile in length, trip records were selected for all transit mode choice analysis model runs a half-mile in length or longer, regardless of the mode of travel reported. Four separate models were run. As mentioned earlier, only Part 1 crime data were available for San Francisco and for Sunnyvale whereas both Part 1 and Part 2 crime data were available for Berkeley, Oakland, Walnut Creek, Santa Clara, and Concord.
Hence, a separate model, examining the impacts of both Part 1 and Part 2 crimes, was run for Berkeley, Oakland, Walnut Creek, Santa Clara, and Concord.
| Walnut | ||||
|---|---|---|---|---|
| SF Only | Oakland & Berkeley | Suburbs Only | Creek/Santa Clara/ | |
| Concord | ||||
| Household Income | ||||
| Under $15,000 | -0.1497 | -1.0773 | 1.0294 | 0.7684 |
| $15,000–49,000 | -0.1138 | -0.3764* | -0.3665 | -0.4985* |
| $50,000–74,999 | 0.1981 | -0.3005 | -0.1411 | -0.3386 |
| Over $75,000 | Referent | Referent | Referent | Referent |
| Household vehicles per licensed driver | -1.2430** * | -1.2117** * | -.06954 | -0.7963*** |
| Household bicycles | -0.2016** * | -0.0412 | -0.1009 | -0.1113 |
| Age | ||||
| 19–39 years | Referent | Referent | Referent | Referent |
| 40–59 years | -0.3163** | -0.4166** | -0.5242** | -0.4245 |
| Over 59 years | -0.8961** | -1.0936** | -1.3301** | -1.1899** |
| Gender (1= Male, 0 = Female) | 0.0899 | 0.0457 | -0.1510 | -0.0917 |
| Householder (HHR) (1=HHR, 0=Non-HHR) | 0.1131 | 0.2486 | 0.3460* | 0.3821* |
| Race (1= White, 0= Non-White) | -0.3205** | -0.3791** | 0.0629 | 0.0099 |
| Urban Form & Transit Accessibility Factor Score | 0.0656 | 0.127 | -0.6142** * | -.05879*** |
| Violent Crimes per Person | N/A | N/A | -67.8786 | N/A |
| Property Crimes per Person | 2.7136 | 1.0748 | N/A | N/A |
| Vice Crimes per Person | N/A | N/A | N/A | -51.2522* |
| Constant | 0.3779 | 0.0812 | -1.4331** * | -0.7554 |
| N | 1392 | 1111 | 1903 | 1162 |
| Nagelkerke R Square | 0.125 | 0.078 | 0.078 | 0.097 |
Notes:
* = p < 0.10
** = p < 0.05
*** = p < 0.01
N/A = Not applicable
Goodness of Fit
Nagelkerke R2 results for the four transit logistic model runs indicate that the models explain between 8 and 13 percent of the variation in the dataset.
Household Income Results
While household income is generally thought to play an important role in determining mode choice, among the four binary mode choice models run to predict transit work trips, the income dummy variables were statistically insignificant on two occasions—in the Oakland & Berkeley model and in the Walnut Creek/Santa Clara/Concord model—where the people with incomes between $15,000 and $49,999 were found to have a lower probability of taking transit to work than those with incomes greater than $75,000. These counter-intuitive negative and statistically insignificant findings may suggest that if transit is available and then controlling for other exogenous factors such as age and race, people of all income groups are equally likely to use it.
Household Vehicle and Bicycle Availability Results
Consistent with our theoretical assumptions, the higher the number of vehicles, the less likely a household member will choose to take transit to work. The variable measuring the number of household vehicles per licensed driver had a negative sign and was highly statistically significant for all four transit work trip models. The variable measuring the number of bicycles per household was statistically significant only for the San Francisco model, indicating that bicycles might not be a very good substitute for transit except for very dense urban environments such as those found in San Francisco.
Person Age Results
Consistent with our theoretical assumptions, taking transit to work appears to be an activity for the young. In all four city groupings’ model runs, dummy variables representing people aged 40 years and older were generally statistically significant, and all age variables possessed negative signs. These findings suggest that in general, people aged 19 to 39 years are more likely to take transit to work than those who are older.
Gender Results
Results for the Gender (Male) dummy variable from all four models were statistically insignificant, indicating that gender does not play an important role influencing transit mode choice in these city groupings.
Householder Status
Those who were identified in the BATS 2000 survey as the “householder” were represented in the analysis datasets with a “1” whereas all other survey participants received a “0.” This variable was statistically insignificant for the San Francisco, and Oakland & Berkeley models. However, those from the suburbs who identified themselves as the “householder” were significantly less likely to take transit to work than were other household members. This finding suggests that the more limited modal options in the suburbs (i.e., higher car dependence) affects the most “time-starved” and busy persons in the typical household—the householder.
Race Variable Results
Those who were identified in the BATS 2000 survey as “White” or “Caucasian” were represented in the analysis datasets with a “1” whereas all other survey participants received a “0.” Results for this variable for the San Francisco only model and for the Oakland & Berkeley model were statistically significant, indicating that in San Francisco and in Berkeley and Oakland, white residents are less likely to take transit to work. For the models run on suburban cities, this variable was statistically insignificant, indicating that race does not play an important role influencing transit mode choice in these city groupings.
Urban Form & Transit Accessibility Factor Score Variable Results
Because the variables developed to represent various aspects of urban form and transit accessibility were found in initial model runs to be collinear, we used factor analysis to develop a combined factor score variable that represents all three of these individual variables (See description of how these variables were developed for each city grouping/model run in the “Pedestrian and Bicycle Model Factor Analysis” section). For all two of the four transit model runs, this factor variable was statistically significant at the p = .01 level or better and possessed a counter-intuitive negative sign. Our theoretical assumption was that the higher the urban form/accessibility variable score, the more likely a household member is to take transit to work from all four city groupings/model runs; however, the models findings suggest that this variable either had no or counter-intuitive negative impact on the probability that a resident will take transit to work. To determine the specific urban from/accessibility component variable or variables that may be causing this counter-intuitive result, we ran the work model without the urban form/accessibility factor variable and instead entered in the component variables of this factor variable. We found that while the transit accessibility variable had a positive sign and was statistically significant, the population density and four-legged intersection density variables had negative signs and were statistically significant. From these exploratory model runs, we determined that the two density (i.e., population and intersection) variables were causing the factor variable to have a negative sign. Because the variables only had negative signs for the two suburban cities’ transit model runs, we interpreted these findings as indicative of the unique land use and street network configurations in suburban TAZs with high levels of transit service (and transit ridership). Because all four of our suburban study cities (Walnut Creek, Concord, Sunnyvale, and Santa Clara) have either BART or light rail stations, and because the TAZs for these station neighborhoods are likely to have some of the highest levels of transit ridership in these cities, they also have counter-intuitive relationships between urban form and the propensity for residents to choose transit. Furthermore, these high transit service/high transit ridership suburban TAZs tend to have freeways running adjacent to or surrounding their rail transit stations (e.g., in Walnut Creek and Concord), with park-and-ride lots surrounding these stations, and/or tend to be located in TAZs dominated by employment uses and low-density, arterial street networks. Therefore, while the proximity of rail transit stations tends to increase the probability that people living in these zones will choose transit, the dominance of suburban, auto-oriented development patterns there result in low population densities, low four-legged intersection densities, and a negative relationship between these two variables and the probability of choosing transit for work trips.
Neighborhood Crime Rate Variable Results
Neighborhood crime variables were selected for each model run based on performance in preliminary modeling exercises and on theoretical considerations. For three models—San Francisco Only; Oakland & Berkeley; and Suburbs Only—crime variables calculated as the number of crimes per TAZ resident/population tended to yield statistically significant results. For the Walnut Creek/Santa Clara/Concord model, the number of crimes per TAZ resident/population plus the number of TAZ employees yielded statistically significant results. For the San Francisco Only and the Oakland & Berkeley model runs, property crime rates were found to yield the best results; for the Suburbs Only model run, violent crime rates worked best; and for the Walnut Creek/Santa Clara/Concord model, vice crime rates worked best.
For three of the four model runs—San Francisco Only, Oakland & Berkeley, and Suburbs Only—the crime variables were statistically insignificant. The crime variable, measured as vice crime rate per TAZ, was statistically highly significant for the Walnut Creek/Santa Clara/Concord model and possessed a theoretically expected negative sign. These results suggest that although Part 1 crimes do not seem to impact the probability of a resident taking transit to work, certain Part 2 crimes (e.g., vice and vagrancy crimes in the case of Walnut Creek/Santa Clara/Concord model) may be associated with a lower probability of a neighborhood resident taking transit to work. We think it is important for future studies to estimate the impact of all crime types, not just major violent and property crimes (i.e., Type 1 crimes), on residents’ probability of taking transit for work trips.
Since transit trips are typically more than a half-mile in length, trip records were selected for all transit mode choice analysis model runs a half-mile in length or longer, regardless of the mode of travel reported. Four separate models were run. As mentioned earlier, only Part 1 crime data were available for San Francisco and for Sunnyvale whereas both Part 1 and Part 2 crime data were available for Berkeley, Oakland, Walnut Creek, Santa Clara, and Concord. Hence, a separate model examining the impacts of both Part 1 and Part 2 crimes was run for Berkeley, Oakland, Walnut Creek, Santa Clara, and Concord.
| Walnut | ||||
| SF Only | Oakland & Berkeley | Suburbs Only | Creek/Santa Clara/ | |
| Concord | ||||
| Household Income | ||||
| Under $15,000 | 0.7648** | -0.0655 | 3.1275*** | 3.5688*** |
| $15,000–49,999 | 0.3985** | 0.3063 | 0.9024** | 0.9549* |
| $50,000–74,999 | -0.1806 | 0.0394 | 0.1937 | 0.4395 |
| Over $75,000 | Referent | Referent | Referent | Referent |
| Household vehicles per licensed driver | -1.2599** * | -2.0745** * | -1.5337 | -0.9428 |
| Household bicycles | -0.1086 | -0.1356 | -0.0170 | -0.0565 |
| Age | ||||
| 19–39 years | Referent | Referent | Referent | Referent |
| 40–59 years | -0.1919 | -0.5637** | -0.8456* | -0.8470 |
| Over 59 years | -0.6924** | -0.6471* | -0.2097 | -0.4092 |
| Gender (1 = Male, 0 = Female) | -0.4141** * | -0.2763 | -0.2097 | -0.1819 |
| Householder (HHR) (1 = HHR, 0 = Non-HHR) | -0.1530 | 0.2817 | -0.0756 | 0.0879 |
| Race (1 = White, 0 = Non-White) | -0.3934** | -.6633*** | 0.4286 | 0.1276 |
| Employed (1 = yes, 0 = no) | 0.0210 | -0.1013 | -0.1104 | -0.0214 |
| Urban Form & Transit Accessibility Factor Score | -0.0145 | 0.0674 | -0.0858 | 0.22152 |
| Violent crimes per person | N/A | N/A | -149.0680 | N/A |
| Property crimes per person | 5,2930 | -3.0513 | N/A | N/A |
| Vice crimes per person | N/A | N/A | N/A | -87.4330 |
| Constant | -0.7185** | -0.1436 | -2.6405 | 0.2152 |
| N | 1533 | 1621 | 2258 | 1511 |
* = p < 0.10
** = p < 0.05
*** = p < 0.01
N/A = Not applicable
Goodness of Fit
Nagelkerke R2 results for the four non-work trip purpose transit logistic model runs indicate that the model runs explained between 8 and 17 percent of the variation in the dataset.
Household Income Results
The four binary mode choice models run to predict transit non-work trips found that the income dummy variables were generally statistically significant, indicating that lower income residents are more likely to take transit for non-work trips. However, the findings were statistically insignificant for the Oakland & Berkeley model.
Household Vehicle and Bicycle Availability Results
Consistent with our theoretical assumptions and the findings from the work trip transit models, the higher the number of vehicles available to the household, the less likely a household member will choose to take transit for non-work purposes. However, availability of bicycles, except for the Oakland & Berkeley model, did not seem to have a statistically significant impact on residents’ probability of taking transit for non-work trips. The overall findings for the availability of bicycles are consistent with the transit work models: a bicycle does not seem to be a viable alternative to taking transit.
Person Age Results
Consistent with our theoretical assumptions and the pedestrian work trip model runs, taking transit for non-work trips appears to be primarily for the young. In three of four city groupings’ model runs, dummy variables representing people aged 40 to 59 years were statistically significant (with the exception of Suburbs Only model) and possessed negative signs. In two of four city groupings’ model runs, dummy variables representing people aged 59 years and older were statistically significant (with the exception of the Suburbs Only model and the Walnut Creek/Santa Clara/Concord model) and possessed negative signs. These findings suggest that in general, people aged 19 to 39 years are more likely to take transit for non-work trips than those who are older.
Gender Results
Results for the Gender (Male) dummy variable for three of four models were statistically insignificant, indicating that gender does not play an important role influencing transit mode choice in these city groupings. The gender variable was statistically significant and a possessed positive sign for the San Francisco model, indicating that males are more likely to take transit to non-work activities in San Francisco.
Householder Status
This variable was statistically insignificant for all four city groupings’ model runs, suggesting that this variable is not useful for predicting the mode choice of non-work trips.
Race Variable Results
Results for this variable for the suburban cities models for non-work transit model runs were statistically insignificant, but highly significant (at the p < .05 level and above) and negative for the San Francisco Only model and the Oakland & Berkeley model. This finding suggests that survey participants who identified themselves as “White” were less likely than people who described themselves as members of some Non-White category to take transit to non-work activities in San Francisco and in Oakland & Berkeley. This may be due to the fact that a large majority of the non-white population in these large urban cities are African American—a group very likely to be the primary user of transit for non-work activities.
Employment Status Variable Results
Those persons identified as “employed” in the survey were represented with a “1” while all other survey participants were coded with a “0.” Results for this variable were statistically insignificant for all transit non-work trip model runs.
Urban Form & Transit Accessibility Factor Score Variable Results
Contrary to our theoretical assumptions, the urban form & accessibility variable score did not have a statistically significant impact on residents’ probability of taking transit to non-work activities in any of the four models.
Neighborhood Crime Rate Variable Results
Neighborhood crime variables were selected for each model run based on performance in preliminary modeling exercises and on theoretical considerations. For three models—San Francisco Only, Oakland & Berkeley, and Suburbs Only—crime variables calculated as the number of crimes per TAZ resident/population tended to yield statistically significant results. For the Walnut Creek/Santa Clara/Concord model, the number of crimes per TAZ resident/population plus the number of TAZ employees yielded statistically significant results. For the San Francisco Only and Oakland & Berkeley model runs, property crime rates were found to yield the best results; for the Suburbs Only model run, violent crime rates worked best; and for the Walnut Creek/Santa Clara/Concord model, vice crime rates worked the best.
Consistent with the findings of the transit work-trip model runs, for three of the four model runs for the transit non-work trips—San Francisco Only, Oakland & Berkeley, and Suburbs Only—the crime variables were statistically insignificant. The crime variable, measured as vice crime rate per TAZ, was statistically highly significant for the Walnut Creek/Santa Clara/Concord model and possessed a theoretically expected negative sign. These results suggest that although Part 1 crimes do not seem to impact the probability of a resident taking transit for non-work trips, certain Part 2 crimes (e.g., vice and vagrancy crimes in the case of the Walnut Creek/Santa Clara/Concord model) may be associated with a lower probability of a neighborhood resident taking such transit trips. We think it is important for future studies to estimate the impact of all crime types, not just major violent and property crimes (i.e., Type 1 crimes), on residents’ probability of taking transit for non-work trips.
Because walk trips are typically less than 1 mile in length, trip records were selected for all pedestrian mode choice analysis model runs 1 mile in length or shorter, regardless of the mode of travel reported.
Table 29 Binomial Logistic Regression Results for Transit Non-Work Trips
Oakland & Suburbs
Household income Under $15,000 N/D -0.7859 -15.9758 $15,000–49,000 -0.800 -0.2919 1.5469** $50,000–74,000 -0.4008 -0.0719 1.7556*** Over $75,000 Referent Referent Referent -1.1291**
Household vehicles per licensed driver -1.1895** *
-2.4607**
-0.3381**
Household bicycles -0.3175** * -0.2925*
Age 19–39 years Referent Referent Referent -0.8163**
40–59 years -0.1488 *
-0.9891*
-2.4040** -2.2562**
Over 59 years * * -0.4527
Gender (1 = Male, 0 = Female) 0.2374 0.1712 -0.6456
Householder (HHR) (1 = HHP, 0 = Non-HHR) 0.2078 0.1786 -1.1310**
Race (1 = White, 0 = Non-White) 0.6191 0.2614 0.6697 -0.4806**
Urban Form & Transit Accessibility Factor Score 0.6004** * 0.5460**
-277.3119
Violent crimes per person N/A N/A *
-12.0730*
Property crimes per person -21.43323 * N/A
Constant 2.7785*** 2.7978 2.0275
N 247 443 129
-2 Log likelihood 209.74 443 129
Nagelkerke R Square 0.217 0.260 0.394
Notes:
* = p < 0.10
** = p < 0.05
*** = p < Not Applicable
N/D = No Data
Goodness of Fit
While the Nagelkerke R2 results for the three pedestrian logistic model runs indicate that their predictive power is relatively low, our experience with these models and similar model results from our review of the literature suggests that this model is performing at a high level, explaining between 22 and 39 percent of the variation in the dataset.
Household Income Results
While household income is generally thought to play an important role in determining mode choice, the three binary mode choice models run to predict pedestrian work trips found that the income dummy variables were statistically significant only in the Suburbs model, where people from households with incomes between $15,000 and $75,000 per year have a higher probability of walking to work than do those with higher household incomes. The lack of statistically significant findings for more urban city models (i.e., the San Francisco and Berkeley & Oakland models) might suggest that due to the relative lack of pedestrian- and transit-friendly environments and infrastructure in suburban cities, those with low or moderate incomes are more likely to get out of their cars and walk to work as a way to save on commute costs.
Household Vehicle and Bicycle Availability Results
Consistent with our theoretical assumptions, the higher the number of vehicles and bicycles available to the household, the less likely a household member will choose to walk to work. These two variables had a negative sign and were highly statistically significant for all three pedestrian work trip models.
Person Age Results
Consistent with our theoretical assumptions, walking to work appears to be an activity for the young. In all three city groupings’ model runs, dummy variables representing people aged 40 years and older were generally statistically significant (with the exception of ages 40 to 59 in the San Francisco Only model and those over 59 in the Suburbs Only model), and all age variables possessed negative signs. These findings suggest that in general, people aged 19 to 39 years are more likely to walk to work than are those who are older.
Gender Results
Results for the Gender (Male) dummy variable from all three models were statistically insignificant, indicating that gender does not play an important role in influencing pedestrian mode choice in these city groupings.
Householder Status
This variable was statistically insignificant for the San Francisco Only and Oakland & Berkeley Only models. However, those from the suburbs who identified themselves as the “head of the household” were significantly less likely to walk to work than other household members. This finding suggests that the more limited modal options in the suburbs (i.e., higher car dependence) affects the most “time-starved” and busy persons in the typical household—the householder.
Race Variable Results
Results for this variable from all three models were statistically insignificant, indicating that race does not play an important role in influencing pedestrian mode choice in these city groupings.
Urban Form & Transit Accessibility Factor Score Variable Results
Because the variables developed to represent various aspects of urban form and transit accessibility were found in initial model runs to be collinear, we used factor analysis to develop a combined factor score variable that represents all three of these individual variables (See description of how these variables were developed for each city grouping/model run in the “Pedestrian and Bicycle Model Factor Analysis” section). For all three pedestrian model runs, this factor variable was statistically significant at the p =.05 level or better and possessed a positive sign. Therefore, consistent with our theoretical assumptions, the higher the urban form/accessibility variable score, the more likely a household member is to walk to work from all three city groupings/model runs. This suggests that neighborhoods with a high density of population and employment, with a traditional grid street network (with a high density of four-legged intersections), and with high transit accessibility tend to increase the probability that a resident will walk to work. These consistent findings also suggest that our factor analytic variable has effectively mitigated the multicollinearity problems found between the three urban form/accessibility variables.
Neighborhood Crime Rate Variable Results
Neighborhood crime variables were selected for each model run based on performance in preliminary modeling exercises and on theoretical considerations. For all pedestrian model runs, crime variables calculated as the number of crimes per TAZ resident/population tended to yield statistically significant results consistent with our theoretical expectations more than those calculated as the number of crimes per TAZ resident/population plus the number of TAZ employees. For the San Francisco Only and Oakland & Berkeley Only model runs, property crime rates were found to yield the best results whereas for the Suburbs Only model run, violent crime rates worked best.
Of the three model runs, the crime variables were statistically significant and possessed a negative sign for the Oakland & Berkeley Only and Suburbs Only model runs. While the crime variable in the San Francisco Only model run was statistically insignificant, its sign also was negative, lending some consistency to the results for this variable across the three model runs. These results suggest that higher crime rates were associated with a lower probability of a neighborhood resident walking to work.
Because walk trips are typically less than 1 mile in length, trip records were selected for all pedestrian mode choice analysis model runs 1 mile in length or shorter, regardless of the mode of travel reported.
Table 30 Binomial Logistic Regression Results for Pedestrian Non-Work Trips
| Variable | SF Only | Oakland & Berkeley | Suburbs Only |
|---|---|---|---|
| Household income | |||
| Under $15,000 | 0.0357 | 0.2101 | -0.3486 |
| $15,000–49,000 | 0.2084 | -0.8301** * | 0.3863 |
| $50,000–74,999 | 0.2127 | -0.1752 | 0.4254 |
| Over $75,000 | Referent | Referent | Referent |
| Household vehicles per licensed driver | -0.4380 | -1.1127** * | -1.0848* |
| Household bicycles | -0.0551 | -0.1265** | -0.0662 |
| Age | |||
| 19–39 years | Referent | Referent | Referent |
| 40–59 years | -0.3431* | -0.2984 | -0.5468* |
| Over 59 years | -1.1220** * | -0.9909** * | -1.2944** * |
| Gender (1 = Male, 0 = Female) | -0.1073 | 0.0424 | -0.0975 |
| Householder (HHR) (1 = HHR, 0 = Non-HHR) | -0.1612 | 0.3166 | -0.0887 |
| Race (1 = White, 0 = Non-White) | 0.6159*** | 0.0379 | 0.2188 |
| Employed (1 = Yes, 0 = No) | -0.3293 | -0.2009 | -0.4689 |
| Urban Form & Transit Accessibility Factor Score | 0.1361 | 0.7186*** | 0.3724*** |
| Violent crimes per person | N/A | N/A | -57.8984 |
| Property crimes per person | 31.4834** | -2.7835 | N/A |
| Constant | 0.5965 | 1.1880** | 0.1739 |
| N | 605 | 544 | 373 |
* = p < 0.10 ** = p < 0.05 *** = p < 0.01 N/A = Not Applicable N/D = No Data
Goodness of Fit
Nagelkerke R2 results for the three non-work trip purpose pedestrian logistic model runs indicate that their predictive power is low compared to that found for the pedestrian work trip model runs. While the work trip model runs explained between 22 and 39 percent of the variation in the dataset, the non-work trip model runs explained between 9 and 13 percent. In general, our experience is that non-work mode choice models tend to have somewhat lower goodness-of-fit results than do the work trip models. From a theoretical perspective, this makes sense because work trips are more regimented in terms of their origins, destinations, trip times, and travel choices available and therefore lead to a more consistent choice of travel mode.
Household Income Results
Similar to the findings from the work trip pedestrian models, the three binary mode choice models run to predict pedestrian non-work trips found that the income dummy variables were generally statistically insignificant. For the non-work pedestrian models, the only statistically significant finding for these variables was in the Oakland & Berkeley model, where people from households with incomes between $15,000 and $75,000 per year have a lower probability of walking to work than do those with higher household incomes. This finding is somewhat in contrast to the findings of a statistically significant, positive relationship from the suburban cities pedestrian work model for the same income category. These different findings may be due to the different lifestyles of those living in the older, more urban cities of Oakland and Berkeley versus those living in more suburban cities.
Household Vehicle and Bicycle Availability Results
Consistent with our theoretical assumptions and the findings from the work trip pedestrian models, the higher the number of vehicles and bicycles available to the household, the less likely a household member will choose to walk for non-work activities. However, while all the work trip pedestrian models had statistically significant coefficients and negative signs for these two variables and while all the signs for these variables were negative for the non-work model runs, significant findings were only found for vehicle availability and household bicycles from the Oakland & Berkeley model and for the Household Vehicles per Licensed Driver for the suburban non-work model runs. The fewer number of statistically significant findings for the non-work pedestrian model runs most likely reflects the higher difficulty we have found in predicting non-work trip mode choice.
Person Age Results
Consistent with our theoretical assumptions and the pedestrian work trip model runs, walking to non-work activities appears to be primarily for the young. In all three city groupings’ model runs, dummy variables representing people ages 40 years and older were statistically significant (with the exception of ages 40 to 59 in the Oakland & Berkeley model), and all age variables possessed negative signs. These findings suggest that in general, people aged 19 to 39 years are more likely to walk to non-work activities than are those who are older.
Gender Results
Results for the Gender (Male) dummy variable from all three models were statistically insignificant, indicating that gender does not play an important role in influencing pedestrian mode choice in these city groupings.
Householder Status
This variable was statistically insignificant for all three city groupings/model runs, suggesting that this variable is not useful for predicting the mode choice of non-work trips.
Race Variable Results
Results for this variable for the Oakland & Berkeley and Suburbs Only non-work pedestrian model runs were statistically insignificant, but highly significant (at the p < .01 level) and positive for the San Francisco Only model. This finding suggests that survey participants who identified themselves as “White” were more likely than people who described themselves as members of some Non-White category to walk to non-work activities in San Francisco.
Employment Status Variable Results
Results for this variable were statistically insignificant for all pedestrian non-work trip model runs.
Urban Form & Transit Accessibility Factor Score Variable Results
Consistent with our theoretical assumptions, the higher the urban form/accessibility variable score, the more likely a household member is to walk to non-work activities from the Oakland & Berkeley and Suburbs city groupings/model runs. This suggests that neighborhoods with a high density of population and employment, with a traditional grid street network (with a high density of four-legged intersections), and with high transit accessibility tend to increase the probability that a resident will walk to work. The fact that the San Francisco model did not produce a statistically significant finding for this factor variable may be explained by the relatively high levels of density and transit accessibility throughout that city. As a result, there is not enough meaningful variation in this factor variable to explain the differences in mode choice there.
Neighborhood Crime Rate Variable Results
Neighborhood crime variables were selected for each model run based on performance in preliminary modeling exercises and on theoretical considerations. For all pedestrian model runs, crime variables calculated as the number of crimes per TAZ resident/population tended to yield statistically significant results consistent with our theoretical expectations more than those calculated as the number of crimes per TAZ resident/population plus the number of TAZ employees. For the San Francisco Only and Oakland & Berkeley Only model runs, property crime rates were found to yield the best results whereas for the Suburbs Only model run, violent crime rates worked best.
Of the three non-work model runs, the crime variables were statistically significant only for the San Francisco model and, contrary to our assumptions, possessed a positive sign indicating that higher property crime rates are associated with a higher probability of selecting transit for non-work trips. There are two major interpretations of this finding that we can offer. First, the variation of crime rates within a San Francisco TAZ may be high due to the very dense urban fabric in that city. The tendency of crimes to cluster into “hot spots” may mean that the calculation of crime rates using TAZ boundaries (which are drawn for the purposes of analyzing travel behavior and not crime patterns) is inappropriate for very dense urban environments. This dynamic leads to an “ecological fallacy” that might have resulted in a counter-intuitive sign for the crime variable.
A second explanation for this result is that it is due to residential self-selection. In general, there is a tendency of crimes to cluster in neighborhoods and along arterial streets with high transit accessibility. Often these neighborhoods and arterials have liquor stores and other uses that have a tendency to attract crimes, increasing their crime rates. While these higher crime rates would ordinarily dissuade residents from walking and using transit, in San Francisco most residents have chosen to live in this dense urban environment despite the high crime rates, possibly in part because they value the walkable neighborhoods and high levels of transit service. Therefore, people who live in San Francisco already have “discounted” the high crime rates in their neighborhoods and have decided that they will not be dissuaded from walking in them. As a result, neighborhoods in San Francisco with high crime rates also have high levels of pedestrian activities for non-work purposes, but the relationship does not appear to be causal.
Overall, of the modes of travel recorded in the BATS 2000 dataset, bicycle trips had the fewest records. Because our study cities comprised only seven of the over one hundred city and county jurisdictions in the nine-county San Francisco Bay Area, the number of bicycle trip records was scarce. Accordingly, to ensure that we had an adequate number of bicycle trip records for the binary logistic regression model runs, we reasoned it necessary to group all study cities together into a single analysis pool. In addition, to maximize the number of bicycle trip records at our disposal to analyze, we reasoned that most bicycle trips are typically less than 5 miles in length. Trip records were selected for all bicycle mode choice analysis model runs 5 miles in length or shorter, regardless of the mode of travel reported.
Table 31 Binomial Logistic Regression Results for Bicycle Trips—All Cities Work and
Non-Work
| Variable | Work | Non-Work |
|---|---|---|
| Household income | ||
| Under $15,000 | 0.1681 | 1.3296** |
| $15,000–49,999 | 1.0185*** | 0.5465* |
| $50,000–74,999 | 0.5940** | 0.9292*** |
| Over $75,000 | Referent | Referent |
Table 31 Binomial Logistic Regression Results for Bicycle Trips—All Cities Work and
Non-Work
| Variable | Work | Non-Work |
|---|---|---|
| Age | ||
| 19–39 years | Referent | Referent |
| 40–59 years | -0.7910** * | -1.0797 |
| Over 59 years | -1.7768** | -2.3648** * |
| Gender (1 = Male, 0 = Female) | 1.1912*** | 0.8114*** |
| Householder (HHR) (1 = HHR, 0 = non-HHR) | -0.0883 | -0.2730 |
| Race (1 = White, 0 = Non-White) | 06.881** | 0.5788* |
| Employed (1 = Yes, 0 = No) | N/A | -0.2949 |
| Urban Form & Transit Accessibility Factor Score | 0.1894* | 0.3017*** |
| Property crimes per person | 2.5631 | 4.0898 |
| Constant | -4.6227 | -4.3991** * |
| N | 2323 | 4433 |
| -2 Log likelihood | 727.10 | 725.95 |
| Nagelkerke R Square | 0.262 | 0.241 |
Notes:
* = p < 0.10
** = p < 0.05
*** = p < 0.01
N/A = Not Applicable
N/D = No Data
Goodness of Fit
Nagelkerke R2 results for the work and non-work bicycle trip purpose logistic model runs indicate their predictive power is somewhat low compared to that found for the pedestrian work trip model runs and somewhat high compared to the pedestrian non-work runs. The work trip model run explained roughly 26 percent of the variation in the dataset whereas the non-work trip model run explained between 9 and 24 percent.
Household Income Results
While the income dummy variables included in all the logistic model runs were generally statistically insignificant for the pedestrian model runs, they were virtually all statistically significant in the work and non-work bicycle trips models; the only insignificant finding was for the $15,000 to $49,000 income category in the work trip model. The signs for all the income category variables in these two models were positive, indicating that those persons from households earning less than $75,000 a year are more likely to ride a bicycle for work and non-work purposes than is a person from a higher income household. This suggests that people partially choose to ride a bicycle to save on travel costs.
Household Vehicle and Bicycle Availability Results
Consistent with our theoretical assumptions and the findings from the work trip pedestrian models, the higher the number of vehicles per licensed driver in the household, the less likely a household member will choose to bicycle for work and non-work purposes. However, while all the work trip pedestrian models had negative signs for these two variables—consistent with our expectations—the positive sign for the Household Bicycles variable suggests that the more bicycles available to the household, the more likely a household member will choose to bicycle. However, because this variable was statistically significant only for the bicycle work trips model, we can only confirm this relationship for work trips.
Person Age Results
Consistent with our theoretical assumptions and the pedestrian work and non-work trip model runs, people aged 19 to 39 years are more likely to choose riding a bicycle for both work and non-work trips than are people from older age groups. In both bicycle work and non-work trip models, dummy variables representing people 40 years and older were statistically significant and all age group dummy variables possessed negative signs. These findings suggest that in general, people 19 to 39 years of age are more likely to bicycle for all trip purposes than are those who are older.
Gender Results
Results for the Gender (Male) dummy variable from both bicycle models were statistically significant at the p < .01 level, indicating that males are more likely to ride bicycles for both work and non-work purposes than are females.
Householder Status
This variable was statistically insignificant for both work and non-work bicycle model runs, suggesting that this variable is not useful for predicting the mode choice of bicycle trips.
Race Variable Results
Results for this variable for both work and non-work bicycle model runs were highly significant (at the p < .01 level) and positive. This finding suggests that survey participants who identified themselves as “White” were more likely than people who described themselves as members of some Non-White category to bicycle to both work and non-work activities in the study cities.
Employment Status Variable Results
Results for this variable were statistically insignificant for the bicycle transit non-work trip model run.
Urban Form & Transit Accessibility Factor Score Variable Results
Consistent with our theoretical assumptions, the higher the urban form/accessibility variable score, the more likely a household member is to bicycle to both work and non-work activities in the study cities. These results suggest that neighborhoods with a high density of population and employment, with a traditional grid street network (with a high density of four-legged intersections), and with high transit accessibility tend to increase the probability that a resident will choose to bicycle.
Neighborhood Crime Rate Variable Results
Neighborhood crime variables were selected for each model run based on performance in preliminary modeling exercises and on theoretical considerations. For all bicycle model runs, crime variables calculated as the number of crimes per TAZ resident/population tended to yield statistically significant results consistent with our theoretical expectations more than those calculated as the number of crimes per TAZ resident/population plus the number of TAZ employees. For the All Cities bicycle mode choice model runs, property crime rates were found to yield the best results. Of the work and non-work bicycle model runs, the crime variables were statistically insignificant.
In general, this study found substantiation for the proposition that neighborhood crime rates have an influence on the propensity to choose non-automotive modes of transportation for home-based trips. Specifically, high Vice and Vagrancy crime rates were associated with a lowered probability of choosing transit in Walnut Creek, Concord, and Santa Clara for both work and non-work trips, high Part 1 Property crime rates were associated with a lower probability of walking for work trips in Oakland and Berkeley, high Part 1 Violent crime rates were associated with a lower probability of walking for work trips in the four suburban study cities, and higher Part 1 Property crime rates in San Francisco were associated with an increased probability of walking for non-work trips.
While the signs of these significant relationships conformed to our expectations (i.e., that high crime rates would tend to reduce the probability of people to choose non-automotive modes of travel), we did not find statistically significant relationships for all city/trip type model runs, suggesting that these relationships differ depending on the urban form and trip type contexts. This conclusion is further substantiated by the finding that different crime types were better predictive variables for certain city/trip type model runs, and by the positive, statistically significant relationship found for San Francisco non-work pedestrian trips. This San Francisco finding in particular challenged our assumptions about the nature of the relationship between neighborhood crime rates and mode choice. To fully understand this finding within the context of the other significant findings, which generally conformed to our expectations, will require additional and more focused research. However, we suspect that this finding is related to the very high densities of San Francisco coupled with the correlation between dense, transit-rich neighborhoods, and high crime rates. In particular, we hypothesize that these urban transit neighborhoods in San Francisco attract residents who are aware of the crime challenges in these environments and, to some extent, have discounted these concerns for personal security. They have explicitly chosen to live in neighborhoods where they can enjoy the benefits of walkable, transit-rich, dense urban environments and have learned to live with or disregard the high crime rates in these areas. This spatial correlation is particularly pronounced with regard to violent crimes and is detectable in a visual examination of Figure 3. Violent Crimes, shown with black triangles, tend to cluster on or near main arterials, which also tend to carry transit service, shown with dashed lines. While significantly more dispersed, the gray circles, showing property crimes, tend to cluster near the transit lines as well, suggesting that blocks with high transit accessibility tend to also have a high incidence of crimes.
The difference between the distribution of crimes in these San Francisco TAZs and the distribution of crimes in four Oakland TAZs is detectable through comparison of Figure 4 and Figure 5 .
In Oakland, there appears to be a greater distribution of both violent and property crimes throughout the city’s TAZs, with only a slight spatial correlation between violent crimes and transit lines, and no detectable relationship between property crimes and transit lines.
Therefore, we suggest that our travel data sample for San Francisco suffers from a certain degree of self-selection bias in that a high proportion of this city’s residents prefer non-auto modes of travel and choose to live in San Francisco—despite the challenges of its higher crime rates—to enjoy its high-quality transit and pedestrian-oriented environments. This would explain the counter-intuitive finding of a significant, positive relationship between crime rates and the probability of walking to non-work activities in San Francisco.
Figure 4 Distribution of Part 1 Violent and Property Crimes in Four San Francisco TAZs
In Oakland, there appears to be a greater distribution of both violent and property crimes throughout the city’s TAZs, with only a slight spatial correlation between violent crimes and transit lines, and no detectable relationship between property crimes and transit lines.
Therefore, we suggest that our travel data sample for San Francisco suffers from a certain degree of self-selection bias in that a high proportion of this city’s residents prefer non-auto modes of travel and choose to live in San Francisco—despite the challenges of its higher crime rates—to enjoy its high-quality transit and pedestrian-oriented environments. This would explain the counter-intuitive finding of a significant, positive relationship between crime rates and the probability of walking to non-work activities in San Francisco.
Figure 5 Distribution of Part 1 Violent and Property Crimes in Four Oakland TAZs
In addition to the possibility of self-selection bias, we also hypothesized that there may be difficulties associated with the measurement of crime rates using TAZs, particularly in dense, urban, transit-rich environments such as San Francisco. Due to the tendency of crimes to cluster along transit lines and where TAZ borders also tend to be drawn, crime “hot spots” may fall into zones where transit levels are high because the neighborhood overall has fewer crimes in its core residential areas. Because TAZs were drawn to describe travel behavior and not with reference to crime rates or distributions, the possibility exists that using TAZs to aggregate crimes is an “ecological fallacy,” where it is erroneously assumed that members of a group (e.g., individuals who live in a TAZ) exhibit the characteristics of the group at large (e.g., those represented by an aggregation of individuals in a TAZ).
This study has verified a statistically significant influence of neighborhood crime on the propensity to walk and ride transit. In three model runs, we found confirmation of our hypothesis that high crime rates are associated with a reduced propensity to walk or ride transit. In the San Francisco non-work trip model, we found a positive relationship between the propensity to walk and property crime rates—a finding for which we have proposed two hypothetical explanations. First, this could be the result of a self-selection bias in the BATS 2000 data in San Francisco, where those more likely to use transit and walk will cluster in neighborhoods with high transit accessibility and densities—the same neighborhoods where crime clusters or “hot spots” will occur. The second explanation is that there is an ecological fallacy at work in our use of TAZs to aggregate crime data because TAZs were not drawn to explain the spatial distribution of crimes.
Our analysis of the availability of crime data for use in transportation planning and policy research found that while the data collection and storage practices are improving, they are inconsistent and “spotty,” with considerable differences in data management and dissemination practices between jurisdictions. As discussed earlier, we strongly recommend that a national- or state-level policy be developed that would (perhaps as a supplement to the UCR system) encourage local police departments to digitally store historical crimes data. To encourage these activities, we see considerable opportunities for pilot-testing and joint funding of crime database improvements by research institutions, universities, non-profit organizations, and government agencies in cooperation with individual police departments and the criminal justice agencies within state and federal governments.
While the statistical results of this study show a significant effect of neighborhood crime rates on travel behavior, the difficulties in obtaining crime data from jurisdictions as well as some of the remaining questions we have that will require further study mean that at this time, we do not believe crime data should be routinely incorporated into travel demand modeling practices. Our recommendations for further research include a more disaggregated approach to measuring crimes, where crime hot spots are identified and the distances from these hot spots to the residences of each survey household are measured. More precise household locations may be available from MTC for BATS 2000 data—GIS locations that could provide a more precise measurement of a survey household’s distance to a crime hot spot. We further recommend investigation of the potential for a self-selection bias in San Francisco households. Additional survey work that collects travel, demographic, and attitudinal data could provide insights into whether San Francisco residents tend to be predisposed to travel by walking or transit, irrespective of the high crime conditions they may experience in their neighborhoods.
Finally, we recommend extending the analytic techniques explored here beyond home-based trips to include those with origins outside the home and developing crime, urban form, and transit accessibility variables for trip destinations as well as for trip origins. Such follow-up studies would improve the predictive strength of the models because it is likely that people choose mode