Comparing Twitter and LODES Data for Detecting Commuter Mobility Patterns

Transportation policy decisions have historically relied on understanding commuting patterns, the journey-to-work. While scholars often acknowledge the value inherent in examining commuting patterns holistically by considering both work and non-work trips, the difficulty of assembling data about non-work trips has restricted their inclusion in regional/large scale analyses. The overarching goal of this research project is to examine the feasibility of using geo-referenced tweets and Open Street Map (OSM) as proxy measures to understand and explain a robust range of commuting behaviors and spatio-temporal movement patterns.

Specifically, the research team uses LEHD Origin Destination Employment Statistics (LODES), a dataset created by the US Census Bureau to develop a baseline of movement patterns in the nine-county San Francisco Bay Area region. The LODES dataset provides spatial distributions of workers' employment and residential locations and the relation between the two at the Census block-level. Working with our research partners at the University of Salzburg and Hunter College, CUNY, we extract commuter patterns from social media data (in this case, geo-referenced tweets). We use semantic clustering methods (interpreting the text of the tweets themselves) to identify the purpose of the trip. We compare Census-derived data with social media-derived data in order to identify spatial movement clusters. In other words, the research team uses social media data to explain movement patterns that fall outside of conventional journey-to-work trips. The incorporation of address-level points-of-interest data derived from OSM provides an additional opportunity to characterize the trip purpose. In addition to the locational references, the social media data also contains date/time stamps that allow for a spatio-temporal analysis (weekday-weekend, seasonal) of movement patterns. The unique shutdown as a result of COVID-19 allows the research team an additional opportunity to examine non-work trips.

This research project investigates the benefits and limitations of using continuously and publicly available volunteered geographic information (tweets and OSM) to complement official census-derived data to conduct data-driven transportation policy research.

University: 
Mineta Consortium for Transportation Mobility
San José State University
Principal Investigator: 
Dr. Laxmi Ramasubramanian; Dr. Jochen Albrecht; Dr. Bernd Resch
PI Contact Information: 

Mineta Transportation Institute
San José State University
210 N. 4th St., 4th Floor
San José, CA 95112
laxmi.ramasubramanian@sjsu.edu

Funding Source(s) and Amounts Provided (by each agency or organization): 

U.S. Department of Transportation, Office of the Assistant Secretary for Research and Technology – $9,472.10
State of California SB1 - $18,860.00

Total Project Cost: 
$28,332.10
Agency ID or Contract Number: 
69A3551747127
Dates: 
May 2020 to February 2021
Project Number: 
2037

-

CSUTC
MCEEST
MCTM
NTFC
NTSC

Contact Us

SJSU Research Foundation   210 N. 4th Street, 4th Floor, San Jose, CA 95112    Phone: 408-924-7560   Email: mineta-institute@sjsu.edu