Tracking human mobility using Twitter through natural language processing techniques

Date of Publication

2018

Document Type

Master's Thesis

Degree Name

Master of Science in Computer Science

College

College of Computer Studies

Department/Unit

Computer Science

Thesis Adviser

Charibeth K. Cheng

Defense Panel Chair

Ethel C. Ong

Defense Panel Member

Charibeth K. Cheng
Courtney Anne M. Ngo
Rafael A. Cabredo

Abstract/Summary

Social media has been proven to be a reliable source of user-generated data that can be used to extract important information regarding different topics, one of which is tracking human mobility. Information on human mobility, which pertains to human travel patterns, can be retrieved from social media, since people of all ages post all sorts of updates on various social media sites, including updates on the places they visit all throughout the day. This research built a system that extracts location, activity and time information from tweets using natural language processing techniques in order to present and visualize human mobility patterns on a layered map. It was discovered that people who post in Manila seldom use the GPS on their phones, so there is a need to extract the information from the tweet content itself. Based on the results, it is possible to extract location and activity information using POS tags. It was also observed in this study that the activity done by the user is usually related to the location. However, it was also revealed that activities are usually not explicitly stated in the text. With that being said, the time information must not be dependent on the presence or the tense of the verb in the tweet. Lastly, it was discovered that tweets are not sufficient and cannot be the sole source of data for human mobility. It also lacks information for urban planning, but the information retrieved and the patterns observed from the visualization reveal that these information may be useful for other fields such as advertising and marketing.

Abstract Format

html

Language

English

Format

Electronic

Accession Number

CDTG007692

Shelf Location

Archives, The Learning Commons, 12F Henry Sy Sr. Hall

Physical Description

1 computer disc ; 4 3/4 in.

Keywords

Natural language processing (Computer science); Social networks

This document is currently not available here.

Share

COinS