Vision System for Hand Gesture Recognition (VISOR)

Date of Publication

2006

Document Type

Bachelor's Thesis

Degree Name

Bachelor of Science in Computer Science

Subject Categories

Computer Sciences

College

College of Computer Studies

Department/Unit

Computer Science

Thesis Adviser

Joel P. Ilao

Defense Panel Member


Alexis V. Pantola
Rigan Ap-Apid
Jesus E. Gonzalez

Abstract/Summary

Vision system for Hand Gesture Recognition (VISOR) is a software application that recognizes a set of dynamic continuous gestures (using only a single hand) from a predefined vocabulary using computer vision algorithms. The system is user-independent and able to contend with different backgrounds and does not require the user to wear a long-sleeved garment (a limitation commonly found in similar systems). A standard USB web camera, placed near the workstation that contains the application, is used to capture gesture sequences. In every frame captured, the hand is detected and afterwards tracked. The hand is detected using a combination of skin-color segmentation and shape analysis based on the concept of convexity defects and several heuristics regarding the human hand shape, established empirically. Upon the hand detection, the user is given a time limit, in which the gesture sequence must be performed. The gesture is recognized by means of extracting temporal and structural information such as motion, hand shape and orientation from the image sequence. In order to recognize the sequence as one of the valid gestures, scoreboarding with respect to the features mentioned is applied after feature extraction.

The system's performance is evaluated in an indoor environment. The prototype was able to detect a user's bare hand and track it throughout the duration of gesture performance. A hand detection rate of 77.65% was achieved with 6.47% false positives, during testing. The system was able to recognize gestures with a success rate of 70% for the old approach and 80% for the new approach, while being able to operate in real-time.

Abstract Format

html

Language

English

Format

Print

Accession Number

TU13554

Shelf Location

Archives, The Learning Commons, 12F, Henry Sy Sr. Hall

Physical Description

1 v. (various foliations) : ill. (some col.) ; 28 cm.

Keywords

Motion perception (Vision)--Computer simulation; Computer vision

This document is currently not available here.

Share

COinS