Date of Publication

3-1-2019

Document Type

Master's Thesis

Degree Name

Master of Science in Computer Science

Subject Categories

Computer Sciences

College

College of Computer Studies

Department/Unit

Computer Science

Thesis Adviser

Arnulfo P. Azcarraga

Defense Panel Chair

Conrado D. Ruiz, Jr.

Defense Panel Member

Joel P. Ilao
Arnulfo P. Azcarraga

Abstract/Summary

Despite the remarkable results and numerous advancements on neural style transfer, enabling artistic freedom through the control over perceptual factors such as pattern density and stroke strength remains a challenging problem. A recent work on fast stylization networks is able to offer some degree of controllability on the pattern density by changing the resolution of the inputs. However, their solution requires a dedicated network architecture that can only accommodate a predefined set of resolutions. In this work, we propose a much simpler solution by addressing the fundamental limitation of neural style transfer models that uses the Gram matrix as its style representation. More specifically, we replace the Gram matrix with a covariance matrix in order to better capture negative spatial correlations. We show that this simple modification allows the model to handle a wider range of input resolutions. We also show that selectively manipulating the covariance matrix allows us to control the stroke strengths independently from the pattern density. Our method compares favorably against several state-of-the-art neural style transfer models. Moreover, since our approach is focused on manipulating and improving the Gram matrix, it is not dependent on any network architecture. This means that all the advancements on neural style transfer that use the Gram matrix as its style representation can directly benefit from our findings.

Abstract Format

html

Language

English

Format

Electronic

Accession Number

CDTG007956

Keywords

Image processing; Image transmission

Upload Full Text

wf_yes

Embargo Period

12-5-2022

Share

COinS