Skip to main content
Article
Manifold Learning for Multivariate Variable-Length Sequences With an Application to Similarity Search
IEEE Transactions on Neural Networks and Learning Systems (2016)
  • Shen-Shyang Ho, Rowan University
  • Peng Dai
  • Frank Rudzicz
Abstract
Multivariate variable-length sequence data are becoming ubiquitous with the technological advancement in mobile devices and sensor networks. Such data are difficult to compare, visualize, and analyze due to the nonmetric nature of data sequence similarity measures. In this paper, we propose a general manifold learning framework for arbitrary-length multivariate data sequences driven by similarity/distance (parameter) learning in both the original data sequence space and the learned manifold. Our proposed algorithm transforms the data sequences in a nonmetric data sequence space into feature vectors in a manifold that preserves the data sequence space structure. In particular, the feature vectors in the manifold representing similar data sequences remain close to one another and far from the feature points corresponding to dissimilar data sequences. To achieve this objective, we assume a semisupervised setting where we have knowledge about whether some of data sequences are similar or dissimilar, called the instance-level constraints. Using this information, one learns the similarity measure for the data sequence space and the distance measures for the manifold. Moreover, we describe an approach to handle the similarity search problem given user-defined instance level constraints in the learned manifold using a consensus voting scheme. Experimental results on both synthetic data and real tropical cyclone sequence data are presented to demonstrate the feasibility of our manifold learning framework and the robustness of performing similarity search in the learned manifold.
Keywords
  • tropical cyclone.,
  • Application,
  • embedding,
  • feature extraction,
  • isometric feature mapping (ISOMAP),
  • longest common subsequence (LCSS),
  • metric learning,
  • similarity learning,
  • similarity search
Disciplines
Publication Date
June, 2016
DOI
10.1109/TNNLS.2015.2399102
Citation Information
Shen-Shyang Ho, Peng Dai and Frank Rudzicz. "Manifold Learning for Multivariate Variable-Length Sequences With an Application to Similarity Search" IEEE Transactions on Neural Networks and Learning Systems Vol. 27 Iss. 6 (2016) p. 1333 - 1344 ISSN: 2162-237X
Available at: http://works.bepress.com/shen-shyang-ho/1/