Skip to main content
Article
Linear Correlation Discovery in Databases: A Data Mining Approach
Data and Knowledge Engineering
  • Roger H. L. Chiang
  • Cecil Eng Huang Chua, Missouri University of Science and Technology
  • Ee-Peng Lim
Abstract

Very little research in knowledge discovery has studied how to incorporate statistical methods to automate linear correlation discovery (LCD). We present an automatic LCD methodology that adopts statistical measurement functions to discover correlations from databases' attributes. Our methodology automatically pairs attribute groups having potential linear correlations, measures the linear correlation of each pair of attribute groups, and confirms the discovered correlation. The methodology is evaluated in two sets of experiments. The results demonstrate the methodology's ability to facilitate linear correlation discovery for databases with a large amount of data.

Department(s)
Business and Information Technology
Keywords and Phrases
  • Association measurement,
  • Data mining,
  • Knowledge discovery in database,
  • Linear correlation
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2005 Elsevier B.V., All rights reserved.
Publication Date
6-1-2005
Publication Date
01 Jun 2005
Disciplines
Citation Information
Roger H. L. Chiang, Cecil Eng Huang Chua and Ee-Peng Lim. "Linear Correlation Discovery in Databases: A Data Mining Approach" Data and Knowledge Engineering Vol. 53 Iss. 3 (2005) p. 311 - 337 ISSN: 0169-023X
Available at: http://works.bepress.com/cecil-chua/31/