Skip to main content
Article
Effects of Normalization Techniques on Logistic Regression in Data Science
Journal of Information Systems Applied Research
  • Nureni Adekunle Adeyemo, Georgia Southern University
  • Hayden Wimmer, Georgia Southern University
  • Loreen Powell, Bloomsburg University
Document Type
Article
Publication Date
8-1-2019
Disciplines
Abstract

The improvements in the data science profession have allowed the introduction of several mathematical ideas to social patterns of data. This research seeks to investigate how different normalization techniques can affect the performance of logistic regression. The original dataset was modeled using the SQL Server Analysis Services (SSAS) Logistic Regression model. This became the baseline model for the research. The normalization methods used to transform the original dataset were described. Next, different logistic models were built based on the three normalization techniques discussed. This work found that, in terms of accuracy, decimal scaling marginally outperformed min-max and z-score scaling. But when Lift was used to evaluate the performances of the models built, decimal scaling and z-score slightly performed better than min-max method. Future work is recommended to test the regression model on other datasets specifically those whose dependent variable are a 2-category problem or those with varying magnitude independent attributes.

Copyright

Copyright © Information Systems and Computing Academic Professionals (ISCAP). Permission to make digital or hard copies of all or part of this journal for personal or classroom use is granted without fee provided that the copies are not made or distributed for profit or commercial use. All copies must bear this notice and full citation. Permission from the Editor is required to post to servers, redistribute to lists, or utilize in a for-profit or commercial use. Permission requests should be sent to Scott Hunsinger, Editor at editor@jisar.org

Creative Commons License
**Select License for Reuse**
Citation Information
Nureni Adekunle Adeyemo, Hayden Wimmer and Loreen Powell. "Effects of Normalization Techniques on Logistic Regression in Data Science" Journal of Information Systems Applied Research Vol. 12 Iss. 2 (2019) p. 37 - 44 ISSN: 1946-1836
Available at: http://works.bepress.com/hayden-wimmer/97/