Development phases of a generic data mining life cycle (DMLC)

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data mining projects are complex and have a high failure rate. In order to improve project management and success rates of such projects a life cycle is vital to the overall success of the project. This paper reports on a research project that was concerned with the life cycle development for large scale data mining projects. The paper provides a detailed view of the design and development of a generic data mining life cycle called DMLC. The life cycle aims to support all members of data mining project teams as well as IT managers and academic researchers and can improve project success rates and strategic decision support. An extensive analysis of eight life cycles leads to a list of advantages, disadvantages, and characteristics of the life cycles. This is extended and generates a conglomerate of several guidelines which serve as the foundation for the development of a new generic data mining life cycle. The new life cycle is further developed to incorporate process, people and data aspects respectively. A detailed study of the human resources involved in a data mining project enhances the DMLC.

Original languageEnglish
Title of host publicationInternational Conference on Software Engineering Theory and Practice 2007, SETP 2007
Pages5-11
Number of pages7
Publication statusPublished - 2007
Event2007 International Conference on Software Engineering Theory and Practice, SETP 2007 - Orlando, FL, United States
Duration: 9 Jul 200712 Jul 2007

Publication series

NameInternational Conference on Software Engineering Theory and Practice 2007, SETP 2007

Conference

Conference2007 International Conference on Software Engineering Theory and Practice, SETP 2007
Country/TerritoryUnited States
CityOrlando, FL
Period9/07/0712/07/07

Keywords

  • Data mining
  • Data mining life cycle
  • DMLC
  • Knowledge discovery
  • Life cycle analysis

Fingerprint

Dive into the research topics of 'Development phases of a generic data mining life cycle (DMLC)'. Together they form a unique fingerprint.

Cite this