Data mining in space physics: MineTool algorithm

H. Karimabadi, T. B. Sipes, H. White, M. Marinucci, A. Dmitriev, J. K. Chao, J. Driscoll, N. Balac

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

A novel data mining method called MineTool is introduced which, by virtue of automating the modeling process and model evaluations, makes it more accessible to nonexperts. The technique aggregates the various stages of model building into a four-step process consisting of (1) data segmentation and sampling, (2) variable preselection and transform generation, (3) predictive model estimation and validation, and (4) final model testing. Optimal strategies are chosen for each modeling step. However, the modular design of the MineTool enables the substitution of alternative strategies in any of the four modeling steps. A notable feature of the technique is that the final model is always in closed analytical form rather than "black box" form of most other techniques. MineTool can be used for analysis of data (e.g., time series) as well as images. The utility of the technique is illustrated through several examples based on synthetic data. Application of the technique to analysis of spacecraft data will be presented in subsequent papers.

Original languageEnglish
Article numberA11215
JournalJournal of Geophysical Research: Space Physics
Volume112
Issue number11
DOIs
StatePublished - 1 Nov 2007

Fingerprint

Dive into the research topics of 'Data mining in space physics: MineTool algorithm'. Together they form a unique fingerprint.

Cite this