Missing value imputation on multiple measurements for prediction of liver cancer recurrence: A comparative study

Xiao Ou Ping, Yi Ju Tseng, Ja Der Liang, Guan Tarn Huang, Pei Ming Yang, Feipei Lai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The problem of missing values frequently occurs during data analysis. Imputation is one of the solutions to handle missing data. Clinical data often contain multiple measurements such as laboratory test results which are measured at different time points. In this study, we compared three imputation methods and their effects on different multiple measurement data sets with different sampling time periods. Data sets of liver cancer were used in this study for classification of liver cancer recurrence based on two types of classification models built by support vector machine (SVM) and random forests. The results report appropriate combinations of imputation methods and sampling time periods which achieve better classification results than those of other imputation methods and periods. These reported the leading imputation method with SVM is significantly different (P<0.001) from mean imputation with SVM which is frequently used by data sets with missing values.

Original languageEnglish
Title of host publicationIntelligent Systems and Applications - Proceedings of the International Computer Symposium, ICS 2014
EditorsWilliam Cheng-Chung Chu, Stephen Jenn-Hwa Yang, Han-Chieh Chao
PublisherIOS Press
Pages1930-1939
Number of pages10
ISBN (Electronic)9781614994831
DOIs
StatePublished - 2015
EventInternational Computer Symposium, ICS 2014 - Taichung, Taiwan
Duration: 12 Dec 201414 Dec 2014

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume274
ISSN (Print)0922-6389

Conference

ConferenceInternational Computer Symposium, ICS 2014
Country/TerritoryTaiwan
CityTaichung
Period12/12/1414/12/14

Keywords

  • imputation
  • Missing values
  • random forests
  • support vector machine

Fingerprint

Dive into the research topics of 'Missing value imputation on multiple measurements for prediction of liver cancer recurrence: A comparative study'. Together they form a unique fingerprint.

Cite this