PreAugNet: improve data augmentation for industrial defect classification with small-scale training data

Isack Farady, Chih Yang Lin, Ming Ching Chang

Research output: Contribution to journalArticlepeer-review

4 Scopus citations


With the prevalence of deep learning and convolutional neural network (CNN), data augmentation is widely used for enriching training samples to gain model training improvement. Data augmentation is important when training samples are scarce. This work focuses on improving data augmentation for training an industrial steel surface defect classification network, where the performance is largely depending on the availability of high-quality training samples. It is very difficult to find a sufficiently large dataset for this application in real-world settings. When it comes to synthetic data augmentation, the performance is often degraded by incorrect class labels, and a large effort is required to generate high-quality samples. This paper introduces a novel off-line pre-augmentation network (PreAugNet) which acts as a class boundary classifier that can effectively screen the quality of the augmented samples and improve image augmentation. This PreAugNet can generate augmented samples and update decision boundaries via an independent support vector machine (SVM) classifier. New samples are automatically distributed and combined with the original data for training the target network. The experiments show that these new augmentation samples can improve classification without changing the target network architecture. The proposed method for steel surface defect inspection is evaluated on three real-world datasets: AOI steel defect dataset, MT, and NEU datasets. PreAugNet significantly increases the accuracy by 3.3% (AOI dataset), 6.25% (MT dataset) and 2.1% (NEU dataset), respectively.

Original languageEnglish
Pages (from-to)1233-1246
Number of pages14
JournalJournal of Intelligent Manufacturing
Issue number3
StatePublished - Mar 2024


  • CNN
  • Data augmentation
  • Decision boundary
  • PreAugNet
  • Surface defect classification
  • Synthetic sample generation


Dive into the research topics of 'PreAugNet: improve data augmentation for industrial defect classification with small-scale training data'. Together they form a unique fingerprint.

Cite this