Home About APEM Events News Sponsorship
Advances in Production Engineering & Management

Archives > Volume 20 | Number 2 | June 2025 > pp 157–172

Advances in Production Engineering & Management
Volume 20 | Number 2 | June 2025 | pp 157–172

https://doi.org/10.14743/apem2025.2.533

Enhanced product defect forecasting using partitioned attributes and ensemble machine learning
Sun, Y.Y.; Yang, J.H.; Zhai, L.Y.; Liu, N.
ABSTRACT AND REFERENCES (PDF)  |  FULL ARTICLE TEXT (PDF)

A B S T R A C T
This study addresses a critical challenge in industrial big data analytics for smart manufacturing: conventional machine learning methods often fail to account for data discontinuities caused by scrapped defective intermediates in multi-stage production processes, inadvertently treating non-conforming products as qualified during model training. We propose a novel process-aware data analytics framework specifically designed for process industries, featuring: (1) intelligent attribute partitioning based on information flow discontinuity points, and (2) an ensemble modelling approach combining Random Forest and C5.0 Decision Tree algorithms to generate interpretable prediction rules with quantified feature importance rankings. Validated using real-world production data from a Chinese rail steel manufacturer, our methodology demonstrates superior performance by explicitly incorporating process-specific data correlations. The proposed solution effectively mitigates information distortion caused by scrapped intermediates while maintaining operational interpretability – a crucial requirement for industrial implementation. The research results increased the accuracy rate of the test set of the random forest experiment from 88.39 % to 92.69 %, and the accuracy rate of the test set of the decision tree experiment from 71.89 % to 79.15 %. Additionally, the experimental results verify that, compared with the traditional methods, our framework has better applicability in capturing product quality in the manufacturing industry when process attributes are considered.

A R T I C L E   I N F O
Keywords • Intelligent manufacturing; Process industry; Industrial data mining; Defect prediction; C5.0 decision tree; Random forest;Process-oriented analytics; Machine learning
Corresponding authorYang, J.H.
Article history • Received 27 March 2025, Revised 28 May 2025, Accepted 10 June 2025
Published on-line • 29 July 2025

E X P O R T   C I T A T I O N
» RIS format (EndNote, ProCite, RefWorks, and most other reference management software)
» BibTeX (JabRef, BibDesk, and other BibTeX-specific software)
» Plain text

< PREVIOUS ISSUE PAPER   |   NEXT PAPER >