Home About APEM Events News Sponsorship
Advances in Production Engineering & Management

Archives > Volume 18 | Number 3 | September 2023 > pp 303–316

Advances in Production Engineering & Management
Volume 18 | Number 3 | September 2023 | pp 303–316

https://doi.org/10.14743/apem2023.3.474

An improved deep reinforcement learning approach: A case study for optimisation of berth and yard scheduling for bulk cargo terminal
Ai, T.; Huang, L.; Song, R.J.; Huang, H.F.; Jiao, F.; Ma, W.G.
ABSTRACT AND REFERENCES (PDF)  |  FULL ARTICLE TEXT (PDF)

A B S T R A C T
The cornerstone of port production operations is ship handling, necessitating judicious allocation of diverse production resources to enhance the efficiency of loading and unloading operations. This paper introduces an optimisation method based on deep reinforcement learning to schedule berths and yards at a bulk cargo terminal. A Markov Decision Process model is formulated by analysing scheduling processes and unloading operations in bulk port imports business. The study presents an enhanced reinforcement learning algorithm called PS-D3QN (Prioritised Experience Replay and Softmax strategy-based Dueling Double Deep Q-Network), amalgamating the strengths of the Double DQN and Dueling DQN algorithms. The proposed solution is evaluated using actual port data and benchmarked against the other two algorithms mentioned in this paper. The numerical experiments and comparative analysis substantiate that the PS-D3QN algorithm significantly enhances the efficiency of berth and yard scheduling in bulk terminals, reduces the cost of port operation, and eliminates errors associated with manual scheduling. The algorithm presented in this paper can be tailored to address scheduling issues in the fields of production and manufacturing with suitable adjustments, including problems like the job shop scheduling problem and its extensions.

A R T I C L E   I N F O
Keywords • Bulk cargo terminal; Scheduling; Optimisation; Markov decision process (MDP) model; Deep reinforcement learning; Prioritised experience replay and softmax strategy-based dueling; Double deep Q-network
Corresponding authorHuang L.
Article history • Received 22 August 2023, Revised 5 November 2023, Accepted 7 November 2023
Published on-line • 19 November 2023

E X P O R T   C I T A T I O N
» RIS format (EndNote, ProCite, RefWorks, and most other reference management software)
» BibTeX (JabRef, BibDesk, and other BibTeX-specific software)
» Plain text

< PREVIOUS PAPER   |   NEXT PAPER >