TAP-Net: Transport-and-Pack using Reinforcement Learning

Abstract

We introduce the transport-and-pack(TAP) problem, a frequently encounteredinstance of real-world packing, and develop a neural optimization solutionbased on reinforcement learning. Given an initial spatial configuration ofboxes, we seek an efficient method to iteratively transport and pack the boxescompactly into a target container. Due to obstruction and accessibilityconstraints, our problem has to add a new search dimension, i.e., finding anoptimal transport sequence, to the already immense search space for packingalone. Using a learning-based approach, a trained network can learn and encodesolution patterns to guide the solution of new problem instances instead ofexecuting an expensive online search. In our work, we represent the transportconstraints using a precedence graph and train a neural network, coinedTAP-Net, using reinforcement learning to reward efficient and stable packing.The network is built on an encoder-decoder architecture, where the encoderemploys convolution layers to encode the box geometry and precedence graph andthe decoder is a recurrent neural network (RNN) which inputs the currentencoder output, as well as the current box packing state of the targetcontainer, and outputs the next box to pack, as well as its orientation. Wetrain our network on randomly generated initial box configurations, withoutsupervision, via policy gradients to learn optimal TAP policies to maximizepacking efficiency and stability. We demonstrate the performance of TAP-Net ona variety of examples, evaluating the network through ablation studies andcomparisons to baselines and alternative network designs. We also show that ournetwork generalizes well to larger problem instances, when trained onsmall-sized inputs.

Quick Read (beta)

loading the full paper ...