In the last few years, researchers have applied machine learning strategiesin the context of vehicular platoons to increase the safety and efficiency ofcooperative transportation. Reinforcement Learning methods have been employedin the longitudinal spacing control of Cooperative Adaptive Cruise Controlsystems, but to date, none of those studies have addressed problems ofdisturbance rejection in such scenarios. Characteristics such as uncertainparameters in the model and external interferences may prevent agents fromreaching null-spacing errors when traveling at cruising speed. On the otherhand, complex communication topologies lead to specific training processes thatcan not be generalized to other contexts, demanding re-training every time theconfiguration changes. Therefore, in this paper, we propose an approach togeneralize the training process of a vehicular platoon, such that theacceleration command of each agent becomes independent of the network topology.Also, we have modeled the acceleration input as a term with integral action,such that the Artificial Neural Network is capable of learning correctiveactions when the states are disturbed by unknown effects. We illustrate theeffectiveness of our proposal with experiments using different networktopologies, uncertain parameters, and external forces. Comparative analyses, interms of the steady-state error and overshoot response, were conducted againstthe state-of-the-art literature. The findings offer new insights concerninggeneralization and robustness of using Reinforcement Learning in the control ofautonomous platoons.