Abstract
This paper explores Reinforcement learning (RL) policy robustness bysystematically analyzing network parameters under internal and externalstresses. Inspired by synaptic plasticity in neuroscience, synaptic filteringintroduces internal stress by selectively perturbing parameters, whileadversarial attacks apply external stress through modified agent observations.This dual approach enables the classification of parameters as fragile, robust,or antifragile, based on their influence on policy performance in clean andadversarial settings. Parameter scores are defined to quantify thesecharacteristics, and the framework is validated on PPO-trained agents in Mujococontinuous control environments. The results highlight the presence ofantifragile parameters that enhance policy performance under stress,demonstrating the potential of targeted filtering techniques to improve RLpolicy adaptability. These insights provide a foundation for futureadvancements in the design of robust and antifragile RL systems.