Abstract
Reusing pre-collected data from different domains is an appealing solutionfor decision-making tasks that have insufficient data in the target domain butare relatively abundant in other related domains. Existing cross-domain policytransfer methods mostly aim at learning domain correspondences or correctionsto facilitate policy learning, such as learning domain/task-specificdiscriminators, representations, or policies. This design philosophy oftenresults in heavy model architectures or task/domain-specific modeling, lackingflexibility. This reality makes us wonder: can we directly bridge the domaingaps universally at the data level, instead of relying on complex downstreamcross-domain policy transfer models? In this study, we propose the Cross-DomainTrajectory EDiting (xTED) framework that employs a specially designed diffusionmodel for cross-domain trajectory adaptation. Our proposed model architectureeffectively captures the intricate dependencies among states, actions, andrewards, as well as the dynamics patterns within target data. By utilizing thepre-trained diffusion as a prior, source domain trajectories can be transformedto match with target domain properties while preserving original semanticinformation. This process implicitly corrects underlying domain gaps, enhancingstate realism and dynamics reliability in the source data, and allowingflexible incorporation with various downstream policy learning methods. Despiteits simplicity, xTED demonstrates superior performance in extensive simulationand real-robot experiments.