Autonomous Vehicles (AVs) are required to operate safely and efficiently indynamic environments. For this, the AVs equipped with JointRadar-Communications (JRC) functions can enhance the driving safety byutilizing both radar detection and data communication functions. However,optimizing the performance of the AV system with two different functions underuncertainty and dynamic of surrounding environments is very challenging. Inthis work, we first propose an intelligent optimization framework based on theMarkov Decision Process (MDP) to help the AV make optimal decisions inselecting JRC operation functions under the dynamic and uncertainty of thesurrounding environment. We then develop an effective learning algorithmleveraging recent advances of deep reinforcement learning techniques to findthe optimal policy for the AV without requiring any prior information aboutsurrounding environment. Furthermore, to make our proposed framework morescalable, we develop a Transfer Learning (TL) mechanism that enables the AV toleverage valuable experiences for accelerating the training process when itmoves to a new environment. Extensive simulations show that the proposedtransferable deep reinforcement learning framework reduces the obstacle missdetection probability by the AV up to 67% compared to other conventional deepreinforcement learning approaches.