While reinforcement learning has achieved considerable successes in recentyears, state-of-the-art models are often still limited by the size of state andaction spaces. Model-free reinforcement learning approaches use some form ofstate representations and the latest work has explored embedding techniques foractions, both with the aim of achieving better generalization andapplicability. However, these approaches consider only states or actions,ignoring the interaction between them when generating embedded representations.In this work, we propose a new approach for jointly embedding states andactions that combines aspects of model-free and model-based reinforcementlearning, which can be applied in both discrete and continuous domains.Specifically, we use a model of the environment to obtain embeddings for statesand actions and present a generic architecture that uses these to learn apolicy. In this way, the embedded representations obtained via our approachenable better generalization over both states and actions by capturingsimilarities in the embedding spaces. Evaluations of our approach on severalgaming and recommender system environments show it significantly outperformsstate-of-the-art models in discrete domains with large state/action space, thusconfirming the efficacy of joint embedding and its overall superiorperformance.