We introduce a novel learning method for 3D pose estimation from colorimages. While acquiring annotations for color images is a difficult task, ourapproach circumvents this problem by learning a mapping from paired color anddepth images captured with an RGB-D camera. We jointly learn the pose fromsynthetic depth images that are easy to generate, and learn to align thesesynthetic depth images with the real depth images. We show our approach for thetask of 3D hand pose estimation and 3D object pose estimation, both from colorimages only. Our method achieves performances comparable to state-of-the-artmethods on popular benchmark datasets, without requiring any annotations forthe color images.