In this paper, we propose a method for training neural networks when we havea large set of data with weak labels and a small amount of data with truelabels. In our proposed model, we train two neural networks: a target network,the learner and a confidence network, the meta-learner. The target network isoptimized to perform a given task and is trained using a large set of unlabeleddata that are weakly annotated. We propose to control the magnitude of thegradient updates to the target network using the scores provided by the secondconfidence network, which is trained on a small amount of supervised data. Thuswe avoid that the weight updates computed from noisy labels harm the quality ofthe target network model.