A wide variety of deep reinforcement learning (DRL) models have recently beenproposed to learn profitable investment strategies. The rules learned by thesemodels outperform the previous strategies specially in high frequency tradingenvironments. However, it is shown that the quality of the extracted featuresfrom a long-term sequence of raw prices of the instruments greatly affects theperformance of the trading rules learned by these models. Employing a neuralencoder-decoder structure to extract informative features from complex inputtime-series has proved very effective in other popular tasks like neuralmachine translation and video captioning in which the models face a similarproblem. The encoder-decoder framework extracts highly informative featuresfrom a long sequence of prices along with learning how to generate outputsbased on the extracted features. In this paper, a novel end-to-end model basedon the neural encoder-decoder framework combined with DRL is proposed to learnsingle instrument trading strategies from a long sequence of raw prices of theinstrument. The proposed model consists of an encoder which is a neuralstructure responsible for learning informative features from the inputsequence, and a decoder which is a DRL model responsible for learningprofitable strategies based on the features extracted by the encoder. Theparameters of the encoder and the decoder structures are learned jointly, whichenables the encoder to extract features fitted to the task of the decoder DRL.In addition, the effects of different structures for the encoder and variousforms of the input sequences on the performance of the learned strategies areinvestigated. Experimental results showed that the proposed model outperformsother state-of-the-art models in highly dynamic environments.