Video super-resolution (VSR) technology excels in reconstructing low-qualityvideo, avoiding unpleasant blur effect caused by interpolation-basedalgorithms. However, vast computation complexity and memory occupation hampersthe edge of deplorability and the runtime inference in real-life applications,especially for large-scale VSR task. This paper explores the possibility ofreal-time VSR system and designs an efficient and generic VSR network, termedEGVSR. The proposed EGVSR is based on spatio-temporal adversarial learning fortemporal coherence. In order to pursue faster VSR processing ability up to 4Kresolution, this paper tries to choose lightweight network structure andefficient upsampling method to reduce the computation required by EGVSR networkunder the guarantee of high visual quality. Besides, we implement the batchnormalization computation fusion, convolutional acceleration algorithm andother neural network acceleration techniques on the actual hardware platform tooptimize the inference process of EGVSR network. Finally, our EGVSR achievesthe real-time processing capacity of [email protected] Compared with TecoGAN, themost advanced VSR network at present, we achieve 85.04% reduction ofcomputation density and 7.92x performance speedups. In terms of visual quality,the proposed EGVSR tops the list of most metrics (such as LPIPS, tOF, tLP,etc.) on the public test dataset Vid4 and surpasses other state-of-the-artmethods in overall performance score. The source code of this project can befound on https://github.com/Thmen/EGVSR.