Image recognition and quality assessment are two important viewing tasks,while potentially following different visual mechanisms. This paperinvestigates if the two tasks can be performed in a multitask learning manner.A sequential spatial-channel attention module is proposed to simulate thevisual attention and contrast sensitivity mechanisms that are crucial forcontent recognition and quality assessment. Spatial attention is shared betweencontent recognition and quality assessment, while channel attention is solelyfor quality assessment. Such attention module is integrated into Transformer tobuild a uniform model for the two viewing tasks. The experimental results havedemonstrated that the proposed uniform model can achieve promising performancefor both quality assessment and content recognition tasks.