Abstract
Retrieval-augmented generation models offer many benefits over standalonelanguage models: besides a textual answer to a given query they provideprovenance items retrieved from an updateable knowledge base. However, they arealso more complex systems and need to handle long inputs. In this work, weintroduce FiD-Light to strongly increase the efficiency of the state-of-the-artretrieval-augmented FiD model, while maintaining the same level ofeffectiveness. Our FiD-Light model constrains the information flow from theencoder (which encodes passages separately) to the decoder (using concatenatedencoded representations). Furthermore, we adapt FiD-Light with re-rankingcapabilities through textual source pointers, to improve the top-rankedprovenance precision. Our experiments on a diverse set of seven knowledgeintensive tasks (KILT) show FiD-Light consistently improves the Pareto frontierbetween query latency and effectiveness. FiD-Light with source pointing setssubstantial new state-of-the-art results on six KILT tasks for combined textgeneration and provenance retrieval evaluation, while maintaining reasonableefficiency.