Abstract
Artificial intelligence systems significantly impact the environment,particularly in natural language processing (NLP) tasks. These tasks oftenrequire extensive computational resources to train deep neural networks,including large-scale language models containing billions of parameters. Thisstudy analyzes the trade-offs between energy consumption and performance acrossthree neural language models: two pre-trained models (T5-base and BART-base),and one large language model (LLaMA 3-8B). These models were fine-tuned for thetext summarization task, focusing on generating research paper highlights thatencapsulate the core themes of each paper. A wide range of evaluation metrics,including ROUGE, METEOR, MoverScore, BERTScore, and SciBERTScore, were employedto assess their performance. Furthermore, the carbon footprint associated withfine-tuning each model was measured, offering a comprehensive assessment oftheir environmental impact. This research underscores the importance ofincorporating environmental considerations into the design and implementationof neural language models and calls for the advancement of energy-efficient AImethodologies.