code_transformed: The Influence of Large Language Models on Code

Abstract

Coding remains one of the most fundamental modes of interaction betweenhumans and machines. With the rapid advancement of Large Language Models(LLMs), code generation capabilities have begun to significantly reshapeprogramming practices. This development prompts a central question: Have LLMstransformed code style, and how can such transformation be characterized? Inthis paper, we present a pioneering study that investigates the impact of LLMson code style, with a focus on naming conventions, complexity, maintainability,and similarity. By analyzing code from over 19,000 GitHub repositories linkedto arXiv papers published between 2020 and 2025, we identify measurable trendsin the evolution of coding style that align with characteristics ofLLM-generated code. For instance, the proportion of snake\_case variable namesin Python code increased from 47% in Q1 2023 to 51% in Q1 2025. Furthermore, weinvestigate how LLMs approach algorithmic problems by examining their reasoningprocesses. Given the diversity of LLMs and usage scenarios, among otherfactors, it is difficult or even impossible to precisely estimate theproportion of code generated or assisted by LLMs. Our experimental resultsprovide the first large-scale empirical evidence that LLMs affect real-worldprogramming style.

Quick Read (beta)

loading the full paper ...