Interpretable Stylistic Variation in Human and LLM Writing Across Genres, Models, and Decoding Strategies

April 15, 20262604.14111

Swati Rallapalli, Shannon Gallagher, Ronald Yurko, Tyler Brooks, Chuck Loughin + 2 more

cs.CL

TLDR

This paper analyzes stylistic variations in human and LLM writing across genres, models, and decoding strategies, finding genre and model are key.

Key contributions

LLM text differentiators are robust to generation conditions like prompting.
Genre has a stronger influence on stylistic features than the text's source.
Chat model variants generally cluster together in stylistic space.
Model type impacts style more significantly than decoding strategy.

Why it matters

This research provides crucial insights into how LLMs generate text stylistically. It highlights that model and genre are more influential than prompting or decoding, guiding better LLM application and detection efforts.

Original Abstract

Large Language Models (LLMs) are now capable of generating highly fluent, human-like text. They enable many applications, but also raise concerns such as large scale spam, phishing, or academic misuse. While much work has focused on detecting LLM-generated text, only limited work has gone into understanding the stylistic differences between human-written and machine-generated text. In this work, we perform a large scale analysis of stylistic variation across human-written text and outputs from 11 LLMs spanning 8 different genres and 4 decoding strategies using Douglas Biber's set of lexicogrammatical and functional features. Our findings reveal insights that can guide intentional LLM usage. First, key linguistic differentiators of LLM-generated text seem robust to generation conditions (e.g., prompt settings to nudge them to generate human-like text, or availability of human-written text to continue the style); second, genre exerts a stronger influence on stylistic features than the source itself; third, chat variants of the models generally appear to be clustered together in stylistic space, and finally, model has a larger effect on the style than decoding strategy, with some exceptions. These results highlight the relative importance of model and genre over prompting and decoding strategies in shaping the stylistic behavior of machine-generated text.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers