Uncanny Semantics. How AI and Human Authors Use Language Differently in Academic Writing

Dennis Wegerhoff

doi:10.62408/ai-ling.v5i1.32

Wegerhoff_2026_AI-Linguistica

DOI

https://doi.org/10.62408/ai-ling.v5i1.32

Keywords

artificial intelligence, human–machine authorship, word embeddings, semantic analysis, epistemic stance, commitment

Published

February 18, 2026

Journal

Published in Vol. 5 No. 1 (2026) of AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses.

AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses is a new scholarly journal aiming at providing a publishing plateform for researchers from all areas of Linguistics (interfacing with neighboring fields: Communication Science, Media and Journalism Studies, Computational Linguistics) to reflect on generated texts from a variety of perspectives: theoretical, descriptive, and applied.

We understand ‘generated texts’ in a broad sense, including formats as diverse as texts generated by Large Language Models, AI-powered smart agents (i.e. chatbots, voice assistants, social bots etc.), writing assistance tools, template-based software, and neural machine translation services.

About the Journal

Abstract

This study explores the semantic differences between human-written and AI-generated academic texts by applying word embedding techniques to a curated corpus of 325 introductions from linguistic articles. The corpus includes human-authored texts and AI-generated texts produced by six language models (OpenAI, Google, and DeepSeek; base and advanced). Each topic was prompted in two different ways: plain and academic. Using cosine similarity, the most frequently occurring lemmas were grouped into semantic categories. The analysis reveals that AI-generated texts, especially under academic prompts, overuse positive-evaluative and methodological vocabulary (e.g., central, crucial, analysis, methodology) and explicitly refer to text structure more often than the plainly prompted texts (e.g., section, chapter). In contrast, human authors employ more epistemically cautious, critical, evaluative, and connective language (e.g., possibly, inconsistent, by no means). I propose that the relative absence of such epistemic markers in AI texts, combined with their tendency to exaggerate the importance of certain topics or data, reflects a pattern of pseudo-commitment: the models produce syntactically assertive, formally academic prose but only weakly modulate epistemic stance and critical engagement, which may contribute to the reported sense of weirdness in AI-generated academic writing.

Wegerhoff_2026_AI-Linguistica

Details

DOI

https://doi.org/10.62408/ai-ling.v5i1.32

Published

February 18, 2026

Issue

Vol. 5 No. 1 (2026): AI-Linguistica

Section

Full-Length Article

Keywords

artificial intelligence, human–machine authorship, word embeddings, semantic analysis, epistemic stance, commitment

How to Cite

Wegerhoff, D. (2026). Uncanny Semantics. How AI and Human Authors Use Language Differently in Academic Writing. AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses, 5(1). https://doi.org/10.62408/ai-ling.v5i1.32

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Uncanny Semantics. How AI and Human Authors Use Language Differently in Academic Writing

Authors

Files

Key Information

DOI

Keywords

Published

Journal

Abstract

Details

DOI

Published

Issue

Section

Keywords

How to Cite

License