Benchmarking AI acceptability and grammaticality in German: A study of ChatGPT and human judgments

Nicholas Catasso

doi:10.62408/ai-ling.v3i1.35

Catasso_2026_AI-Linguistica

DOI

https://doi.org/10.62408/ai-ling.v3i1.35

Keywords

Large Language Models, ChatGPT, grammaticality, acceptability, German

Published

February 19, 2026

Journal

Published in Vol. 3 No. 1 (2026) of AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses.

AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses is a new scholarly journal aiming at providing a publishing plateform for researchers from all areas of Linguistics (interfacing with neighboring fields: Communication Science, Media and Journalism Studies, Computational Linguistics) to reflect on generated texts from a variety of perspectives: theoretical, descriptive, and applied.

We understand ‘generated texts’ in a broad sense, including formats as diverse as texts generated by Large Language Models, AI-powered smart agents (i.e. chatbots, voice assistants, social bots etc.), writing assistance tools, template-based software, and neural machine translation services.

About the Journal

Abstract

The rapid development of large language models has opened new avenues for linguistic research, including areas traditionally reliant on native-speaker intuitions. One such domain is grammaticality and acceptability judgment, where speakers assess whether sentences are structurally well-formed and contextually appropriate. This study investigates the extent to which ChatGPT-4 can approximate human judgments in German, focusing on a diverse range of grammatical and usage-related phenomena. A carefully designed set of test items was presented to both the model and native speakers, allowing for a direct comparison. The results show a high degree of alignment in many cases, but also reveal systematic divergences, particularly in contexts involving gradience, sociolinguistic markedness or context-dependent acceptability. These findings demonstrate both the analytical potential and the current limitations of large language models in linguistic research, and contribute to ongoing discussions about their ability to approximate native speaker competence.

Catasso_2026_AI-Linguistica

Details

DOI

https://doi.org/10.62408/ai-ling.v3i1.35

Published

February 19, 2026

Issue

Vol. 3 No. 1 (2026): Natural Language and AI. New Perspectives for Linguistic Studies

Section

Special Issue

Keywords

Large Language Models, ChatGPT, grammaticality, acceptability, German

How to Cite

Catasso, N. (2026). Benchmarking AI acceptability and grammaticality in German: A study of ChatGPT and human judgments. AI-Linguistica. Linguistic Studies on AI-Generated Texts and Discourses, 3(1). https://doi.org/10.62408/ai-ling.v3i1.35

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Benchmarking AI acceptability and grammaticality in German: A study of ChatGPT and human judgments

Authors

Files

Key Information

DOI

Keywords

Published

Journal

Abstract

Details

DOI

Published

Issue

Section

Keywords

How to Cite

License