This study constructs a systematic and quantifiable comparative framework to investigate the word-formation patterns of Internet slang in China and Japan by integrating quantitative text analysis with pre-trained language models (PLMs). Utilizing computational linguistics tools, we conducted fundamental metric analyses – including word frequency distribution, co-occurrence network modeling, and correspondence analysis – on large-scale corpora harvested from prominent social media platforms. Furthermore, the research introduces pre-trained models to transform core lexical items into high-dimensional semantic vectors, enabling advanced computations such as semantic similarity matrix construction, hierarchical clustering, and dimensionality reduction visualization. These methods allow for a deep exploration of the underlying semantic generation and association mechanisms of online neologisms. By comparing the divergences between the two, this study reveals the similarities and differences in linguistic innovation paths within Chinese and Japanese cyber-contexts, providing an empirical analytical framework for applying computational linguistics to the comparative study of dynamic sociolinguistic phenomena.
References
[1] Carney, T. R. (2023) Drafting definitions with polisemy and semantic change in mind. Obiter, 44(3), 561-574.
[2] Tuptim, N. (2023) Japanese industrial technical terms: Word formation, word type and word pedagogical applications. Kasetsart Journal of Social Sciences, 44(3), 797-806.
[3] Meilantari, N. L. G., Aritonang, B. (2024) Clipping and blending words in Japanese internet Slang: philosophical perspective. Mahadaya: Jurnal Bahasa, Sastra, Dan Budaya, 4(1), 11-18.
[4] Al Hikmah, I., Machmoed, H., Sahib, H. (2024) An analysis of word formation processes found in TikTok application. ELS Journal on Interdisciplinary Studies in Humanities, 7(1), 160-169.
[5] Albladi, A., Islam, M., Das, A., Bigonah, M., Zhang, Z., Jamshidi, F., Seals, C. (2025) Hate speech detection using large language models: a comprehensive review. IEEE Access, 13, 20871-20892.
[6] Meilantari, N. L. G., Aritonang, B. (2024) Clipping and blending words in Japanese internet Slang: philosophical perspective. Mahadaya: Jurnal Bahasa, Sastra, Dan Budaya, 4(1), 11-18.
[7] Hamasalih, H. A., Ghafoory, D. S. (2025) Word formation processes used in Kurdish community Facebook platform. Zanco Journal of Human Sciences, 29(SpA), 275-297.
[8] Yuniarto, H. (2025) Evolving Neologisms and Ephemeral Discourse: WeChat index analysis of 2021’s Chinese Internet buzzwords. PAROLE: Journal of Linguistics and Education, 15(1), 90-99.
[9] Yang, Y., Foucault Welles, B. (2026) Subversive humor and platformed Asianness: How Asians use neologisms to define themselves and others online. The Communication Review, 1-36.
[10] Fitria, T. N. (2022) Analysis of word formation process in online shop’s terminologies. Kajian Linguistik Dan Sastra, 7(2), 67-80.
Share and Cite
Gao, P. (2026) A Comparative Study of Word-Formation Patterns in Chinese and Japanese Internet Slang – A Computational Linguistics Perspective. Journal of Social Development and History, 2(1), 35-41.
