Content Knowledge Base for AI-based Chinese Computer-assisted Pronunciation Training
Abstract
Integrating the state-of-the-art Automatic Technology (AI) technology into the Computer-Assisted Pronunciation Training (CAPT) system to assist language acquisition could address the various drawbacks of conventional pronunciation and enhance its effectiveness. However, little attention has been given to their application in Chinese language acquisition, and the lack of guidelines grounded in Second Language Acquisition (SLA) theories and pronunciation pedagogy has led to the tension between technology and pedagogy, limited the understanding of their application beyond English, and hindered their broader application. Grounded in the knowledge framework of AI-based Chinese CAPT system based on the Richey and Klein’s (2007) Design and Development Research (DDR) framework, the Monitor Theory and the Speech Acquisition Model (SAM), this study aimed to establish the content knowledge base informed by Chinese pronunciation pedagogy and derive evidence-based, pedagogically sound design guidelines for the AI-based Chinese CAPT system for non-native Chinese learners. Alongside the guidelines previously derived from the SLA Interactive theories and Oral Corrective Feedback (OCF) research in the Chinese setting, these could inform future design and development of the system. Employing the qualitative method of thematic analysis, three key themes were identified from a literature review on Chinese pronunciation acquisition, leading to the formulation of pedagogical design guidelines. These findings offer valuable insights into Chinese pronunciation practices and learners’ needs, providing a robust theoretical and empirical foundation for future design and development research. Finally, the study discussed its limitations and contributions.
Downloads
References
Amrate, M. & Tsai, P-h., (2024). Computer-assisted pronunciation training: A systematic review. ReCALL, 37(1), 22–42. https://doi.org/10.1017/S0958344024000181
Anderson, C., Bjorkman, B., Denis, D., Doner, J., Grant, M., Sander, S., & Taniguchi, A. (2022). Essentials of Linguistics(2nd ed.). eCampusOntario. https://openlibrary-repo.ecampusontario.ca/jspui/bitstream/123456789/497/1/Essentials-of-Linguistics-1539375869.pdf
Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology, 3(2), 77–101. https://doi.org/10.1191/1478088706qp063oa
Broersma, M., & Cutler, A. (2008). Phantom word activation in L2. System, 36, 22-34. DOI: https://doi.org/10.1016/j.system.2007.11.003
Cengiz, B. C. (2023). Computer-Assisted Pronunciation Teaching: An Analysis of Empirical Research. Participatory Educational Research, 10(3). 72-88. https://doi.org/10.17275/per.23.45.10.3
Chen, C. H., Koong, C. S., & Liao, C. (2022). Influences of integrating dynamic assessment into a speech recognition learning design to support students’ English speaking skills, learning anxiety and cognitive load. Educational Technology & Society, 25(1), 1–14. https://www.jstor.org/stable/48647026.
Chen, H. (2020). New understanding of Chinese tone teaching based on cognitive theory. Journal of Tianjin University (Social Sciences), 22(04), 361–367. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk3aQQg3mh3ZtOQ7zHW5fGb-LLPraP_uNirJIQfNG_RWKVm5QkIflCXIOzV5hK3taoeshOxIS0bs5a3BVyX3JEhafMV7--XDgriOcfyU8DmLW_h_suko5gzSiuK6kOH6C5st-w8JorfkiCT-NENNiYsOmOOTmauCGWk=&uniplatform=NZKPT
Chen, M. (2022). Computer-aided feedback on the pronunciation of Mandarin Chinese tones: Using Praat to promote multimedia foreign language learning. Computer Assisted Language Learning, 37(3), 363-388 https://doi.org/10.1080/09588221.2022.2037652
Chen, M., & Wang, J. Q. (2008). Initial Development of Chinese Monosyllabic Tone Awareness of Non-native Chinese Speakers: a Case Study. Nankai Linguistics. 02:63-70+165. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk34prxhfAxBOaqQnU82RhfQ_RgRUdX5yTHkbk408oQ0ZTgmgQE-zvoTEdaUvejjg1XZ9fECcuKP4qAjMI3_yXxluEs0WsYttU5iyy0mCTKku9A-cuJink2QjgtYCcQVy3rjiceWhAvJNHNxRuwFQCl1cmk6jExn6gc=&uniplatform=NZKPT
Deng, D., &Lin, Y.Q. (2017). Application of perceptual training to Chinese tone teaching. Journal of Yunnan Normal University (Teaching& Studying Chinese as a Foreign Language Edition) 15 (03), 21-27. Doi:10.16802/j.cnki.ynsddw.2017.03.003. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk03_8NL0obQ1--s6agu5Q36e85AgdCL8Os5RKkpHaXDmVWwlQmV7hT8HPFg7GrTKlIOYQSZUifIH7XcP3gi5PeFLC7luJ8Lh0C8cfFDFwuCQajmSX8Fh7jjlEsRlyUr_AfPLGnXwqBQCw3DpcIKxVdEp5T2-6C3LEQ=&uniplatform=NZKPT
Ding, A. Q., Cheng, H. T., Yu Y., Ding, H, & Cheng, W. J. (2021). The present situation and problems of global Chinese learners’ oral performance: Based on big data analysis on HSKK. Language Teaching and Linguistic Studies (6), 13-23. https://kns.cnki.net/kcms2/article/abstract?v=A-1EuXenf_rmlqHUdsUe0-HUeFr7T3AKW3RgC0W72Ym4EZCBmVk5AmEh__B3rfH3117R6zBUT5OvNdl-dlIcGFtUyBcT7BXI-2T99UlREsEKrIVKUILQY6UAHCulPRH33uviB_5us-dZ_2-bpygW5s7cjiAsd7V18a1_q9XbnFI=&uniplatform=NZKPT
Flege, J. E. (1999). The relation between L2 production and perception. In Proceedings of ICPhs99 (San Francisco). https://www.internationalphoneticassociation.org/icphs-proceedings/ICPhS1999/papers/p14_1273.pdf
Gao, Y. J., &Li, B. G. (2018). An experimental study on the errors made by Russian students in learning Chinese tones. Journal of Yunnan Normal University (Teaching &Studying Chinese as a Foreign Language Edition) 16 (05), 18-26. Doi:10.16802/j.cnki.ynsddw.2018.05.003. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk0QKMJCjSjVarJh4XV8ZiOxOa8GVeyJsYWR_MHHZzI0WJ2LAeV26F5FaiohAXLPEVK9SRvA2Srzssk4ecKujviJv1DIqre03finCINCwifKRpSMS7DK-MwIgL3qQ1WJ2yr6UhJeHI8_58MZILZ1VBusR7Y33H7boWw=&uniplatform=NZKPT
Gui, M. C. (2000). The interference of English intonation to Mandarin tones perception: the linguistic analysis and the empirical solutions. Chinese Teaching in the World (01), 89–93. Doi:10.13724/j.cnki.ctiw.2000.01.013. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2AsgMfWa6LKUGjN-AdCbwp4BVDrFR5MTku2d203o5GIwY_RoMGC2vO6mc3gGp6uJCQO3ALwSHSnoNS-UOifxD6MMgEa6d7fLvtttDAiJizw5lFDX8FJzYAII84E07sL1GvP3xTW3XAJFzXIjrEiMbx&uniplatform=NZKPT
Guo, L., & Xuan, W. H. (2013). American students’ acquisition of Chinese tones at the beginning stage. Journal of Yunnan Normal University (Teaching & Studying Chinese as a Foreign Language Edition 11(02), 25–31. Doi: 10.16802/ j.cnki. ynsddw. 2013.02.005. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk0JoOhBniGX8b4OWJG5QyfXeHI0nUVZllU9OVl2JzcdydknvZ6n_Aic-Kwk1kM76B5RsHsO_7YaNTCZPzEI2FwjP0NLgx9ODUH731sgZiBgEZFY-PbMfzUtwSHC8bFN4gggbZuwZfrD9xvGjxgdyuhV&uniplatform=NZKPT
Huang, B. (2022). The investigation about tonal learning of multinational learners. Journal of Social Science of Harbin Normal University, 12 (2), 96–100. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk0-0GFxD20BScBlv30GMRAt6GjWic-_LYwWAhBnlBQl5pEC9hzt4EJhJq9hikYEsZB4u0uL52-OryLJ01kzzRi2rsaFZ_hBLURRbhr7r01uIsmDta5q0v1lg7jre08IEX_kUV27xRf0lyPlv7r4pjcaN2F-BaYKbAw=&uniplatform=NZKPT
Huang, B., & Liao, X. (2021). The Modern Chinese (7th ed.). Higher Education Press. https://www.scribd.com/document/657889462/%E7%8E%B0%E4%BB%A3%E6%B1%89%E8%AF%AD%E7%AC%AC%E5%85%AD%E7%89%88%E4%B8%8A%E5%86%8C
Hsiao, H.-S., Chang, C.-S., Lin, C.-Y., & Hsu, H.-L. (2015). “iRobiQ”: The influence of bidirectional interaction on kindergarteners’ reading motivation, literacy, and behavior. Interactive Learning Environments, 23(3), 269–292. https://doi.org/10.1080/10494820.2012.745435
Immonen, K., Alku, P. & Peltola, M. S. (2022). Phonetic listen-and-repeat training alters 6–7-year-old children’s non-native vowel contrast production after one training session. Journal of Second Language Pronunciation, 8(1): 95–115. https://doi. org/10.1075/jslp.21005.imm
Jiang, D. M., Goh, C. S., & Airil Haimi, M. A. (2026). Interactive Knowledge Base for
Designing an AI-based Chinese Computer-assisted Pronunciation Training: A
Systematic Review, WSEAS Transactions on Computer Research,14. 223-236.
https://doi. org/10.37394/232018.2026.14.20.
Kang, O., Thomson, R. I. & Moran, M. (2018). Empirical approaches to measuring the intelligibility of different varieties of English in predicting listener comprehension. Language Learning, 68(1), 115–146. https://doi.org/10.1111/lang.12270
Lee, B., Plonsky, L. & Saito, K. (2020). The effects of perception- vs. production-based pronunciation instruction. System, 88, 102185. https://doi.org/10.1016/j.system.2019.102185
Lee, D., Kim, Hh. & Sung, SH (2023). Development research on an AI English learning support system to facilitate learner-generated-context-based learning. Educational Technology Research and Development, 71, 629–666. https://doi.org/10.1007/s11423-022-10172-2
Leeuwestein, H., Barking, M., Sodacı, H., Oudgenoeg-Paz, O., Verhagen, J., Vogt, P., ... Leseman, P. (2021). Teaching Turkish-Dutch kinder gartners Dutch vocabulary with a social robot: Does the robot’s use of Turkish translations benefit children’s Dutch vocabulary learning? Journal of Computer Assisted Learning, 37(3), 603–620. https://doi.org/10.1111/jcal.12510
Levis, J. (2007). Computer technology in teaching and researching pronunciation. Annual Review of Applied Linguistics 27, 184–202. https://doi.org/10.1017/S0267190508070098
Li, Y. H. (1995). Pronunciation error committed by Thai Chinese learners. Chinese Teaching in the World, (02), 66-71. DOI:10.13724/j.cnki.ctiw.1995.02.013. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2jMaHnq0ENj9_B4jpokT3rsvx8uHSb7CZ9mGHa4ifnlmhQYmZkMtM_y_G4-4o0EHB6rJZpnVeofEWZ2SdF6ehBVXcKpkIHxGXSZYB-yOv0eukeoiQEfQbh_owrMrJmD0KD_9dp9EGOvO14qLL6g5yv&uniplatform=NZKPT
Li, Z. Q., &, Lin, M. C. (2018). Phonetic Issues in CFL Tone and Intonation Teaching. Journal of International Chinese Teaching, 3,26-36. https://xueshu.baidu.com/ndscholar/browse/detail?paperid=124g02h05s650ga0x00y0g102k780546
Lin, J. P. (2016). Positioning and key points on teaching Chinese phonetics as an international language. International Journal of Chinese Studies, 7(02), pp.1–7..https://doi.org/10.14095/b.cnki.jics.2016.02.001. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2LXyHshOx8j-59J8IjCZskX4_kWPXc1HF3E9Th9uRY3ZOUjo9iyDXf4lIsSWpGDBy7DAmZr2tser7AYn6-LppgZOvn9sZFg-LkPyS8N_-zHoMkayXdHngyQMVAnqUql5S--Imyax81dBjI1ai_JI_bxnkCS4ul0GI=&uniplatform=NZKPT
Liu, Y. L., Ma J. F., &Zhao, G. CH. (2021). Chinese Proficiency Grading Standards for International Chinese Language Education: Application and Interpretation, 2021, pp3-12. Beijing Language and Culture University Press. https://www.blcup.com/EnPInfo/index/10867#001
Luckin, R., & Cukurova, M. (2019). Designing educational technologies in the age of AI: A learning sciences-driven approach. British Journal of Educational Technology, 50(6), 2824–2838. https://doi.org/ 10.1111/bjet.12861
Munro, M. J. & Derwing, T. M. (1995). Foreign accent, comprehensibility, and intelligibility in the speech of second language learners. Language Learning, 45(1): 73–97. https://doi.org/10.1111/j.1467-1770.1995.tb00963.x
Nagle, C. L. (2018b). Motivation, comprehensibility, and accentedness in L2 Spanish: Investigating motivation as a time-varying predictor of pronunciation development. The Modern Language Journal, 102(1), 199–217. https://doi.org/10.1111/modl.12461
Nagle, C. L. (2021). Revisiting perception–production relationships: Exploring a new approach to investigate perception as a time-varying predictor. Language Learning, 71(1), 243–279. https://doi.org/10.1111/lang.12431
Neri, A., Cucchiarini, C., Strik, H. (2002). Feedback in computer assisted pronunciation training: technology push or demand pull? Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 1209-1212. Doi: 10.21437/ICSLP.2002-246
O’Brien, M. G., Derwing, T. M., Cucchiarini, C., Hardison, D. M., Mixdorff, H., Thomson, R. I., Strik, H., Levis, J. M., Munro, M. J., Foote, J. A. & Levis, G. M. (2018). Directions for the future of technology in pronunciation research and teaching. Journal of Second Language Pronunciation, 4(2): 182–207. https://doi.org/10.1075/jslp.17001.obr
Pikhart, M. (2020). Intelligent information processing for language education: The use of artificial intelligence in language learning apps. Procedia Computer Science, 176, 1412–1419. https://doi.org/10.1016/j.procs.2020.09.151
Ran, Q.B. & Yu (2019). Characteristics and patterns of Chinese speech errors: analysis and exploration into the error item database from 25 L1 backgrounds. Chinese Teaching in the World, 33 (03), 417–432. https://doi.org/10.13724/j.cnki.ctiw.2019.03.012.https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk297Bo36bPErFK8blRrh-tCrE2B7uHXhq9T0pTfZkp3tmOYwF6v8UCvkWRAGoZxx6noUYwvjptK0amhH0LiKiWYdXZarOuT13U9pBvD1dAy8DgB0XhTN324DEzfEuGt97ht3vlpGS5-8jiHSoyH0DDOo1DzZ5JL-0c=&uniplatform=NZKPT
Richey, R. C., & Klein, J. D. (2007). Design and Development Research: Methods, Strategies, and Issues (1st ed.). Routledge. https://doi.org/10.4324/9780203826034
Saito, K., Trofimovich, P. & Isaacs, T. (2016). Second language speech production: Investigating linguistic correlates of comprehensibility and accentedness for learners at different ability levels. Applied Psycholinguistics, 37(2): 217–240. https:// doi.org/10.1017/S0142716414000502
Saito, K. & Plonsky, L. (2019) Effects of second language pronunciation teaching revisited: A proposed measurement framework and meta-analysis. Language Learning, 69(3): 652–708. https://doi.org/10.1111/LANG.12345
Shen, X. N. (1989). American learn Chinese language tone. Chinese Teaching in the World (03), 158-168. https://doi.org/10.13724/j.cnki.ctiw.1989.03.006. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2_4UqBrcEq_J3PNAwTQSBZlWuV-zT8gKA5UGRCZs6QCuWF2I-C41aMk8Hkn_V5XxsWoRmYeMwmi8hsDX5N-NiBmrVsxXc-IQm53eGBbqKnj3auvTkM_5JmmLr2D9N6_OusgzB2kWiDtsHjC-urcxOo&uniplatform=NZKPT
Song, Y. D. (2009). Strategies on teaching tone in Chinese as a foreign language. Language Teaching and Linguistic Studies (03), 48-53. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk1seTLsXAtOudk1165yDRbEeIBV7iANlw_t6A7MvYk4M5TimtSzjgdv-W8gH7FJiQsESMPejB3z6sjjpyxyQD0FFqu1Die4F6bOi4_G-mhavoi_D4kXBEbvi_iYXtbTYUfI4sASuPsg8Iy4RSdlPfyn&uniplatform=NZKPT
Wang, F., Cheung, A. C. K., & Chai, C. S. (2024). Language learning development in human-AI interaction: A thematic review of the research landscape. System, 125(000), 24. https://doi.org/10.1016/j.system.2024.103424.
Wang, Y. (2016). Study on the strategy of teaching Chinese tone to intermediate and advanced-level learners from Korea, American and Japanese. Language Planning 18, 21-22. Doi:10.16412/j.cnki.1001-8476.2016.18.012. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk1GeANakpa8svhrw4TlvCAZZilC96fj5y86QHMmZn26xECi9cE2W47F5KEgWHIxDQCs5l3H1SaSAb-avTYY0VoXpB1ov4hXxejJrRulpr1NsTetNv4l9o2uR1TnWqPaa3Mnt9r2iYTVbYqOVo0SzXZCI9LPTOalSRM=&uniplatform=NZKPT
Wu, M. J., & Hu, M.G. (2004). Explore the reason of tone errors committed by Vietnamese. Chinese Teaching in the World 02, 81-87+4. DOI:10.13724/j.cnki.ctiw.2004.02.009. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk1UFTLVytYpFGmg82qpvTZn-Da-ejrrLsCM2rUcqYjOmVyrjjmIxHXuQ5cTF3w-Sk3z1_C4mZPqEG20_SKUTLNh3wAfeUxhNeuQaeS18ZWgc5nZyxf0AC4nCw_Px45SBgdqzROlwf0SuzHkQFOzCwlM&uniplatform=NZKPT
Xu, Y. Y. (2009). On difficulties for Thai students to Learn Chinese pinyin and teaching methods. Journal of Guangxi Teachers Education University (Social Science Edition) 30 (3), 139-142. DOI:10.16601/j.cnki.issn1002-5227.2009.03.018. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2NfXfyFHojaFlZ2PHbv7oMQmq7Qs2JgYjPe2-HLCSo4878mqjVHRkdCpI_ADOtZ7HN7aX81-pkO9Ar0oJIFiZ_L5elLXpqnuME9pC_xbbrnVThhhb4jLsshceed8SXY5W7PbhuR9KhMSZXK1nA_S92&uniplatform=NZKPT
Yang, B. (2021), Audio-visual training for tone production: considering form and meaning. Language Teaching and Linguistic Studies 3, 31-44. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk0f8jQ5zZC6yPUe0yXP79dZ6oW-DAEvkB8M1Rxr0rPuiA7xtL1dnSxtzE9V9Y6d5SgY1SsHIXsT-muKMCsiQmrlhDYRJBb0FesVAjODsitxXW82Cge66iefqHFIxVaea2rPmRQPH7uSmtw5cHL76lcox-keiij4x9E=&uniplatform=NZKPT
Yi, B. (2021). Study on perception of Chinese tone by the Chinese learner from Malaysia. Nankai Linguistics 2, 52-60. https://kns.cnki.net/kcms2/article/abstract?v=OLEU9YGVhk2HubR81a9ziUXHTn-eKGxdukiyY_63ZbxQ51ekigLIAYTf8R3-tWoToKAp7Yx8DzAai2N3NkbIYPJOqmtEoZwcWkpFscBz4IDFKVr4I5tnFe_OgZxeFjwutLM_sBZWo0l0J09M7XuzmOYY3TtRkarJB_ikd0pV7-0=&uniplatform=NZKPT
Zawacki-Richter, O., Marín, V. I., Bond, M., & Gouverneur, F. (2019). Systematic review of research on artificial intelligence applications in higher education–where are the educators? International Journal of Educational Technology in Higher Education, 16(1), 1–27. https://doi.org/10.1186/s41239-019-0171-0
Zhai, X., Chu, X., Chai, C. S., Jong, M. S. Y., & Li, Y. (2021). A review of artificial intelligence (AI) in education from 2010 to 2020. Complexity, 6, 1–18. https://doi.org/10.1155/2021/8812542















