SEMI-COMPLEX SENTENCES WITH ING-FORMS IN MEDICAL DISCOURSE: CHALLENGES FOR NEURAL MACHINE TEANSLATION

The study is related to semi-complex sentences with ing-forms that are typical for the English language. Being an analytic language with very little inflection and strict word order, English has a wide range of constructions requiring mental operations for understanding their meanings. Nowadays with the increasing amount of texts being processed and translated using artificial intelligence, it is of particular interest to analyse how these sophisticated constructions function in the text and in which ways does the machine “understand” them.

The main objective of this study is to reveal the features of semi-complex sentences in particular (medical) discourse and identify some patterns (if there are any) of neural machine translation of these sentences from English into Russian.

Two types of methods are used in the study in order to achieve the goal: computational (Coh-Metrix tool, Neural Machine Translation) and manual (Analysis of Translation Errors). Coh-Metrix is a tool focused on linguistic features closely associated with deeper level of comprehension. It provides the text analysis on multiple characteristics and language-discourse levels. Neural Machine Translation (NMT) is one of the machine translation approaches which uses the large artificial neural network and deep learning models.

Five medical research articles on topics related to different spheres such as Lifestyle Medicine, Vaccination, Intensive Care, Medical Ethics and Speech Therapy were analysed, processed with Coh-Metrix tool, translated by PROMT neural machine translation system with medical profile and evaluated.

The study found that 1) semi-complex sentences with ing-forms (gerunds, participles and verbal nouns) are frequently used in medical research articles, contributing in diversity (but not always complication) of grammatical structures; 2) gerunds and verbal nouns are more typical for medical discourse due to prevalence of noun phrases over verb phrases; 3) neural machine translation system demonstrated certain patterns in translating ing-forms from English into Russian in some cases potentially causing mistakes; 4) a number of frequent word combinations have their own patterns in NMT translation presumably due to their existence in NMT translation memory 4) there is a necessity to improve the overall quality of the neural machine translation in terms of translating semi-complex sentences with ing-forms.

Keywords: semi-complex sentences, medical discourse, neural machine translation, non-finite verbs

Natalya S. Pak

Northern (Arctic) Federal University named after M.V. Lomonosov (NArFU) Arkhangelsk, Russia

e-mail: n.pak@narfu.ru

Alhaidary, A. 2019. Treatment of Speech Sound Disorders in children: Nonspeech Oral Exercises. International Journal of Pediatrics and Adolescent Medicine. URL: https://doi.org/10.1016/j.ijpam.2019.07.008. [Accessed October 16, 2020].

Bentivogli, L., Bisazza, A., Cettolo, M., Federico, M. Neural versus PhraseBased Machine Translation Quality: a Case Study. URL: https://www.research gate.net/publication/306226354_Neural_versus_Phrase-Based_Machine_Translation_ Quality_a_Case_Study. [Accessed October 10, 2020].

Brown, A., Ballal, A., Al-Haddad, M. 2019. Recognition of the critically ill patient and escalation of therapy. Anaesthesia & Intensive Care Medicine 20(1): 1–5.

CohMetrics.com. 2020. URL: http://cohmetrix.com/ [Accessed October 14, 2020].

DeepAI.org. Neural Machine Translation. 2020. URL: https://deepai.org/ machine-learning-glossary-and-terms/neural-machine-translation [Accessed October 28, 2020].

Hordern, J. 2016. Religion and culture. Medicine (Abingdon) 44(10): 589–592.

Kaushanskaya, V.L. 2008. A grammar of the English language, 5 th ed. Iris Press, Moscow, 384 p.

Langley, J.M. 2015. Adolescent immunization — Protecting youth and preparing them for a healthy future. International Journal of Pediatrics and Adolescent Medicine 2(1): 1–6.

Rippe, J.M. 2019. Are We Ready to Practice Lifestyle Medicine? The American Journal of Medicine 132(1): 6–8.

Smeth, H., Heyvaert, L. The meaning of the English present participle. English Language and Linguistics 15(03): 473–498.

Tannen, D., Hamilton, H.E., Schiffrin, D. (ed.) 2015. The Handbook of Discourse Analysis, 2 nd ed. JohnWiley & Sons, Inc., UK, 952 p.

Wilce, J.M. 2009. Medical Discourse. Annual Review of Anthropology 38: 199–215.