Now that we know how neural networks function, let us return to the practical use of neural machine translation.
Prior to the deployment of neural machine translation, the organization needs to decide whether to use a generic system or a customizable system. Generic translation systems are those that usually come to our mind first: Google Translate, DeepL, Microsoft Translator, Amazon Translate, etc. These systems are trained with huge amounts of data (aligned, parallel corpora) from various subject areas (domains). The resulting translations are fluent and usually good, but they are of limited use for translating highly specialized (technical) texts.
In contrast, customizable translation systems are trained with customer-specific data. Therefore, they can consider the individual corporate language and terminology and thus deliver more accurate translations. The resulting raw translations are better and require less post-editing. The amount of training data required for a customized engine depends on the provider. Usually, the customized engine is first trained with generic and domain-specific texts and then "enriched" with internal texts. In any case, due maintenance of the translation memories and terminology databases is a key precondition for the customization.
Leading providers of customizable systems include SYSTRAN, Textshuttle, SmartMATE, KantanMT, and Omniscien. DeepL also belongs to this category to a certain extent, as the online interface of the premium version allows the creation of custom glossaries. If this function is activated, the engine is forced to use the stored terminology. A correctly created glossary definitely contributes to better results.
Unfortunately, the Starter package only allows the creation of one glossary. Depending on the text type, the stored terminology may not necessarily be suitable. To create several glossaries, you need to order the Advanced package. What is more, the Advanced package enables the connection of DeepL to your translation management system via API. However, the glossary function is only available in the browser version.
Further information on generic and customizable translation systems, the selection of a suitable machine translation provider, MT quality, training data for MT systems, costs of MT, and the collaboration with translators is available in our detailed article "Machine translation for companies".
Advantages of generic translation systems
- Relatively inexpensive
- Quick implementation
- Good translations of "normal" texts
Disadvantages of generic translation systems
- Poor translation quality of specialized texts
- More post-editing required
Advantages of customized translation systems
- Good translation of specialized texts
- Less post-editing required
Disadvantages of customized translation systems
- More expensive implementation
- Longer lead time
- Not enough data available for the customization