Character Handling
By default, crossMining filters all special characters such as apostrophes, hyphens, and punctuation marks from the determined source and target-language equivalents. The character handling settings enable the definition of special characters not to be removed by crossMining.
Special characters can be defined globally for all languages or specifically for individual languages. To add characters for individual languages, select the desired language and click Add.
To add a special character not to be removed by crossMining, select a language or All languages in the left part of the dialog window. Insert the character in the input field in the right part of the dialog window and click Add.
To add special characters, you can insert the actual characters or the corresponding Unicode value – introduced by the character string U- – e.g. U-0027 for an apostrophe.
The characters contained in the All languages section apply to all languages and may be complemented with the language-specific characters.
Example
An English-German lexicon is to be created. The following characters are defined under Character handling:
- All languages: ' -
- English: !
- German: ;
In the English texts of the lexicon, the following characters are retained during the creation of the lexicon: ' - !
In the German texts of the lexicon, the following characters are retained: ' - ;