Date, Time, and Number Formats
The date, time, and number formats can be defined in the Formats section of the language settings.
Across always analyzes the numbers contained in the source text and highlights them with a blue overline in the Context/Source View.
The detection and highlighting of numbers is based on the rules defined in the Formats section of language settings.
If a number such as a date in the source text does not correspond to the language settings, the date will not be identified as such. Therefore, the individual components of the date will be interpreted and displayed as separate numbers.
The following example shows an English source text containing a date:
Valid date specification
Invalid date specification
The date matches the specification for English dates in the language settings. The date is identified as a valid date and marked as such by the overline.
The date dies not match the specification for English dates in the language settings. Thus, the individual components of the date are not interpreted as belonging together and are displayed separately.
The following format types can be defined:
Long date format with four-digit year specification
Short date format with two-digit year specification
German 31. Dezember 2018
English Dec. 31, 2018
In addition to these language-specific settings (under Tools > System Settings > General > Language Settings > Language Sets > Standard language set > Languages > desired language > Formats), Across also takes into consideration language-independent settings defined in the standard language set. As a result of these language-independent rules, Across detects both the German and the English number format, for instance 10,50 and 10.50, in the same German sentence. Should you wish to cancel this behavior, you can delete the corresponding rules in the standard language set.
In the case of a language which does not have its own language set in the language settings, the settings in Tools > System Settings > General > Language settings > Language Sets > Standard Language Sets > Formats will automatically apply.
For each format type (e.g. for alphanumeric dates), it is possible to define several formats. In this case, you can use the option "Prefer this format for pre-translation" to ensure use of the preferred format for automatic adjustments, e.g. during pre-translation and when transferring a format from the source text to the target text in crossDesk (e.g. using the keyboard shortcut Ctrl+Alt+0). In the format area of a language, the preferred format is marked with an *.
Apart from short dates (with two-digit years, e.g. German 31.12.18) and long dates (with four-digit years, e.g. German 31.12.2018), Across also identifies alphanumeric dates, i.e. dates containing the names of the months, e.g. "December 31, 2018" or "April 25, 2018" or abbreviated names of months, e.g. "Dec 31, 2018".
Across supports alphanumeric dates without a year, e.g. "1 January" or "31 Dec".
The alphanumeric date must first be set up for the desired languages.
Alphanumeric Dates and Sentence Detection
After setting up alphanumeric dates, the sentence detection will work correctly in segments containing this type of dates. Demonstrated below on the basis of the date "01. Juni 2014" in a German source document:
User-Defined Month Names
For most languages, Across already contains full and abbreviated month names. The management of the month names takes place in the Months subsection of a language in the language settings.
Click Edit to access the management of the month names. In this dialog, you can edit the existing month names and add new ones if necessary.
In case there are several variants for a month, these must be separated by commas. The first month name is the one that will be used for pre-translations or when transferring dates from the source text to the target text (e.g. using the keyboard shortcut Ctrl+Alt+0).
Handling of symbols
Symbols entered under Treat following symbols as characters will be interpreted as word parts during a search in crossTank and crossTerm. This option has no effect on the gross analysis result, e.g. with regard to the word count in the Across reports.
Please note that you should normally not modify this option or only after consulting the Across support desk.
For example, the copyright symbol © may be included in the list of symbols to be interpreted as characters. In addition, it is assumed that, in crossTerm, the term Across© may exist. In this case, if a sentence of a source text contains Across© , Across will notice that a corresponding term exists. This term will be displayed in the crossTerm window of the crossDesk. If the list does not contain the copyright symbol, the terminology detection will not work in this case, and the term will not be displayed in the crossTerm window.