Date, Time, and Number Formats

The date, time, and number formats can be defined in the Formats section of the language settings.

Across always analyzes the numbers contained in the source text and highlights them with a blue overline in the Context/Source View.

The detection and highlighting of numbers is based on the rules defined in the Formats section of language settings.

If a number such as a date in the source text does not correspond to the language settings, the date will not be identified as such. Therefore, the individual components of the date will be interpreted and displayed as separate numbers.

The following example shows an English source text containing a date:

Valid date specification
Invalid date specification
cDesk_datum-gueltig
cDesk_datum-ungueltig
The date matches the specification for English dates in the language settings. The date is identified as a valid date and marked as such by the overline.
The date dies not match the specification for English dates in the language settings. Thus, the individual components of the date are not interpreted as belonging together and are displayed separately.

The following format types can be defined:

Format type
Example 1
Example 2
Number
German 123.456
English 123,456
Long date format with four-digit year specification
German 31.12.2018
English 2018-12-31
Short date format with two-digit year specification
German 31.12.18
English 18/12/31
Alphanumeric date
German 31. Dezember 2018
English Dec. 31, 2018
Time
German 12:58
Italian 12.58

In addition to these language-specific settings (under Tools > System Settings > General > Language Settings > Language Sets > Standard language set > Languages > desired language > Formats), Across also takes into consideration language-independent settings defined in the standard language set. As a result of these language-independent rules, Across detects both the German and the English number format, for instance 10,50 and 10.50, in the same German sentence. Should you wish to cancel this behavior, you can delete the corresponding rules in the standard language set.

In the case of a language which does not have its own language set in the language settings, the settings in Tools > System Settings > General > Language settings > Language Sets > Standard Language Sets > Formats will automatically apply.

For each format type (e.g. for alphanumeric dates), it is possible to define several formats. In this case, you can use the option "Prefer this format for pre-translation" to ensure use of the preferred format for automatic adjustments, e.g. during pre-translation and when transferring a format from the source text to the target text in crossDesk (e.g. using the keyboard shortcut Ctrl+Alt+0). In the format area of a language, the preferred format is marked with an *.

Alphanumeric dates

Apart from short dates (with two-digit years, e.g. German 31.12.18) and long dates (with four-digit years, e.g. German 31.12.2018), Across also identifies alphanumeric dates, i.e. dates containing the names of the months, e.g. "December 31, 2018" or "April 25, 2018" or abbreviated names of months, e.g. "Dec 31, 2018".

Across supports alphanumeric dates without a year, e.g. "1 January" or "31 Dec".

The alphanumeric date must first be set up for the desired languages.

Alphanumeric Dates and Sentence Detection

After setting up alphanumeric dates, the sentence detection will work correctly in segments containing this type of dates. Demonstrated below on the basis of the date "01. Juni 2014" in a German source document:

No detection of the date:
Correct detection of the date:
cDesk_source-view_datum-nicht-erkannt
cDesk_source-view_datum-erkannt
As the date "01. Juni 2014" is not identified correctly, Across wrongly detects two separate sentences.
As the date "01. Juni 2014" is identified correctly, Across also detects the sentence correctly.

User-Defined Month Names

For most languages, Across already contains full and abbreviated month names. The management of the month names takes place in the Months subsection of a language in the language settings.

sysset_allg_spracheinstellungen_sprachfamilien_monate

Click Edit to access the management of the month names. In this dialog, you can edit the existing month names and add new ones if necessary.

sysset_allg_spracheinstellungen_sprachfamilien_monate_bearbeiten

In case there are several variants for a month, these must be separated by commas. The first month name is the one that will be used for pre-translations or when transferring dates from the source text to the target text (e.g. using the keyboard shortcut Ctrl+Alt+0).

Handling of symbols

Symbols entered under Treat following symbols as characters will be interpreted as word parts during a search in crossTank and crossTerm. This option has no effect on the gross analysis result, e.g. with regard to the word count in the Across reports.

Attention

Please note that you should normally not modify this option or only after consulting the Across support desk.

Example

For example, the copyright symbol © may be included in the list of symbols to be interpreted as characters. In addition, it is assumed that, in crossTerm, the term Across© may exist. In this case, if a sentence of a source text contains Across© , Across will notice that a corresponding term exists. This term will be displayed in the crossTerm window of the crossDesk. If the list does not contain the copyright symbol, the terminology detection will not work in this case, and the term will not be displayed in the crossTerm window.