System Settings: Terminology > Stopwords

In Stopwords you can add stopwords for each of the languages created in Across.

Stopwords are words that are filtered out during term extraction and are not offered as term candidates. Typically, stopwords are, for example, articles, expletives or conjunctions. The larger the list of stopwords, the more precise the results of the term extraction will be.

Apart from the possibility to define stopwords within the scope of term extraction tasks, you can enter stopwords manually in the corresponding section of the system settings by selecting the stopword list of the desired language in the drop-down list, typing in the word in the lower input box, and clicking Add.

You can also import stopword lists in XML format. To do this, click Import and define whether the imported file should replace existing lists or be appended to them. Select the file with stopword lists to be imported and click Open to import it.

Please observe the following:

  • If you determine that the existing stopword lists are to be replaced (and not complemented), all existing stopword lists will be deleted and replaced by the stopword lists to be imported. An existing stopword list in a particular language will be deleted even if no stopword list is defined for this language in the file to be imported.
  • The XML file to be imported should have the following structure:

    <EverydayWords>

    <Language Name="LCID">

    <Word>Word 1</Word>

    <Word>Word 2</Word>

    ...

    </Language>

    </EverydayWords>

    LCID should be replaced with the LCID of the respective language. (LCID stands for Locale Identifier and is assigned to each language. For example, German (Germany) has the LCID 1031 and English (USA) the LCID 1033.)

You can also export the stopword lists of all languages in XML format. Click Export all, assign a name for the list to be exported, and click Save.