External terminology (or glosarry) files can be imported into Memsource term base in Excel or TBX file formats.
To launch the Import dialogue window, first click on the name of your term base. Then, click the Import button in the Import/Export section of the page.
- Select your file (Excel or TMX)
- Select to add new terms (importing the same file twice will create duplicate terms)
- or update existing ones (see Update Terminology)
- Select strict match locale (If you don't want to add for example "en" terms to "en_US" termbase)
Note: The size limit for a file that can be uploaded is 1GB.
Import Excel File
If you do not have a ready TBX file, using an Excel file is the easiest way to import terms into a term base. A plain list of terms can be imported, but more complex terminology imports are also supported (see below: importing synonyms, morphology, terms with various attributes, etc.).
- Only terms from the first sheet in your Excel file are imported.
- The size limit for a file that can be uploaded is 1GB.
How to prepare your Excel file for import:
- All terms need to be organized into columns, where each column represents one language.
- In the first row, use the appropriate language code for each language (for example en, de, and it)
- When you're done, save the Excel file and click on the Import button and select "Create new terms"
Importing synonyms in Excle is possible. The synonym must be placed in a separate column using the same language code header.
For example using "contract" as a synonym for "agreement" will need two en columns: en, en, de, and it. Using three synonyms in English will need three en columns, using synonyms in English and German will result in en, en, de, de, it.
Importing Terms with Attributes
The XLS/XLSX format also makes it possible to import terms with specific attributes. Some of them are generadet by Memsource and are available only in files exported from Memsource TB.
The attributes are:
- CID - Memsource Concept ID (needed for reimporting updated terms). The Term concept includes the source and all its targets and synonyms
- TID - Memsource Term Id (needed for reimporting updated terms). The ID of the specific term in the specific language
- status - either New or Approved)
- forbidden - True or False)
- preferred - True or False)
- case - meaning "case sensitive." The case can be either True or False.
- exact - meaning "exact match." This can be either True or False.
- note - only target note will be displayed in the Editor
- usage - only target usage will be displayed in the Editor
- created_by - only Memsource usernames are supported
- created_at - date and time of term's creation
- modified_by - only Memsource usernames are supported
- modified_at - date and time of the last modification of the term
Creating a new terms with attributes:
- Add an new attribute columns after each term column (or synonym column)
- Put the name of the attribute in the header and the attribute values in the rows below.
- Save the Excel file and import it to TB.
Importing Terms with Challenging Morphology
The morphology of the terms for import follows the same rules as Term Morphology morphology for terms created directly in Memsource TB. Please see details in our Term Morphology article.
Appart from working with synonyms and Fuzzy / Exact matches, you can also add a pipe character | as a boundary between the word stem (the part that does not change) and the suffix (the part that does change). This is especially useful for words whose endings can change. For example, the term smíšen|ý in Czech can also come up as smíšeného, smíšenou, etc. Putting the "|" character before the ý ensures that all three endings will be considered matches.
TBX Import Format
Memsource also supports the TBX format for terminology imports (and exports). The TBX standard is a "loose" standard, which means that if a TBX file is imported from non-Memsource software, some of the metadata may not get imported. However, we have optimized the import from Multiterm TBX and the following metadata should get imported correctly into Memsource:
- Timestamps (created at, last modified at)
- Value in element <descrip type="usageNote"> to the attribute usageExample of all the terms of the concept
- Value in element <descrip type="note"> to the attribute note of all the terms of the concept
Note: If you need to import terminology between two Memsource term bases, use the TBX format because inside the Memsource environment, the data will be imported correctly.
SDL TBX.XML file
The SDL Trados uses a special TBX.XML format instead of the standard TBX format. Because of this, it has different specifications for import.
Articles related to Term Base: