BIK Terminology

Solving the terminology puzzle, one posting at a time

  • About
    • Curriculum Vitae
  • Services
  • Portfolio
  • Resources
  • Blog
  • Contact

Doublettes—such a pretty term, yet such a bad concept

June 10, 2011 by Barbara Inge Karsch

Sooner rather than later terminologists need to think about database maintenance. Initially, with few entries in the database, data integrity is easy to warrant: In fact, the terminologist might remember about any entry they ever compiled; my Italian colleague, Licia, remembered just about any entry she ever opened in the database. But even the best human brains will eventually ‘run out of memory’ and blunders will happen. One of these blunders are so called doublettes.

According to ISO TR 26162, a doublette is a “terminological entry that describes the same concept as another entry.” Sometimes these entries are also referred to as duplicates or duplicate entries, but the technical term in standards is doublette. It is important to note that homonyms do not equal doublettes. In other words, two terms that are spelt the same way and that are in two separate entries may refer to the same concept and may therefore be doublettes. But they may also justifiably be listed in separate entries, because they denote slightly or completely different concepts.

As an example, I deliberately set up doublettes in i-Term, a terminology management system developed by DANTERM: The terms automated teller machine and electronic cash machine can be considered synonyms and should be listed in one terminological entry. Below you can see that automated teller machine and its abbreviated form ATM have one definition and definition source, while electronic cash machine and its abbreviated form, cash machine, are listed in a separate entry with another, yet similar definition and its definition source. During database maintenance, these entries should be consolidated into one terminological entry with all its synonyms.

It is much easier to detect homographs that turn out to be doublettes. Rather, it should be easier to avoid them in the first place: after all, every new entry in a database starts with a search of the term denoting the concept; if it already exists with the same spelling, it would be a hit). Here are ‘homograph doublettes’ from the Microsoft Language Portal. While we can’t see the ID, the definition shows pretty clearly that the two entries are describing the same concept.

Doublettes happen, particularly in settings where more than one terminologist adds and approves entries in a database. But even if one terminologist approves all new concepts, s/he cannot guarantee that a database remains free of doublettes. The right combination of skills, processes and tool support can help limit the number, though.

SHARE THIS:

Filed Under: iTerm, Maintaining a database, Microsoft Language Portal, Process, Setting up entries Tagged With: doublette, ISO 26162

Comments

  1. Sue says

    June 15, 2011 at 12:15 pm

    I’ve been working hard to consolidate doublettes in our termbank. My biggest problem is homographs that have software-specific definitions that, in the eye of the term contributor, constitute ‘different concepts.’ Often, the definition also contains usage details that are considered ‘essential information’ (but not in a lexicographical sense, if I can say that). Sometimes, the two definitions are SO different, and and approach the concept in such different ways, that the terminologist can’t even tell that they are the same concept without a lot of research.

    For example, we had a half dozen different entries for ‘path’–all of which meant some sort of directory path to a file or object. It just had different characteristics in this or that product. Sometimes when I have this problem I ask the writer if there is a phrase that we can use as the ‘long form’ and that will serve to differentiate the two concepts. But often that’s artificial, when said ‘long form’ is rarely or never used.
    Even if I am successful in boiling down all the product-specific definitions into a generic definition that works for all of them, the complaint then is that the definition is SO generic as to be useless. “Everyone knows what a path is.” Exactly.

    It’s a big problem.
    Add to that the possibility that nuanced differences at some point could arguably lead to”different concepts”… it’s ultimately subjective, right? I can sometimes argue that if all those instances of ‘path’ are translated in language X or Y with a single word, then they must be considered the same concept. But some terms in English will have multiple translations in other languages, depending on the domain. And vice versa, of course. Makes terminological work tricky even if, as in my case for the moment, I work only on English language terminology.

  2. Barbara Inge Karsch says

    June 16, 2011 at 1:23 am

    Sue, thanks for your comment. How about “if everyone knows what a path is, why do we have to define it six different ways?”:-) There is obviously the need for discussion about the definition or the terms when you have that many entries. And it is not an easy job for exactly the reasons that you state.
    I am working on collecting methods on dealing with doublettes or avoiding them in the first place. Not sure when I will have it ready.
    Best,
    Barbara

  3. Sue says

    June 16, 2011 at 2:12 pm

    I look forward to that, Barbara!

Blog Categories

  • Advanced terminology topics
  • Branding
  • Content publisher
  • Events
  • Interesting terms
  • Job posting
  • Process
    • Coining terms
    • Designing a terminology database
    • Maintaining a database
    • Researching terms
    • Selecting terms
    • Setting up entries
    • Standardizing entries
  • Return on investment
  • Skills and qualities
    • Negotiation skills
    • Producing quality
    • Producing quantity
  • Subject matter expert
  • Terminologist
  • Terminology 101
    • Terminology methods
    • Terminology of terminology
    • Terminology principles
  • TermNet
  • Theory
  • Tool
    • iTerm
    • Machine translation
    • Proprietary terminology management systems
      • J.D. Edwards TDB
      • Microsoft Terminology Studio
    • Term extraction tool
      • memoQ
    • Terminology portals
      • BACUS
      • EuroTermBank
      • Irish National Terminology Database
      • Microsoft Language Portal
      • Rikstermbanken
  • Translator
  • Usability

Blog Archives

  • November 2012
  • October 2012
  • September 2012
  • November 2011
  • October 2011
  • September 2011
  • August 2011
  • July 2011
  • June 2011
  • April 2011
  • March 2011
  • February 2011
  • January 2011
  • December 2010
  • November 2010
  • October 2010
  • September 2010
  • August 2010
  • July 2010
  • June 2010
  • May 2010

BIK Terminology

  • About Barbara Inge Karsch
  • Terminology Services
  • Terminology Resources
  • My Terminology Portfolio
  • Let’s Talk Terminology

From the Blog

  • A glossary for MT–terrific! MT on a glossary—horrific!
  • Part-time position for an Arabic terminologist
  • Tidbit from the ATA Conference
  • Bilingual corpora and target terminology research
  • Terminology internship at Eurocopter in France

Find It Here

Follow Me

  • Email
  • LinkedIn
  • Phone
Copyright © 2023 BIK Terminology. All Rights Reserved. Sitemap. Website by sundaradesign.