Delete duplicates i TM?
Autor de la hebra: Profdoc
Profdoc
Local time: 14:42
sueco al inglés
Aug 13, 2010

Is it recommendable to delete duplicates in a memoQ translation memory?

I translate technical documentation for a software company and at the moment I'm creating a new TM by aligning older original and translated documents. A lot of phrases and sentences are used repeatedly throughout the documents (which of course is why I use memoQ in the first place).

Anyway, the TM ends up having a lot of repeats in it. I'm not sure how many, but I wouldn't be surprised if 25-50% of the translation units are duplicates. Some of them are short, just one word, but there are a lot of them.

Is this a problem or does the TM work just fine anyway? If it is recommendable to remove the duplicates, what is the best method of doing it?

Thanks!


Direct link Reply with quote
 

Grzegorz Gryc  Identity Verified
Local time: 14:42
francés al polaco
+ ...
Not necessary Aug 13, 2010

Profdoc wrote:

Anyway, the TM ends up having a lot of repeats in it. I'm not sure how many, but I wouldn't be surprised if 25-50% of the translation units are duplicates. Some of them are short, just one word, but there are a lot of them.

Is this a problem or does the TM work just fine anyway?


It's not a problem, leave it as is.

By default, MQ stores this kind of duplicates in order to save the context information.
So, if one day, you receive an updated document, you'll not be forced to revise it thouroughly, MQ will remember the correct sequence of TUs and you'll need only to check the efective changes.

BTW, in a lot of target languages (as Polish), the 100% matches should be always checked and revised because they may be translated in a different way depending of the context.
E.g., basically, in English you have no grammatical gender, the Polish translation of "it" may be translated in 3 different ways, MQ permits to apply the correct one if an appropriate contextual information is present.

Cheers
GG


Direct link Reply with quote
 

Soonthon LUPKITARO(Ph.D.)  Identity Verified
Tailandia
Local time: 19:42
Miembro 2004
inglés al tailandés
+ ...
OCR converted document Aug 14, 2010

I recently translated the OCR converted (untidy) document where TM hits were very bad. Format change made 100% match impossible and those many TU in TM did tricks for me. This is known as fuzzy in CAT tool algorithm.

Best regards,

Soonthon Lupkitaro


Direct link Reply with quote
 

Péter Tófalvi  Identity Verified
Hungría
Local time: 14:42
inglés al húngaro
+ ...
How this feature actually works? Apr 21

I see that one can mark duplicates for merging or deletion, but then you must delete/merge the marked entries one by one, there is no one-step solution,

Direct link Reply with quote
 

hhgygy
Hungría
Local time: 14:42
inglés al húngaro
+ ...
Deleting duplicates Apr 27

Péter Tófalvi wrote:

I see that one can mark duplicates for merging or deletion, but then you must delete/merge the marked entries one by one, there is no one-step solution,


Actually you can mark them all first and then delete them all by clicking on merge/delete once more.


Direct link Reply with quote
 

Elif Baykara  Identity Verified
Turquía
Local time: 15:42
Miembro 2015
alemán al turco
+ ...
Olifant Apr 27

I could not figure out the solution to the very same problem in memoQ.

I use the Olifant software from Okapi. Olifant is straightforward and quick.

Elif


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Delete duplicates i TM?

Advanced search






Across v6.3
Translation Toolkit and Sales Potential under One Roof

Apart from features that enable you to translate more efficiently, the new Across Translator Edition v6.3 comprises your crossMarket membership. The new online network for Across users assists you in exploring new sales potential and generating revenue.

More info »
Déjà Vu X3
Try it, Love it

Find out why Déjà Vu is today the most flexible, customizable and user-friendly tool on the market. See the brand new features in action: *Completely redesigned user interface *Live Preview *Inline spell checking *Inline

More info »



All of ProZ.com
  • All of ProZ.com
  • Búsqueda de términos
  • Trabajos