Differences in word count
Posted by Péter Botta on 18 January 2013 12:20 PM
Title: Differences in word counts
the word count different from SDL Trados or MS Word. This has to do with a
different tagging of text, including/excluding of whitespaces, numbers. The way
you specify a tag weight might also influence your analysis result.
A scenario can be that you use cascading filters, e.g. you have an Excel file containing HTML code. memoQ offers you to use a cascading filter to exclude this HTML code on import. You choose first the Excel filter and then the HTML for the document import. memoQ now filters out the HTML code. This code is now also excluded from your analysis. Other tools might not be able to exclude such tags and may count it in as characters in the analysis.
Sometimes the word count can also differ when you import the same document once as DOCX and once as RTF. memoQ uses a different import filter for both file formats. This can result in a different word count.
memoQ’s filter functionality allows you to specify what and what not to import from files for translation. Being able to exclude XML attributes from translation, excluding code thanks to the Regex text filter or cascading filters or excluding text that is not for translation helps you to cut down translation costs.