Does OmegaT creates search indexes for tm?
Initiator des Themas: Руслан И.
Руслан И.
Руслан И.
Russische Föderation
Local time: 00:55
Türkisch > Russisch
+ ...
Nov 29, 2018

When I import new TM into OmegaT in form of large tmx files, OmegaT seems to lag for some time.
Question is, does OmegaT creates search indexes?

Second question, is it better to split large tmx into smaller ones?

My main issue is low OmegaT performance with large tm files.


 
Didier Briel
Didier Briel  Identity Verified
Frankreich
Local time: 21:55
Englisch > Französisch
+ ...
No search index Nov 29, 2018

Руслан И. wrote:
When I import new TM into OmegaT

OmegaT doesn't import them, it just loads them (i.e., it doesn't transform them into anything else).

in form of large tmx files, OmegaT seems to lag for some time.
Question is, does OmegaT creates search indexes?

No.
The lag time you are seeing is the time needed to load the TMX, or an issue with memory.

Second question, is it better to split large tmx into smaller ones?

It shouldn't really make a difference.

My main issue is low OmegaT performance with large tm files.

Are you sure you have allocated enough memory to OmegaT?

You can check the memory available in Help > About.

If not, you are likely to experience performance issues.

To increase memory, see:
https://omegat.sourceforge.io/manual-standard/en/chapter.installing.and.running.html#launch.command.arguments

Didier


 
Руслан И.
Руслан И.
Russische Föderation
Local time: 00:55
Türkisch > Russisch
+ ...
THEMENSTARTER
Solved Nov 29, 2018

Thank you for your reply! Very clear answer!

 
tcordonniery
tcordonniery
Frankreich
Local time: 21:55
Lucene Indexes for OmegaT Jan 22, 2019

Hi Ruslan, sorry that I did not see this message before

I experimented the possibility to create indexed translation memories for OmegaT using Apache Lucene: this library is already present in OmegaT (used for tokenizers) but I added the possibility to use it for indexation, which is its initial goal.

You can download a pa
... See more
Hi Ruslan, sorry that I did not see this message before

I experimented the possibility to create indexed translation memories for OmegaT using Apache Lucene: this library is already present in OmegaT (used for tokenizers) but I added the possibility to use it for indexation, which is its initial goal.

You can download a patched version of OmegaT here:
http://www.silvestris-lab.org/node/43
and to learn how to use Lucene indexes:
http://www.silvestris-lab.org/node/26

As far as I could test, these indexes have a speed comparable with TMX loaded in memory (i.e. search is sometimes faster, sometimes slower), except that they do not need to be loaded: you can have gigabytes of such files without expanding the memory used by OmegaT. So it is definitively useful for large files but useless for smaller ones.

This is still experimental, use it at your own risk. Don't hesitate to make a report about your tests here.

Note: this cannot be done as a plugin actually because OmegaT does not provide an API for translation memories (they provide API only for machine translation, which is not the same thing), that is why it looks like a fork. But if you are interested, we can try to convince the core team to include it.

Regards
Thomas
Collapse


 


Dieses Forum wird von keinem Moderator betreut.
Um Verstöße gegen die ProZ.com-Regeln zu melden oder um Hilfe zu erhalten, wenden Sie sich bitte an unsere ProZ.com-Mitarbeiter »


Does OmegaT creates search indexes for tm?






Pastey
Your smart companion app

Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.

Find out more »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »