Menu

Show posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Show posts Menu

Messages - arsen_a

#16
Hi Gert,

Sorry for my delayed reply but I have some good news for you. Today I managed to install the dictionary with size of 346kb and it ran normally but I have deleted 3 index files from the output directory so it is not the right solution. I also checked to install a file with size of 356kb and it didn't run. I suppose the limit of file size for my Nokia is 350kb. Now, the question is, can we make the dictionary smaller by 10kb?
I had a look into dictionary functions and found some "unnecessary" things, for example if I set to on: Use bitmap fonts, then russian words become unreadable :( Can't we cut this section from the empty dictionary file? Also there are some "unnecessary" languages, for example Vietnamese and Japanese, I think I don't need them.
The next important thing- As I can understand, in the settings field, when we choose the language for the program interface, Russian is not completed yet! You know, Gert I can update that section, I speak russian very well for already 25 years! :) Please just tell me in which format (I guess in Unicode) you would like to have that missing and incorrect words? Seems that's all, thank you very much for your help, I will wait for your answer!
#17
Hi again,

Regarding to the precise file limit for jar applications on my Nokia, I think it can be 300kb, because once I have installed a EuroMap program,which size was 296kb and it ran normally to my astonishment. After that I thought that my Nokia can run even more, installed another dictionary ( I can't remember which one but I remember the size was ~365kb ) and my Nokia could not identify that file. The generated message was: incorrect file.
As to the index file size, as I have mentioned before, each file weights 10kb.
What do you mean by saying: Are the index files well compressed in the JAR-file ? For making the dictionary I use automatic method that is described in the how to. I use this command:
java -jar JarCreator.jar dictionarydirectory emptyjar outputdirectory
( of course with correction for my directory names ).  Do I have to do this step manually? I just checked to compress the finally created jar file with zip and the file size decreased with 5kb only.
I forget to tell you that I have also checked the content of index files and did not find any other unnecessary information there :(
#18
Hello Gert,
Thank you for your help, I have added the line (language1DictionaryUpdateClassName=de.kugihan. etc) and now my properties file looks like this:
infoText:  Spanish-Russian dictionary
                dictionaryAbbreviation: IDP
                numberOfAvailableLanguages: 2
                language1DisplayText: Spanish
                language2DisplayText: Russian
                language1FilePostfix: Esp
                language2FilePostfix: Rus
                language2IsSearchable: false
                language2GenerateIndex: false
                language1DictionaryUpdateClassName=de.kugihan.dictionaryformids.dictgen.dictionaryupdate.DictionaryUpdatePartialIndex
                dictionaryGenerationSeparatorCharacter: ':'
                indexFileSeparationCharacter: ':'
                searchListFileSeparationCharacter: ':'
                dictionaryFileSeparationCharacter: ':'
                dictionaryGenerationInputCharEncoding: UTF-8
                indexCharEncoding: UTF-8
                searchListCharEncoding: UTF-8
                dictionaryCharEncoding: UTF-8
                language1NormationClassName: de.kugihan.dictionaryformids.translation.normation.NormationLat
                language2NormationClassName: de.kugihan.dictionaryformids.translation.normation.NormationRus

now the final .jar file size is 356kb, we have gained another 30kb but it still can not run on my Nokia :(
BTW, here is some information, maybe it can be helpful

Creating: ./output/dictionary/DictionaryForMIDs.properties
Property searchListFileMaxSize set to 235
Property indexFileMaxSize set to 11999
Property dictionaryFileMaxSize set to 6783
Property language1IndexNumberOfSourceEntries set to 9779
Done: property file
#19
Hi Gert,

I just had a look into index files again and found something interesting, as I already said, I have added DictionaryUpdatePartialIndex: {{ }} line into properties file. I discovered, that in there are two big ~40kb index files in the output directory, I looked into them and found that the letters 'm' and 'f' had a lot of indexes or pointers for the directory files. I think you know, that this letters are used to describe the gender of noun, so I think our program did not identified information in the  {{ }} correctly. Dear Gert, can you tell me, did I write that line correctly, is teh syntax right?
DictionaryUpdatePartialIndex: {{ }}
Another question is, here is a line from the index file:
millonario m :5-1215-B
and here is the same word in the directory5 file:
millonario_{{(m)}}:(here are the characters in Unicode format)
Is this correct? BTW, after compilation I ran the program on a PC emulator and when I enter a word in Spanish, after translation I got the word in such format: some word {{(m)}}, is that normal? Should I see this braces or not?
Thanks a lot!
#20
Hi, Gert!

Sorry for delayed answer, I just tried to recompile with the settings that you advised, I have added DictionaryUpdatePartialIndex: {{ }} in the properties file and now I got dictionary with the size of 385kb, so we have gained another 100kb :) Regarding to the icons, I forget to tell you that I am using the light version of empty dictionary, which is 112kb and seems, does not contain icons. Do you have any other ideas to decrease the file size? Thank you very much!
#21
 I have just made a unidirectional dictionary as you advised, but now the file size is 377 :(( I think the problem is in the index files.
I would like to provide this information for comparision: my dictionary source .txt file is 300kb, because it is in Unicode format. The original file was in ISO-8859-1 format, size was 200kb but after compilation, I ran that program on PC Mobile phone emulator and russian letters were unreadable so I decided to change the encoding into Unicode and the file became bigger for 100kb. I have English-Russian dictionary installed on my phone, that is 230kb, the source dictionary for that program weights 560kb and it contains almost 25000 words. So, I am wondering, why this Spanish dictionary, that contains only 10000 words, is becoming 485kb?   
#22
Thanks for your reply!

I have modified the .properties file as you suggested, removed language1DictionaryUpdateClassName and language2DictionaryUpdateClassName, then changed NormationEng into NormationLat and NormationLat into NormationRus. Regarding to complicated syntax, I used as a source for the dictionary, a .txt file containing special characters, for example ñ, ó etc, can this be the cause of my problem?
I have recompiled now and got the same result, file size is 485kb, bigger than before, also the size of index and directory files have been changed, directories-298kb, index-567kb. You advised to "double check that only the desired words from the inputdictionaryfile are indexed ? Maybe there are unnecessary index entries"! Does this program add some unnnecessary files to the original dictionary? I mean not in the .txt file but in the final file? From where does it take that words? I think all the words in Spanish dictionary are necessary :) What can I do next?
#23
Hi, Gert!

Of course I used JarCreator and gained almost 150kb, because the outpout directory size was 600kb and now the .jar file size is 450kb. Here is my DictionaryForMIDs.properties file

infoText:  Spanish-Russian dictionary
      dictionaryAbbreviation: IDP
      numberOfAvailableLanguages: 2
      language1DisplayText: Spanish
      language2DisplayText: Russian
      language1FilePostfix: Esp
      language2FilePostfix: Rus
      dictionaryGenerationSeparatorCharacter: ':'
      indexFileSeparationCharacter: ':'
      searchListFileSeparationCharacter: ':'
      dictionaryFileSeparationCharacter: ':'
      dictionaryGenerationInputCharEncoding: UTF-8
      indexCharEncoding: UTF-8
      searchListCharEncoding: UTF-8
      dictionaryCharEncoding: UTF-8
      language1DictionaryUpdateClassName:

de.kugihan.dictionaryformids.dictgen.dictionaryupdate.DictionaryUpdateIDP
      language2DictionaryUpdateClassName:

de.kugihan.dictionaryformids.dictgen.dictionaryupdate.DictionaryUpdateIDPSpa
      language1NormationClassName: de.kugihan.dictionaryformids.translation.normation.NormationEng
      language2NormationClassName: de.kugihan.dictionaryformids.translation.normation.NormationLat

I have modified it a bit, because characters in .txt file are in Unicode format and the separator is ':'.
Regarding to the other values, for example index files, that you have mentioned, what can I change there to make the size smaller? Do you mean this variables?
•   searchListFileMaxSize/indexFileMaxSize/dictionaryFileMaxSize

I had a look at the index.files but did not find anything because their also contain the same words from directory.files! Do I need this files? I have checked, all the index.files weight 500kb, and directory.files weight 100kb.
Thank you for your help :)
#24
Hello everybody!

Recently I came across to this project and I really appreciate it. I have already made a Spanish-Russian dictionary by your tools, but unfortunately file size is 465kb and it can not run on my Nokia6230. Is it possible to make the dictionary smaller than 300kb? If I make it unidirectional, will it decrease the file size? How can I do that (make unidirectional)? Thanks.