Menu

Show posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Show posts Menu

Messages - dreamingsky

#61
That's great.  Thanks for adding that.  I'm working on updating the Japanese dictionary now.  I'll see if the new DictionaryGeneration works.  I should have time to do it this weekend.

I'll see about adding the {{}} information on the "Setting up a New Dictionary" webpage.

Jeff
#62
Huong

I'm happy you like the Japanese-English dictionary.  Actually I didn't make the dictionary.  I just converted the dictionary from it's original format to the DictionaryforMIDs format.  Jim Breen created the dictionary.  You can submit changes to the dictionary here http://www.csse.monash.edu.au/~jwb/cgi-bin/wwwjdic.cgi?17.

Jeff
#63
QuoteI can't type Vietnamese vocabulary. The Vietnamese dictionaries require a cell phone/PDA with a Vietnamese font & Input?

This is a problem we've been trying to fix for years.  We've been waiting for someone with Java programming experience who could make code to type in writing systems other than roman letters.  Now several dictionaries can only be searched from English -> Dictionary 2.  They cannot search from Dictionary 2 -> English.  Some dictionaries are: Vietnamese, Russian, Arabic, Thai, Hindi, and Khmer.

The Japanese dictionary can search from Japanese -> English, but it must be searched in romaji.  Seaching in hiragana would be much better than romaji.  But we must wait for the code to be added.

So, until someone can add the feature to search in non-roman scripts, you will only be able to search from English to Vietnamese (unless you can install a 3rd party program for Vietnamese input).  Sorry.

QuoteHow can I type づ with Romaji?

I'm sorry.  I made a mistake with づ and ぢ in the DictionaryforMIDs program.  Thank you for catching my mistake.  I will try to fix the mistake within 1 week.  Then I will upload a new version of the Japanese dictionary.

Jeff
#65
The DictionaryUpdatePartialIndex DictionaryUpdateClass is very useful.  I've used it on several dictionaries.  I was wondering if the code could be added to the default DfM code instead of a separate UpdateClass?

If users want to use a custom DictionaryUpdateClass such as DictionaryUpdateThaiNIUEng or DictionaryUpdateEDICTJpn, then they can't use DictionaryUpdatePartialIndex without referencing it in the new UpdateClass.

We use brackets [] for ContentDeclarations.  So maybe write the code for the curly brackets {{}} similar in DfM.

Then users could write a dictionary using {{}} without having to reference anything in the DictionaryForMIDs.properties.

Jeff
#66
Here you go.  It's ready to go.

I added this to your infoText in the DictionaryForMIDs.properties:
infoText: Deutsch - Tagalog (by Piero Peruzzi) 02Feb04 http://www.de-it.ch/lexikon-deutsch-tagalog.htm

I included a neat little utility called "Command Prompt Here" released by Microsoft back with Windows 2000.  Unzip the file, right-click it, and click Install.  This will let you start the Command Prompt from any directory.

Extract "DictionaryForMIDs_3.4.0_GerTgl_Peruzzi(source)" to your "C:\" (root) directory.

In Windows Explorer, right click on the C:\Temp\Deu-Tgl directory and click on "CMD Prompt Here".

type setup [Enter]
type jar [Enter]

Then you're done.  The JAR and JAD file will come up in the C:\Temp\Deu-Tgl\JAR directory.

If you get an error message about not finding Java, then you might need to add Java to your Windows Path:
Start -> Command Prompt -> System -> Advanced System Settings -> Environment Variables -> System Variables -> Path -> Edit

Add this to the end of your path (you need the ";"):
;C:\Program Files (x86)\Java\jre6\bin\
Then reboot and you should be good to go.

You might want to upload your dictionary to the DictionaryforMIDs website.  E-mail Peter Kmet (see the Contacts page http://dictionarymid.sourceforge.net/contact.html) to e-mail the file to be uploaded.

Have fun
#67
Yes, I can help finish the last step.  I just need to know where you got the word list.

QuoteP. S. There is no special internet site where I found the vocabulary, I was using everything I could find
No problem, can you just post links here of the top 3 webpages you used to make the word list?  Please sort the list by writing the webpage you used the most at the top of the list.

QuoteCan I later change the vocabulary if there might be a mistake? And can I put more words to complete the dictonary?
Yes, no problem.  Please upload the final version of the word list you want on the forum.
#68
Setting up a dictionary is a 3 step process.

Step 1: Prepare the data file.

Setup the Excel file with 2 columns: headword and definition
Save the file as "Unicode text" (Unicode UTF-16LE)
Open the file in Microsoft Office and then resave the file as "plain text" (other encoding: Unicode UTF-8)
(the data file will be ready to go with a tab between the headword and definition)

Step 2: Create the DictionaryForMIDs.properties file

Paste this into Microsoft Word:

infoText: Deutsch - Tagalog
dictionaryAbbreviation: Deu-Tgl
numberOfAvailableLanguages: 2

language1DisplayText: Deutsch
language2DisplayText: Tagalog
language1FilePostfix: Deu
language2FilePostfix: Tgl

language1IsSearchable: true
language2IsSearchable: true
language1GenerateIndex: true
language2GenerateIndex: true

language1HasSeparateDictionaryFile: false
language2HasSeparateDictionaryFile: false

dictionaryGenerationSeparatorCharacter: '\t'
indexFileSeparationCharacter: '\t'
searchListFileSeparationCharacter: '\t'
dictionaryFileSeparationCharacter: '\t'

fileEncodingFormat: plain_format1
dictionaryGenerationInputCharEncoding: UTF-8
indexCharEncoding: UTF-8
searchListCharEncoding: UTF-8
dictionaryCharEncoding: UTF-8

language1NormationClassName=de.kugihan.dictionaryformids.translation.normation.NormationGer
language2NormationClassName: de.kugihan.dictionaryformids.translation.NormationFil


Change "infoText: Deutsch - Tagalog" to the name of the dictionary you downloaded and include the website where you got the dictionary.
For example:
infoText: EDICT by Jim Breen http://www.csse.monash.edu.au/~jwb/j_edict.html (version 25Oct06)

Then save the file as a Unicode UTF-8 plain text file again.

Step 3:
Run DictionaryforMIDs

Download 3 files:
http://dictionarymid.sourceforge.net/newdict.html

1. DictionaryForMIDs_DictionaryGeneration_3.1.0.zip
2. DictionaryForMIDs_3.3.0_empty.zip
3. DictionaryForMIDs_JarCreator_3.1.2.zip


Extract the 3 files and put them plus your data file and DictionaryForMIDs.properties file in the 'C:\Temp\Dict\Deu_Tgl\' directory.

Paste this into Microsoft Word:
java -jar C:\Temp\Dict\Deu_Tgl\DictionaryGeneration.jar C:\Temp\Dict\Deu_Tgl\Deu_Tgl.txt C:\Temp\Dict\Deu_Tgl\dictionary C:\Temp\Dict\Deu_Tgl

Change 'Deu_Tgl.txt' to the name of your data file.
Save the file as plain text (Western European encoding) and save it as 'setup.bat'

Paste this into Microsoft Word:
java -jar C:\Temp\Dict\Deu_Tgl\JarCreator.jar C:\Temp\Dict\Deu_Tgl\dictionary C:\Temp\Dict\Deu_Tgl C:\Temp\Dict\Deu_Tgl\JAR\

Save the file as plain text (Western European encoding) and save it as 'jar.bat'
Then go Start -> Programs -> Accessories -> Command Prompt

Type setup [Enter]
Type jar [Enter]

That's it.  The dictionary is ready to go in C:\Temp\Dict\Deu_Tgl\JAR\.  If you want you can upload it to Sourceforge for others to enjoy.  E-mail one of us on the Contacts page and we can upload it.

Hope that helps
#69
funfun

Thanks for all your hard work.  So the dictionary fails at 108,503 entries (out of 120,179 entries).

I'm wondering if the number of entries might affect the performance.  The "Webster's Revised Unabridged Dictionary" English-English is 11MB (the EDICT dictionary is only 9MB).  But, I'm thinking the EDICT dictionary might have more entries with shorter definitions, whereas the Webster's dictionary has fewer entries with longer definitions.

Gert, maybe changing the settings in the DictionaryForMIDs.properties file might help.  Right now I have the settings at:
dictionaryGenerationMinNumberOfEntriesPerDictionaryFile: 200
dictionaryGenerationMinNumberOfEntriesPerIndexFile: 500

Would different numbers work better?

Changing this might fix the "...max nr of hits reached" problem.  But I still have no idea what is causing the "cannot read the jar file" error.

Also, funfun, are you using the Bitmap font option in DfM?  Or did you add a 3rd party program to add Japanese fonts on your phone?

Jeff
#70
funfun

I can e-mail you the source files for the EDICT dictionary.  Then you can make changes until the JAR and JAD files will work on your phone.  Send me an empty e-mail to jeffkrengel (at) hotmail.com.  Then I'll e-mail you the source files.

Once you get them then:

1. run c:\temp\dict\EDICT(P)\setup.bat
2. run c:\temp\dict\fontgenerator.jar
- font: select MSMincho
- directory: select c:\temp\dict\EDICT(P)\Dictionary
3. run c:\temp\dict\EDICT(P)\jar.bat
4. install DictionaryForMIDs_EngJpn_EDICT-P.jad and DictionaryForMIDs_EngJpn_EDICT-P.jar to your phone and see if it works

This 1st setup should work OK since it is the exact setup on the DfM website.

- Then copy about 10,000 lines from the "Japanese_EDICT_(non-P-only).txt" file to the end of the "Japanese_EDICT(P).txt" file.
- Then delete the "Dictionary" directory and make a new "Dictionary" directory in its place (it is quicker to delete the directory then to delete all the CSV files).
- Then run steps #1 - #4 again.
- Keep doing that until the JAR and JAD files no longer work on your phone.
- Then let us know when it stopped working.

Jeff
#71
Would you mind finding the "JAR-file size limit" of your phone?  Maybe Google for the infomation.  If the 2MB dictionary works OK, but the 9MB dictionary doesn't, then I'm thinking the phone can't handle 9MB Java files.

If your phone can do 9MB Java files, then I guess I'm out of ideas.  Maybe Gert or someone might have more ideas.
#72
Hmm, have you tried installing other dictionaries?  Do any of the other ones work OK?

Also, for clarification - is your phone a Sony Ericsson P990i? (any phone that starts with a "P" and ends with an "i" and wants a Japanese dictionary leads me to guess you might have a DoCoMo FOMA phone).

Also, are you installing the full version (8.9MB) dictionary?  Or are you installing the reduced "(P)" (2MB) dictionary?
#73
I posted the new version yesterday.  I didn't do a thorough test on the new romaji inputs, however.
#74
I was waiting to get a response from oxxide before posting a new build.  I'll wait another week then I'll go ahead and post it.

I don't need a special build for it.  I'll just use WinRAR to update the JAR file.

Thanks
#75
Problems / Re: Bitmap Font Setup
17. January 2008, 05:13:21
I don't think the bitmap fonts from version 3.0.3 will work with version 3.2.0.  The bitmap fonts in version 3.0.3 only has the file font.bmf.  In version 3.1.0 the bitmap fonts are in the PNG files.

The file size for the 12-point BMF file is 126KB (MSMincho font).  The file size for the 12-point PNG files (MSMincho font) is 128KB + a 19KB BMF file (147KB total).  So the PNG files are a bit larger than the old BMF file.  But, out of a 8.9MB dictionary file, I think 147KB isn't bad.

I was thinking of having 3 sizes of bitmap fonts for the next release: 12pt, 14pt, 16pt.  But the file sizes gets pretty large: 147KB + 182KB + 220KB (549KB total).

12pt font is good enough for most older cell phones, I think.  But, high resolution screens might do better with 14pt font.

If you extract the "Dictionary" directory from version 3.0.3 and then run the newest version 3.1.0 of the fontgenerator then use the new version of the jarcreator (3.1.0) with the new JAR and JAD files, then you should get a working file.