Menu

Show posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Show posts Menu

Messages - jn0101

#16
Quote from: Gert on 05. June 2010, 08:37:18
@jn0101
I just noted:

{{[02~s {{[06ad]}}, ~ hinanden]}}

I believe that nesting of {{ and }} is not supported by the current implementation (see class DictionaryUpdate.java). That may be part of the problem; well, maybe there is an additional issue.

Would it be hard for you to avoid nesting of {{ and }} ?

What I need is to exclude all text in []'s. Therefore I have these replacements as a part of the preprocessing:

# - ni enmetu {{ kaj }} por eviti indeksigon de io en [ kaj ].
   sed 's/\[/{{[/g' |
   sed 's/\]/]}}/g' |

I think it would take quite some time to exclude nesting, as the [ and ]'s are nested.

Anyway the phrase {{[02~ nogen af med affaldet]}} does not have nestings, so I cant see how it should happen.
I might try with only this line and with no nesting... but I expect it to make no difference....

Jacob
#17
General discussions / Re: Build environment
05. June 2010, 08:46:29
Hey, I just tried downloading. I can't extract the files.
Could you please distribute a ZIP file and not an EXE file?

There is no reason for keeping me and other Linux users out of this. This is Java, we can use the same JARs so lets also use the same build environment...

Jacob
#18
Im using the SVN version:

/*
* Note: this class is obsolete starting with DictionaryGeneration 3.5.0 because
* with version 3.5.0 of DictionayGeneration the behaviour from DictionaryUpdatePartialIndex
* is already included in class DictionaryUpdate. The class DictionaryUpdatePartialIndex
* is only retained for compatibility reasons.
*/
package de.kugihan.dictionaryformids.dictgen.dictionaryupdate;


public class DictionaryUpdatePartialIndex extends DictionaryUpdate {
   
}


so.... yes :-)
#19
I think there is either a problem (bug) or I just don't understand how to use it:

I have for example the line:

hjælpe  {{[051]}}{{[01tr]}} helpi; {{[02understøtte]}} subteni; {{[02assistere]}} asisti; {{[02~s {{[06ad]}}, ~ hinanden]}} helpi unu la alian, interhelpi sin, {{[02også fx: ~s ad med at trække]}} kune tiri, kunlabori tirante; {{[02~ {{[06af]}} med, fx: ~ nogen af med jakken]}} helpi iun demeti la jakon, {{[02~ nogen af med affaldet]}} helpi iun seniĝi pri la rubo; {{[02~ {{[06med/]}} til]}} kunhelpi, asisti; {{[02fx: ~ nogen jakken {{[06på]}}]}} helpi iun surmeti la jakon; {{[02fx: ~ {{[06til]}} en stilling]}} helpi dungiĝi; {{[02{{[06stå]}} til at ~]}} esti helpebla/ savebla; {{[052]}}{{[01tr]}} {{[02~ hen, fx: ~ frem]}} helpi antaŭeniri/ progresi, {{[02~ ind i huset]}} helpi veni en la domon, {{[02~ op]}} helpi stariĝi/ supreniri, {{[02~ op på hesten]}} helpi surĉevaliĝi; {{[02se også: komme, gå]}}; {{[053]}}{{[02gavne]}} {{[01itr]}} utili, {{[02det kan ikke ~ noget]}} tio ne utilas, estas senutile, {{[02hvad ~r det?]}} por kio utilas?; {{[02gøre nemmere]}} {{[01tr]}} plifaciligi, {{[02fx: det skal nok ~ på hans forståelse]}} tio certe utilos al li por kompreni, tio certe plifaciligos lian komprenon

The string "nogen af med affaldet" is here enclosed as {{[02~ nogen af med affaldet]}} so it SHOULDNT be indexed at all when reversing and going Esperanto->Danish


Anyway, I have this gigantic redundancy. It seems the phrase is included TWENTY-TWO times in the Esperanto index.

$ grep -l "nogen af med affaldet" *
directory58.csv
indexEpo104.csv
indexEpo115.csv
indexEpo118.csv
indexEpo12.csv
indexEpo144.csv
indexEpo152.csv
indexEpo159.csv
indexEpo166.csv
indexEpo170.csv
indexEpo172.csv
indexEpo1.csv
indexEpo26.csv
indexEpo2.csv
indexEpo45.csv
indexEpo60.csv
indexEpo65.csv
indexEpo68.csv
indexEpo7.csv
indexEpo83.csv
indexEpo84.csv
indexEpo86.csv
indexEpo87.csv

$ grep "nogen af med affaldet" *
directory58.csv:hjælpe   [051][01tr] helpi; [02understøtte] subteni; [02assistere] asisti; [02~s [06ad], ~ hinanden] helpi unu la alian, interhelpi sin, [02også fx: ~s ad med at trække] kune tiri, kunlabori tirante; [02~ [06af] med, fx: ~ nogen af med jakken] helpi iun demeti la jakon, [02~ nogen af med affaldet] helpi iun seniĝi pri la rubo; [02~ [06med/] til] kunhelpi, asisti; [02fx: ~ nogen jakken [06på]] helpi iun surmeti la jakon; [02fx: ~ [06til] en stilling] helpi dungiĝi; [02[06stå] til at ~] esti helpebla/ savebla; [052][01tr] [02~ hen, fx: ~ frem] helpi antaŭeniri/ progresi, [02~ ind i huset] helpi veni en la domon, [02~ op] helpi stariĝi/ supreniri, [02~ op på hesten] helpi surĉevaliĝi; [02se også: komme, gå]; [053][02gavne] [01itr] utili, [02det kan ikke ~ noget] tio ne utilas, estas senutile, [02hvad ~r det?] por kio utilas?; [02gøre nemmere] [01tr] plifaciligi, [02fx: det skal nok ~ på hans forståelse] tio certe utilos al li por kompreni, tio certe plifaciligos lian komprenon
indexEpo104.csv:med at trække kune tiri kunlabori tirante af med f nogen af med jakken helpi iun demeti la jakon nogen af med affaldet helpi iun senigi pri la rubo med til kunhelpi asisti f nogen jakken på helpi iun surmeti la jakon f til en stilling helpi dungigi stå til at esti helpebla savebla 2 tr hen f frem helpi antaueniri progresi ind i huset helpi veni en la domon op helpi starigi supreniri op på hesten helpi surcevaligi se også komme gå 3 gavne itr utili det kan ikke noget tio ne utilas estas senutile hvad r det por kio utilas gøre nemmere tr plifaciligi f det skal nok på hans forståelse tio certe utilos al li por kompreni tio certe plifaciligos lian komprenon   58-7928-S


indexEpo104.csv:med f nogen af med jakken helpi iun demeti la jakon nogen af med affaldet helpi iun senigi pri la rubo med til kunhelpi asisti f nogen jakken på helpi iun surmeti la jakon f til en stilling helpi dungigi stå til at esti helpebla savebla 2 tr hen f frem helpi antaueniri progresi ind i huset helpi veni en la domon op helpi starigi supreniri op på hesten helpi surcevaligi se også komme gå 3 gavne itr utili det kan ikke noget tio ne utilas estas senutile hvad r det por kio utilas gøre nemmere tr plifaciligi f det skal nok på hans forståelse tio certe utilos al li por kompreni tio certe plifaciligos lian komprenon   58-7928-S

indexEpo104.csv:med jakken helpi iun demeti la jakon nogen af med affaldet helpi iun senigi pri la rubo med til kunhelpi asisti f nogen jakken på helpi iun surmeti la jakon f til en stilling helpi dungigi stå til at esti helpebla savebla 2 tr hen f frem helpi antaueniri progresi ind i huset helpi veni en la domon op helpi starigi supreniri op på hesten helpi surcevaligi se også komme gå 3 gavne itr utili det kan ikke noget tio ne utilas estas senutile hvad r det por kio utilas gøre nemmere tr plifaciligi f det skal nok på hans forståelse tio certe utilos al li por kompreni tio certe plifaciligos lian komprenon   58-7928-S
#20
Ok, I changed the implementation, now there is an encryption password, which will make it an order of magnitude harder to break encryption.

Gert, could you please send me a DictionaryForMIDs_3.5.0_empty.zip compiled with these changes ?

It seems that my own version cannot handle the large files in my emulator (I get a OutOfMemoryError), but your obfuscated version runs file.

Thanks,
Jacob
#21
Argh, I will have to change the encryption. The author played with breaking encryption, unfortunately, and thinks its too easy :-(

I trust noone has actually used the weakCrypt and I can just change the implementation.

Jacob
#22
General discussions / Re: Smal bug
04. June 2010, 15:46:12
Cool you found out Achim!

Yes, seems a good idea.

In general I think you can assume that noone are using 1.5 anymore (but 1.6 yes).

Jacob
#23
General discussions / Re: Build environment
04. June 2010, 15:44:39
Great idea to set up a ready to run ZIP file with all the tools.

What is definitely also missing is an full example of doing everything, complete with all input files and a script for producing the output files. Consider including that as a sample. 'evrything' includes setting languageNormationClassName and languageContentDisplayText etc, and with an input dictionary included entries like [01foo [02bar]] that actually uses these things.

And perhaps all a small minimal example sample.

If you smash togehter the ZIP file and a Windows script for the minimal example sample, I could do the 'complete' sample.


Pls note that for Java-enabled developers Ive put a small README.txt in the SVN root, to help you get started compiling the source (I was missing that)

Jacob
#24
Ive experienced the same. Properties should be ISO-8859-1 and I wanted special chars like ĉ, ĝ, ŭ.

I solved it by using UTF-8 in my scripts and then just pipe it thru the command "native2ascii -encoding UTF-8", like this:

echo "
infoText: blabla. Kiel serĉi:\nPor tajpi literojn kun akcentoj (ekz ĉ, ĝ, ŭ) vi simple tajpu la bazan varianton de la litero (ekz c, g, c).

... (stuff)

" | native2ascii -encoding UTF-8 > DictionaryForMIDs.properties

In the file will be the escaped values.

infoText: blabla. Kiel ser\u0109i:\nPor tajpi literojn kun akcentoj (ekz \u0109, \u011d, \u016d) vi simple tajpu la bazan varianton de la litero (ekz c, g, c).
#25
I have some desires for the Android version:

- A new release with the updated Esperanto translations (no hurry, though :-)

- In the "download dictionaries" section it'd be great to be able to download more than one dictionary at a time (subsequent clicks should be queued)

- When long pressing the input text field there is a "Choose all", "Cut", "Copy" etc, but the option "Paste" is missing.

- Believe it or not, but I could not change language directions the first few times. I was looking in the menu. Your "turn"-button is fine, but consider also adding a "switch direction" to the menu, for new users to not get stuck.

I might do the changes myself, if you like the ideas.


Yours,
Jacob
#26
Yes, this is definitely becaurse the normation class is not in 3.4. I created it 3 weeks ago, so it should be available from 3.5.0 and onward.

I was using JarCreator from SVN, therefore I didnt get that error.

Jacob
#27
Quote from: Gert on 02. June 2010, 06:54:28
@jn0101
Thank you for your great work !!

QuoteGuys, I believe my work is finished
:o  - hey, we need you to do so many more things on the DictionaryForMIDs project  ;)

Ah, Im not leaving, just continiuing to the next thing :-)
I have a non-GPL Danish-Esperanto dictionary to set up, plus a lot of Apertium dictionaries.


Quote from: Gert on 02. June 2010, 06:54:28
... besides, about the Esperanto dictionaries that you did set up: would you like to send your dictionaries to Jeff for his "dictionary archive" ? See http://dictionarymid.sourceforge.net/dictionaryArchive.html

Sure, I'll do that. But my expectation is that Lernu! will publish all source under GPL at some time in the future, and I will provide the scripting to generate the dictionaries from scratch at any time.
#28
Guys, I believe my work is finished, and I want to thank you all very much for all your great support!!

Ive done a small manual in Esperanto. Its not finished, but the preliminary version can already be seen on the permanent Lernu! address of the dictionaries: http://lernu.net/vortolistoj/.

(a crude machine translation of the page to English, using Apertium, can be seen at http://xixona.dlsi.ua.es/apertium-www/common/browser.php?mark=&dir=eo-en&inurl=http://lernu.net/vortolistoj/ )

Thanks again,
Jacob
#29
General discussions / Re: Smal bug
01. June 2010, 21:50:49
Esperanto and Danish uses (like most european countries) the , to seperate the fraction from the whole part.
What shoud the pattern be, in this case?

Probably a lot of translators will have missed this...
#30
General discussions / Smal bug
01. June 2010, 13:08:15
See here

This is for "Esperanto-Czech", 320827 bytes.

I suspect the string
   <string name="file_size">%1$,.2f %2$s</string>
is somehow causing trouble.

I always get it when unit is kilobytes.
I never get it when unit is megabytes.