Re: Words of Bug in Leopard (Japanese environment)
Re: Words of Bug in Leopard (Japanese environment)
- Subject: Re: Words of Bug in Leopard (Japanese environment)
- From: Takaaki Naganoya <email@hidden>
- Date: Tue, 12 Aug 2008 13:24:28 +0900
I made a tiny Japanese word parsing app by AppleScript studio in 10
minutes.
It calls MeCab in Leopard. Everybody can enjoy Japanese parsing (this
work with Intel mac and Leopard).
Attachment:
mecabTest.zip
Description: Zip archive
But...MeCab in Leopard seems not to refer learning dictionary of
Kotoeri Japanese input system.
So, proper nouns and person's names are not prepared correctly.
On 2008/08/12, at 11:22, Takaaki Naganoya wrote:
What is at the heart of the matter?
I feel the lack of policy. Which is the right way? Is there need
to write misdirected bug-report in vain ?
Do you want to *fix* the word parsing function with *natural*
language ?
Do you know the existence of the "MeCab" Japanese Morphological
Analysis engine in Leopard? (/usr/lib/libmecab*) It is working for
making Spotlight dictionary in Leopard.
I think the most biggest theme is detecting the situation by yourself.
At first, you'd better to talk with Mr.Yasuo Kida in Apple US.
On 2008/08/12, at 4:04, Christopher Nebel wrote:
Well, I encourage you to file bugs. Word breaking in Japanese is,
by definition, sort of hard, since it doesn't use any sort of word
separators. (Though I have no idea why it got that one wrong,
since as far as I can tell "一番高" isn't a real word.) The
word break behavior isn't specific to AppleScript -- as I said
before, it relies on the system utilities for word breaking, which
rely on data that comes from the Unicode specification. In other
words, it may be wrong, but at least it's consistent -- try using
option-arrow to move by words through text, and you'll see what I
mean.
--Chris Nebel
AppleScript Engineering
On Aug 8, 2008, at 6:56 PM, Takaaki Naganoya wrote:
It is a nice Joke! I am laughing out loud.
AppleScript's word breaking ability for *natural* language ?
It is useless for parsing with "natural language".
I don't expect natural language parsing ability to AppleScript.
Japanese word breaking?
<AppleScript>
set a to "富士山は日本で一番高い山です。"
set b to words of a
</AppleScript>
--> {"富士山", "は", "日本", "で",
"一番高", "い", "山", "です"}
It is crazy and useless *word* breaking. We thought word breaking
ablitity in AppleScript is a kind of joke.
On 2008/08/06, at 15:20, Christopher Nebel wrote:
Basically, don't use "word" elements to process anything other
than natural language text. In your case, you probably want text
items breaking on ":". Alternatively, you could set your word
break preference (System Preferences > International > Language >
Word break) to "English (United States, Computer)", which always
treats ":" as a word break, but that would probably do horrible
things to your Japanese word breaking.
--
Takaaki Naganoya
Piyomaru Software
http://piyo.piyocast.com
email@hidden
PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden
)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden
--
Takaaki Naganoya
Piyomaru Software
http://piyo.piyocast.com
email@hidden
PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (applescript-
email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden
--
Takaaki Naganoya
Piyomaru Software
http://piyo.piyocast.com
email@hidden
PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden