Re: Words of Bug in Leopard (Japanese environment)
Re: Words of Bug in Leopard (Japanese environment)
- Subject: Re: Words of Bug in Leopard (Japanese environment)
- From: Philip Aker <email@hidden>
- Date: Mon, 11 Aug 2008 20:21:01 -0700
On Aug 11, 2008, at 7:22 PM, Takaaki Naganoya wrote:
What is at the heart of the matter?
I feel the lack of policy. Which is the right way? Is there need
to write misdirected bug-report in vain ?
Do you want to *fix* the word parsing function with *natural*
language ?
Do you know the existence of the "MeCab" Japanese Morphological
Analysis engine in Leopard? (/usr/lib/libmecab*) It is working for
making Spotlight dictionary in Leopard.
I think the most biggest theme is detecting the situation by yourself.
At first, you'd better to talk with Mr.Yasuo Kida in Apple US.
Also, according to the previous explanation, the specific 'ifconfig'
call only works with a POSIX locale. So something's fundamentally
wrong if you can't word break at ':' in a US English locate receiving
UTF-8 from a shell command call.
I'm just wondering, since I've noticed a lot of folks having
connection problems in Leopard (Xcode documentation update links e.g.)
if there's some misinterpretation of particular byte codes somewhere
along the way. That is, ':' will not have the high bit set, but if the
byte before that is a byte which denotes a continuation character is
to follow then one can understand why the errors occur.
Furthermore, in the case of ifconfig, everyone will have a different
en0 -> ether value, so that could explain why some machines return a
bad value while others do not. Theoretically, making for a situation
where someone at Apple can't reproduce the reported error because
their ether ID contains no bytes which would trigger misinterpretation.
On 2008/08/12, at 4:04, Christopher Nebel wrote:
Well, I encourage you to file bugs. Word breaking in Japanese is,
by definition, sort of hard, since it doesn't use any sort of word
separators. (Though I have no idea why it got that one wrong,
since as far as I can tell "一番高" isn't a real word.) The
word break behavior isn't specific to AppleScript -- as I said
before, it relies on the system utilities for word breaking, which
rely on data that comes from the Unicode specification. In other
words, it may be wrong, but at least it's consistent -- try using
option-arrow to move by words through text, and you'll see what I
mean.
--Chris Nebel
AppleScript Engineering
On Aug 8, 2008, at 6:56 PM, Takaaki Naganoya wrote:
It is a nice Joke! I am laughing out loud.
AppleScript's word breaking ability for *natural* language ?
It is useless for parsing with "natural language".
I don't expect natural language parsing ability to AppleScript.
Japanese word breaking?
<AppleScript>
set a to "富士山は日本で一番高い山です。"
set b to words of a
</AppleScript>
--> {"富士山", "は", "日本", "で",
"一番高", "い", "山", "です"}
It is crazy and useless *word* breaking. We thought word breaking
ablitity in AppleScript is a kind of joke.
On 2008/08/06, at 15:20, Christopher Nebel wrote:
Basically, don't use "word" elements to process anything other
than natural language text. In your case, you probably want text
items breaking on ":". Alternatively, you could set your word
break preference (System Preferences > International > Language >
Word break) to "English (United States, Computer)", which always
treats ":" as a word break, but that would probably do horrible
things to your Japanese word breaking.
--
Takaaki Naganoya
Piyomaru Software
http://piyo.piyocast.com
email@hidden
PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden
)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden
--
Takaaki Naganoya
Piyomaru Software
http://piyo.piyocast.com
email@hidden
PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (applescript-
email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden
Philip Aker
echo email@hidden@nl | tr a-z@. p-za-o.@
Democracy: Two wolves and a sheep voting on lunch.
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden