• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Words of Bug in Leopard (Japanese environment)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Words of Bug in Leopard (Japanese environment)


  • Subject: Re: Words of Bug in Leopard (Japanese environment)
  • From: Philip Aker <email@hidden>
  • Date: Mon, 11 Aug 2008 20:21:01 -0700

On Aug 11, 2008, at 7:22 PM, Takaaki Naganoya wrote:

What is at the heart of the matter?

I feel the lack of policy. Which is the right way? Is there need to write misdirected bug-report in vain ?
Do you want to *fix* the word parsing function with *natural* language ?


Do you know the existence of the "MeCab" Japanese Morphological Analysis engine in Leopard? (/usr/lib/libmecab*) It is working for making Spotlight dictionary in Leopard.

I think the most biggest theme is detecting the situation by yourself.

At first, you'd better to talk with Mr.Yasuo Kida in Apple US.

Also, according to the previous explanation, the specific 'ifconfig' call only works with a POSIX locale. So something's fundamentally wrong if you can't word break at ':' in a US English locate receiving UTF-8 from a shell command call.


I'm just wondering, since I've noticed a lot of folks having connection problems in Leopard (Xcode documentation update links e.g.) if there's some misinterpretation of particular byte codes somewhere along the way. That is, ':' will not have the high bit set, but if the byte before that is a byte which denotes a continuation character is to follow then one can understand why the errors occur.

Furthermore, in the case of ifconfig, everyone will have a different en0 -> ether value, so that could explain why some machines return a bad value while others do not. Theoretically, making for a situation where someone at Apple can't reproduce the reported error because their ether ID contains no bytes which would trigger misinterpretation.



On 2008/08/12, at 4:04, Christopher Nebel wrote:

Well, I encourage you to file bugs. Word breaking in Japanese is, by definition, sort of hard, since it doesn't use any sort of word separators. (Though I have no idea why it got that one wrong, since as far as I can tell "一番高" isn't a real word.) The word break behavior isn't specific to AppleScript -- as I said before, it relies on the system utilities for word breaking, which rely on data that comes from the Unicode specification. In other words, it may be wrong, but at least it's consistent -- try using option-arrow to move by words through text, and you'll see what I mean.


--Chris Nebel AppleScript Engineering

On Aug 8, 2008, at 6:56 PM, Takaaki Naganoya wrote:

It is a nice Joke!   I am laughing out loud.
AppleScript's word breaking ability for *natural* language ?
It is useless for parsing with "natural language".

I don't expect natural language parsing ability to AppleScript.

Japanese word breaking?

<AppleScript>
set a to "富士山は日本で一番高い山です。"
set b to words of a
</AppleScript>

--> {"富士山", "は", "日本", "で", "一番高", "い", "山", "です"}

It is crazy and useless *word* breaking. We thought word breaking ablitity in AppleScript is a kind of joke.

On 2008/08/06, at 15:20, Christopher Nebel wrote:

Basically, don't use "word" elements to process anything other than natural language text. In your case, you probably want text items breaking on ":". Alternatively, you could set your word break preference (System Preferences > International > Language > Word break) to "English (United States, Computer)", which always treats ":" as a word break, but that would probably do horrible things to your Japanese word breaking.

-- Takaaki Naganoya Piyomaru Software http://piyo.piyocast.com email@hidden

PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/



_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden )
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users


This email sent to email@hidden


-- Takaaki Naganoya Piyomaru Software http://piyo.piyocast.com email@hidden

PiyoCast Web (Podcasting with Music!)
http://www.piyocast.com
Free AppleScript Library "AS Hole"
http://www.piyocast.com/as/



_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (applescript- email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users


This email sent to email@hidden


Philip Aker
echo email@hidden@nl | tr a-z@. p-za-o.@

Democracy: Two wolves and a sheep voting on lunch.

_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users

This email sent to email@hidden
References: 
 >Words of Bug in Leopard (Japanese environment) (From: Takaaki Naganoya <email@hidden>)
 >Re: Words of Bug in Leopard (Japanese environment) (From: Christopher Nebel <email@hidden>)
 >Re: Words of Bug in Leopard (Japanese environment) (From: Takaaki Naganoya <email@hidden>)
 >Re: Words of Bug in Leopard (Japanese environment) (From: Christopher Nebel <email@hidden>)
 >Re: Words of Bug in Leopard (Japanese environment) (From: Takaaki Naganoya <email@hidden>)

  • Prev by Date: Re: Words of Bug in Leopard (Japanese environment)
  • Next by Date: Re: Words of Bug in Leopard (Japanese environment)
  • Previous by thread: Re: Words of Bug in Leopard (Japanese environment)
  • Next by thread: Re: Words of Bug in Leopard (Japanese environment)
  • Index(es):
    • Date
    • Thread