Re: How to tell if text is of another language
Re: How to tell if text is of another language
- Subject: Re: How to tell if text is of another language
- From: "Mark J. Reed" <email@hidden>
- Date: Mon, 21 Apr 2008 12:50:54 -0400
On Sun, Apr 20, 2008 at 10:24 PM, Stephen Lanza <email@hidden> wrote:
> Is there a way to determine this. The character set value is empty
? What "character set value"?
In any case, the character encoding is probably Unicode, which tells
you nothing about the language. You could search for character whose
ids fall in specific ranges from Unicode, e.g. any character with an
id between 19,968 and 40,869 is Chinese; a series of them would tend
to make one suspect that the text was in Chinese (or Japanese or
Korean, especially if interspersed with Kana or Hangeul,
respectively)... anything between 1072 and 1273 is Cyrillic . . . etc.
-
Mark J. Reed <email@hidden>
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden