Re: NSLinguisticTagger usable for non-english?
Re: NSLinguisticTagger usable for non-english?
- Subject: Re: NSLinguisticTagger usable for non-english?
- From: "Gerriet M. Denkmann" <email@hidden>
- Date: Mon, 05 Aug 2013 18:43:25 +0700
On 5 Aug 2013, at 18:21, Igor Elland <email@hidden> wrote:
> Since you’re getting some result, I’ll assume you’re doing this in Cocoa,
Correct. I should have mentioned 10.8.4
> not CocoaTouch (that only really supports English). In this case, I don’t see why you need the lexical tagging, when the lemma tagging would suffice and should yield the expected result without your need to post-process the tagged results.
1. of my 133 000 words only 14 000 have lemmas.
2. "käme", "kämest" have no lemma, "kämen" has (correct) "kommen".
3. "weiß" has Lemma "wissen" which is only correct, if this is a verb. Might be an adjective as well. Or the lemma might be the verb "weißen".
possibleTagsAtIndex returns an array, but this never has more than one lemma.
I also want to sort my words into categories like Verb, Noun, etc.
> Change the scheme to NSLinguisticTagSchemeLemma. Hope this helps.
>
> On Aug 5, 2013, at 1:18 PM, Gerriet M. Denkmann <email@hidden> wrote:
>
>> I am trying to use NSLinguisticTagger with German.
>> Has anybody ever tried this and found it usable?
>>
>> E.g. "Ich motivier dich" ends with Verb, Pronoun (ok).
>> But: "Ich motivier dich." ends with two Adjectives (wrong).
>> And "motivier" is a Number, "motivier." is an Adjective (both wrong).
>>
>> What I am trying to do:
>> I got a list of 133 000 German words and want to convert these to their base forms, like: "käme" → "kommen", "Häuser" → "Haus", "weißem" → "weiß".
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden