Re: Most efficient character parsing.
Re: Most efficient character parsing.
- Subject: Re: Most efficient character parsing.
- From: Wolfram Stebel <email@hidden>
- Date: Thu, 16 Feb 2006 00:17:09 +0100
- Thread-topic: Most efficient character parsing.
Am 15.02.2006 22:43 Uhr schrieb "Eric Stewart" unter
<email@hidden>:
> I've got a WOApp that needs to deal with 200 strings of approximately
> 250-500 characters per string.
>
> I need to strip out all characters that are not ISO-8859-1 legal
> characters. So basically any character that is not decimal ascii
> character 9, 10, 13, 32-126, 160-255 need to be removed from the
> string. This process is happening roughly 2.5 million times a day and
> I'm trying to figure out what is the most efficient way to do it.
>
> Right now I'm tearing the strings apart character-by-character and
> checking it's ascii decimal value against the values I know are good.
> Is there a more efficient way to do it?
There should be a regular expression for this to use it in
String.replaceAll.
Precompile it and reuse it for all of them lucky 2.5 millions of strings :-)
I can't write such an expression right from my head... sorry.
Regular Regards
Wolfram
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden