Re: Issue with NSXMLDocument and bad HTML
Re: Issue with NSXMLDocument and bad HTML
- Subject: Re: Issue with NSXMLDocument and bad HTML
- From: John Stiles <email@hidden>
- Date: Tue, 18 Oct 2005 13:51:39 -0700
On Oct 18, 2005, at 12:48 PM, Brian Stern wrote:
I'm trying to load an HTML page as an XML object like this:
NSURL *url = [NSURL URLWithString:@"http://wwwa.accuweather.com/
forecast.asp?zipcode=94025&partner=accuweather"]];
NSXMLDocument *xml = [[[NSXMLDocument alloc]
initWithContentsOfURL:url options:NSXMLDocumentTidyXML
error:anError]autorelease];
The page seems to be malformed, so the NSXMLDocument becomes nil and
'anError' contains a lot of errors:
NSError "line 78 column 26 - Warning: unescaped & or unknown entity
"&city"
The page isn't malformed. All those ampersands are inside string
constants
in javascript that's on that page. You probably need to download
the page
and remove all the javascript content before passing it to
NSXMLDocument.
Actually, I think it is malformed. It claims to be XHTML, but
contains JavaScript that is not inside a comment block. Look for
"adPosTop". It's valid HTML, but that's not really the same thing.
That was just the first thing I found; it could be malformed in lots
of other creative ways as well :)
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden