Re: Extracting data from HTML
Re: Extracting data from HTML
- Subject: Re: Extracting data from HTML
- From: Andrew Oliver <email@hidden>
- Date: Mon, 09 Feb 2004 13:43:21 -0800
I agree with Doug - at least on the 'curl' part.
While it is possible to get the source of a page in Safari (get source of
document 1), using curl, or other faceless application has several benefits,
not least of which you don't have the overhead of having to render the page,
and you don't interrupt the users' workflow while you load the page.
If you use Safari you have to launch the app if it isn't running, and open
the page. What happens if the poor sop whose machine the script is running
on is in the middle of browsing some other site? Tough luck for him, I
guess.
I don't have a problem with using AppleScript (vs. Perl) to parse the page,
but I do think that using Safari as the front end is too much.
Andrew
:)
On 2/9/04 12:47 PM, "Doug McNutt" <email@hidden> wrote:
>
At 13:37 -0600 2/9/04, Irwin Poche wrote:
>
> Quite sometime ago I scripted IE to load a page ever so often, pull some
>
> information from the page, and write out a summary of that information.
>
>
Take the time to learn curl and perl. Curl will download the page and perl
>
with its regular expression capability is great for extracting repeated
>
information. The whole thing can be scripted and run with cron. A simple
>
AppleScript can initiate the whole thing in Mac OS neXt but it will also run
>
on a $20 Linux box without modification.
>
>
Ask off line for some samples which are quite simple so long as you don't get
>
involved with https and certification of public keys.
_______________________________________________
applescript-users mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/applescript-users
Do not post admin requests to the list. They will be ignored.