• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Extracting text from html
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Extracting text from html


  • Subject: Re: Extracting text from html
  • From: Doug McNutt <email@hidden>
  • Date: Tue, 3 Sep 2002 10:05:54 -0600

At 13:13 +0200 9/3/02, Gvran Ehn wrote:
>Can someone please illustrate on how to grab text between some specific html tags in a html file?

I did some of that in a simple way using the offset function to look for key text items. It works when the content of the html doesn't change much from day to day. The content is still up on my ftp site where I put it for use by some beginning AppleScripters in the local user group:

ftp://ftp.macnauchtan.com/SMMUG/Get_SMMUG_news_four.bin (11kB)

If you want to be serious then get thee into perl and its LWP module.

And watch out for a feature in AppleScript involving reads "using delimiter". I was unable to make it work with a UNIX linefeed as the delimiter. Read "until" is OK though.


--
-> The U. S. Patent Office issues protection for methods used in stem cell research. Is that an immoral use of taxpayer's money? <-
_______________________________________________
applescript-users mailing list | email@hidden
Help/Unsubscribe/Archives: http://www.lists.apple.com/mailman/listinfo/applescript-users
Do not post admin requests to the list. They will be ignored.

  • Follow-Ups:
    • Re: Extracting text from html
      • From: Ron Bishop <email@hidden>
  • Prev by Date: Re: What's system attribute "__CF_USER_TEXT_ENCODING"?
  • Next by Date: bash now default shell in 10.2; and reading files
  • Previous by thread: Re: Extracting text from html
  • Next by thread: Re: Extracting text from html
  • Index(es):
    • Date
    • Thread