• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: OT: A web link crawler/checker
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: OT: A web link crawler/checker


  • Subject: Re: OT: A web link crawler/checker
  • From: Kieran Kelleher <email@hidden>
  • Date: Sat, 25 Feb 2006 13:32:26 -0500

Just guessing ...... Jakarta HTTPClient maybe? It's a programmable client ..... like a java-automatible browser in a sense. Then use regex to parse the pages you get back.

On Feb 25, 2006, at 9:22 AM, James Cicenia wrote:

Hello -

On a new project I will be starting I have to check remote site pages for links. I have to basically crawl through a page and collect compare both the link and the anchor text.

Any ideas on approach, software, frameworks, etc., would be greatly appreciated.

- James Cicenia
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
email@hidden


This email sent to email@hidden


_______________________________________________ Do not post admin requests to the list. They will be ignored. Webobjects-dev mailing list (email@hidden) Help/Unsubscribe/Update your Subscription: This email sent to email@hidden
  • Follow-Ups:
    • Re: OT: A web link crawler/checker
      • From: James Cicenia <email@hidden>
References: 
 >OT: A web link crawler/checker (From: James Cicenia <email@hidden>)

  • Prev by Date: Re: child component not getting binding?
  • Next by Date: Re: OT: A web link crawler/checker
  • Previous by thread: OT: A web link crawler/checker
  • Next by thread: Re: OT: A web link crawler/checker
  • Index(es):
    • Date
    • Thread