projects / schoolwork / webcrawler


webcrawler

This Java program takes a webpage within the winthrop.edu domain and parses the anchor tags to find links within the page. It looks for these tags and compares them to a state table, built with a finite state machine. If a tag is found, it finds the filename of the link and adds it to the list to be printed.

This works for download links as well, due to the indifference of context when it comes to anchor tags.



code


Loading...