Posted By

rowntreerob on 11/03/09

Tagged

perl

Versions (?)

Last Edited at 11/03/09 07:07pm

Statistics

Viewed 984 times

Favorited by 0 user(s)

Related snippets

run 'linkcheck' on bookmarks 1 of 2

/ Published in: Bash

export bookmarks.html file, extract links from that file to a list , using the list as INPUT to linkchecker that reports dead links

Expand | Embed | Plain Text

Copy this code and paste it in your HTML

grep 'HREF\=.http' bookmarks.html | \  # get the links or hrefs from the file
awk '{print $2}' | \                   # save just that word w/ link
sed 's/......//' | \                   # remove prefix
sed 's/"$//' | \                       # remove suffix
grep -v https | \
grep -v mozilla > linkcheck_in         # save file for step 2 INPUT 
 
perl check_links_1.pl <  linkcheck_in  #  STEP 2 , run the perl linkcheker
 
<<STEP 2 STDOUT>>
 
Answer for 'http://www.warnerbros.com/hipclips/' was    404: Not Found
Answer for 'http://www.informit.com/articles/article.aspx?p=353736&seqNum=4&rll=1' was  200: OK
Answer for 'http://www.cs.washington.edu/homes/amp/opine/emnlp05_opine.pdf' was         403: Forbidden
Answer for 'http://java.sun.com/javase/technologies/desktop/javawebstart/index.jsp' was         200: OK
Answer for 'http://www.sfgate.com/eguide/' was  200: OK
Answer for 'http://oedb.org/library/college-basics/invisible-web' was   200: OK
Answer for 'http://linuxmafia.com/bale/' was    200: OK
Answer for 'http://www.oracle.com/index.html' was       200: OK

URL: http://search.cpan.org/~marclang/ParallelUserAgent-2.57/lib/LWP/Parallel.pm

Report this snippet Tweet

Comments

Subscribe to comments

Comment:

You need to login to post a comment.