Here some added info to lots of noise but a start if you need to login. So this is going to burn a lot of time and bandwidth. In particular memberlist.php and viewtopic.php where p= is specified can create thousands of files!ĭue to this bug in wget it will still download an astounding number of those useless files - esepcially viewtopic.php?p= ones - before simply deleting them. I have also excluded some other pages that lead to a lot of cruft being saved. (Perhaps there's a way to force the recursive wget to start from index.php - I don't know). Except for one squirreled away somewhere - which links to a plain index.php which then continues with no sid= parameter. They seem to get added automatically by the index page, and then get attached to all the links in a virus-like fashion. I wanted to strip out those pesky session id things (sid=blahblahblah). Here's the command I'm using: wget -k -m -E -p -np -R memberlist.php*,faq.php*,viewtopic.php*p=*,posting.php*,search.php*,ucp.php*,viewonline.php*,*sid*,*view=print*,*start=0* -o log.txt
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |