It will, it's a new version
All reports, lists, and graphics in the program use an embedded copy of Internet Explorer to display HTML files. For example, when you ask for a list of links, it writes the list to disk in HTML format, and then displays it.
Make sure that you use the full website address. For example, try
(note the http:// at the front, and the / at the end)
The program works only on sites which use well-formed html. For example, using the program on yahoo.com will not work, because they don't use quotes around attribute names, hence their page fails an HTML validator.
To check whether a site has valid HTML, try the W3C validator (click the Valid-HTML! button at the bottom of this page to see what the validator does)
If the site uses frames, try entering the URL of the document itself, rather than the overall "frames" page. To find out what this is, right-click somewhere on the page, and click "properties" or "open frame in new window" (or the equivalent in your favorite browser)
If the site uses URLs of the form http://download.cnet.com/downloads/0-3282264-100-5550892.html?tag=st.dl.10001-103-1.lst-7-1.5550892
then the program should still work - although the URL looks like noise,
it is actually valid.
Scan Site will take ages - it has to download every single HTML document on the web site. For example, blibbleblobble is about 50Mb big, about a third of which is HTML pages. Try dividing that by 5k/sec for a modem connection.
Also, watch out that you've not found a ridiculously big HTML file. I was having problems running ScanGraphics on this site, until I realised that it was parsing my entire 3rd-year project (all 200 pages of it) for links and graphics.
Scan pictures is the same as scan-site, except that it downloads each document twice (once to look for links, once to look for pictures) so it will take twice as long.
I plan to fix this by caching the most-recently-loaded web page, which should double the speed of this function
"Scan Site" and "Scan Pics" also work, but they will take quite a while to run, especially on a large site, or a slow internet connection.
It won't, it hasn't been programmed yet. You're welcome to write the code yourself and mail it to me for inclusion in the next release.
It won't, it hasn't been programmed yet. You're welcome to write the code yourself and mail it to me for inclusion in the next release.
Try using the "save page as" function on internet explorer
That's Visual Basic for you. Try the perl version.
This is just a bug in Internet Explorer (well, I assume it's a bug, it could be a 'feature' )
Given a long list of image files on a page, internet explorer will often give-up half way through, and not display the rest. Try using images.google.com in a low-memory situation to see examples of this.
Try the perl version.