I'm trying to grab a page with a lot of these: <form action="dir_compnay_detail.asp" method="post" > <font color="#FFFFFF" size="4"> <input type="hidden"...
Dima Nemchenko
Dima.Nemchenko@...
Mar 7, 2001 5:54 am
457
Hello, first of all: pavuk is a great tool! Thanks to all contributors! Now let me describe my problem: The best way to understand it is by example, so please...
Karakas@...
Mar 9, 2001 4:09 pm
458
The following patch adds: 1. -egd_socket <path> command-line option. I have *not* added a GUI entry for this. 2. --egd-socket=<path> autoconf option to provide...
pavuk@...
Mar 12, 2001 11:18 pm
459
... Ok, more patches. 5. Use <sys/stat.h> instead of <sys/mode.h>. -- albert chin (china@...) -- snip snip ... +++ src/myssl.c Mon Mar 12...
pavuk@...
Mar 13, 2001 3:38 am
460
... 6. Add --with-zlib-includes=DIR and --with-zlib-libraries=DIR to specify location of zlib library. -- albert chin (china@...)...
pavuk@...
Mar 13, 2001 3:40 am
462
In the pavuk manual, the line: AllowedSufixes: ---> -asfx has a typo, AllowedSufixes should be AllowedSuffixes, double 'f'. Otherwise pavuk won't...
ha shao
hashao@...
Mar 15, 2001 4:07 am
463
... This was done incorrectly. Patch below is correct. -- albert chin (china@...) -- snip snip ... +++ src/myssl.c Mon Mar 12 17:02:23 2001 @@...
pavuk@...
Mar 15, 2001 5:53 am
464
The pavuk segfault randomly when download a large amound of pages. Mostly I cannot get a backtrace of the core because gdb cannot access the address. But I got...
ha shao
hashao@...
Mar 15, 2001 11:27 am
465
Here is another bt. I do use multithread under limited RAM(64MB). Program terminated with signal 11, Segmentation fault. #0 0x4051a9d4 in ?? () (gdb) bt #0...
ha shao
hashao@...
Mar 15, 2001 11:31 am
466
Say, if I want to download recursively all html docs matching tag_rpattern_a and those inline objects with <img src="url_rpattern_b">, can this be done in one...
Huaxin Wang
wanghx@...
Mar 22, 2001 12:27 pm
467
I am having a problem getting pavuk to delete files that are no longer on the server using remove_old/subdir/cdir whilst using base_level to flatten the...
Tony Gale
gale@...
Mar 23, 2001 6:28 pm
468
Hello. I am new to this group and to pavuk as well so this might have been answered a milion times but anyhow.. I am trying to download part of a server. As I...
de-bit-el@...
Mar 30, 2001 9:55 am
469
Hi all! How can I make a mirror from a ftp server preserving symbolic links ? (going through a squid proxy and a packet filter) I tried: pavuk -mode sync \ ...
Richard Ems
r.ems.mtg@...
Apr 4, 2001 12:15 pm
470
I am having a problem using pavuk with -use_http11, When the web server sends Content-Length bytes, pavuk just sits there with 100% completed until it times...
Larry Riedel
Larry@...
Apr 11, 2001 10:13 pm
471
Hi, How to get the directory and files's info only(from a ftp site)? Or this it not support. I know I can use -store_info/-nostore_info , but I don't want to ...
mwen@...
Apr 18, 2001 4:07 pm
472
1. Can choose links to download based on regular expression match on the text content quoted by html tag pairs, for example, the "foo" in <a href=...>foo</a>....
Huaxin Wang
wanghx@...
Apr 26, 2001 4:07 am
473
I would like to save in a file all the external links which doesn't belong to the current host. For example, if I crawl only the site : www.host1.com I would...
orion orion
orion30@...
May 1, 2001 9:25 am
474
Hello, wyh are there always a lot of 'rejected' files ? Is there anything I can do to get these files ? have a nice day Klaus...
Klaus
colonius@...
May 8, 2001 6:15 pm
475
Hi, first of all I'd like to send many greetings to the author of Pavuk (srdecne pozdravy do Slovenska :)) which is probably the most advanced mirroring tool...
Data Tonis
data@...
May 9, 2001 11:43 am
476
Hi, I have another question - how pavuk can synchronize local tree of documents when I use fnrules? IMHO there is no way how pavuk can retrieve an original url...
Data Tonis
data@...
May 9, 2001 1:42 pm
477
Hi, I found a bug - just go to: http://www.werbebanner.de/pavuktest/9/test.html and click on the link - it is a very simple example. And now start pavuk (I...
castor@...
Jun 2, 2001 8:08 pm
478
Hello people! Sorry for so long time of silence from my side. I was quite busy and frustrated from things around me so I stopped some of my freetime activities...
Stefan Ondrejicka
ondrej@...
Jun 22, 2001 11:08 am
479
... Welcome back! -- albert chin (china@...)...
pavuk@...
Jun 22, 2001 8:39 pm
480
hello, I just subscribed, with a question, naturally, that is: how to make the output of mode remind to go appended to a file instead of sending mail ? thank...
Andrea Tasso
atasso@...
Jun 23, 2001 12:27 pm
481
On Thu, 8 Feb 2001 galanga@... wrote: Hi! ... This is hopefully fixed now in the two latest pavuk testing versions. It all was caused by adding the...
Stefan Ondrejicka
ondrej@...
Jun 25, 2001 3:35 pm
482
On Mon, 12 Feb 2001 zajc2@... wrote: Hi! ... The problem is with Microsoft origins of NTLM authorization. When you use it from your browser (only MSIE...
Stefan Ondrejicka
ondrej@...
Jun 25, 2001 3:46 pm
483
On Wed, 14 Feb 2001, Tony Gale wrote: Hi! ... Currently not possible, but I will add to next version new function for -fnrules option which will allow that :-)...
Stefan Ondrejicka
ondrej@...
Jun 26, 2001 11:07 am
484
On Wed, 28 Feb 2001, Raun wrote: Hi Raun! ... This is known issue of MS ftp servers ... ftp command REST is supported by those servers, but the transfer always...
Stefan Ondrejicka
ondrej@...
Jun 26, 2001 11:38 am
485
On Wed, 7 Mar 2001, Dima Nemchenko wrote: Hi Dima! ... There were bugs in 0.9pl27 which caused segfaulting, when there were not specified any fields in the...
Stefan Ondrejicka
ondrej@...
Jun 26, 2001 11:57 am
486
On Fri, 9 Mar 2001 Karakas@... wrote: Hello! ... Hmmm ... for me everything works well with latest testing version... The layout of the tree is now...