Greets, I wanted to build an indexing system to tie together the pavuk web crawler info with a keyword search system. I was looking at source forge, and it...
Hello group! I have faced with strange behaviour of pavuk and I think that this is a bug. I use pavuk 0.9.31 lomov@theor:~$ pavuk --version pavuk 0.9.31...
Vladimir Lomov
lomov@...
Sep 30, 2005 7:05 am
795
Hello, ... [...] ... This bug is fixed in 0.9.33. Ciao --...
Hi, I'm a new pavuk user, and as usual am up against the learning curve. I want to sync an ftp site (basically going from my file server to a backup server),...
I answered my question by changing strategies. Using a shell script, I'm using mount_smbfs to just mount the file server shares on my BSD server, then copying...
Hi, It is possible to use fnrules such that downloaded files are renamed using text within the link rather than the link itself. For example the file...
Hi, I read through the pavuk man page but was not able to figure out how I may rename a downloaded file (using fnrules) so that the name is taken from the text...
Hello all, the current CVS version learnt to deal with 64bit systems, but has some 64bit related GTK problems. Also pavuk has problems with GTK2. There are...
I plan using it to replace wget but need to know the following: 1. I need wget-like url translation(which is no translation?). How to let pavuk behave this way...
Hello, ... Please describe what you want. pavuk has a lot of option for translating and also not translating. ... Well, I used pavuk with much larger sites and...
Thanks for you reply ! Wget store the url http://sites/dir1/dir2/file.htm as following: sites/ dir1/ dir2/ file.htm but pavuk will save the save file(s) as ...
Dear Ciao, Many thanks for your reply. I use -noRelocate and -fnrules F '*' '/%h/%d/%n?%s' to solve my problem (I need the search string.). One caveat is...
Hello everyone! If i need to extract pages from a complex shopping site. I need only it's - product detail page (w/ product name, price, etc.) - product...
Hello I try to filter some File types. For Example: Save only Files *.rar *.zip *.gz I was trying to set some configs in Limits Config / Documents / Suffix but...
Hi. I'm using pavuk 0.9.32 2006-01-07T21:26 i686-pc-linux-gnu on a Gentoo linux system. I created a scenario by executing the following command: pavuk...
Hello, Some URLs contain the http session id : http://myhost/mypage? cat=12&sessionId=jkkjnfjnfjfiNJJHJNHdnjddjhdnj&doc=12 I would like to keep for local...
... Well, I would think this is a bug. Could you modify your report so, that I can reproduce it. E.G. search an internet page I can access also and adapt the...
Hello, pavuk 0.9.34 has been released. The main news are much improved 64 bit support (it now works :-). Still missing is updated GTK support. Any help...
... so, that I ... and adapt ... Please try the following command line : pavuk -fnrules R '(.*)session=.*prod=([0-9]*).*' '$1/$2' - url_rpattern 'call_cat=3'...
Hi, I recently switched from Wget to Pavuk in hope of fixing some troubles but I am still having troubles with mirroring a website. Files that no longer exist...
I've installed Pavuk and got it to grab some content from a site, but can't persuade it to do a login. I'm very new to Pavuk and woul like to know if it can...
Luc - Personal
luc@...
Jan 21, 2006 5:44 pm
820
Hello, ... You can do the login using another browser and use the cookie file with pavuk. For normal (non-cookie) based systems, you can give the username and ...
... but can't persuade it to do a login. I'm very new to Pavuk and woul like to know if it can allow me to grab a site given a user name and password. You can...