Search the web
Sign In
New User? Sign Up
pavuk · Pavuk Webgrabber Mailing List
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Modifying Pavuk   Message List  
Reply | Forward Message #757 of 988 |
Re: Modifying Pavuk

----- Original Message -----
From: "Noah" <noah@...>

: 1) I would like to fetch the list of sites to spider and the
various parameters (domains to
: skip, extentions, depth, etc.) from a databse instead of a
text file. Where in the code
: would it be best to set this up so that the apporpriate
internal variables are set for Pavuk?
:
: 2) I want to post process the html that I get. (No need to
save to disk), I imagine that at
: some point all the HTML is in a varialbe that I can easily
access and then pass to a few
: functions of my own.

Perhaps there's no need to modify the program at all, but
instead focus on using the existing features to their fullest
potential. You can dump the SQL query results to a file. Pavuk
will also save file descriptors for you, just for the purpose
of further processing with another tool.

: Anybody feel like helping out a newbie still learning his way
around C??

Nobody wants to help newbies, other than to encourage them to
keep trying. Get the latest distro (or better, CVS snapshot),
compile it and start tinkering with it. Use C reference manual,
grep, gdb, some syntax highlighting editor, and you can't go
wrong, except to run out of motivation.

Paul





Wed Sep 15, 2004 5:36 pm

wiedzmin
Offline Offline
Send Email Send Email

Forward
Message #757 of 988 |
Expand Messages Author Sort by Date

I have a question for the group (or the Pavuk authors.) I am considering using Pavuk for a project that would require a bit of modification on how it runs....
Noah
noah977
Offline Send Email
Sep 14, 2004
2:16 pm

... From: "Noah" <noah@...> ... various parameters (domains to ... text file. Where in the code ... internal variables are set for Pavuk? ... save...
Paul Slusarz
wiedzmin
Offline Send Email
Sep 16, 2004
3:08 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help