Search the web
Sign In
New User? Sign Up
pavuk · Pavuk Webgrabber Mailing List
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
download trouble   Message List  
Reply | Forward Message #861 of 988 |
Re: [pavuk] download trouble

Thanks for reporting. I found a wicked bug in my new pavuk/chunky code
thanks to this URL.

Anyway, back to your problem.

Currently pavuk does not support RFC2616 (HTTP/1.1) 'chunked' content
download, so you'll have to trick the server into feeding you the data
as one block instead of multiple 'chunks'.

You can do this by forcing pavuk to act like a HTTP/1.0 client with the
commandline argument '-nouse_http11' .
I checked and that works in this case.

BTW: which pavuk version are you using?

Best regards,

Ger


(For completeness sake: when won't this commandline argument work as
expected? When you're accessing virtual hosted web servers which are
sharing a single IP number. Those will not like this, but that does
apply here.)



user nx wrote:
> I have problem with some URLs.
>
> For example:
>
>
http://www.fanlib.ru/GetBook.ashx?type=html&Id=a40e7088-40a5-4c03-8b97-2bde224b7\
b72&UserId=00000000-0000-0000-0000-000000000000

>
> When I use it with Mozilla I receive file Paradise_Lost.zip
>
> But when I use Pavuk it say me this:
>
> URL: 1(0) of 1
>
http://www.fanlib.ru/GetBook.ashx?type=html&Id=a40e7088-40a5-4c03-8b97-2bde224b7\
b72&UserId=00000000-0000-0000-0000-000000000000

> Error reading document with "chunked" transfer encoding!
> Document transfer data: Success
> download: ERROR: HTTP document is truncated
> URL: 1(0) of 1
>
http://www.fanlib.ru/GetBook.ashx?type=html&Id=a40e7088-40a5-4c03-8b97-2bde224b7\
b72&UserId=00000000-0000-0000-0000-000000000000

> Trying to resume from position 184948
> download: ERROR: HTTP server doesn't support partial content retrieving
>
>
>
>
________________________________________________________________________________\
____
> Bored stiff? Loosen up...
> Download and play hundreds of games for free on Yahoo! Games.
> http://games.yahoo.com/games/front
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
>



Fri May 11, 2007 1:27 am

i_a42
Offline Offline
Send Email Send Email

Forward
Message #861 of 988 |
Expand Messages Author Sort by Date

I have problem with some URLs. For example: ...
user nx
nx.user
Offline Send Email
May 10, 2007
5:52 am

Thanks for reporting. I found a wicked bug in my new pavuk/chunky code thanks to this URL. Anyway, back to your problem. Currently pavuk does not support...
Gerrit E.G. Hobbelt
i_a42
Offline Send Email
May 11, 2007
1:27 am

Correction to what I said just now: pavuk _does_ support 'chunked' content, but it looks like both pavuk and IIS 6.0 (the webserver version we're taking about...
Gerrit E.G. Hobbelt
i_a42
Offline Send Email
May 11, 2007
2:03 am

... commandline argument '-nouse_http11' . Yes it work, thanks. ... Sorry, I forgot to specify. Version is 0.9.35. Compile from source on Debian 3.1. ...
user nx
nx.user
Offline Send Email
May 12, 2007
11:20 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help