Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

xenu-usergroup · Xenu Linkchecker Usergroup

The Yahoo! Groups Product Blog

Check it out!

Group Information

  • Members: 493
  • Category: Internet
  • Founded: Jul 20, 2004
  • Language: English
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Messages

Advanced
Messages Help
Messages 1249 - 1279 of 1639   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#1249 From: "Mark Findlay" <mfindlay@...>
Date: Sun Feb 7, 2010 12:07 am
Subject: Xenu keeps deleting itself?
markjamesfin...
Send Email Send Email
 

Hi all,

 

I am running Xenu 1.3.4 and about every 7 days or so, the executable disappears on me. The icon remains on my desktop, and all the other files that make up the installation remain, but the actual xenu.exe file is gone.

 

Is the 1.3.4 version of Xenu a trial version? Does anyone else have any similar experiences with Xenu auto-deleting itself?

 

Thanks!
Mark

 


#1250 From: Tilman Hausherr <tilman@...>
Date: Sun Feb 7, 2010 6:56 am
Subject: Re: Xenu keeps deleting itself?
geo4497
Send Email Send Email
 
No, Xenu does not delete itself. Maybe you're on a corporate system with
some weird policies enabled.

Tilman

On Sat, 6 Feb 2010 16:07:34 -0800, Mark Findlay wrote:

>Hi all,
>
>
>
>I am running Xenu 1.3.4 and about every 7 days or so, the executable
>disappears on me. The icon remains on my desktop, and all the other files
>that make up the installation remain, but the actual xenu.exe file is gone.
>
>
>
>Is the 1.3.4 version of Xenu a trial version? Does anyone else have any
>similar experiences with Xenu auto-deleting itself?
>
>
>
>Thanks!
>Mark
>
>

#1251 From: "Mark Findlay" <mfindlay@...>
Date: Sun Feb 7, 2010 7:33 am
Subject: RE: Xenu keeps deleting itself?
markjamesfin...
Send Email Send Email
 

No, I’m just on my home computer, and all other programs install and run just fine. Oddest thing I ever saw.

 

Mark

 

From: xenu-usergroup@yahoogroups.com [mailto:xenu-usergroup@yahoogroups.com] On Behalf Of Tilman Hausherr
Sent: Saturday, February 06, 2010 10:57 PM
To: xenu-usergroup@yahoogroups.com
Subject: Re: [xenu-usergroup] Xenu keeps deleting itself?

 

 

No, Xenu does not delete itself. Maybe you're on a corporate system with
some weird policies enabled.

Tilman

On Sat, 6 Feb 2010 16:07:34 -0800, Mark Findlay wrote:

>Hi all,
>
>
>
>I am running Xenu 1.3.4 and about every 7 days or so, the executable
>disappears on me. The icon remains on my desktop, and all the other files
>that make up the installation remain, but the actual xenu.exe file is gone.
>
>
>
>Is the 1.3.4 version of Xenu a trial version? Does anyone else have any
>similar experiences with Xenu auto-deleting itself?
>
>
>
>Thanks!
>Mark
>
>


#1252 From: Tilman Hausherr <tilman@...>
Date: Sun Feb 7, 2010 7:44 am
Subject: Re: Xenu keeps deleting itself?
geo4497
Send Email Send Email
 
Another thing to try is that after it happens, look into the event logs
in the control center, and into the logs of any security software
(antivirus, antispy) you have installed to see if it mentions anything.

Btw, the current version is 1.3.5,
http://home.snafu.de/tilman/XENU.ZIP
and there's also a nice beta here.
http://home.snafu.de/tilman/tmp/xenubeta.zip

Tilman

On Sat, 6 Feb 2010 23:33:49 -0800, Mark Findlay wrote:

>No, I'm just on my home computer, and all other programs install and run
>just fine. Oddest thing I ever saw.
>
>
>
>Mark
>
>
>
>From: xenu-usergroup@yahoogroups.com [mailto:xenu-usergroup@yahoogroups.com]
>On Behalf Of Tilman Hausherr
>Sent: Saturday, February 06, 2010 10:57 PM
>To: xenu-usergroup@yahoogroups.com
>Subject: Re: [xenu-usergroup] Xenu keeps deleting itself?
>
>
>
>
>
>No, Xenu does not delete itself. Maybe you're on a corporate system with
>some weird policies enabled.
>
>Tilman
>
>On Sat, 6 Feb 2010 16:07:34 -0800, Mark Findlay wrote:
>
>>Hi all,
>>
>>
>>
>>I am running Xenu 1.3.4 and about every 7 days or so, the executable
>>disappears on me. The icon remains on my desktop, and all the other files
>>that make up the installation remain, but the actual xenu.exe file is gone.
>
>>
>>
>>
>>Is the 1.3.4 version of Xenu a trial version? Does anyone else have any
>>similar experiences with Xenu auto-deleting itself?
>>
>>
>>
>>Thanks!
>>Mark
>>
>>
>
>

#1254 From: Tilman Hausherr <tilman@...>
Date: Thu Feb 25, 2010 9:47 pm
Subject: mail address check
geo4497
Send Email Send Email
 
I've started a partial check of mail addresses. The current beta
http://home.snafu.de/tilman/tmp/xenubeta.zip
will check the domains of mail addresses (the part after the "@"). Xenu
makes a DNS request for MX records.

If Xenu can't find the file DNSAPI.DLL, it will check nothing. If Xenu
cannot find a host or an MX record for it, it will report the error "no
such host". If Xenu does find an MX record, it will output "skip type"
as usual.

Thus, the new version will only find broken domains of mail addresses.
It doesn't check whether the user exists. The reason is that I doubt
that there is an exact check. (For obvious reason: it would help
spammers)

Tilman

#1255 From: "andrewcswift" <ac@...>
Date: Fri Apr 9, 2010 1:23 pm
Subject: Xenu on XP crashes immediately when generating XML site map
andrewcswift
Send Email Send Email
 
Hello,

In the past, I have used Xenu to index a site with about 800,000 pages. It took
a while to generate the XML site map, but worked fine (this was a year ago).

Recently I went to update the site map, and Xenu crashes immediately when I try
to generate the XML file. I get the cute volcano icon with barely any delay.

I've tried both the latest stable release (1.3.5) and the beta (1.3.6), and I've
tried reducing the number of pages indexed. The program still crashes
immediately even if I'm only indexing 200,000 pages or less.

Is there any way I can find out what is causing Xenu to crash? I've looked in
the XP event viewer and there's nothing.

As far as I know, except for a year's worth of XP updates, both my computer and
Xenu are configured exactly the way they were last year when the program worked.

Does anyone have any ideas? Is it possible that there is a mal-formed URL that's
being indexed, and if so, is there an easy way to isolate it in the list of all
the pages?

#1256 From: Tilman Hausherr <tilman@...>
Date: Fri Apr 9, 2010 1:27 pm
Subject: Re: Xenu on XP crashes immediately when generating XML site map
geo4497
Send Email Send Email
 
Send a ZIPped .XEN file to my mail address (not to the list): tilman at
snafu dot de.  If the file is too large, upload it to rapidshare dot com
or a similar service.

Tilman

On Fri, 09 Apr 2010 13:23:04 -0000, andrewcswift <ac@...> wrote:

> Hello,
>
> In the past, I have used Xenu to index a site with about 800,000 pages.
> It took a while to generate the XML site map, but worked fine (this was
> a year ago).
>
> Recently I went to update the site map, and Xenu crashes immediately
> when I try to generate the XML file. I get the cute volcano icon with
> barely any delay.
>
> I've tried both the latest stable release (1.3.5) and the beta (1.3.6),
> and I've tried reducing the number of pages indexed. The program still
> crashes immediately even if I'm only indexing 200,000 pages or less.
>
> Is there any way I can find out what is causing Xenu to crash? I've
> looked in the XP event viewer and there's nothing.
>
> As far as I know, except for a year's worth of XP updates, both my
> computer and Xenu are configured exactly the way they were last year
> when the program worked.
>
> Does anyone have any ideas? Is it possible that there is a mal-formed
> URL that's being indexed, and if so, is there an easy way to isolate it
> in the list of all the pages?
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>

#1257 From: Tilman Hausherr <tilman@...>
Date: Mon Apr 12, 2010 2:58 pm
Subject: Re: Xenu on XP crashes immediately when generating XML site map
geo4497
Send Email Send Email
 
On Fri, 09 Apr 2010 13:23:04 -0000, andrewcswift wrote:

>Recently I went to update the site map, and Xenu crashes immediately when I try
to generate the XML file. I get the cute volcano icon with barely any delay.

This has now been solved, it was a misunderstanding. The volcano means
that Xenu is working, not that it is crashing :-)

Tilman

#1258 From: "Lars" <baloo5419@...>
Date: Mon Apr 12, 2010 3:04 pm
Subject: link to wikipedia
baloo5419
Send Email Send Email
 
I have quite some links to Wikipedia.

They give this message. A little annoying
  error code: 403 (forbidden request)

#1259 From: "daniel norton, teeny tiny websites" <daniel@...>
Date: Mon Apr 12, 2010 3:12 pm
Subject: Re: link to wikipedia
teenytinyweb...
Send Email Send Email
 
On Mon, Apr 12, 2010 at 10:04 AM, Lars <baloo5419@...> wrote: 

I have quite some links to Wikipedia.

They give this message. A little annoying
error code: 403 (forbidden request)

...
User-agent: Xenu
Disallow: /
...

--
Daniel



#1260 From: "Vince Thacker" <vince@...>
Date: Mon Apr 12, 2010 4:47 pm
Subject: Re: link to wikipedia
vip_uc
Send Email Send Email
 
I've raised a similar query before about many Google links that return a
403, especially their RSS feeds. It doesn't seem you can do much about it,
as these sites have a policy of forbidding software such as Xenu getting
access.

Presumably if a page is generating a 403, you know at least that you have a
valid link.

Vince.

----- Original Message -----
From: "Lars" <baloo5419@...>
To: <xenu-usergroup@yahoogroups.com>
Sent: Monday, April 12, 2010 4:04 PM
Subject: [xenu-usergroup] link to wikipedia


>
> I have quite some links to Wikipedia.
>
> They give this message. A little annoying
> error code: 403 (forbidden request)
>
>
>
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>
>
>

#1261 From: "Fischer, Thomas" <fischer@...>
Date: Tue Apr 13, 2010 6:53 am
Subject: AW: link to wikipedia
thgb.fischer
Send Email Send Email
 
Hello,
 
since checking links to Wikipedia seems to be a legitimate task for Xenu, shouldn't someone contact them and as for the removal of the robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia don't work together smoothly, e.g because of the internal redirects in Wikipedia?
 
By the way,
User-agent: Xenu
Disallow: /

All the best
Thomas
 

Von: xenu-usergroup@yahoogroups.com [mailto:xenu-usergroup@yahoogroups.com] Im Auftrag von Vince Thacker
Gesendet: Montag, 12. April 2010 18:48
An: xenu-usergroup@yahoogroups.com
Betreff: Re: [xenu-usergroup] link to wikipedia

 

I've raised a similar query before about many Google links that return a
403, especially their RSS feeds. It doesn't seem you can do much about it,
as these sites have a policy of forbidding software such as Xenu getting
access.

Presumably if a page is generating a 403, you know at least that you have a
valid link.

Vince.

----- Original Message -----
From: "Lars" <baloo5419@yahoo.co.uk>
To: <xenu-usergroup@yahoogroups.com>
Sent: Monday, April 12, 2010 4:04 PM
Subject: [xenu-usergroup] link to wikipedia

>
> I have quite some links to Wikipedia.
>
> They give this message. A little annoying
> error code: 403 (forbidden request)
>
>
>
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>
>
>


#1262 From: "Wolfgang" <w.peters@...>
Date: Tue Apr 13, 2010 8:45 am
Subject: missusing canonical tag
wpemit
Send Email Send Email
 
Hi,
i started using the canonical-tag to convince google to take the primary domain
into his index.
<link rel="canonical" href="..." />


Unfortunately die domain doesn´t exist now but it will exist in a few days.
Unlikely XENU checks the canonical-tag and lists an error for each page of my
website.
Is there some setting to command XENU to ignore the canonical?
My XENU-Version: 1.3.5

best regards
Wolfgang

#1263 From: Tilman Hausherr <tilman@...>
Date: Tue Apr 13, 2010 4:40 pm
Subject: Re: missusing canonical tag
geo4497
Send Email Send Email
 
On Tue, 13 Apr 2010 08:45:45 -0000, Wolfgang wrote:

>Hi,
>i started using the canonical-tag to convince google to take the primary domain
into his index.
><link rel="canonical" href="..." />
>
>
>Unfortunately die domain doesn´t exist now but it will exist in a few days.
>Unlikely XENU checks the canonical-tag and lists an error for each page of my
website.
>Is there some setting to command XENU to ignore the canonical?

Use the exclusion feature in the initial dialogbox -> it would then
ignore links to that "tomorrow" domain.

Tilman


>My XENU-Version: 1.3.5
>
>best regards
>Wolfgang
>
>
>
>------------------------------------
>
>Yahoo! Groups Links
>
>
>

#1264 From: "Ron Jones" <ron@...>
Date: Tue Apr 13, 2010 6:59 pm
Subject: Re: link to wikipedia
ronhjones
Send Email Send Email
 
Fischer, Thomas wrote:
> Hello,
>
> since checking links to Wikipedia seems to be a legitimate task for
> Xenu, shouldn't someone contact them and as for the removal of the
> robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia
> don't work together smoothly, e.g because of the internal redirects
> in Wikipedia?
>
> By the way,
>
> User-agent: Xenu
> Disallow: /
>
> is also contained in http://de.wikipedia.org/robots.txt.
>
> All the best
> Thomas

I would suggest it's all about load on their servers - at the end of the day
they are still a charity, so they won't have the finest and fastest servers
in the world.  Also remember that pages are stored in Wiki markup - each
page you ask for has to be converted to html for your browser to display.
One wikipedia page is likely to lead to a huge tree of pages being
requested - editors are always asked to make sure that a page links to
plenty of other pages.
They do allow Google to search, not sure if anyone else can do so.

Ron Jones
Process Safety & Development Specialist
Don't repeat history, unreported chemical lab/plant near misses at
http://www.crhf.org.uk Only two things are certain: The universe and
human stupidity; and I'm not certain about the universe. ~ Albert
Einstein

#1265 From: "Ron Jones" <ron@...>
Date: Tue Apr 13, 2010 7:58 pm
Subject: Re: link to wikipedia
ronhjones
Send Email Send Email
 
Ron Jones wrote:
> Fischer, Thomas wrote:
>> Hello,
>>
>> since checking links to Wikipedia seems to be a legitimate task for
>> Xenu, shouldn't someone contact them and as for the removal of the
>> robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia
>> don't work together smoothly, e.g because of the internal redirects
>> in Wikipedia?
>>
>> By the way,
>>
>> User-agent: Xenu
>> Disallow: /
>>
>> is also contained in http://de.wikipedia.org/robots.txt.
>>
>> All the best
>> Thomas
>
> I would suggest it's all about load on their servers - at the end of
> the day they are still a charity, so they won't have the finest and
> fastest servers in the world.  Also remember that pages are stored in
> Wiki markup - each page you ask for has to be converted to html for
> your browser to display. One wikipedia page is likely to lead to a
> huge tree of pages being requested - editors are always asked to make
> sure that a page links to plenty of other pages.
> They do allow Google to search, not sure if anyone else can do so.

Just remembered - all the pages are all set for "no follow" - this is to
stop unscrupulous companies adding spam links to build up their Google
ranking.  Any links added have no effect on the Google ranking.

Ron Jones
Process Safety & Development Specialist
Don't repeat history, unreported chemical lab/plant near misses at
http://www.crhf.org.uk Only two things are certain: The universe and
human stupidity; and I'm not certain about the universe. ~ Albert
Einstein

#1266 From: Tilman Hausherr <tilman@...>
Date: Sun Apr 18, 2010 11:03 am
Subject: new beta with milliseconds
geo4497
Send Email Send Email
 
Although Xenu isn't a SEO tool, it is being "misused" as such. A guy
asked to get the duration in milliseconds, and google has recently
announced that loading time of websites would be taken into
consideration.

A new beta version is here:
http://home.snafu.de/tilman/tmp/xenubeta.zip

This is just a test so you see how it looks and give feedback. The
milliseconds value isn't saved in the .XEN file, nor in the export file.
(This will be done at a later time). If you need the milliseconds
feature, please test it and give feedback about wether this is usable,
or annoying.

Below are all the changes since the last regular version. If you like to
support me, please test it and give feedback.

Tilman

=====================

Major improvements:
24.2.2010: Check the domains of mail addresses (DNS lookup for MX
record)

Minor improvements:
7.12.2009: Include PARSETEST4 section in general release (convert
characters >80H to %XX, for "international" URLs)
19.12.2009: For "international" characters in local files: Use Unicode
for local directory search, URL launch in browser, read/check local
files
20.12.2009: But not for Windows 95/98/ME
22.12.2009: add ".class" for applets if needed, replace "." with "/".
             example:
http://www.colorado.edu/physics/2000/applets/bec.html
27.12.2009: updated to NSIS 2.46
10.1.2010: use version 6 list column sort arrows on XP and higher
14.1.2010: added Description column
15.1.2010: added warning when settings overwritten by profile
16.1.2010: attempt at decoding .jar files for APPLET ARCHIVE thanks to
            http://www.codeguru.com/cpp/cpp/cpp_mfc/article.php/c4049/
		    However:
		    - only one .jar archive per applet
		    - no unicode in file names
		    - name of archive must end with .jar
		    - .jar file must be internal, or the class link will
remain broken
		    - .class "in Jar" property isn't saved in .XEN file
(which prevents standard access in favor of waiting for .jar lookup)
24.1.2010: added <video src=
27.1.2010: improved list control divider double click (title is the
minimum)
26.2.2010: improved extra text in domain mail check
13.3.2010: Get page body only if not redirection or redirection but no
"Location:" in header
            (should make PARSETEST3 fix superfluous)
16.3.2010: ...
30.3.2010: Abort box for ftp orphan search
2.4.2010: [Options] Accept="*/*"  (default value)
14.4.2010: milliseconds in duration
            (in progress; missing: export, save/load)

Bug fixes:
15.12.2009: PARSETEST4 section: replaced "> 80X" with ">= 80X"
20.12.2009: added version check for Unicode Clipboard and Sitemap for
Windows 95/98/ME (like 27.1.2009)
21.12.2009: corrected broken banner links
22.12.2009: tell "anchor occurs multiple times" only once per URL
4.1.2010: remove stuff after "?" in mailto: due to Microsoft error in
AfxParseURLEx()
10.1.2010: fixed list column sort arrows wrongly displayed in unsorted
columns (on 7, but not on XP)
12.1.2010: fixed "//" bug in applet codebase in local url
15.1.2010: disabled and unchecked "Inactive" checkbox after loading new
profile
18.1.2010: fixed title line of tab export
20.1.2010: Don't assume URLs to be UTF-8, use current charset instead
            However: this solution isn't perfect, because the correct
charset of an URL would be the referring URL
		    But in most cases it will work, because URLs usually
have the same charset
		    Known bug: Root URL with exotic characters
20.1.2010: Corrected exotic URLs in sitemap
26.1.2010: Fixed % in file: URLs, only convert %XX
27.1.2010: "Conversion to lowercase" option uses codepage for conversion
31.1.2010: Fixed bug in report (max size + max size url), probably
introduced on 15.1.2010
15.3.2010: vNormalizeURL() with conversion to UTF8 prior to
AfxMyParseURL()
            store URLs in UTF8, unless already ANSI or ISO-8859-1 (1252)
		    vRemovePercents for display only
3.4.2010:  prevent reentrant calls to vDoIdle();
            set fileNotFound status if tmp URL content file deleted by
antivirus software
10.4.2010: replaced "> 80X" with ">= 80X" in vAnsi2EntityEscaped()

#1267 From: "Andy Mabbett" <andy@...>
Date: Tue Apr 13, 2010 10:11 am
Subject: Re: AW: link to wikipedia
pigsonthewinguk
Send Email Send Email
 
There would be no point - Wikipedia always returns "200 OK" never "404
page not found".

On Tue, April 13, 2010 07:53, Fischer, Thomas wrote:
> Hello,
>
> since checking links to Wikipedia seems to be a legitimate task for Xenu,
> shouldn't someone contact them and as for the removal of the robots.txt
> exclusion?. Or is there a reason that Xenu and Wikipedia don't work
> together smoothly, e.g because of the internal redirects in Wikipedia?
>
> By the way,
>
> User-agent: Xenu
> Disallow: /
>
> is also contained in http://de.wikipedia.org/robots.txt.
>
> All the best
> Thomas
>
> ________________________________
> Von: xenu-usergroup@yahoogroups.com
> [mailto:xenu-usergroup@yahoogroups.com] Im Auftrag von Vince Thacker
> Gesendet: Montag, 12. April 2010 18:48
> An: xenu-usergroup@yahoogroups.com
> Betreff: Re: [xenu-usergroup] link to wikipedia
>
>
>
> I've raised a similar query before about many Google links that return a
> 403, especially their RSS feeds. It doesn't seem you can do much about it,
> as these sites have a policy of forbidding software such as Xenu getting
> access.
>
> Presumably if a page is generating a 403, you know at least that you have
> a
> valid link.
>
> Vince.
>
> ----- Original Message -----
> From: "Lars" <baloo5419@...<mailto:baloo5419%40yahoo.co.uk>>
> To:
> <xenu-usergroup@yahoogroups.com<mailto:xenu-usergroup%40yahoogroups.com>>
> Sent: Monday, April 12, 2010 4:04 PM
> Subject: [xenu-usergroup] link to wikipedia
>
>>
>> I have quite some links to Wikipedia.
>>
>> They give this message. A little annoying
>> error code: 403 (forbidden request)

--
Andy Mabbett
@pigsonthewing
http://pigsonthewing.org.uk
** via webmail **

#1268 From: "Fischer, Thomas" <fischer@...>
Date: Mon Apr 26, 2010 10:23 am
Subject: AW: AW: link to wikipedia
thgb.fischer
Send Email Send Email
 
Hi Andy,

> There would be no point - Wikipedia always returns "200 OK"
> never "404 page not found".

That is not true.
While this may hold for searches, trying to access specific pages will give
error messages.
E.g. http://en.wikipedia.org/wiki/QWER
gives "HTTP/1.0 404 Not Found"
before starting a redirect.
I assume that this would be the kind of link somebody might want to check using
Xenu.

Thomas

>
> On Tue, April 13, 2010 07:53, Fischer, Thomas wrote:
> > Hello,
> >
> > since checking links to Wikipedia seems to be a legitimate task for
> > Xenu, shouldn't someone contact them and as for the removal of the
> > robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia
> > don't work together smoothly, e.g because of the internal
> redirects in Wikipedia?
> >
> > By the way,
> >
> > User-agent: Xenu
> > Disallow: /
> >
> > is also contained in http://de.wikipedia.org/robots.txt.
> > <http://de.wikipedia.org/robots.txt.>
> >
> > All the best
> > Thomas
> >
> > ________________________________
> > Von: xenu-usergroup@yahoogroups.com
> > <mailto:xenu-usergroup%40yahoogroups.com>
> > [mailto:xenu-usergroup@yahoogroups.com
> > <mailto:xenu-usergroup%40yahoogroups.com> ] Im Auftrag von Vince
> > Thacker
> > Gesendet: Montag, 12. April 2010 18:48
> > An: xenu-usergroup@yahoogroups.com
> > <mailto:xenu-usergroup%40yahoogroups.com>
> > Betreff: Re: [xenu-usergroup] link to wikipedia
> >
> >
> >
> > I've raised a similar query before about many Google links
> that return
> > a 403, especially their RSS feeds. It doesn't seem you can do much
> > about it, as these sites have a policy of forbidding
> software such as
> > Xenu getting access.
> >
> > Presumably if a page is generating a 403, you know at least
> that you
> > have a valid link.
> >
> > Vince.
> >
> > ----- Original Message -----
> > From: "Lars" <baloo5419@...
> <mailto:baloo5419%40yahoo.co.uk>
> > <mailto:baloo5419%40yahoo.co.uk>>
> > To:
> > <xenu-usergroup@yahoogroups.com
> > <mailto:xenu-usergroup%40yahoogroups.com>
> > <mailto:xenu-usergroup%40yahoogroups.com>>
> > Sent: Monday, April 12, 2010 4:04 PM
> > Subject: [xenu-usergroup] link to wikipedia
> >
> >>
> >> I have quite some links to Wikipedia.
> >>
> >> They give this message. A little annoying error code: 403
> (forbidden
> >> request)
>
> --
> Andy Mabbett
> @pigsonthewing
> http://pigsonthewing.org.uk <http://pigsonthewing.org.uk>
> ** via webmail **
>
>
>
>
>

#1269 From: Jack Stringer <jack@...>
Date: Mon Apr 26, 2010 10:40 am
Subject: Re: link to wikipedia
iliketrailri...
Send Email Send Email
 
>>> since checking links to Wikipedia seems to be a legitimate task for
>>> Xenu, shouldn't someone contact them and as for the removal of the
>>> robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia
>>> don't work together smoothly, e.g because of the internal
>> redirects in Wikipedia?
>>>
>>> By the way,
>>>
>>> User-agent: Xenu
>>> Disallow: /
>>>
>>> is also contained in http://de.wikipedia.org/robots.txt.
>>> <http://de.wikipedia.org/robots.txt.>


There are a couple of thousand users using Xenu if they all started
sending requests to wikipedia site then the server soon gets bogged down
trying to deliver the pages. Its the same as those people using website
copying software. I have had my photography gallery go very very slow at
times just because someone is trying to hoover up the pictures.

What would be nice is to find out from wikipedia what changes need to be
made to Xenu so make it nicer to their systems. E.g some sort of delay
when getting pages from wikipedia servers.


Jack Stringer

#1270 From: Tilman Hausherr <tilman@...>
Date: Mon Apr 26, 2010 4:17 pm
Subject: Re: link to wikipedia
geo4497
Send Email Send Email
 
On Mon, 26 Apr 2010 11:40:55 +0100, Jack Stringer wrote:

>>>> since checking links to Wikipedia seems to be a legitimate task for
>>>> Xenu, shouldn't someone contact them and as for the removal of the
>>>> robots.txt exclusion?. Or is there a reason that Xenu and Wikipedia
>>>> don't work together smoothly, e.g because of the internal
>>> redirects in Wikipedia?
>>>>
>>>> By the way,
>>>>
>>>> User-agent: Xenu
>>>> Disallow: /
>>>>
>>>> is also contained in http://de.wikipedia.org/robots.txt.
>>>> <http://de.wikipedia.org/robots.txt.>
>
>
>There are a couple of thousand users using Xenu if they all started
>sending requests to wikipedia site then the server soon gets bogged down
>trying to deliver the pages. Its the same as those people using website
>copying software. I have had my photography gallery go very very slow at
>times just because someone is trying to hoover up the pictures.
>
>What would be nice is to find out from wikipedia what changes need to be
>made to Xenu so make it nicer to their systems. E.g some sort of delay
>when getting pages from wikipedia servers.

Xenu is already "nice", i.e. it makes a HEAD request, not a GET request.
My opinion is that the wikipedia software is crappy. The organisation is
mostly concentrated on collecting money, enforcing censorship, altering
history, and being busy with itself (many of the admins are just very
intelligent kids with too much time), instead of delivering a high
quality product by running a Continuous Improvement Process.

Tilman (holder of a scarlet letter from the wikipedia arb board :-))
http://en.wikipedia.org/wiki/User:Tilman


>
>
>Jack Stringer
>
>
>------------------------------------
>
>Yahoo! Groups Links
>
>
>

#1271 From: "Ron Jones" <ron@...>
Date: Mon Apr 26, 2010 7:52 pm
Subject: Re: link to wikipedia
ronhjones
Send Email Send Email
 
Tilman Hausherr wrote:
> On Mon, 26 Apr 2010 11:40:55 +0100, Jack Stringer wrote:
>
>>>>> since checking links to Wikipedia seems to be a legitimate task
>>>>> for Xenu, shouldn't someone contact them and as for the removal
>>>>> of the robots.txt exclusion?. Or is there a reason that Xenu and
>>>>> Wikipedia don't work together smoothly, e.g because of the
>>>>> internal
>>>> redirects in Wikipedia?
>>>>>
>>>>> By the way,
>>>>>
>>>>> User-agent: Xenu
>>>>> Disallow: /
>>>>>
>>>>> is also contained in http://de.wikipedia.org/robots.txt.
>>>>> <http://de.wikipedia.org/robots.txt.>
>>
>>
>> There are a couple of thousand users using Xenu if they all started
>> sending requests to wikipedia site then the server soon gets bogged
>> down trying to deliver the pages. Its the same as those people using
>> website copying software. I have had my photography gallery go very
>> very slow at times just because someone is trying to hoover up the
>> pictures.
>>
>> What would be nice is to find out from wikipedia what changes need
>> to be made to Xenu so make it nicer to their systems. E.g some sort
>> of delay when getting pages from wikipedia servers.
>
> Xenu is already "nice", i.e. it makes a HEAD request, not a GET
> request. My opinion is that the wikipedia software is crappy. The
> organisation is mostly concentrated on collecting money, enforcing
> censorship, altering history, and being busy with itself (many of the
> admins are just very intelligent kids with too much time), instead of
> delivering a high quality product by running a Continuous Improvement
> Process.
>
> Tilman (holder of a scarlet letter from the wikipedia arb board :-))
> http://en.wikipedia.org/wiki/User:Tilman
>

There are plenty of old admins, I can assure you :-)
The software is probably rough - it *is* still a charity, and due to the
mindless antics of loads of juniville vandals, it needs a large team of
vandal fighters (not just admins - there's only 1000 regular ones) to keep
the pages more or less intact - English Wikipedia has around 150-200 pages
change per minute, and around 10% of those have to be reverted - so the
servers are already very busy, and I think allowing Xenu in will grind it to
a halt - If the Dutch mirrors go down, and I have to connect direct (from
UK) to the USA servers, then it can take 30 seconds plus for a medium page
to load.

Ron Jones
Process Safety & Development Specialist
Don't repeat history, unreported chemical lab/plant near misses at
http://www.crhf.org.uk Only two things are certain: The universe and
human stupidity; and I'm not certain about the universe. ~ Albert
Einstein

#1272 From: Tilman Hausherr <tilman@...>
Date: Thu May 6, 2010 4:25 pm
Subject: Re: new beta with milliseconds
geo4497
Send Email Send Email
 
Now it does save/export and restore the milliseconds value.
http://home.snafu.de/tilman/tmp/xenubeta.zip

Tilman


On Sun, 18 Apr 2010 13:03:46 +0200, Tilman Hausherr wrote:

>Although Xenu isn't a SEO tool, it is being "misused" as such. A guy
>asked to get the duration in milliseconds, and google has recently
>announced that loading time of websites would be taken into
>consideration.
>
>A new beta version is here:
>http://home.snafu.de/tilman/tmp/xenubeta.zip
>
>This is just a test so you see how it looks and give feedback. The
>milliseconds value isn't saved in the .XEN file, nor in the export file.
>(This will be done at a later time). If you need the milliseconds
>feature, please test it and give feedback about wether this is usable,
>or annoying.
>
>Below are all the changes since the last regular version. If you like to
>support me, please test it and give feedback.
>
>Tilman
>
>=====================
>
>Major improvements:
>24.2.2010: Check the domains of mail addresses (DNS lookup for MX
>record)
>
>Minor improvements:
>7.12.2009: Include PARSETEST4 section in general release (convert
>characters >80H to %XX, for "international" URLs)
>19.12.2009: For "international" characters in local files: Use Unicode
>for local directory search, URL launch in browser, read/check local
>files
>20.12.2009: But not for Windows 95/98/ME
>22.12.2009: add ".class" for applets if needed, replace "." with "/".
>            example:
>http://www.colorado.edu/physics/2000/applets/bec.html
>27.12.2009: updated to NSIS 2.46
>10.1.2010: use version 6 list column sort arrows on XP and higher
>14.1.2010: added Description column
>15.1.2010: added warning when settings overwritten by profile
>16.1.2010: attempt at decoding .jar files for APPLET ARCHIVE thanks to
>           http://www.codeguru.com/cpp/cpp/cpp_mfc/article.php/c4049/
> 	   However:
> 	   - only one .jar archive per applet
> 	   - no unicode in file names
> 	   - name of archive must end with .jar
> 	   - .jar file must be internal, or the class link will
>remain broken
> 	   - .class "in Jar" property isn't saved in .XEN file
>(which prevents standard access in favor of waiting for .jar lookup)
>24.1.2010: added <video src=
>27.1.2010: improved list control divider double click (title is the
>minimum)
>26.2.2010: improved extra text in domain mail check
>13.3.2010: Get page body only if not redirection or redirection but no
>"Location:" in header
>           (should make PARSETEST3 fix superfluous)
>16.3.2010: ...
>30.3.2010: Abort box for ftp orphan search
>2.4.2010: [Options] Accept="*/*"  (default value)
>14.4.2010: milliseconds in duration
>           (in progress; missing: export, save/load)
>
>Bug fixes:
>15.12.2009: PARSETEST4 section: replaced "> 80X" with ">= 80X"
>20.12.2009: added version check for Unicode Clipboard and Sitemap for
>Windows 95/98/ME (like 27.1.2009)
>21.12.2009: corrected broken banner links
>22.12.2009: tell "anchor occurs multiple times" only once per URL
>4.1.2010: remove stuff after "?" in mailto: due to Microsoft error in
>AfxParseURLEx()
>10.1.2010: fixed list column sort arrows wrongly displayed in unsorted
>columns (on 7, but not on XP)
>12.1.2010: fixed "//" bug in applet codebase in local url
>15.1.2010: disabled and unchecked "Inactive" checkbox after loading new
>profile
>18.1.2010: fixed title line of tab export
>20.1.2010: Don't assume URLs to be UTF-8, use current charset instead
>           However: this solution isn't perfect, because the correct
>charset of an URL would be the referring URL
> 	   But in most cases it will work, because URLs usually
>have the same charset
> 	   Known bug: Root URL with exotic characters
>20.1.2010: Corrected exotic URLs in sitemap
>26.1.2010: Fixed % in file: URLs, only convert %XX
>27.1.2010: "Conversion to lowercase" option uses codepage for conversion
>31.1.2010: Fixed bug in report (max size + max size url), probably
>introduced on 15.1.2010
>15.3.2010: vNormalizeURL() with conversion to UTF8 prior to
>AfxMyParseURL()
>           store URLs in UTF8, unless already ANSI or ISO-8859-1 (1252)
> 	   vRemovePercents for display only
>3.4.2010:  prevent reentrant calls to vDoIdle();
>           set fileNotFound status if tmp URL content file deleted by
>antivirus software
>10.4.2010: replaced "> 80X" with ">= 80X" in vAnsi2EntityEscaped()
>
>
>------------------------------------
>
>Yahoo! Groups Links
>
>
>

#1273 From: "Ven. S. Upatissa (g)" <sadhu44@...>
Date: Fri May 7, 2010 2:29 am
Subject: Problem with orphan files report
sadhu44
Send Email Send Email
 
On my local hard disk I have a folder containing hundreds of html files,
and an index.html file that contains links to all of them.

When I run xenu on the index file, it correctly reports no broken links,
but it also reports that all of the other files are orphans.

Why is this?  What am I doing wrong?

-Thanks

#1274 From: Tilman Hausherr <tilman@...>
Date: Fri May 7, 2010 5:26 am
Subject: Re: Problem with orphan files report
geo4497
Send Email Send Email
 
Don't know.... send it to me in a zip, and send me a .XEN file in a ZIP
too, at    tilman at snafu dot de.


Tilman

On Fri, 07 May 2010 07:59:19 +0530, Ven. S. Upatissa (g) wrote:

>On my local hard disk I have a folder containing hundreds of html files,
>and an index.html file that contains links to all of them.
>
>When I run xenu on the index file, it correctly reports no broken links,
>but it also reports that all of the other files are orphans.
>
>Why is this?  What am I doing wrong?
>
>-Thanks
>
>
>------------------------------------
>
>Yahoo! Groups Links
>
>
>

#1275 From: "Fischer, Thomas" <fischer@...>
Date: Fri May 7, 2010 8:53 am
Subject: Very External Links Checked
thgb.fischer
Send Email Send Email
 
Hello Tilman,

I checked links on our site
Root URL: http://www.mathguide.de/
with
Consider URLs beginning with this as 'internal':
http://www.sub.uni-goettingen.de/ssgfi/

(don't try this, without the proper restrictions this will run into almost
infinite loops).

But among the error result I received I have also

http://purl.org/dstc/dc4
         http://www.dstc.edu.au/DC4
           \_____ error code: 12007 (no such host)

http://scout.cs.wisc.edu/research/index.html
         http://scout.wisc.edu/Projects/index.html
           \_____ error code: 404 (not found)

Can you explain why these links where checked, and is there a way to prevent
this?

All the best
Thomas

--
Dr. Thomas Fischer
Research and Development Department (RDD)
Georg-August-Universität Göttingen
Göttingen State and University Library
37073 Goettingen
Germany

Tel.: +49 551 393883
and   +43 662 621498
fischer@...
http://www.sub.uni-goettingen.de/

#1276 From: "Fischer, Thomas" <fischer@...>
Date: Fri May 7, 2010 9:10 am
Subject: Expressing Dublin Core metadata using HTML
thgb.fischer
Send Email Send Email
 
Hi Tilman,

we are using Dublin Core Metadata to present some of our data in machine
readable format, following the standard from
http://dublincore.org/documents/dc-html/.
Now this profile carries a definition
         <head profile="http://dublincore.org/documents/2008/08/04/dc-html/">
         <link rel="schema.DC" href="http://purl.org/dc/elements/1.1/">
where these URLs are less meant as links than as identifiers of the standard
used.
But in my linkcheck I receive

http://purl.org/dc/elements/1.1/
redirected to: http://dublincore.org/2008/01/14/dcelements.rdf
status code: 302 (object temporarily moved)

with about 4500 links. This is not really helpful, is there a way around this?

All the best
Thomas

#1277 From: Tilman Hausherr <tilman@...>
Date: Fri May 7, 2010 10:32 am
Subject: Re: Very External Links Checked
geo4497
Send Email Send Email
 
Probably an external link that was redirected. One of "your" URLs was
linking to
http://purl.org/dstc/dc4
which redirects to
http://www.dstc.edu.au/DC4
but host doesn't exist.

Tilman

On Fri, 7 May 2010 10:53:22 +0200, "Fischer, Thomas"
<fischer@...> wrote:

> Hello Tilman,
>
> I checked links on our site
> Root URL: http://www.mathguide.de/
> with
> Consider URLs beginning with this as 'internal':
> http://www.sub.uni-goettingen.de/ssgfi/
>
> (don't try this, without the proper restrictions this will run into
> almost infinite loops).
>
> But among the error result I received I have also
>
> http://purl.org/dstc/dc4
>         http://www.dstc.edu.au/DC4
>           \_____ error code: 12007 (no such host)
>
> http://scout.cs.wisc.edu/research/index.html
>         http://scout.wisc.edu/Projects/index.html
>           \_____ error code: 404 (not found)
>
> Can you explain why these links where checked, and is there a way to
> prevent this?
>
> All the best
> Thomas
>
> --
> Dr. Thomas Fischer
> Research and Development Department (RDD)
> Georg-August-Universität Göttingen
> Göttingen State and University Library
> 37073 Goettingen
> Germany
>
> Tel.: +49 551 393883
> and   +43 662 621498
> fischer@...
> http://www.sub.uni-goettingen.de/
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>

#1278 From: Tilman Hausherr <tilman@...>
Date: Fri May 7, 2010 10:34 am
Subject: Re: Expressing Dublin Core metadata using HTML
geo4497
Send Email Send Email
 
The 302 probably comes because you have checked "treat redirections as
errors".

Tilman

On Fri, 7 May 2010 11:10:40 +0200, "Fischer, Thomas"
<fischer@...> wrote:

> Hi Tilman,
>
> we are using Dublin Core Metadata to present some of our data in machine
> readable format, following the standard from
> http://dublincore.org/documents/dc-html/. Now this profile carries a
> definition
>         <head
>         profile="http://dublincore.org/documents/2008/08/04/dc-html/">
>         <link rel="schema.DC" href="http://purl.org/dc/elements/1.1/">
> where these URLs are less meant as links than as identifiers of the
> standard used. But in my linkcheck I receive
>
> http://purl.org/dc/elements/1.1/
> redirected to: http://dublincore.org/2008/01/14/dcelements.rdf
> status code: 302 (object temporarily moved)
>
> with about 4500 links. This is not really helpful, is there a way around
> this?
>
> All the best
> Thomas
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>

#1279 From: "Fischer, Thomas" <fischer@...>
Date: Fri May 7, 2010 10:59 am
Subject: AW: Expressing Dublin Core metadata using HTML
thgb.fischer
Send Email Send Email
 
Hello,

> The 302 probably comes because you have checked "treat
> redirections as errors".
>
> Tilman

The point is that here URIs (in the head section of the document) are treated as
links, although they are no links (and needn't be) but identifiers in
<link rel="schema.DC" href="http://purl.org/dc/elements/1.1/">.
For an example, take
http://www.MathGuide.de/cgi-bin/ssgfi/anzeige.pl?db=math&nr=005032&ew=SSGFI.

Since I don't recall these to have appeared earlier, I assume something has
changed inside of Xenu that these URIs are checked and reported as redirections.
And since I want the redirections to be reported, I can't switch that off.
Every single webpage which uses http://dublincore.org/documents/dc-html/ will
use this or a similar construct, so this should be handled differently. A long
list of every single page containing a redirection will help nobody.

Cheers
Thomas

> On Fri, 7 May 2010 11:10:40 +0200, "Fischer, Thomas"
> <fischer@...> wrote:
>
> > Hi Tilman,
> >
> > we are using Dublin Core Metadata to present some of our data in
> > machine readable format, following the standard from
> > http://dublincore.org/documents/dc-html/. Now this profile
> carries a
> > definition
> >         <head
> >
> profile="http://dublincore.org/documents/2008/08/04/dc-html/">
> >         <link rel="schema.DC"
> href="http://purl.org/dc/elements/1.1/">
> > where these URLs are less meant as links than as identifiers of the
> > standard used. But in my linkcheck I receive
> >
> > http://purl.org/dc/elements/1.1/
> > redirected to: http://dublincore.org/2008/01/14/dcelements.rdf
> > status code: 302 (object temporarily moved)
> >
> > with about 4500 links. This is not really helpful, is there a way
> > around this?
> >
> > All the best
> > Thomas
> >
> > ------------------------------------
> >
> > Yahoo! Groups Links
> >
> >
> >
>
>
>
> ------------------------------------
>
> Yahoo! Groups Links
>
>
>
>

Messages 1249 - 1279 of 1639   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright © 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help