Search the web
Sign In
New User? Sign Up
webanalytics · The Web Analytics Forum
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
AVG Linkscanner   Message List  
Reply | Forward Message #17858 of 24282 |
I'm a bit surprised that the firestorm over AVG Linkscanner hasn't come up
here yet (or did I miss it?)... is anybody else feeling the pain from it?
Or are there so few people here using log file analysis that it has gone
unnoticed by many?
Linkscanner is part of AVG's security package. It pre-fetches links on every
page the user views (or just on SERPs, if the user chooses) and scans them
for malware... and tracking cookies. It masquerades as a normal browser. A
user who normally would have a handful of page views suddenly has hundreds
or thousands (and yes, many users have complainted about what it does to
their browser performance).

On some sites, such as Google, it triggers rate limiters because it is
requesting pages so rapidly.

It is invisible to hosted analytics because (I can tell from my analytics)
99 percent of people using it block the cookies (or the script itself, I
suppose). Log file analytics see a huge increase in page views with no
corresponding increase in visitors. This means it also looks just like a
spambot address harvester.

I was able to identify some of its probable activity because an earlier
version omits a space that's normally in the user agent string (look for
;1813 with no space after the semicolon). That apparently is "fixed" now.

On the good side of things, an AVG executive posted the following on the
Register's site (see
http://www.theregister.co.uk/2008/06/13/avg_scanner_skews_web_traffic_numbers/co\
mments/
)
on
Saturday:

"Hi, folks. Pat Bitton from AVG here. This issue has clearly raised some
concerns that we had not anticipated, and we acknowledge that we need to do
something. Our primary purpose with LinkScanner, as Roger Thompson has
pointed out, is to protect users against web-based threats that they cannot
see. These threats are also usually invisible to web site operators, who
presumably also don't wish to be unwittingly passing infections on to their
visitors. This kind of problem can and does affect all types of web sites,
big or small, and is extremely transient - which is why we don't use the
static database approach cited by some as a viable alternative. Over the
next few days, we will be exploring ways in which we can continue to deliver
informed protection as unobtrusively as possible without adversely impacting
site analytics. Any webmaster reading this post who is interested in working
with us constructively to reach this goal is welcome to contact me at
pat.bitton(at)avg.com."

I'd welcome any thoughts on this specific issue or the more general one it
raises.

Has the WAA tried to articulate a position on the tradeoffs between security
and analytics? I don't recall ever seeing anything like that.

Nick


[Non-text portions of this message have been removed]




Mon Jun 16, 2008 4:07 pm

narnett
Offline Offline
Send Email Send Email

Forward
Message #17858 of 24282 |
Expand Messages Author Sort by Date

I'm a bit surprised that the firestorm over AVG Linkscanner hasn't come up here yet (or did I miss it?)... is anybody else feeling the pain from it? Or are...
Nick Arnett
narnett
Offline Send Email
Jun 17, 2008
6:59 am

Just came across the same article earlier today. Interesting. Our traffic increases are matched within all the other cross checks, so I'd suspect we haven't...
Steve McInerney
nuilvows
Offline Send Email
Jun 17, 2008
9:26 am

So if I'm to understand this all correctly-- this solution looks at X amount of search results and crawls them similarly to how Google or another search engine...
ALEX BRASIL
alexbrasil489
Offline Send Email
Jun 17, 2008
11:52 am

All of these issues are inherent in any kind of pre-fetch "protection". The best solution would be to "crawl" the page only after the user clicks the link....
nethab1
Offline Send Email
Jun 17, 2008
9:19 pm

'Morning Alex, ... That is my understanding from the reading/research I've done. Which is not quite the same thing as saying that it's correct. :-) I don't...
Steve McInerney
nuilvows
Offline Send Email
Jun 18, 2008
6:24 pm

On Tue, Jun 17, 2008 at 4:02 PM, Steve McInerney <steve.mcinerney@...> ... Small, but significant correction -- it can do only SERPs, but if it is ...
Nick Arnett
narnett
Offline Send Email
Jun 19, 2008
12:22 am

(Full disclosure: I work for Blue Coat Systems, which provides a family of intelligent appliances that give IT organizations the ability to effectively secure...
jenniferwhite
Offline Send Email
Jun 19, 2008
7:50 pm

I see your point, though from my experience the "caching" of pages / objects is (at least partially) what is driving the adoption of JavaScript "tag" based...
nethab1
Offline Send Email
Jun 20, 2008
8:54 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help