Google using Search Logs for Quailty

28.06.08

I’M trying to workout why Google have come forward with this,

quote from Google :

Data from search logs is one tool we use to fight webspam and return cleaner and more relevant results. Logs data such as IP address and cookie information make it possible to create and use metrics that measure the different aspects of our search quality (such as index size and coverage, results “freshness,” and spam).

Whenever we create a new metric, it’s essential to be able to go over our logs data and compute new spam metrics using previous queries or results. We use our search logs to go “back in time” and see how well Google did on queries from months before. When we create a metric that measures a new type of spam more accurately, we not only start tracking our spam success going forward, but we also use logs data to see how we were doing on that type of spam in previous months and years.

The IP and cookie information is important for helping us apply this method only to searches that are from legitimate users as opposed to those that were generated by bots and other false searches. For example, if a bot sends the same queries to Google over and over again, those queries should really be discarded before we measure how much spam our users see. All of this–log data, IP addresses, and cookie information–makes your search results cleaner and more relevant.

So where are the pitfalls here, first off the big one that jumps out here is the amazing amounts of botnets and malware that’s out there in the wild that could be set to create fake traffic..

example: botnet users

search google for : keyword phrase you want to rank for,automate page click through keep on google don’t pass you brand, if brand found fake a click. If brand not found in first 2 -5 pages, random per user, search brand + keyword, brand.com + keyword

also

log spamming has been around for years, also if you thought you need protection before from Google, set up a caching proxy, take all your own data and look at you search patterns what sites you visit, a there are couple of things you will notice,

so you might want to do or at least think about.

a) Think about work search V personal searches
b) Install, foxy proxy or another firefox proxy plugin
c) Make sure you purge your cookie on closing your browse
d) Run a dictionary on google when your not searching
e) Pray that Sergy never wants to be the president of America

DaveN

12 Comments

  • 1

    I was wondering that as well. Seemed like a really odd thing to just suddenly share.

    DazzlinDonna
    http://www.dazzlindonna.com/blog/

    28th June 2008 @ 15:03

  • 2

    Thought provoking stuff Dave! I installed Foxy Proxy as per your suggestion.

    Could you explain what you mean by this in much more detail?

    ” Run a dictionary on google when your not searching

    Thanks!

    Dalka
    http://www.daviddalka.com/createvalue/

    29th June 2008 @ 02:26

  • 3

    This seems to be a hand Google did not want fully aired… the privacy issues when cookies are mentioned with deciding what is presented seems to have hit a nerve and could be getting some serious government and privacy advocate attention soon.

    Aussiewebmaster
    http://www.kangamurramedia.com

    29th June 2008 @ 06:49

  • 4

    ” Run a dictionary on google when your not searching” .. thats maybe a little harsh by me, but what I ment was by using a dictionary to could automate a screen saver or a firefox plugin to randomly get a keyword and search it, i originally thought about this when I got the AOL data .

    Dave

    DaveN

    29th June 2008 @ 09:43

  • 5

    Thanks for the infor Dave….I’m off to install Foxy Proxy.
    I’m a “newbie” and I think this information will be quite helpful. Looking to learn a lot more!

    Kathy

    Kathy
    http://shurlady.ws/blog

    29th June 2008 @ 17:50

  • 6

    Thanks Dave, as usual you pick up on the wtf stuff. Don’t worry Sergey doesn’t want to be president, he wants to rule the world!

    David Temple
    http://www.semscholar.com

    30th June 2008 @ 01:36

  • 7

    isnt this them regression testing there algo changes using past data which i hope to frack they do though sometimes i wonder

    Though i take the point about faking click throughs

    Sergey wasnt born in the USA so he cant stand unless they change the constitution.

    Maurice
    http://www.thuk.co.uk

    30th June 2008 @ 11:40

  • 8

    [...] using some of their products. Any of their products will place cookies on your computer and they log everything you’re doing. Since they came out and said they track our IP and cookie search logs, if you do anything remotely [...]

    Market For Google - Search Somewhere Else — LinkWorth - Be Found Online

    30th June 2008 @ 14:00

  • 9

    Foxy Proxy or Tor and Vidalia are a staple fo rme. Being in the UK and having international clients not just in the US and UK but Europe and Asia as well tend to require it. If I had a nickle for every time I got a skewed query because I was logged into to a Google account I’d have a few quid.

    GaryTheScubaGuy

    Gary Beal
    http://www.broadband.co.uk/provider.jsp?i=63&d=190

    30th June 2008 @ 22:05

  • 10

    The problem I have with tor and proxies is they tend to be shit. I have tried out foxyproxy and I like the idea but does anyone know of any decent proxies that perform well? I’d happily pay a small fee to use them.

    I use Xerobank quite a bit at the moment when I am blocked by Google or feel the need for a little anonymity though I could do with a decent UK based proxy.

    Yossarian
    http://yossarian.co.uk

    1st July 2008 @ 14:45

  • 11

    my problem with using proxies is the slow down, although I supposedly have a T1 connection i really have to wonder sometimes another thing some of these proxies just arent safe.

    $30,000 Cash in 30 Days
    http://www.30kcashin30days.com/

    4th July 2008 @ 17:13

  • 12

    @Yossarian - I too have used foxyproxy and it is okay but I am loking for a decent one as well…

    Nick Stamoulis
    http://www.searchengineoptimizationjournal.com

    28th July 2008 @ 16:48

Add a Comment

*

*

*

Come and work with David Naylor and the team Subscribe
to the David Naylor feed
Follow
David Naylor's Twitter feed
View Dave's Blog