I have seen a lot of sites get washed recently due to Googles dupe content filters, the most common one is the MOD_Rewrite, yer go figure.. the amount of times I have told people to use a mod_rewrite, was it bad advice ?? well it didn’t used to be, So now I have added this statement to my “use a Mod_Rewrite”..

Ok Mr Matt Cutts .. first what you need to do is get rid of those Urls that look like this:
http://www.mattcutts.com/blog/?p=16 and replace them with
http://www.mattcutts.com/blog/up-up-up-up-up/ .. but Matt.. what you must do is have a robots.txt .. in that robots. txt file add this little line in ..

User-agent: *
Disallow: /?p

I think that’s right … i have never really wanted to stop SE’s spiders getting in before.. also watch out for “PRINT Article” links if they go to the same content with a different CSS.. you could get into trouble with that too.. oh hum

DaveN

DaveN

5 Comments

  • 1

    I *want* to see how well engines handle it, and how to fix it when I notice a problem. :)

    Matt Cutts | http://www.mattcutts.com/

    7th October 2005 @ 16:47

  • 2

    IMHO, this is a huge problem right now, there are TONS of scrapper sites out there, targeting most of the times the top SERPs, so it may be that the real solution is to cloack pages and give GBot and yBot and msBot diffrent content then the one we give the user …

    BroadProspect | http://www.searchenginesmarketingblog.com

    8th October 2005 @ 18:47

  • 3

    > so it may be that the real solution is to cloack pages and give GBot and yBot and msBot diffrent content then the one we give the user …

    Thats called “personalization” and is very comon these days :)

    Mikkel deMib Svendsen | http://www.demib.com

    11th October 2005 @ 23:40

  • 4

    Matt! Throw me a bone on fixing OHWY.com which appears to be killed by (unfair!) dupe filtering or we’ll have to hire … DAVE!

    JoeDuck/Joe Hunkins

    Joeduck | http://ohwy.com

    19th October 2005 @ 04:35

  • 5

    Go get ‘em Dave ;)

    This continues to be an issue with blogging software, and needs to be addressed.

    brad | http://www.VisibilityGenie.com

    23rd October 2005 @ 14:38

Write a Comment

*

*

*

SES New YorkA4U Expo Munich
Subscribe
to the David Naylor feed
Follow
David Naylor's Twitter feed

View Dave's Blog