Creating A WordPress Robots.txt To Improve SEO


I tried to enter uncharted waters today as I’ve decided to create a robots.txt file, which I haven’t done before. Daniel explains best why this should be a sensible move:

The robots.txt file is used to instruct search engine robots about what pages on your website should be crawled and consequently indexed. Most websites have files and folders that are not relevant for search engines (like images or admin files) therefore creating a robots.txt file can actually improve your website indexation.

The problem I’m having, is I can’t find a definitive source that gives clear instructions on what should be included in a robots.txt so I thought I’d throw my problem out to the Connected Internet ‘team’ to see what should and shouldn’t be included. Here’s what I have so far, which is based mainly on this guide:


User-agent: Googlebot
Disallow: /*/feed/$
Disallow: /*/feed/rss/$
Disallow: /*/trackback/$
User-agent: *
Disallow: /wp-
Disallow: /feed/
Disallow: /trackback/
Disallow: /rss/
Disallow: /comments/feed/
Disallow: /page/
Disallow: /date/
Disallow: /comments/

What else do you think I should include or exclude?

Bookmark & Share

Related Posts

About the Author

author photo

Everton is based in London and has worked in the internet and mobile space for over ten years now, and before that worked in corporate strategy and consulting. He has a degree in Economics from Cambridge University, and currently runs the Portal and online operations for one of the largest ISPs in the UK. He also writes for Windows 7 News.

See All Posts by This Author

There Are 42 Responses So Far. »

Pages: « 5 4 3 [2] 1 » Show All

  1. #20

    [...] ich bereits vor einem Monat in meinem Blog nachfragte wie ich am Besten die robots.txt Datei gestalten sollte um meinen Traffic [...]

  2. #19

    Another area that I’ve never looked into before, I’ll have to read into this and enlightenment myself, but the idea seems like a sound one to me.

    I’m guessing it’s best to start with the defaults shown in the linked file on Daily Blog Tips and go from there adding misc pages I have such as pages full of bookmarks etc.

    Cheers!

  3. #18

    Good idea to disallow the bots from viewing the RSS feeds. My feeds usually always appear above the actual content in the SERPs which is good, but not so great for humans! Thanks, I’ll give this a go :)

  4. #17

    [...] I asked a month ago for advice on adding a robots.txt file to my WordPress blog to improve my search engine traffic, I didn’t actually get around to it until my blogging [...]

  5. #16

    if you want your site indexed, why should you exclude something?

    Because you don’t want the same posts appearing twice and Google thinking it’s duplicate content, or junk appearing in results as Google will then downgrade your real pages

  6. #15

    if you want your site indexed, why should you exclude something?

  7. #14

    There is a better article about this on the askapache blog:
    WordPress robots.txt optimized for SEO

  8. #13

    Why Disallow all extension? such as .PHP, .JS? can I get explain more about it?

    Thanks

  9. #12

    [...] the subject has come up again a couple of times over the last few days at both Wolf-Howl and Connected Internet and pricked my conscience so I thought it was time to revisit [...]

  10. #11

    One of the reasons why you’d want to use a robots.txt file is to prevent Google from indexing content twice and potentially marking it as duplicate content. If it’s listed in monthly archives, category folders and on your front page, for example, you might end up with pages in the supplemental index.

Pages: « 5 4 3 [2] 1 » Show All

Subscribe without commenting

Post a Response


Comment Policy: Any comments are permitted only because the site owner is letting you post, and any comments could be removed for any reason at the absolute discretion of the site owner.