SEO | Link Popularity | Search Engine Consulting | SEO Tutorial | SEO Tools | SEO Forum
Reply
 
Thread Tools Search this Thread Rate Thread Display Modes
  #1  
Old 10-05-2006, 12:14 AM
Kate's Avatar
Kate Kate is offline
SEO
 
Join Date: Sep 2006
Location: Bangkok, Thailand
Posts: 848 Kate will become famous soon enough
msnbot does it follow robots.txt

Hi all,

I want to disallow certain directories from being crawled and indexed by any SE bots for one of my client's site..
so i have placed this tag in my robots.txt (to disallow all bots)
User-Agent: *
Disallow: /folder1/
Disallow: /folder2/
Disallow: /floder3/
Disallow: /floder5/

however i still see at msn fresh pages from these folder being cached. Those pages are not related to the site i m promoting n that's why i don't them to appear on any SE index.

Do i need to put this for specially for msnbot
User-agent: msnbot
Disallow: /folder1/
Disallow: /folder2/
Disallow: /floder3/
Disallow: /floder5/

All these folders are reside on root. and when they are crawled the path looks like this www.example.com/foldername/filename which is not related to example site..

I would like to know is there any other way to prevent those bots from crawling.. If not is there any solution or way out to get rid of this..

Kindly advice me..

thank you
Reply With Quote
  #2  
Old 10-05-2006, 01:59 AM
Paz's Avatar
Paz Paz is offline
SEO GUY Moderator
 
Join Date: Sep 2004
Location: Antalya, Turkey
Posts: 4,238 Paz has a spectacular aura aboutPaz has a spectacular aura about
Hi Kate,

yes I've noticed that if you disallow pages with the robots.txt in MSN they index the pages as "link only", inother words in stead of showing the normal:

Page Title:
Here is a snippet usually taken from your meta description tag

they are showing the url with no title or description with

http://www.example.com/folder1/index.htm]

Cheers,
Paz.
__________________
10.3 million entries for Hotels in Turkey but I'm still chipping away.
Reply With Quote
  #3  
Old 10-05-2006, 03:14 AM
Kate's Avatar
Kate Kate is offline
SEO
 
Join Date: Sep 2006
Location: Bangkok, Thailand
Posts: 848 Kate will become famous soon enough
Hi Paz,

Thank you for kind reply.. still i don't get this one. msn index the pages link only won't SE spiders get confused due to these links? Will they crawl them even if there is no title n description?
N what about other imp pages which i should show up at first why they are buried deep inside.. coz the pages that are listed are usually such links from disallowed folders and due to this other pages have been pushed down in the index..
How can i get those pages to show at first place? Can you advice Paz?

Thank you once again
Reply With Quote
  #4  
Old 10-05-2006, 09:31 AM
Paz's Avatar
Paz Paz is offline
SEO GUY Moderator
 
Join Date: Sep 2004
Location: Antalya, Turkey
Posts: 4,238 Paz has a spectacular aura aboutPaz has a spectacular aura about
Hi,

other search engines don't normally index each other's indexes, so the url only links won't matter. Anyway, don't confuse the site: command results and rankings. I know a site that ranks very well in MSN depsite the fact that the site command returns a few url only links:
http://search.msn.com/results.aspx?...hitecompany.com

I think it's just a little warning from MSN.

Cheers,
Paz.
__________________
10.3 million entries for Hotels in Turkey but I'm still chipping away.
Reply With Quote
  #5  
Old 12-06-2006, 05:56 AM
la cala la cala is offline
Banned
 
Join Date: Nov 2006
Posts: 30 la cala is on a distinguished road
I have been getting the same thing, thought i was going mad have changed my tags 3 times to try and overcome this problem.
Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


Login/Register
User Name
Password
Remember Me?

Forum Links
Forum Home
SEO Forum
Internet Marketing Forum
Web Design Forum
Web Hosting Forum
Programming Forum
SEO Chat

Quick Links
Forum Home
New Posts
Mark Forums Read
Open Buddy List
User Control Panel
Edit Avatar
Edit Profile
Edit Options
Miscellaneous
Subscribed Threads
My Profile

Search Forums

Advanced Search
All times are GMT -8. The time now is 10:11 PM.


Powered by: vBulletin Version 3.0.3
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.