Search Engine Optimization > Webmaster World > do unwelcome search-bots come back?
do unwelcome search-bots come back?
Posted by hug on January 29th, 2006

If you specify in your robots.txt file that such-and-so robot should
not look at anything on your site, will it note that and never return?

I'm assuming they'd rather not burn the disk space, and will come back
as often as before.

IOW if you want to temporarily make your site off-limits to all robots
because of testing/whatever, then later you open it up again, have you
forever disappeared yo'sef?

--
http://www.ren-prod-inc.com/hug_soft...action=contact

Posted by William Tasso on January 29th, 2006

Fleeing from the madness of the . jungle
hug <contact_info@sig_line.clickit> stumbled into news:alt.www.webmaster
and said:

The algo behind every bot is different, some won't even ack that you have
a robots.txt to start with.

--
William Tasso - I was looking back to see, if she was looking back to see,
if I was looking back at her.

How To Usenet: http://williamtasso.com/usenet/ - prove I'm wrong.

Posted by Bill on January 29th, 2006


The best way is to block the ip range serverwide

--
Bil

*Kind Regards
*Bill

'www.ukwebmasterforums.com
(http://www.ukwebmasterforums.com

Supported By 'Google Adsense
(http://adsense.ukwebmasterforums.com/), Drive Ads To Your Sit
-----------------------------------------------------------------------
Bill's Profile: http://www.ukwebmasterforums.com/member.php?userid=
View this thread: http://www.ukwebmasterforums.com/showthread.php?t=737

Posted by hug on January 29th, 2006

"William Tasso" <SpamBlocked@tbdata.com> wrote:

Obvious.

Assumed.

I'm concerned about the major search-bots; googlebot, msnbot, slurp,
etc. Do they continue checking robots.txt to see if they're allowed
in? I don't want to "temporarily" shut them out then find it's
permanent, that's my real concern.

Thanks. btw, good morning William.

--
http://www.ren-prod-inc.com/hug_soft...action=contact

Posted by Mark Goodge on January 29th, 2006

On Sun, 29 Jan 2006 05:27:06 -0700, hug put finger to keyboard and
typed:

Google will recheck every now and then, so removing robots.txt (or
removing an exclusion from it) will result in Google indexing the
areas that were previously forbidden to it.

I don't know about the other majors, but I'd guess that they're the
same as Google in this respect.

Mark
--
http://www.MotorwayServices.info - read and share comments and opinons
"Look at the stars; look how they shine for you"

Posted by John Bokma on January 29th, 2006

hug <contact_info@sig_line.clickit> wrote:

A lot of unwelcome bots don't fetch that file, or just fetch it and ignore
it.

No. Google for one follows links. It seems to be very hard to get pages
out of Google :-D

--
John Experienced (web) developer: http://castleamber.com/
Perl SEO tools: http://johnbokma.com/perl/
NEW ----> Textpad reference card (pdf): http://johnbokma.com/textpad/


Posted by John Bokma on January 29th, 2006

hug <contact_info@sig_line.clickit> wrote:

Yes, but they cache the file, so a change to this file doesn't show up
immediatly. Also, removing pages from Google seems to take ages.

Doubt if that's an easy thing if you have incoming links.

--
John Experienced (web) developer: http://castleamber.com/
Perl SEO tools: http://johnbokma.com/perl/
NEW ----> Textpad reference card (pdf): http://johnbokma.com/textpad/


Posted by John Bokma on January 29th, 2006

Bill <Bill.22ebu2@noreply.ukwebmasterforums.com> wrote:

Preferable on the firewall. But yup, that works for bots using a range and
sticking to it.

--
John Experienced (web) developer: http://castleamber.com/
Perl SEO tools: http://johnbokma.com/perl/
NEW ----> Textpad reference card (pdf): http://johnbokma.com/textpad/


Posted by Duende on January 30th, 2006

On 29 Jan 2006 John Bokma wrote in alt.www.webmaster

Google still lists stuff from when Charles was my host.

--
D?
http://yorkshirepete.com/

Posted by hug on January 30th, 2006

Thanks all who replied.

--
http://www.ren-prod-inc.com/hug_soft...action=contact

Funbolt.com - Entertainment portal, wallpapers, sexy celebs