Talk:Googlebot

Latest comment: 10 years ago by 71.105.105.145 in topic Comments

Comments

edit

"It visits websites that change frequently, according to how frequently they change."

To me this sentence is very confusing. How about something like:

"It visits websites that change frequently, starting with the most frequently changed pages."

Taladon 18:33, 19 September 2006 (UTC)Reply



why don't add IP range of google-bot?—Preceding unsigned comment added by 19:51, 4 May 2006 (talk) 87.1.53.187

Probably because it varies too much from place to place, perhaps?—Preceding unsigned comment added by 70.135.120.225 (talk) 09:37, 17 June 2006


the german external link is broken


Warning: GoogleBot will try to index FTP sites too - especially if there's a HTML link to an FTP resource. It is said that in FTP mode, it will respect a "/robots.txt" file, but such a resource is only defined for web servers. Server admins be warned.... 71.105.105.145 (talk) 00:43, 28 November 2013 (UTC)Reply

How to use google user agent:

edit

http://diorz.tuxfamily.org/index.php?id=ByPass%20Forum%20Signup —The preceding unsigned comment was added by 217.154.102.195 (talk) 12:39, 30 April 2007 (UTC).Reply

GoogleBot and IFrames

edit

There are discussions inquiring about how GoogleBots handle IFrames.

I've been told that IFrames cause errors in the Bots 'readings', but I've read (from uncertain sources) that the Bots are fine with IFrames. Does anyone have compelling evidence as to one or the other?

CertGuard 05:27, 21 July 2007 (UTC)Reply

Indexing limits

edit

I'm thinking of adding a sub topic Indexing Limits, to googlebot are there any references or research done by any good webmasters and also I guess there should be a relation between the indexing and the Google Pagerank. If there are any, kindly refer the same.Ganesh J. Acharya 06:06, 1 August 2007 (UTC)Reply

How often does Googlebot update?

edit

I've been curious to know how often Googlebot caches a website and once caches does it later on update previous versions. I ask this because I've found a few websites with personal information cached on older versions of the page, yet the most recent is without said information. Some people worry that their info will be made public and curiosity has me wondering how often Googlebot updates a cached page. Thanks 74.195.2.98 17:41, 10 September 2007 (UTC)Reply

Googlebot(s) Discovered

edit

I was browsing the net on the ole' Google.com search engine when I found a peculiar message stating: "Your IP is: 66.249.73.186" Obviously, this was not my IP. Of course, I had to look up the possessor of this IP; thus, I used Network-tools.com. From here, I simply copy and pasted the IP and voila! I had found the IP to a GoogleBot.

The following message was presented as the entire web domain and sub-domain (host name):

  IP address: 66.249.73.186
  Host name: crawl-66-249-73-186.googlebot.com
  66.249.73.186 is from United States(US) in region North America

I performed a WHOIS trace for further investigation:

  Domain Name: GOOGLEBOT.COM
  Registrar: MARKMONITOR INC.
  Whois Server: whois.markmonitor.com
  Referral URL: http://www.markmonitor.com
  Name Server: NS1.GOOGLE.COM
  Name Server: NS2.GOOGLE.COM
  Name Server: NS3.GOOGLE.COM
  Name Server: NS4.GOOGLE.COM
  Status: clientDeleteProhibited
  Status: clientTransferProhibited
  Status: clientUpdateProhibited
  Updated Date: 06-nov-2006
  Creation Date: 21-oct-1998
  Expiration Date: 20-oct-2011

--It was quite strange to find on MarkMonitor.com's "Strategic Alliances" page they had not listed Google.

Anyways, I was only opening this section to reveal an actual GoogleBot--perhaps you can find more? —Preceding unsigned comment added by 68.114.11.211 (talk) 16:07, 30 May 2008 (UTC)Reply

Amazon et al? — Preceding unsigned comment added by 189.10.157.96 (talk) 05:39, 8 October 2012 (UTC)Reply

Is 'upgrade' an order?

edit

Already have a modern browser (ItaliC, not Phoenician or syllabic), it's called NoScript, imHo it might be Dante, Kafka or Joyce (GuimarÃEs Rosa I-n P:-ortuguese-br).