Science Fair Project Encyclopedia
User:Alterego/Number of pages in Google
- See the ten million annnouncement
I'm keeping track of the number of Wikipedia pages in Google's index across all datacenters. The numbers vary widely across the datacenters at any given time so I record the highest instance. From March 21-23 I wasn't recording the datacenter.
3/21: 2710000 3/21: 4170000 3/22: 4550000 3/22: 5170000 3/22: 2740000 3/23: 5440000 3/23: 6490000 3/23: 7340000 3/24: 7350000 3/26: 7470000 (64.233.183.99) 3/26: 7530000 (64.233.183.99) 4/08: 9030000 (66.102.7.105) 4/09: 9350000 (64.233.189.104) 4/10: 9730000 (66.102.7.104) 4/10: 9850000 4/11: 10100000 (64.233.183.99) 4/15: 11300000 most datacenters 4/17: 11800000 half of all datacenters 4/17: 11900000 4/18: 12100000 216.239.63.104
I use AutoIt 3, and the following very basic code (it pastes the results to your clipboard):
#include <Misc.au3>
#include <Date.au3>
#include <file.au3>
#Include <string.au3>
#Include <Array.au3>
Global $results
$DCs = StringSplit('64.233.161.99,64.233.161.104,64.233.161.105,64.233.161.147,64.233.167.99,64.233.167.104,64.233.167.147,64.233.171.99,64.233.171.104,64.233.171.105,64.233.171.147,64.233.179.99,64.233.179.99,64.233.183.99,64.233.183.104,64.233.185.99,64.233.185.104,64.233.187.99,64.233.187.104,64.233.189.104,66.102.7.104,66.102.7.105,66.102.7.147,66.102.9.104,66.102.11.104,216.239.37.104,216.239.37.105,216.239.37.147,216.239.39.104,216.239.53.104,216.239.57.98,216.239.57.104,216.239.57.105,216.239.57.147,216.239.59.104,216.239.59.105,216.239.63.104',',')
For $loop = 1 to $DCs[0]
Global $pagesingoogle = _ScreenScrape ('http://' & $DCs[$loop] & '/search?hl=en&q=site%3Awikipedia.org&btnG=Google+Search', 't <b>', '</b> f')
$results = $results & @CRLF & $DCs[$loop] & ' : ' & $pagesingoogle
Next
ClipPut($results)
Incidentally, you will also need my custom _ScreenScrape function for this to work properly.
Last updated: 06-08-2005 14:54:40
10-26-2009 08:16:03
The contents of this article is licensed from www.wikipedia.org under the GNU Free Documentation License. Click here to see the transparent copy and copyright details
The contents of this article is licensed from www.wikipedia.org under the GNU Free Documentation License. Click here to see the transparent copy and copyright details


