t   a
s

 

Robot Logs

I record every host that requests my robots.txt file and assume that that host is a robot/spider (that is, a computer program which runs through a website, following all links), since most normal users do not request that file. This is a listing of those hosts.

I use this list to determine whether a revisiting host is a robot, so that i may alter my data to suit real people.

  1. localhost
  2. pd952f314.dip.t-dialin.net
  3. x1crawler3-1-0.x-echo.com
  4. spider007.net
  5. pd951dd17.dip.t-dialin.net
  6. cdm-68-15-247-92.amro.cox-internet.com
  7. x1crawler2-1-0.x-echo.com
  8. pd9e534fa.dip.t-dialin.net
  9. pd9e7d5d4.dip.t-dialin.net
  10. dsl-082-082-165-051.arcor-ip.net
  11. cblmdm63-127-62-16.buckeye-express.com
  12. x1crawler1-1-0.x-echo.com
  13. roadrunner.inf.hs-anhalt.de
  14. dsl-082-082-167-182.arcor-ip.net
  15. serveur.com
  16. vagabondo.wise-guys.nl
  17. 104.35.138.210.xn.2iij.net
  18. node-c-c478.a2000.nl
  19. co-colspgs-u1-c6b-196.clspco.adelphia.net
  20. ga-cmng-cuda2-c5a-95.atlaga.adelphia.net
  21. snapper.afspc.af.mil
  22. mako.afspc.af.mil
  23. crawl8-public.alexa.com
  24. crawl13-public.alexa.com
  25. crawl9-public.alexa.com
  26. crawl15-public.alexa.com
  27. crawl24-public.alexa.com
  28. crawl11-public.alexa.com
  29. crawl23-public.alexa.com
  30. crawl22-public.alexa.com
  31. crawl12-public.alexa.com
  32. crawl16-public.alexa.com
  33. crawl25-public.alexa.com
  34. ac939c0d.ipt.aol.com
  35. aca9ea30.ipt.aol.com
  36. ac9be394.ipt.aol.com
  37. ia11037.archive.org
  38. cgi6.archive.org
  39. news.assertive.ca
  40. 36.suba.sttl.sttlwane.dsl.att.net
  41. drone11.sv.av.com
  42. drone10.sv.av.com
  43. bigip1a-snat.sv.av.com
  44. drone8.sv.av.com
  45. drone9.sv.av.com
  46. buildrack23.sv.av.com
  47. drone4.sv.av.com
  48. drone2.sv.av.com
  49. drone7.sv.av.com
  50. 66-154-97-66.bchosting.com
  51. adsl-61-189-184.bhm.bellsouth.net
  52. adsl-80-101-53.bhm.bellsouth.net
  53. adsl-218-122-215.mia.bellsouth.net
  54. ps3.aic.bls.com
  55. 142.167.186.195.dial.bluewin.ch
  56. host-145.brainna.com
  57. ns50.eneserve.co.jp
  58. 82-41-144-89.cable.ubr04.glen.blueyonder.co.uk
  59. cache162.156ce.maxonline.com.sg
  60. pcp311612pcs.woodln01.md.comcast.net
  61. c-24-4-91-157.client.comcast.net
  62. pcp312338pcs.woodln01.md.comcast.net
  63. pcp02709216pcs.flrnc01.al.comcast.net
  64. pcp01543070pcs.abngtn01.va.comcast.net
  65. pcp03748376pcs.sarast01.fl.comcast.net
  66. bgp946159bgs.canton01.mi.comcast.net
  67. 74.39.108.194.contactel.net
  68. wsip-68-15-247-92.dl.dl.cox.net
  69. crawler1.crawler918.com
  70. pavlik.natur.cuni.cz
  71. pes.natur.cuni.cz
  72. boris.natur.cuni.cz
  73. cyber86.cybercity.fr
  74. cursed.data.ee
  75. zero.data.ee
  76. crawl09.dir.com
  77. ng1.exabot.com
  78. cr018r01-3.sac2.fastsearch.net
  79. mmscrm06-1.sac2.fastsearch.net
  80. cr012r01-3.sac2.fastsearch.net
  81. nircr002.sac2.fastsearch.net
  82. fixcr003.sac2.fastsearch.net
  83. d-mhslc-34x-170.fullerton.edu
  84. crawl16.googlebot.com
  85. crawler11.googlebot.com
  86. crawler14.googlebot.com
  87. crawl22.googlebot.com
  88. crawl24.googlebot.com
  89. crawl33.googlebot.com
  90. crawl23.googlebot.com
  91. crawl17.googlebot.com
  92. crawl13.googlebot.com
  93. crawler15.googlebot.com
  94. crawl18.googlebot.com
  95. crawler13.googlebot.com
  96. crawler3.googlebot.com
  97. crawler10.googlebot.com
  98. crawler12.googlebot.com
  99. crawl11.googlebot.com
  100. crawl32.googlebot.com
  101. crawl10.googlebot.com
  102. crawl14.googlebot.com
  103. crawler9.googlebot.com
  104. bdsl.66.14.38.223.gte.net
  105. bdsl.66.14.163.212.gte.net
  106. wfp2.almaden.ibm.com
  107. ingrid.ilse.nl
  108. server.imediabiz.com
  109. mint.inktomi.com
  110. brimstone-u6.inktomi.com
  111. idev19.inktomi.com
  112. g1009.inktomi.com
  113. idev20.inktomi.com
  114. brimstone-u7.inktomi.com
  115. lj1239.inktomisearch.com
  116. lj1219.inktomisearch.com
  117. lj1231.inktomisearch.com
  118. lj1230.inktomisearch.com
  119. lj1207.inktomisearch.com
  120. lj1068.inktomisearch.com
  121. si1003.inktomisearch.com
  122. lj1233.inktomisearch.com
  123. lj1076.inktomisearch.com
  124. lj1235.inktomisearch.com
  125. lj1240.inktomisearch.com
  126. lj1079.inktomisearch.com
  127. lj1236.inktomisearch.com
  128. lj1220.inktomisearch.com
  129. si1006.inktomisearch.com
  130. lj1241.inktomisearch.com
  131. lj1200.inktomisearch.com
  132. si1005.inktomisearch.com
  133. lj1077.inktomisearch.com
  134. si1000.inktomisearch.com
  135. si1001.inktomisearch.com
  136. 12-222-174-119.client.insightbb.com
  137. ua20d4hel.dial.kolumbus.fi
  138. tsubame09.crawler.kototoi.org
  139. hibari09.crawler.kototoi.org
  140. sv-fw.looksmart.com
  141. crawlers.looksmart.com
  142. umn-cache.r.state.mn.us
  143. fll-dsl181-cust181.mpowercom.net
  144. kt-technology.myftp.biz
  145. ntkngw071107.kngw.nt.ftth.ppp.infoweb.ne.jp
  146. deepindex.net1.nerim.net
  147. ns1.nerxs.com
  148. customer-148-233-67-228.uninet.net.mx
  149. cpc3-lutn1-6-0-cust26.lutn.cable.ntl.com
  150. host-sa275.res.openband.net
  151. ool-18bd4134.dyn.optonline.net
  152. j083107.ppp.asahi-net.or.jp
  153. inet-nc01-o.oracle.com
  154. inet-netcache2-o.oracle.com
  155. adsl-207-215-186-91.dsl.lsan03.pacbell.net
  156. adsl-63-207-206-130.dsl.snfc21.pacbell.net
  157. ppp-67-124-90-127.dsl.pltn13.pacbell.net
  158. adsl-66-124-227-164.dsl.snfc21.pacbell.net
  159. us-135.picsearch.com
  160. ip503c238f.speed.planet.nl
  161. qn-213-73-210-115.quicknet.nl
  162. qn-213-73-197-30.quicknet.nl
  163. qn-213-231-196-71.quicknet.nl
  164. www.readware.com
  165. ptd-24-198-85-236.maine.rr.com
  166. rdu57-64-082.nc.rr.com
  167. cs170121-176.sport.rr.com
  168. ptd-24-198-95-201.maine.rr.com
  169. ptd-24-198-89-182.maine.rr.com
  170. rrcs-west-66-27-55-14.biz.rr.com
  171. ctb-cache1-vif1.saix.net
  172. charon.schneebi.com
  173. mtl-hse-ppp185776.qc.sympatico.ca
  174. mtl-hse-ppp201513.qc.sympatico.ca
  175. mtl-hse-ppp206908.qc.sympatico.ca
  176. h136n2fls31o881.telia.com
  177. egspd427.teoma.com
  178. egspd402.teoma.com
  179. egspd403.teoma.com
  180. egspd400.teoma.com
  181. copilot.thunderstone.com
  182. proxy.tiscali.be
  183. anonproxy.tisnet.ch
  184. cr4.turnitin.com
  185. cr1.turnitin.com
  186. n128-227-97-112.xlate.ufl.edu
  187. ifweb.dimi.uniud.it
  188. amcip3655.amc.uva.nl
  189. pool-151-196-232-36.balt.east.verizon.net
  190. pool-141-157-6-21.balt.east.verizon.net
  191. pool-151-196-10-32.balt.east.verizon.net
  192. pool-138-88-224-68.res.east.verizon.net
  193. pool-141-156-175-86.esr.east.verizon.net
  194. www.whois.sc
  195. 66.237.60.86.ptr.us.xo.net
  196. 66.237.60.87.ptr.us.xo.net
  197. 66.237.60.90.ptr.us.xo.net
  198. 66.237.60.89.ptr.us.xo.net
  199. 66.237.60.88.ptr.us.xo.net
  200. 66.237.60.83.ptr.us.xo.net
  201. 66.237.60.85.ptr.us.xo.net
  202. 66.237.60.84.ptr.us.xo.net
  203. 12.175.0.35
  204. 12.36.129.131
  205. 131.107.137.37
  206. 131.107.137.47
  207. 137.148.229.111
  208. 151.196.184.172
  209. 158.110.147.81
  210. 193.29.77.220
  211. 194.108.39.74
  212. 195.5.50.60
  213. 198.134.101.8
  214. 198.235.178.10
  215. 200.24.111.100
  216. 202.108.249.184
  217. 202.108.249.185
  218. 202.214.69.131
  219. 204.95.98.252
  220. 204.95.98.253
  221. 208.187.37.97
  222. 208.45.145.70
  223. 209.167.50.22
  224. 209.226.39.19
  225. 209.226.39.23
  226. 210.150.10.42
  227. 211.95.222.247
  228. 212.142.138.169
  229. 213.215.133.19
  230. 213.252.152.11
  231. 216.39.50.104
  232. 216.39.50.114
  233. 216.39.50.13
  234. 216.39.50.154
  235. 216.39.50.24
  236. 216.39.50.33
  237. 216.39.50.4
  238. 216.39.50.44
  239. 216.39.50.54
  240. 216.39.50.64
  241. 216.39.50.74
  242. 216.39.50.84
  243. 216.39.50.94
  244. 217.17.233.136
  245. 217.207.136.162
  246. 217.23.251.177
  247. 218.145.25.111
  248. 220.73.165.142
  249. 220.73.165.206
  250. 220.73.165.78
  251. 62.110.67.166
  252. 64.210.196.195
  253. 64.210.196.197
  254. 64.210.196.198
  255. 65.115.5.162
  256. 65.170.137.46
  257. 66.46.234.254
  258. 66.77.73.89
  259. 69.28.130.222
  260. 69.28.130.229
  261. 69.28.130.230
  262. 69.28.130.231
  263. 69.31.79.226
  264. 69.59.139.156
  265. 81.205.39.64
  266. 81.205.49.2
  267. 81.208.60.201
  268. 81.5.184.25
 
[ Total by Pages ] [ Page-Month ] [ Total By Month ] [ Tracking ] [ Referer ]
[ Security ] [ 404 ] [ Maps ] [ Stats ] [ About Index ]
[ Art ] [ Code ] [ Personal ] [ Other ] [ Main Index ]
 
r   f