R O B O T Crawler Database Index
UNDER CONSTRUCTION AREA
This area is intended as an experimental robotic crawler index
location. It will contain text links to special pages containing each
database in a search return friendly format specially designed for
ccrawler optimization... HHAHAHA whatever that is supposed to mean.
I will work on this when i have time... ROBOTS START HERE.. ;-)
1.0.1
news - news columns robot index
1.0.3
photos - photo media engine robot index
1.0.4
forum - site forum robot index
1.0.5
videos - video database robot index
1.0.6
resweb - web resources robot index
1.0.8
products - product catalog robot index
MORE TO COME...
R O B O T Agent Tracking
NOTICE UNDER CONSTRUCTION
I am currently developing a complete ROBOT agent tracking
system and database frontend for all the spiders and robotic agents
the system tracks. However it is still under construction.
In the interum I will keep a list of currently tracked agents and
show some automated statistics here:
CRAWLER ACTIVITY STATUS: HIGH
|
ROBOT CRAWLER SESSIONS/ACTIVITY IN PAST 12 HOURS
|
| CRAWLER |
SESSIONS |
ACTIVITY |
| MSNbot: |
4 |
20 |
| Googlebot: |
3 |
449 |
| Yahoo Slurp: |
2 |
27 |
| Turnitin.com: |
0 |
0 |
| Inktomi (old): |
0 |
0 |
Other: (IBM/IA/Alexa/WISEnut/Netcraft
/NextGenSearchBot/Jetbot/Gigabot/NaverBot/etc.
|
2 |
2 |
TRACKED SIPDER / CRAWLER AGENTS:
- msnbot - agent
- googlebot - agent
- turnitin.com - agent
- yahoo.com - inktomisearch - agent
- ia_archive - alexa agent
- WISEnutbot.com - agent
- Netcraft Web Server - agent
- almaden.ibm.com - agent
- NextGenSearchBot - agent
- Jetbot - agent
- Gigabot - agent
- NaverBot - agent
- picsearch.com - agent
- BecomeBot - agent
- Baiduspider - agent
- SpeedySpider - agent
- Ask Jeeves - agent
- Nutch agent
- ShopWiki - agent
- Exabot - agent
- RedCarpet - agent
- MetaCrawler - fastsearch - agent
- grub-client - looksmart agent
The goodcoffeeonline.com R O B O T
NOTICE UNDER CONSTRUCTION
This site employes the use of "robot" data gathering systems
which automatically retrieve data and images for various net
resources. At the present time they gather only specific resources
from target locations i.e. webcam database cam images, site
status info, weather data etc. We currently do not operate
a crawling robot and have no plans to do so in the near future.
Our robots will be identifiable by the agent string
"www.goodcoffeeonline.com ROBOT". If you have any questions
about our robot use and policy contact us at the goodcoffeeonline.com support center.
|