|User Agent|| ||Verified||Date Added|
|CyberPatrol SiteCat Webbot||1/12/2008 1:15:05|
|Zook Knight||1/12/2008 0:49:16|
|LucidMedia ClickSense||1/12/2008 0:48:19|
|BebopBot is an experimental web crawler under development since January 2007. BebopBot is owned and operated by A Passion for Jazz!™ as part of an effort to develop a niche based searchable Web index. We are not attempting to steal any copyrighted information from your site and will not be re-distributing your content. We will only be allowing users to find your website more easily. The information gathered will be indexed and made accessible via one or more publicly accessible web sites in the near future.|
|A1 Sitemap Generator||1/12/2008 0:41:52|
|Create Text, HTML, RSS and XML sitemaps for your website.|
|CatchBot is the web crawler for Catch, the online division of Reed Business Information Australia. Reed Business Information is Australia’s leading and largest business to business publisher and information provider.|
CatchBot investigates websites for publicly available information about companies, such as a company’s name, address, telephone number and keyword data about a company’s products and services. CatchBot is not designed to access or index any personal information or any information about individuals.
Information gathered by CatchBot is stored on our password protected servers and the security of this information is of the highest importance to us.
Information gathered by CatchBot may be used for business activities that are undertaken by Catch. Examples of this include publishing and maintaining business directories in various countries around the world, industry specific websites and online portals
|fell for bad bot trap|
|NameOfAgent (CMS Spider)||20/08/2008 16:28:55|
|Searches for Wordpress wp-login.php and mod_login.xml|
|Onetime ShopFinder Program||8/08/2008 13:51:08|
|Use PostRank™ to score, filter and track performance of any RSS feed. Reclaim your time, boost your productivity, and stay on top of the news.|
|Die Semager-Bots sind Webcrawler unserer Suchmaschine. Dabei handelt es sich um Computerprogramme, die Texte im World Wide Web herunterladen und diese über die Web- und Wörtersuche von Semager auffindbar machen.|
|does a request with Content-Type: text/html; charset=utf-8|
|WebAlta Crawler||17/07/2008 0:25:55|
|fell for bad bot trap + url is not working + e-mail harvesting ip addresses|
|fell for bad bot trap + aggressive (2 requests/second)|
|MarkAny WebSafer||1/07/2008 15:37:02|
|MarkAny is a Korean rights management company.|
WebSafer is their web content protection system.
ActiveX client control requests "/MarkAny/Websafer/MaSiteInfo.ini".
A reverse proxy is installed in front of the web server with content that needs protection. When a web browser wants access to protected content, it will first be presented with an ActiveX control to be installed.
The ActiveX control:
- installs "automatically".
- switches between protected and unprotected mode automatically;
probably why it requests the file /MarkAny/Websafer/MaSiteInfo.ini
- removes view source, print, and clipboard actions from menus.
- removes visits to protected pages from the browser history.
- removes access to right-click context menu.
- tries to stop screen capture programs.