Solutions for educational platforms and publishers

The Wizenoze Bot

WizenozeBot is a robot for crawling web pages. It collects content to be included in the Wizenoze Solutions, where students go to explore, search and self-learn. Our solutions point students and teachers to information that is suitable and useful for them. We do not copy the information, but send our users to the source, just like a web search engine does.
 
How does WizenozeBot identify itself?

The robot uses the WizenozeBot user-agent with each request. The full user-agent string that we use is:

WizenozeBot/1.0 (https://wizenoze.com/bot; crawler@wizenoze.com

The robot operates from the following IP addresses:

  • 54.146.192.109
  • 18.214.57.246
  • 80.101.25.79
  • 34.229.92.93
  • 52.91.166.31
  • 35.174.173.39
  • 3.87.52.95
  • 54.225.58.54
  • 3.81.157.57

How much of my web server’s resources does the WizenozeBot consume?


The WizenozeBot is designed to be polite and not overload websites with requests. It will pause between subsequent requests to your server. However, if you feel our visits are hurting your site performance, you can use robots.txt to restrict the parts of your site that WizenozeBot has access to, or to specify a crawl-delay.


How do I allow WizenozeBot to visit my site?


WizenozeBot respects the robots.txt standard. You can add rules to tell the robot which parts it can or cannot visit. For example to tell us we can crawl your entire site, you can use:

User-agent: WizenozeBot
Disallow:
How do I prevent WizenozeBot from visiting parts of my site?
WizenozeBot respects the robots.txt standard. You can add rules to tell the robot which parts it can or cannot visit. For example to tell us to stop crawling anything under the /includes directory:

User-agent: WizenozeBot
Disallow: /includes/
to block WizenozeBot completely and stop us from including your site in our search engines, you can use:

User-agent: WizenozeBot
Disallow: *
Alternatively, you  use robots meta tags on individual pages to tell WizenozeBot if it should index a page and if it should follow the links. For example:

<meta name=”robots” CONTENT=”index, follow” />


How do I get my content included in the Wizenoze collection?


If your site is not included yet, and you feel it offers readable, reliable and relevant information for children, please let us know.

If your site is included already, you can help WizenozeBot to find the most important parts by following the following steps:

  • Use text rather than images for important content
  • Make sure your site also works with JavaScript, Java and CSS disabled
  • Use normal URLs, avoiding links that look like form queries (http://www.example.com/engine?id)
  • Provide a sitemap

How can I contact you?


If you have additional questions or concerns, please let us know. You can contact us at crawler@wizenoze.com.

Book a demo

Contact details

Other information