Learn how to leverage AI for building powerful content machines.
Some of Moonlit’s core functionalities require web scraping. That includes “Apps” that use functions such as the Sitemap Extractor, or Webpage Scraper, and also for the “Import from Website” option in the knowledge base. However, some websites are protected by anti-bot services, so in this guide we’ll show you how you can whitelist requests coming from Moonlit’s server.
A user agent is a short string identifier for an agent performing a web request. Moonlit’s User Agent string is:
Mozilla/5.0 (compatible; Moonlit/1.0; +https://moonlitplatform.com
Depending on your hosting provider or security software, you can add rules that allow this user agent to scrape your website.
Whitelisting in Robots.txt
If you have custom rules set in your site’s robots.txt file you can whitelist moonlit by adding the following lines:
User-agent: Moonlit
Allow:
If your hosting provider is not listed above, please contact us through the live chat widget on the bottom right and we’ll help you.