admin管理员组文章数量:1426052
I created a page on a Wordpress site that was for internal use only and triggers some backend code. Within a few days I started seeing hits on that page from "bingbot".
I'm not using any kind of sitemap plugin. How are crawlers finding this page?
I know the robots.txt file can block them but I want to make sure they don't show up for crawlers that don't respect this. I still want to have the page publicly accessible if someone types in the URL.
What needs to be done in Wordpress to make sure a page can't be discovered except by typing in the URL?
I created a page on a Wordpress site that was for internal use only and triggers some backend code. Within a few days I started seeing hits on that page from "bingbot".
I'm not using any kind of sitemap plugin. How are crawlers finding this page?
I know the robots.txt file can block them but I want to make sure they don't show up for crawlers that don't respect this. I still want to have the page publicly accessible if someone types in the URL.
What needs to be done in Wordpress to make sure a page can't be discovered except by typing in the URL?
Share Improve this question asked Jun 18, 2019 at 1:10 user83358user83358 101 8- " to make sure a page can't be discovered except by typing in the URL?" - but that is exactly what many crawlers are doing. The way to go is usually either a robots.txt or some restrictions like HTTP Basic Auth (can be a simple user/password) – kero Commented Jun 18, 2019 at 6:21
- Crawlers can't guess the URL, they have to get it from somewhere. That is what I would like to remove. – user83358 Commented Jun 18, 2019 at 22:33
- This is just not correct. I have had several websites in development that suddenly appeared in Google. Even if they do not guess URLs (which I presume they do), if you surf the site in Chrome, they know it exists. The only effective way of blocking them out is by blocking out all random visitors (eg via Basic Auth) – kero Commented Jun 19, 2019 at 6:45
- If you create the url yoursite/dfoimsfg9q743jroaiemf9w84mr09wqnmr09 will it show up in a crawler? Why do you think that is? The URL that I created was only for an internal function, can't use basic auth, and was only accessed through Safari for testing. Since Apple doesn't have a search engine that seems like an unlikely way for it to be discovered. – user83358 Commented Jun 20, 2019 at 15:42
- After creating a page with a completely random URL, and not accessing it at all, it took 4 days to start getting hits from bing bot. Since the server hasn't been hammered with trillions of requests for random URLs, this can only mean that crawlers (and potentially malicious attackers) are getting a list of all pages that exist in Wordpress, whether or not they are linked from anywhere else on the public site. That could be a concern to some people. – user83358 Commented Jun 28, 2019 at 0:51
1 Answer
Reset to default -1Yes, if they are public pages WordPress will let them be indexed unless you tell it not to.
Try adding this to you function.php
this:
function allow_no_robots() {
global $post;
if ( $post->ID == PAGE_ID_TO_NOT_INDEX ) {
echo '<meta name="robots" content="noindex, nofollow">';
} add_action( 'wp_head', 'allow_no_robots' );
That sould do it.
本文标签: Does Wordpress list all pages for crawlers
版权声明:本文标题:Does Wordpress list all pages for crawlers? 内容由网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://www.betaflare.com/web/1745392193a2656648.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论