The meta robots tag must be placed in the header area of the web page in question and must include two attributes: name and content to work properly.
The name attribute will indicate the trackers to which it applies.
The content attribute will indicate how it should behave based on the parameters used.
meta name="robots" content="noindex" />
However, not all browsers support all directives:
Value | Bing | Yandex | |
---|---|---|---|
index | ✔️ | ✔️ | ✔️ |
noindex | ✔️ | ✔️ | ✔️ |
none | ✔️ | ❌ | ✔️ |
noimageindex | ✔️ | ❌ | ❌ |
follow | ✔️ | ✔️ | ✔️ |
nofollow | ✔️ | ✔️ | ✔️ |
noarchive/nocache | ✔️ | ✔️ | ✔️ |
nosnippet | ✔️ | ❌ | ❌ |
notranslate | ✔️ | ❌ | ❌ |
unavailable_after | ✔️ | ❌ | ❌ |
The content attribute will indicate how it should behave based on the parameters used.
The most common parameters that we are going to find and useful for SEO are the index tag, noindex tag, follow tag and the no follow tag:
Remark: It is not necessary to use meta robots and the x-robots tag on the same page, as it is redundant.
These types of directives, together with the robots.txt file, the canonicals tags, and the X-robots-tag, allow you to control the indexability and crawlability of your website.
Thus, we can optimize the crawl budget and avoid duplicate content errors.
The main difference between them is that meta robots give indications about the indexing of pages, while robots.txt gives indications about crawling.
Important: We must remark that, as with robots.txt files, crawlers are not obliged to follow the instructions indicated by the metatags of your page. They act only as a suggestion.
Depending on the case, these are the 3 different ways to block the Googlebot:
Meta robots tags should be used whenever you want to control indexing at the individual page level. In other words, to ensure that a URL is not indexed, always use the robots meta tag or the X-Robots-Tag HTTP header.