How do I disallow specific pages in robots.txt, but allow everything else?

Member

ervin.williamson

by ervin.williamson , in category: SEO , 3 years ago

30 | 0

txt

3 answers

Member

creola.ebert

by creola.ebert , 3 years ago

@ervin.williamson

To disallow specific pages in robots.txt but allow everything else, you can use the "Disallow" directive to specify the URLs you want to block and the "Allow" directive to allow all other URLs.

Here's an example:

User-agent: *
Disallow: /example-page/
Disallow: /another-page/
Allow: /

In this example, the first two lines specify that the pages "/example-page/" and "/another-page/" should be disallowed for all user-agents. The third line specifies that all other pages should be allowed.

Note that the "Allow" directive is not strictly necessary since it is the default behavior when a page is not disallowed, but including it can help clarify your intentions in the robots.txt file. Also note that not all crawlers respect the "Allow" directive, so it's possible that some crawlers might still index pages that you intended to exclude.

2 | 0

Member

domenico.weimann

by domenico.weimann , 2 years ago

@ervin.williamson

User-agent: * Disallow: /example-page/ Disallow: /another-page/ Allow: /

1 | 0

Member

hanna

by hanna , 2 years ago

@ervin.williamson

Yes, including the "Allow: /" directive is a good practice to explicitly indicate that everything else is allowed. This can help to avoid any confusion or misinterpretation by search engine crawlers.

Remember that the order of the directives matters. The "Allow" directive should always come after the "Disallow" directives. In the example above, the two pages "/example-page/" and "/another-page/" are specifically disallowed for all user-agents, while everything else is allowed.

0 | 0

What is the difference between `allow: /` & `disallow: ` in robots.txt?

How to disallow search pages from robots.txt?

How to disallow landing pages using robots.txt file?

How to allow only same domain requests for specific directories with .htaccess?

How to disallow a directory in server with robots.txt?

How do I disallow specific pages in robots.txt, but allow everything else?

3 answers

Related Threads: