The Comprehensive Guide to Excluded Pages in the Google Search Console Coverage Report

It’s important to keep track of pages Google has excluded from their index. Thanks to the new excluded pages section within the Search Console coverage report, you can see why a page as excluded. What follows is an explanation of all the reasons a page might be excluded from Google’s index. It will also explain what you can do about it- if you don’t want that page excluded.

What’s the problem with an excluded page in Google Search Console’s coverage report?

These page are probably not in Google. It doesn’t appear that you want them to be. Make sure Google has made the correct decision.

What are the reasons a page is excluded from Google’s index, according to the Google Search Console coverage report

Why was a page blocked by ‘noindex’ tag excluded according to Google Search Console’s coverage report?

There’s a robots meta tag on this page, telling Google not to index it. If you want this page in Google’s index, remove the robots meta tag.

Why was a page blocked by page removal tool excluded according to Google Search Console’s coverage report?

Someone has specifically told Google not to index this through Google Search Console. If you want this page in Google’s index, login to Google Search Console and remove it from the list of removed pages.

Why was a page blocked by robots.txt excluded according to Google Search Console’s coverage report?

There is a line in the robots.txt file that is excluding Google from adding this to its index. To get this page indexed, modify your robots.txt file to allow Google to index it. The Google Search Console robots.txt checker will help you do this.

Why was a page blocked due to unauthorized request (401) excluded according to Google Search Console’s coverage report?

Google does not have permission to view the page. For Google to index this page you must give it permission to view it by sending a 200 header code.

Why was a crawl anomaly page excluded according to Google Search Console’s coverage report?

This is a catch-all for other reasons Google excluded this page from its index. You’ll have to figure out the crawl anomaly if you want Google to index this page.

Why was a crawled – currently not indexed page excluded according to Google Search Console’s coverage report?

Google isn’t specific on why they’re not including it, but they’ve determined it’s not going to be indexed. There’s nothing you can do about this page, if you want Google to index it. Try submitting it through an XML sitemap or adding links to this page. It might help.

Why was a discovered – currently not indexed page excluded according to Google Search Console’s coverage report?

Google found this page but hasn’t indexed it yet. You’re going to have to wait for Google to index this page.

Why was an alternate page with proper canonical tag excluded according to Google Search Console’s coverage report?

This page is a duplicate of another page. It’s not in Google’s index because this page’s canonical tag is pointing to the original page. That’s the page in Google’s index. If you want Google to index this page instead, change your canonical tags to point to this page.

Why was a duplicate page without canonical tag excluded according to Google Search Console’s coverage report?

Google has determined that this page is a duplicate. Without a canonical tag, it has chosen another page to index instead. You’ll need to add a canonical tag to this page (and other, duplicate pages) if you want this page in the Google index.

Why was a duplicate non-HTML page excluded according to Google Search Console’s coverage report?

Google found an HTML page that duplicates the content of this page. It has decided not to add this page to its index. Make this page more unique if you want Google to add it to the index.

Why did Google chose different canonical than user according to Google Search Console’s coverage report?

Although this page is marked as the canonical version, Google thinks a different page is the correct canonical version of the page. Figure out why Google thinks the other page is better as the canonical version, and modify this page.

Why was a not found (404) page excluded according to Google Search Console’s coverage report?

This page serves a 404 (page not found) header code when served to Google. There’s nothing wrong with having a 404 page, but you might miss an opportunity. Setup a 301 redirect so Google (and users) get to the right page. Remove the 404 header code from this page, if you want it indexed.

Why was a page removed because of legal complaint excluded according to Google Search Console’s coverage report?

Someone has complained about this page legally, and Google has removed it from the index. This is usually a copyright violation. If you want this in Google’s index, your content will need to comply with local laws. You can appeal the submitted legal complaint here: https://support.google.com/legal/troubleshooter/1114905

Why was a page with redirect excluded according to Google Search Console’s coverage report?

Because this page has a redirect, it’s not included in Google’s index. Remove the redirect if you want this page in the index. Bear in mind that you might want this redirect in place, for SEO reasons.

Why was a queued for crawling page excluded according to Google Search Console’s coverage report?

Google is waiting to crawl this page before adding it to its index. You’re just going to have to wait a couple days on this one.

Why was a soft 404 page excluded according to Google Search Console’s coverage report?

Google thinks this page is meant to be a 404 page even though it’s not serving a 404 header code. If the page is not meant to be a 404 page, try adding more content to the page and Google might add it to its index. If it is meant to be a 404 page, send a 404 header code with this page.

Why was a submitted URL dropped page excluded according to Google Search Console’s coverage report?

Google has decided to remove this page from its index. Google isn’t specific on how to add this page back to its index. It might take some research to determine why it might have been removed.

Why was a submitted URL not selected as canonical page excluded according to Google Search Console’s coverage report?

You submitted this page to Google and it has a canonical tag, but Google thinks a different page should be the canonical version. If you want this page in Google’s index, treat it as if it is a “Google chose different canonical than user” page (above).

 

If you want to learn more, check-out Google’s explanation of these errors, here: https://support.google.com/webmasters/answer/7440203