Defining Index Bloat

Index bloat occurs when there are too many pages on a website indexed in search engines. In other words, when your site “bloats” search engines indices, there is an excess of low-quality pages that Google is indexing, wasting valuable and limited resources on pages that you probably don’t care about.

Index bloat can lead to the following SEO issues:

  • Exhausts crawl budget *
  • Decreases the organic quality of the domain
  • Lowers the ranking potential of your other pages

Additionally, there are a few scenarios that apply to some websites in specific situations that make them prone to having too many pages indexed:

  • Ecommerce websites that use URL parameters: Hundreds/thousands of possible URL variations added by introducing product filtering or re-ordering.
  • Medium to large websites: Sites with large numbers of pages that may not necessarily have a need to be indexed; like Thank You pages, PPC landing pages, Testimonial pages, and others.
  • Sites that have been blogging for a long time: We very often find archive pages like blog tags and date archive pages to bloat search engine indices, especially when there’s not a defined blog category/tag system in place.
  • Site redesigns or migrations: It’s very common to find lots of dev, or test pages left over during a site redesign or site rebuild.

Search Engine’s Limited Crawl Budget *

Gary Illyes, a Google Webmaster Trends Analyst, said in 2017:

“Prioritizing what to crawl, when, and how many resources the server hosting the site can allocate to crawling is more important for bigger sites or those that auto-generate pages based on URL parameters.”

One of the main reasons why index bloat occurs is due to Google finding too many pages on your website that don’t have any instructions on how they should be treated. Very often a large number of these pages result in being indexed.

Taking control of how Googlebot and other search engines crawl and index your site is imperative in order to ensure you are at your maximum ranking potential. Being at this level means that Google efficiently finds your pages, understands your content, and matches the searcher’s need for information to your pages.

How to Find Sources of Index Bloat

The ideal scenario here is to have your website audited by SEO experts in order to have a comprehensive and holistic view of your website, its history, and your business objectives.

The second-best scenario is to use the Index Bloat guide I wrote on Search Engine Watch for very specific checks that can be completed. However, keep in mind that this is focused on common issues we see, and they may not necessarily apply to you.

Index Bloat & SEO Technical Optimization

Delete your Pages and Rank Higher in Search – Search Engine Watch

Resolving Index Bloat

The removal of pages from Google’s Index may certainly test your patience, as it’s a slow and painful process depending on the severity of the bloat.

It heavily depends on the CMS limitations of your site and the SEO strategy you have in place. If you don’t currently have a keyword research strategy, this is one of the first things you need to do before you begin any form of on-page SEO or link-building campaign.

Be careful!

Do not implement recommendations you read online before carefully considering the implications it will have on your website. Fixing index bloat essentially involves manually asking search engines to de-index your pages. Doing so without proper guidance or without having strategy in place can directly lead to a considerable drop in rankings.

Be smart!

Get Your Website Audited

As always, every website is different. The specific methods to resolve index bloat that works for you must be carefully considered by an SEO based on a comprehensive and thorough SEO audit of your website. Adding noindex meta tags on the wrong pages or disallowing incorrect subdirectories on your site could potentially lead to a drastic drop in organic traffic and conversions.

Are you ready to dominate the first page of Google? Our team of award-winning SEOs (and all-around awesome people) are ready to audit your website! Contact our SEO consultants today.

Looking for more SEO resources, sign up for our blog below!

Author
Pablo Villalpando

Pablo holds a BA in Sociology and is a Bilingual SEO Strategist for Victorious. He was recently featured in "Real Research: Research Methods Sociology Students Can Use" by CSU, Chico Professor Liahna Gordon. He is fortunate to enjoy a career path through analytics, digital marketing, and social psychology. His passions include Mexican food, his 4-year-old lab Chia, and astronomy.

Additional articles

What is Organic Search?

The main way we navigate the internet around the world today is by searching — with the vast majority of searches coming from the Google search engine, and a very small market share going to other search engines like Yahoo or Bing. In marketing terms, all of this searching is referred to as organic search, and it is the most “natural” way you might navigate to a new place, or find the answer to a burning question you find yourself suddenly needing to know the answer to. Here are the answers to some frequently asked questions about organic search engine …

Read Article
6 Min Read
How to Unlock Google Analytics Keyword ‘Not Provided’

A show of hands for all the marketers constantly frustrated with Google Analytics reports saying ‘not provided?’ Don’t worry, that’s completely normal. And yes, there’s a way around it. As annoying as it is, there was a lot of intention behind Google launching ‘not provided’ reporting, starting in 2011. It doesn’t mean you’re shit out of luck, it just calls for a little extra effort to “unlock” and access this data.  As a marketing professional, having access to the organic search keyword ‘not provided’ will give you a huge leg up on the competition.  By unlocking ‘not provided’ keywords  you’ll …

Read Article
6 Min Read
WordPress SEO: Discover the Top 9 SEO Plugins for WordPress

Calling all marketers working on WordPress! If your company’s website is hosted on WordPress, you probably already know that it’s the most SEO-friendly CMS platform in existence. That being said, WordPress SEO is going to be SO SO SO SO vital and the easiest way to begin optimizing your site will be with SEO plugins specifically for WordPress. Note: It’s important to keep in mind the difference between WordPress.com and WordPress.org as they both have their strengths and limitations.  When referring to “WordPress,” I’m specifically referring to ONLY WordPress.org. Yoast Yoast has 5+ million active installations and has been downloaded …

Read Article
7 Min Read
Let's give your business the attention it deserves