Skip to content

Google Search’s Internal Engineering Documentation Leak: What it Means

Google has given us insight into their inner workings and how they may rank search engine results. Back in March, Google Search API documents were published on GitHub. While later removed in early May, this public Google documentation was automatically attached with the Apache 2.0 license, which meant those who had accessed it had the…
Reading Time: 4 minutes
Blog Post

Google has given us insight into their inner workings and how they may rank search engine results. Back in March, Google Search API documents were published on GitHub. While later removed in early May, this public Google documentation was automatically attached with the Apache 2.0 license, which meant those who had accessed it had the rights to distribute it. While Google already has published its own Content API Warehouse, these particular documents were never meant to be in the public eye.

Naturally, there has been a major stir in the SEO community. Google has been notoriously private about how they rank websites, but this leak has given us an unprecedented insight into their ranking systems and search algorithms – it sheds light on the type of data that matters to them.

What Happened?

The leaked documents were published on GitHub on 13th March. The document contained 2,596 modules and 14,014 attributes – tons of information to work through. The accidental publishing was originally spotted by the CEO of EA Digital Eagle, Erfan Azimi, who shared the information via email with Rand Fishkin, the co-founder of SparkToro. Since then, SEO experts have analysed the document, and what they have revealed is quite interesting. This one may be more impactful than the Yandex Search leak!

You might wonder, are these leaked documents legit? It looks as though they are, as the email to Rand Fishkin also stated the authenticity of the documents was backed up by ex-employees of Google. Google hasn’t yet responded to news of the leak.

What Does it Tell Us?

The documents tell us what kind of data Google stores and finds important. While it doesn’t go into specifics in terms of how ranking factors are weighted, the wealth of information can be very helpful for SEO companies and people who want their websites to rank higher. These documents also tell us that Google may not have been completely honest in their previous statements about how Google’s algorithm operates, as there are some clear contradictions between what they have said and what the documents show.

Want to learn more? Here’s an official Google guide to the Google Search Ranking Systems.

Now, let’s go into the most interesting takeaways from the document regarding search engine ranking.

  • Site Authority

Google has said that they do not have a website authority score. However, the leaked internal documents don’t coincide with this statement. That means that the strength of a website’s domain may play more of a role in ranking than many people had previously thought.

  • Clicks for Rankings

Despite Google previously stating that they do not use click-centric user signals, these documents show that: yes, clicks do matter. It’s not a massive surprise to us, but if you want to rank high on Google search, you’ll need to bring in successful clicks. Google ranks clicks under any of the following: goodClicks, badClicks, unsquashedClicks, and lastLongestClicks. So, it’s not just about having people click on a link – it’s about having it be successful.

  • Sandboxing

What about the sandbox – the idea that newer websites don’t rank as well? Again, Google had previously denied the presence of a sandbox, with the document stating otherwise, as it shows they use the attribute hostAge specifically for sandboxing purposes, which tells Google which sites are more trustworthy based on age and other trust signals.

  • E-E-A-T

E-E-A-T, standing for Experience, Expertise, Authoritativeness, and Trustworthiness may play a part in Google’s ranking factors. It wasn’t mentioned too often in the leak, but it’s worth highlighting that the leak showed that it identifies authors and stores that information.

  • Heading Tags and Keyword-driven Meta Titles

We’ve learned from the document that keywords in heading tags and meta titles matter. For example, if a title tag includes particular keywords, it may rank higher for search queries that match it.

  • Link Building

It’s no big surprise, but the leaked document shows that link building does matter when it comes to Google’s ranking system. Within the Google document, it showed that links were classed as either low, medium, or high-quality. So, it’s all about having successful links, which means link diversity is an important factor.

Want more insight into the power of link building? Check out our piece on link relevancy and authority and whether it still matters.

  • Fresh Content

The document has made it clear that Google cares about content freshness, which relates to how often a page updates with new content as well as the published dates. Essentially, the fresher the content, the higher quality it is deemed by Google.

  • YMYL Score

Part of the leaked document tells us that they keep a YMYL (Your Money Your Life) score, which means scoring any content that covers topics that may have an effect on the users in the real world. For example, that includes content concerning health or financial advice.

  • Demotions

Many people will be interested in potential demotions, and the leaks show that Google uses algorithmic demotions to rank content. The document highlighted demotions for anchor mismatches and exact match domains. So, if an anchor link does not match the site it’s referring to, the piece of content may get demoted on the ranking system.

The Takeaway

The leaked Google Search API Documents consist of thousands of pages. Thankfully, SEO experts have already sifted through to bring us valuable information concerning search engine rankings. Notably, link building, content freshness, clicks for rankings, and site authority all play a role in Google’s ranking factors. These documents tell us what Google is interested in and which information it stores, and that can help us navigate SEO going forward.

Are you ready to climb the Google ranks? We are SEO experts here at Click Intelligence, and we can help your site increase in traffic with a bespoke campaign. Book a free consultation with us today to get started!

James Owen, Co-Founder & Head Of Search

James has been involved in SEO and digital marketing projects since 2007. James has led many SEO projects for well-known brands in Travel, Gaming and Retail such as Jackpotjoy, Marriott, Intercontinental Hotels, Hotels.com, Expedia, Betway, Gumtree, 888, Ax Paris, Ebyuer, Ebay, Hotels combined, Smyths toys, love honey and Pearson to name a few. James has also been a speaker at SEO and digital marketing conferences and events such as Brighton SEO.

View all Downloads

Downloads

The cover image of our e-book titled 'Content Marketing How-To Guide', portraying an individual typing on a laptop.

Content Marketing – How-to Guide

What is Content Marketing? Download Our How-To Guide Today!

Download
download

The Ultimate Guide To Selling Products Online

We’re going to take a look at how you can turn your eCommerce dream into a reality.

Download
A blue section and the text "The Millionaire Guide On SEO." The right side is filled with scattered U.S. one-dollar bills.
View the Blog

You may also be interested in...

New Feature Launch: Click Insights AI Overviews Gap Analysis

Google's AI Overviews are changing the game when it comes to all things search. From…

How to Increase Digital Marketing Agency Profit Margins in 5 Steps

Healthy margins. For any agency, profitability goes beyond revenue and is about achieving (and retaining)…

Generative AI SEO: Threat or Opportunity?

Generative AI is rapidly reshaping the search landscape. It has left marketers asking a specific…

No Tricks, Just Links: Why Link Building Matters for AI Search

AI-driven search is altering SEO at lightning-fast speed. Google AI Overviews, Bing Copilot, ChatGPT –…

How New Websites Are Ranking Faster in 2025

SEO in 2025 looks very different from just a couple of years ago. Google’s algorithms…

How Many Backlinks Does It Take to Rule Your Niche’s SERPs?

SEO is undergoing a dramatic evolution. Despite this, when it comes to those trusty old…

Content Writing vs Copy Writing: What’s the Difference?

Content writing vs copywriting. It's a debate that often confuses marketers. However, don't underestimate knowing…

What You Should Know Before Buying Content

Content is the fuel that drives digital visibility. It’s the vehicle behind customer engagement and…

View all Guides

Online Guides

The Best Link Building Agencies Helping Investment Banking Firms Strengthen Their Digital Reputation
View guide
The Best Link Building Companies Helping Technology Brands Strengthen Their Online Authority
View guide
Mission: Visibility — The Best Link Building Companies Lifting the Aerospace Industry to New Heights
View guide
4 Game-Changing Link Building Agencies Helping Data Centre Companies Dominate Search Rankings
View guide
Top Link Building Companies Powering Success for Web Hosting Brands
View guide
The Best Link Building Agencies for IT Support Companies
View guide
How To Become One of the Great Advisory Companies: Work with these Link Building Agencies
View guide
Work with 5 of the Best Link Building Agencies for Web Development Companies
View guide
Back To Top