gateway

The Gateway Logo

Gateway - SEO Tools Technical Standards & Functional Specifications Draft

Abstract


This specification is intended for product managers, developers, and executives who are involved in the creation, maintenance, and enhancement of software used for search engine optimization (SEO) analysis, monitoring, and strategy. It serves as a comprehensive guide for those responsible for developing tools that assist in the collection, processing, and interpretation of SEO data. This includes, but is not limited to, software that performs tasks such as crawling websites, analyzing on-page and off-page factors, tracking keyword rankings, monitoring backlinks, and evaluating website performance in search engine results.

Furthermore, this specification is valuable for stakeholders who are interested in ensuring that the software adheres to best practices and industry standards, thereby improving the accuracy, reliability, interoperability, compatibability, and usability of SEO tools. It aims to provide clear and detailed guidelines that facilitate the correct implementation of features, functionality, and data reporting mechanisms.

This document is also intended for parties focused on enhancing the quality and correctness of the documentation associated with such tools, as well as those responsible for ensuring compliance with these requirements during the development lifecycle. Whether you’re building new SEO software from scratch or updating an existing product, adhering to this specification will help ensure that your tool meets the expectations of users and stakeholders by delivering accurate, actionable insights in a standardized manner.

In summary, this specification is a vital resource for anyone involved in the SEO software development ecosystem, from initial conceptualization and design to deployment and ongoing maintenance, aiming to elevate the overall quality and effectiveness of SEO tools within the industry.

Objective


The goal of this document is to standardize the technical capabilities of SEO software tools pursuant to the issues identified in the December 2016 article on Search Engine Land “How and Why the SEO Tools Industry Should Develop Technical Standards”. These standards will enhance data portability, consistency, and accuracy across SEO tools, reducing discrepancies in how data is stored, analyzed, and exported. By establishing a unified framework, it aims to improve interoperability and reliability, ultimately benefiting both users and developers in the SEO industry.

Scope


Currently, the scope of this specification is limited to providing schemas and functional requirements for all software that is used for monitoring, analyzing and/or improving Organic Search performance both native and software as a service (SaaS) applications.

Participate


We encourage all people actively involved in search engine optimization to contribute to this specification. You can https://github.com/ipullrank/gateway/issues/new or https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-a-pull-request. Open issues can be found here

Compliance


To comply with this document, search engine optimization solutions must implement all applicable features outlined within. The method for assessing compliance will be established at a later date. Those who are found to be in compliance may use the Gateway logo and will be added to list of certified tools.

Standard and Process


This section outlines the established guidelines and methodologies that must be adhered to in the development and maintenance of SEO software tools. It provides a structured approach for implementing features and functionaliti, ensuring consistency, accuracy, and quality across different tools and platforms.

Data Portability

Data portability is a critical component of modern SEO software, enabling users to seamlessly transfer data between different tools and platforms without loss of information or integrity. This section defines the requirements and protocols for ensuring that data can be exported, imported, and utilized across various systems in a standardized format. It covers the necessary schemas, data types, and formats that must be supported, as well as the mechanisms for handling data transformations and mappings. By adhering to these guidelines, SEO software developers can ensure that their tools provide users with the flexibility to integrate and analyze data from diverse sources, facilitating better decision-making and collaboration. This standard also aims to promote transparency and user control over data, aligning with broader industry trends and regulatory requirements around data ownership and accessibility.

Crawl Data File (.CDF)

When exporting data from a crawl, the minimum specification of data points and their order within the exported file is follows.

Where a data point is indicated as 1-n, this means that if there are multiple instances of the feature or data point, they should be included as subsequent columns in the exported file.

These files are specific comma separated value text files, but saved with an extension of .CDF and should be exported with a file name of [Data provider]-[Domain Name]-crawl-[date].cdf. An example filename might be “ScreamingFrog-Example_com-crawl-12072024.cdf.” Alternatively,

Alternatively, this can be delivered as a SQLite database, with a metadata table of metadata of the crawl where relevant metadata is defined as:

When exporting data link data for website or webpage the minimum specification of data points and their order within the exported file is follows.

Note: Many of these data points are not available in all link indices. This specification is intended to encourage providers of link data to expand their data points to a level that makes the data more useful for analysis.

These files are specific comma separated value text files, but saved with an extension of .LDF.

All link indices provide disparate measures of the perceived authority of domains and pages. Ahrefs provides URL Rating, Domain Rating. Majestic provides X, y, Z. Moz provides Page Authority, Domain Authority, MozRank, MozTrust.

While none of these metrics matches up exactly with the measures that Google is using internally, they are reasonable estimations. Unfortunately, since these tools crawl a different pocket of the web, it is difficult to combine the data. Therefore, what is required is a method to tie the datsets together and calculate the best review of a given link profile.

Link Indices require the following methods to ensure users have the ability to do so:

Organizations providing link indices are required to process the Common Crawl data monthly and publish these numbers as a means for users to calculate the translation of between metrics of across datasets.

Alternatively, all link indices are required to process and provide the following public link metrics:

TODO

Document Owner


Until another person or group of people has been identified, the owner of this document is Michael King (mike@ipullrank.com).

Document Approver


Until another person has been identified, the owner of this document is Michael King (mike@ipullrank.com).

Effective Date


This document is a draft, and therefore will not be enforced until its specifications are completed and are collectively agreed upon.

Last Reviewed Date


This document was last reviewed on September 26th, 2024.