Schema markup is the language of web crawlers. Also referred to as structured data, schema is based on its own semantic vocabulary, or coding language, that minimizes the guesswork for search engines in understanding the context and meaning of a site and what it's all about. And in the dynamic, ever-evolving world that is SEO, structured data remains steadfast in proving its worth in more ways than one.
Most SEOs today (even non-tech SEOs) admit they're keen on the importance of schema and the role it plays. Yet, despite an overwhelming nod in agreement among the SEO collective that "Yes. Schema is important. You should be using schema," it's still a topic that's hazed with mysticism.
That is, a lot of SEOs know that structured data is significant. Many have even thrown up some scripts on a few pages and perhaps seen some results. But only some have the tools and skills to leverage it correctly and cohesively on both a page level and more broadly on a site level.
As a team that's rooted in tech SEO, we're privy to the opportunities that schema offers in helping elevate the search appearance for brands, businesses, and blogs alike. This guide shares our processes on how we go about planning and implementing structured data while avoiding common mistakes.
Planning and outlining your structured data is one of the most widely overlooked yet fundamentally crucial aspects of schema. This is the initial strategic organizing of your structured data across the site from a global perspective. As a result of neglecting this stage, a site's schema "strategy" (or lack thereof) is typically not well thought out and mistakes are destined to occur. For instance, we may later discover:
Think of this initial process as an exercise in defining the "why" behind your structured data and what you hope to achieve when it's implemented properly. In short, you want to understand the potential gains from schema.
Google Product Manager, Aylin Altiok, is quoted as saying,
"By just adding a bit of markup, your content becomes eligible for rich results in search, speech recognition, and natural language understanding on the assistant and discovery on all these devices and surfaces."
via Watch Google IO 2019: Enhance Your Search and Assistant Presence with Structured Data
Before jumping into the planning phase, it's important we bring clarity to underlying myths and misconceptions surrounding schema.
Schema is just another tool at our disposal to help search engines understand our sites' contents. It should be taken at both a page level and a site level, thereby underscoring the importance of properly planning your approach and identifying templates that can be used.
A central theme in this planning phase is outlining how you want your business to look in search and determining which aspects to focus on. There are a few ways to approach this.
To start, explore what represents the business or brand via Google's Knowledge Graph – a system that understands facts and information about specific entities (typically the people, places, or things that are important enough to warrant a Wikipedia page) and displays that info in bite-sized pieces as knowledge panels in Google Search.
In simple terms, the Knowledge Graph draws from various sources and materials shared across the web; most commonly Wikipedia and Wikidata. The latter provides a sound launching pad for SEOs to explore and find what represents certain business concepts (e.g. check out the item for SEO for an obvious example).
Another good place to inspire your planning is to explore Google's search gallery of rich features based on structured data types. Depending on the nature of your content, you can identify the most relevant and applicable categories worth utilizing. This is a great source of insight, as you may stumble upon schema types that you may not know existed.
Perhaps the most important aspect of this planning exercise is pinpointing the actual objects that can be marked up with certain schema types. Knowing which forms of schema will work best requires knowing your content and identifying potential page templates that can be used.
To help articulate this process, here is a simple example using The Gray Dot Company site and which objects can be used to identify common page templates.
Based on the schema types mentioned above, let's take the example of blog posting a little deeper. We can state that there is a published blog on the Gray Dot site. Or, we can identify that:
Similarly, the concept of mapping entities together is a way to drive home all the different facets of your organization and offering. For instance, instead of constantly adding the SAME structured data, you can link to that entity's primary page, and easily make these relationship connections FOR Google.
Take for example a blog written by Tory. Instead of having to repeat all of the info about Tory for the author portion of the blog, we can declare the relationship and add the ID of the dedicated page about Tory.
Pro Tip: Use SchemaApp's Schema Paths Tool to identify such relationships and opportunities to bind the schema together.
Also worth noting when establishing connections between items, if you're building nested schema code, the error on the sub-schema affects the overall integrity of that entire schema block as a whole.
We usually see this with Product Rating schema: if there are no ratings or reviews but the schema is implemented, you'll see an error in Google Search Console (GSC).
When planning, you may want to create a visual map of the various entities and relationships that make up your brand and its website. A couple of visual mapping tools designed specifically for schema are JSON Crack and Classy Schema's Structured Data Viewer. Both tools enable you to map visual relationships based on different schema types.
You can also use more standard mind-mapping tools, like Lucid Charts, Miro, and Figma. There are tons of these tools available, some of which may be available as part of existing work management platforms (e.g. ClickUp.)
There are common mistakes we see that are rooted in the planning phase. At large, these mistakes involve choosing the improper schema type due to misinformation/lack of research or using structured data improperly. More specifically, the types of mistakes we see include
1) Wrong or unnecessary schema types being used, such as
2) Schema types implemented in the wrong places (site-wide vs. page-level) – e.g. Organization schema improperly deployed globally versus being implemented only on the homepage as it should be.
3) Not correctly nesting parent/child relationships – e.g. Reviews/Ratings not appropriately nested under Product schema or on actual product pages. Using Product schema on eComm category pages instead of properly using CollectionPage or ProductCollection schema.
4) General missed opportunities – e.g. parent schema types lacking specification (what type of LocalBusiness?) or not including adequate detail.
Like any core piece of a larger channel or strategic initiative like SEO, it's important to have a system in place for reviewing and assessing your structured data. This involves addressing errors in GSC, evaluating existing schema relationships, and looking for further improvements. For most brands, quarterly assessments are a good cadence.
Also, you don't have to feel limited to what's documented in Google Guidelines or simply doing what others are doing. There's a range of schema types that can uniquely support your brand and the content that represents it.
Aside from SEO and attaining certain SERP features, it can be a way to communicate the meaning and top information about your page content. Because structured data is a closed language set, it is easier to convey meaning and eliminate confusion/ambiguity.
Now that we've outlined what we envision our Schema strategy to look like, the next major step is implementation. This phase will take different shapes depending on the site and the current structured data that's been previously deployed, but the fundamentals remain consistent.
For sites with schema already in place, we'll want to first audit the existing markup. In doing so, we're looking to identify improvements that can be made, errors to fix, and missed opportunities.
Next is the process of mapping the schema types to be implemented and building the semantic relationships between types. This is where your visualization map can be very useful, as you can see the relationship or connections between types and where they exist on the site.
Auditing is a critical component to ensure your schema is functioning as it should be.
It's not all that difficult, either. Google's Rich Result Test tool makes it easy to assess specific URLs and syntax ("code") to see which types of structured data are detected, and which items may be valid or invalid. And if invalid, the tool helps in understanding why.
There are certain types of schema errors that tend to occur more frequently than others. These are largely due to improper formatting, missing fields, or general syntax errors. Below are some of the common schema types that encounter issues and why.
1) Parsing errors
2) Date/time not in ISO 8601 format in field "uploadDate"
3) X is not a known valid target type for the Property
5) Unknown Type
1) Provider property has to be identified as either a person or an organization
Incorrect:
Correct:
2) sameAs property can have multiple links and the syntax changes when multiplier URLs are added.
3) A value for the field is required – when a property field doesn't have value
Identify if that is due to manual or dynamic errors - We usually find relevant Wikidata or Wikipedia entries e.g. a single and multiple sameAs properties:
Single-source:
Multiple-sources:
Since this is rather a manual process that's implemented either manually or in bulk (dynamically), the errors should be revised accordingly.
4) Accidentally using double quotes inside a field (instead of single quotes), which will break the field and everything after it.
Make sure to replace any double quotes within a field with single quotes instead.
It happens fairly often that when you copy and paste code from a document (Google Docs or Microsoft Word), the special characters (" or ') are no longer the correct character causing parsing errors.
To fix, use a program that's made for code, such as Sublime Text or Microsoft Visual Studio Code (tons of extensions available for either to help with formatting too)
There are certain post-launch activities that help ensure your structured data is in place and working properly. Here's a rundown on what to assess after deploying your schema markup.
First, let's make sure Google has crawled and indexed the page(s) containing the newly implemented schema markup. We can do this by:
Once we can confirm that Google has in fact indexed our page(s), and schema scripts alongside it, we can ensure schema is properly implemented and monitor its impact.
You'll also want to periodically check in with any issues detected with your schema enhancements in GSC. There are error reports conveniently organized for you that flag any critical issues. Or you can click on each type of schema under the enhancements drop-down in GSC to see if any items are invalid.
If you've implemented schema at scale, you can oftentimes identify patterns in certain errors/warnings. Depending on the sheer size of your site and the extent of structured data markup involved, you'll want to keep up with this monitoring effort on a monthly or quarterly basis.
In addition to periodically monitoring your schema, you'll want to measure and convey the impact of your efforts. There are a lot of ways you can do this, several of which are shown in this handy resource from SchemaApp.com. We also break down a few examples of how to measure the impact of schema with our reporting processes.
In GSC, an overall report will include general search metrics across all active schema types, providing a high-level overview of how impactful the structured data is performing as a whole. This overall view might be enough for certain reporting needs, however, we can drill into the activity of specific types (more on this below).
For an overall report, we can consider comparing:
To filter your report based on certain schema types of interest, select type under the search appearance filter:
Here's an overview of search appearance metrics based on Recipe rich results:
Alternatively, we can also include search appearance metrics of Recipe Gallery:
Also, you can also cover the validity or "health" of the structured data and communicate an issue-free setup via the enhancement report:
Additional information is provided under each rich result report. These insights can point to opportunities where missing fields can be populated and item appearance can be improved. In short, this adds a layer of action or potential next steps to your report.
Or, in cases in which you have invalid structured data or critical issues present, such as with the Review snippet schema below, you can also call these out as errors to fix in your report.
Although these matters are highly technical, there's no shortage of creativity that can be employed when reporting on schema and articulating your findings. Distill what matters most to the teams and stakeholders involved, and always consider what action you can take next to further optimize search appearance and rich results.
As a company that's deeply rooted in all things tech SEO, our team is comprised of highly experienced specialists in the world of schema and we help brands leverage the search impact of rich results. If you're struggling to make schema work for your site, it's likely that we can help. Get in touch with us and let's get the ball rolling.