Last year, Google expanded the Knowledge Graph to use data extracted (*cough* scraped) from the index to create answer boxes. Back in October, I wrote about a failed experiment. One of my posts, an odd dive into Google's revenue, was being answer-fied for the query "How much does Google make?":
Objectively speaking, even I could concede that this wasn't a very good answer in 2014. I posted it on Twitter, and David Iwanow asked the inevitable question:
Enthusiasm may have gotten the best of us, a few more people got involved (like my former Moz colleague Ruth Burr Reedy), and suddenly we were going to fix this once and for all:
There Was Just One Problem
I updated the post, carefully rewriting the first paragraph to reflect the new reality of Google's revenue. I did my best to make the change user-friendly, adding valuable information but not disrupting the original post. I did, however, completely replace the old text that Google was scraping.
Within less than a day, Google had re-cached the content, and I just had to wait to see the new answer box. So, I waited, and waited… and waited. Two months later, still no change. Some days, the SERP showed no answer box at all (although I've since found these answer boxes are very dynamic), and I was starting to wonder if it was all a mistake.
Then, Something Happened
Last week, months after I had given up, I went to double-check this query for entirely different reasons, and I saw the following:
Google had finally updated the answer box with the new text, and they had even pulled an image from the post. It was a strange choice of images, but in fairness, it was a strange post.
Interestingly, Google also added the publication date of the post, perhaps recognizing that outdated answers aren't always useful. Unfortunately, this doesn't reflect the timing of the new content, but that's understandable – Google doesn't have easy access to that data.
It's interesting to note that sometimes Google shows the image, and sometimes they don't. This seems to be independent of whether the SERP is personalized or incognito. Here's a capture of the image-free version, along with the #1 organic ranking:
You'll notice that the #1 result is also my Moz post, and that result has an expanded meta description. So, the same URL is essentially double-dipping this SERP. This isn't always the case – answers can be extracted from URLs that appear lower on page 1 (although almost always page 1, in my experience). Anecdotally, it's also not always the case that these organic result ends up getting an expanded meta description.
However, it definitely seems that some of the quality signals driving organic ranking and expanded meta descriptions are also helping Google determine whether a query deserves a direct answer. Put simply, it's not an accident that this post was chosen to answer this question.
What Does This Mean for You?
Let's start with the obvious – Yes, the v2 answer boxes (driven by the index, not Freebase/WikiData) can be updated. However, the update cycle is independent of the index's refresh cycle. In other words, just because a post is re-cached, it doesn't mean the answer box will update. Presumably, Google is creating a second Knowledge Graph, based on the index, and this data is only periodically updated.
It's also entirely possible that updating could cause you to lose an answer box, if the new data weren't a strong match to the question or the quality of the content came into question. Here's an interesting question – on a query where a competitor has an answer box, could you change your own content enough to either replace them or knock out the answer box altogether? We are currently testing this question, but it may be a few more months before we have any answers.
Another question is what triggers this style of answer box in the first place? Eric Enge has an in-depth look at 850,000 queries that's well worth your time, and in many cases Google is still triggering on obvious questions ("how", "what", "where", etc.). Nouns that could be interpreted as ambiguous also can trigger the new answer boxes. For example, a search for "ruby" is interpreted by Google as roughly meaning "What is Ruby?":
This answer box also triggers "Related topics" that use content pulled from other sites but drive users to more Google searches. The small, gray links are the source sites. The much more visible, blue links are more Google searches.
Note that these also have to be questions (explicit or implied) that Google can't answer with their curated Knowledge Graph (based on sources like Freebase and WikiData). So, for example, the question "When is Mother's Day?" triggers an older-style answer:
Sites offering this data aren't going to have a chance to get attribution, because Google essentially already owns the answer to this question as part of their core Knowledge Graph.
Do You Want to Be An Answer?
This is where things get tricky. At this point, we have no clear data on how these answer boxes impact CTR, and it's likely that the impact depends a great deal on the context. I think we're facing a certain degree of inevitability – if Google is going to list an answer, better it's your answer then someone else's, IMO. On the other hand, what if that answer is so complete that it renders your URL irrelevant? Consider, for example, the SERP for "how to make grilled cheese":
Sorry, Food Network, but making a grilled cheese sandwich isn't really that hard, and this answer box doesn't leave much to the imagination. As these answers get more and more thorough, expect CTRs to fall.
For now, I'd argue that it's better to have your link in the box than someone else's, but that's cold comfort in many cases. These new answer boxes represent what I feel is a dramatic shift in the relationship between Google and webmasters, and they may be tipping the balance. For now, we can't do much but wait, see, and experiment.
Very interesting Pete. I've tested this with similar results. It is strange that Google maintain the "fact" even after re-crawling the answer page.
I also did an experiment last yeah on attempting to correct Freebase/Wikidata fueled knowledge panels: https://www.andrewisidoro.co.uk/blog/microsofts-new-ceo-breaking-news-knowledge-graph/
This appeared to adapt to change much more readily...
I suspect Google is creating a second database for these index-based answers, and maybe that data just isn't getting updated very often. Unlike your experiments with the core Knowledge Graph, it doesn't seem like Google should have to validate the new answer against other sources, but I suppose that's possible, too. They may have some time period or process for determining if the new answer is trustworthy.
Yes I agree, a second data corpus seems he most likely solution. Although I'm still interested in how they'll respond to conflicting data from different databases. I.e if the "freebase-index" still had George Bush as president which data would be prefered?
Such an interesting element of SEO at the moment.
Probably you are right !
It's interesting that they might not be updating these answer boxes as often as possible, as they are almost always the source of information for the user, when available. It would be beneficial to Google and Google users for the algorithms and equations for the answer boxes to be updated.
We have been working very hard in the last few months to get into the "Answers Box" and we have had some success, but its very hit and miss.
We have noticed though, that our rankings have increased when we get into the answers box - not always to position 1, but there has been an increase.
It has affected Organic traffic to them pages - but as you mentioned - its you or your competitors.
Andy, would it be the case that because of your efforts to produce helpful answers you have enhanced your content and therefore a natural increase in rankings rather than a directly attributed rise due to presence in the Answer box. The quest to get in the answer box pushes us in the direction of better content. Wins for everybody
Interesting - I don't have a lot of before/after data on rankings, unfortunately, but we are starting to dig into some of that.
Hey Dr. Pete! Just getting a refresher on the ins and outs of featured snippets and wanted to know if there is a specific way to mark up your website content at all to improve the chances of having Google pull your content as the featured snippet? Or, is it simply who answers the question the best, the most concise, and has authority/rankings already?
Thanks again for such a detailed post!
We're not seeing any connection to specific mark-up, such as schemas. Google seems to be looking for eligible text anywhere they can find it. The content-quality factors Google uses are already pretty complex (and they haven't revealed these publicly), but a couple of things I've found help:
(1) As STAT's research showed, try to match your format to the current snippet. If it's a list or table, for example, you may want to write toward that format.
(2) Consider using the "reverse pyramid" approach journalists use. Summarize the page at the top and then dive into details. Write a short paragraph that would work well in a Featured Snippet. I think this is also good for web users, who often skim and needed to get the "scent" of your content as quickly as possible.
That's fascinating but I'm wondering, did this happen because of MOZ's domain power? I'm pretty sure, if any other site did it, they might not clinched that sacred spot again.
So far, every 2nd-generation answer box I've seen comes from an organic result on page 1, so yes - it does take a fair amount of authority. After that, it seems to be a pretty crude matching process right now. Your title/text have to be well aligned with the question. I suspect that will get more sophisticated over time.
Well if that's going to be more sophisticated then I think it will be more cool. Imagine, sites are competing with each other in terms of producing great & relevant content that eventually gonna help the readers. :)
Very interesting post.!!!!
How about this answer box - img
Good one - yeah, they've still got some work to do :)
That is funny although to be honest I always find it funny when someone causes Google to look silly.
For those who know how to make a grilled cheese, there isn't much to it. Those who don't know however, will be stumped by step two. [What sandwich are they talking about!!!] There will always be room for fuller explanations.
Informative and Nice post to read on answer search results.
I wasn't even aware at first that the answers box that Google uses was supplied by what a specific author may have written. I had always assumed that it was taken from Google's index of data stored or even it's own knowledge base. I might be interested to see if there are any questions I can hope to answer which Google would post in that style.
So is Google just using the information from a normal page to create these answer boxes? I have seen them before but never really looked in to it too deeply. It's like a reward for being accurate and helpful :)
Yes, this style of answer box is coming directly from indexed pages. It's a combination of authority and matching the question well. At the moment, they don't just accuracy very well, but I think your "reward for being accurate" statement is very aligned with where Google wants to be over the next 2-3 years.
Hi Dr. Pete,
Is this the way Google is trying to generate answer boxes based also on user generated content authority signals? Maybe that could also play a major part in caching of a webpage. Recently Bill Slawaski wrote a post on recent Google patent on "How Google May Rank Users In Streams". What do you think?
Well I have been seeing these answer boxes for quite a while, the only thing that bothers me is Google is scraping content from other websites and asking webmasters not to do so, isn't it a bit diplomatic approach? What I have observed in this regard is that Google does not want users to leave their platform to fetch information from other websites but remain intact and find all the information right there.
Wow! $29.3 billion really surprised me. Its unbelievable man, anyway Google can because he grab first position in WWW...
It is interesting that having your answer in the box is better than having a competitor's answer. The likelihood being that in order to have your answer in the box you need to provide the most relevant and comprehensive answer for the user, which could in turn give them all the information they need and therefore they would not visit your site. It all depends on the complexity of the question and the inquisitiveness of the user...horses for courses! Quite the predicament.
You bring up an interesting point Dr. Pete! Since Google’s algorithm will begin ranking pages based on factual information, index-based answers could be a testing ground for fine-tuning the process. If so, that process is most likely being tweaked even as we speak, which would possibly account for the delay in posting the new content. Information contained in the knowledge graph will likely be determined by how an infinite number of users search and what they're searching for. That process may not always get me to my destination as quickly as I'd like though. Thanks for another insightful article.
Dr. Peter,
This is very interesting. I've seen Knowledge Graph results one day for a phrase, and then no Knowledge Graph the next. It seems to fluctuate quite a bit.
I agree - it's a good idea to try to show up in Google's Knowledge Graph vs. not show up there. It seems like more and more they're pushing people to stay within Google even for Local Search with the recent addition of new photo opportunities for My Business listings.
Did you use special schema markup in your original post to help the Knowledge Graph pull this answer (I only see schema.org/article)?
Hi Dr. Pete
It's an amazing post. Have you conducted any research to know how can we have our link in that box? If anyone in the industry can conduct research, you can do it. Having great answer on my site is not enough for having link in answer box, there would surely be other parameters invloved in this. I think someone with enough resources and PR should try things on tones of websites.
We're working on take a SERP where someone else has their URL in the answer box and trying to replace them - that seems like the logical next step. It's a bit hard to force an answer box to exist. Right now, Google has to decide that the query is the kind that can be "answered", in this sense. If they don't think the query can be answered, I doubt you can force that.
Thanks for your reply,
There are two cases.
The only answer to your first question is authority and Trust Rank, if your website qualifies for both of them you can be the part of this prime spot. As a matter of fact both these factors need time and lots of it.
I often feel that knowledgebase answers are not very satisfying - they work well for simple raw facts like what day is mothers day and the like but even for simple recipes like say 'how to make pancakes' whilst there is a knowledgebox I don't really see Google as an authority, I may want a video or some more information to expand upon the initial question / answer. The credit score example in Eric's post is a great example - Google's high level answer is just not cutting it for me and I would want way more and to know more about the source.
In my mind there will always be the need for subject matter experts so the 'jack of all trades master of none' phrase springs to mind which ironically pops up a knowledgebox definition of the phrase itself if searched in Google.
Very interesting information, I'm looking for the day when submissions are updated in real time. Perhaps updated could be based on your historical contributions. I know Google attempted this to some degree with authorship but maybe there's an alternative route they will pursue (or are already) in the future. I think it would be nice if the listed results in knowledge/answer box contained branding of the business providing the information deemed most relevant or useful. Hmmm, it will be interesting to see where this goes over the next couple years. It would be awesome if local businesses could tap into these listings to connect better with local customers.
Up until this point the Answer Box has been somewhat of an obscure portion of Google's page. It does, however offer value because it provides a centralized target to aim for--in terms of improving the relevancy of content and aiming for that search question. On the other hand, if you do manage to get into an 'answer box' and your CTR improves it's somewhat like hitting the search lottery since we're not entirely sure the criteria Google uses for selecting these so it's difficult to replicate your efforts.
To answer the question "do you want to be an answer?" Yes, because you don't want to not be an answer.
Very interesting post. i also tried the same by searching on various topic and i found the same..