"What is this page about?"
As marketers, helping search engines answer that basic question is one of our most important tasks. Search engines can't read pages like humans can, so we incorporate structure and clues as to what our content means. This helps provide the relevance element of search engine optimization that matches queries to useful results.
Understanding the techniques used to capture this meaning helps to provide better signals as to what our content relates to, and ultimately helps it to rank higher in search results. This post explores a series of on-page techniques that not only build upon one another, but can be combined in sophisticated ways.
While Google doesn't reveal the exact details of its algorithm, over the years we've collected evidence from interviews, research papers, US patent filings and observations from hundreds of search marketers to be able to explore these processes. Special thanks to Bill Slawski, whose posts on SEO By the Sea led to much of the research for this work.
As you read, keep in mind these are only some of the ways in which Google could determine on-page relevancy, and they aren't absolute law! Experimenting on your own is always the best policy.
We'll start with the simple, and move to the more advanced.
1. Keyword Usage
In the beginning, there were keywords. All over the page.
The concept was this: If your page focused on a certain topic, search engines would discover keywords in important areas. These locations included the title tag, headlines, alt attributes of images, and throughout in the text. SEOs helped their pages rank by placing keywords in these areas.
Even today, we start with keywords, and it remains the most basic form of on-page optimization.
Most on-page SEO tools still rely on keyword placement to grade pages, and while it remains a good place to start, research shows its influence has fallen.
While it's important to ensure your page at a bare minimum contains the keywords you want to rank for, it is unlikely that keyword placement by itself will have much of an influence on your page's ranking potential.
2. TF-IDF
It's not keyword density, it's term frequency–inverse document frequency (TF-IDF).
Google researchers recently described TF-IDF as "long used to index web pages" and variations of TF-IDF appear as a component in several well-known Google patents.
TF-IDF doesn't measure how often a keyword appears, but offers a measurement of importance by comparing how often a keyword appears compared to expectations gathered from a larger set of documents.
If we compare the phrases "basket" to "basketball player" in Google's Ngram viewer, we see that "basketball player" is a more rare, while "basket" is more common. Based on this frequency, we might conclude that "basketball player" is significant on a page that contains that term, while the threshold for "basket" remains much higher.
For SEO purposes, when we measure TF-IDF's correlation with higher rankings, it performs only moderately better than individual keyword usage. In other words, generating a high TF-IDF score by itself generally isn't enough to expect much of an SEO boost. Instead, we should think of TF-IDF as an important component of other more advanced on-page concepts.
3. Synonyms and Close Variants
With over 6 billion searches per day, Google has a wealth of information to determine what searchers actually mean when typing queries into a search box. Google's own research shows that synonyms actually play a role in up to 70% of searches.
To solve this problem, search engines possess vast corpuses of synonyms and close variants for billions of phrases, which allows them to match content to queries even when searchers use different words than your text. An example is the query dog pics, which can mean the same thing as:
• Dog Photos • Pictures of Dogs • Dog Pictures • Canine Photos • Dog Photographs
On the other hand, the query Dog Motion Picture means something else entirely, and it's important for search engines to know the difference.
From an SEO point of view, this means creating content using natural language and variations, instead of employing the same strict keywords over and over again.
Using variations of your main topics can also add deeper semantic meaning and help solve the problem of disambiguation, when the same keyword phrase can refer to more than one concept. Plant and factory together might refer to a manufacturing plant, whereas plant and shrub refer to vegetation.
Today, Google's Hummingbird algorithm also uses co-occurrence to identify synonyms for query replacement.
Under Hummingbird, co-occurrence is used to identify words that may be synonyms of each other in certain contexts while following certain rules according to which, the selection of a certain page in response to a query where such a substitution has taken place has a heightened probability.
Bill Slawski - SEO by the Sea
4. Page Segmentation
Where you place your words on a page is often as important as the words themselves.
Each web page is made up of different parts—headers, footers, sidebars, and more. Search engines have long worked to determine the most important part of a given page. Both Microsoft and Google hold several patents suggesting content in the more relevant sections of HTML carry more weight.
Content located in the main body text likely holds more importance than text placed in sidebars or alternative positions. Repeating text placed in boilerplate locations, or chrome, runs the risk of being discounted even more.
Page segmentation becomes significantly more important as we move toward mobile devices, which often hide portions of the page. Search engines want to serve users the portion of your pages that are visible and important, so text in these areas deserves the most focus.
To take it a step further, HTML5 offers addition semantic elements such as <article>, <aside>, and <nav>, which can clearly define sections of your webpage.
5. Semantic Distance and Term Relationships
When talking about on-page optimization, semantic distance refers to the relationships between different words and phrases in the text. This differs from the physical distance between phrases, and focuses on how terms connect within sentences, paragraphs, and other HTML elements.
How do search engines know that "Labrador" relates to "dog breeds" when the two phrases aren't in the same sentence?
Search engines solve this problem by measuring the distance between different words and phrases within different HTML elements. The closer the concepts are semantically, the closer the concepts may be related. Phrases located in the same paragraph are closer semantically than phrases separated by several blocks of text.
Additionally, HTML elements may shorten the semantic distance between concepts, pulling them closer together. For example, list items can be considered equally distant to one another, and "the title of a document may be considered to be close to every other term in document".
Now is a good time to mention Schema.org. Schema markup provides a way to semantically structure portions of your text in a manner that explicitly define relationship between terms.
The great advantage schema offers is that it leaves no guesswork for the search engines. Relationships are clearly defined. The challenge is it requires webmasters to employ special markup. So far, studies show low adoption. The rest of the concepts listed here can work on any page containing text.
6. Co-occurrence and Phrase-Based Indexing
Up to this point, we've discussed individual keywords and relationships between them. Search engines also employ methods of indexing pages based on complete phrases, and also ranking pages on the relevance of those phrases.
We know this process as phrase-based indexing.
What's most interesting about this process is not how Google determines the important phrases for a webpage, but how Google can use these phrases to rank a webpage based on how relevant they are.
Using the concept of co-occurrence, search engines know that certain phrases tend to predict other phrases. If your main topic targets "John Oliver," this phrase often co-occurs with other phrases like "late night comedian," "Daily Show," and "HBO." A page that contains these related terms is more likely to be about "John Oliver" than a page that doesn't contain related terms.
Add to this incoming links from pages with related, co-occurring phrases and you've given your page powerful contextual signals.
7. Entity Salience
Looking to the future, search engines are exploring ways of using relationships between entities, not just keywords, to determine topical relevance.
One technique, published as a Google research paper, describes assigning relevance through entity salience.
Entity salience goes beyond traditional keyword techniques, like TF-IDF, for finding relevant terms in a document by leveraging known relationships between entities. An entity is anything in the document that is distinct and well defined.
The stronger an entity's relationship to other entities on the page, the more significant that entity becomes.
In the diagram above, an article contains the topics Iron Man, Tony Stark, Pepper Potts and Science Fiction. The phrase "Marvel Comics" has a strong entity relationship to all these terms. Even it only appears once, it's likely significant in the document.
On the other hand, even though the phrase "Cinerama" appears multiple times (because the film showed there), this phrase has weaker entity relationships, and likely isn't as significant.
Practical tips for better on-page optimization
As we transition from keyword placement to more advanced practices of topic targeting, it's actually easy to incorporate these concepts into our content. While most of us don't have the means available to calculate semantic relationships and entity occurrences, there are a number of simple steps we can take when crafting optimized content:
- Keyword research forms your base. Even though individual keywords themselves are no longer enough to form the foundation of your content, everything begins with good keyword research. You want to know what terms you are targeting, the relative competition around those keywords, and the popularity of those terms. Ultimately, your goal is to connect your content with the very keywords people type and speak into the search box.
- Research around topics and themes. Resist researching single keywords, and instead move towards exploring your keyword themes. Examine the secondary keywords related to each keyword. When people talk about your topic, what words do they use to describe it? What are the properties of your subject? Use these supporting keyword phrases as cast members to build content around your central theme.
- When crafting your content, answer as many questions as you can. Good content answers questions, and semantically relevant content reflects this. A top ranking for any search query means the search engine believes your content answers the question best. As you structure your content around topics and themes, make sure you deserve the top ranking by answering the questions and offering a user experience better than the competition.
- Use natural language and variations. During your keyword research process, it's helpful to identify other common ways searchers refer to your topic, and include these in your content when appropriate. Semantic keyword research is often invaluable to this process.
- Place your important content in the most important sections. Avoid footers and sidebars for important content. Don't try to fool search engines with fancy CSS or JavaScript tricks. Your most important content should go in the places where it is most visible and accessible to readers.
- Structure your content appropriately. Headers, paragraphs, lists, and tables all provide structure to content so that search engines understand your topic targeting. A clear webpage contains structure similar to a good university paper. Employ proper introductions, conclusions, topics organized into paragraphs, spelling and grammar, and cite your sources properly.
At the end of the day, we don't need a super computer to make our content better, or easier to understand. If we write like humans for humans, our content goes a long way in becoming optimized for search engines. What are your best tips for on-page SEO and topic targeting?
Special thanks to Dawn Shepard, who provided the images for this post.
For those wishing to explore these topics on your own and more in-depth, here are few select sources for easy reference.
1. Keyword Usage
• A Visual Guide to Keyword Targeting and On-Page SEO
2. TF-IDF
• tf–idf
3. Synonyms and variants
• Helping computers understand language
• How Google May Substitute Query Terms with Co-Occurrence
4. Page Segmentation
• Systems and methods for analyzing boilerplate
• Document page segmentation in optical character recognition
5. Semantic Distance and Term Relationships
• Google Defines Semantic Closeness as a Ranking Signal
• Document ranking using word relationships
6. Co-occurrence and Phrase Based Indexing
• Phrase Based Indexing and Semantics
• Ranking Webpages Based upon Relationships Between Words
7. Entity Salience
• A New Entity Salience Task with Millions of Training Examples
• Teaching machines to read between the lines
All these in-depth posts are really helpful; Thank you so much for sharing and for writing this amazing post. :)
So cool! Thanks for sharing this... And all the great links too!!!
Cyrus,
I have developed a tool, that does (something similar to) a TF-IDF & close variants analysis of a given term and displays it in a more or less nice chart and enables you to compare your text to the already ranking competitors: https://www.text-tools.net/
Awesome article Cyrus, very usefull links! Thanks!
Thank you so much for your post. It is very usefull.
Thanks a ton!
thanks it was really helpful
Thanks for these resources Cyrus.
One way I've described the use of synonyms, close variants, and semantic distance to clients (especially if someone on their team is creating the copy) is with a made up story about Google being a person who knows only a little English or maybe ESL (English as a Second Language).
The idea came when I was thinking about my travels to non-English-speaking countries. Because I'm [sadly] only fluent in English, I really had to rely on a form of 'Spanglish' and hope to encounter people who knew English. When I did, it was a very broken conversation, but we could usually get our points across. The more I explained my question or whatever it was I was trying to share, the more likely I would say a related word/phrase that they understood. Maybe I said a bunch that they couldn't quite follow, but then I would say something that they understood & that even helped them understand the earlier part of the conversation. The more distance between key phrases, the less likely the listener can sort of connect all the pieces for a better understanding of what you're trying to say.
Obviously not a very scientific explanation, but very effective with clients. Maybe now I can simply share this article with them! ;) Great post, Cyrus!
Wonderful analogy! We often assume our content readers know much more than they actually do - and the same applies to search engines. It's as if we write for an audience that already knows what we're talking about, when we instead we need to produce content for a room of strangers.
In this case, Google is the stranger from a foreign land, looking for understanding and trying to piece as many clues together as possible. How do you talk to someone who only understands fragments of what you are saying, and can't speak your body language?
Talking to Google is a lot like talking to an intelligent alien from another world. They may not speak the language, but if you give them enough clues they will gain understanding.
This is the best analogy I've heard regarding how to structure content and keywords. Thanks for sharing!
Thank you, Cyrus. Very well done, and very enjoyable to read.
Posts like this would never happen without your original research. Thanks for making it possible Bill.
Thank you Cyrus for this excellent post. Bill thank you for your part of the research that went into it.
I would love to see more posts like this such high-quality and so relevant to what everyone building sites and doing work on sites needs today.
I love this!
Thanks again, Cyrus, for this post.
When I got into SEO a couple years ago, I quickly found myself ghostwriting for SEO companies— with specific instructions to, well, keyword stuff.
Getting away from keyphrase overuse and approaching the webpage from a user's perspective is more difficult than I thought... My brain is trained to put keywords on the page first, and write around them as best I can afterwards.
WRONG!
Besides, what good are keywords, when they're unknown to the layman?
I have spent way too much time on-page optimizing for phrases like "SEO Copywriter," when, if you stop and think about it, the client you want most— the layman— will probably never search for that.... "How to get my website ranking better" or "Expert writer for my website" is, in my opinion, of more value.
...Which begs the question— what is the best way to conduct keyword research?
I'm tired of free tools, with 2 or 3-word keyphrases that, without inbound links... Ha— GOOD LUCK!
Writing for the web (notice I didn't say SEO) is trickier than most may think. And I think that by immersing myself in "SEO," I may have taken a step backwards, losing focus on the original question you posed here:
~ What Is This Page About? ~
You, sir, have inspired me to revamp my own content; stop obsessing over keywords (and their usage); and go back to the basics of thinking like a client, and doing everything imaginable to answer his or her questions to the best of my ability.
Honest content is killer content. And that's what I'm all about.
Thanks for writing a great post that caught my attention, and my way of working...
Yes. Yes. Yes!
A copywriter with no knowledge of SEO can create a #1 page that ranks for 100s of phrases, as long as it targets user intent, covers the subject compellingly, uses good structure, and earns sufficient attention (links) from external sources.
Add to this a small amount of understanding as to how search engines work - and if you care to, a bit of research - and you have a powerful combination to pack a 1-2 punch.
Thank you Sean. Your comment is very inspirational to me.
They key is to keep it as natural as possible. Yes, keywords still play an important role but you should always write for your readers first. There's a good chance you'll be including your keywords and close variations naturally.
Totally! The ultimate goal is to create content and train others to create content without really thinking about it. We can teach others to write more completely, answer more questions, and organize their content in better ways.
Also, we can scale these solutions on larger websites. For example, the typical page on Rotten Tomatoes, even though it's mostly an aggregation from other sources, happens to leverage these concepts extremely well.
Thanks to Mr. Cyrus Shepard to share this very important information about on-page SEO, to use keywords in content or meta.
This is an awesome post. I've been looking a lot lately about topical relevance, term relationship, and variants in order to improve our content readability, yet maintain basic keyword principles. I've also used ConceptNet 5 (https://conceptnet5.media.mit.edu/) as a tool to generate those variants and identify the semantic distance between the two terms. Thanks for the thorough elaboration Cyrus...looking forward to implementing some of these practical tips as well.
Great platform! It's been challenging for the search industry to create tools that identify relationships between words/concepts. In the past, for quick and dirty work researching related concepts I've used:
For entity research, ConceptNet 5 like you mentioned works well.
Great post Cyrus, much appreciated.
One thing I have been interested in is the use of synonyms. I read with interest:
"From an SEO point of view, this means creating content using natural language and variations, instead of employing the same strict keywords over and over again."
My understanding of natural language would mean a person generally chooses a variation to describe an entity and for the most part sticks to it. e.g. you say 'car' I say 'automobile'. I've never considered LSI a ranking factor.
Obviously query replacement is important to rankings since Hummingbird but using variations in your content to describe the same entity doesn't seem natural to me.
Agreed about LSI not being a ranking factor - I didn't mean to imply otherwise - and you have to sort through a lot of noise to find relevant phrases. I do like using it to get a sense of other related phrases that often appear with the keyword I'm targeting.
So much of this process is simply trying to figure out how people search, tackling the subject robustly, all while not over-manipulating and keeping everything "natural."
Easy as pie, right? ;)
Thanks Cyrus, I completely agree.
There's a lot of science involved in attempting to write naturally :)
I thought this article nailed it personally. Thanks again.
Hi Cyrus,
One more tool for your Topical Authority-themed arsenal: check out MarketMuse.
We've built a topic modeling/topic expansion technology that analyzes ranking content and generates related keywords.
It utilizes semantic analysis, graph analysis and natural language processing to return the most relevant bigrams and trigrams (two-word and three-word keywords).
In particular, check out our new Site Audit tool that we've just released. This identifies topical gaps across your content, telling you about content gaps that you didn't even know you had.
You can use this to build high-quality content on related topics. All that remains then is to do the link building to support it.
Aki
Co-Founder, MarketMuse
ConceptNet 5 is a very interesting tool. Thanks Charles for sharing it.
I did give it a try by testing with general keywords and the results were very insightful.
It seems however that the tool is limited in term of keywords in the database.
I hope the Tech community can expend on ConceptNet 5 tool because it's really useful for SEO keyword research and planing.
Thank you Cyrus. This points are very important day by day. Great post!
For the Italian audience, I've posted the Italian translation here: https://www.ideawebitalia.it/seo/6557/
Thanks for the excellent translation Marco!
Grazie mille!
Wow. A brain-breaker, but as a content writer, I'm happy to re-read until I distill every part into my landing pages and blog posts. I tend to write long, use a good network of synonyms and answer questions as much as possible. Quite the balancing act, but it's a fun challenge!
You've broken the code to content success without having to know a lot of complex SEO theory:
The last point is key. I think we could form an entire school of SEO thought around answering questions.
Excellent article. Summarises whole page content creation planning.
Thanks Cyrus.
Now-a-days we should prepare himself to create more powerful content and when we will put keywords variations as you mentioned in this blog then no doubt in that it will work for site like a magic. So ready guys to do something extraordinary to show your strength in front of search engine. Beside that Giant search engine is continuously focus on ""Content is King"" for today's ranking factor.
Amazing stuff Cyrus - the TF-IDF stuff is particularly interesting.
Not something I've ever come across before.
Thanks for the post!
Here's what I find most interesting about TF-IDF:
When we study it as a ranking factor all by itself, it doesn't correlate to higher ranking much better than individual keyword usage. Many people would stop right there.
But TF-IDF can be incorporated by other methods of topic analysis to identify primary phrases. If your word and/or term has a high TF-IDF, it can become a candidate for further analysis, such as phrase-based indexing. In this way, TF-IDF acts as a filter that identifies the most likely candidates on your page for further promotion.
On the other hand, TF-IDF has it's limitations, which solutions like entity salience hopes to address.
Better description about on-page SEO, I have not read. Excellent work Cyrus.
Thank you Panagiotis. Back in 2010, this post by Rand hugely inspired me: All Links are Not Created Equal: 10 Illustrations on Search Engines' Valuation of Links
That piece was incredibly empowering and helpful to advancing my knowledge of SEO. Ever since then, I've looked for ways to make search engine concepts simpler through cool visualizations and clear explanations. Thank you for the kind words.
Same here Cyrus. Rand's "All links are not equal" post was a moment of clarity for me too, especially because of the simple illustrations.
From an outsider's perspective, you've created the equivalent resource for on-page SEO.
A thousand points to you.
The 100th like for a great article! Love the illustrations. Simple. Effective.
Very informative post, thanks a lot for sharing all your research. "write like humans for humans" this will go a long way.
Cyrus - I think the people have spoken, and they want more posts like this. Deep SEO concepts that are brought to light to deepen knowledge. I like the fact that you said these concepts build on each other, it kept me engaged throughout the reading and thinking about how all of the points relate.
Great to get a summary of the more technical aspects. You and Bill S have an awesome way of wording complex principles in language non-techs can understand. Thanks :)
Appreciate the visual explanation of the relationship between keywords and rankings! :)
Great post! Thanks Cyrus :)
Great article Cyrus. I managed to rank a couple of sites quite well by using the majority of the above techniques without even utilising any serious link building tactics (obviously this has to do with the keyword difficulty as well).
What is your opinion about (optimised) video content as on page element?
We love video! I think it's gotten a bit harder after Google removed most video snippets for smaller sites. I tend to think of video as nether a positive nor negative ranking factor, but in the right situation can hugely benefit user experience.
I do suggest using transcripts when possible and following video SEO best practices.
Excellent and precious post for me ..Thank you so much dear Cyrus...
Cyrus, do you possess some sort of mind reading power? I was looking for an article that highlighted exactly what you wrote here. This will serve as an excellent piece for some of the content teams I work with. The images are great for sharing too. Excellent piece and thank you.
Question: I've been thinking about turning this into a PDF. Would something like that be helpful to your team?
Wow! I think it's been a deep analysis of how SEO works.
I'm a rookie on this item so I'll have to read it twice or three times to fully get it.
Thanks for share it!
So everything leads to natural, high quality writing. Thank you Cyrus.
Great indepth post as it was always.
Great post. I have wondered if Entity Salience influences local rankings? I am sure Google understands the relationship between phone area codes, postcodes, towns, counties and maybe even business districts. In London for example the Temple contains Barristers chambers. Most sites places these details in the footer or sidebars, I am interested to see how placing them in other elements (headers, titles tags) might improve rankings. Any thoughts?
Thanks, Cyrus! Thoroughly enjoyed reading your post. I love how item number 5's semantic definitions, relationships and usefulness have become the most powerful measurement of content relevance in SEO.
Thanks Cyrus. Excellent article. I really appreciate the clarity you've provided on effective keyword practices. I see immediate opportunities to improve my work. Thank you!
This is a very important article - SEO's who really want to know how to succeed with on-page optimization should carefully study this article and its related pages (linked). Bravo Cyrus!... this is top-drawer info :-) Thanks for sharing, and explaining things so carefully.
Thanks Andy. Greatly appreciated!
Thanks for a well thought out outline, Cyrus. I will be passing this info along to my clients that write their own blog posts. This covers many of the questions they have on what the content should contain, and the structure of that content for optimum on-page SEO. This will hopefully save me some time in re-organizing (re-writing) their posts.
Wow, fantastic technical insight. Thank you for sharing this with all of us. I'm definitely bookmarking this one and will implement it into my arsenal of content writing!
Well, I'm sold on the concept of an entity salience discovery tool. First person who can build one gets my money.
Oh, and awesome post.
Perfect post Cyrus. I like the point number 3 because I think that close variants are the "natural" language that Google is seeking with the new algorithms as hummingbird. Thanks again!
Thanks for the information, I am just getting SEO and any tips or enlightening of some of the keyword basics is greatly appreciated!
This a great post Cyrus. I currently teach SEO at NYU and I'm going to steal some of these on-page optimization images for my presentation ;)
Please do! We meant for these images to be shared/stolen/reused. (just be sure to include my picture :)
Great post Cyrus. One of the better articles I've read here outlining advanced SEO techniques and concepts.
I completely agree with all of this - a very informative post on content writing/ On-page SEO for the Semantic web, and we use many if not all of these techniques ourselves. But I honestly think that anyone who can write high quality, well-communicated and original content will naturally cover off most of these points without having to worry too much about the technicalities of on-page SEO. And this I find is one of the great things about content writing today for the Semantic Web - you cant really fake it any more.
Hi Mike,
I totally agree! There's really 2 ways to take advantage of these concepts, both very different:
First of all I would like to Thank You! Cyrus for making it easier for everyone to understand how on page optimization can be much more effective. I love the views you have presented here and really caught some interesting insights. Well Done! I appreciate your hard working on gather this highly informative illustration for us.
A refreshing post about on-page SEO with new points (instead of the same old tips re-posted over and over). Thanks :)
As someone who writes about SEO for a living, it gets continually harder to come up with anything new to say. :)
Search engines are less volatile these past few years after Panda/Penguin, and fresh techniques have become more elusive.
On the other hand, many people still do SEO like it's 2006. There's such a wealth of knowledge about how search engines operate that we mostly ignore because either we don't understand it yet, or it's difficult to measure. The truth is there, it's simply a little harder to dig out!
The knowledge is at our fingertips, if we care to take advantage :)
Nice Cyrus thank you for this. Great look into some key elements and good talking and thinking points for those involved with content/search. The conclusion I'm coming to here is that it's getting more and more reasonable to push clients to focus on users (as they should have) and see them rewarded in search more and more now and in the near future. This post provides great points of discussion when pushback is received.
"push clients to focus on users (as they should have) and see them rewarded in search more and more now and in the near future."
Spot on conclusion.
Always love to hear that. It's been a long battle over the years as I'm sure you know getting clients to understand and view search holistically instead of as a 1-off tactic. I'm just happy Google's advances over the last 5 years (and earlier) are really coming into play now and changing the landscape so that spam is ACTUALLY significantly harder to do, and businesses must think long-term and holistic.
Thanks again, always enjoy your posts and thoughts.
Excellent illustration of how complicated on page optimization is, yet what the average webmaster needs to understand conten-wise is pretty simple:
If you know your topic and write about it well, then your keywords (and many other things people actually search) will already be there ready to be found.
Thank you Cyrus, a lot of interesting points to dig in :)
Hi Cyrus,
This article is just perfect for our new content. :)
Thank you!
-Lyke
Great article, very usefull tips, thanks for share¡¡¡
A great, enjoyable, insightful read by Cyrus. As always :)
Awesome post!
Great to finally read some fresh and logical SEO insights again and learn something new (TF-IDF).
Thanks very much, Cyrus.
These are great examples of how on-page can be done right and on the next level. So many time the same word appearing in content and then not ranking as well, but more than anything it takes away the natural flow. This is not only great for search but also the client that has to proof the content prior to putting it on the site. The less redundant you can make a term by putting in some variance the better.
The section of the page segmentation also at work I find developers and designers asking me does it really matter if this content shows up, and I am always battling with them explaining why keep the juicy stuff in and let the sidebars go, but they never listen.
Last the Entity Salience is another great stand point for any client that has a product that branches out into so many other realms. I plan on utilizing your marvel comic tree with my team to show how we can find keyword variations off of the main search topic which this project call for organic dog products.
As always thanks for the great post, knowledge, and overall support to the SEO community.
Interesting. When we were designing the graphics (with my wife, who loves doing this type of work and is amazingly good at it) our goal was to make images that would be borrowed, copied and shared all over the Internet and within workplaces.
Great article, as always from Cyrus.
Aw shucks. Thanks Jakub!
Wow. I get so sick of reading new content that pops up on my favorite sources because it's all people copying each other repeating the same content/ideas and every now and then I come across one article that makes it all worth it. This is that article. Great stuff!
Cyrus - thank you - this is oh-so-timely. This week I've been introducing SEO to our first bunch of MSc Digital Marketing students, and I actually used Moz' 'A Visual Guide to Keyword Targeting and On-Page SEO' as one of the resources. Your in-depth guide here is an excellent (albeit more advanced) companion!
I also love your (and Moz') scientific / research-based approach (very evident here). Too much anecdotal and / or simply wrong advice out there but in Moz I always trust. Practical tip 6 at the end - comparison of a webpage to a (good) University paper is absolutely spot on. I have the pleasure of marking many University dissertations and papers, and students who have the structure right with "proper introductions, conclusions, topics organized into paragraphs, spelling and grammar" pretty much tend to be the ones with the best content too - the most persuasive communicators overall, the 'A' students (with the best marks).
Thanks Simone. We love original research, experiments and analysis. Resources are always tight, but we're striving for Moz to deliver more research in the future.
And, I believe your observations about ranking potential and content quality are spot on.
This is an amazing resource I will be revisiting often. Thank you!
Nice Post very well written with images it gives more visual and explicit understanding how thinks are work under the hood. term frequency (tf)–inverse document frequency (idf) numericalstatistic show how important relevant and weighted documents are it helps Search engine to determine How some words are generally more common than others.
Thanks for that summary! I had no idea that the location of content on your page actually makes a difference. Really helpful stuff.
I'm a relatively new at SEO and have been reading so many articles about this stuff, and I like this one a lot. I've also just started my own blog where I've been discussing the basics of digital marketing, informing those looking for a digital marketing career that the University of Michigan - Dearborn's degree program is the best around. Check out my new blog for more info! https://mymichigandigitalmarketingdegree.blogspot.com/
Helpful Blog! But i am looking to know more about point 6th and 7th. Can you?
Wow, great article. Now, i know hot write a perfect optimized article for SEO actions. Thanks you very much.
Thank you so much, Cyrus Shepard - its depth on-page article and nothing left more than it. repindia.com
"From an SEO point of view, this means creating content using natural language and variations, instead of employing the same strict keywords over and over again."
I understand that this is the way Google works, for English. Is this also for other languages? When doing competitor research, pages that rank nr. 1 have text with the exact keyword phrase repeated lots of times. So for both longterm and userfriendly articles, we will do it with the natural language with the variatons.
Q: Are the variations and natural language use applicable for other languages?
Hi there, I'm a little late to comment on this post and I haven't read through all of the comments yet so forgive me if I repeat a question/idea but I have to type it out while it is fresh in my mind.
With search engines now beginning to recognize synonyms and variations of words, how important is it now to continue to track and target singular and plural versions of keywords? I understand that search volume is still different for both versions of the word, but in writing out natural, targeted content, would both versions of the keyword begin to be recognized by search engines for that page? Or could they build off of each other? It seems almost as if keywords are starting to become more like key topics with semantic connectivity (just watched that WBF), and if you can discuss the key topic correctly, then why still track both versions of the keyword?
Excellent post!
Another great post I have seen in MOZ blog. Learnet a lot of thing from here .
Lots of thanks for this great post.
Wow, this is a absolute must read for everyone who is new to SEO. Like the beautiful and very straight visualization of crutial information. They should make it a required reading for all the wannabe SEO writers out there
Excellent article, thanks Cyrus. It's got me re-examining how much I do without thinking. I have a lot of writing to do this year and there are some very valuable pointers here.
Love your stuff Cyrus! These advanced posts keep getting better and better. Dont be shy, keep them coming!
CYRUS SHEPARD GOES INTO SEO BEAST MODE! Great post!
I must say a very helping post for the new SEO people like me :)
Hi Cyrus,
thank you very much for this post, it is very illuminating and helpful!
I have some doubts about a phrase contained in the second graphic (TF-IDF). You say: "...measueres the importance of a keyword phrase by comparing it to the frequency of the term in a large set of document"
After studying the TF-IDF (wiki and other posts) I formulated that idea: given a collection of documents (C), a document (D) belonging to the collection C and a term (T), TF-IDF returns the relevance of D respect T ("by comparing how often a keyword [T] appears [in D] compared to expectations gathered from a larger set of documents [C]" as you said).
It's very different from your definition! I feel like I'm missing something...
Hi Cyrus, Your point about using "natural variants" of keywords in content is important to consider when grouping keywords into themes. Marketers need to collect these words of similar meaning into the same groups so they can be mapped to individual pages of a site. In the old days, SEOs would create individual pages for each of the natural variants, but today that is search suicide.
Just found this pinned on your Twitter today. Absolutely love reading your content, Cyrus.
Hii, Cyrus I was looking for some advanced seo concept and found your article, it's very helpful to me and add extra value to my seo knowledge database, specially for page segmentation section. I really love to sharing it on my social network.
Great post! Thank You! Cyrus
What about the future of SEO
Cyrus,
Great article. Really. I've been struggling with a client's rankings who doesnt have a large amount of content yet and so I've been forced to use my on page real estate very strategically. These tips helped me re think my plan of attack and I feel much more confident moving forward.
Any advice in regards to adding more content to a Client's site as an SEO?
I still remember when the pages that position in google were the ones that had the keyword throughout the text, keyword text, keyword text, keyword text, making the content irrelevant and only this positioned by the keyword
Good job! but i don't robots google intelligent to know or understand hiden keywords so fast. At least right now
For a small businessman with a subscription to Moz and doing their own SEO this stuff is gold dust. Thank you Cyrus!!
The final point is possibly the most poignant - write like a human for humans. Repeating the same keyword over and over again is not only an old school tactic but also pretty boring for the user. Using synonyms will read better and also define your subject more clearly for Google. Add value for the user and you will most likely add value for Google.
Hello Cyrus,
Thank you for this information about on-page SEO, As i have read so many article about on-page seo to optimize my website, In mostly article i have got information "Content is King". But still i have one question, what should length of standard Article for SEO point of view?
My Website is: https://www.1strankseo.com
How much keyword should we target on a page?
Make texts for web pages it is more difficult than it seems. First you have to search the best keywords, write in content, order the content for relevance, etc.
Cheers
Hi,
in Germany wie use WDF*IDF. Unfortunately, this blog article is only available in German. But with a good translation software, hopefully you understand the contents: https://www.onpagedoc.com/blog/wdfidf-analyse-best-practice/
where should be keywords diplayed ?? if i have large amount of keywords ?
Since Google is getting better at semantics and user intend, the days of keyword density are certainty over.
Great article.
Instead of using phrase variants I personally prefer co-occurrences, this can greatly enhance your organic exposure. Obviously it is about relevancy and variations.
Hey Cyrus, You did a good piece of work and provides us depth knowledge how to optimize our content thank you so much for guiding us in right directions
Useful blog.. for the new SEO's like me.. :)
Fantastic article. Great work!
What an awesome post. I learnt something new. Thank you.
Thanks Cyrus! The post is very usefull, and pictures help to understand better what you're saying. There are some things I knew, but others are very interesting like "term frecuency-inverse document frequency". I've always thought that the most important words for Google are those that appears more times or at the begining of the text, but It seems this is not the only thing.
Really good and easy tips! We musnt't forget that we are writing for people and not for crawlers! If we don't forget that, maybe all will be more natural and our site will get better results on the search.
Great article Cyrus! The TF-IDF especially interests me - are there any tools that you know of that measure how a page is doing in this area? It seems almost all SEO graders focus on keyword usage and page segmentation (lesser) and we don't really have good tools for the other five. Would you agree with that? Thanks again for the awesome post!
Hi David,
To be honest, I no of no tools. That said, TF-IDF on a practical level isn't much more complex than keyword usage. If you use your main terms 1-2 times predominately in your text (and especially near the beginning) you should be fine. The goal is to simply help search engines identify your keyword as a potentially salient phrase, so they can do the rest of the text analysis after they have isolated the phrase as a candidate.
Hope that helps!
Excellent article and extremely useful. Thank you Cyrus.
Newbie to this website. Hope I could learn something new everyday from this site :)
Very detailed research and a very good and helpful post for us. thank you
Very well written. I share it with my co-workers in my office.
Thanks Cyrus.
A gold mine. Brilliant!
Great article!! Lot of things to test!
Thank you
Nice report,thank you for sharing your experiences!
Thanks Cyrus, so much information to intake from this article. Points are noted. Thanks for sharing.
Great post man, but not much understand about
term frequency–inverse document frequency (TF-IDF)
Great article Cyrus. On page SEO is getting more sophisticated. I was just writing a new post about on page SEO so this certainly helped!!
An excellent research article, thanks for sharing.
Thanks, very informative.
thank you for your research and post
I learn a lot from your moz post.
Stellar piece, Cyrus! This will definitely change the way SEO's approach keyword research moving forward. The 6th point is a sweet reminiscence of Rand's WBF back in the 16th of November, 2012 (when he used to have a clean shave) in which he shared his observations on some websites that were ranking in the top three spots of Google for a specific search query even if their meta data were NOT optimized for that particular search query. His observations were true as was his prediction on the co-occurrence and co-citation being a critical component of Google's search algo in the times to come. At this point, it is safe to assume that the future of SEO is going to be dominated by Semantic Search and Social Citation Strengths among many other influential factors. Thanks for sharing your research again!
Absolutely a brilliant post!
I would add that you should be aware that text in your navigation links can sometimes throw your page "off" if you are not careful. You need to consider the page as a whole, not just the bit in the main body.
Yes, thank you for all of the research!
Loved this post. Some clear guides and tips. Have not heard the term "chrome" used in this connection before (above where you mention copying text to the boiler plater and in chrome)
On a side note:
Anyone notice the typo on Google's Patent in the link above?
Interesting read! There are two interesting features in Onpage.org (German - not sure if english version is ready yet) onpage tool and a similar one in SearchMetrics (Content Optimization tool) to check these document word frequency (WDF) and index word frequency (IDF). These tools might be of help in terms of optimizing a page with right keyword frequency and occurrences.
Looks like there's an English version. Here's the link: https://en.onpage.org/
Text Tools (www.text-tools.net) offer TF-IDF (WDF-IDF) in English as well ;-)
Thanks for another great blog Cyrus ! This will really help our websites to serve quality experience to users as well as search engines.
I learned a few new things like I always do on moz.
A good research indeed Cyrus. I never miss a post of yours and I discovered some of god info especially in terms if TF-IDF concept. I never cam across this fact and it's worth consideration. Also, the supportive links is a bookmark thing you shared in your comment section. It seem that entity relationship creation is the SEO now. Figuring out a perfect balance might be tough but worth trying to be the best in SERP's.
Oh it's a great post for On page SEO. I learned some new things from this article. It is really awesome.
Thanks really informative, so you mean still keywords in the title tags and description are having importance in SEO?
Thanks @Cyrus for the post always interesting, there's big controversy about TF-IDF which is a technique no more relevant as the web goes wide with Trillions of pages from all around the world.
Hello Cyrus, Really great insight..!! This TF-IDF stuff was completely new for me. I'wd always love to see such worthful post. :)
Thanks for sharing this post..good job...
I think all the statements techniques are aimed primarily one purpose: the search engine, Google occurrence, can understand the content of the best.
I say we have the reflex to tell Google how our content is more important than the others.
Nice outline. If you are new to SEO just follow and expand on the 1-7 outline provided by the author and you will have success.
1-6 nothing new
7 fairly new and extremely interesting
I would add look at google's patents and study them and see if you can guess what steps they will take in the future.
Plus chicks will be really impressed by your knowledge of Google patents. Don't be afraid to bring that knowledge up at parties to.... WOOT
I know you miss me.
Dammnit Rand, is this your alter ego!?