Discussing LDA and SEO - Whiteboard Friday

Comments 101

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

audilo

2010-09-10T01:25:49-07:00

Hi all and thanks for the brilliant work you guys do.Theres is however something that I find annoying with your Whiteboard Fridays: I find it sometimes very hard to hear/understand.This seems to be due to the poor quality of the microphone you use as well as the acoustics of the room !

The more technical the subject, the harder for me (am I the only one??) to understand what you say. As you know, we all listen to your presentations on our lappies and the speakers are not very good either, so all in all, from source to output, theres is quite a loss of quality.

So please, Rand: could you invest some of your had earned cash in a couple of higher spec lapel mikes maybe?

This would make it a much more enjoyable experience, specially for us foreigners :-)

Thanks and keep on the good work!

23 0

Hi all and thanks for the brilliant work you guys do.Theres is however something that I find annoying with your Whiteboard Fridays: I find it sometimes very hard to hear/understand.This seems to be due to the poor quality of the microphone you use as well as the acoustics of the room ! The more technical the subject, the harder for me (am I the only one??) to understand what you say. As you know, we all listen to your presentations on our lappies and the speakers are not very good either, so all in all, from source to output, theres is quite a loss of quality. So please, Rand: could you invest some of your had earned cash in a couple of higher spec lapel mikes maybe? This would make it a much more enjoyable experience, specially for us foreigners :-) Thanks and keep on the good work!
Cancel
- Jacob Eeckhout
 
 2010-09-10T01:32:49-07:00
 
 @Audilo; same here. But it helps a lot to read the transcript along with the video, unless you get (____)
 
 Jacobe edited 2010-09-10T01:48:52-07:00
 3 0
 
 @Audilo; same here. But it helps a lot to read the transcript along with the video, unless you get (____)
 Cancel
- Gareth James
 
 2010-09-10T01:42:50-07:00
 
 Same here too, quality used to be great until you changed it all. I just get loads of hiss now :(
 
 3 0
 
 Same here too, quality used to be great until you changed it all. I just get loads of hiss now :(
 Cancel
- Toby Mason
 
 2010-09-10T02:06:07-07:00
 
 That's why they've added transcripts :D it's indeed very helpful.
 
 3 5
 
 That's why they've added transcripts :D it's indeed very helpful.
 Cancel
 - RickValentine
 
 2010-09-10T03:04:18-07:00
 
 Yes, they have added transcript, but it is called White Board Friday I want to the and hear the video..with the whiteboard .. hell it might as well be called transcript friday in that case
 
 RickValentine edited 2010-09-10T03:05:45-07:00
 10 1
 
 Yes, they have added transcript, but it is called White Board Friday I want to the and hear the video..with the whiteboard .. hell it might as well be called transcript friday in that case
 Cancel
 - Gareth James
 
 2010-09-11T01:27:46-07:00
 
 lol very good point!
 
 1 0
 
 lol very good point!
 Cancel
- Alastair Kay
 
 2010-09-10T10:52:42-07:00
 
 I consistently have the same problem. Using my head mike / earphones helps. You could try that if you have one.
 
 2 0
 
 I consistently have the same problem. Using my head mike / earphones helps. You could try that if you have one.
 Cancel
- Marlo Schneider
 
 2010-09-10T14:21:51-07:00
 
 I fully agree with what Audilo said.
 
 The audio is very difficult to hear, I had mine turned up so loud that when the video ending techno hit me I almost fell out of my chair. The content is great as always but an investment in some lapel mics and maybe even some noise insulation to reduce echo in that room would make these videos much more pleasant for me to watch.
 
 MarloSchneider edited 2010-09-10T14:27:35-07:00
 5 0
 
 I fully agree with what Audilo said. The audio is very difficult to hear, I had mine turned up so loud that when the video ending techno hit me I almost fell out of my chair. The content is great as always but an investment in some lapel mics and maybe even some noise insulation to reduce echo in that room would make these videos much more pleasant for me to watch.
 Cancel
 - James Drake
 
 2010-09-10T16:04:51-07:00
 
 same here audio issues
 
 3 0
 
 same here audio issues
 Cancel
Jacob Eeckhout

2010-09-10T01:31:30-07:00

I like Ben's dance moves. If you put a rap-tune on it, I bet he pulls it off :)

9 0

I like Ben's dance moves. If you put a rap-tune on it, I bet he pulls it off :)
Cancel
Rand Fishkin

2010-09-10T00:41:03-07:00

BTW - For anyone with more interest in the technicalities of the LDA model, its advantages over LSI/pLSI/etc and some of the ways Google might be using it, this video - https://www.youtube.com/watch?v=vgqWMGT9haY - from a Google Tech Talk by Amit Gruber. Skip to 11:03 if you'd just like to hear about the LDA stuff.

Gruber worked on some research related to text/topic analysis that was posted about on the Official Google Research blog here.

randfish edited 2010-09-10T00:42:50-07:00
5 0

BTW - For anyone with more interest in the technicalities of the LDA model, its advantages over LSI/pLSI/etc and some of the ways Google might be using it, this video - <a href="https://www.youtube.com/watch?v=vgqWMGT9haY" rel="nofollow">https://www.youtube.com/watch?v=vgqWMGT9haY</a> - from a Google Tech Talk by Amit Gruber. Skip to 11:03 if you'd just like to hear about the LDA stuff. Gruber worked on some research related to text/topic analysis that was <a href="https://googleresearch.blogspot.com/2007_09_01_archive.html" rel="nofollow">posted about on the Official Google Research blog here</a>.
Cancel
Gianluca Fiorelli

2010-09-10T02:20:39-07:00

So... let see if I understood the message:
- On Page is important, but summing up all the"on page factors" (including LDA), Links Factor are confirmed to be still better correlated to high rankings...
- ... that means that On Page (LDA included) is essential for high rankings but - assuming concurring pages have equivalent On Page optimization - it's their link profile that makes the difference
Then:
- LDA is a "metric" that suggest that it's not the "mechanical" use of the keyword in the content that counts but the context where the keywords are used (semantics and semiotic)
- ... that means, on an actionable side: please, write well and be coherent in your writings and do not get crazy of H1, bold, italics...
Finally
- Context is king of the On Page Optimization feud, but is Vassal of the Almighty Link Profile kingdom.
On a user note... 25 minutes of WBF are quite long, but in this case very needed. Anyhow, when talking about LDA what I'd like to see next should be real life cases, as we are still almost on a theoretical phase (or not)?

5 0
So... let see if I understood the message: <ul><li>On Page is important, but summing up all the"on page factors" (including LDA), Links Factor are confirmed to be still better correlated to high rankings...</li> <li>... that means that On Page (LDA included) is essential for high rankings but - assuming concurring pages have equivalent On Page optimization - it's their link profile that makes the difference</li> </ul> Then: <ul><li>LDA is a "metric" that suggest that it's not the "mechanical" use of the keyword in the content that counts but the context where the keywords are used (semantics and semiotic)</li> <li>... that means, on an actionable side: please, write well and be coherent in your writings and do not get crazy of H1, bold, italics...</li> </ul> Finally <ul><li>Context is king of the On Page Optimization feud, but is Vassal of the Almighty Link Profile kingdom.</li> </ul> On a user note... 25 minutes of WBF are quite long, but in this case very needed. Anyhow, when talking about LDA what I'd like to see next should be real life cases, as we are still almost on a theoretical phase (or not)?
Cancel
- greymorsels
 
 2010-09-16T23:46:09-07:00
 
 That was a good summary as usual Gfiorelli.
 
 1 0
 
 That was a good summary as usual Gfiorelli.
 Cancel
RickValentine

2010-09-10T03:01:50-07:00

Bad sound quality this week, cant hear a thing at full volume on my headphones :(

6 1

Bad sound quality this week, cant hear a thing at full volume on my headphones :(
Cancel
- Matt Shoffner
 
 2010-09-10T10:35:28-07:00
 
 I didn't notice any change
 
 2 1
 
 I didn't notice any change
 Cancel
- Jamie Steven
 
 2010-09-10T10:38:16-07:00
 
 Hi Rick-- we're working to improve the audio for future whiteboard Friday's. Please stay tuned!
 
 3 0
 
 Hi Rick-- we're working to improve the audio for future whiteboard Friday's. Please stay tuned!
 Cancel
BastBakeneeme

2010-09-10T04:12:21-07:00

Bad sound quality this week, cant hear a thing at full volume on my headphones

4 0

Bad sound quality this week, cant hear a thing at full volume on my headphones
Cancel
michael_housebook

2010-09-10T03:54:52-07:00

Oh my goodness spam in SEOMOZ! Shocking!

6 2

Oh my goodness spam in SEOMOZ! Shocking!
Cancel
- Gianluca Fiorelli
 
 2010-09-10T12:05:46-07:00
 
 Just a quick note guys: this comment above was written just because there was an incredible spam comment before... not for other reason. So no reason to thumb down our innocent Michael :)
 
 1 0
 
 Just a quick note guys: this comment above was written just because there was an incredible spam comment before... not for other reason. So no reason to thumb down our innocent Michael :)
 Cancel
- michael_housebook
 
 2010-09-10T17:31:05-07:00
 
 Has been removed. I was surprised. :)
 
 2 0
 
 Has been removed. I was surprised. :)
 Cancel
- Casey Henry
 
 2010-09-10T18:30:59-07:00
 
 I wasn't quick enough to kill that SPAM, sorry for that!
 
 1 0
 
 I wasn't quick enough to kill that SPAM, sorry for that! 
 Cancel
 - michael_housebook
 
 2010-09-10T21:32:19-07:00
 
 I was just surprised because this blog is always clean, which I really like. You must be doing great job. It must be hard when you readers coming from different timezones.
 
 I did not mean to make big thing out of it.
 
 2 0
 
 I was just surprised because this blog is always clean, which I really like. You must be doing great job. It must be hard when you readers coming from different timezones. I did not mean to make big thing out of it. 
 Cancel
 - Casey Henry
 
 2010-09-11T08:42:29-07:00
 
 No big deal, we try to keep up on SPAM in the comments since it can really take away from the overall message of the post and comments. Feel free to PM me if you ever see on that I may have missed.
 
 Casey
 
 1 0
 
 No big deal, we try to keep up on SPAM in the comments since it can really take away from the overall message of the post and comments. Feel free to PM me if you ever see on that I may have missed. Casey
 Cancel
Alex Avery

2010-09-10T08:44:46-07:00

Great topic Rand and Ben. By far, the best Whiteboard Friday to date. It seems to boil down to what I believe Rand mentioned in an earlier post and that is Web pages, as far as search engines are concerned, are text based. So it doesn't surprise me that this concept of topic modeling and LDA is an important factor in showing that your Web page (or text document) is more relevant than another.

I look forward to more on this topic and thanks for all of the great insights!

3 0

Great topic Rand and Ben. By far, the best Whiteboard Friday to date. It seems to boil down to what I believe Rand mentioned in an earlier post and that is Web pages, as far as search engines are concerned, are text based. So it doesn't surprise me that this concept of topic modeling and LDA is an important factor in showing that your Web page (or text document) is more relevant than another. I look forward to more on this topic and thanks for all of the great insights!
Cancel
Glenn Crocker

2010-09-11T06:51:18-07:00

Ben, in your original presentation, you mention that your LDA corpus is 8 million pages from Wikipedia. I wonder if that might skew your results since wikipedia ranks so high for so many long-tail phrases. Have you tried removing them from the results and confirming that you still see relatively high correlation scores?

3 0

Ben, in your original presentation, you mention that your LDA corpus is 8 million pages from Wikipedia. I wonder if that might skew your results since wikipedia ranks so high for so many long-tail phrases. Have you tried removing them from the results and confirming that you still see relatively high correlation scores?
Cancel
- Ben Hendrickson
 
 2010-09-11T14:32:22-07:00
 
 That is a valid concern. I had not looked at that, but did so right now.
 
 On a dataset that is similarly constructed to what we posted but not identical, the mean coefficient is 0.326347. If I strip out every URL with wikipedia.org in it, I get 0.313532.
 
 So it doesn't make much difference, but there is a slight drop. Wikipedia articles will generally score quite well even if it was not the corpus, so it is not clear to what extent this drop is because of bias vs because we are removing some of the easier to identity good results.
 
 That was a very good question.
 
 3 0
 
 That is a valid concern. I had not looked at that, but did so right now. On a dataset that is similarly constructed to what we posted but not identical, the mean coefficient is 0.326347. If I strip out every URL with wikipedia.org in it, I get 0.313532. So it doesn't make much difference, but there is a slight drop. Wikipedia articles will generally score quite well even if it was not the corpus, so it is not clear to what extent this drop is because of bias vs because we are removing some of the easier to identity good results. That was a very good question.
 Cancel
CharlieB29

2010-09-10T09:01:58-07:00

Good topic. I still believe natural content is the way to go. Manipulating word selection is fine but will it always work?

3 0

Good topic. I still believe natural content is the way to go. Manipulating word selection is fine but will it always work?
Cancel
Kate1982

2010-09-10T07:04:38-07:00

As it's all about being honest and transparent at SEOmoz I would also like to join the ranks of those who were not too happy with this weeks whiteboard session. I found myself distracted from the actually very interesting content by dance moves and bad audio. The whiteboard friday is usualyl one of my 'it's almost weekend, let's watch some clips' treat - not this time..

Other than that, please keep up the good work guys. Don't know what I would do without you!

J

3 0

As it's all about being honest and transparent at SEOmoz I would also like to join the ranks of those who were not too happy with this weeks whiteboard session. I found myself distracted from the actually very interesting content by dance moves and bad audio. The whiteboard friday is usualyl one of my 'it's almost weekend, let's watch some clips' treat - not this time.. Other than that, please keep up the good work guys. Don't know what I would do without you! J
Cancel
Mrosa

2010-09-10T10:08:23-07:00

"the keyword startrek"? I think I'm on the holodeck right now, somebody turn this thing off

3 0

"the keyword startrek"? I think I'm on the holodeck right now, somebody turn this thing off
Cancel
Andy_Fletcher

2010-09-10T10:11:17-07:00

Hmmm, this page only get's an LDA score of 16% for "bad sound quality". Perhaps a few more people could comment on it?

Joking aside, definately get some decent microphones, and I'd suggest some soundproofing for the ceiling to improve the accoustics. Love the post regardless. Not many online videos would keep me watching, but your content wins out over sound quality ;)

3 0

Hmmm, this page only get's an LDA score of 16% for "bad sound quality". Perhaps a few more people could comment on it? Joking aside, definately get some decent microphones, and I'd suggest some soundproofing for the ceiling to improve the accoustics. Love the post regardless. Not many online videos would keep me watching, but your content wins out over sound quality ;)
Cancel
- Jamie Steven
 
 2010-09-10T10:47:33-07:00
 
 Thanks Andy. We are indeed investing in dampening one of our rooms, and improving the overall quality of the sound. Our new office is noisy!
 
 3 0
 
 Thanks Andy. We are indeed investing in dampening one of our rooms, and improving the overall quality of the sound. Our new office is noisy!
 Cancel
- Kyle Richey
 
 2010-09-10T11:25:23-07:00
 
 Agreed!
 
 Though LDA is not going to change a whole lot in terms of how we write, it is quite helpful for us to understand the concept.
 
 I could see the LDA tool being really cool in certain situations, especially if they added the ability to enter a keyword and check the LDA "score" for each of the top 10-20 Google results for the term. Similar to heatmap analytics, I feel like LDA analysis is more of a top-down view of site structure and on-page layout/elements in general.
 
 1 0
 
 Agreed! Though LDA is not going to change a whole lot in terms of how we write, it is quite helpful for us to understand the concept. I could see the LDA tool being really cool in certain situations, especially if they added the ability to enter a keyword and check the LDA "score" for each of the top 10-20 Google results for the term. Similar to heatmap analytics, I feel like LDA analysis is more of a top-down view of site structure and on-page layout/elements in general.
 Cancel
SDFF

2010-09-12T10:09:48-07:00

Not sure if this is the right place but I'll ask anyway.

I have a couple of customers who like to embellish on their product descriptions and use words that, while possibly resonating with their demos, are probably not so good for SEO. Although I've never really had a way to prove it. I'm wondering if the LDA tool can back this up. Here's an example:

"This ain't your Father's Barber Shop. This ain't your Mother's Salon. Don't come here looking for a shoeshine or a pedicure. We're not a chain store more interested in gimmicks than quality of service."

Humans know what the description above is talking about even with out the rest of the context, but when the search engines see terms like "shoeshines", "pedicures", and "chain stores", does that bring relevance to the keywords/phrase "barber shop" down? I claim they do but haven't had a way to prove it. Is that something LDA will tell me?

- Jeff Hancock

SDFF edited 2010-09-12T10:10:27-07:00
4 1

Not sure if this is the right place but I'll ask anyway. I have a couple of customers who like to embellish on their product descriptions and use words that, while possibly resonating with their demos, are probably not so good for SEO. Although I've never really had a way to prove it. I'm wondering if the LDA tool can back this up. Here's an example: "This ain't your Father's Barber Shop. This ain't your Mother's Salon. Don't come here looking for a shoeshine or a pedicure. We're not a chain store more interested in gimmicks than quality of service." Humans know what the description above is talking about even with out the rest of the context, but when the search engines see terms like "shoeshines", "pedicures", and "chain stores", does that bring relevance to the keywords/phrase "barber shop" down? I claim they do but haven't had a way to prove it. Is that something LDA will tell me? - Jeff Hancock
Cancel
- goodnewscowboy
 
 2010-09-12T11:36:19-07:00
 
 Man Jeff, you've got the best example of a perfect use of the LDA tool that I've seen yet. Your client issue would totally be solved using it. Compare the current copy of the site to optimised copy you create that strips out the shoeshine, etc. and I'd bet you'd be able to show your clients better LDA scores for your optimised copy.
 
 1 1
 
 Man Jeff, you've got the best example of a perfect use of the LDA tool that I've seen yet. Your client issue would totally be solved using it. Compare the current copy of the site to optimised copy you create that strips out the shoeshine, etc. and I'd bet you'd be able to show your clients better LDA scores for your optimised copy.
 Cancel
 - WeRASkitzzo
 
 2010-09-13T10:30:01-07:00
 
 See this is a perfect example of my problem with this study! If Jeff uses SEOmoz's tool to generate an LDA score and tells them improving it will help their SEO, it will be one more case of SEO's selling the latest trend that may or may not have any impact on their rankings at all.
 
 SEOmoz's conclusions to their data have been called into question in multiple places, but even if we accept their conclusions 100%, there's absolutely NO data to suggest an improved LDA will improve your rankings.
 
 Selling this kind of crap is exactly why our industry is viewed in such a skeptical light. This response is also why I wish SEOmoz were more responsible with their studies. If you're going to present yourselves as SEO scientists, follow the scientific policy of peer review, etc before making bombastic claims and turning people like Jeff into possible snake oil salesmen.
 
 1 3
 
 See this is a perfect example of my problem with this study! If Jeff uses SEOmoz's tool to generate an LDA score and tells them improving it will help their SEO, it will be one more case of SEO's selling the latest trend that may or may not have any impact on their rankings at all. SEOmoz's conclusions to their data have been called into question in multiple places, but even if we accept their conclusions 100%, there's absolutely NO data to suggest an improved LDA will improve your rankings. Selling this kind of crap is exactly why our industry is viewed in such a skeptical light. This response is also why I wish SEOmoz were more responsible with their studies. If you're going to present yourselves as SEO scientists, follow the scientific policy of peer review, etc before making bombastic claims and turning people like Jeff into possible snake oil salesmen.
 Cancel
 - Rand Fishkin
 
 2010-09-13T12:23:49-07:00
 
 Ben - in the spirit of TAGFEE, I'm going to say that I just don't agree with your perspective, but certainly am fine with providing you an outlet to voice it. Nothing I've seen so far suggests to me that we've done something irresponsible. We presented some research at our seminar, some people tweeted about it excitedly (though I think you'll far worse overhyping for nearly every story that appears in the political/entertainment/technology field), we wrote a blog post that explained what we'd done, invited others to repeat (several did so, even using other methodologies recommended by critics, and got similar or better correlation results) and provided a free tool.
 
 We're certainly planning to do lots more of this in the future. If this is a process you don't agree with or don't like, you are free not to participate or engage. There is nothing compulsory about our work and our suggestions around every facet of SEO have always been "try this, if it works for you, great, and if not, no worries."
 
 SEOmoz is a private software company. We're not a regulatory board or some officially sanctioned representative of the engines or the SEO field. We love providing material to those who enjoy, appreciate or find value in our work and we empathize with and harbor no ill will towards any who don't. I'd appreciate if you treated the situation proportionally in the future.
 
 3 0
 
 Ben - in the spirit of TAGFEE, I'm going to say that I just don't agree with your perspective, but certainly am fine with providing you an outlet to voice it. Nothing I've seen so far suggests to me that we've done something irresponsible. We presented some research at our seminar, some people tweeted about it excitedly (though I think you'll far worse overhyping for nearly every story that appears in the political/entertainment/technology field), we wrote a blog post that explained what we'd done, invited others to repeat (several did so, even using other methodologies recommended by critics, and got similar or better correlation results) and provided a free tool. We're certainly planning to do lots more of this in the future. If this is a process you don't agree with or don't like, you are free not to participate or engage. There is nothing compulsory about our work and our suggestions around every facet of SEO have always been "try this, if it works for you, great, and if not, no worries." SEOmoz is a private software company. We're not a regulatory board or some officially sanctioned representative of the engines or the SEO field. We love providing material to those who enjoy, appreciate or find value in our work and we empathize with and harbor no ill will towards any who don't. I'd appreciate if you treated the situation proportionally in the future.
 Cancel
 - WeRASkitzzo
 
 2010-09-14T12:15:51-07:00
 
 Rand, do you follow your mom on twitter? Or Joanna Lord? Your comment seems to suggest that your employees did nothing to hype this tool when in fact they were some of the most egrigious. (You can see my latest post on Skitzzo.com for my favorite examples.)
 
 Also, I would be very interested to see the othe replicated studies that you mentioned. Can you maybe include them as related links or something of the sort on one of your LDA posts?
 
 1 2
 
 Rand, do you follow your mom on twitter? Or Joanna Lord? Your comment seems to suggest that your employees did nothing to hype this tool when in fact they were some of the most egrigious. (You can see my latest post on Skitzzo.com for my favorite examples.) Also, I would be very interested to see the othe replicated studies that you mentioned. Can you maybe include them as related links or something of the sort on one of your LDA posts? 
 Cancel
 - oskarokupa
 
 2010-09-14T05:48:10-07:00
 
 The concept of peer reviews is very sound. But in order to have peer reviews there need to be "peers"; hence other researchers who have done a similar study that disproves the study thesis that is being scrutinised. So far the complaints about the experiment design and statistical methodology haven't been backed up by a disproving study. It is very good to criticise Seomoz about their lack of scientific rigor, but without citing a study that disproves theirs, it all sounds like a childish tantrum. Everybody is free to create their own model and prove Seomoz wrong and I would be very happy to read and hype the rebuttal if it shows me a more sound conclusion.
 
 1 0
 
 The concept of peer reviews is very sound. But in order to have peer reviews there need to be "peers"; hence other researchers who have done a similar study that disproves the study thesis that is being scrutinised. So far the complaints about the experiment design and statistical methodology haven't been backed up by a disproving study. It is very good to criticise Seomoz about their lack of scientific rigor, but without citing a study that disproves theirs, it all sounds like a childish tantrum. Everybody is free to create their own model and prove Seomoz wrong and I would be very happy to read and hype the rebuttal if it shows me a more sound conclusion.
 Cancel
 - Toby Mason
 
 2010-09-14T09:11:40-07:00
 
 I completely disagree with you WeareSkitzoo.
 
 Surely with this information at Jeffs disposal it would be irresponsible of him not to recommend to his client that they investigate the potential of changing the text. Of course he should not state that "this is fact", (after all not much in SEO is) and roll it out across 100's of pages but a couple of copy changes here and there may make a massive difference. Of course it may not.
 
 I for one will be looking at some of our clients pages and seeing for myself if it impacts on rankings.
 
 1 0
 
 I completely disagree with you WeareSkitzoo. Surely with this information at Jeffs disposal it would be irresponsible of him not to recommend to his client that they investigate the potential of changing the text. Of course he should not state that "this is fact", (after all not much in SEO is) and roll it out across 100's of pages but a couple of copy changes here and there may make a massive difference. Of course it may not. I for one will be looking at some of our clients pages and seeing for myself if it impacts on rankings.
 Cancel
- Rand Fishkin
 
 2010-09-12T16:09:48-07:00
 
 Certainly there could be extreme instances where a writer has gone overboard with irrelevant language and potentially confused a search engine about the topic/relevance of the page to the keyword. However, my suspicion (albeit untested given the tool/process' newness) is that most of the time, you'll get better value out of identifying topics and content you may not have covered that searchers are interested in.
 
 Still - great thinking in terms of application. I'd imagine that this could be particularly more likely in news/media pieces, where creative reporters like to use flourish and diversity in their works rather than speaking plainly to a topic. However, I'd hate to see that creativity lost - hopefully it can simply be channeled in a way that's both productive for searchers/engines and still enjoyable to read.
 
 1 0
 
 Certainly there could be extreme instances where a writer has gone overboard with irrelevant language and potentially confused a search engine about the topic/relevance of the page to the keyword. However, my suspicion (albeit untested given the tool/process' newness) is that most of the time, you'll get better value out of identifying topics and content you may not have covered that searchers are interested in. Still - great thinking in terms of application. I'd imagine that this could be particularly more likely in news/media pieces, where creative reporters like to use flourish and diversity in their works rather than speaking plainly to a topic. However, I'd hate to see that creativity lost - hopefully it can simply be channeled in a way that's both productive for searchers/engines and still enjoyable to read.
 Cancel
Josh Braaten

2010-09-10T06:50:05-07:00

This is great guys. I'm really excited to try LDA out on some of my content. It's a tough set of concepts but you've explained it all very well between Whiteboard Friday and the recent blog posts.

By the way, I love the SEO community. Only here will you hear "Megan Fox???? Is she related to Vanessa Fox?" I'd hang out with Vanessa over Megan any day too, fellas. If that means there's something wrong with us, then at least we're in good company.

3 0

This is great guys. I'm really excited to try LDA out on some of my content. It's a tough set of concepts but you've explained it all very well between Whiteboard Friday and the recent blog posts. By the way, I love the SEO community. Only here will you hear "Megan Fox???? Is she related to Vanessa Fox?" I'd hang out with Vanessa over Megan any day too, fellas. If that means there's something wrong with us, then at least we're in good company.
Cancel
- Norman Newsome
 
 2010-09-18T18:56:06-07:00
 
 Yeah Megan is hot, but what's up with those tumbs? (thumb's that look like big toes?) No thanks! lol
 
 1 0
 
 Yeah Megan is hot, but what's up with those tumbs? (thumb's that look like big toes?) No thanks! lol
 Cancel
Gary Bennion

2010-09-10T02:14:59-07:00

Is it me or is all this stuff a bit over the top? surely if you just write good content you shouldn't need to worry... or am I missing something?

I mean, it's interesting but I don't think it will change the way I write content.

3 0

Is it me or is all this stuff a bit over the top? surely if you just write good content you shouldn't need to worry... or am I missing something? I mean, it's interesting but I don't think it will change the way I write content.
Cancel
- Dennis Lees
 
 2010-09-10T14:03:43-07:00
 
 It might not change the way you write content, but might go a long way to explaining why it doesn't rank as highly.
 
 For example, this LDA finding suggests that similes are not a good idea. That is, the words that make up expressions like "like a hot knife through butter" and "as sweet as sweet as candy" are likely to make your language less topically relevant. (for most topics)
 
 Would you hang on to your similes depsite the fact they might be hurting your ranking?
 
 dennis.globalsign edited 2010-09-10T14:05:07-07:00
 2 0
 
 It might not change the way you write content, but might go a long way to explaining why it doesn't rank as highly. For example, this LDA finding suggests that similes are not a good idea. That is, the words that make up expressions like "like a hot knife through butter" and "as sweet as sweet as candy" are likely to make your language less topically relevant. (for most topics) Would you hang on to your similes depsite the fact they might be hurting your ranking? 
 Cancel
 - Alan Mosley
 
 2010-09-11T11:37:38-07:00
 
 I have made pages in the past that have ranked well before adding any real text, just a few headings images with alt tags and stuff. Then when added 500 to 1000 words they have dropped rankings. I though maybe un-natural text or keyword stuffing was to blame, but I now thing LDA may have had something to do with it.
 
 1 0
 
 I have made pages in the past that have ranked well before adding any real text, just a few headings images with alt tags and stuff. Then when added 500 to 1000 words they have dropped rankings. I though maybe un-natural text or keyword stuffing was to blame, but I now thing LDA may have had something to do with it.
 Cancel
 - Simon Dalley
 
 2011-12-07T03:09:23-08:00
 
 Yeah I have often found this and it's opposite in many ways to the things we normally say as seo's
 
 1 0
 
 Yeah I have often found this and it's opposite in many ways to the things we normally say as seo's
 Cancel
lsujoe

2010-09-10T06:04:24-07:00

Very nice video but I do agree video/audio quality was a little low. One question though, are the correlation stats like the .32 LDA and the link factors percentage posted anywhere? I think this would be a great thing to see.

3 0

Very nice video but I do agree video/audio quality was a little low. One question though, are the correlation stats like the .32 LDA and the link factors percentage posted anywhere? I think this would be a great thing to see.
Cancel
Ian Auld

2010-09-14T12:58:21-07:00

Very interesting video. It reminds of something one of my professors had told me about writing papers. She told me you should never imply anything, be very explicit and make sure the reader knows what you are talking about. We shouldn't just assume the engines know what we are talking about because we mentioned a keyword a couple of times. Since the same word can have several completely different meanings, as cited by the bark example in the video, it makes sense that the engines would try to look for other factors to ensure that the results they return are for the correct bark. By looking at the Bing commercials (In the commercials someone mentions a word and everyone around them begins spouting out "search results" that include the word but are non-related) we can see that this is a thought in the mind of search engines. I still believe that content should be written for users not engines, but this would make me think about some terms I would use. After all you can still have natural content but include content for the engines as well. I was also pretty excited to hear how dirichlet was pronounced.

2 0

Very interesting video. It reminds of something one of my professors had told me about writing papers. She told me you should never imply anything, be very explicit and make sure the reader knows what you are talking about. We shouldn't just assume the engines know what we are talking about because we mentioned a keyword a couple of times. Since the same word can have several completely different meanings, as cited by the bark example in the video, it makes sense that the engines would try to look for other factors to ensure that the results they return are for the correct bark. By looking at the Bing commercials (In the commercials someone mentions a word and everyone around them begins spouting out "search results" that include the word but are non-related) we can see that this is a thought in the mind of search engines. I still believe that content should be written for users not engines, but this would make me think about some terms I would use. After all you can still have natural content but include content for the engines as well. I was also pretty excited to hear how dirichlet was pronounced.
Cancel
James Curley

2010-09-10T12:15:19-07:00

Hey Rand and Ben,

I really like the idea of boiling down SEO to mathematical concepts when possible, as I'm a real logical and deductive person. I imagine the majority of people that really love this LDA tool are the ones that come from development or math backgrounds, where those that think it doesn't change anything are more of a journalist and designer background. I could be wrong though, I'm basing that off just a few opinions I've seen.

Anyway, there are a lot of people saying "this won't change how I write anything" and others claiming "backlinks are still the most important factor". I see two underlying questions that have yet to be really answered. Most likely they will be over time.

1) What makes this different than any other SEO "fad"?

2) How does this change what I am doing or need to do?

Only time and implementation will adequately answer the first question. Although it is doubtful that everyone will ever be satisfied by the answer, even if it shows some concrete evidence. The difference is this is based off mathematical analysis rather than "gut instinct" and "quaint observations". It's not that working off instinct and observations (some would call this experience) is a bad thing. It's just a different way of doing things. Some will prefer basing their actions on statistical principles and others on experience.

We have to wait to see if it's different than other fads. If people implementing it begin to show results, it might begin to calm down some skeptics. I think the bigger question is how we are supposed to use this information. Right now it does nothing other than reinforce what we've known all along - writing good quality content is an important aspect of SEO.

That's not game changing. If I write a well crafted article about Star Trek, it's most likely going to have a high LDA for "star trek". If I write a gimmicked article and don't really know what I'm talking about (e.g. who is Captain Kirk and Spock and the Gorn and Orion slave girls...), it probably won't.

There are probably uses for this tool that we have yet to discover or understand. The biggest potential use for it that I see is the inverse: I submit my document and you tell me what it's about based on your LDA model. Then I can check my Star Trek article, and your LDA tool might say, "hey, I see where you're getting at on the Star Trek front, but it sounds more like you're talking about sickly looking enslaved women, you should shy away from the Orion slave girls and toss in a few more references to Kirk and maybe the Enterprise." That's useful.

As of right now, yeah, it doesn't change how most people do things. Unless they were writing low quality articles. The tool is still in the Labs, after all.

I'm excited to see where it goes from here!

Regards,

Matthias

MountainMedia edited 2010-09-10T12:15:46-07:00
2 0

Hey Rand and Ben, I really like the idea of boiling down SEO to mathematical concepts when possible, as I'm a real logical and deductive person. I imagine the majority of people that really love this LDA tool are the ones that come from development or math backgrounds, where those that think it doesn't change anything are more of a journalist and designer background. I could be wrong though, I'm basing that off just a few opinions I've seen. Anyway, there are a lot of people saying "this won't change how I write anything" and others claiming "backlinks are still the most important factor". I see two underlying questions that have yet to be really answered. Most likely they will be over time. 1) What makes this different than any other SEO "fad"? 2) How does this change what I am doing or need to do? Only time and implementation will adequately answer the first question. Although it is doubtful that everyone will ever be satisfied by the answer, even if it shows some concrete evidence. The difference is this is based off mathematical analysis rather than "gut instinct" and "quaint observations". It's not that working off instinct and observations (some would call this experience) is a bad thing. It's just a different way of doing things. Some will prefer basing their actions on statistical principles and others on experience. We have to wait to see if it's different than other fads. If people implementing it begin to show results, it might begin to calm down some skeptics. I think the bigger question is how we are supposed to use this information. Right now it does nothing other than reinforce what we've known all along - writing good quality content is an important aspect of SEO. That's not game changing. If I write a well crafted article about Star Trek, it's most likely going to have a high LDA for "star trek". If I write a gimmicked article and don't really know what I'm talking about (e.g. who is Captain Kirk and Spock and the Gorn and Orion slave girls...), it probably won't. There are probably uses for this tool that we have yet to discover or understand. The biggest potential use for it that I see is the inverse: I submit my document and you tell me what it's about based on your LDA model. Then I can check my Star Trek article, and your LDA tool might say, "hey, I see where you're getting at on the Star Trek front, but it sounds more like you're talking about sickly looking enslaved women, you should shy away from the Orion slave girls and toss in a few more references to Kirk and maybe the Enterprise." That's useful. As of right now, yeah, it doesn't change how most people do things. Unless they were writing low quality articles. The tool is still in the Labs, after all. I'm excited to see where it goes from here! Regards, Matthias
Cancel
- Carlos del Rio
 
 2010-09-10T14:06:30-07:00
 
 1.) It isn't an "SEO fad" it is a mathematical means of valuing written context. We are just now openly discussing what application it has to the issue of search rankings.
 
 2.) It doesn't change anything. The need for clear comprehensive writing pre-dates search engines. If you were good at writing copy for search engines last week you will still be good at it today. LDA is just a perspective for looking at your writing vs. your intended meanings.
 
 LDA is a metric, not a tactic.
 
 2 0
 
 1.) It isn't an "SEO fad" it is a mathematical means of valuing written context. We are just now openly discussing what application it has to the issue of search rankings. 2.) It doesn't change anything. The need for clear comprehensive writing pre-dates search engines. If you were good at writing copy for search engines last week you will still be good at it today. LDA is just a perspective for looking at your writing vs. your intended meanings. LDA is a metric, not a tactic.
 Cancel
- Gianluca Fiorelli
 
 2010-09-10T14:17:13-07:00
 
 Re: I imagine the majority of people that really love this LDA tool are the ones that come from development or math backgrounds, where those that think it doesn't change anything are more of a journalist and designer background.
 
 I think I can recognize myself in the pattern you have "intuitively" guessed reading the comments, not being a math lover (or, better, it's math that disliked me since I was attending Primay School).
 
 But that doesn't mean I'm not intrigued by the concept of LDA and its possible use in On Page Optimization Tools, because any tool that can make my day more productive is very welcome.
 
 The fact is that - coming from an editorial world - that topics like "Context" "Semantics" "Signs" and so on were already there in my daily working baggage. If there are proves that can show us in a math/scientifical way that they are important and - since further experiments won't say the contrary - have a relatively high correlation with rankings, then they simply are confirming what everybody could infer from the well known Google suggestion "write relevant content for your readers". And if you are coming from the editorial world, you know that relevant means not only "important" or "unique", but also write content well written. That suggestion implies "not write for us", that means "not fill with keywords", that means "write naturally", that leads "write giving a context to what you want to express". Therefore, Context was already there and important, simply its importance was not explained with formulas.
 
 That is why LDA is finally not going to revolutionate my life, simply it confirms to me that I am doing right and help me in faster checking if my competitors are doing better or worse on that particular factor. Just for this, I welcome LDA and the future LDA Tools: they are going to make less "painfull" my job, not changing the way I do my job.
 
 gfiorelli1 edited 2010-09-10T14:45:06-07:00
 1 0
 
 Re: I imagine the majority of people that really love this LDA tool are the ones that come from development or math backgrounds, where those that think it doesn't change anything are more of a journalist and designer background. I think I can recognize myself in the pattern you have "intuitively" guessed reading the comments, not being a math lover (or, better, it's math that disliked me since I was attending Primay School). But that doesn't mean I'm not intrigued by the concept of LDA and its possible use in On Page Optimization Tools, because any tool that can make my day more productive is very welcome. The fact is that - coming from an editorial world - that topics like "Context" "Semantics" "Signs" and so on were already there in my daily working baggage. If there are proves that can show us in a math/scientifical way that they are important and - since further experiments won't say the contrary - have a relatively high correlation with rankings, then they simply are confirming what everybody could infer from the well known Google suggestion "write relevant content for your readers". And if you are coming from the editorial world, you know that relevant means not only "important" or "unique", but also write content well written. That suggestion implies "not write for us", that means "not fill with keywords", that means "write naturally", that leads "write giving a context to what you want to express". Therefore, Context was already there and important, simply its importance was not explained with formulas. That is why LDA is finally not going to revolutionate my life, simply it confirms to me that I am doing right and help me in faster checking if my competitors are doing better or worse on that particular factor. Just for this, I welcome LDA and the future LDA Tools: they are going to make less "painfull" my job, not changing the way I do my job.
 Cancel
UMoveFree

2010-09-12T14:09:15-07:00

Nice WBF. I agree that it would be much more helpful if the tool suggested related (topical) keywords that would improve your score.

BTW - You missed comment SPAM from "Dan Dees" who slid in a nice little keyword ("SEO") pointing to his site.

Keep up the great work!

2 0

Nice WBF. I agree that it would be much more helpful if the tool suggested related (topical) keywords that would improve your score. BTW - You missed comment SPAM from "Dan Dees" who slid in a nice little keyword ("SEO") pointing to his site. Keep up the great work!
Cancel
Carlos del Rio

2010-09-10T11:46:44-07:00

I think that on-page factors have been undervalued the past few years. At the end of the day making a comparison of the LDA of two documents is going to be about as useful as making a comparison of readablity scores. It is a good factor to use to measure your copy writer, but it isn't a siver bullet.

We are in an era where search engines really only have two types of factors: things a site can control (content, architecture, etc.), and things they can't (links, citations, competing sites, search behavior). Because Google is the site that spam built they will err on the side of things you don't control; hence penalties for link buyers.

One drawback to obsessing about LDA (just like with KW density) is that it will paint you into a corner of inhuman writing. And, inhuman writing, or overly complicated writing, will lose out to more human friendly (linkable?) writing.

2 0

I think that on-page factors have been undervalued the past few years. At the end of the day making a comparison of the LDA of two documents is going to be about as useful as making a comparison of readablity scores. It is a good factor to use to measure your copy writer, but it isn't a siver bullet. We are in an era where search engines really only have two types of factors: things a site can control (content, architecture, etc.), and things they can't (links, citations, competing sites, search behavior). Because Google is the site that spam built they will err on the side of things you don't control; hence penalties for link buyers. One drawback to obsessing about LDA (just like with KW density) is that it will paint you into a corner of inhuman writing. And, inhuman writing, or overly complicated writing, will lose out to more human friendly (linkable?) writing.
Cancel
- Rand Fishkin
 
 2010-09-10T22:44:19-07:00
 
 Carlos - we may have some substantive disagreements:
 
 1) Re: Readability scores vs. LDA scores - unless correlation with readability scores is quite high (certainly would be interesting to check, but I have doubts - though, to be fair, I didn't think LDA would be particularly high either), it would seem that, by definition, readability improvements won't have the same impact as LDA improvements. Of course, the correlation could be something else entirely, but I'm struggling to think what that could be if not some form of topic modeling.
 
 2) Re: Inhuman writing - I'd hope not! While density would suggest some very weird overuse of terms, LDA should, primarily, just be suggesting terms/phrases you may not be using that you probably should (or some that you are using that could be confusing engines/visitors about the topic). I suppose it's possible that some writers might go overboard with this, but hopefully, it's more like the example I gave in the previous post - you've been writing about the Rolling Stones, but forgot to include "Keith Richards" - I'd think that would be good for search engines AND visitors.
 
 3) My guess is that search engine don't just observe on-site factors like word usage/topics/keywords/etc and links, but that social signals, usage signals, searcher behavior patterns (perhaps branded searches) and maybe even manual quality ratings (internal and 3rd party - especially w/ local), are all making their way (or will) into ranking algorithms. Some of this stuff SEOs are good at observing, others are tougher.
 
 2 0
 
 Carlos - we may have some substantive disagreements: 1) Re: Readability scores vs. LDA scores - unless correlation with readability scores is quite high (certainly would be interesting to check, but I have doubts - though, to be fair, I didn't think LDA would be particularly high either), it would seem that, by definition, readability improvements won't have the same impact as LDA improvements. Of course, the correlation could be something else entirely, but I'm struggling to think what that could be if not some form of topic modeling. 2) Re: Inhuman writing - I'd hope not! While density would suggest some very weird overuse of terms, LDA should, primarily, just be suggesting terms/phrases you may not be using that you probably should (or some that you are using that could be confusing engines/visitors about the topic). I suppose it's possible that some writers might go overboard with this, but hopefully, it's more like the example I gave in the previous post - you've been writing about the Rolling Stones, but forgot to include "Keith Richards" - I'd think that would be good for search engines AND visitors. 3) My guess is that search engine don't just observe on-site factors like word usage/topics/keywords/etc and links, but that social signals, usage signals, searcher behavior patterns (perhaps branded searches) and maybe even manual quality ratings (internal and 3rd party - especially w/ local), are all making their way (or will) into ranking algorithms. Some of this stuff SEOs are good at observing, others are tougher.
 Cancel
Jonathan Goodman

2010-09-12T08:41:19-07:00

Ok I've played around with this labs tool a little and would like to suggest a reversal on how the tool works. Currently, the tool allows you to enter text or a URL to be compared against a specific word (similar to KW density) resulting in a percentage grade.

Instead I would like if the tool allowed the user to enter specific text or a URL and resolve with an observation on what words correlate to the document.

For example if I enter a CNN story on the trapped miners in Chile (https://www.cnn.com/2010/WORLD/americas/09/12/chile.miners/index.html?hpt=T1) I would get keyword three phrases according to the SEOmoz Term Extractor Tool like:
1. two packs daily
2. the men underground
3. the 33 men
4. take special precautions
5. oil drilling platform
6. minister laurence golborne
7. minister jaime manalich
8. miners' sleep patterns
9. miners' longstanding request
10. mine led officials
I’m proposing that the LDA tool extract these terms and quantify the result with percentages.

Now my question is this: Are the percentages currently coming out of the LDA a mix of Google results and computation or pure statistical computations? If it’s purely computational this would account for the growing differential when running the tool again and again for the same document. As more people use the LDA the base numeric is going to change. If LDA was computational plus referential against current Google results you would have proof against the percentages and a significant reason to use this tool. Think of it as a sandbox for testing your content for ranking.

2 0
Ok I've played around with this labs tool a little and would like to suggest a reversal on how the tool works. Currently, the tool allows you to enter text or a URL to be compared against a specific word (similar to KW density) resulting in a percentage grade. Instead I would like if the tool allowed the user to enter specific text or a URL and resolve with an observation on what words correlate to the document. For example if I enter a CNN story on the trapped miners in Chile (https://www.cnn.com/2010/WORLD/americas/09/12/chile.miners/index.html?hpt=T1) I would get keyword three phrases according to the SEOmoz Term Extractor Tool like: <ol><li>two packs daily</li> <li>the men underground</li> <li>the 33 men</li> <li>take special precautions</li> <li>oil drilling platform</li> <li>minister laurence golborne</li> <li>minister jaime manalich</li> <li>miners' sleep patterns</li> <li>miners' longstanding request</li> <li>mine led officials</li> </ol> I’m proposing that the LDA tool extract these terms and quantify the result with percentages. Now my question is this: Are the percentages currently coming out of the LDA a mix of Google results and computation or pure statistical computations? If it’s purely computational this would account for the growing differential when running the tool again and again for the same document. As more people use the LDA the base numeric is going to change. If LDA was computational plus referential against current Google results you would have proof against the percentages and a significant reason to use this tool. Think of it as a sandbox for testing your content for ranking.
Cancel
- Rand Fishkin
 
 2010-09-12T15:53:33-07:00
 
 This is definitely on the roadmap, but it's significantly more work to build. Hopefully we'll have something in the near future.
 
 2 0
 
 This is definitely on the roadmap, but it's significantly more work to build. Hopefully we'll have something in the near future.
 Cancel
Bharati Ahuja

2010-09-12T00:25:17-07:00

Though everyone says that inbound links are very important , it is the on page content and optimization which gives you the initial rankings and with this concept of LDA it gets proved that the content should have true quality with great contextual info. even if the keywords are not repeated.

2 0

Though everyone says that inbound links are very important , it is the on page content and optimization which gives you the initial rankings and with this concept of LDA it gets proved that the content should have true quality with great contextual info. even if the keywords are not repeated.
Cancel
Realize

2010-09-12T12:42:04-07:00

I think it's time you got a couple of wireless mics for the videos... it is a little tiring to listen to you guys because the mic is far from your mouths...

2 0

I think it's time you got a couple of wireless mics for the videos... it is a little tiring to listen to you guys because the mic is far from your mouths...
Cancel
- Jamie Steven
 
 2010-09-13T10:28:18-07:00
 
 We're working on it! Please expect improved quality for this Friday.
 
 2 0
 
 We're working on it! Please expect improved quality for this Friday.
 Cancel
Glenn Crocker

2010-09-10T09:14:00-07:00

I've been working with the LDA tool to see what wins I can find for clients, and was surprised to see what looks like a keyword density aspect to the tool. Specifically, if I take a block of text from a client site that yields (say) a 71% LDA score with 459 words, then sort the words in the sample set, I still see roughly 71% LDA with those 459 words. (So this affirms that phrases are ignored and only single words are being used.)

But if I test LDA with just the 252 unique words from the set, the LDA score drops to 53%.

This was a surprising result. What's the nature of keyword density as a factor for LDA?

2 0

I've been working with the LDA tool to see what wins I can find for clients, and was surprised to see what looks like a keyword density aspect to the tool. Specifically, if I take a block of text from a client site that yields (say) a 71% LDA score with 459 words, then sort the words in the sample set, I still see roughly 71% LDA with those 459 words. (So this affirms that phrases are ignored and only single words are being used.) But if I test LDA with just the 252 unique words from the set, the LDA score drops to 53%. This was a surprising result. What's the nature of keyword density as a factor for LDA?
Cancel
- indurain
 
 2010-09-10T10:02:11-07:00
 
 Any natural looking text (that can be read by humans) will need to have prepositions, articles, pronoums, verbs....a block of text based on "keywords" wouldn't hold much meaning at all...perhaps the tool addresses this fact and that's why your total LDA % goes down?
 
 2 0
 
 Any natural looking text (that can be read by humans) will need to have prepositions, articles, pronoums, verbs....a block of text based on "keywords" wouldn't hold much meaning at all...perhaps the tool addresses this fact and that's why your total LDA % goes down?
 Cancel
 - Glenn Crocker
 
 2010-09-10T10:31:50-07:00
 
 It looks like the LDA tool removes "stop words" already, leaving only "interesting" words. Run it against a URL and in the box you'll see the words they actually used for LDA. Here's a concrete example of what I'm seeing:
 
 Query: star trek
 
 19-20% with Document: kirk mccoy
 
 17-20% with Document: mccoy kirk
 
 So word order seems irrelevant.
 
 12-14% with Document: mccoy kirk mccoy
 
 Adding more mccoy seems irrelevant, even negative.
 
 53-56% with Document: kirk mccoy kirk
 
 But more kirk = higher LDA. This makes sense, but I don't see this aspect described in the LDA info from seoMoz so far.
 
 56-60% with Document: kirk kirk mccoy
 
 63-68% with Document: kirk kirk kirk mccoy
 
 3 kirks to one mccoy seems best for LDA %.
 
 34-36% with Document: mccoy kirk mccoy kirk
 
 38-48% with Document: kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy
 
 98% with Document: kirk kirk kirk kirk kirk kirk mccoy uhura chekov
 
 98% with Document: kirk mccoy uhura chekov
 
 It's like I need more words in the target LDA space, even if they're repeats, but unique words from the LDA space did best.
 
 In a separate test with real content from a web site, it seemed that having a "natural" keyword density helped. Sorted unique words ranked the lowest, doubling all unique words was higher, but a natural keyword density did better.
 
 gcrocker edited 2010-09-10T10:34:54-07:00
 3 0
 
 It looks like the LDA tool removes "stop words" already, leaving only "interesting" words. Run it against a URL and in the box you'll see the words they actually used for LDA. Here's a concrete example of what I'm seeing: Query: star trek 19-20% with Document: kirk mccoy 17-20% with Document: mccoy kirk So word order seems irrelevant. 12-14% with Document: mccoy kirk mccoy Adding more mccoy seems irrelevant, even negative. 53-56% with Document: kirk mccoy kirk But more kirk = higher LDA. This makes sense, but I don't see this aspect described in the LDA info from seoMoz so far. 56-60% with Document: kirk kirk mccoy 63-68% with Document: kirk kirk kirk mccoy 3 kirks to one mccoy seems best for LDA %. 34-36% with Document: mccoy kirk mccoy kirk 38-48% with Document: kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy kirk mccoy 98% with Document: kirk kirk kirk kirk kirk kirk mccoy uhura chekov 98% with Document: kirk mccoy uhura chekov It's like I need more words in the target LDA space, even if they're repeats, but unique words from the LDA space did best. In a separate test with real content from a web site, it seemed that having a "natural" keyword density helped. Sorted unique words ranked the lowest, doubling all unique words was higher, but a natural keyword density did better.
 Cancel
 - Alan Mosley
 
 2010-09-11T11:46:00-07:00
 
 I think ben stated that keyword stuffing may give you a higher score but doubts it will give you higher rankings, we have to use a bitt of common sence here
 
 1 0
 
 I think ben stated that keyword stuffing may give you a higher score but doubts it will give you higher rankings, we have to use a bitt of common sence here
 Cancel
IanDouglas

2010-09-10T06:01:17-07:00

Yet another fantastic post. I was lacking knowledge in this field. Thanks so much the information you covered. It enlightened =) me to say the least.

2 0

Yet another fantastic post. I was lacking knowledge in this field. Thanks so much the information you covered. It enlightened =) me to say the least.
Cancel
NueMD

2010-09-10T07:48:31-07:00

It sounds fine to me - a little quiet but just cranked it up and it's fine. Thanks for the video on a complicated subject - doesn't seem so complicated anymore.

2 0

It sounds fine to me - a little quiet but just cranked it up and it's fine. Thanks for the video on a complicated subject - doesn't seem so complicated anymore.
Cancel
michael_housebook

2010-09-10T04:24:15-07:00

Good WBF, I do understand the issue better now.

I just check pro tool section it look much more organized. Great work seomoz.

2 0

Good WBF, I do understand the issue better now. I just check pro tool section it look much more organized. Great work seomoz.
Cancel
Hibba

2010-09-10T03:14:44-07:00

Interesting theory, but as benniog said, i don't think i'm going to change the way i write soon. The hardest thing in my opinion is to figure out what words to use to improve your score. The Star Trek example is quit clear but in real live there are much more subjects to take into account.

In short, interesting theory which is hard to implement. But i keep following the research you guys are doing on this subject. Are there any plans to incluid foreign language in this tool?

2 0

Interesting theory, but as benniog said, i don't think i'm going to change the way i write soon. The hardest thing in my opinion is to figure out what words to use to improve your score. The Star Trek example is quit clear but in real live there are much more subjects to take into account. In short, interesting theory which is hard to implement. But i keep following the research you guys are doing on this subject. Are there any plans to incluid foreign language in this tool?
Cancel
Himanshu Sharma

2010-09-10T00:50:31-07:00

When i first read 25 minutes, i got scared. I thought i will be dealing with some advanced stats or something. But there was nothing like that. However i am still not sure how to improve LDA score. All i get is write for your audience and not for search engines. Should i copy the words of higher LDA scores pages and use them in my copy?

2 0

When i first read 25 minutes, i got scared. I thought i will be dealing with some advanced stats or something. But there was nothing like that. However i am still not sure how to improve LDA score. All i get is write for your audience and not for search engines. Should i copy the words of higher LDA scores pages and use them in my copy?
Cancel
- Gianluca Fiorelli
 
 2010-09-10T08:16:26-07:00
 
 Maybe simply be aware that the way the write contents is probably a factor why they rank better... but just one between many others.
 
 3 0
 
 Maybe simply be aware that the way the write contents is probably a factor why they rank better... but just one between many others.
 Cancel
 - Matt Shoffner
 
 2010-09-10T10:32:48-07:00
 
 I agree... I think the point is to realize that context matters, and it is harder to "fake" with keyword stuffing. I cross referenced word usage from a client's site and competitors about 4 months ago and found that my client was missing some very relevant keywords... we included them and found our long tail traffic increased. The long tail seems to be another element of this discussion that perhaps should get more airtime (especially with Google's more recent changes).
 
 2 0
 
 I agree... I think the point is to realize that context matters, and it is harder to "fake" with keyword stuffing. I cross referenced word usage from a client's site and competitors about 4 months ago and found that my client was missing some very relevant keywords... we included them and found our long tail traffic increased. The long tail seems to be another element of this discussion that perhaps should get more airtime (especially with Google's more recent changes).
 Cancel
Carter Cole

2010-09-10T08:39:53-07:00

Any chance your going to publish the keywords and weights (are there weights?) of the topics you find... it would be way cool to kinda reverse it and show how you can make the relevance stronger rather than trying to guess what the coinsurance and topic words are

It seems liek the problem with LDA is the same as LSI if you have a large enough sample set you can build more topics but its all based on sample size and the same way that you run out of data to compute recommendations for algorithms like facebook or netflix you can only get a limited number of words back to use and for most "topics" there may not be enough data in the sample to build out a good list

could content is king be causing some of this stuff to kick in b/c there are more words that google can look for these more advanced metrics that are built off the Google giant index of content so natural writing will show what the topic is on (thats the whole point right)

cartercole edited 2010-09-10T08:41:18-07:00
2 0

Any chance your going to publish the keywords and weights (are there weights?) of the topics you find... it would be way cool to kinda reverse it and show how you can make the relevance stronger rather than trying to guess what the coinsurance and topic words are It seems liek the problem with LDA is the same as LSI if you have a large enough sample set you can build more topics but its all based on sample size and the same way that you run out of data to compute recommendations for algorithms like facebook or netflix you can only get a limited number of words back to use and for most "topics" there may not be enough data in the sample to build out a good list could content is king be causing some of this stuff to kick in b/c there are more words that google can look for these more advanced metrics that are built off the Google giant index of content so natural writing will show what the topic is on (thats the whole point right)
Cancel
- Rand Fishkin
 
 2010-09-10T22:29:05-07:00
 
 We'll certainly try to do more publishing as we get more sophisticated with the tool, process and testing. At one time in the past, Google had a labs tool where you could plug in a site and it would give you the topic/category it felt was most applicable. That could be something for us to rebuild - I was always frustrated they took it away, but my understanding is that much like their link data, they felt there was too much potential for abuse/manipulation by spammers.
 
 2 0
 
 We'll certainly try to do more publishing as we get more sophisticated with the tool, process and testing. At one time in the past, Google had a labs tool where you could plug in a site and it would give you the topic/category it felt was most applicable. That could be something for us to rebuild - I was always frustrated they took it away, but my understanding is that much like their link data, they felt there was too much potential for abuse/manipulation by spammers.
 Cancel
 - Vanessa Fox
 
 2010-09-12T11:54:01-07:00
 
 Google provides this now as part of AdPlanner: www.google.com/adplanner
 
 4 0
 
 Google provides this now as part of AdPlanner: www.google.com/adplanner
 Cancel
 - Rand Fishkin
 
 2010-09-12T15:55:49-07:00
 
 Awesome! Hadn't really it was powered by the same backend, but it definitely looks more beefed up (deeper categories, etc). Here's an example for SEOmoz.org. Thanks V!
 
 1 0
 
 Awesome! Hadn't really it was powered by the same backend, but it definitely looks more beefed up (deeper categories, etc). Here's an <a href="https://www.google.com/adplanner/planning/site_profile#siteDetails?identifier=www.seomoz.org&lp=true" rel="nofollow">example for SEOmoz.org</a>. Thanks V!
 Cancel
Gareth Thomas

2010-09-10T07:05:59-07:00

Same here. Very difficult to follow the discussion on the video.

Transcript saves the day.

Ideally a good quality video sound and transcript would be best.

Lets not distract from the LDA issue though.

At least we get to hear/read the LDA discussion because of Seomoz Transparency

2 0

Same here. Very difficult to follow the discussion on the video. Transcript saves the day. Ideally a good quality video sound and transcript would be best. Lets not distract from the LDA issue though. At least we get to hear/read the LDA discussion because of Seomoz Transparency
Cancel
Glenn Crocker

2010-09-14T11:17:32-07:00

Ben, I'm confused about how keyword density in documents changes LDA %. Examples:

Keyword: star trek

LDA 20% with Document: kirk mccoy

LDA 67% with Document: kirk kirk kirk mccoy

I don't see this effect described in the LDA videos & papers I've looked at. Is this an artifact of your implementation, or an expected result with LDA?

1 0

Ben, I'm confused about how keyword density in documents changes LDA %. Examples: Keyword: star trek LDA 20% with Document: kirk mccoy LDA 67% with Document: kirk kirk kirk mccoy I don't see this effect described in the LDA videos & papers I've looked at. Is this an artifact of your implementation, or an expected result with LDA?
Cancel
- Rand Fishkin
 
 2010-09-14T11:48:13-07:00
 
 It's not an artifact per se - I'd think it would be expected. A document or piece of content that mentions the word "Kirk" many times is more likely to be about Star Trek than one that does so only once. The key isn't that the tool perfectly measures your exact usage, but rather that you can see strengths/weaknesses of content blocks from a topic modeling perspective.
 
 Keyword density isn't a metric the engines use and it's not one we use, either. It's just that having 7 words of content where a topically relevant term is employed multiple times is more likely to be "relevant" than a content block that's very small and contains fewer relevant words.
 
 I think this is why KW Density persists as a myth - some SEOs will "improve" their KW density, see their rankings rise, and assume cause and effect. It's a big part of why the myth of this metric is so hard to fight, because technically, sometimes improving it might actually help (but that doesn't mean it's a good way to measure things or an optimization tactic to pursue).
 
 1 0
 
 It's not an artifact per se - I'd think it would be expected. A document or piece of content that mentions the word "Kirk" many times is more likely to be about Star Trek than one that does so only once. The key isn't that the tool perfectly measures your exact usage, but rather that you can see strengths/weaknesses of content blocks from a topic modeling perspective. Keyword density isn't a metric the engines use and it's not one we use, either. It's just that having 7 words of content where a topically relevant term is employed multiple times is more likely to be "relevant" than a content block that's very small and contains fewer relevant words. I think this is why KW Density persists as a myth - some SEOs will "improve" their KW density, see their rankings rise, and assume cause and effect. It's a big part of why the myth of this metric is so hard to fight, because technically, sometimes improving it might actually help (but that doesn't mean it's a good way to measure things or an optimization tactic to pursue).
 Cancel
 - Glenn Crocker
 
 2010-09-14T12:16:19-07:00
 
 I think I'm trying to scratch the surface on the consine similarity process. My understanding is that the LDA Tool looks at the search phrase words to figure out what topics are related, then looks at the document to see what topics it covers, and then assigns a % relatedness.
 
 What's surprising to me is that if I have "kirk" in a document once, I should have the topics related to that word covered. So more kirks in the document shouldn't cause LDA to go up.
 
 So I'm missing something here. I'm trying to build a tool to help find keyword blind spots from looking at LDA % across sites for a given keyword, but I can't tell whether I can get away with using unique words or if I have to pass through repetitions in order to keep the keyword density natural.
 
 1 0
 
 I think I'm trying to scratch the surface on the consine similarity process. My understanding is that the LDA Tool looks at the search phrase words to figure out what topics are related, then looks at the document to see what topics it covers, and then assigns a % relatedness. What's surprising to me is that if I have "kirk" in a document once, I should have the topics related to that word covered. So more kirks in the document shouldn't cause LDA to go up. So I'm missing something here. I'm trying to build a tool to help find keyword blind spots from looking at LDA % across sites for a given keyword, but I can't tell whether I can get away with using unique words or if I have to pass through repetitions in order to keep the keyword density natural.
 Cancel
Associate

Will Critchlow
Associate

2010-09-10T12:28:54-07:00

Shout out to whoever made the transcript btw - it's like you just picked the hardest words you could find for them to decipher. Good job :)

1 0

Shout out to whoever made the transcript btw - it's like you just picked the hardest words you could find for them to decipher. Good job :)
Cancel
Dwight Santos

2010-09-20T22:31:38-07:00

Oh man. The best part was when Megan Fox came up and it took a couple of seconds to get her name - Ben could not have been less enthusiastic about her!

Just goes to show how hard these guys work and have no time for petty Transformers movies haha.

1 0

Oh man. The best part was when Megan Fox came up and it took a couple of seconds to get her name - Ben could not have been less enthusiastic about her! Just goes to show how hard these guys work and have no time for petty Transformers movies haha.
Cancel
Chris Horner

2010-09-24T02:12:28-07:00

LSI LDA it all sounds like some kind of new SEO drug :)

chrishorner edited 2010-09-24T02:12:56-07:00
1 0

LSI LDA it all sounds like some kind of new SEO drug :)
Cancel
Unmatched Solutions

2010-09-12T23:55:22-07:00

i have little bit confuse about how to improve LDA score?

Can u guys help me?

1 0

i have little bit confuse about how to improve LDA score? Can u guys help me?
Cancel
- Rand Fishkin
 
 2010-09-13T12:27:12-07:00
 
 The basic concept is to do a better job making the words and phrases describe the keyword/content/query you're targeting. For example, if you are trying to rank well for "The Rolling Stones" but describe them only in passing and focus primarily in the subject matter abou a Peanut Butter and Jelly sandwich you ate once at their concert, that may be less topically relevant than including information and words about the band's members, history, records, songs, concerts, style, etc.
 
 While the LDA tool we've built is certainly an imperfect and imprecise model, it may help you to measure the degree to which your content (or that of others) is "topically relevant" to a particular term/phrase.
 
 1 0
 
 The basic concept is to do a better job making the words and phrases describe the keyword/content/query you're targeting. For example, if you are trying to rank well for "The Rolling Stones" but describe them only in passing and focus primarily in the subject matter abou a Peanut Butter and Jelly sandwich you ate once at their concert, that may be less topically relevant than including information and words about the band's members, history, records, songs, concerts, style, etc. While the LDA tool we've built is certainly an imperfect and imprecise model, it may help you to measure the degree to which your content (or that of others) is "topically relevant" to a particular term/phrase.
 Cancel
- Matt Shoffner
 
 2010-09-14T07:56:16-07:00
 
 Firstly, I would say that you shouldn't focus on "improving the score" and should focus on being more topically relevant. There is no evidence to prove that a higher LDA score will increase your rankings. There is evidence that suggests that the score correlates well with the SERPs. EDIT: so in theory, a higher score will lead to higher rankings.
 
 Use the tool on your site, as well as on your competitor's. cross-reference relevant terms that appear often on your competitors' pages. Assemble a list of words you are not using on a specific topic and try to fluidly add them in. Blog posts may be good places to do this.
 
 This help?
 
 Inbounded edited 2010-09-14T07:58:41-07:00
 1 0
 
 Firstly, I would say that you shouldn't focus on "improving the score" and should focus on being more topically relevant. There is no evidence to prove that a higher LDA score will increase your rankings. There is evidence that suggests that the score correlates well with the SERPs. EDIT: so in theory, a higher score will lead to higher rankings. Use the tool on your site, as well as on your competitor's. cross-reference relevant terms that appear often on your competitors' pages. Assemble a list of words you are not using on a specific topic and try to fluidly add them in. Blog posts may be good places to do this. This help?
 Cancel
j.gabaix

2010-09-10T15:40:28-07:00

Tremendously helpful videos.

I understand that synonyms are irrelevant for LDA .

What about "related searches" and "wonderwheel"? They seem good tools for this.

1 0

Tremendously helpful videos. I understand that synonyms are irrelevant for LDA . What about "related searches" and "wonderwheel"? They seem good tools for this.
Cancel
- Alan Mosley
 
 2010-09-11T12:00:18-07:00
 
 Related searches can change context or can be ambiguous, How I am attacking this is, I take my keyword, make a list of terms that reinforce the context, then make a list of terms that may cause confusing or ambiguous. Then try to use the right list.
 
 When writing for humans we try not to confuse and state our context, well we need to do this for machines also, as they can be easily confused and manipulated.
 
 1 0
 
 Related searches can change context or can be ambiguous, How I am attacking this is, I take my keyword, make a list of terms that reinforce the context, then make a list of terms that may cause confusing or ambiguous. Then try to use the right list. When writing for humans we try not to confuse and state our context, well we need to do this for machines also, as they can be easily confused and manipulated.
 Cancel
BluebandMedia

2010-09-10T10:51:13-07:00

Interesting post about LDA and SEO.

I will be looking into this topic further.

1 0

Interesting post about LDA and SEO. I will be looking into this topic further. 
Cancel
al sefati

2010-09-10T17:00:24-07:00

with all do respect to seomoz staff...aren't you guys putting too much focus on this LDA stuff? It almost feels like you are trying to promote yourself\tool based on LDA...(sorry I am a bit candid)

Considering most SEOs are marketers not mathematicians, I think more focus on natural good quality content that is both user and search engine friendly maybe a better strategy to grab attentions...

other than that you guys rock :)

1 0

with all do respect to seomoz staff...aren't you guys putting too much focus on this LDA stuff? It almost feels like you are trying to promote yourself\tool based on LDA...(sorry I am a bit candid) Considering most SEOs are marketers not mathematicians, I think more focus on natural good quality content that is both user and search engine friendly maybe a better strategy to grab attentions... other than that you guys rock :)
Cancel
- Rand Fishkin
 
 2010-09-10T22:25:42-07:00
 
 Hmm... I guess we feel a bit differently about this. LDA seems like an interesting topic modeling system and something SEOs generally aren't familar with, so it requires a good deal of explanation (and I don't know that we've yet to do a comprehensive job, though we're trying).
 
 It's also a bit odd to say we're "selling" ourselves with it - it's not yet part of any of our paid tools. There's the free research and the free tool for testing, but we've yet to productize it in a monetized way. I think we'll be holding off on that until we feel confident that it really is valuable for improving rankings.
 
 If there's things we're doing that are creating this impression, let me know. I certainly don't want this being misconstrued.
 
 randfish edited 2010-09-11T12:58:21-07:00
 3 0
 
 Hmm... I guess we feel a bit differently about this. LDA seems like an interesting topic modeling system and something SEOs generally aren't familar with, so it requires a good deal of explanation (and I don't know that we've yet to do a comprehensive job, though we're trying). It's also a bit odd to say we're "selling" ourselves with it - it's not yet part of any of our paid tools. There's the free research and the free tool for testing, but we've yet to productize it in a monetized way. I think we'll be holding off on that until we feel confident that it really is valuable for improving rankings. If there's things we're doing that are creating this impression, let me know. I certainly don't want this being misconstrued.
 Cancel
 - WeRASkitzzo
 
 2010-09-11T15:53:31-07:00
 
 Rand, you know as well as I do that the attention being generated by this (as well as links of course) improve your bottom line. Just because this tool is free for now, doesn't mean you're not benefiting from it financially.
 
 1 3
 
 Rand, you know as well as I do that the attention being generated by this (as well as links of course) improve your bottom line. Just because this tool is free for now, doesn't mean you're not benefiting from it financially.
 Cancel
 - Rand Fishkin
 
 2010-09-12T16:04:14-07:00
 
 Ben - are you suggesting we should be a non-profit? Or act like one? My sense (which, admittedly, could be biased) is that across the SEO market, we provide more free resources (tools, guides, blog posts, research, presentations, etc.) than any other company our size. Granted, this has marketing benefits and is a marketing channel, but is it your opinion that it's an illogical, unprofitable or evil practice?
 
 1 0
 
 Ben - are you suggesting we should be a non-profit? Or act like one? My sense (which, admittedly, could be biased) is that across the SEO market, we provide more free resources (tools, guides, blog posts, research, presentations, etc.) than any other company our size. Granted, this has marketing benefits and is a marketing channel, but is it your opinion that it's an illogical, unprofitable or evil practice?
 Cancel
 - WeRASkitzzo
 
 2010-09-13T10:16:27-07:00
 
 Rand, I'm not at all saying you should be non-profit or that producing free tools is evil. My comment was in response to your statement "It's also a bit odd to say we're "selling" ourselves with it."
 
 Both you and Ben (in comments elsewhere) have acted like there's no monetary benefit to producing this tool and hyping up the results as a big breakthrough in SEO.
 
 That's simply not the case as all the buzz surrounding this and other studies increases brand awareness, generates inbound links, and generally improves your bottom line.
 
 I have no idea whether this incentive influenced your conclusions in any way (not even close to a stats guy) but I think we as a community should examine the study as we would any other that came from a company with a financial incentive built into the outcome.
 
 1 3
 
 Rand, I'm not at all saying you should be non-profit or that producing free tools is evil. My comment was in response to your statement "It's also a bit odd to say we're "selling" ourselves with it." Both you and Ben (in comments elsewhere) have acted like there's no monetary benefit to producing this tool and hyping up the results as a big breakthrough in SEO. That's simply not the case as all the buzz surrounding this and other studies increases brand awareness, generates inbound links, and generally improves your bottom line. I have no idea whether this incentive influenced your conclusions in any way (not even close to a stats guy) but I think we as a community should examine the study as we would any other that came from a company with a financial incentive built into the outcome. 
 Cancel
 
 Rand Fishkin
 
 2010-09-13T12:34:51-07:00
 
 That skepticism is warranted is a fair point, but I think you're backing down substantively from the confrontational and negative connotation of your earlier remarks. That's great if it's the case, but then it sounds as though your disagreement/issue is far less systemic (and one one which we agree).
 
 2 0
 
 That skepticism is warranted is a fair point, but I think you're backing down substantively from the confrontational and negative connotation of your earlier remarks. That's great if it's the case, but then it sounds as though your disagreement/issue is far less systemic (and one one which we agree).
 Cancel
 - Norman Newsome
 
 2010-09-18T19:25:18-07:00
 
 You just sound jealous. Fact is, this brings us one step closer to understanding a system that, each day, becomes more complex. Whether this is a huge step or a half step is yet to be seen. However, complaining that when SEOMOZ does their job they benefit from a job well done is just plain dumb.
 
 1 0
 
 You just sound jealous. Fact is, this brings us one step closer to understanding a system that, each day, becomes more complex. Whether this is a huge step or a half step is yet to be seen. However, complaining that when SEOMOZ does their job they benefit from a job well done is just plain dumb.
 Cancel
 - SDFF
 
 2010-09-12T09:50:10-07:00
 
 Rand - promote away, man. You guys spent the money, you spent the time, this is your frikkin' website for crying out loud. Asking for and receiving feedback on a tool you make available is one thing, but to be criticized for capitalizing on your own product, on your own website... that is where I personally would draw the line. Maybe you ought to change your .org to .com. SEOmoz is not a charity, people.
 
 4 0
 
 Rand - promote away, man. You guys spent the money, you spent the time, this is your frikkin' website for crying out loud. Asking for and receiving feedback on a tool you make available is one thing, but to be criticized for capitalizing on your own product, on your own website... that is where I personally would draw the line. Maybe you ought to change your .org to .com. SEOmoz is not a charity, people.
 Cancel
 - WeRASkitzzo
 
 2010-09-13T10:21:47-07:00
 
 The commenter that Rand responded to criticized them for selling themselves with it, not me. I have no problem with them selling themselves with everything they do, my issue (in this limited sense) is with Rand acting like they have no financial investment in this. They've spent time & money on this, and there's money to be made if they can brand themselves as the company that has tools that reverese engineer the Google algo.
 
 Just because it's not a part of the paid tools, doesn't mean they're not "selling themselves with it."
 
 1 4
 
 The commenter that Rand responded to criticized them for selling themselves with it, not me. I have no problem with them selling themselves with everything they do, my issue (in this limited sense) is with Rand acting like they have no financial investment in this. They've spent time & money on this, and there's money to be made if they can brand themselves as the company that has tools that reverese engineer the Google algo. Just because it's not a part of the paid tools, doesn't mean they're not "selling themselves with it."
 Cancel
 - Matt Shoffner
 
 2010-09-14T07:52:37-07:00
 
 WeRASkitzzo,
 
 Yes, free tools are types of promotions. Yes, SEOmoz is benefiting from the attention. That is obvious. Rand was very upfront with his plans for making this tool part of the pro package in the future last week in a post, In a post last week, "We're leaving the Labs LDA tool free for anyone to use for a while, as we'd love to hear what the community thinks of the process and want to get as broad input as possible. Future iterations may be PRO-only."
 
 I think you are saying that it is disingenuous to act as if SEOmoz has nothing to gain from the discussion, even if the tool is currently free. Fair point, but couldn't that be said for anything they write? They have a blog to promote themselves, by extension every post is promotional (albeit, their style is educational). I would like to think that this community is rational and will skeptically use this tool and determine its usefulness for themselves.
 
 1 0
 
 WeRASkitzzo, Yes, free tools are types of promotions. Yes, SEOmoz is benefiting from the attention. That is obvious. Rand was very upfront with his plans for making this tool part of the pro package in the future last week in a post, In a post last week, "We're leaving <a href="https://www.seomoz.org/labs/lda" rel="nofollow">the Labs LDA tool</a> free for anyone to use for a while, as we'd love to hear what the community thinks of the process and want to get as broad input as possible. Future iterations may be <a href="https://www.seomoz.org/gopro" rel="nofollow">PRO-only</a>." I think you are saying that it is disingenuous to act as if SEOmoz has nothing to gain from the discussion, even if the tool is currently free. Fair point, but couldn't that be said for anything they write? They have a blog to promote themselves, by extension every post is promotional (albeit, their style is educational). I would like to think that this community is rational and will skeptically use this tool and determine its usefulness for themselves.
 Cancel
 - Matt Shoffner
 
 2010-09-12T09:57:58-07:00
 
 If I were going to design a search engine I would include topic modeling in it. Who wouldn't? If you want the most relevant results you have to make sure the results are on the right topic, and with Google Instant, topicality is even more important. The tool isn't perfect, we know that. I for one am interested to see where the LDA tool goes in the future.
 
 I've used the LDA tool on my client's pages and have identified some phrases that are relveant that we were not using, including these terms in future blog posts seems like a good strategy for the long tail.
 
 Also, I have no illusions that SEOmoz is in business to make a profit. That is why they do everything they do. Offering a new tool for free to seek feedback is a wise business decision as it will improve the tool faster. The tool can be turned into a earning proposition later, once its value is established (if it proves valuable). It is valid to question the motives of anyone who sells solutions to problems you didn't know you had. I do think this discussion is very relevant to SEOs though. We all know that content matters (keywords, etc.), but I think it is relevant to note that CONTEXT also matters (topicality, etc.).
 
 4 0
 
 If I were going to design a search engine I would include topic modeling in it. Who wouldn't? If you want the most relevant results you have to make sure the results are on the right topic, and with Google Instant, topicality is even more important. The tool isn't perfect, we know that. I for one am interested to see where the LDA tool goes in the future. I've used the LDA tool on my client's pages and have identified some phrases that are relveant that we were not using, including these terms in future blog posts seems like a good strategy for the long tail. Also, I have no illusions that SEOmoz is in business to make a profit. That is why they do everything they do. Offering a new tool for free to seek feedback is a wise business decision as it will improve the tool faster. The tool can be turned into a earning proposition later, once its value is established (if it proves valuable). It is valid to question the motives of anyone who sells solutions to problems you didn't know you had. I do think this discussion is very relevant to SEOs though. We all know that content matters (keywords, etc.), but I think it is relevant to note that CONTEXT also matters (topicality, etc.).
 Cancel
- Alan Mosley
 
 2010-09-11T12:05:32-07:00
 
 I think the corrations warrent the hype, to asume that good well written content shows the same corraltion may be folly, but having said that maybe its something to test, do good content writers get good LDA scores and good rankings, I dont know how to test that as it would mean someone has to judge what iis well written content.
 
 1 0
 
 I think the corrations warrent the hype, to asume that good well written content shows the same corraltion may be folly, but having said that maybe its something to test, do good content writers get good LDA scores and good rankings, I dont know how to test that as it would mean someone has to judge what iis well written content.
 Cancel
Okan Kortan

2010-09-10T15:42:15-07:00

I'm trying to understand what they say, but I have to read the transcript to clearly understand what is going on.. But when I'm reading the transcript I miss the drawings and other useful stuff on whiteboard..

Subtitles maybe?

1 0

I'm trying to understand what they say, but I have to read the transcript to clearly understand what is going on.. But when I'm reading the transcript I miss the drawings and other useful stuff on whiteboard.. Subtitles maybe?
Cancel
Marco Williams

2010-09-11T01:14:32-07:00

I do agree with this post a lot. Thanks for posting. Although I've a little problem with the sound in the video still yeah the script helps a lot. I always believe that content is still the best in order to have a good reputation online that in return can bring you tons of visitors though it's not an over night success. On the other hand, the combination of SEO and etc. will be an added factor for your site to succeed but make sure that you are on the right track or else everything will be futile

caseyhen edited 2010-09-12T18:38:21-07:00
1 0

I do agree with this post a lot. Thanks for posting. Although I've a little problem with the sound in the video still yeah the script helps a lot. I always believe that content is still the best in order to have a good reputation online that in return can bring you tons of visitors though it's not an over night success. On the other hand, the combination of SEO and etc. will be an added factor for your site to succeed but make sure that you are on the right track or else everything will be futile
Cancel

Post Analytics

Video Transcription

Comments 101

Log in to Moz

Don't have an account?