Linkscape's September Update + Feedback

Comments 31

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

SuperlativB

2011-09-19T01:18:36-07:00

I just LOVE this, linkscape updates every 4 weeks. Randfish for president?

5 0

I just LOVE this, linkscape updates every 4 weeks. Randfish for president?
Cancel
Sha Menz

2011-09-18T02:28:45-07:00

Hi Rand,

Thanks for taking the time to keep the information flowing and for explaining the challenges that need to be overcome so we can all have some basic understanding of the way ahead.

I have to say that managing all the complexities in play seems just mind boggling to me!

It's great to hear another update is not too far away.

It would have been nice if things could be easier for the team as they put their collective shoulder to the wheel, but as people who work in the world of SEO, we all know that the easiest path is usually not the best. :-)

I'll be looking forward to being a part of this next phase of growth for Linkscape and OSE. Sending feedback brings with it an extra little rush of excitement now...how awesome would it be if it were my little comment that unlocks some piece of the puzzle along the way?! :-)

I'll borrow from something I said in a recent Q&A thread to send a little message both to the SEOmoz team and to the rest of this awesome community who I know are committed to helping in the process:

"Also - when the problem seems insurmountable, remember two things ...
- The heart of this community is TAGFEE and that means help is close by.
- Best way to eat an elephant - one mouthful at a time!!"
Sha

ShaMenz edited 2011-09-18T02:33:28-07:00
4 0
Hi Rand, Thanks for taking the time to keep the information flowing and for explaining the challenges that need to be overcome so we can all have some basic understanding of the way ahead. I have to say that managing all the complexities in play seems just mind boggling to me! It's great to hear another update is not too far away. It would have been nice if things could be easier for the team as they put their collective shoulder to the wheel, but as people who work in the world of SEO, we all know that the easiest path is usually not the best. :-) I'll be looking forward to being a part of this next phase of growth for Linkscape and OSE. Sending feedback brings with it an extra little rush of excitement now...how awesome would it be if it were my little comment that unlocks some piece of the puzzle along the way?! :-) I'll borrow from something I said in a recent Q&A thread to send a little message both to the SEOmoz team and to the rest of this awesome community who I know are committed to helping in the process: "Also - when the problem seems insurmountable, remember two things ... <ul><li>The heart of this community is <a href="../q/how-are-you-implementing-tagfee">TAGFEE</a> and that means help is close by.</li> <li>Best way to eat an elephant - one mouthful at a time!!"</li> </ul> Sha
Cancel
sean459

2011-09-18T11:19:30-07:00

Love the tool but i get frustrated when i see link with high authority but its not cached by google. Would be nice if there was easy way to toss those links out. For example the tool loves the spammy garden web directory and crawls it super deep and scores it high..wheras google bot cuts off crawl much shallower.

3 0

Love the tool but i get frustrated when i see link with high authority but its not cached by google. Would be nice if there was easy way to toss those links out. For example the tool loves the spammy garden web directory and crawls it super deep and scores it high..wheras google bot cuts off crawl much shallower.
Cancel
- Sha Menz
 
 2011-09-18T14:04:10-07:00
 
 Hi Sean,
 
 That sounds like exactly the type of feedback that might be of use to the engineering team!
 
 If you haven't already sent a detailed explanation of the issue to the help team, then I hope that you will soon.
 
 Who knows, maybe it will be your comment that helps fit another piece into the puzzle! :)
 
 1 0
 
 Hi Sean, That sounds like exactly the type of feedback that might be of use to the engineering team! If you haven't already sent a detailed explanation of the issue to the help team, then I hope that you will soon. Who knows, maybe it will be your comment that helps fit another piece into the puzzle! :)
 Cancel
- Rand Fishkin
 
 2011-09-18T15:13:54-07:00
 
 If you've got some examples, that would be awesome! Like Google, we use mozRank (well, technically, they use PageRank, but the two algos are quite similar) to prioritize our crawl of the web, but unlike Google, we don't have sophisticated spam scores, so we often get "fooled" by those who manipulate the web's link graph. Working on a good spam metric is a priority we'll focus on following the upgrade of depth/breadth.
 
 If you want to send the specifics over, the feedback tab of OpenSiteExplorer (https://www.opensiteexplorer.org/) on the left-hand side is a great place, or you can email the help team.
 
 1 0
 
 If you've got some examples, that would be awesome! Like Google, we use mozRank (well, technically, they use PageRank, but the two algos are quite similar) to prioritize our crawl of the web, but unlike Google, we don't have sophisticated spam scores, so we often get "fooled" by those who manipulate the web's link graph. Working on a good spam metric is a priority we'll focus on following the upgrade of depth/breadth. If you want to send the specifics over, the feedback tab of OpenSiteExplorer (<a href="https://www.opensiteexplorer.org/" rel="nofollow">https://www.opensiteexplorer.org/</a>) on the left-hand side is a great place, or you can email the help team.
 Cancel
Jeff Sliger

2011-09-18T13:30:19-07:00

Thanks Rand,

It's exciting to see the progress your team has been able to achieve with Linkscape and the direction you're heading.

Since links are such a big part of the search engine equation we are extremely fortunate to have this awesome tool at our disposal.

Just reading the numbers you have thrown out here makes my head spin trying to calculate the shear computer power it must take.

Thanks to you and your team for such great work and for making it available to us.

2 0

Thanks Rand, It's exciting to see the progress your team has been able to achieve with Linkscape and the direction you're heading. Since links are such a big part of the search engine equation we are extremely fortunate to have this awesome tool at our disposal. Just reading the numbers you have thrown out here makes my head spin trying to calculate the shear computer power it must take. Thanks to you and your team for such great work and for making it available to us.
Cancel
ommrudraksha1

2011-09-19T00:41:33-07:00

No follow links are not that bad i think. still it is 2.22 means ok.

2 0

No follow links are not that bad i think. still it is 2.22 means ok.
Cancel
algogmbh_petra

2011-09-18T02:35:43-07:00

I think we all understand that this comprehensive crawling of the Linkscape update is hard stuff. If there are some problems rising we all know that you do your best to solve them.

We should be glad that you provide us the data at all!

2 0

I think we all understand that this comprehensive crawling of the Linkscape update is hard stuff. If there are some problems rising we all know that you do your best to solve them. We should be glad that you provide us the data at all!
Cancel
Kieran Daly

2011-09-18T06:16:39-07:00

Can you explain what this brings to the party (aka us) as I don't have the fullest understanding of the whole Linkscape benefit to us. I know of course you use it for metrics and helping getting deeper pictures of certain URLs but I regularly see URLs/domains that haven't been visited despite being around for a bit.

This isn't a critique as your post does explain well the effort required and the fact that you are competing against the big players.

But is it an uphill battle with no end and one that you will alweays be behind when compared to the bigger players?

3 1

Can you explain what this brings to the party (aka us) as I don't have the fullest understanding of the whole Linkscape benefit to us. I know of course you use it for metrics and helping getting deeper pictures of certain URLs but I regularly see URLs/domains that haven't been visited despite being around for a bit. This isn't a critique as your post does explain well the effort required and the fact that you are competing against the big players. But is it an uphill battle with no end and one that you will alweays be behind when compared to the bigger players?
Cancel
- Rand Fishkin
 
 2011-09-18T09:37:00-07:00
 
 The latest index updates, as you can see from the charts above, are better in some ways and worse in others. We estimate that, in total, Bing + Google keep ~130 million root domains and ~150 billion pages in their main indices at any given time. Our current scale doesn't let us get there, but in the not-to-far-future, we hope to achieve 75%+ and possibly 90%+ of what search engines keep from the web.
 
 My goal with explaining the challenges we're facing and laying out the roadmap is to do exactly what Gyorgy described above - make sure folks are informed rather than surprised.
 
 We'll likely never be 100% of Google/Bing in terms of all three dimensions - size, breadth and freshness - but we do think we can get very close, and we've seen that the closer we get, the better and more useful our tools and metrics become.
 
 6 0
 
 The latest index updates, as you can see from the charts above, are better in some ways and worse in others. We estimate that, in total, Bing + Google keep ~130 million root domains and ~150 billion pages in their main indices at any given time. Our current scale doesn't let us get there, but in the not-to-far-future, we hope to achieve 75%+ and possibly 90%+ of what search engines keep from the web. My goal with explaining the challenges we're facing and laying out the roadmap is to do exactly what Gyorgy described above - make sure folks are informed rather than surprised. We'll likely never be 100% of Google/Bing in terms of all three dimensions - size, breadth and freshness - but we do think we can get very close, and we've seen that the closer we get, the better and more useful our tools and metrics become.
 Cancel
 - Pierre-Yves
 
 2011-09-18T11:25:49-07:00
 
 Rand, as a Sysadmin/Linux dude (with a strong interest in data caching, so I now what it means since I know you guys got a flat file architecture) before even being an SEO, I would be really excited to see what SEOmoz datacenter looks like...
 
 S-I-C-K update!!
 
 Keep up the great work guys!
 
 2 0
 
 Rand, as a Sysadmin/Linux dude (with a strong interest in data caching, so I now what it means since I know you guys got a flat file architecture) before even being an SEO, I would be really excited to see what SEOmoz datacenter looks like... S-I-C-K update!! Keep up the great work guys!
 Cancel
 - Raunaq Rayait (BlurbPoint.com)
 
 2011-09-19T01:50:41-07:00
 
 @Rand Sir these is the first time I come to know that SEOMOZ is working on such a great project. Well I am not that smart or intelligent to give any suggestion on such a big subject but still I am doing this comment is just because I like the way you shared your improvement statistic with us is really very respectable and noble. Really I loved this post very much and wish you and your team all the best for bright future with your project. Thanks.
 
 RaunaqRayait edited 2011-09-19T01:51:48-07:00
 1 0
 
 @Rand Sir these is the first time I come to know that SEOMOZ is working on such a great project. Well I am not that smart or intelligent to give any suggestion on such a big subject but still I am doing this comment is just because I like the way you shared your improvement statistic with us is really very respectable and noble. Really I loved this post very much and wish you and your team all the best for bright future with your project. Thanks.
 Cancel
PeterAlexLeigh

2011-09-19T07:18:10-07:00

Good stuff. I'll be happy to see some fresher links

1 0

Good stuff. I'll be happy to see some fresher links
Cancel
- irldonalb
 
 2011-09-20T03:56:10-07:00
 
 Same here. I can't wait.
 
 1 0
 
 Same here. I can't wait.
 Cancel
eyepaq

2011-09-19T08:38:35-07:00

Based on direct case study compare Majestic and Seo Moz data are more or less the same - I think that means that both are close to the actual true data.

I use SEO Moz because it looks much better :)

eyepaq edited 2011-09-19T08:39:11-07:00
1 0

Based on direct case study compare Majestic and Seo Moz data are more or less the same - I think that means that both are close to the actual true data. I use SEO Moz because it looks much better :)
Cancel
SEOBestTips

2011-09-19T20:50:31-07:00

Very interesting data from the Linkscape crawler, its amazing that so much data can be "farmed" of sorts from Google without any issues. Gotta be tough to get all those inner page URL's and I see it's mentioned that you've had some success fixing that, much props.

1 0

Very interesting data from the Linkscape crawler, its amazing that so much data can be "farmed" of sorts from Google without any issues. Gotta be tough to get all those inner page URL's and I see it's mentioned that you've had some success fixing that, much props. 
Cancel
Paul Martin

2011-09-19T05:33:29-07:00

Hi Rand,

Well it’s finally nice to see a public acknowledgement to this issue. We have been badgering you and your team constantly for over 2 months (since the back end of July) about this and didn’t really get a straight answer on the matter until now.

It’s a shame that your whole TAGFEE ethos slipped a little here as people are paying substantial sums of money to use Linkscape (and all tools based on Linkscape data) and to not be told that 75%+ of the data is effectively unworkable is a little sad.

I guess it also has further repercussions as many people (I guess foolishly) blindly take your data for granted and base SEO campaigns upon it; many of which will have been based on defunct data sets.

It’s especially a shame for ourselves as we had just built an awesome tool using your full API access which we haven’t been able to use since the summer because of the issue. (SEOmoz were kind enough to halt our payments for the API though until the matter is resolved).

Glad to see you know what the problem is though and that you have put the PR wheels in motion with this post to try and appease people like myself from wondering if you were trying to keep the whole issue hush-hush!

I look forward to the fix and using your API again!

Paul

PaulMartin edited 2011-09-19T05:36:44-07:00
3 2

Hi Rand, Well it’s finally nice to see a public acknowledgement to this issue. We have been badgering you and your team constantly for over 2 months (since the back end of July) about this and didn’t really get a straight answer on the matter until now. It’s a shame that your whole <a href="../q/how-are-you-implementing-tagfee">TAGFEE</a> ethos slipped a little here as people are paying substantial sums of money to use Linkscape (and all tools based on Linkscape data) and to not be told that 75%+ of the data is effectively unworkable is a little sad. I guess it also has further repercussions as many people (I guess foolishly) blindly take your data for granted and base SEO campaigns upon it; many of which will have been based on defunct data sets. It’s especially a shame for ourselves as we had just built an awesome tool using your full API access which we haven’t been able to use since the summer because of the issue. (SEOmoz were kind enough to halt our payments for the API though until the matter is resolved). Glad to see you know what the problem is though and that you have put the PR wheels in motion with this post to try and appease people like myself from wondering if you were trying to keep the whole issue hush-hush! I look forward to the fix and using your API again! Paul
Cancel
- Rand Fishkin
 
 2011-09-19T12:08:52-07:00
 
 Hi Paul - really appreciate your feedback and understand your frustration. Let me address a few specific points you brought up:
 - Re: challenges since July - we rolled out the first new index with these features at the end of July, and I noted some of the issues in the post along with the release of OSE V3.
 - As soon as folks brought this up in Q+A, we responded within 12 hours. We'd like to be even faster, but wanted to make sure we looped in the right people - you can see the SEOmoz Twitter account getting on top of this within the first few tweets after we launched this index.
 - If you ever feel like you're not getting a response fast enough or something isn't TAGFEE, please feel free to email me directly - [email protected]. Most of the time, I'm very fast on email (24 hours or less). I'm really sorry to hear that you felt ignored for 2 months! That's awful, and shouldn't ever happen.
 - Re: 75% of the data unworkable... I'm not sure where that number comes from, but I don't think it's accurate. 100% of the data that's in Linkscape is accurate (as far as links, linking root domains, nofollows, anchor text, etc). Some of it is not as fresh as we'd like (anchor text was ~1 month old until this update) and we're not getting all the data the engines are, but what's there is correct, and the metrics are among the best you can find for predicting rankings, doing competitive comparisons, etc.
 If you have any specific questions or issues, please feel free to email me direct - happy to follow up more. It sounds like you might be perceiving or feeling even more problems than the ones of scale/reach I described (and Kate noted) above.
 
 Cheers!
 
 2 0
 Hi Paul - really appreciate your feedback and understand your frustration. Let me address a few specific points you brought up: <ul><li>Re: challenges since July - we rolled out the first new index with these features at the end of July, and I noted some of the issues in the post along with the release of OSE V3.</li> <li>As soon as folks brought this up in Q+A, we responded within 12 hours. We'd like to be even faster, but wanted to make sure we looped in the right people - you can see the SEOmoz Twitter account getting on top of this within the first few tweets after we launched this index.</li> <li>If you ever feel like you're not getting a response fast enough or something isn't TAGFEE, please feel free to email me directly - rand@seomoz.org. Most of the time, I'm very fast on email (24 hours or less). I'm really sorry to hear that you felt ignored for 2 months! That's awful, and shouldn't ever happen.</li> <li>Re: 75% of the data unworkable... I'm not sure where that number comes from, but I don't think it's accurate. 100% of the data that's in Linkscape is accurate (as far as links, linking root domains, nofollows, anchor text, etc). Some of it is not as fresh as we'd like (anchor text was ~1 month old until this update) and we're not getting all the data the engines are, but what's there is correct, and the metrics are among the best you can find for predicting rankings, doing competitive comparisons, etc.</li> </ul> If you have any specific questions or issues, please feel free to email me direct - happy to follow up more. It sounds like you might be perceiving or feeling even more problems than the ones of scale/reach I described (and Kate noted) above. Cheers!
 Cancel
 - Paul Martin
 
 2011-09-20T05:54:37-07:00
 
 Hey Rand,
 
 Oh no, it’s not that we were being ignored, we were in fact getting responses from both Twitter and your helpdesk immediately. It was more that no-one was answering our questions! The cynic in me was sensing a reluctance to discuss or even acknowledge the issue at all.
 
 Your last bullet pointed statement here however leads me to believe that we still are not on the same page and that you are not picking up on the same issues we are.
 
 I’ll admit the 75% figure was plucked out of thin air for illustrative purposes; however I would expect it to be wholly accurate, if not understated.
 
 I will pull together an email for you that includes some screen grabs (of which we have provided for SEOmoz previously) so you can see exactly what we’re seeing and hopefully we can finally iron it out!
 
 Thanks,
 
 Paul
 
 2 0
 
 Hey Rand, Oh no, it’s not that we were being ignored, we were in fact getting responses from both Twitter and your helpdesk immediately. It was more that no-one was answering our questions! The cynic in me was sensing a reluctance to discuss or even acknowledge the issue at all. Your last bullet pointed statement here however leads me to believe that we still are not on the same page and that you are not picking up on the same issues we are. I’ll admit the 75% figure was plucked out of thin air for illustrative purposes; however I would expect it to be wholly accurate, if not understated. I will pull together an email for you that includes some screen grabs (of which we have provided for SEOmoz previously) so you can see exactly what we’re seeing and hopefully we can finally iron it out! Thanks, Paul
 Cancel
Mitch Monsen

2011-09-19T09:56:15-07:00

I'd love to know the nitty gritty behind HOW your OSE crawler works... What it's built on, how long it takes to crawl, the hardware/bandwidth requirements, why Google doesn't nuke you for it, and all of that stuff. I'm almost positive I don't have the chops to make a full one myself, but it would be interesting to see how it all works.

1 0

I'd love to know the nitty gritty behind HOW your OSE crawler works... What it's built on, how long it takes to crawl, the hardware/bandwidth requirements, why Google doesn't nuke you for it, and all of that stuff. I'm almost positive I don't have the chops to make a full one myself, but it would be interesting to see how it all works.
Cancel
Gyorgy Bolla

2011-09-18T01:53:32-07:00

Thanks for the updates, it's good hear the detailed information about the indexing issues. It's better to be informed rather than surprised. ;-)

1 0

Thanks for the updates, it's good hear the detailed information about the indexing issues. It's better to be informed rather than surprised. ;-)
Cancel
William Craig

2011-09-18T14:39:59-07:00

Very interesting to see that only 2.22% of the links are no-followed. I thought it would have been slightly higher than that.

1 0

Very interesting to see that only 2.22% of the links are no-followed. I thought it would have been slightly higher than that.
Cancel
Matt Beswick

2011-09-24T09:53:19-07:00

As difficult as this is, thanks for being so open. SEOMoz is just one source of data and, although SEO campaigns are going to be based on this, you'd be mad to use it as your only source. Good luck getting things fixed! :)

1 0

As difficult as this is, thanks for being so open. SEOMoz is just one source of data and, although SEO campaigns are going to be based on this, you'd be mad to use it as your only source. Good luck getting things fixed! :)
Cancel
LocalSeoService

2011-09-20T03:23:51-07:00

Domain diversity and overall size has completely inverse relationship..We can easily check it by the data...Thanks for producing such important data.

1 0

Domain diversity and overall size has completely inverse relationship..We can easily check it by the data...Thanks for producing such important data.
Cancel
yudz

2011-09-18T09:43:06-07:00

I think mergering with MajesticSEO is a good idea for building web index. lol

They seems have great index but still i am in love with SEOmoz because of your link metric.

btw, i have login problem with mozbar firefox 6.0.2, already adress this issue via email.

and, i noticed several link metric drop on my websites especially on PA and DA. Is this because of that change?

yudz edited 2011-09-18T09:47:48-07:00
1 0

I think mergering with MajesticSEO is a good idea for building web index. lol They seems have great index but still i am in love with SEOmoz because of your link metric. btw, i have login problem with mozbar firefox 6.0.2, already adress this issue via email. and, i noticed several link metric drop on my websites especially on PA and DA. Is this because of that change?
Cancel
- Rand Fishkin
 
 2011-09-18T10:05:20-07:00
 
 Majestic has built a very large index, thanks to their process - they actually do things very differently than us or the engines, but it's impressive to see their size and reach. We've talked on occasion w/ the Majestic folks (who are great people, BTW), but for now, we're going to try slogging out the next 6 months and see if we can maintain the quality, metrics, canonicalization and index structure we prefer AND reach the broad size of the engines.
 
 Re: link metrics drops; yeah, these are likely related to the link counts from dropping binary files as well as the shift in index focus (growing diversity and sacrificing some raw size). As I mentioned in the post, if you're an outlier (i.e. everyone else in your sphere - competitors/etc - has stayed at similar metrics but you've dropped or risen substantially) please send us feedback so we can check it out.
 
 Thanks!
 
 2 0
 
 Majestic has built a very large index, thanks to their process - they actually do things very differently than us or the engines, but it's impressive to see their size and reach. We've talked on occasion w/ the Majestic folks (who are great people, BTW), but for now, we're going to try slogging out the next 6 months and see if we can maintain the quality, metrics, canonicalization and index structure we prefer AND reach the broad size of the engines. Re: link metrics drops; yeah, these are likely related to the link counts from dropping binary files as well as the shift in index focus (growing diversity and sacrificing some raw size). As I mentioned in the post, if you're an outlier (i.e. everyone else in your sphere - competitors/etc - has stayed at similar metrics but you've dropped or risen substantially) please send us feedback so we can check it out. Thanks!
 Cancel
 - yudz
 
 2011-09-18T11:10:54-07:00
 
 my competitors's link metric also drop a little bit, not as much as what happened to my website.
 
 1 0
 
 my competitors's link metric also drop a little bit, not as much as what happened to my website.
 Cancel
 - yudz
 
 2011-09-18T11:23:14-07:00
 
 But seems that i doesnt correlate with my ranking.
 
 i got more long tail keyword from organic search and my ranking remain the same and some other goes higher.
 
 btw, thanks rand :D
 
 Yudhis
 
 yudz edited 2011-09-18T11:23:41-07:00
 1 0
 
 But seems that i doesnt correlate with my ranking. i got more long tail keyword from organic search and my ranking remain the same and some other goes higher. btw, thanks rand :D Yudhis
 Cancel
- Pashmina Lalchandani
 
 2011-10-04T20:52:34-07:00
 
 One has the index. The other has the great link valuation algorithm. Who is going to win the race? And more importantly is anyone going to win the race before the value of paying attention to Total Links gets diminished?
 
 1 0
 
 One has the index. The other has the great link valuation algorithm. Who is going to win the race? And more importantly is anyone going to win the race before the value of paying attention to Total Links gets diminished?
 Cancel
irldonalb

2011-09-19T04:22:23-07:00

Hi Rand,

I think Alex @ MajesticSEO would be interested in helping you if you come across major issues.

Thanks.

1 2

Hi Rand, I think Alex @ MajesticSEO would be interested in helping you if you come across major issues. Thanks.
Cancel
Piotr Cichosz

2011-09-17T23:26:05-07:00

Amaizing data!

Good to know that people started using nofollow and canonical.

1 3

Amaizing data! Good to know that people started using nofollow and canonical.
Cancel

Post Analytics

Comments 31

Log in to Moz

Don't have an account?