7 Reasons Why Search Engines Don't Return Relevant Results 100% of the Time

Comments 58

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Lori Bourne

2007-07-20T19:10:44-07:00

This is a great post, Hamlet. When I first read it on Youmoz, I thought, "This deserves to be promoted!" Glad it was.

Some thoughts I had about this subject:

The English language is deeper and more complicated than most other languages. We have more synomyms, homonyms, and related terms than other languages. It's also much more elastic than other languages, which means that new words are created and assimilated to a greater degree than other languages.

It's often said that English is the best language for writing (especially poetry) because the incredibly rich vocabulary allows for varied word choice that gives depth and nuance that other languages might not.

Funny how these attributes - which make English such an expressive language - account for most of the problems you listed. In other words, search engines are fighting an uphill battle because of the very things that make English (and language in general) so wonderful. (I'm sure search isn't perfect in other countries either; it's just probably even more complicated in English).

I wonder though, as someone did earlier: as the search engines grow and change to better interpret language, will people do the same? Will the average person someday understand that if they want to search for a forest, it's better not to use "wood"? Will humans contribute to the search engine problem and meet Google halfway? We shouldn't expect them to do all the work, after all...

lorisa edited 2007-07-20T19:18:31-07:00
3 0

This is a great post, Hamlet. When I first read it on Youmoz, I thought, "This deserves to be promoted!" Glad it was. Some thoughts I had about this subject: The English language is deeper and more complicated than most other languages. We have more synomyms, homonyms, and related terms than other languages. It's also much more elastic than other languages, which means that new words are created and assimilated to a greater degree than other languages. It's often said that English is the best language for writing (especially poetry) because the incredibly rich vocabulary allows for varied word choice that gives depth and nuance that other languages might not. Funny how these attributes - which make English such an expressive language - account for most of the problems you listed. In other words, search engines are fighting an uphill battle because of the very things that make English (and language in general) so wonderful. (I'm sure search isn't perfect in other countries either; it's just probably even more complicated in English). I wonder though, as someone did earlier: as the search engines grow and change to better interpret language, will people do the same? Will the average person someday understand that if they want to search for a forest, it's better not to use "wood"? Will humans contribute to the search engine problem and meet Google halfway? We shouldn't expect them to do all the work, after all... 
Cancel
- Hamlet Batista
 
 2007-07-21T02:42:48-07:00
 
 lorisa - thanks for your kind words. I felt like I was reading poetry. Was that on purpose? ;-)
 
 Spanish is my first language and many things you said about English are also true for Spanish. I guess the same could be said about other languages. I have to agree with you that English is definitely better in the sense that you can say more with fewer words (which is something that I am still trying to learn as an ESL writer).
 
 Will the average person someday understand that if they want to search for a forest, it's better not to use "wood"? Will humans contribute to the search engine problem and meet Google halfway? We shouldn't expect them to do all the work, after all...
 
 I wouldn't rule out that possibility.
 
 2 0
 
 lorisa - thanks for your kind words. I felt like I was reading poetry. Was that on purpose? ;-) Spanish is my first language and many things you said about English are also true for Spanish. I guess the same could be said about other languages. I have to agree with you that English is definitely better in the sense that you can say more with fewer words (which is something that I am still trying to learn as an ESL writer). <blockquote>Will the average person someday understand that if they want to search for a forest, it's better not to use "wood"? Will humans contribute to the search engine problem and meet Google halfway? We shouldn't expect them to do all the work, after all...</blockquote> I wouldn't rule out that possibility. 
 Cancel
 - Lori Bourne
 
 2007-07-21T17:57:47-07:00
 
 Thanks for the compliment, Hamlet! You're a real prince!*
 
 It seems to me that search is contingent on three things: the search engine itself, the searcher, and the language used for search. Search engines can refine themselves for the next century, but I don't think they'll ever be able to completely predict the other two parts of the equation.
 
 *I've wanted to say that for awhile...I love Shakespeare, and your namesake is my favorite play =)
 
 1 0
 
 Thanks for the compliment, Hamlet! You're a real prince!* It seems to me that search is contingent on three things: the search engine itself, the searcher, and the language used for search. Search engines can refine themselves for the next century, but I don't think they'll ever be able to completely predict the other two parts of the equation. *I've wanted to say that for awhile...I love Shakespeare, and your namesake is my favorite play =) 
 Cancel
 - Hamlet Batista
 
 2007-07-22T09:37:33-07:00
 
 *I've wanted to say that for awhile...I love Shakespeare, and your namesake is my favorite play =)
 
 Thanks for the compliment too! I am ashamed to admit that I have not read the book yet :-(. I guess it is about time!
 
 I don't think they'll ever be able to completely predict the other two parts of the equation.
 
 That is why the Quality raters have become a very important part of modern search engines.
 
 I would not be surprised if the number of quality raters increased an order of magnitude (from 10,000 to 100,000) in the next few years.
 
 2 0
 
 <blockquote>*I've wanted to say that for awhile...I love Shakespeare, and your namesake is my favorite play =)</blockquote> Thanks for the compliment too! I am ashamed to admit that I have not read the book yet :-(. I guess it is about time! <blockquote>I don't think they'll ever be able to completely predict the other two parts of the equation.</blockquote> That is why the Quality raters have become a very important part of modern search engines. I would not be surprised if the number of quality raters increased an order of magnitude (from 10,000 to 100,000) in the next few years. 
 Cancel
 - Lori Bourne
 
 2007-07-22T11:47:16-07:00
 
 Oh, you must read it! But, as with all of Shakespeare's plays, it's better seen than read.
 
 Good to know that the human component can't ever be taken out of the search equation completely!
 
 lorisa edited 2007-07-22T11:47:38-07:00
 2 0
 
 Oh, you must read it! But, as with all of Shakespeare's plays, it's better seen than read. Good to know that the human component can't ever be taken out of the search equation completely! 
 Cancel
 
 Hamlet Batista
 
 2007-07-23T17:44:10-07:00
 
 I followed your advice and pre-ordered the two-disc DVD. I'm going to be or not to be watching it very soon :-)
 
 1 0
 
 I followed your advice and pre-ordered the two-disc DVD. I'm going to be or not to be watching it very soon :-)
 Cancel
Peter Newsome

2007-07-16T22:35:25-07:00

This post was timed really well - only yesterday a friend asked me why the search results they want are usually towards the bottom-end of the top 10 or in some cases buried much deeper in the serps.

I pretty-much explained the points you made, although there were a couple I missed so I'll forward them on...

For more commonly searched-for terms, the results often aren't too bad (and are improved further if you use Distilled's wiki removal plug-in)... but for more niche terms you can end-up having to wade-through a lot of long-tail, affiliate sites, random blogs etc. to find what you're after.

In the end, no search engine will ever be perfect (or at least not for a very long time) so it's great that objective posts like this pop-up from time-to-time to remind us of this.

Nice article Hamlet

2 0

This post was timed really well - only yesterday a friend asked me why the search results they want are usually towards the bottom-end of the top 10 or in some cases buried much deeper in the serps. I pretty-much explained the points you made, although there were a couple I missed so I'll forward them on... For more commonly searched-for terms, the results often aren't too bad (and are improved further if you use Distilled's wiki removal plug-in)... but for more niche terms you can end-up having to wade-through a lot of long-tail, affiliate sites, random blogs etc. to find what you're after. In the end, no search engine will ever be perfect (or at least not for a very long time) so it's great that objective posts like this pop-up from time-to-time to remind us of this. Nice article Hamlet 
Cancel
- Hamlet Batista
 
 2007-07-17T04:58:19-07:00
 
 SiteMost,
 
 Thanks, I am glad you enjoyed the post.
 
 Google doesn't agree with you. They want to build the perfect search engine by 2012.
 
 1 0
 
 SiteMost, Thanks, I am glad you enjoyed the post. Google doesn't agree with you. <a href="https://www.latimes.com/news/opinion/la-oe-keen12jul12,0,7307803.story?coll=la-home-commentary" rel="nofollow">They want to build the perfect search engine by 2012</a>. 
 Cancel
 - Peter Newsome
 
 2007-07-17T14:55:52-07:00
 
 Meh - who really wants Google to agree with them? If Google agreed with everything I said I'd be out of a job...
 
 If they do make the perfect search engine by 2012, I won't be complaining... although it reminds me of old movies and news clippings that would look into the future - by now we should be wearing silver Lycra body-suits and going to work in flying cars...
 
 A perfect search engine would be great - but I'd rather a jet pack.
 
 1 0
 
 Meh - who really wants Google to agree with them? If Google agreed with everything I said I'd be out of a job... If they do make the perfect search engine by 2012, I won't be complaining... although it reminds me of old movies and news clippings that would look into the future - by now we should be wearing silver Lycra body-suits and going to work in flying cars... A perfect search engine would be great - but I'd rather a jet pack. 
 Cancel
Rand Fishkin

2007-07-19T13:59:00-07:00

Hamlet - excellent post - just a quick piece - Dr. Garcia (an IR researcher in California) blogged about his disagreement with the mention of LSI in this piece.

1 0

Hamlet - excellent post - just a quick piece - Dr. Garcia (an IR researcher in California) blogged about <a href="https://irthoughts.wordpress.com/2007/07/19/seos-and-still-their-lsi-misconceptions/" rel="nofollow">his disagreement with the mention of LSI</a> in this piece.
Cancel
- Nick Gerner
 
 2007-07-19T14:23:35-07:00
 
 I apologize most deeply and humbly for any confusion I might have inadvertently helped to propagate.
 
 Reading Dr. Garcia's post, the disagreement seems to be the association between LSI and synonymy. Take a look at his tutorial on LSI, especially the section "Another SEO Myth..."
 
 If I'm reading this tutorial correctly (and please, by all means cut off my head that I stuck out earlier!), he sums his argument up by saying, "synomyns are a special class of tokens that do not tend to occur together, but tend to co-occur in similar contexts (neighboring terms)[...] The reverse is not necessarily true; not all terms with a second-order co-occurrence relationship are synonyms."
 
 So when I said "synonyms" earlier I was using the term incorrectly. I was trying (unsuccessfully!) to get at the idea that if you're looking for information on "primates", documents which never mention the term "primate", but do mention "chimpanzee", "ape", and "monkey" could be very useful. These terms are not synonyms for "primate".
 
 I've submitted such a correction to Dr. Garcia, and I shall report back to you gracious Mozzers if my correction is, in fact, correct.
 
 3 0
 
 I apologize most deeply and humbly for any confusion I might have inadvertently helped to propagate. Reading Dr. Garcia's post, the disagreement seems to be the association between LSI and synonymy. Take a look at his <a href="https://www.miislita.com/information-retrieval-tutorial/svd-lsi-tutorial-5-lsi-keyword-research-co-occurrence.html#seo-myth" rel="nofollow">tutorial on LSI</a>, especially the section "Another SEO Myth..." If I'm reading this tutorial correctly (and please, by all means cut off my head that I stuck out earlier!), he sums his argument up by saying, "synomyns are a special class of tokens that do not tend to occur together, but tend to co-occur in similar contexts (neighboring terms)[...] The reverse is not necessarily true; not all terms with a second-order co-occurrence relationship are synonyms." So when I said "synonyms" earlier I was using the term incorrectly. I was trying (unsuccessfully!) to get at the idea that if you're looking for information on "primates", documents which never mention the term "primate", but do mention "chimpanzee", "ape", and "monkey" could be very useful. These terms are not synonyms for "primate". I've submitted such a correction to Dr. Garcia, and I shall report back to you gracious Mozzers if my correction is, in fact, correct.
 Cancel
 - Nick Gerner
 
 2007-07-20T06:24:18-07:00
 
 After having sync'd up with Dr. Garcia here's the lowdown.
 
 The problem with my above comment is that I said "synonym" when I should have said "related terms, not necessarily synonyms". Moreover, by saying "synonym" I was helping to perpetuate the "Synonym Fallacy", under which people incorrectly conclude that LSI clusters terms because they are synonyms. This is not the case.
 
 Moral of the story: use language precisely or else people will draw the wrong conclusions about what you say.
 
 edited 2007-07-20T06:37:40-07:00
 2 0
 
 After having sync'd up with Dr. Garcia here's the lowdown. The problem with my above comment is that I said "synonym" when I should have said "related terms, not necessarily synonyms". Moreover, by saying "synonym" I was helping to perpetuate the "Synonym Fallacy", under which people incorrectly conclude that LSI clusters terms because they are synonyms. This is not the case. Moral of the story: use language precisely or else people will draw the wrong conclusions about what you say.
 Cancel
 - Hamlet Batista
 
 2007-07-20T13:19:04-07:00
 
 Nick - thanks for making it clearer.
 
 1 0
 
 Nick - thanks for making it clearer.
 Cancel
- Hamlet Batista
 
 2007-07-19T14:43:07-07:00
 
 Rand - Thanks for promoting my post. That was unexpected.
 
 Please note that as Nick acknowledges, I did not mention LSI on my post and I trusted Nick's assertions --he is the one with formal IR education.
 
 I am learning IR by reading books (I'm not done with the LSI one yet), so I have to respect and accept what Dr. Garcia and the IR experts say.
 
 I try to research carefully what I write to avoid spreading myths. I apologize for any confusion.
 
 BTW: Two out of three of my posts have been promoted. For an ESL blogger that is not bad. I think others ESL readers should be encouraged to post :-)
 
 A sharp editor helps a lot ;-)
 
 HamletBatista edited 2007-07-19T15:22:18-07:00
 1 0
 
 Rand - Thanks for promoting my post. That was unexpected. Please note that as Nick acknowledges, I did not mention LSI on my post and I trusted Nick's assertions --he is the one with formal IR education. I am learning IR by reading books (I'm not done with the LSI one yet), so I have to respect and accept what Dr. Garcia and the IR experts say. I try to research carefully what I write to avoid spreading myths. I apologize for any confusion. BTW: Two out of three of my posts have been promoted. For an ESL blogger that is not bad. I think others ESL readers should be encouraged to post :-) A sharp editor helps a lot ;-)
 Cancel
 - Rand Fishkin
 
 2007-07-19T15:22:37-07:00
 
 Hamlet - you've got Rebecca to thank here, not me. She's our official YOUmoz editor, promoter, etc. :)
 
 1 0
 
 Hamlet - you've got Rebecca to thank here, not me. She's our official YOUmoz editor, promoter, etc. :)
 Cancel
 - Hamlet Batista
 
 2007-07-19T15:58:54-07:00
 
 Gracias Rebecca!
 
 You never said if you liked the monkey pictures :-)
 
 1 0
 
 Gracias Rebecca! You never said if you liked the monkey pictures :-)
 Cancel
Brian Horn

2007-07-19T13:11:11-07:00

Very good points. Particularly polysemy. I've not run into it myself, but it is a great point.

I have had the issue with "automobile" and "car" and had to come up with multiple version of the content (still very different from each other) .

1 0

Very good points. Particularly polysemy. I've not run into it myself, but it is a great point. I have had the issue with "automobile" and "car" and had to come up with multiple version of the content (still very different from each other) .
Cancel
- Hamlet Batista
 
 2007-07-19T15:18:53-07:00
 
 Thanks, Brian. Search engines still face significant problems. We should have food on our tables for a while ;-)
 
 1 0
 
 Thanks, Brian. Search engines still face significant problems. We should have food on our tables for a while ;-)
 Cancel
Oatmeal

2007-07-19T12:54:42-07:00

I'm glad this was promoted, this is is an excellent post

1 0

I'm glad this was promoted, this is is an excellent post
Cancel
- Hamlet Batista
 
 2007-07-19T15:14:53-07:00
 
 thanks, Matt
 
 1 0
 
 thanks, Matt
 Cancel
Campy

2007-07-20T00:43:51-07:00

Nice post Hamlet. Especially the short section about spam. Really nice way of explaining it.

Also, thanks for the primate diagram and introducing me to the word "disambiguate". :-)

1 0

Nice post Hamlet. Especially the short section about spam. Really nice way of explaining it. Also, thanks for the primate diagram and introducing me to the word "disambiguate". :-)
Cancel
- Hamlet Batista
 
 2007-07-20T05:15:00-07:00
 
 Campy - Glad you liked it
 
 1 0
 
 Campy - Glad you liked it
 Cancel
Kimber Scott

2007-07-19T12:47:21-07:00

gotta love the monkey references!

1 0

gotta love the monkey references! 
Cancel
- Hamlet Batista
 
 2007-07-19T14:56:48-07:00
 
 Yes. I used the monkeys to make the post more entertaining :-)
 
 1 0
 
 Yes. I used the monkeys to make the post more entertaining :-)
 Cancel
podcomplex

2007-07-24T20:07:17-07:00

Good post Hamlet - also, the number (and quality) of comments provide almost as much value as the original article. I still find it quite remarkable that about 20% of queries at Google are being seen by them for the first time - but given the complexity of language, and the innate creativity and unpredictability of the human mind, there is likely to always be a significant percentage of queries that fall into this category.

1 0

Good post Hamlet - also, the number (and quality) of comments provide almost as much value as the original article. I still find it quite remarkable that about 20% of queries at Google are being seen by them for the first time - but given the complexity of language, and the innate creativity and unpredictability of the human mind, there is likely to always be a significant percentage of queries that fall into this category.
Cancel
- Hamlet Batista
 
 2007-07-25T02:32:27-07:00
 
 Thanks, podcomplex.
 
 given the complexity of language, and the innate creativity and unpredictability of the human mind, there is likely to always be a significant percentage of queries that fall into this category.
 
 I agree
 
 1 0
 
 Thanks, podcomplex. <blockquote>given the complexity of language, and the innate creativity and unpredictability of the human mind, there is likely to always be a significant percentage of queries that fall into this category.</blockquote> I agree 
 Cancel
KailaColbin

2007-07-22T23:09:02-07:00

Hi Hamlet,

Thanks for such a thought-provoking post; I used it as the basis for a post myself today. Your reader who comments about timeliness and freshness had an excellent point. I imagine that search engines face still more challenges we haven't discussed.

I also appreciate what you say about many of the problems coming from the searchers themselves rather than incomplete algorithms.

Search engines deserve a round of applause for accepting that it is unlikely for users to become universally educated in a consistent search syntax, and getting on with the business of attempting to disambiguate cryptic and minimalist queries.

Looking forward to your next post!

1 0

Hi Hamlet, Thanks for such a thought-provoking post; I used it as the basis for a post myself today. Your reader who comments about timeliness and freshness had an excellent point. I imagine that search engines face still more challenges we haven't discussed. I also appreciate what you say about many of the problems coming from the searchers themselves rather than incomplete algorithms. Search engines deserve a round of applause for accepting that it is unlikely for users to become universally educated in a consistent search syntax, and getting on with the business of attempting to disambiguate cryptic and minimalist queries. Looking forward to your next post!
Cancel
- Hamlet Batista
 
 2007-07-23T17:36:12-07:00
 
 I imagine that search engines face still more challenges we haven't discussed.
 
 ...
 
 Search engines deserve a round of applause for accepting that it is unlikely for users to become universally educated in a consistent search syntax, and getting on with the business of attempting to disambiguate cryptic and minimalist queries.
 
 I agree.
 
 Looking forward to your next post!
 
 Thanks, I'll try to make the next one even better.
 
 1 0
 
 <blockquote>I imagine that search engines face still more challenges we haven't discussed.</blockquote> <blockquote>...</blockquote> <blockquote>Search engines deserve a round of applause for accepting that it is unlikely for users to become universally educated in a consistent search syntax, and getting on with the business of attempting to disambiguate cryptic and minimalist queries. </blockquote> I agree. <blockquote>Looking forward to your next post!</blockquote> Thanks, I'll try to make the next one even better.
 Cancel
barry mitchell

2008-11-01T16:14:04-07:00

hi all - i am currently building a localised search engine for Dundee in Scotland. I am working on how the search terms will be relevant to the viewers search terms. I am just now resticted to the company name, address and the search term that i enter into the search column relating to the company. i have around 3k results just now but i am building the list hopfull up to around 20 k in the next month. my mini search engine is for dundee will be for people looking for businesses in dundee and i am trying to make the search rsults more relevant to the terms the people are typing. i am using sql tersm just now but sometimes the things that people get shon have nothing to do with the terms they type in simply because the data base is returning what it thinks is best accoring to the beasi nfo it has to work with .. this is pretty crappy just now but i have noticed that when people are using my search engine they are still getting mostly 50% of the results they would like... my next step is to make things more in tune with what people want. i am thinking about writing a scritp that will analize peoples web site to see if they have the keyphrases the viewer is tpying and returning results according to the content of their sites and shuving the peoples results who have no website to show to the bottom of the listings ... this in turn will give the people with webites an advantage over the listings that dont .. this might be good or bad .. but i would say that the companies in my are that have a website that can be analised by me and graded according to the content is better than a company that has no site at all.. time will tell

1 0

hi all - i am currently building a localised search engine for Dundee in Scotland. I am working on how the search terms will be relevant to the viewers search terms. I am just now resticted to the company name, address and the search term that i enter into the search column relating to the company. i have around 3k results just now but i am building the list hopfull up to around 20 k in the next month. my mini search engine is for dundee will be for people looking for businesses in dundee and i am trying to make the search rsults more relevant to the terms the people are typing. i am using sql tersm just now but sometimes the things that people get shon have nothing to do with the terms they type in simply because the data base is returning what it thinks is best accoring to the beasi nfo it has to work with .. this is pretty crappy just now but i have noticed that when people are using my search engine they are still getting mostly 50% of the results they would like... my next step is to make things more in tune with what people want. i am thinking about writing a scritp that will analize peoples web site to see if they have the keyphrases the viewer is tpying and returning results according to the content of their sites and shuving the peoples results who have no website to show to the bottom of the listings ... this in turn will give the people with webites an advantage over the listings that dont .. this might be good or bad .. but i would say that the companies in my are that have a website that can be analised by me and graded according to the content is better than a company that has no site at all.. time will tell
Cancel
Natewood

2007-07-20T04:01:02-07:00

I think there's an 8th here Hamlet: Timeliness and Freshness.

Trends in query use and meaning rise and fall all the time. What was perfectly relevant yesterday for a query can be absolutely irrelevant tomorrow due to a major event or change in the word meaning. But once that event is no longer "newsworthy" the original content can then become the more relevant content for a query.

Example: London Tube - in the events leading up to the tube bombings of a couple of years ago, the content you'd expect to see should be tube maps and timetables. Immediately after the event and for a few months afterwards the content should have been about the bombings. But then it reverts back again.

This is a fairly obvious example, but as an ex relevancy manager and spam cleaner for a search engine, I can say that there are much more subtle queries for which this switching in meaning is not so clear cut, and sometimes it's very difficult to determine the predominant meaning and react as quickly as that meaning changes in search queries.

Natewood edited 2007-07-20T04:11:37-07:00
1 0

I think there's an 8th here Hamlet: Timeliness and Freshness. Trends in query use and meaning rise and fall all the time. What was perfectly relevant yesterday for a query can be absolutely irrelevant tomorrow due to a major event or change in the word meaning. But once that event is no longer "newsworthy" the original content can then become the more relevant content for a query. Example: London Tube - in the events leading up to the tube bombings of a couple of years ago, the content you'd expect to see should be tube maps and timetables. Immediately after the event and for a few months afterwards the content should have been about the bombings. But then it reverts back again. This is a fairly obvious example, but as an ex relevancy manager and spam cleaner for a search engine, I can say that there are much more subtle queries for which this switching in meaning is not so clear cut, and sometimes it's very difficult to determine the predominant meaning and react as quickly as that meaning changes in search queries.
Cancel
- Hamlet Batista
 
 2007-07-20T05:24:44-07:00
 
 Natewood - That is an excellent addition.
 
 Google made that problem public on the New York Times article: Google Keeps Tweaking Its Search Engine
 
 They say they are addressing the problem with something they call QDF (Query Deserves Freshness)
 
 What I really like about the comments so far is that SEOs and search engine experts agree that search engines are still far from producing 100% relevant results.
 
 Do you think they will ever do?
 
 1 0
 
 Natewood - That is an excellent addition. Google made that problem public on the New York Times article: <a href="https://www.nytimes.com/2007/06/03/business/yourmoney/03google.html" rel="nofollow">Google Keeps Tweaking Its Search Engine</a> They say they are addressing the problem with something they call QDF (Query Deserves Freshness) What I really like about the comments so far is that SEOs and search engine experts agree that search engines are still far from producing 100% relevant results. Do you think they will ever do?
 Cancel
Igor Klajo

2007-07-18T04:19:08-07:00

Sometimes I spend more time to click throught all the links provided by the search engines than reading the actual informaiton I'm looking for which is frustrating. This happens only if I don't know for what to look, what keywords to use to get the desired results.

1 0

Sometimes I spend more time to click throught all the links provided by the search engines than reading the actual informaiton I'm looking for which is frustrating. This happens only if I don't know for what to look, what keywords to use to get the desired results.
Cancel
- Amit Bhawani
 
 2007-07-18T04:22:12-07:00
 
 May be you should then try using google suggest , though a old feature now, which can save your time when you are confused about the search keywords.
 
 https://www.google.com/webhp?complete=1&hl=en
 
 Other way you are giving all those websites extra referrars ;)
 
 1 0
 
 May be you should then try using google suggest , though a old feature now, which can save your time when you are confused about the search keywords. https://www.google.com/webhp?complete=1&hl=en Other way you are giving all those websites extra referrars ;) 
 Cancel
 - Igor Klajo
 
 2007-07-18T05:13:53-07:00
 
 Thanks . . . I'll keep that in mind for my upcoming researches where I'll be confused :D
 
 The referrals . . . I saw a very funny referral from ask.com the other day. Somebody entered "what was the music playlists in australia 2000" into ask.com and for ths my website's result number one, but for what . . . for mentioning the word "Australia" and "Music" and since this website is about playlists and theword playlist is in the website title I got hit, well I guess that's why my site's listed first for this request.
 
 I would say that ask.com didn't understand the question, like i didn't cos there's something missing. For what or from what was this playlist the person was looking for? I don't know, but I guess it was the Essential mix playlist from New Years eve at Bondi Beach, Sydney Australia, but this is what I would have searched for since I don't know any other music playlist for that time from down under. Anyway . . . this search shows somewow how search will show what they know and if the user doesn't know how to aks, then the result won't be that great aswell, but like I said in my previous post, thngs like that happen to me aswell, so we need to be specific.
 
 1 0
 
 Thanks . . . I'll keep that in mind for my upcoming researches where I'll be confused :D The referrals . . . I saw a very funny referral from ask.com the other day. Somebody entered "what was the music playlists in australia 2000" into ask.com and for ths my website's result number one, but for what . . . for mentioning the word "Australia" and "Music" and since this website is about playlists and theword playlist is in the website title I got hit, well I guess that's why my site's listed first for this request. I would say that ask.com didn't understand the question, like i didn't cos there's something missing. For what or from what was this playlist the person was looking for? I don't know, but I guess it was the Essential mix playlist from New Years eve at Bondi Beach, Sydney Australia, but this is what I would have searched for since I don't know any other music playlist for that time from down under. Anyway . . . this search shows somewow how search will show what they know and if the user doesn't know how to aks, then the result won't be that great aswell, but like I said in my previous post, thngs like that happen to me aswell, so we need to be specific. 
 Cancel
 - Amit Bhawani
 
 2007-07-18T08:46:25-07:00
 
 Thats just not with ask.com, google also shows results based on keyword combinations in a website when a long query is made instead of single keywords.
 
 This is the reason blogs get more traffic because of the combination of keywords from the blog posts and categories based keywords.
 
 1 0
 
 Thats just not with ask.com, google also shows results based on keyword combinations in a website when a long query is made instead of single keywords. This is the reason blogs get more traffic because of the combination of keywords from the blog posts and categories based keywords. 
 Cancel
Andrew Miller

2007-07-17T07:08:25-07:00

Thanks for the great points. You may find a book called "Ambient Findability" by Peter Morville interesting. After reading it you may find yourself convinced that search engines and search as we know it are only in the earliest stages of development and well on our way to "complete navigability".

Some sections are a little dry but all search engine marketers can benefit from knowing a little "information retrieval theory".

1 0

Thanks for the great points. You may find a book called "<a href="https://www.amazon.com/Ambient-Findability-What-Changes-Become/dp/0596007655/ref=pd_bbs_sr_1/002-9007845-6785662?ie=UTF8&s=books&qid=1184681593&sr=8-1" rel="nofollow">Ambient Findability</a>" by Peter Morville interesting. After reading it you may find yourself convinced that search engines and search as we know it are only in the earliest stages of development and well on our way to "complete navigability". Some sections are a little dry but all search engine marketers can benefit from knowing a little "information retrieval theory". 
Cancel
- Hamlet Batista
 
 2007-07-17T08:30:17-07:00
 
 acm.miller,
 
 Thanks a lot for the book. Looks like a interesting read. I just bought it.
 
 I am currently reading: Understanding Search Engines, Mathematical Modeling and Text Retrieval (Second Edition) by Michael W. Berry and Murray Browne. The book does an excellent coverage of the main concepts and the math is easy to follow if you have undergraduate math skills. I plan to post a review on my blog.
 
 Many SEOs would wonder why it is important to keep learning about the search engines. I think it is wise to keep up with their advances, if we want to remain comeptitive in the near future. Their tecnology is going to get far more complex.
 
 The good part is that the user is more and more becoming the center. Make your websites primarily for your visitors, without forgetting the search engines, and should win the long battle.
 
 2 0
 
 acm.miller, Thanks a lot for the book. Looks like a interesting read. I just bought it. I am currently reading: Understanding Search Engines, Mathematical Modeling and Text Retrieval (Second Edition) by Michael W. Berry and Murray Browne. The book does an excellent coverage of the main concepts and the math is easy to follow if you have undergraduate math skills. I plan to post a review on my blog. Many SEOs would wonder why it is important to keep learning about the search engines. I think it is wise to keep up with their advances, if we want to remain comeptitive in the near future. Their tecnology is going to get far more complex. The good part is that the user is more and more becoming the center. Make your websites primarily for your visitors, without forgetting the search engines, and should win the long battle.
 Cancel
mercutiom

2007-07-17T00:06:50-07:00

Wow, great post, lots of good information here. I find it interesting to note that most of what you're saying isn't about what the serarch engines are doing, but rather what the searcher is failing to do.

Should we really be making changes to how the search engines work? Or how the searcher looks? As you say searching for "primate" has its problems, but if the user has some (limited,basi) knowledge and adds those to the search parameters aren't they going to get beeter results?

Shouldn't we be teaching clients to search better as well as pushing for better results?

1 0

Wow, great post, lots of good information here. I find it interesting to note that most of what you're saying isn't about what the serarch engines are doing, but rather what the searcher is failing to do. Should we really be making changes to how the search engines work? Or how the searcher looks? As you say searching for "primate" has its problems, but if the user has some (limited,basi) knowledge and adds those to the search parameters aren't they going to get beeter results? Shouldn't we be teaching clients to search better as well as pushing for better results? 
Cancel
- Hamlet Batista
 
 2007-07-17T05:09:11-07:00
 
 mercutiom,
 
 Thanks for your comment. You hit the nail on the head. We ARE the main problem search engines face.
 
 Unfortunately it is a bigger task to teach the whole world how to properly use the search engines, than to adapt the search engines to our way of searching. Remember, relational databases are 'perfect' because the DBAs know precisely how to ask the questions. How likely it is that we could train millions to do the same thing?
 
 Google is already working to create the perfect search engine for a reason.
 
 1 0
 
 mercutiom, Thanks for your comment. You hit the nail on the head. We ARE the main problem search engines face. Unfortunately it is a bigger task to teach the whole world how to properly use the search engines, than to adapt the search engines to our way of searching. Remember, relational databases are 'perfect' because the DBAs know precisely how to ask the questions. How likely it is that we could train millions to do the same thing? Google is already working to create <a href="https://www.latimes.com/news/opinion/la-oe-keen12jul12,0,7307803.story?coll=la-home-commentary" rel="nofollow">the perfect search engine</a> for a reason. 
 Cancel
LindaBustos

2007-07-16T19:46:01-07:00

Here in Vancouver it's a real toss up between Tim Horton's and Starbucks, actually I think Starbucks has the edge, ;-)

1 0

Here in Vancouver it's a real toss up between Tim Horton's and Starbucks, actually I think Starbucks has the edge, ;-)
Cancel
- Hamlet Batista
 
 2007-07-17T04:55:43-07:00
 
 Linda,
 
 Glad to hear that. In my last visit to Toronto all I could see was Tim Horton's. No Starbucks in sight :-(
 
 1 0
 
 Linda, Glad to hear that. In my last visit to Toronto all I could see was Tim Horton's. No Starbucks in sight :-( 
 Cancel
Kimber Scott

2007-07-17T11:10:38-07:00

This is an excellent post, Hamlet.

"Many searchers don't know how to express what they want in the real world, and are even worse when attempting to ask a search engine"

So true. I get so frustrated when I see how some of my 'normal' (non-seo) friends search. i used to have races with one friend to see who could find some bit of info first. I'd usually find the answer within the first few results, while they were still sorting through irrelevant pages.

For someone who is an English as a second language blogger, your writing skills are better than a lot of English as a first language bloggers out there. Nice job!

1 0

This is an excellent post, Hamlet. <blockquote>"Many searchers don't know how to express what they want in the real world, and are even worse when attempting to ask a search engine"</blockquote> So true. I get so frustrated when I see how some of my 'normal' (non-seo) friends search. i used to have races with one friend to see who could find some bit of info first. I'd usually find the answer within the first few results, while they were still sorting through irrelevant pages. For someone who is an English as a second language blogger, your writing skills are better than a lot of English as a first language bloggers out there. Nice job!
Cancel
- Hamlet Batista
 
 2007-07-17T11:30:57-07:00
 
 kimber,
 
 Thanks for your kind words.
 
 For someone who is an English as a second language blogger, your writing skills are better than a lot of English as a first language bloggers out there. Nice job!
 
 I have to admit that I cheat. A professional technical writer edits most of my posts ;-) Unfortunately, I write as bad as you read on my comments.
 
 1 0
 
 kimber, Thanks for your kind words. <blockquote>For someone who is an English as a second language blogger, your writing skills are better than a lot of English as a first language bloggers out there. Nice job!</blockquote> I have to admit that I cheat. A professional technical writer edits most of my posts ;-) Unfortunately, I write as bad as you read on my comments.
 Cancel
Markb

2007-07-17T11:43:38-07:00

Great Post Hamlet. This is certainly the case in my situation, and also very frustrating. We do very well in Canada - therefore well in the SERP's as well. But we are pushing international and we're not relevant at all there. Most of our links come from pages from Canada, so were working on a link building campaign internationally.

1 0

Great Post Hamlet. This is certainly the case in my situation, and also very frustrating. We do very well in Canada - therefore well in the SERP's as well. But we are pushing international and we're not relevant at all there. Most of our links come from pages from Canada, so were working on a link building campaign internationally.
Cancel
- Hamlet Batista
 
 2007-07-17T12:54:19-07:00
 
 Thanks, Markb. As you correctly said, the key in your case is to get more links from international websites. I'd also create a separate website (.com instead of .ca) for your international efforts.
 
 1 0
 
 Thanks, Markb. As you correctly said, the key in your case is to get more links from international websites. I'd also create a separate website (.com instead of .ca) for your international efforts.
 Cancel
Amit Bhawani

2007-07-17T22:26:14-07:00

The clear reason is youtube is owned by google > more preferance

Wikipedia has lot of content leached from webmasters > more preferance even though its worth listing in top or not.

1 0

The clear reason is youtube is owned by google > more preferance Wikipedia has lot of content leached from webmasters > more preferance even though its worth listing in top or not. 
Cancel
Bud-Caddell

2007-07-17T14:13:48-07:00

I think some people do make dubious stabs at their search terms -- but I think some may argue that people don't make poor terms, SE's just can't yet handle the robust nature of human language. I hope our expression never becomes homogenized, even if it saves you 30 seconds or 30 minutes..

1 0

I think some people do make dubious stabs at their search terms -- but I think some may argue that people don't make poor terms, SE's just can't yet handle the robust nature of human language. I hope our expression never becomes homogenized, even if it saves you 30 seconds or 30 minutes..
Cancel
- Hamlet Batista
 
 2007-07-17T21:36:58-07:00
 
 I hope our expression never becomes homogenized, even if it saves you 30 seconds or 30 minutes..
 
 Bud,
 
 I hope the same. For the look of things, search engines will keep adapting to the searchers inefficiencies.
 
 1 0
 
 <blockquote> I hope our expression never becomes homogenized, even if it saves you 30 seconds or 30 minutes..</blockquote> Bud, I hope the same. For the look of things, search engines will keep adapting to the searchers inefficiencies.
 Cancel
Nick Gerner

2007-07-17T14:07:29-07:00

A couple of notes from theoretical IR (from many years ago, so Google et. al are doing it in some form):

When a site, any site, gets a request (e.g. a query) it knows the originating IP address. This exposes the country of origin, the owning organization, and even the latitude and longitude (I've seen a demo app from Google that uses this info in a fun way). So you don't need personalized search to get personalized service (whether you want it or not).

Latent Semantic Indexing is a relatively old technique to help with Synonymy. I've seen it (and implemented such systems) put to good use. Essentially it looks for how often pairs of words occur in the same documents. We could argue about why this works, but empirically it does.

1 0

A couple of notes from theoretical IR (from many years ago, so Google et. al are doing it in some form): When a site, any site, gets a request (e.g. a query) it knows the originating IP address. This exposes the country of origin, the owning organization, and even the latitude and longitude (I've seen a demo app from Google that uses this info in a <a href="https://209.85.163.132/papers/sawzall-sciprog.pdf" rel="nofollow">fun way</a>). So you don't need personalized search to get personalized service (whether you want it or not). <a href="https://www3.interscience.wiley.com/cgi-bin/issuetoc?ID=10049584" rel="nofollow">Latent Semantic Indexing</a> is a relatively old technique to help with Synonymy. I've seen it (and implemented such systems) put to good use. Essentially it looks for how often pairs of words occur in the same documents. We could argue about why this works, but empirically it does.
Cancel
- Hamlet Batista
 
 2007-07-17T21:26:07-07:00
 
 Nick,
 
 I am glad to have someone with a IR education contributing. Thanks for your comment.
 
 When a site, any site, gets a request (e.g. a query) it knows the originating IP address. This exposes the country of origin, the owning organization, and even the latitude and longitude (I've seen a demo app from Google that uses this info in a fun way). So you don't need personalized search to get personalized service (whether you want it or not).
 
 You can get a database that provides such information from ip2location.com. I am subscribed to their service as we use it for some of our web applications (primarily for fraud detection). Thanks for the interesting paper, though.
 
 Please note that physical location is only one of the examples I used to illustrate relevance is subjective. By no means I am implying that the problem is impossible to solve.
 
 Latent Semantic Indexing is a relatively old technique to help with Synonymy. I've seen it (and implemented such systems) put to good use. Essentially it looks for how often pairs of words occur in the same documents. We could argue about why this works, but empirically it does.
 
 Excellent! The book I am currently reading is about LSI. I have to agree with you that LSI can help with synonymy and polysemy, but the problem is that LSI is highly unlikely to be in use in the main search engines, due to their large indexes.
 
 In order to support my claim, let me quote a paragraph from the book: Understanding Search Engines: Mathematical Modeling and Text Retrieval by Michael W. Berry and Murray Browne, Chapter 7, page 77:
 
 The most dramatic change in search engine design in the past several years has been developing search engines that account for the Web's hyperlink structure. LSI, with its SVD of a term-by-document matrix, is an approach that works well for smaller document collections but has problems with scalability. The computation and storage of an SVD-based LSI model for the entire Web is not tractable [49].
 [49] A. N. LANGVILLE AND C. D. MEYER, A survey of eigenvector methods for Web information retrieval, SIAM Review, 47 (2005), pp. 135-161.
 
 Here is the PDF of the referenced paper
 
 HamletBatista edited 2007-07-17T21:32:47-07:00
 2 0
 
 Nick, I am glad to have someone with a IR education contributing. Thanks for your comment. <blockquote>When a site, any site, gets a request (e.g. a query) it knows the originating IP address. This exposes the country of origin, the owning organization, and even the latitude and longitude (I've seen a demo app from Google that uses this info in a <a href="https://209.85.163.132/papers/sawzall-sciprog.pdf" rel="nofollow">fun way</a>). So you don't need personalized search to get personalized service (whether you want it or not). </blockquote> You can get a database that provides such information from ip2location.com. I am subscribed to their service as we use it for some of our web applications (primarily for fraud detection). Thanks for the interesting paper, though. Please note that physical location is only one of the examples I used to illustrate relevance is subjective. By no means I am implying that the problem is impossible to solve. <blockquote><a href="https://www3.interscience.wiley.com/cgi-bin/issuetoc?ID=10049584" rel="nofollow">Latent Semantic Indexing</a> is a relatively old technique to help with Synonymy. I've seen it (and implemented such systems) put to good use. Essentially it looks for how often pairs of words occur in the same documents. We could argue about why this works, but empirically it does. </blockquote> Excellent! The book I am currently reading is about LSI. I have to agree with you that LSI can help with synonymy and polysemy, but the problem is that LSI is highly unlikely to be in use in the main search engines, due to their large indexes. In order to support my claim, let me quote a paragraph from the book: Understanding Search Engines: Mathematical Modeling and Text Retrieval by Michael W. Berry and Murray Browne, Chapter 7, page 77: <blockquote>The most dramatic change in search engine design in the past several years has been developing search engines that account for the Web's hyperlink structure. LSI, with its SVD of a term-by-document matrix, is an approach that works well for smaller document collections but has problems with scalability. The computation and storage of an SVD-based LSI model for the entire Web is not tractable [49].[49] A. N. LANGVILLE AND C. D. MEYER, A survey of eigenvector methods for Web information retrieval, SIAM Review, 47 (2005), pp. 135-161.</blockquote> Here is the <a href="https://meyer.math.ncsu.edu/Meyer/PS_Files/Survey.pdf" rel="nofollow">PDF</a> of the referenced paper 
 Cancel
 - Nick Gerner
 
 2007-07-18T10:18:33-07:00
 
 Thanks for the deep follow up :)
 
 I agree that a full SVD is not tractable for the web. However, as in a lot of IR , you could hack the problem, do approximations or sampling, etc. and get some of the advantage. Plus there are other advantages to a dimensionality reduction (lower dimension index = less space, better query performance, etc.). If I worked at Google and they weren't not already doing it, I'd by working on LSI for (parts of) their index as a 20% project.
 
 If you're really into IR theory and LSI, see this paper by Dr. Lillian Lee of Cornell Univserity on Iterative Rescaling for LSI. I saw a dumbed down talk on it, and got about 75% of the info. So the paper will hurt your head.
 
 Also, typically LSI won't help with polysemy (there's a good IR midterm question by the way!). In fact, you could make the argument (/me sticks neck out) that LSI hurts polysemy. However, I think (/me extends neck further) polysemy can be naturally addressed (to some extend) by multi word queries.
 
 For example, see the problem in this Google trends programming languages graph. At least when I view it, one of the key news stories is "pet python strangles man". Polysemy has caused us some problems. But the query I'm personally more likely to try is "python programming". I'd wager this won't bring up too many hits about pet pythons (because of the additional 1-word context). In this case the synonymy between "programming", "development", "web development", etc. is going to be the bigger problem.
 
 2 0
 
 Thanks for the deep follow up :) I agree that a full SVD is not tractable for the web. However, as in a lot of IR , you could hack the problem, do approximations or sampling, etc. and get some of the advantage. Plus there are other advantages to a dimensionality reduction (lower dimension index = less space, better query performance, etc.). If I worked at Google and they weren't not already doing it, I'd by working on LSI for (parts of) their index as a 20% project. If you're really into IR theory and LSI, see this paper by <a href="https://www.cs.cornell.edu/home/llee/" rel="nofollow">Dr. Lillian Lee</a> of Cornell Univserity on <a href="https://www.cs.cornell.edu/home/llee/papers/ando-lee-sigir01.home.html" rel="nofollow">Iterative Rescaling for LSI</a>. I saw a dumbed down talk on it, and got about 75% of the info. So the paper will hurt your head. Also, typically LSI won't help with polysemy (there's a good IR midterm question by the way!). In fact, you could make the argument (/me sticks neck out) that LSI hurts polysemy. However, I think (/me extends neck further) polysemy can be naturally addressed (to some extend) by multi word queries. For example, see the problem in this <a href="https://www.google.com/trends?q=php%2Casp+%7C+asp.net%2Cruby+on+rails%2Cjsp%2CPython&ctab=0&geo=all&date=all&sort=1" rel="nofollow">Google trends programming languages graph</a>. At least when I view it, one of the key news stories is "pet python strangles man". Polysemy has caused us some problems. But the query I'm personally more likely to try is "python programming". I'd wager this won't bring up too many hits about pet pythons (because of the additional 1-word context). In this case the synonymy between "programming", "development", "web development", etc. is going to be the bigger problem.
 Cancel
 - Hamlet Batista
 
 2007-07-18T14:40:24-07:00
 
 Hey Nick,
 
 That is some really cool stuff. You and I are going to be really good pals :-) Those papers look really scary. I am working to improve my linear algebra and graph theory skills, they are still undergraduate level.
 
 BTW: I like you post about Amazon EC2, I'd will like to play with it in the near future.
 
 1 0
 
 Hey Nick, That is some really cool stuff. You and I are going to be really good pals :-) Those papers look really scary. I am working to improve my linear algebra and graph theory skills, they are still undergraduate level. BTW: I like you post about Amazon EC2, I'd will like to play with it in the near future.
 Cancel
DavidLaFerney

2007-07-16T18:22:32-07:00

Good post. Sounds like if search engines are ever perfected SEO will be a memory. Might be a while though...

1 0

Good post. Sounds like if search engines are ever perfected SEO will be a memory. Might be a while though...
Cancel
- Hamlet Batista
 
 2007-07-16T19:27:28-07:00
 
 DrDave,
 
 I think it will be a looong while :-)
 
 1 0
 
 DrDave, I think it will be a looong while :-) 
 Cancel
- Amit Bhawani
 
 2007-07-17T22:27:59-07:00
 
 You are right we should be happy with these irrelevant results :p because thats when people need Seo's
 
 1 0
 
 You are right we should be happy with these irrelevant results :p because thats when people need Seo's
 Cancel
 - Hamlet Batista
 
 2007-07-18T19:17:58-07:00
 
 Amit,
 
 That is the point of the whole post. We are needed!
 
 1 0
 
 Amit, That is the point of the whole post. We are needed! 
 Cancel
 - Kwyjibo
 
 2007-07-19T12:29:48-07:00
 
 Imagine if we could all band together to use Robots.txt to keep Googlebot from indexing our sites. Then they'd have no content to show people.
 
 Google needs us as much as we need them.
 
 1 0
 
 Imagine if we could all band together to use Robots.txt to keep Googlebot from indexing our sites. Then they'd have no content to show people. Google needs us as much as we need them. 
 Cancel
 - Hamlet Batista
 
 2007-07-19T16:02:14-07:00
 
 Imagine if we could all band together to use Robots.txt to keep Googlebot from indexing our sites. Then they'd have no content to show people.
 
 That would be very interesting to see. I think that would be a shocking surprise for them.
 
 They know we need each other and that is why they are more forthcoming lately. More tools, more feedback, etc.
 
 1 0
 
 <blockquote>Imagine if we could all band together to use Robots.txt to keep Googlebot from indexing our sites. Then they'd have no content to show people.</blockquote> That would be very interesting to see. I think that would be a shocking surprise for them. They know we need each other and that is why they are more forthcoming lately. More tools, more feedback, etc.
 Cancel

Post Analytics

Comments 58

Log in to Moz

Don't have an account?