Last weekend at Scary SEO, Dave Snyder gave a very solid presentation on Information Environment Design…what the rest of us would call Information Architecture. Duncan Morris also wrote about information/site architecture on SEOmoz very recently. While absorbing both of these presentations, I couldn’t help but think once again of the analogy I repeatedly come back to when trying to explain Info/Site Architecture to people: the filing cabinet.
Many people, whether new to the web or not, seem to have a difficult time wrapping their heads around how to logically structure their site’s information for both users and engines, but I’ve found that comparing it to something everyone’s familiar with can make it very easy to understand even complex aspects of site architecture.
Let’s first look at how a standard filing cabinet is organized: You have the individual cabinet, drawers in that cabinet, folders within the drawers, files within the folders, and documents within the files.
There is only one copy of any individual document, and it’s located in a particular spot. There is a very clear navigation path to get to it.
If I want to find the January 2008 Invoice for a client (Amalgamated Glove & Spat), I will go to the cabinet, and perhaps open the drawer marked Client Accounts, find the Amalgamated Glove & Spat folder, look for the Invoices file, and then flip through the documents until I come to the January 2008 invoice I’m looking for (again, there is only one copy of this…I won’t find it anywhere else).
Now, before we get into details, let’s make a quick comparative reference between our filing cabinet’s information architecture, and that of a website. Let’s use Craigslist as an example. We’re not paying attention to their page design or navigation, just their information architecture.
This sort of information architecture provides a simple, straightforward way for your visitors to find individual items or content. It’s the way we all learned to organize information growing up, and the way we naturally want to navigate down through a site. It’s also an excellent architecture style for SEO purposes. It’s shallow, easily crawled, and if you remember to think of individual content items as documents in a filing cabinet, there will never be more than one copy and you’ll avoid duplicate content problems.
Let’s also look briefly at how this basic filing cabinet approach can work for some more complex IA issues.
Subdomains:
Subdomains should be thought of as completely separate filing cabinets within one big room. They may share similar architecture, but they shouldn't share the same content and, more importantly, if someone points you to one cabinet to find something, they’re indicating that cabinet is the authority…not the other cabinets in the room. Why is this important? It will help you remember that links (i.e., votes or references) to subdomains may not pass their authority to other subdomains within the Pay-Level Domain (e.g., *.craigslist.com, wherein * is a variable subdomain name).
Those cabinets, their contents, and their authority are isolated from each other and may not be considered in concert with each other. This is why, in most cases, it is best to have one large, well-organized, filing cabinet (may I suggest the www variety) than several that may prevent users and bots from finding what they want.
Redirects:
The most organized Administrative Assistant I’ve ever seen in my life was a woman named Corley at a film company I used to work for. She didn’t know it (neither did I at the time), but she used 301 redirects inside her literal, metal filing cabinet. If there was something she found herself looking for in the wrong place, she would stick a Post-It note in there reminding her of the correct location. Anytime you looked for something in her cabinets, you could always find it because if you navigated improperly, you would inevitably find a note pointing you in the right direction. One copy. One. Only. Ever.
Redirect irrelevant, outdated, or misplaced content to the proper spot in your filing cabinet and both your users and the engines will know what qualities and keywords you think it should be associated with.
Noindex and Nofollow:
It may be the case that you have certain files or documents that you only want your users to see, but not the search engines. In those cases, using noindex is a way to mark areas of your filing cabinet as Classified. While the engines may sneak a peek, they won’t share the information with anybody. The only way someone will find that information is if you they look through your cabinet and come across it, or if you provide a declassified copy for them to search for.
Nofollow, on the other hand, is a way to slap the engines' hands away when they’re going through your cabinet the wrong way. Users may jump around and need several different ways to find things in your cabinet. Or you may need to reference another cabinet that has relevant information, but you don’t advise people to look in it. In these cases you can use nofollow to tell engines that they should ignore certain links; they might be worthwhile to users, but you don’t advocate their use and would prefer they’d use other links to get places, or look at other sources that you do advocate.
URLs:
How easy would it be to find something in a filing cabinet if every time you went to look for it, it had a different name? What if that name resembled “jklhj25br3g452ikbr52k”? Static, keyword targeted URLs are best for users and best for bots. They can always be found in the same place, and give semantic clues as to the nature of the content.
These specifics aside, thinking of your site information architecture in terms of a filing cabinet is a good way to make sense of best practices. It’ll help keep you focused on simple, easily navigated, easily crawled, well-organized structure. It’s also a great way to explain an often complicated set of concepts to clients and coworkers.
If you can think of other ways the filing cabinet analogy can be applied, or if you have your own helpful analogies for explaining SEO concepts, feel free to share them.
The Filing Cabinet Theory of Site Architecture
Design
The author's views are entirely his or her own (excluding the unlikely event of hypnosis) and may not always reflect the views of Moz.
The filing cabinet -dont foeget the post it note which sticks out of a document that can be see upon opening the cabinet that gives a direct reference to relevant content from the sites home page(link juice goodness)
Subdomains - i thought G now treated these like folders in your site after the EBay debacle?
Also its been a while now since G started accurately following dynamic URI, in fact G even say to stop doing URL rewrites as its CONFUSING BOTS!
nofollow = get your hands our of my draws?
Great analogy. With large sites, I often find myself getting bogged down in too many sub-directories. Enforcing a structure like the filing cabinet will help greatly.
I'm confused. Are you arguing in favor of a "my-way-or-the-highway" approach to classification?
Imagine an ecommerce site with an AJAXian product hierarchy/drilldown, where you can find, say, a men's green woolen sweater in size large by navigating thus:
but not by *any other* path? I'd call that a crippled implementation. (Not that it's uncommon!)
The problem I wrestle with daily is avoiding SERP dilution and duplicate content in a more sophisticated, "faceted" schema. (I know Rand has addressed this, long and often. But it ain't easy.) Am I missing the point? (That's not uncommon either!)
great ideas and a very usefull info.
Great analogy. Password protected sites are like locked filing cabinets : )
This is a great simple way of explaining the importance of information architecture. Nice post!
good visuals...thank you!
this filing cabinet example reminds me a comic situation I happened to be into as I started a job for a public sector organisation where the library staff had been running the web for a few years. On the main website's top navigation I noticed an item that really drew my attention 'UK A to Z'
As you would click on it and navigated further down the site, the content had been modelled in a cataloguing sort of librarian style, so you had A, B, C, D, etc... and each of those subcategories was supported by a web page corresponding list of products, services and resources whose names would start by the letter assigned. It took me a while to try and argue convincingly that such taxonomy was not responding to the underlying principles of how people use websites.
While I find the post very interesting and the comparison very appropriate, I am not sure I would want to use that filing cabinet example in all situations.
Well done Scott! you just made it simpler
That is a simple and easy top to bottom understanding of web structure, but the hairy part comes in when your internally linking those documents and categories. That is really the arhcitecture art so to speak.
Simple is always better.
No Follow= "get your hands out of my bush!" -mom in transformers
Nice post Scott
Scott - Thank you for this great analogy. I work at a university, and everyone thinks they know what they want and how they want it.
This example structure shows pure usability, benefit for both the end user and our in-house offices!
Thanks, and thumbs up!
For https://www.domainname.com & https://domainname.com
Is the domain with "www" and without "www" filed into different whole room or same room ? For the existing structure which has been cached.
Query site:https://www.domainname.com and site:https://domainname.com dosent show different result but i am able to track the difference For https://www.domainname.com & https://domainname.com while running ranking report. Any way to track it first.
Edit : had removed comment frist as it seemed inrelative. But Added again but couldnt add it same to same.
Hmm. Correct me if I'm wrong but isn't this kind of missing the point?
Be organized is the point of the post.
https://domanname.com should forward to https://www.domainname.com
but if you have https://widgets.domainname.com and https://gagets.domainname.com they would be seperate cabinets since they are subdomains.
The idea of https://domainname.com would be the whole room. :D cause it would entail everything throughout each subdomain.
Yes, be organized is the point. Sorry meant room instead of cabinet. Removed my comments.
I was asking for the exisiting rooms/cabinet structure. If they were treated differently and how to track them.
Why removed? that was a legitimate question. :D
Put it back!!! :P
Scott. You lived up to your name. :D (Great Scott)
Fantastic analogy. I feel like I can expound on it more and more, as I think about it and I just might adopt it when I teach people SEO.
Outside of the filing cabinets you have a general (high volume) word outside. "Blue widgets" and then each folder is more specifc "Blue widgets for PC" "Blue widgets for iPhone" etc.
I like. I like a lot. :D
Drinks on me next time. =)
I love the simplicity of this analogy. Great post.
The post-it note analogy also explains deep internal linking.
If there's one particular document that I frequently use, then rather than having to try and find its folder in the cabinet, then try and find the correct document within that folder, I can just stick a note on it. This way, as soon as I open the cabinet, I can go straight to the document I need.
EDIT: Good job Mark! ;)
exactly my point ;)
Well done Scott! :)
A very interesting post and intuitive analogy for all users...
I'd like to see "all" the social-network websites pay attention to their IA!
great scott, according to your example of filing cabinet, a subdomain would be another filing cabinet? or a drawer within that filing cabinet? Thanks.
"Subdomains should be thought of as completely separate filing cabinets..." :)
I just learned a lot about filing cabinets! jk. great post!