A programming principle should only be followed when it provides more long-term benefit than any alternative. This may seem self-evident; it should be apparent from the fact that a principle that doesn’t provide long-term benefit obviously isn’t a good principle.
This is something that mediocre problem-solvers don’t always understand: deviation from generally accepted principle can be a sign of weakness or, contrarily, a sign of strength.
To take an analogy, in chess it is said that a rook is worth five, a bishop three, and a pawn one and so on. These facts aren’t anywhere in the rules of chess, they are just very good approximations, overall, that emerge from the game. About 90% of the time, you can use math based on these numbers to make good decisions.
However, 10% of the time other situation factors supersede this principle. Though there is an appeal to such simplicity, as one becomes closer to a master of chess he must let go of these numbers and move to a more advanced situation calculus. The same applies to coding.
Even the purest of coding principles have exceptions: though you’ve probably never considered it, every time you copy-paste you participate in a violation of DRY.
I’d like to reiterate, every single (non-trivial) coding principle necessarily has an exception. Why? Because every principle incurs a cost of a situation benefit. And for any such principle we can imagine a scenario, however unlikely, where the situationality of that benefit is totally avoided.
There is only one universal principle of problem-solving: the cost-benefit analysis. Every other principles only offer value as heuristic approximations. This is the calculus by which an expert must weigh the relevance and generality of any other principle.
(All pictures taken at the time of writing, no doubt features and interface will change with time)
For this case study I’ll pick on chrome because it’s hands-down my favorite browser, and because I want to elaborate on the criticism I made in my UI manifesto. It’s excellent in many ways, particularly the developer tools. However, I periodically still learn about cool hidden features it has.
Case in point: http://www.igvita.com/slides/2012/devtools-tips-and-tricks/ . It’s a mixed compliment to have an article written about your software that entitled, “Wait, [your software] can do that?” It’s great to impress with your feature set, but the fact that such an article needs to be written to unearth these capabilities is indicative of underlying unintuitive UI.
Every single application I use fails UI/UX in simple, fixable ways.
Part 1: An Analogy
You travel 200 years into the future. And walk into a bank. Fortunately everything is in english. You recall having some savings that should have accumulated some interest by now, so you wan to withdraw. The first thing you do is look around the bank nothing is familiar 1. So you look for somebody to ask for help, though there are people, you don’t know who is an employee and who is not an employee since there are no consistent uniforms and there is no “counter” that would designate employee-only zone2. You walk up to a girl in grey and ask “Hi can you help me, I’m quite new here and am totally lost!”
The attractive young girl looks back and says, “Open all bookmarked accounts or Open all bookmarked accounts in table view?” You look at her blankly and she returns the blank stare before repeating herself exactly.
You realize she is an android. “Help?” You ask her “Can I speak to your supervisor?” Nothing. You realize she is unable to communicate with you and also unable to help you find anybody who can3. You decide you need to leave the bank and get help from outside. As you start to walk out the door, a guard blocks your path and asks, “Are you sure you want to terminate transaction initiation?” You have no idea what this is but you’re uncomfortable and say yes and hustle away.
Perhaps you come away blaming yourself for not understanding. Even though you know that with time you could become an expert in such a place, you leave frustrated and preferring not to return.
Part 2: The Alternative
You walk into a futuristic bank. Their advanced technology recognizes you have never been here before, but doesn’t aggressively nag you about it. You see several clearly distinct counters, each with one person in a special red outfit, you infer those are the employees. You walk up to the nearest station and say “Hi, I’m new here I need to check if I have an account.” Though full language comprehension technology isn’t around, the android recognizes you are a new person and that what you said was very far from an appropriate action. “I didn’t understand that. I am an automated account-opener robot. I recognize the commands ‘open account’, ‘close account’, and ‘help’. You can also always find help at our help desk.” She gestures.
You see that one of the stations had a HUGE sign labeled HELP hanging over it, and every desk has a sign over it4. You feel a bit silly for not noticing it sooner. You go to the help desk and soon are getting futuristic video tutorials.
Part 3: The Lessons
Follow a convention for design (preferably a common one).
This is actually a point I have seen championed before in UI, yet it just doesn’t seem to be followed. It’s like going into a house and not realizing there was a door somewhere because the nob was so elaborate (or out-of-place) it was unrecognizable.
I’m going to pick on chrome because it’s one of my favorite applications. Look at the debug toolbar. Could you realistically expect somebody to know how to disable cache with that? Do you even remember how (perhaps you didn’t know it’s possible). Turns out the gear in the lower right (I don’t know why it’s separated from the other buttons) is effectively a menu (despite no visual indication of this).
Distinguish navigation from content.
It should always be apparent to the user when something is System-wide navigation, App-wide-navigation, or custom-content. This is usually done quite well and isn’t often a problem. But look at the picture below:
Don’t you agree that uninstall button is weirdly placed? Look at it for a while, and you may realize that it’s simply a bookmark to a page entitled “Uninstall Chrome,” but there’s no systematic way to know this.
All errors should give you unambiguous directions to their solution.Directing to a intuitive help interface is sufficient, the point is that somebody who’s never used your system should at the very least be able to systematically find their way to the information on how to use your system.
Case Study: Dos Prompt. I remember as a lad of 9 being at the magical black terminal that responded to secret passwords (“called commands”) consisting of unguessable combinations of characters. It was reminiscent of a haunted mansion in scooby doo where the only way to get to a room would be by playing the secret piano keys. I would type almost anything except the commands I knew (qbasic, dir, cd) and would get back an ominous “Unknown command or filename.” Until one day, in an act of annoyance I typed help (I meant it as more of an imperative at the machine than anything). All of the bad command or filename messages should have mentioned this command! There was an amazing help tutorial back then in dos (not like now) that taught piping and everything with enough examples for me to learn at that age with limited vocabulary.
I want to take a second here and hammer this point. Qbasic was one of those “magic passwords” that I learned only through my brother’s friend, my whole foray into coding was almost stymied by terrible UI.
Enumerate all available action.
This may be the biggest failing of modern UI. Going back to the chrome example above, would you believe I didn’t know about that lower right gear for months of using chrome’s debug menu (and I certainly wasn’t the only one)? With almost any application, I routinely discover that there are hidden features available that I wish I had known existed sooner. Did you know you can drag a window to the left edge of your screen in Windows 7? Did you know you can drag tabs not only in one browser window but between browser windows/instances in chrome? Did you know that you move a window with the keyboard by pressing alt-space (then down, then enter, then the arrow keys?)? Did you know you can see memory usage of tabs in chrome by pressing shift-escape? And don’t get me started on gmail, office, or mac. This point is so in-depth and I have so much to say on it that it will take an entire post of its own to cover. Stay tuned for more.
Ed Weissman posted a retelling of a scenario where his boss asked him for “10 minutes” to solve a problems. He reluctantly agrees but ultimately spends 3 hours working on this fix, and for most of the time he’s browsing Hacker News while his boss and a coworker named Sue repeatedly fumble with software while trying to convey the bug to him.
Of course everybody has opinions on the internet, but what struck me was when one commentor by the id Michael suggests that our protagonist Ed could have tried to be more helpful there was roughly a 10-1 disproportionate rebuke of Michael’s thought, mostly along the lines of: It’s not the narrator’s fault or job description to help others who are incompetent with software.
I want to look at the larger theme here. Off the top of my head, I can come up with several ways to handle that situation better (e.g. mulitask with something work-related, help out your co-workers, have bug reports get CC’d directly to you, come up with a better meeting strategy) but the root problem here is either attitude or a naive idealism that business should function mechanically..
The attitude portrayed is: I want to do my piece, and only my piece, and if you aren’t able to interface with me in a rigid predefined way, then I’ll silently resent you / the system. To look at it as a naive contractual system (programmer only does programming, manager only does meta-optimization, etc) is to miss the chance to improve things.
How would you feel if you got bounced around accounting over an issue with your check because several people said “It’s not my job” ? This is the same situation with the roles reversed.
And moreover, how can you expect to go anywhere if you take no ownership of outcomes and can’t be liked by coworkers?
One term that comes up significantly too much in interviews is the highly applauded and highly misunderstood    ReST.
I’m not sure why it comes up in interviews, but here are a few guesses:
As an indirect measure of how one stays up-to-date with the developer community.
The interviewer is actually caught up in the trend himself, and that this “paradigm shift” is a crucial practice that can’t be learned in a matter of hours.
They don’t know what else to ask, or they were asked it when they interviewed.
I think this particular question, and this type of question (even though I now know ReST so well I nail it every time) are a bad idea.
By sheer virtue of the fact that it comes up so much, it can simply indicate the candidate has done so many interviews that they just happened to look it up the particular idea after stumbling when hit by it before.
Staying absolutely up to the minute with the developer community isn’t important. In fact, by lagging behind a little (not too much) you miss out on a lot of flash-trends. Even if these trends do stick around (RoR, node.js) early adoption comes at a steep price.
Knowing acronyms isn’t necessarily the best measure of being up-to-date. I think a question I hear too little, which is more important, is What’s your process for debugging an application?
So I recommend ditching the ReST question, and really any question on specific acronyms or designs that could be learned in a matter of hours.
You may be shocked to hear, but back when I was young I used the now-hated Visual Basic and Access. In fact, moving these applications to PHP on the job is how I first learned PHP.
Anyways, once I was trying to return random results in an sql query to an Access MDB from VB / vba and I scoured the internet and found nothing. Then I devised this little number, which probably for all practical purposes does the trick:
ORDER BY right(right(now*10000000000, 4) ^ (1/ID),4)
where ID is the unique id for the given table. Right(now*100000000000,4) basically gets you a pseudo-random 4-digit value, the current milliseconds. We combine this with the row ID to give a different but constantly changing number for each row.