The value-based pricing trap

Software pricing

Sometimes pricing is easy. For example, the price of oil is pretty straightforward – there is a market and it sets the price. In many cases prices are just a little higher than production costs – this is the case for PCs or for some consumer electronics. Software is different and much more complicated.

Two things make software pricing complicated:

- It often isn’t directly substitutable. If C&H overprices their sugar, I’ll buy generic. What should I do when Microsoft overprices Excel? There are substitutes, but there is a significant switching cost.

- The marginal cost per unit to produce is very near zero. If Toyota sells extra minivans, they have to pay a lot to build them. If Oracle sells extra databases, they may have to pay a lot of sales commissions but they don’t incur additional cost to produce them.

Put these together and you get a pricing situation where there is a huge gap between the marginal production cost of extra units and the value to the user. Since the price should in general be somewhere between the marginal unit production cost (near zero in the case of software) and the value to the user (because above that nobody would buy), the range of possible prices to consider is very wide.

Given a wide range of possible prices at which you can make a profit and customers would buy, where should you price your product? Assuming your price must be the same for all customers, an economist would say to price your product so as to maximize the total profit generated. For a given price x, you can sell d(x) units. if they cost c per unit to produce, you select x to maximize d(x) * (x-c). In the case of software, c is near zero so you are maximizing d(x) * x. This is your total profit, and it is the same as the area of the largest rectangle you can fit under the curve that plots demand against price.

Custom per-deal pricing

Sounds good. Now you’re beginning to see why software is a great business (if you’re not sure on this point you will be if you visit Larry Ellison’s house). In fact the software business is even better than what we described: you can make even more money than you would using the maximization approach in the last paragraph. How? Your price need not be the same for all customers. Classically, you do this by segmenting the market. Airlines, for example, want to make prices as high as possible for business travelers who are viewed as price insensitive, and much lower for leisure travelers who are much more price sensitive; their great insight is that leisure travelers often stay over a weekend and business travelers often do not. Presto, the Saturday-night-stay requirement for discounted airfares. Not bad, but software does even better. Lets see how.

Regardless of the list price, each high end enterprise software deal usually winds up being negotiated by a sales person. I’ve seen many cases where the list price produced absurd outcomes. One of my favorites was a chain of pizza restaurants buying an order management package from Oracle that was priced based on the number of order lines processed – at a list price of 85 cents per order line including the advanced pricing option if I remember correctly. Try telling a pizza restaurant that it has to pay 85 cents to its software supplier whenever a customer adds pepperoni to their small pizza at a cost of 75 cents. They won’t like it very much.

Classically, enterprise software companies solve this problem with sales people who can offer big discounts. If your list price is effectively infinite and always needs to be negotiated (which naturally your sales people do perfectly) you will not just capture the largest box under the demand/price curve but you will capture the whole area under it.

Congratulations, you have achieved pricing nirvana. You are capturing all the value created by your software. Your sales team learns to work with the user to understand the value the software will create; this is wonderful because it makes them a “business partner” not sleazy sales people. But when it comes time to price negotiation, they understand what the value is and you can hold the line. There might be some slight inefficiencies: commissions to pay, customers hiring analyst firms to help them negotiate against you, you hiring business practices people to cross check the sales managers you hire, but this is all minor compared to the massive revenue you are extracting from your customers.

Caught in the trap

It sounds wonderful, doesn’t it? You get to capture every last drop of value created by your technology. Your shareholders should be very happy. But it’s not quite that simple. If you price your software at a level where you capture almost all the value in a deal, you create two problems:

- Your customers resent you

- Because your customers are capturing minimal value, they have very little incentive to grow their usage of your software. In fact, they have a strong incentive to find an alternative situation – even an inferior one – if they can capture even a little more of the value it creates.

Think about it. Imagine that new manufacturing software will save me $5,000,000 and costs $4,999,000 [assume both numbers are discounted for cash flow and adjusted for risk]. Would you buy it? Yes, because it saves you $1000. Did the vendor maximize their revenue in the transaction? Yes. Will you tell your friends that they absolutely should buy it or make it a top priority to install the same software in all your other manufacturing plants? No, because the net savings is very low so the project is barely worth the hassle. Would I switch to an inferior product that saves me $3,000,000 and costs $1,500,000? In a second. Don’t even talk about how fast I’ll switch when a competitor has a superior offering that is priced reasonably!

This is the trap most enterprise software companies are in. My use of the word trap is specific; I am not saying it is a mistake, rather something they have gotten stuck in and something which is very hard to get out of. Leaving money on the table will shrink revenues and the market will punish the stock price. The benefit of customers being more excited about your product and talking it up more is great for your future, but doesn’t help the business in the short term.

Enter open source

Now we’ve arrived at my challenge. I don’t want to fall into the trap. I want to price MongoDB subscriptions at a price point that customers love. I want to leave money on the table. That money that I leave on the table is what makes economic buyers excited about their purchase. I want to save a customer millions of dollars and charge them a modest fee. Why? Because when that happens they’ll be aggressively looking for the next place to use MongoDB. They will tell their friends not just about how great the product is, but how easy 10gen is to deal with and what great value we provide. Short term revenues may be less, but if this makes the business grow faster over time revenues will be much higher.

This fits perfectly with open source. MongoDB users don’t have to pay 10gen. They only pay us if we deliver value that they find compelling. They talk to each other about whether they got value from what they paid us. If they stop getting value, they won’t renew. Our reduced pricing power is, in my opinion, a huge benefit because it helps us avoid the trap of pricing too high.

What else can we do to avoid keeping all the value for ourselves and not leaving enough with customers?  We start with a reasonable list price. If list price is actually a great deal for most customers, much of this problem goes away. Second we are standardizing volume discounting – with very aggressive discounts on large deals, so that even in volume the standard price is a great deal. And finally, we need to explain to our sales team and our customers what we are trying to do.

– Max

Fun with right triangles

Last year my son enjoyed learning the pythagorean theorem. He was fascinated by a picture containing a proof and being able to calculate the length of one of the sides from the other two.

On vacation a couple weeks ago, we found a fun (well, for geeks anyway) driving game, which we called “Iron Chef Triangles”. The game is very simple: given a number, construct a pythagorean triple containing it – that is, find a right triangle whose sides are whole numbers one of which matches the given number. For example, if I say “5″, you might respond “3, 4, 5″ (because 3 squared plus 4 squared equals 5 squared, so you can make a right triangle with sides of length 3, 4 and 5) or you might say “5, 12, 13″ (because 5 squared plus 12 squared is 13 squared). If I say 8 you might say “6, 8, 10″ or you might say “8, 15, 17″.

Try a few: how about 11? How about 20? See any patterns?

Once you’ve figured out how to easily do these in your head, here are two harder problems.

Hard: given right triangle whose sides aren’t necessarily integers, can you make a right triangle whose sides are integers with approximately the same angles? How close can you get?

Observation: the product of the three parts of a pythagorean triple is always a multiple of 60.

Unfairly hard problem: can you find two different pythagorean triples whose product is the same?

Have fun,

– Max

What our heroes tell us about ourselves

Yesterday I was at a dinner and I was asked by a smart business person whether today’s young software engineers are more tuned in to business issues than the software engineers of the past. It wasn’t a question I had thought much about, but fairly quickly – and happily – I concluded that they are. Why? What’s changed?

To start, today’s young software engineers have different heroes. In the 80′s, a software engineer might have idolized Ken Thomson or Dennis Ritchie. They earned advanced degrees and invented programming languages (C) and operating systems (Unix). They worked at places Bell Labs. They weren’t billionaires.

In the 90′s, Bill Gates was on top of the world. He dropped out of school to found Microsoft and made billions on an empire based on DOS and Windows. He was more commercial than the previous generation of heroes, but still made his fortune building technology.

Fast forward. Google rules the world, Larry and Sergey are the heroes. Then Facebook and Mark Zuckerberg. The billions are arriving younger and faster, but something else has changed. The companies are no longer selling technology. Many people think of Google and Facebook as technology companies. They certainly employ a lot of technical people and technology is critical to their business, but they don’t sell technology. The use technology. Their product is you, and their customer is a business that wants to sell something to you.

Nowadays, most young engineers are more interested in inventing the next Facebook than the next Unix. That’s a very different task. You’re building a product that appeals to your aunt or your cousin, not your programming buddy. Virality matters more than threading. The UI has to be appealing – and that doesn’t mean being able to set your command line prompt to include your username and the current directory.

Why the shift in focus? Maybe because much of the technology-related value being created today is not in improvements to the technology but in applications of the technology to how we interact with the world and with each other. This is a normal and healthy thing as the technology industry matures.

I am an old math geek who idolized among others Alan Turing and John Von Neumann, so I am excited to help 10gen revolutionize the database with MongoDB. But if I’m done with that in a decade or two, don’t be surprised if my next gig is more Facebook than Unix.

– Max

The 5 whys – and why do there need to be 5 of them?

An employee at my company (10gen, the company behind MongoDB) sent me this link to a video of Eric Ries talking about the “5 whys” and how technology problems are often people problems.

For those of you not familiar with the notion of “5 whys”, the idea is to ask why 5 times until you get to the root cause. While I think asking why and finding root cause can be important, a few issues jump out at me: first, why do you need to ask exactly 5 times?  Second and perhaps more importantly, when there are multiple answers to a given why, which one do you follow for further investigation.

In the example Eric gives, they start with a server crash and end with a manager who doesn’t believe in training. I would argue this is an attempt to reduce technical management to something that general managers reading Harvard Business Review can do, which is dangerous.

What really needs to be asked? The first why is straightforward.

1. Q: Why did the server crash?

A: There was a bad call to some new API

Now what’s the root cause? Is the problem with the API or the program that called it? Is it a problem of programmer competence, communication, or training? Or is it an architecture problem which is actually bigger than this specific API usage? Was the problem in design, implementation, testing, or documentation? Was it a process problem, a training problem, or a hiring/management problem?  I don’t think 3 or 4 more whys will mechanically answer that. One or two more whys might answer it, but the key is focusing on the right areas.

What technical manager at a startup really thinks that training will prevent a developer from calling an internally developed API incorrectly in a way that causes a crash? Its not impossible, but I think pretty unlikely. Better API docs could help, and better communications could definitely help. Or a more robust API, or more thought about whether that API needs to be public if it can’t be made more robust, or broader test coverage…

I think a better policy might be “2 or 3 whys, but the right ones”. I don’t think it will catch on as a management slogan, but I think it is more likely to yield useful answers.

– Max

Elections: just as irrational as human behavior in general

A friend of mine sent me a link to an article by Danny Hillis on elections. Danny is a smart guy who has done good work in many areas, but his analysis struck me as a) radically oversimplified and b) wrong in some important ways. But it got me thinking about how our changing understanding of human rationality (or lack thereof) in economics might spill over into political science. Let me start with how human behavior differs from classical economic models and move from there to how elections are effected.

Traditionally, economics was based on the notion of rational actors. Each individual makes decisions to maximize their utility functions (think about  a modern version of quantifying the notion of “greatest good” in Utilitarianism). Person A might value summers off more highly than a chance at millions of dollars and therefore go into teaching. Person B might dream of having their own private jet and be willing to work nights and weekends to keep even a small hope of their dream alive. But all of these are rational decisions based on utility functions which describe how much each economic actor values each benefit.

It seems reasonable (to a quantitative person such as myself) to think that human economic behavior works this way and the mysteries of the different choices people make is encapsulated in this unknown and in all practical respects unknowable utility function. Unfortunately for those of us who like simple mathematical explanations of human behavior, people don’t actually behave this way. In 1953 Maurice Allais wrote a paper (warning: it is mostly in French, 44 pages, and costs $10 to download) describing a paradox which showed that even if you look at simple financial wealth, people do not rationally follow a utility function. His paradox attacked the notion that if some percentage of the time (say 98%) two options produce the same outcome, we should base our choice between the options on the 2% of the time when they produce a different outcome (in economics-geek-speak, this is the Independence axiom in the Von Neumann-Morgenstern utility theorem). It seems rational (and official, now that it is part of a theorem), but in fact we care very much what happens in the 98% – we do not ignore the identical part of the two scenarios in evaluating them, irrational as that may seem.

Detailed example:

In scenario 1a, you receive $1 million 100% of the time. In scenario 1b, you take a 1% risk of losing your $1 million prize to gain a 1% chance of increasing its value to $3 million. 1% chance of nothing, 98% chance of $1 million, 1% chance of $3 million.

In scenario 2a, you have an 2% chance of receiving $1 million, in the other 98% you receive nothing. In scenario 2b, again you take a 1% chance of losing your $1 million prize to gain a 1% chance of increasing its value to $3 million. 1% chance of 3 million, 98% chance of nothing.

Some people (a reasonable number I would guess) would choose 1a over 1b but 2b over 2a. Mathematically speaking, both decisions are identical: you are giving up a 1% chance at $1 million to increase a $1 million prize to $3 million 1% of the time. The other 98% doesn’t matter. But psychologically, it matters a lot. When you are guaranteed $1 million dollars, risking it sounds irresponsible. And if that 1% chance happens, you will live forever with the knowledge that your greed to get $3 million cost you $1 million. In the other situation, if you don’t get anything, you aren’t surprised and you are fairly confident that it had nothing to do with your decision to increase the odds of getting nothing from 98% to 99%.

Now, on to politics.

Hillis lays out a simple left-right political spectrum. He puts two candidates on it and models voting behavior as people voting for whichever candidate is closer to their beliefs. From this, we can deduce all sorts of things, some of which – like elections being close – sometimes model reality pretty well. Then again, Herbert Hoover never participated win or lose in a close presidential election, and how many electoral votes did Walter Mondale get in 1984? 13. Which is 5 more electoral votes than Alf Landon received in 1936!

The problem is that elections are much more complicated than Hillis’s model. Lets put aside for a second that there are multiple issues involved so it is not a simple one-dimensional spectrum. Lets assume there are only two candidates in the election. Lets also put aside candidates personal charm or lack thereof and their potentially scandalous sexual, military, or substance consumption histories. This model is an still an oversimplification in two critical respects:

1. Voters do not have a simple choice between voting for the Democrat and voting for the Republican. They can stay home. Or they can volunteer or donate money or attempt to harangue their friends into voting for the candidate of their choice. Imagine an (exaggerated, to make a point) election where:

a) 60% of the electorate prefers Smith to Jones, but only by the narrowest of margins; they are generally dissatisfied with both candidates

b) 40% of the electorate strongly prefers Jones to Smith – in fact they thing Jones is close to their ideal candidate and Smith is so nightmarish a candidate they would consider leaving the country if he were elected

What happens in that election? Maybe a 48% turnout; 36% of population (90% of the Jones supporters) vote for Jones and 12% of the population (20% of the Smith supporters) vote for Smith. Jones wins in a landslide, with 75% of the vote. Smith, despite a 20 point margin in polls of registered voters, can’t even carry his home state.

Is this exaggerated? Yes, but intensity matters.

2. Just as the 98% that was unchanged effected the outcome in Allais’s paradox, candidates not on the ballot effect voters. Again, here’s a scenario:

The Yellow party and the Purple party each have 50% support. Both of them nominate a candidate who is free of scandal but not particularly personally charming. Each candidate represents the mainstream of their party – put them just slightly to the center of the middle of their party’s 50%, at say the 30th and 70th percentile respectively.

So far, it sounds like a close election.

However, in the Yellow Party primary Mr. Gold had to duke it out with Mr. Canary. Canary was just a few points further from center than Gold, so the primary was very close, and neither party had a majority of committed delegates at the convention. Mr. Gold did have a slight lead over Mr. Canary, so when the super delegates broke strongly for Gold (whom they felt was more electable) they felt they weren’t overturning the will of the people. But the Canary camp felt like if you included the Canary, Maize and Buff delegates (who endorsed Canary when they dropped out, but such endorsements aren’t binding on their delegates so they were counted separately) against the Gold and Ecru delegates (Ecru having thrown his support to Gold), Canary held a lead before the super delegates voted.

The Purple party primary, however, was a different story. Mrs. Plum was the only mainstream candidate; Mrs. Indigo and Mrs. Magenta were really fringe candidates who generated very little support. The party became unified behind Mrs. Plum in February.

How does this election turn out? Not close at all. About half the Yellow party voters are profoundly upset with the Gold campaign and their party. Many of those voters don’t have an ideological problem with Gold per se, but they are so disappointed with how the process played out for Canary (who wasn’t picked as VP) that they are not at all motivated to vote. Despite being weary from the primary, the half of the Yellow party that supported Gold turns out 60% of their voters giving gold 30 million votes, but only 40% of the Canary voters turn out at the general election. Those that do vote reluctantly for Gold, giving him another 20 million votes for a total of 50 million. Meanwhile, 60% of the Purple party voters show up, giving Mrs Plum 60 million votes and a 10 million vote margin.

If that seems farfetched, I can introduce you to some Hilary Clinton supporters who are strongly pro-choice, support gay marriage, and voted for John McCain.

Now, I constructed both of these scenarios while leaving in place many oversimplifications. Overlay multiple issues of varying intensity and varying degrees of organized lobbying with ethnic, religious, and gender identification and elections become very complex things. Certainly the practical modeling of individual factors is quite advanced; I am curious what the intellectual underpinnings are, and to what extent they have embraced the latest thinking in human economic behavior.

– Max

How the heck? A puzzle with a very surprising answer

A few weeks ago I posted a puzzle about two mathematicians guessing the results of coin tosses. Dwight asked me about generalizations to more than two mathematicians. While the problem may seem very different on the surface, in my opinion the underlying issue is actually quite similar and the changes are necessary to generalize from two mathematicians to N.

I posted one version a while ago here which I will repeat as a warmup for the harder version :)

16 mathematicians are in a room. They are each assigned a hat, either black or white. Each hat is assigned totally independently of all the other hats and has a 50% chance of being either color. Each mathematician can see everyone else’s hat but not his/her own hat. The mathematicians all have to independently and simultaneously guess the color of their own hat. They have an hour before the hats are assigned to make a plan, then one minute to view the hats, then they each go into a voting booth to vote for the color of their own hat. While they are viewing the hats they can not communicate or signal in any manner to the other mathematicians.

The success or failure of the mathematicians is judged as a team: if every single one guesses their own hat color correctly, the team wins. If even a single mathematician guesses incorrectly, the team loses. What are the odds of their success, and what strategy should they employ to achieve it? [Hint: the odds are much better than you might first think they are. Really. If you are sure you can't improve them I am happy to find a jurisdiction where we can play this for high stakes!]

Second version (if you thought the first version was too easy):

Everything is just like the first version, except that when the mathematicians enter the voting booth they can vote “White”, “Black”, or “Don’t know”. If anyone is wrong, they all lose. If everyone passes, they all lose. If at least one mathematician chooses a color and all mathematicians who choose colors are correct, they win. Again, what are the odds of their success, and what strategy should they employ to achieve it? Again, you can do better than you might first think!

– Max

Math puzzle with a surprising answer

16 mathematicians are imprisoned by the evil emperor. They are told that in the morning they will each be assigned a hat, either white or black. Each hat is assigned randomly (50/50) and independently. Each mathematician will be able to see everyone else’s hat but not their own. After one minute to observe the hats, they will simultaneously be sent back to their private cells where they have to guess the color of their own hat. If a single mathematician guesses incorrectly, they will all be executed. If all 16 guess correctly, they go free.

What is the probability that the mathematicians will be set free, and how should they guess?

If you’d like a hint, look at this problem. Thanks to Dwight for suggesting more complex problems in that spirit; if this isn’t hard enough I’ll add another twist next week.

– Max

Fun math puzzle: two mathematicians and a coin…

The king is very angry at his two best mathematicians, but rather than executing them immediately institutes the following process:

They are banished to opposite ends of the kingdom, kept under 24 hour guard. Every day  in both Westerville and Eastgate where the mathematicians are being kept one of the guards flips a coin and records the result. Each mathematician must then guess the result of the flip at the other end of the kingdom. A messenger is then dispatched from each location to the royal palace to share the results of the flip and their mathematician’s guess. As long as one of the two mathematicians guesses right, the lives of both mathematicians are spared until the next day. The first day that both mathematicians are wrong, they are both executed.

The king imagined this game would not last more than a week or so. The mathematicians, however, hatched a clever plan as they were being ushered out the door of the castle.

What do the mathematicians do to live as long a life as they can, and on average how long should the king have to wait to execute them?

– Max

How much would you pay to own an undisclosed percentage of my house?

I just got an email from a friend describing the interview offer process of a mutual friend joining a technology startup. The newly minted employee has an offer but they won’t tell him how many shares are outstanding. He was told what percentage of the “employee pool” he was getting, but not the overall percentage. What exactly is included in the “employee pool” is a question – is it just unissued options, is it akk issued and unissued options, is it all common shares including founders???

As I’ve said before when talking about stock options (see this post), it is impossible to evaluate your grant without knowing how many total shares are outstanding. Proof by analogy: how much would you pay for partial ownership of my house? I can create an “investor pool” which represents some ownership of my house. For $50,000, you can own 10% of the investor pool. Sorry, but I am unable to disclose how much of the ownership of my house will be assigned to the investor pool. Please contact me if you are interested! I don’t expect to sell much of my house this way; why should I expect employees to be excited about options offered in the same manner?

Not only does the lack of data make it hard to evaluate the stock grant, but the lack of transparency is concerning as well. One of the things that makes startups great is a team working together towards a common goal. For me, a big part of creating that is giving the team a lot of transparency about the company they are building. I view the employees in an early stage company as stakeholders, and as investors in a sense: they are investing their time, their passion, and their reputation in the company. To not share basic information, especially at an early stage, demeans that investment and undermines the sense of team.

Will my friend still join? Maybe. There’s a lot to like about the opportunity. But the lack of data and the lack of transparency is a red flag. At a minimum, it’s caused an extra round of digging/assessment. As an employer, why would you want to give a potential recruit a reason to question your offer?

– Max

HP/Autonomy and Oracle/Endeca: Is everything now “big data”?

The industry has a tendency to attach everything to the latest hot trend/buzzword, but I just can’t accept Autonomy and Endeca as “big data” vendors.

Autonomy is a search engine vendor that added various content technologies like Zantaz (email archiving) and Interwoven (web content management). Yes, it has “unstructured” in common with big data, but that’s about it.

Endeca started as an ecommerce search engine with aspirations of general-purpose BI tool. So it has “analytics” in it but again, I don’t see it as big data.

I’m not saying that either Autonomy or Endeca is a bad technology, if they are big data then maybe Sharepoint is big data too.

Does anyone out there really think these enterprise search-in-disguise vendors are big data?

– Max

Follow

Get every new post delivered to your Inbox.

Join 455 other followers