Thursday, July 13, 2006

measuring the kde user base

the current size of and growth rate of an open source project's user base is notoriously hard to measure. we can't rely on sales because the software is freely available with many independent distribution channels. in the desktop space, many products allow one to pick the desktop installed post-purchase anyways. our software doesn't phone home(though that would make for a nice slashdot article;) and participation levels on user forums gives us only general guidance that we are growing without actually revealing any numbers ... web stats are not specific enough for us (the browser often belies the desktop), but even if they were they are often skewed by usage patterns that vary between different user segments. to complicate it further, many kde systems are simply not on the internet at all. some have used cd handouts to imply user base numbers with, but that's an amazingly shakey (some might say meaningless) bridge to errect.

but there are some tantalizing hints as to the extent of our user base growth rate out there. everywhere i go it seems i run into kde users: from the streets of paris to print shops i patronize to cocktail party goers to random business people i meet in assorted american cities to users in small towns in norway. it happens so often it no longer really surprises me. this certainly wasn't the case when i first got involved with kde. interestingly, though, i have yet to see a single linux laptop on an airplane besides my own or those belonging to people going to linux conferences. the implication of that is interesting. but my experience is simply anecdotal and hard data would be nice.

which brings me to the numbers in a recent article in brazil's "easy linux" magazine titled "linux para todos" which covered a brazillian national program to make inexpensive computers running free software available to the public. they all happen to ship with kde running on linux distributions (5 different ones if i recall correctly).

the article notes that 200,000 of these machines have been sold in the 12 months spanning from april '05 to april '06. of course, not all of those systems remain kde desktops. one survey i've learned of has found that approximately 80% of those who bought one of these computers ends up installing microsoft windows on it when they get it home. this is not surprising as most people only know windows. i'm happy the retention rate is as high as 20%, to be honest, since the majority of those people are almost certainly trying something new for the first time and that's always the harder path to take.

20% leaves us with 40,000 users. that's over a 100 new kde systems per day due to just one program in one country. how many of those represent new kde users is harder to pin down, but it would be interesting if we could get some data stating that the majority of people who use those systems are new kde users. it's also probably safe to say that many if not most of those systems are used by more than one person.

then there is red flag linux in china. according to this interview with the red flag people they ship over one million copies a year, use kde for their desktop (and do a fair amount of custom kde development to that end) and currently hold 80% of the chinese linux desktop market. given the growing importance of the chinese market, these are pretty important numbers.

of course, of those one million units shipped most are almost certainly for servers assuming the linux usage patterns are similar in china as they are in the rest of the world. it's still nice to know that if a gui is brought up on those servers that it is kde, but how many "traditional" desktop users might there be in that one million units? they didn't say in the interview and i haven't yet asked, so i'll just do some wag'ing (wild-assed guessing) and say that 95% of those shipments are for servers. it could be higher or lower, but let's go with 5% as a number. that gives us 50,000 desktop units or nearly 137 new users per day.

we end up with some 240 new installations each and every day due to two projects in two countries. i'll wager that many of those reading this had not heard of one or even both of these projects before. how many other similar projects are out there, large and small, pushing kde out into the market place all around the world? if those 240 installations a day represent 5% (which i think is probably generous) of our user base growth rate that would give us 1.75 million new desktop (not server) installations per year. i wouldn't fall off my chair if the number was even higher.

the best news is that there is simply no way that 5 years ago we were seeing this kind of growth. which means adoption is accelerating. this implies that if we keep on track even better numbers await.

9 comments:

Quintesse said...

Well how about putting a "Let us know you are using KDE" icon on the desktop? :-)

The problem with phoning home is not telling the users about it and of course with this system you still wouldn't know how many _actual_ users you have, but you'd get a lower bound at least.

Heck, I'd click on it, that's one!

Anonymous said...

KDE does not only run on Linux. Keep the BSD's in mind, though they're a minority ;-)

Anonymous said...

I have to agree with quintesse. Putting a "Let us know you are using KDE" icon on the desktop is a great idea. Even after the firt log in and the welcome screen and basic config steps you could add to the final step a button to send a "Hello! I'm using kde" message over the internet...

Tim said...

But couldn't that cause there to be adds for say tech support on that same screen? Once you start adding things that are meaningless to the user companies feel that they are justified.

Anonymous said...

Well, I for one think adding such a button would be a bad idea. As you said, stick to doing the good better and numbers will grow.

If you want yet another kde-promoting project, ask canonical how many kubuntu cd's it has shipped since the release of 6.06. I bet that's a big lot. :)

Aaron J. Seigo said...

"ask canonical how many kubuntu cd's it has shipped"

i'm wary of numbers based on CD give aways. CDs are easy to give away but don't mean an installed (and used) desktop system.

that's what's interesting about the two projects in the blog entry: they are about installed systems that get put to use.

Anonymous said...

Hallo, I definitely think that your approach of makeing rough estimates based on some known values is quite feasable, as long no surveys are available. The lacke of data you are facing is called a Fermi-Problem and as such a more in depth estimate of the numbers might actually really get you some useful numbers. But you should make a few consistancy checks (like comparing the ratios of project contributing to kde user base and country size ... ).

For those suggesting a "Let us know you are using KDE" button somewhere. This will create it's own uncertainty, since you'll have to get an estimate how many people are actually using that button.

Getting a lower bound might seem interesting, but the effort put into this isn't really justified by the results.

Another approach might be to let KDE try to create a unique ID per desktop install. These are broadcasted when using KGetHotNewStuff or something similar. The ID could be the time of the epoch when KDE was configured first on that computer. This number is of course not unique - but unique enough. Then collect the ID's that have been seen somewhere and you get approximate installation rates.

Just from top of my head =) -
Klaus Blindert

Johan said...

How about asking some of the major distros for download statistics from their repositories (including their larger mirrors)? I think that would give a reasonable estimate of the number of internet-connected systems, at least.

Anonymous said...

"i'm wary of numbers based on CD give aways. CDs are easy to give away but don't mean an installed (and used) desktop system."

Well, it could be that it is never installed, but then again, the CD could be used to install more than one system, so counting one install per CD may not be entirely wrong. (My Kubuntu CD at work has been used for a couple of different machines, for instance.)