How many actual developers work on the Linux kernel?

It could be interesting to clone the git repository of linux and query it immediately.

Cloning the repo

Beware it's a large file! (~1.5G)

Install git and run the following (in a new directory):

git clone http://github.com/torvalds/linux.git

Querying the repo

Once you've cloned it, you can analyze the log of commits with git log. Since the log is so long, you may want to limit your research to a smaller period of time:

git log <since>..<to>

for instance

git log v3.4..v3.5

This has theoretically a lot of info you could use. For example, that command prints the 20 most prolific committers along with their number of commits and their email address.

$ git log v3.4..v3.5 | grep Author | cut -d ":" -f 2 | sort | uniq -c | sort -nr | head -n 20
417  Linus Torvalds <[email protected]>
257  Greg Kroah-Hartman <[email protected]>
196  Mark Brown <[email protected]>
191  Axel Lin <[email protected]>
172  David S. Miller <[email protected]>
138  Daniel Vetter <[email protected]>
132  H Hartley Sweeten <[email protected]>
128  Al Viro <[email protected]>
117  Stephen Warren <[email protected]>
113  Tejun Heo <[email protected]>
111  Paul Mundt <[email protected]>
104  Johannes Berg <[email protected]>
103  Shawn Guo <[email protected]>
101  Arnd Bergmann <[email protected]>
100  Thomas Gleixner <[email protected]>
 96  Eric Dumazet <[email protected]>
 94  Hans Verkuil <[email protected]>
 86  Chris Wilson <[email protected]>
 85  Sam Ravnborg <[email protected]>
 85  Dan Carpenter <[email protected]>

The email address can give you an idea about the employers of the developers (google.com, cisco.com, oracle.com).


Go to Kernel coverage at LWN.net and do a search for "Releases", and "Contributor statistics". Also do a search for "Who". There are various articles in that index with titles like (most recently) Who wrote 3.5.

While these articles may not directly answer your question, they are as detailed an answer as you are likely to find on the net, without trying to collect information first hand. In particular, they should provide at least a partial answer to 3.

The statistics gathering is done by gitdm (LWN article announcing it: gitdm v0.10 available). Thanks to vonbrand for pointing this out. The repository can currently (January 2015) be obtained with

git clone git://git.lwn.net/gitdm.git

As for 1 and 2, they are not so well defined. In the case of 1, I imagine the answer is almost certainly yes, some of the time. But it is not clear what you are looking for - anecdotal evidence, or some statistics. If statistics, in what form? In the case of 2, by "3rd party groups" it is not clear what you mean, and what kind of help you are referring to. Would people on an IRC channel count as a third party group, for example? Or are you talking about a more formal contractual relationship where money changes hands? Like an outside company retained for temporary consulting? In any case, such information would be hard to get without talking to kernel developers directly, and even then would likely be anecdotal. I suppose forums like the Linux kernel mailing list would be a possibility in that case.

Tags:

Linux Kernel