statistics
Drupal Statistics
Not everyone I know is on the Drupal development mailing list so I thought I'd pass along this interesting bit of research that webchick compiled. It's a number of Drupal statistics including number of active installations, most popular modules, number of Drupal downloads, etc.
Read it here.
How Many Contributors Were There for Drupal6
I've tabulated some data based upon the commit messages for Drupal6. Note, this is not a good judge of contributors. A better judge would be "people who have "commented on an issue and changed the status" or "uploaded an attachment" or even "posted a comment" but this is one way of gathering the data.
I wanted to capture my method and share the output data. The data is available here: http://spreadsheets.google.com/pub?key=pVuTBhrgLH93h7VQNyyynXw
Statistics?
There are lots of processed statistics in this group, but is the raw data available anywhere? Acquia is looking to grow Drupal's use by a magnitude of 10, which implies they must have some good data to base their projects off. Is that available to others? Many thanks!
OSS Census/Discovery
Not sure if others have noted the new open source census effort, which aims to measure open source software use both locally (via use of a new OSS Discovery tool) and globally through the aggregation of local discovery data.
http://www.osscensus.org/
http://ossdiscovery.opensource.collab.net/
SVG, PHP and AJAX references
Some of my bookmarks, for reference:
* detect SVG capabilities: http://blog.codedread.com/archives/2005/06/21/detecting-svg-viewer-capab...
* AJAX/JS and PHP SVG charting: http://cristian.nexcess.net/ajax/svg_chart/
* The lib I used (code from) in my proof of concept: http://www.phpclasses.org/browse/package/2184.html
I will collect more bookmarks at http://www.simpy.com/user/berkes/tag/svg
October Download Statistics
On November 15th Gerhard released the download statistics for all packages on Drupal.org (with formatting by Earl). Here are two charts that summarize the data and the accompanying Excel. Suggestions are welcome on how to improve them or on other ways to analyze and display the data.
1. Top 30 Packages (click thumbnail to enlarge)

These top packages are comprised of 3 versions of Drupal, 20 modules, 5 themes, and 2 videos.
2. Overall Distribution (click thumbnail to enlarge)

When looking at the distribution of downloads we see noticeable breaking points at 16, 36, and about 590, which segment packages into four classes: Tier 1 (critical), Tier 2 (very popular), Tier 3 (moderately popular), and Tier 4 (unpopular).
Group activity data analysis
There has been some recent work analyzing the growth on drupal.org, and I think we should do something similar for groups.drupal.org.
I would be interested in charts/histograms showing the distribution of:
- Groups by number of subscribers
- Groups by posts/week in the past three months
- Users by number of subscriptions (without identifying information)
- Users by number of posts (without identifying information)
- Users by number of posts (without identifying information)
- Total posts over time
- Total posts per week over time
- Total groups over time
- New groups per week over time
- Median subscriptions per user over time
Did I forget anything or should some of these be removed/tweaked? I am willing to generate the charts if someone can run the queries, and I can figure out the exact sql queries if needed.
Next generation of Drupal data collection and analysis tools, featuring extensibility
I recently began sketching out a module that will ultimately allow quick generation of custom reports and graphs based on arbitrary tables (Drupal or other). It's called Reports.
After beginning work on this module, I was contacted by several others who are working on related modules.
There are currently several modules available that augment Drupal's internal reports/statistics, however, none of them, AFAIK, were designed to be extensible.
I believe Drupal would benefit greatly from an extensible set of tools for collecting data and creating reports and graphs related to a specific Drupal website. These tools would be flexible and should be able to utilize arbitrary tables and perhaps new tables whose purpose is solely for reporting purposes (ie "cooked" data such as user sessions).
Collecting data
Is there anywhere a way to get data:
- Hown often are modules be downloaded?
- Which modules are installed?
Like from:
- http://drupal.org/node/46109
- http://drupal.org/node/25704
- admin/settings/drupal (Send system information)
Ajax statistics
How useful is Ajax? I am working on a module to record who allows Javascript, who has xmlhttp available for Ajax and to give users control over Ajax. Part of the project is described in Statistics visitor enhancement. The rest is described here.









