Statistics?

We encourage users to post events happening in the community to the community events group on https://www.drupal.org.
thatashok's picture

There are lots of processed statistics in this group, but is the raw data available anywhere? Acquia is looking to grow Drupal's use by a magnitude of 10, which implies they must have some good data to base their projects off. Is that available to others? Many thanks!

Comments

as close as it gets

greggles's picture

afaik, this is as close as it gets: http://groups.drupal.org/node/6551 That page lists things to track and also possible sources (currently open or places where we might theoretically get it).

If you help me to decide the best statistics and we can get some consensus then we can work on exposing the data.

--
Knaddisons Denver Life | mmm Chipotle Log | The Big Spanish Tour

You can help make statistics more available

Amazon's picture

Greg and I both are working to expose more measurements of community activity.

Some of the data is currently available to members of the infrastructure team, who have privileges to run custom PHP code. We are trying to post that data. There are good threads of ideas which need code written.

Here's an important thread for exposing the Drupal downloads: http://drupal.org/node/188993

There are other issues, on quality metrics, CVS commits, etc.

Mostly, we need people who can write code to help on the infrastructure team.

Kieran

To seek, to strive, to find, and not to yield

New Drupal career! Drupal profile builders.
Try pre-configured and updatable profiles on CivicSpaceOnDemand

Thanks for the pointers.

thatashok's picture

Thanks for the pointers. I've been following the slogan thread, and realise consensus is challenging :) I was personally interested in the Drupal usage statistics:

Metric: sites in the wild that appear to be running Drupal
Metric: downloads - downloads of drupal core tarfile
and maybe Metric: Module downloads over time

Out of these, only some module download data is available. There are so many opportunities for value added services to Drupal, and it would help justify the effort required to implement them if there was a better idea as Drupal's actual usage/market potential. With Acquia looking to increase this, its good news for all service providers. But I'd still like to know what data they are basing their estimates/projections on, and whether that could be made available to others/everyone.

Is coding required to get this information? Wouldn't publishing awstats/other statistics go a long way? My day job involves Java, and I haven't coded in PHP, but if its required point me in the right direction.

Developers needed

Amazon's picture

Hi, all the data is being published to the marketing group as it becomes available.

We just need more developers to help write infrastructure so instead of manually compiling the data and posting to the marketing group, it's published live for the community. We need volunteers!

AWSTATs raw log file posting is a project. Update status processing is another.

Don't try to read too much into download numbers. It's a bit like reading tea leaves. Many hosting companies provide Drupal pre-installed. Many of the consulting companies have their own repositories which they use to deploy Drupal. Drupal is multi-site so one copy could mean 1 or a 1000 Drupal sites. It's better to look at a broader range of community activity.

Here's a good summary of Drupal community activity: http://association.drupal.org/membership . A lot of those statistics are determined by looking at obvious numbers on Drupal.org itself. There are some pages with statistics that the 90+ site maintainers on Drupal.org can see. They are very intensive pages so we haven't made them public, but the results are routinely published here.

It's really a matter of joining the infrastructure team and helping to get the stats for the whole community.

Cheers,
Kieran

To seek, to strive, to find, and not to yield

jwhatcott-gdo's picture

Hi. As Kieran has said above, Acquia is working from the same publicly available data as everyone else. I think that we need to work as a community to more actively quantify and report our progress and this thread is a good place for people to discuss that and organize to go get the data. I've been working on my own wish list and possible sources and will post back here when I've gathered my thoughts.

Acquia's stated goal of growing Drupal usage by 10X is an aspirational goal based on our belief in the total opportunity in the marketplace and the power of Drupal and the Drupal community. We didn't do lots of fancy calculations - we just looked at the market based on our years of experience and asked "Why not?" Today the VAST majority of publishing, collaboration, and community interaction on the web is not yet done with Drupal, and we see no good reason why that can't be fixed if we all do the right things to advance the technology, tear down the barriers to adoption, and tell our story in all the right places.

Regards,

Jeff Whatcott
VP Marketing, Acquia

Most of the data greggeles

thatashok's picture

Most of the data @greggles and @amazon have provided pointers for are already-processed stats on usage of Drupal.org.

Acquia must be using something to base its projections on usage of Drupal itself. It would be great if you could share that (or even better, the raw data) with the Drupal community.

Yes, I would like this for commercial reasons (to validate whether my Drupal concept is worth commercialising). But, I'm sure this information would be valuable if transparently shared with everyone.

needs infrastructure, code, help

greggles's picture

@thatashok - I take everyone at face value that they are telling the truth and that's going to include jwhatcott. I don't think they're sitting around on piles of data hording it from you ;) When you say "the must be using something" what evidence do you have for that? None, really...let's not assume too much here.

The data points I describe are all available somewhere, it just needs someone to
1) decide on a way to present the data (xstatistics module? graphstat?)
2) Write the queries/code necessary to expose the data into that module
3) Get the code reviewed and performance tested and make a case for installing it on D.o

For item 3, you can be sure that I will help as much as I can. I just haven't had time to do 1 & 2 lately.

--
Knaddisons Denver Life | mmm Chipotle Log | The Big Spanish Tour

You're correct, I'm was

thatashok's picture

You're correct, I'm was assuming that it was either (1) Acquia convinced the dispensers of $7million with some good stats and trend projections or (2) its way easy to acquire $7million these days ;) It could be something else, and it would have been nice to have some data, but I guess I'll have to decide for myself whether my concept and Drupal's growth are worth further effort ...

The Marketing of Drupal

Group categories

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds:

Hot content this week