data mining

Events happening in the community are now at Drupal community events on www.drupal.org.
mikhailian's picture

Data-mining users in a screenful of code

I recently coded a system to suggest like-minded users and adversaries using voting_api. It's not yet a module, but if there's enough interest, I'll make a module out of it. See the details here.

Read more
Anonymous's picture

Import/Export API Module Version 2

As I'm debugging the code for the Drupal 6 port of importexportapi I am thinking of ideas for the version 2 rendition of the module. The original version of the module was written before there was a Schema API to use and with the Schema API the module could take on a whole different form. Also, currently the module is written to import/export Drupal tables only based on the modules name, not bad, but this doesn't give any room for external data sources that are not tied to a module enabled by Drupal. Therefore an API for the external data needs to be created.

Read more
merilainen's picture

Collecting and exporting user data

Hello all

We are researching social networks and online communities here at the Tampere University of Technology in Finland. In addition to developing them, we need also to study the behavior of the users in these networks. I guess the best term is social network analysis.

Read more
stdbrouw@groups.drupal.org's picture

Keeping content fresh: what's your strategy?

Hi, I'm currently in the planning stage of creating an online presence of a student guide to the university and the city of Ghent. There's one question that keeps popping up: how can you create a guide that is as accurate as possible, and how can you spot most easily what needs an update, a fact check or a rewrite?

I'm exploring a few routes, and I'd like some input and/or hear about your experiences. (This isn't strictly newspaper-related content, but since local newspapers often also try their best to be a guide to the city they cover, I thought I'd post it over here anyway.) Here goes:

Read more
ChrisKennedy's picture

October Download Statistics

On November 15th Gerhard released the download statistics for all packages on Drupal.org (with formatting by Earl). Here are two charts that summarize the data and the accompanying Excel. Suggestions are welcome on how to improve them or on other ways to analyze and display the data.

1. Top 30 Packages (click thumbnail to enlarge)

These top packages are comprised of 3 versions of Drupal, 20 modules, 5 themes, and 2 videos.

2. Overall Distribution (click thumbnail to enlarge)

When looking at the distribution of downloads we see noticeable breaking points at 16, 36, and about 590, which segment packages into four classes: Tier 1 (critical), Tier 2 (very popular), Tier 3 (moderately popular), and Tier 4 (unpopular).

Read more
ChrisKennedy's picture

Group activity data analysis

There has been some recent work analyzing the growth on drupal.org, and I think we should do something similar for groups.drupal.org.

I would be interested in charts/histograms showing the distribution of:

  1. Groups by number of subscribers
  2. Groups by posts/week in the past three months
  3. Users by number of subscriptions (without identifying information)
  4. Users by number of posts (without identifying information)
  5. Users by number of posts (without identifying information)
  6. Total posts over time
  7. Total posts per week over time
  8. Total groups over time
  9. New groups per week over time
  10. Median subscriptions per user over time

Did I forget anything or should some of these be removed/tweaked? I am willing to generate the charts if someone can run the queries, and I can figure out the exact sql queries if needed.

Read more
harry slaughter's picture

Next generation of Drupal data collection and analysis tools, featuring extensibility

I recently began sketching out a module that will ultimately allow quick generation of custom reports and graphs based on arbitrary tables (Drupal or other). It's called Reports.

After beginning work on this module, I was contacted by several others who are working on related modules.

There are currently several modules available that augment Drupal's internal reports/statistics, however, none of them, AFAIK, were designed to be extensible.

I believe Drupal would benefit greatly from an extensible set of tools for collecting data and creating reports and graphs related to a specific Drupal website. These tools would be flexible and should be able to utilize arbitrary tables and perhaps new tables whose purpose is solely for reporting purposes (ie "cooked" data such as user sessions).

Read more
Amazon's picture

Summer of Code proposal: User experience analysis with implicit meta-data

http://drupal.org/node/62120

Please review this proposal and comment, it's not to late to get students to write up a proposal.

Read more
Subscribe with RSS Syndicate content