Statistics/analytics data strategy for (multilanguage) site?

Events happening in the community are now at Drupal community events on www.drupal.org.
playfulwolf's picture

Hello everyone,

The site, which contains from ~100k to ~1M nodes needs web analytics strategy. It may seem trivial but will describe the site and requirements I am talking about:
-3 different languages, nodes are localized: the same content but 3 different paths. 3 XML sitemap are used, so it is 3x more content for search engines
-Different content authors have to see the summary of their authored content: how much every node was visited total (on all 3 languages combined!!!), daily, weekly, monthly, some other minor stats. Ideally all the data is kept until node is published.
-Data has to be accessible by Drupal, so that some basic visualization, graphs, charts could be drawn, calculations made and Views are build
-Some social networks stats

Google Analytics Reports seems the way to go, are there any other modules or tested external/internal solutions, which may be up to the task? Also, what about social stats? There is a lone module, which can show how many times node was shared on social networks, but Drupal community is surprisingly silent on these topics.

The question is: how will that work for multilanguage site? How to solve 3 different paths and actually 3 different contents in 1 node problem?

Comments

you might want to look at

Andre-B's picture

you might want to look at https://www.drupal.org/project/piwik as well.

"The question is: how will that work for multilanguage site? How to solve 3 different paths and actually 3 different contents in 1 node problem?"

since they all have the same nid I don't see the problem, just set your logging up to log against node id, and include the language/ tld as well as custom data. 3 different paths for the same node should be configurable in drupal7 core already.

maybe look at: http://hojtsy.hu/multilingual-drupal7

Andre, English is not my

playfulwolf's picture

Andre, English is not my primary language... I have described my problem not precisely enough.
The site is pretty much in final development stages, and Drupal localization techniques are totally clear.

For example, the site url structure for the node is this:
site.com/en/123456-beautiful-girl
site.com/de/123456-schones-fraulein
site.com/ru/123456-krasivaja-devushka

Google Analytics and Google search engine sees that as 3 different content pieces - I want that to stay as it is, because of SEO, general usability, and because content is a bit different for all 3 languages.

The exact questions are:
1. How to combine stats from 1 node with 3 different urls into one? Google Analytics or Piwik or any other web stats service do not know that this is the same node, or is it somehow possible to mark that those services understand?
2. Are there any other similar and proven alternatives to Google Analytics Reports module to get the data from external sources to Drupal? Without coding custom module, of course.
3. The same about social stats... There is just a
4. Is internal statistics modules up for the task?

drupal+me: jeweler portfolio

you might want to use

Andre-B's picture
  1. you might want to use canonical urls, the setup to get everything up and running the correct way though this might take some work.
  2. I don't know of any, custom module coding was always the way to go till now - let me know if you find something
  3. don't know
  4. depends on your hardware and traffic, I guess the ajax counter should work here (and still have the usual page cache available)

Analytics

Group organizers

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds: