Database

Big Data Drupal: Cloudera Hadoop, MapReduce, Nutch, Solr, Aegir BOA, Drupal 7 ApacheSolr Views

niccolox — Fri, 04 Oct 2013 17:37:37 +0000

I am giving a talk at Badcamp on Big Data Drupal: Cloudera Hadoop, MapReduce, Nutch, Solr, Aegir BOA, Drupal 7 ApacheSolr Views

http://2013.badcamp.net/sessions/big-data-drupal-cloudera-hadoop-mapredu...

I am trying to gather some other experts i.e. Cloudera / Hadoop / MapReduce + HyperDrupal + Twig etc to come and handle the bigger and deeper questions

https://drupal.org/node/2104503
https://groups.google.com/a/cloudera.org/forum/#!topic/cdh-user/Uwuj1q7bWBY

I am fishing for experts with tech and stories who can share in an interactive session

I hope to do lightening talks by panelists, 3 or 4, and then have most of the session Q & A

this is a big area and the NEXT BIG THING

UPDATE:

www.BigDataDrupal.com is the marketing minisite for the commercial open source toolchain

www.BigDataDrupal.org is the prototype search site and directory using above toolchain

Howto Make Big Data Drupal Search http://www.bigdatadrupal.com/howto-big-data-drupal-search

Future Directions for Big Data Drupal http://www.bigdatadrupal.com/content/future-directions

contact me
Nicholas Roberts
510 684 8264
or via the contact form on d.o or email

cheers

Forms, schemas and DB transactions

ken205726 — Mon, 08 Oct 2012 19:36:11 +0000

I have gradually built up my knowledge of how to do stuff in Drupal and have virtually always started the wrong way. Eventually a light bulb goes on and I get an idea of the power of the Drupal framework and API's.

I have just created my first db table in a module and am about to build forms to handle add, modify, delete datbase records and then I am struck.

I have just defined my table in a schema. I can write a piece of code to create basic forms from my schema with basic a/m/d transactions and then I can tweek if necessary.

Has anyone already done this? Is it once again all there hidden behind the wealth of documentation.

As I might have to do this fairly often as I am writing form based DB applications I prefer to create a tool rather than type carefully for days, basically changing the format of the same data.

Thanks for any feedback.

Connect the SQL or Oracle database

vbose — Wed, 30 May 2012 11:05:07 +0000

Hi Team,

I want to know about that can I connect my site of different Database like Oracle or Sql. So what is the procedure of it and how could I connected it?

Calendar with footer, using data from another content type.

mtg.nampa.net — Thu, 16 Feb 2012 01:13:54 +0000

Making a calendar of events. In the footer section, adding in a view of another content type which uses the 'Inherit contextual filters'.

I can 'Preview with contextual filters' and everything works great.
In the real window, the view in the footer never changes. It's like the contextual information never gets passed to the footer.

Any ideas would be greatly appreciated.

Thanks, MTG

Include date formatting and conversion routines into Database Abstraction Layer

d.novikov — Fri, 25 Nov 2011 15:37:48 +0000

In Drupal 7, as it supports different databases, modules which run queries containing date conversions and comparisons (Views, Date and others) face with bugs connected to different SQL syntax (partially I see a number of problems with MSSQL). Date functions are also a part of SQL and I think they should not be handled by modules. We need a kind of date abstraction layer within database drivers, which will contain date formatting, conversion, and extraction routines. I don't know if current DB layer can be changed too much, but this idea can become a roadmap for Drupal 8. Please share your thoughts.

For the sake if it grows in something practical (hopefully:)), I created a sandbox project, where you can post issues: http://drupal.org/sandbox/d.novikov/1762750

Database development prior to MySQL

aaron1948 — Wed, 28 Sep 2011 07:20:21 +0000

I am seeking advice about what might be the best database program to use to create a large pre-MySQL database, from which I will select the content to export/import/enter into a Drupal MySQL database.

I am developing my first Drupal site -- a web portal where information will be organized by relevance to various generations. The site will launch with a Baby Boomer orientation, with information organized for various baby boomer niches, and then expand to serve more generational niches.

Starting from my pre-Internet youth, when my hobby was "collecting information," for several decades I have collected a colossal amount of information--and now must develop and enter it into a "master database" from which I will then select portions to serve as content for the website(s).

Because of the amount of information, I don't think it makes sense to initially enter all of it into the site-based Drupal MySQL database; the most efficient process seems to be to build a "master database," and then, from that, select the portions of it I will use for the site, and either export/import to the Drupal site MySQL database. I will use other portions of the database for other sites and purposes.

So . . . I am seeking advice about what might be the best database program to use to create the large pre-MySQL database.

have used Access in the past, but wonder if using BibTex or another open source program might make the database building and eventual exporting/importing easier. Because using Access would present me with the smallest learning curve, my instinct is to use it.

But I first want to check in with those whose database and Drupal knowledge is far greater than mine. (I am using a Windows-based PC for my database and local site development and testing framework; but, when developed, the site will be ported to a Unix-based host server.)

Thanks for any advice.

Aaron

How to use SELECT in a SELECT?

MrHaroldA — Tue, 16 Aug 2011 18:42:13 +0000

I have a tested and working query that I need to convert to the new Drupal 7 database layer, but I can't seem to find how to use a nested SELECT...

I need to join the MAX() value of last_totalcount to all rows in node_counter and this should be the way to do that:

SELECT nc.nid AS nid, nc.totalcount AS totalcount, last_totalcount AS last_totalcount

FROM node_counter nc, (
  SELECT max(last_totalcount) AS last_totalcount, nid 
  FROM node_counter_history GROUP BY nid
) AS tbl

WHERE nc.timestamp >= 1309471200 AND tbl.nid = nc.nid

D.o/Google only shows me how to use a subquery in condition(), but I don't need the subquery/subselect in the WHERE-clause, I need it nested in a SELECT.

What function can't I find?

<?php
  // Fetch statistics totalcounts
  $query = db_select('node_counter', 'nc')
    ->fields('nc', array('nid', 'totalcount'))
    ->condition('nc.timestamp', $from, '>=');

  // Add the latest total counter
  $subquery = db_select('node_counter_history', 'nch')
    ->fields('nch', array('nid'))
    ->addExpression('MAX(last_totalcount)', 'last_totalcount');

  $query->condition($subquery); // WRONG!!! @TODO: find function!!! ;)
?>

New to Drupal, just want to create and use a database (table) on various pages with a single page to edit data

mtg.nampa.net — Mon, 15 Aug 2011 23:11:12 +0000

I'm new to Drupal. Sorry, if I'm in the wrong group, but I'm looking for the best direction.

I want to create a database table, edit from a single list, but display results on various pages. I've done various database programming before. Drupal has so many modules to choose from, and many do similar things.

I'm just a bit overwhelmed by the variety of directions I could proceed. I'm looking for opinions about modules suited to my needs.

    Thanks in advance, MTG

merge and select from, incomplete or incompetent?

mixel — Sat, 07 May 2011 14:29:13 +0000

While converting my exercises from Drupal 6 to 7 I've tried to stay loyal to the new DB-abstraction layer. One of the exercises has some advanced mysql. In an attempt to make this query work, I've encountered several issues that make me question if I've discovered some incompleteness in the DB-abstraction layer or if I'm incompetent and do not understanding the DB-abstraction layer.

Let me start with the query, it is used for data analysis between users. The query will populate a table based on how often a user has replied to another user. The key of user_interaction is (sid, rid), which allows me to write one query to populate the table:

INSERT INTO user_interaction (sid, rid, count)
SELECT s.uid as sid, r.uid as rid, 1 as count
FROM comments s
INNER JOIN comments r ON s.pid = r.cid
ON DUPLICATE KEY UPDATE count = count + 1

From the DB documentation I understand that the above code is not exactly according to the SQL standard. The "INSERT ..ON DUPLICATE KEY UPDATE" means we are combining an insert an an update query and so need to use db_merge. However, this query is also an "INSERT ... FROM", which is demonstrated here. How I think the query should look like is:

  $query = db_select('comment', 's');
  $query->join('comment', 'r', 'r.nid = s.nid');
  $query->addField('s','uid', 'sid');
  $query->addField('r','uid', 'rid');
  $query->addField('',1, 'count');
  $tmp = db_merge('user_interaction')
  ->from($query)
  ->expression('count', 'count + :inc', array(':inc' => 1)) 
  ->execute();

Normally the db_merge needs a "key" and "fields", but we expect this should be replaceable by "from".

So time to dig into the DB abstraction layer to understand what is happening.

We see that both InsertQuery and MergeQuery inherit from Query and of course the from method is located at "InsertQuery::from". Logical as from only is relevant for insert and should not be part of Query.

The evaluation of the "from" method can be found in the "execute", were a the actual object is asked to transform the query to a string:

if (!empty($this->fromQuery)) {
  $sql = (string) $this;
  // The SelectQuery may contain arguments, load and pass them through.
  return $this->connection->query($sql, $this->fromQuery->getArguments(), $this->queryOptions);
}

With Query being an abstract class and the string transformation is and abstract method (abstract public Query::__toString();), we see how (string) $this only becomes executable in the concrete InsertQuery (e.g. InsertQuery_mysql, InsertQuery_pgsql, ...).

So I got a warm and fussy feeling about the design and tried debugging my code by calling (string) $tmp, but MergeQuery does not implement the string transformation properly it does: public function __toString() { }... byby warmth, hello cold shower ...

MergeQuery does not seem to be implemented properly, but to suggest a solution, I need to have a look at some software architecture patterns in the DB abstraction layer. By putting some code behind db_insert together we get:

$class = $this->getDriverClass('InsertQuery', array('query.inc'));
  return new $class($this, $table, $options);
//and inside getDriverClass($class, $driver);
$this->driverClasses[$class] = $class . '_' . $driver;

As a small remark, I'm not sure why the class creation needs to be outside the getDriverClass, as separate method calls (insert, update, delete and merge) all create the class. More importantly, this is a creational patter (I'm guessing the builder). The breaking of the "abstract class/method" structure seems a validation of the architectural patter.Two possible change could be done to MergeQuery:

1) If we do not need unique syntax for can MergeQuery, we may use a structural pattern (something like the wraper). This would make MergeQuery a simple object and have all default method calls go to InsertQuery, so that we only need to regulate the update case ... I'm not sure if this is possible
2) If we cannot wrap the object than we need MergeQuery to follow the creational structure (e.g. MergeQuery_mysql, MergeQuery_pgsql, ...)

Considering that I'm bringing this problem up, I'm more than happy to try and solve it. I would actually prefer to solve it as this would be a good exercise for me to contribute code. Still I could use some support from experienced DB abstraction layer architects. It is also possible that I misunderstood the whole problem, in that case I hope you will educate me.

Slave support in D7

ctoomey — Sat, 12 Mar 2011 20:56:58 +0000

I'm an architect working on a project to rebuild our current consumer site, possibly using Drupal, and am new to Drupal and learning as much as I can as fast as I can. I see that one of the things that's been added in D7 is support for master/slave DB configurations, so I wanted to get more info. about that.

I've found in the DB documentation that 1) slave DBs can be specified in the DB configuration (http://drupal.org/node/310071), and 2) that module writers can pass an optional 3rd argument to db_query() to mark queries that are save to send to slaves (http://drupal.org/node/310072). Were there any other slave-related changes made in D7 and if so, what were they?

Since 2) requires code changes to modules, I'm curious how much progress has been made in updating code to mark slave-safe queries. Can anyone speak to the extent to which that's been done in core modules and contrib modules?

And can anyone post sample stats from slave-using installations as to what percentage of queries are seen going to slaves?

thanks,
Chris