Comparison of Automatic or Suggested Tagging Tools (OpenCalais, Yahoo Term Extractor, Autotagging, Extractor, etc.)

You are viewing a wiki page. You are welcome to join the group and then edit it. Be bold!

There are quite a few modules which help in making it easier to tag the content on a site.

Some of these are completely internal, some rely on free (or "freemium") third party services, some rely on fee-based third party services. Some provide UI enhancements and some are transparent to the end user.

Name Releases Description Usage (Aug 1st)
OpenCalais 6.x stable, 5.x stable, 7.x alpha Uses the free but rate limited third party Calais term extractor to automatically tag content. 4,739
Yahoo Terms 6.x stable, 7.x dev Uses the free, but rate limited Yahoo term extraction service to automatically tag content. 46
Inform 6.x dev Uses the commercial Inform service to automatically tag content. 0
Amplify 6.x stable Uses the commercial Amplify service to analyze content (doesn't store them as terms yet (Developer use is free, production use is not) 63
Auto Tagging 6.x beta, 7.x dev A generalized system to leverage multiple backends for automatically tagging. Currently supports OpenCalais, Yahoo! terms extraction, TagThe.net. 131
Tagging 6.2.4 stable Advanced UI, API to implement suggestions (OpenCalais, Alchemy or Extractor for sematic content suggestions allready implemented) 905
Extractor 6.x alpha Uses a vocabulary to look up terms in any given text. Integrates with Feeds where it's used to extract terms from imported content. Node independent. 808
Taxonomy Autotagger 6.x stable A UI improvement which also suggests terms based on the content of the node and an existing vocabulary. 219
Autocategorise 6.x stable, 7.x dev Automatically tags based on a defined vocabulary and the content of the title/body of a node. 92
HILCC Taxonomy Autotag 6.x dev Auto-generates HILCC (Hierarchical Interface to the Library of Congress Classification) taxonomy terms for nodes that have LCC call numbers (library-assigned numbers like "QA 76.73 .P224 2005"). 12
Keyword Analysis 6.x Does both Yahoo! term extraction and based on frequency analysis 75
Alchemy 6.x Integrates with the free (limited to 30,000 calls/day) Alchemy service for extraction of keywords, entities and concepts. Integrates with Content Analysis, Tagging and Auto Tagging modules. 75
Suggested Terms 6.x stable, 5.x stable Provides "suggested terms" for free-tagging Taxonomy fields based on terms already submitted. Adds suggested terms that can be clicked to add to the default autocomplete field. 1,005
Active Tags 6.x stable, 7.x dev While the primary module provides a widget for adding tags to a freetagging vocabulary, a sub-module adds suggested terms based on the most-used existing terms in the given vocabulary. 2,958
Rules Autotag 7.x stable The Rules Autotag module takes a light-weight approach for autotagging full-text content by matching taxonomy terms (and optional Synonyms). 105

Please update descriptions and/or add more modules as you see fit.

Login or register to post comments

I'm running OpenCalais on my

JasPar - Fri, 2009-12-04 17:06

I'm running OpenCalais on my site. If anyone is using any of the other options listed, I think it may be insightful if we could coordinate some standard articles to run through each respective tool on our sites and share the output to get an apples to apples comparison of their performance. I'd be willing to help.


Clarification of OpenCalais limitations

febbraro's picture
febbraro - Fri, 2009-12-04 20:51

Thanks for pulling this list together. It is very helpful to have many of these in the same place so people can make evaluation decisions. Now, before I make any comments I must clarify that I'm the maintainer of the OpenCalais modules for Drupal. :)

The OpenCalais semantic tagging for Drupal is in no way limited in functionality. You get full Entity, Event and Fact recognition, Document Categorization and (soon) Social Tags, as well as Linked Data goodies. It is a very compelling and easy foray into the Semantic Web for the beginner as well as the seasoned vet. The only way in which you are limited is the number of transactions in any given 24 hour period. You are limited to 40,000 transactions in that period (one Node save for an enabled content type is one transaction) so for most folks that limitation will never come into play.

Just wanted to make the clarification, looking forward to the discussion.


re: Clarification of OpenCalais limitations

mcantelon's picture
mcantelon - Sat, 2009-12-05 01:22

I agree that calling OpenCalais "limitied" without qualifying it may be misleading.

The great thing about OpenCalais data is the semantic relevance. We use OpenCalais for relating content and it's much more useful for this than Yahoo Term Extractor, which doesn't care about semantic relevance.


be bold ;)

greggles's picture
greggles - Sat, 2009-12-05 19:42

This is a wiki, please "be bold" and edit to use better language.


One minor correction

fran.sansalone - Mon, 2009-12-07 15:20

Frank,
Thanks for the comments and clarification. One small correction: We now allow users 50,000 transactions per day.
Cheers,
Fran
(OpenCalais Community Manager)


Hi Fran, can I ask if there

valthebald - Mon, 2009-12-07 17:16

Hi Fran,
can I ask if there are any plans to support Russian language in Calais?
Thank you.


Russian language support

fran.sansalone - Mon, 2009-12-07 17:52

Hi Val,
We're on the brink of Release 4.3, which will include some additional features and capabilities as well as bug fixes, but support for additional languages is not on the calendar right now. Sorry to disappoint.
When we do release any new capabilities, we'll communicate via the website forums, blogs, and news.
Regards,

Fran


OpenCalais Limits

alexmc - Sat, 2010-11-27 10:57

Current limits for openCalais seem to be 50000 requests per 24 hour period, and no more than 4 per second.


Vocabulary limitations

crose - Tue, 2011-05-03 19:57

I don't see any way to use OpenCalais with any vocabularies other than the ones provided by OpenCalais. So it is 'rich' in vocabularies that are totally irrelevant to my needs. In that respect I would call it limited. I use it to tag an article about Neurotransmitters and 'Industry entertainment event' doesn't come up with anything at all!! If I can't direct its usage to a targeted focus on my needs then it is going to hit limitations as soon as it gets beyond square one.


HILCC Autotag module?

janusman's picture
janusman - Fri, 2010-01-08 14:48

This module scans nodes for LCC (Library of Congress Classification) and assigns a term from the HILCC categorization (which is created as a taxonomy vocabulary on install). Does it qualify for the list?

[EDIT: Forgot the URL... http://drupal.org/project/hilcc ]


Be Bold =)

janusman's picture
janusman - Fri, 2010-01-08 14:53

Nevermind, I was bold enough to add that to the list myself =)


Updated Tagging

EugenMayer's picture
EugenMayer - Fri, 2010-01-22 12:46

Updated Tagging (added OpenCalais support)


No arabic language support

Remon's picture
Remon - Sat, 2010-01-30 11:17

Unfotunately, non of them support arabic language.


@rgeorgy

EugenMayer's picture
EugenMayer - Sat, 2010-01-30 11:44

What is missing for arrabic support? If its the translation, i would invite you to help translate those 30 strings (in ~5-10 minutes).
If you have other problems, just write down an issue in http://drupal.org/project/tagging and i will try to help you out.


Concerned about the module called 'autotag'

siliconmeadow's picture
siliconmeadow - Fri, 2010-03-12 10:25

I am working on implementing autotagging for our editors and journalists, and we run our own taxonomy (won't be using a 3rd party service at the moment).

I've been trying to make the autotag module to work, and failing. Looking at the issue queue doesn't fill me with confidence either. From what I can tell, extractor is the way of the future, but now I'm struggling to figure out how to make feeds and extractor work together to tag our content on the fly at creation time.

The autotag module's intention would appear to be what we need:

  • Content creators can put their data in without have to concern themselves with how to tag it;
  • After saving, it would appear that the content creator/editor can go back to a handy jQuery-esque widget to add/add remove tags (see screenshot below - I've not been able to get this kind of display in the edit view of nodes, btw).

Autocategorize looked like it could be useful, but it's got a definite end-of-life statement - it won't be ported to D7.

Any suggestions?


Autotagging with Taggin Module

EugenMayer's picture
EugenMayer - Fri, 2010-03-12 11:27

Hello, well you can use the Tagging Module. Its more then a replacement for the Drupal core API "free tagging taxonomy". It also has a API for "suggestions", which can be use in variuos way.

Yet it works with extractor, OpenCalais and just a static Taxonomy as you wish (all 3 are submodules which you can enable).

In the case of the extractor it will suggest terms based on the content of the corrent node, which then editor then can apply in a nice GUI. You can also automate this process if you like but still can let the author influence the process and customize some tags.

There is a screencast of tagging, you might want to have a look.


Tagging module

siliconmeadow's picture
siliconmeadow - Mon, 2010-03-22 15:26

Thanks, Eugen! Tagging and Extractor seem to do about 90% of what we need to do.


Tagging Module screencast

rj.seward's picture
rj.seward - Fri, 2010-03-12 15:19

In case anyone wants to see the aforementioned screencast: http://www.youtube.com/watch?v=XyvE6Lwsfxs


Add keywords to nodes previously created without any keywords ?

wwwoliondorcom - Wed, 2010-09-22 05:56

Hi,

Do you know what is the best module to add keywords to nodes previously created without any keyword ?

Thanks for help.


For a broad group of users or just the node author?

greggles's picture
greggles - Wed, 2010-09-22 12:54

http://drupal.org/project/community_tags does it for a broad group of users (whoever has the permission to use it).

Works pretty well...


Related presentation at Drupalcon CPH

jhedstrom's picture
jhedstrom - Wed, 2010-09-22 15:45

There was a presentation by johsw at Drupalcon CPH about a home-rolled automated tagging/semantic taxonomy solution. I think he is planning on releasing some of the modules that they've developed along the way.

http://www.archive.org/details/SemanticTaxonomies

http://cph2010.drupal.org/sessions/semantic-taxonomies


Proposed API spec

nedjo - Mon, 2010-10-18 17:10

Posted a draft API spec to enable interoperability among tag suggestion and autotagging modules. Please review, improve, and comment!


Human choice still critical

shareumentarian's picture
shareumentarian - Sun, 2011-09-25 20:45

These auto-tagging tools are great for generating semantic content but I have yet to see a mashup of all of them that then allows for collaborative editing of the content that results in machine learning. Still hoping