Comparison of Automatic or Suggested Tagging Tools (OpenCalais, Yahoo Term Extractor, Autotagging, Extractor, etc.)

We encourage users to post events happening in the community to the community events group on https://www.drupal.org.
You are viewing a wiki page. You are welcome to join the group and then edit it. Be bold!

There are quite a few modules which help in making it easier to tag the content on a site.

Some of these are completely internal, some rely on free (or "freemium") third party services, some rely on fee-based third party services. Some provide UI enhancements and some are transparent to the end user.

Name Releases Description Usage (Jan 2018)
OpenCalais 7.x,
6.x,
5.x
Uses the free but rate limited third party Calais term extractor to automatically tag content. 216
Yahoo Terms 7.x dev,
6.x
Uses the free, but rate limited Yahoo term extraction service to automatically tag content. 2
Inform 6.x No further development.
Uses the commercial Inform service to automatically tag content.
0
Amplify 7.x dev,
6.x
Obsolete and unsupported.
Uses the commercial Amplify service to analyze content (doesn't store them as terms yet (Developer use is free, production use is not)
4
Auto Tagging 7.x dev,
6.x beta
A generalized system to leverage multiple backends for automatically tagging. Currently supports OpenCalais, Yahoo! terms extraction, TagThe.net. 70
Tagging 7.x,
6.x
Unsupported, No further development.
Advanced UI, API to implement suggestions (OpenCalais, Alchemy or Extractor for sematic content suggestions allready implemented)
234
Extractor 6.x alpha Uses a vocabulary to look up terms in any given text. Integrates with Feeds where it's used to extract terms from imported content. Node independent. 102
Taxonomy Autotagger 8.x dev,
7.x,
6.x
A UI improvement which also suggests terms based on the content of the node and an existing vocabulary. 7.x branch lacks documentation, and released directly as 3.0 "stable branch" without any -dev branch... 1,182
Autocategorise 7.x beta,
6.x
Maintenance fixes only.
Automatically tags based on a defined vocabulary and the content of the title/body of a node.
22
HILCC Taxonomy Autotag 6.x dev,
5.x dev
Auto-generates HILCC (Hierarchical Interface to the Library of Congress Classification) taxonomy terms for nodes that have LCC call numbers (library-assigned numbers like "QA 76.73 .P224 2005"). 1
Keyword Analysis 7.x beta,
6.x
Does both Yahoo! term extraction and based on frequency analysis 427
Alchemy 7.x beta,
6.x rc
Integrates with the free (limited to 30,000 calls/day) Alchemy service for extraction of keywords, entities and concepts. Integrates with Content Analysis, Tagging and Auto Tagging modules. 5,577
Suggested Terms 8.x dev,
7.x,
6.x,
5.x
Provides "suggested terms" for free-tagging Taxonomy fields based on terms already submitted. Adds suggested terms that can be clicked to add to the default autocomplete field. 338
Active Tags 7.x alpha,
6.x
No further development.
While the primary module provides a widget for adding tags to a freetagging vocabulary, a sub-module adds suggested terms based on the most-used existing terms in the given vocabulary.
2,314
Rules Autotag 7.x The Rules Autotag module takes a light-weight approach for autotagging full-text content by matching taxonomy terms (and optional Synonyms). 489
Zemanta 7.x,
6.x,
5.x
Maintenance fixes only.
As you write your content, Zemanta will analyze the text and recommended related content that you can insert into your post. Easily insert recommended images, articles, links and tags that will help your post stand out!.
30
Taxonomy Freetag Detection 7.x alpha Automatically populates your taxonomy term freetag field based on terms that exist in another field (such as body) and terms that exist in the vocabulary of that field 44
MLTag 7.x Provides tag suggestions by reading node content using Text Analytics and Machine Learning Algorithms which also take into account a user's previous posts and tagging patterns. A ML model trains in the background and keeps on learning. 17
PowerTagging 8.x alpha,
7.x
The PowerTagging module is able to analyze content from Drupal nodes including file attachments. PowerTagging annotates content automatically with concepts from a thesaurus or taxonomy by using all their names (incl. synonyms). 53

Please update descriptions and/or add more modules as you see fit.

Comments

I'm running OpenCalais on my

JasPar's picture

I'm running OpenCalais on my site. If anyone is using any of the other options listed, I think it may be insightful if we could coordinate some standard articles to run through each respective tool on our sites and share the output to get an apples to apples comparison of their performance. I'd be willing to help.

Clarification of OpenCalais limitations

febbraro's picture

Thanks for pulling this list together. It is very helpful to have many of these in the same place so people can make evaluation decisions. Now, before I make any comments I must clarify that I'm the maintainer of the OpenCalais modules for Drupal. :)

The OpenCalais semantic tagging for Drupal is in no way limited in functionality. You get full Entity, Event and Fact recognition, Document Categorization and (soon) Social Tags, as well as Linked Data goodies. It is a very compelling and easy foray into the Semantic Web for the beginner as well as the seasoned vet. The only way in which you are limited is the number of transactions in any given 24 hour period. You are limited to 40,000 transactions in that period (one Node save for an enabled content type is one transaction) so for most folks that limitation will never come into play.

Just wanted to make the clarification, looking forward to the discussion.

re: Clarification of OpenCalais limitations

mcantelon's picture

I agree that calling OpenCalais "limitied" without qualifying it may be misleading.

The great thing about OpenCalais data is the semantic relevance. We use OpenCalais for relating content and it's much more useful for this than Yahoo Term Extractor, which doesn't care about semantic relevance.

be bold ;)

greggles's picture

This is a wiki, please "be bold" and edit to use better language.

One minor correction

fran.sansalone's picture

Frank,
Thanks for the comments and clarification. One small correction: We now allow users 50,000 transactions per day.
Cheers,
Fran
(OpenCalais Community Manager)

Hi Fran, can I ask if there

valthebald's picture

Hi Fran,
can I ask if there are any plans to support Russian language in Calais?
Thank you.

Russian language support

fran.sansalone's picture

Hi Val,
We're on the brink of Release 4.3, which will include some additional features and capabilities as well as bug fixes, but support for additional languages is not on the calendar right now. Sorry to disappoint.
When we do release any new capabilities, we'll communicate via the website forums, blogs, and news.
Regards,

Fran

OpenCalais Limits

alexmc's picture

Current limits for openCalais seem to be 50000 requests per 24 hour period, and no more than 4 per second.

Vocabulary limitations

crose's picture

I don't see any way to use OpenCalais with any vocabularies other than the ones provided by OpenCalais. So it is 'rich' in vocabularies that are totally irrelevant to my needs. In that respect I would call it limited. I use it to tag an article about Neurotransmitters and 'Industry entertainment event' doesn't come up with anything at all!! If I can't direct its usage to a targeted focus on my needs then it is going to hit limitations as soon as it gets beyond square one.

HILCC Autotag module?

janusman's picture

This module scans nodes for LCC (Library of Congress Classification) and assigns a term from the HILCC categorization (which is created as a taxonomy vocabulary on install). Does it qualify for the list?

[EDIT: Forgot the URL... http://drupal.org/project/hilcc ]

Be Bold =)

janusman's picture

Nevermind, I was bold enough to add that to the list myself =)

Updated Tagging

EugenMayer's picture

Updated Tagging (added OpenCalais support)

No arabic language support

Remon's picture

Unfotunately, non of them support arabic language.

@rgeorgy

EugenMayer's picture

What is missing for arrabic support? If its the translation, i would invite you to help translate those 30 strings (in ~5-10 minutes).
If you have other problems, just write down an issue in http://drupal.org/project/tagging and i will try to help you out.

Concerned about the module called 'autotag'

siliconmeadow's picture

I am working on implementing autotagging for our editors and journalists, and we run our own taxonomy (won't be using a 3rd party service at the moment).

I've been trying to make the autotag module to work, and failing. Looking at the issue queue doesn't fill me with confidence either. From what I can tell, extractor is the way of the future, but now I'm struggling to figure out how to make feeds and extractor work together to tag our content on the fly at creation time.

The autotag module's intention would appear to be what we need:

  • Content creators can put their data in without have to concern themselves with how to tag it;
  • After saving, it would appear that the content creator/editor can go back to a handy jQuery-esque widget to add/add remove tags (see screenshot below - I've not been able to get this kind of display in the edit view of nodes, btw).

Only local images are allowed.

Autocategorize looked like it could be useful, but it's got a definite end-of-life statement - it won't be ported to D7.

Any suggestions?

Autotagging with Taggin Module

EugenMayer's picture

Hello, well you can use the Tagging Module. Its more then a replacement for the Drupal core API "free tagging taxonomy". It also has a API for "suggestions", which can be use in variuos way.

Yet it works with extractor, OpenCalais and just a static Taxonomy as you wish (all 3 are submodules which you can enable).

In the case of the extractor it will suggest terms based on the content of the corrent node, which then editor then can apply in a nice GUI. You can also automate this process if you like but still can let the author influence the process and customize some tags.

There is a screencast of tagging, you might want to have a look.

Tagging module

siliconmeadow's picture

Thanks, Eugen! Tagging and Extractor seem to do about 90% of what we need to do.

Tagging Module screencast

rj.seward's picture

In case anyone wants to see the aforementioned screencast: http://www.youtube.com/watch?v=XyvE6Lwsfxs

wwwoliondorcom's picture

Hi,

Do you know what is the best module to add keywords to nodes previously created without any keyword ?

Thanks for help.

For a broad group of users or just the node author?

greggles's picture

http://drupal.org/project/community_tags does it for a broad group of users (whoever has the permission to use it).

Works pretty well...

Related presentation at Drupalcon CPH

jhedstrom's picture

There was a presentation by johsw at Drupalcon CPH about a home-rolled automated tagging/semantic taxonomy solution. I think he is planning on releasing some of the modules that they've developed along the way.

http://www.archive.org/details/SemanticTaxonomies

http://cph2010.drupal.org/sessions/semantic-taxonomies

Proposed API spec

nedjo's picture

Posted a draft API spec to enable interoperability among tag suggestion and autotagging modules. Please review, improve, and comment!

Human choice still critical

shareumentarian's picture

These auto-tagging tools are great for generating semantic content but I have yet to see a mashup of all of them that then allows for collaborative editing of the content that results in machine learning. Still hoping

yeah, you AND Google :D

dgtlmoon's picture

yeah, you AND Google :D

tried to update but the

dgtlmoon's picture

tried to update but the format seems to be broken, ideas?

Just needed some whitespace

greggles's picture

Just needed some whitespace at the bottom of the table before the next non-table-formatted text. http://groups.drupal.org/node/38290/revisions/view/553043/553133

Thanks a bunch!

dgtlmoon's picture

Thanks a bunch!

Suggestion hook for Active Tags

bgilhome's picture

I've submitted a patch for Active Tags to implement a suggestions UI/hook here: https://drupal.org/node/840966#comment-7681001

Hooks can be defined by other modules to return an array of suggested tag names with/without a score. I hope people find it useful!

Calais Major update, d6 completely non-functional

AntiNSA's picture

I am utterly speechless... I have worked a long time and really appreciate calais and what it did for my site... now i am left without legs to stand on : (

hmmm...

MM10's picture

Nice table.

it seems like there was a window of time where a diversity of these modules were actively being developed and maintained.

Are there good D8 versions out there? What is the recommended autotagging/suggested version? Why does there appear to be less activity in this space (ie over the last 5 years)?

Newspapers on Drupal

Group organizers

Group categories

Topics - Newspaper on Drupal

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds: