Posted by janusman on December 17, 2010 at 4:52pm
http://drupal.org/project/feeds_oai_pmh
Yes, yet another OAI-PMH-related module =) It fetches and parses OAI_DC (Dublin core) metadata records from OAI-PMH services, as defined by http://www.openarchives.org/. It's built as an add-on to Feeds module in order to inherit all of its awesomeness and simplify the codebase.
Features:
- Harvests from OAI-PMH repositories, respecting resumptionTokens, compression (but no deleted record support yet).
- Can map OAI_DC metadata into Feeds targets (CCK, taxonomy, etc.) (this or other modules could plug in and support other metadata schemas in the future, or you can roll your own parser using existing modules)
- Can harvest from the entire repository or a single set. Sets are loaded in via AHAH when creating the importer.
- You can set up multiple harvesting rules/mapping per set and/or repository as you desire.
- Record storage handled by Feeds: nodes (CCK support), raw database, etc. Extensible with other modules.
- Cron scheduling handled by Feeds: as often as every cron run up to a month between harvests.
As I mentioned, I know there are several similar modules available; see the "Similar modules" entry on the project page for my thoughts/comparison.