Feed Scraper

Categories

Component ID

459770

Component name

Feed Scraper

Component type

module

Maintenance status

Development status

Component security advisory coverage

not-covered

Downloads

2348

Component created

Component changed

Component body

This project has been abandoned since the maintainers of Feed Element Mapper launched a successor project: Feeds - read more about the future of FeedAPI and Feed Element Mapper in Good bye FeedAPI, hello Feeds

Add-on module for Feed Element Mapper that extracts (scrapes) content from HTML encoded in syndication feed items and allows to map it to CCK fields. In order to extract HTML content, it comes with XPath and Regular Expression parsers out of the box; it is possible to extend the module providing custom parsers.

Usage Example

The module could be used, for example, to extract an image URL from within raw HTML and to map it in a FileField image field.

Module Dependences

The module depends on:

Credits

This project has been sponsored by Nuvole and Youth Agora.