Extracting entire articles

Events happening in the community are now at Drupal community events on www.drupal.org.
jitalku006's picture

When I use aggregator, I am not able to pull the full article. I am only shown a teaser of the article. Here is an example of what I mean. http://feeds.washingtonpost.com/rss/politics. If I click on the links to the article the full article with the image shows up. Is there anyway I can pull the full articles?

Comments

The source RSS feed you

budda's picture

The source RSS feed you referenced only contains teasers. So your source RSS feed is the problem.

You can (with some coding) get in to page scraping to retrieve full articles. But it's possibly frowned upon and also tricky to keep on top of if the source website changes their design at all.

Thank you for your response.

khushma's picture

I didn't realize that the aggregator only looks at that page and not the full articles. I read about a post about using yahoo pipes to scrape the rss feeds. I want to be able to make my feeds look like yahoo, linkedin, or newser where the image of the article shows and that image will take you to the actual article. I read from an earlier post about using yahoo pipes. I'm guessing that pipes will be able to link to the article and pull what I need. Hopefully it will work.

Keep an eye on Yahoo!

budda's picture

Yes Yahoo Pipes is a great solution for non developers - good research!

However your project will then require Yahoo to keep their Pipes service running in order for your site to function. Based on how Yahoo has been with shutting down their services over the past 12 months I'd keep a close eye on them...

RSS & Aggregation

Group organizers

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds: