Import Articles from 'Europresse' Using the 'tm' Text Mining Framework
Provides a 'tm' Source to create corpora from articles exported from the 'Europresse' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages).
Useful links