Hattai's Fortune: a script to be used by a bot, that gives news (from Público)
This is the behaviour of the script, as extracted from the previous implementation.
When run the script will:
- Read the RSS feed of the Público newspaper (http://http://feeds.feedburner.com/publicoRSS)
- Exclude the articles which contain any word from a given list of "bad words" (the match is case insensitive)
- Exclude articles which have an empty title
- Add new articles to the article list
- Trim the article list up to a given maximum size
- Pick the "best" article from the article list and:
- Store the title in a file called title
- Store the URL in a file called link
- Print out the title to STDOUT
- Save the current article list for the next run
The way the "best article" is chosen is roughly like this:
- Look for the articles which have been used the least (ideally never)
- Pick the most recent one
We assume the feed delivers the articles ordered by date, so no check is done on the script.