- the Twitter interface interface takes the top few trending topics from Twitter, cleans them, and sends them to Web Scraping module
- the Web Scraping module searches the web for articles related to that topic that are most recent and are from trusted sources and then forwards them to the text summarization module
- the text summarization module uses a modified word frequency method to get the gist of the news article in fewer than 240 characters and then sends it to the Twitter interface module
- a shortened hyperlink to the article is appended to the short gist, and the final result is posted on Twitter
For selecting the most appropriate extractive summarization method, the following algorithms were compared:
- Word Frequency
- Term Frequency - Inverse Document Frequency
- TextRank
- Latent Semantic Analysis
- Improved Word Frequency (we modified the word frequency algorithm)