Aggregate news stories using RSS feeds and rank news stories according to relevance based on content of RSS feeds

Encerrado Postado Mar 10, 2016 Pago na entrega
Encerrado

The goal of my project is to aggregate news from different sites and display the news headlines and links ranked by relevance (defined below). The news relates to stocks and other business related topics. My initial thought is to use RSS feeds but I’m open to other methods of collecting news stories. See this example of Yahoo news for news aggregator. I would like to have a similar format but with ranked results. [url removed, login to view]

Relevance can be defined several way and I’m not set on how to define it but here are some thoughts. If a user wants to see the news for Microsoft (ticker MSFT), I’d like the RSS feeds from various sites to be ranked according to the following methodology:

Keywords in this example = “Microsoft” (company name) and “MSFT” (ticker)

1 point if story is from Wall Street Journal or New York Times or Financial Times

1 point if keywords shows up in the Title of the news story

Or

1 Point if keywords shows up in the first paragraph of the story

¼ point for each time keywords show up in story after the first paragraph

1 point if no other company is mentioned in the story

-0.5 points if three or more companies are mentioned in the story

***Is there a way to rank RSS feed based on social media mentions?

***what other relevance mechanisms do you suggest?

***are there programs that quantify emotions in text? For example if words like announced, disappointed, beat, exceeded or missed are used, this denotes a strong event that we want to rank higher than others.

In the final ranking of the RSS feeds, user will have the ability to filter and rank through dropdowns

Sort: Relevance, Newest, Oldest, Longest, Shortest

Filter: Last 1 day, Last 5 days, Last 30 days, Last 3 months, Last 1 Year, All, Custom Dates

We are using a front end environment with React.js and this news feed will be one of many components. The ranking calculation will need to be done on the server and given to the web browser.

ID do Projeto: #9904931

Sobre o projeto

Projeto remoto Ativo em há 7 anos