Our MongoDB Atlas hackathon idea: news headlines and NLP

The idea: a news headlines search engine with sentiment analysis

After scrolling through possible subjects and MongoDB technologies we might use for the hackathon, we (me and 2 friends) came up with an idea that would leverage the text searc…


This content originally appeared on DEV Community and was authored by yactouat

The idea: a news headlines search engine with sentiment analysis

After scrolling through possible subjects and MongoDB technologies we might use for the hackathon, we (me and 2 friends) came up with an idea that would leverage the text search capabilities of Atlas Search with a massive amount of news headlines data.

We want our news headlines search engine to be able to do these kinds of searches:

  • "War in Iraq" => would give all headlines related to that subject even if the title does not exactly match
  • "sentiment about war in Iraq in news headlines from date 1 to date 2" => would output a main sentiment related to that subject using NLP

More query filters and capabilities could later be added to the app', but if we make a text box work that outputs relevant results related showing that the aforementioned features work, we would be very pleased :)

We have no prior formal knowledge of Data Science Atlas Search or NLP, so I guess it's gonna be a hell of a ride ^^

initial steps

We derived a few major steps to create our app':

  1. get a maximum of data related to news headlines as CSV's or JSON's from various sources
  2. define common data structure of the news headlines entity(ies) we'll use in the app
  3. I/O algorithm to format all data from various sources into one or multiple files with same format
  4. fill mongo DB with formatted data
  5. implement full text search with Atlas Search
  6. add sentiment analysis to headlines text search feature

These are all the vague steps we thought about, I guess these will be split into multiple sub todos as we go along.

If you want to see how our project moves on, check out => https://github.com/yactouat/dev.to_mongodbatlas_hackathon_2022/projects/2

Stay Tuned !


This content originally appeared on DEV Community and was authored by yactouat


Print Share Comment Cite Upload Translate Updates
APA

yactouat | Sciencx (2021-12-21T21:04:45+00:00) Our MongoDB Atlas hackathon idea: news headlines and NLP. Retrieved from https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/

MLA
" » Our MongoDB Atlas hackathon idea: news headlines and NLP." yactouat | Sciencx - Tuesday December 21, 2021, https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/
HARVARD
yactouat | Sciencx Tuesday December 21, 2021 » Our MongoDB Atlas hackathon idea: news headlines and NLP., viewed ,<https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/>
VANCOUVER
yactouat | Sciencx - » Our MongoDB Atlas hackathon idea: news headlines and NLP. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/
CHICAGO
" » Our MongoDB Atlas hackathon idea: news headlines and NLP." yactouat | Sciencx - Accessed . https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/
IEEE
" » Our MongoDB Atlas hackathon idea: news headlines and NLP." yactouat | Sciencx [Online]. Available: https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/. [Accessed: ]
rf:citation
» Our MongoDB Atlas hackathon idea: news headlines and NLP | yactouat | Sciencx | https://www.scien.cx/2021/12/21/our-mongodb-atlas-hackathon-idea-news-headlines-and-nlp/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.