This project contains several application for consuming and processing Twitter streams for the Supply Chain use case in the GeoKnow.eu project.
This projects contains three Spark applications for:
- streaming-twitter: Twitter streaming extrtaction
- fox-extractor: Name Entity Recognition extraction that consumes tweets extracted in the streaming-twitter that uses FOX
- rdf-serializer: a persistance component for saving results form the fox-extractor
To use these applications you need to have installed and running:
You need to configure Spark to have at least three workers available (one for each application). Submit each of the applications to spark using spark-submit (check the README on each of the directories).
The GeoKnowSocialAPI, a work at https://github.com/GeoKnow/SocialAPI by OntosAG is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.