Applied Flair Word+Document Embeddings on a small subset of the given mission literature dataset. Then computed cosine similarity on the embedding vectors. Top 'k' elements from resulting vector are mapped with the content id's and sent back as 'Similar Content' in an REST API.
Tech Stack includes Python (pytorch, flair, pandas) + Azure Machine Learning Service for training in cloud and model deployment as webservice (training on full dataset is in progress).
https://documenter.getpostman.com/view/5756089/SVfGzCVu?version=latest