An application that scrapes text data from subreddits and visualise sentiments and text patterns
-
Updated
Aug 4, 2024 - Python
An application that scrapes text data from subreddits and visualise sentiments and text patterns
Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.
Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation
🚀 EASE-ReD: Ethnicity Analysis and Sentence Embedding from Restaurant Distribution. Predicting ethnicity distribution in an area based on its restaurants data. Cleaning the data using sentence embeddings!
Scripts, data, and results from the "Through time with BERT" project, which evaluated and examined the extent to which English tenses are represented in BERT's raw sentence embeddings.
Official implementation for paper "Learning Discrete Sentence Representations via Construction & Decomposition".
C++ implementation of the paper "Segmentation-free compositional n-gram embedding". NAACL-HLT2019.
A project aiming to leverage text embeddings and Milvus, a high-performance vector search engine, to detect duplicate job postings.
CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning
HASHET (HAshtag recommendation using Sentence-to-Hashtag Embedding Translation) is a model aimed at suggesting a relevant set of hashtags for a given post.
[NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models
EmbedRank implemented in Python.
Run sentence-transformers (SBERT) compatible models in Node.js or browser.
Difference-based Contrastive Learning for Korean Sentence Embeddings
All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers
Paraphrase Generation model using pair-wise discriminator loss
Benchmark for Thai sentence representation
This repository contains various ways to calculate sentence vector similarity using NLP models
Add a description, image, and links to the sentence-embedding topic page so that developers can more easily learn about it.
To associate your repository with the sentence-embedding topic, visit your repo's landing page and select "manage topics."