GitHub - Abhishekmamidi123/ABSA-Codemix: Research - The aim of this project is to find the aspect from a given code-mix sentence. The traditional sequence tagging methods are compared with Deep learning methods. The concept of Question and Answering model is used to achieve this task.

Aspect Based Sentimental Analysis Codemix.

Data:

We have collected twitter codemix data. After preprocessing the data, we are left with 3500 tweets with Aspects annotated.

Progress:

We have divided our project into two parts - Aspect Identification and Sentimental Analysis.
Working on Aspect Identification -- Ongoing work.

Parts of speech plays an important role in Aspect Identication.
We have 1000 codemix tweets with annoatated parts-of-speech. We used this data for training.
We have trained the data using different Machine learning methods and achieved the following accuracies:
- Conditional Random Fields(CRF): 77.185%
- Trigram + Bigram + Unigram: 68.172%
- Hidden Markov Method(HMM): 18.014%

Aspect Identification:

Till now, we applied CRF and SVM methods for identifying Aspects in a sentence.
CRF: We have used nearly 65 features(considering window length of 5) and achieved an accuracy of 44.73%.
SVM: We represented each word in the form of a vector and trained these vectors.

Contributors:

M R Abhishek and K Vagdevi

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
Aspect_Identification		Aspect_Identification
CustomPOSTagger		CustomPOSTagger
data		data
refinedData		refinedData
README.md		README.md
getData_EN.py		getData_EN.py
getData_HI.py		getData_HI.py
preprocess.py		preprocess.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aspect Based Sentimental Analysis Codemix.

Data:

Progress:

Contents:

Parts of speech tagger for codemix:

Aspect Identification:

Contributors:

About

Releases

Packages

Contributors 2

Languages

Abhishekmamidi123/ABSA-Codemix

Folders and files

Latest commit

History

Repository files navigation

Aspect Based Sentimental Analysis Codemix.

Data:

Progress:

Contents:

Parts of speech tagger for codemix:

Aspect Identification:

Contributors:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages