Skip to content

Latest commit

 

History

History
69 lines (49 loc) · 1.82 KB

README.md

File metadata and controls

69 lines (49 loc) · 1.82 KB

False Friends

This project is about distinguishing true and false friends between Spanish and Portuguese. To run the code, just execute the following (and follow the instructions from there):

./falsefriends.py --help

See A High Coverage Method for Automatic False Friends Detection for Spanish and Portuguese for more information.

Docker

To open a bash shell in the context of the project, use this docker image:

docker run -ti pln-fing-udelar/false-friends /bin/bash

To map a local directory with resources with the container, for example run:

docker run -ti -v $PWD/resources/big:/usr/src/app/resources/big pln-fing-udelar/false-friends /bin/bash

If you want to build the image:

docker build -t false-friends .

Non-Docker way

To install dependencies:

pip install Cython # Needed to **install** "word2vec" package. 
pip install -r requirements.txt

Similar words in Wikipedia's

Similar words in Wikipedia's can be studied also running:

ipython -i scripts/comparesimilar.py

With the -i flag, a interactive iPython shell is ready to be used after the script execution.

Citation

If you use this work in your research, please cite us:

@InProceedings{W18-3903,
  author = 	"Castro, Santiago
		and Bonanata, Jairo
		and Ros{\'a}, Aiala",
  title = 	"A High Coverage Method for Automatic False Friends Detection for Spanish and Portuguese",
  booktitle = 	"Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)",
  year = 	"2018",
  publisher = 	"Association for Computational Linguistics",
  pages = 	"29--36",
  location = 	"Santa Fe, New Mexico, USA",
  url = 	"http://aclweb.org/anthology/W18-3903"
}