Repository for Yowlumne data, models, and ancillary files for WIELD
This repository contains the data, models, and ancillary files for my WIELD internship project.
The repository contains four folders:
- raw_data : the raw data in text form derived from the Smithsonian Institution's John Peabody Harrington papers collection
- tesseract : a folder to contain the Tesseract model file and related ancillary files
- nlp : a folder to contain the models and ancillary files for POS and grammatical form tagging and named entity recognition
- website : a folder to contain the website files for WIELD