Skip to content

wikiHowToImprove: A Resource and Analyses on Edits in Instructional Texts

License

Notifications You must be signed in to change notification settings

irshadbhat/wikiHowToImprove

Repository files navigation

wikiHowToImprove

wikiHowToImprove: A Resource and Analyses on Edits in Instructional Texts

Dependencies

Train models from scratch

  • python baseline_bow_clf.py wikiHow_revisions_corpus.txt
  • python lstm_binary_clf.py --data_file wikiHow_revisions_corpus.txt --test_files data/test_files.txt --dev_files data/dev_files.txt --pre_word_vec cc.en.300.vec --bin_vec 0 --save models/lstm_clf_model --batch_size 256 --dynet-devices CPU,GPU:0 --iter 25
  • python lstm_pairwise_ranking.py --data_file wikiHow_revisions_corpus.txt --test_files data/test_files.txt --dev_files data/dev_files.txt --pre_word_vec cc.en.300.vec --bin_vec 0 --save models/lstm_ranking_model --batch_size 256 --dynet-devices CPU,GPU:0 --iter 25

Reproduce results with pre-trained models

  • python baseline_bow_clf.py wikiHow_revisions_corpus.txt
  • python lstm_binary_clf.py --data_file wikiHow_revisions_corpus.txt --test_files data/test_files.txt --dev_files data/dev_files.txt --pre_word_vec cc.en.300.vec --bin_vec 0 --load models/lstm_clf_model --batch_size 256 --dynet-devices CPU,GPU:0
  • python lstm_pairwise_ranking.py --data_file wikiHow_revisions_corpus.txt --test_files data/test_files.txt --dev_files data/dev_files.txt --pre_word_vec cc.en.300.vec --bin_vec 0 --load models/lstm_ranking_model --batch_size 256 --dynet-devices CPU,GPU:0

Crawl wikiHow articles with their revision histories

  • python crawl_wikiHow.py

About

wikiHowToImprove: A Resource and Analyses on Edits in Instructional Texts

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages