integrity-data This repo contains the data files used to verify the integrity of Common Crawl's crawl data. Source code is in https://github.com/commoncrawl/cc-integrity/