This week's data is all about passwords. Data is sourced from Information is Beautiful, with the graphic coming from the same group here.
There's lots of additional information about password quality & strength in the source Doc. Please note that the "strength" column in this dataset is relative to these common aka "bad" passwords and YOU SHOULDN'T USE ANY OF THEM!
Wikipedia has a nice article on password strength as well.
# Get the Data
passwords <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2020-01-14/passwords.csv')
# Or read in with tidytuesdayR package (https://github.com/dslc-io/tidytuesdayR)
# PLEASE NOTE TO USE 2020 DATA YOU NEED TO UPDATE tidytuesdayR from GitHub
# Either ISO-8601 date or year/week works!
# Install via pak::pak("dslc-io/tidytuesdayR")
tuesdata <- tidytuesdayR::tt_load('2020-01-14')
tuesdata <- tidytuesdayR::tt_load(2020, week = 3)
passwords <- tuesdata$passwords
variable | class | description |
---|---|---|
rank | double | popularity in their database of released passwords |
password | character | Actual text of the password |
category | character | What category does the password fall in to? |
value | double | Time to crack by online guessing |
time_unit | character | Time unit to match with value |
offline_crack_sec | double | Time to crack offline in seconds |
rank_alt | double | Rank 2 |
strength | double | Strength = quality of password where 10 is highest, 1 is lowest, please note that these are relative to these generally bad passwords |
font_size | double | Used to create the graphic for KIB |