Loading...

Data Quality Analyzer

Data Quality

  • 111,502,1 111 characters have only a given name or nickname listed
  • 60,690,0 60 incidents are unverified
  • 36,367,0 36 verified stories lack summaries
  • 27,367,0 27 unverified stories lack summaries
  • 8,502,3 8 character records don't cross-reference their sources
  • 7,690,0 7 incidents list unnamed characters
  • 6,502,0 6 characters may be duplicates
  • 5,367,0 5 stories have an uneven number of body-swaps
  • 4,367,0 4 stories lack creator citations
  • 2,367,0 2 story entries may be duplicates
  • 1,367,18 1 stories lack incident records
  • 0,690,0 0 body swap records lack alter ego names
  • 0,16,0 0 categories are empty
  • 0,502,0 0 character records are unused
  • 0,395,0 0 creator records are unused
  • 0,3,0 0 gender classifications are unused
  • 0,17,0 0 incident types are unused
  • 0,3,0 0 modifier tags are unused
  • 0,4,0 0 significance levels are unused
  • 0,690,0 0 verified incidents lack significance data

Link Coverage

  • 95,1072,0 95 stories are missing links they really should have
  • 27,1018,30 27 URLs would fail link type autodetection with no results
  • 18,971,0 18 links are of a type potentially inappropriate to their stories
  • 17,367,67 17 story entries have only one link
  • 17,1018,10 17 URLs would have their link type incorrectly autodetected
  • 15,1018,0 15 links have names which may be worth converting to new link types
  • 6,28,0 6 link types have affinities but no equivalence roles
  • 5,28,0 5 link types have no category affinities.
  • 4,1018,16 4 links are not unique to a single story
  • 1,367,0 1 online fiction entries lack "read it" links
  • 0,367,0 0 entries have no links
  • 0,119,0 0 link type-identification regexes lack ^ or $
  • 0,28,0 0 link types are unused
  • 0,1018,0 0 links in Wayback-like archives are unmarked.
  • 0,1018,0 0 links need better names so each tooltip+icon combo will be unique for a given story
  • 0,1018,0 0 links should be marked Not Safe For Work but aren't.
  • 0,367,0 0 stories have their only read_free links in Wayback-style archives.

Pie charts may be clicked for details.
Entries ending with an asterisk (*) are unavoidably at risk of false positives.