henchman.diagnostics.profile

henchman.diagnostics.profile(data, corr_thresh=0.9, missing_thresh=0.1, card_thresh=50)[source]

Profile dataset. Gives a dataset overview, writes the warnings and reports on all columns.

Parameters:
  • data (pd.DataFrame) – The dataframe to profile.
  • corr_thresh (float) – Warn above this threshold (Default .9)
  • missing_thresh (float) – Warn above this threshold (Default .1)
  • card_thresh (int) – Warn above this threshold (Default 50)

Example

>>> from henchman.diagnostics import profile
>>> profile(df, missing_thresh=.3, card_thresh=10)