numeric_summary

numeric_summary(data: DataFrame, column: str, fig_height: int = 600, fig_width: int = 1200, bins: int = 0, transform: str = 'identity', lower_quantile: float = 0, upper_quantile: float = 1, display_figure: bool = False) Figure

Creates a univariate EDA summary for a high cardinality numeric data column in a pandas DataFrame.

Parameters
  • data – pandas DataFrame to perform EDA on

  • column – A string matching a column in the data to visualize

  • fig_height – Height of the plot in pixels

  • fig_width – Width of the plot in pixels

  • bins – Number of bins to use for the histogram. Default (0) is to determines # of bins from the data

  • transform

    Transformation to apply to the data for plotting:

    • ’identity’: no transformation

    • ’log’: apply a logarithmic transformation (zero and negative values will be filtered out)

    • ’sqrt’: apply a square root transformation

  • lower_quantile – Lower quantile to filter data above

  • upper_quantile – Upper quantile to filter data below

  • display_figure – Whether to display the figure in addition to returning it