numeric_summary
- numeric_summary(data: DataFrame, column: str, fig_height: int = 600, fig_width: int = 1200, bins: int = 0, transform: str = 'identity', lower_quantile: float = 0, upper_quantile: float = 1, display_figure: bool = False) Figure
Creates a univariate EDA summary for a high cardinality numeric data column in a pandas DataFrame.
- Parameters
data – pandas DataFrame to perform EDA on
column – A string matching a column in the data to visualize
fig_height – Height of the plot in pixels
fig_width – Width of the plot in pixels
bins – Number of bins to use for the histogram. Default (0) is to determines # of bins from the data
transform –
Transformation to apply to the data for plotting:
’identity’: no transformation
’log’: apply a logarithmic transformation (zero and negative values will be filtered out)
’sqrt’: apply a square root transformation
lower_quantile – Lower quantile to filter data above
upper_quantile – Upper quantile to filter data below
display_figure – Whether to display the figure in addition to returning it