For a while I have now been thinking about what a good plotting modality would be for fully distributional predictions, i.e., the output of predict_proba in sktime or skpro.
The challnge is that you have a (marginal) distribution for each entry in a pandas-like table, which seems hard to visualize. I've experimented with panels (matplotlib.subplots) but I wasn't quit happy with the result.
Now, by accident (just curious clicking), I've discovered ridgeplot.
What would you think of using the look & feel of ridgeplot as a plotting function in BaseDistribution? Where rows are the rows of the data-frame like stucture, and mayb there are also columns (but I am happy with the single-variable case too)
The main difference is that the distribution does not need to be estimated via KDE, you already have it in a form where you can access pdf, cdf, etc, completely, and you have the quantile function too which helps with selecting x-axis range.
Plotting cdf and other distribution defining functions would also be neat, of course pdf (if exists), or cdf (for survival) are already great.
Imagined usage, sth like
fcst = BuildSth(Complex(params), more_params)
fcst.fit(y, fh=range(1, 10)
y_dist = fcst.predict_proba()
y_dist.plot() # default is pdf for continuous distirbutions
y_dist.plot("cdf")
Dependencies-wise, one could imagine ridgeplot as a plotting softdep like matplotlib or seaborn, of skpro and therefore indirectly of sktime.
What do you think?
For a while I have now been thinking about what a good plotting modality would be for fully distributional predictions, i.e., the output of
predict_probainsktimeorskpro.The challnge is that you have a (marginal) distribution for each entry in a
pandas-like table, which seems hard to visualize. I've experimented with panels (matplotlib.subplots) but I wasn't quit happy with the result.Now, by accident (just curious clicking), I've discovered
ridgeplot.What would you think of using the look & feel of
ridgeplotas a plotting function inBaseDistribution? Where rows are the rows of the data-frame like stucture, and mayb there are also columns (but I am happy with the single-variable case too)The main difference is that the distribution does not need to be estimated via KDE, you already have it in a form where you can access
pdf,cdf, etc, completely, and you have the quantile function too which helps with selecting x-axis range.Plotting
cdfand other distribution defining functions would also be neat, of coursepdf(if exists), orcdf(for survival) are already great.Imagined usage, sth like
Dependencies-wise, one could imagine
ridgeplotas a plotting softdep likematplotliborseaborn, ofskproand therefore indirectly ofsktime.What do you think?