.. DO NOT EDIT. .. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY. .. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE: .. "auto_examples\4_testing\plot_7_segmented_cls.py" .. LINE NUMBERS ARE GIVEN BELOW. .. only:: html .. note:: :class: sphx-glr-download-link-note :ref:`Go to the end ` to download the full example code or to run this example in your browser via Binder .. rst-class:: sphx-glr-example-title .. _sphx_glr_auto_examples_4_testing_plot_7_segmented_cls.py: Segmented Diagnose (Classification) ======================================================== .. GENERATED FROM PYTHON SOURCE LINES 8-9 Experiment initialization and data preparation .. GENERATED FROM PYTHON SOURCE LINES 9-17 .. code-block:: default from piml import Experiment from piml.models import XGB2Classifier exp = Experiment() exp.data_loader("SimuCredit", silent=True) exp.data_summary(feature_exclude=["Race", "Gender"], silent=True) exp.data_prepare(target="Approved", task_type="classification", silent=True) .. GENERATED FROM PYTHON SOURCE LINES 18-19 Train Model .. GENERATED FROM PYTHON SOURCE LINES 19-21 .. code-block:: default exp.model_train(XGB2Classifier(), name="XGB2") .. GENERATED FROM PYTHON SOURCE LINES 22-23 Summary of all segments (top 10 with the worst performance) .. GENERATED FROM PYTHON SOURCE LINES 23-26 .. code-block:: default result = exp.segmented_diagnose(model="XGB2", show="segment_table", segment_method="auto", return_data=True) result.data.head(10) .. rst-class:: sphx-glr-script-out .. code-block:: none .. raw:: html
Segment ID Feature Segment Size ACC
0 0 Balance [0.1831, 0.2088) 63 0.539683
1 1 Mortgage [0.0447, 0.064) 487 0.597536
2 2 Mortgage [0.072, 0.0761) 87 0.597701
3 3 Balance [-inf, 0.011) 575 0.603478
4 4 Mortgage [-inf, 0.0234) 176 0.619318
5 5 Mortgage [0.0404, 0.0447) 101 0.623762
6 6 Utilization [0.2221, 0.309) 538 0.624535
7 7 Mortgage [0.2859, 0.316) 56 0.625000
8 8 Mortgage [0.0234, 0.0404) 384 0.635417
9 9 Utilization [-inf, 0.2221) 999 0.644645


.. GENERATED FROM PYTHON SOURCE LINES 27-28 Summary of all segments of a given feature (top 10 with the worst performance) .. GENERATED FROM PYTHON SOURCE LINES 28-32 .. code-block:: default result = exp.segmented_diagnose(model="XGB2", show="segment_table", segment_method="auto", segment_feature="Balance", return_data=True) result.data .. rst-class:: sphx-glr-script-out .. code-block:: none .. raw:: html
Segment ID Feature Segment Size ACC
0 0 Balance [0.1831, 0.2088) 63 0.539683
1 1 Balance [-inf, 0.011) 575 0.603478
2 2 Balance [0.011, 0.0184) 446 0.650224
3 3 Balance [0.0184, 0.0211) 158 0.658228
4 4 Balance [0.0211, 0.0469) 1119 0.675603
5 5 Balance [0.2804, inf] 45 0.688889
6 6 Balance [0.0469, 0.1831) 1506 0.724436
7 7 Balance [0.2088, 0.2804) 88 0.795455


.. GENERATED FROM PYTHON SOURCE LINES 33-34 Accuracy talbe of the samples in that segment .. GENERATED FROM PYTHON SOURCE LINES 34-37 .. code-block:: default exp.segmented_diagnose(model="XGB2", show="accuracy_table", segment_id=0, segment_method="auto", segment_feature="Balance") .. rst-class:: sphx-glr-script-out .. code-block:: none ACC AUC F1 LogLoss Brier Train 0.7563 0.8512 0.7622 0.4899 0.1604 Test 0.5397 0.5777 0.5538 0.7566 0.2760 Gap -0.2166 -0.2735 -0.2084 0.2667 0.1156 .. GENERATED FROM PYTHON SOURCE LINES 38-39 Residual analysis of the samples in that segment .. GENERATED FROM PYTHON SOURCE LINES 39-43 .. code-block:: default exp.segmented_diagnose(model="XGB2", show="accuracy_residual", segment_id=0, segment_method="auto", segment_feature="Balance", show_feature="Mortgage", figsize=(5, 4)) .. image-sg:: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_001.png :alt: Residual Plot :srcset: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_001.png :class: sphx-glr-single-img .. GENERATED FROM PYTHON SOURCE LINES 44-45 Weakspot analysis of the samples in that segment .. GENERATED FROM PYTHON SOURCE LINES 45-49 .. code-block:: default exp.segmented_diagnose(model="XGB2", show="weakspot", segment_id=0, segment_method="auto", segment_feature="Balance", slice_features=["Mortgage"], metric="AUC", figsize=(5, 4)) .. image-sg:: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_002.png :alt: Weak Regions :srcset: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_002.png :class: sphx-glr-single-img .. GENERATED FROM PYTHON SOURCE LINES 50-51 Distributional distance comparison between the specificed segment and the remaining (feature-by-feature) .. GENERATED FROM PYTHON SOURCE LINES 51-55 .. code-block:: default res = exp.segmented_diagnose(model="XGB2", show="distribution_shift", segment_id=0, segment_method="auto", segment_feature="Balance", figsize=(5, 4), return_data=True) .. image-sg:: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_003.png :alt: Data distance(In segment vs. out of segment) :srcset: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_003.png :class: sphx-glr-single-img .. GENERATED FROM PYTHON SOURCE LINES 56-57 Distributional distance comparison between the specificed segment and the remaining (density of one selected feature) .. GENERATED FROM PYTHON SOURCE LINES 57-60 .. code-block:: default res = exp.segmented_diagnose(model="XGB2", show="distribution_shift", segment_id=0, segment_method="auto", segment_feature="Balance", show_feature="Mortgage", figsize=(5, 4), return_data=True) .. image-sg:: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_004.png :alt: Distribution plot :srcset: /auto_examples/4_testing/images/sphx_glr_plot_7_segmented_cls_004.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-timing **Total running time of the script:** ( 0 minutes 51.680 seconds) **Estimated memory usage:** 41 MB .. _sphx_glr_download_auto_examples_4_testing_plot_7_segmented_cls.py: .. only:: html .. container:: sphx-glr-footer sphx-glr-footer-example .. container:: binder-badge .. image:: images/binder_badge_logo.svg :target: https://mybinder.org/v2/gh/selfexplainml/piml-toolbox/main?urlpath=lab/tree/./docs/_build/html/notebooks/auto_examples/4_testing/plot_7_segmented_cls.ipynb :alt: Launch binder :width: 150 px .. container:: sphx-glr-download sphx-glr-download-python :download:`Download Python source code: plot_7_segmented_cls.py ` .. container:: sphx-glr-download sphx-glr-download-jupyter :download:`Download Jupyter notebook: plot_7_segmented_cls.ipynb ` .. only:: html .. rst-class:: sphx-glr-signature `Gallery generated by Sphinx-Gallery `_