Alternativer Identifier:
-
Verwandter Identifier:
Ersteller/in:
Bach, Jakob https://orcid.org/0000-0003-0301-2798 [Bach, Jakob]
Beitragende:
-
Titel:
Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"
Weitere Titel:
-
Beschreibung:
(Abstract) These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/english/index.php) of the [Karlsruhe Institute of Technology](https://www.kit.edu/english/). See the `README` for details. Many input datasets (which we also provide here) either - originate from [OpenML](https://www.openml.org) and are CC-BY-licensed or - originate from [PMLB](https://epistasislab.github.io/pmlb/) and are MIT-licensed. Please see the `LICENSE` files in the corresponding `datasets/` subfolders for details.
(Technical Remarks) # Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection" These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/english/index.php) of the [Karlsruhe Institute of Technology](https://www.kit.edu/english/). The subfolders correspond to individual chapters of the dissertation: - `chap4-syn`: Chapter 4 - "Evaluating the Impact of Constraints on Feature-Selection Results" - `chap5-ms`: Chapter 5 - "Formulating Scientific Hypotheses as Constraints - A Case Study" - `chap6-afs`: Chapter 6 - "Finding Alternative Feature Sets" - `chap7-csd`: Chapter 7 - "Discovering Sparse and Alternative Subgroup Descriptions" See the corresponding `README` files in the subfolders for more information. We already published prior versions of the experimental data, as the dissertation bases on prior papers: - Chapters 4 and 5: [Data](https://doi.org/10.35097/1345) for the [paper](https://doi.org/10.1007/s42979-022-01338-z) "An Empirical Evaluation of Constrained Feature Selection" - Chapter 6: [Data](https://doi.org/10.35097/1920) for the [paper](https://doi.org/10.48550/arXiv.2307.11607) "Finding Optimal Diverse Feature Sets with Alternative Feature Selection" (Version 2) - Chapter 7: [Data](https://doi.org/10.35097/caKKJCtoKqgxyvqG) for the [paper](https://doi.org/10.48550/arXiv.2406.01411) "Using Constraints to Discover Sparse and Alternative Subgroup Descriptions" (Version 1) For Chapters 4, 5, and 7, we mainly consolidate the existing data. In particular, all `*.csv` files (datasets and results) remain unchanged compared to the data linked above. For Chapter 6, we reran the experimental pipeline to integrate a change for the feature-selection method "Greedy Wrapper". The other feature-selection methods have not changed, but experimental data may slightly differ regarding runtimes and for results affected by solver timeouts. For all four chapters, the following files (in each subfolder) differ from prior versions: - `Evaluation_console_output.txt`: The dissertation's evaluation partly differs from the papers' evaluations (e.g., some analyses added, adapted, or removed). - `README.md`: We adapted these files to the context of the dissertation, added some explanations, and proofread them.
Schlagworte:
feature selection
subgroup discovery
constraints
alternatives
explainability
interpretability
XAI
Zugehörige Informationen:
-
Sprache:
-
Erstellungsjahr:
Fachgebiet:
Computer Science
Objekttyp:
Dataset
Datenquelle:
-
Verwendete Software:
-
Datenverarbeitung:
-
Erscheinungsjahr:
Rechteinhaber/in:
Förderung:
-
Name Speichervolumen Metadaten Upload Aktion
Status:
Publiziert
Eingestellt von:
kitopen
Erstellt am:
Archivierungsdatum:
2024-11-08
Archivgröße:
307,1 MB
Archiversteller:
kitopen
Archiv-Prüfsumme:
29870ce49cee60860560c52513b31ac8 (MD5)
Embargo-Zeitraum:
-