Alternativer Identifier:

(KITopen-DOI) 10.5445/IR/1000146001

Verwandter Identifier:

(Is Identical To) https://publikationen.bibliothek.kit.edu/1000146001 - URL

Ersteller/in:

Schmid, Larissa https://orcid.org/0000-0002-3600-6899 [Karlsruher Institut für Technologie (KIT), Karlsruher Institut für Technologie (KIT)]

Beitragende:

Titel:

Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models - Supplementary Material

Weitere Titel:

Beschreibung:

(Technical Remarks) Supplementary material for "Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models" (DOI 10.1145/3524059.3532391) # Step 1: System analysis We provide the processed JSON output of Perf-Taint for the Pace3D and Kripke case studies, as well as the bitcode of Kripke used as... Supplementary material for "Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models" (DOI 10.1145/3524059.3532391) # Step 1: System analysis We provide the processed JSON output of Perf-Taint for the Pace3D and Kripke case studies, as well as the bitcode of Kripke used as input of Perf-Taint. Because Pace3D is closed source, we cannot provide the source code to reproduce the analysis results. # Step 2: Experiment design We provide scripts that: * check the output of Perf-Taint for parameters not interacting with each other, * detect iteration parameters potentially linearly influencing the runtime of the calculation * check the coefficient of variation of single iterations to confirm the linear influence # Step 3: Instrumented experiments We provide the profiles of all measurements conducted for the training points. For Pace3D, we had to add one function to the instrumentation filter manually because of a bug in Score-P: Functions declared as static inline inside a header file which are then called from different translation units are not instrumented correctly by Score-P. Thus, the resulting profiles are incorrect. However, we can work around this bug by including the function calling the static inline function. ## Training Data Pace3D ### Training Data Performance-Detective Performance-Detective derived a minimal experiment design with 25 configurations to be measured once, resulting in 25 measurements. The data for Extra-P is splitted into two folders that contain the same measurements that are labeled differently. As we did not implement modeling for the optimized subset, we later create two models (one using procs and vol, one using procs and cubes) and extract the model of each function from the respective model (depending on whether the functions relies on vol OR on cubes). The data for PIM is the same as in the Extra-P folder, just labeled with all the values measured. ### Training Data Full-Factorial 625 measurements for all possible combinations of the five values of procs, vol, and cubes. Each measurement was repeated 5 times. ### Training Data Plackett-Burman 49 samples selected using a random seed and 5 levels -- effectively a subset of the full-factorial measurements. Each measurement was repeated 5 times. ## Training Data Kripke ### Training Data Performance-Detective Performance-Detective derived a minimal experiment design with 5 configurations to be measured once, resulting in 5 measurements. The data for Extra-P is splitted into two folders that contain the same measurements that are labeled differently. As we did not implement modeling for the optimized subset, we later create two models (one for procs, one for dirsets) and extract the model of each function from the respective model (depending on whether the functions relies on procs OR on dirsets). However, for the single-parameter modeler, modeling based on dependencies from Perf-Taint is not yet implemented. Therefore, the measurements are labeled as two parameter-experiments following the policy detailed in the respective folder. The data for PIM is the same as in the Extra-P folder, just labeled with all the values measured. ### Training Data Full-Factorial 125 measurements for all possible combinations of the five values of procs and dirsets. Each measurements was repeated 5 times. ### Training Data Plackett-Burman 10 samples selected using a random seed and 5 levels -- effectively a subset of the full-factorial measurements. Each measurement was repeated 5 times. # Step 4: Modeling and Evaluation Data of each case study is in the respective folder. ## Evaluation Measurements Extra- and interpolated measurement data in the respective folder. ## Extra-P Contains script for calculating the model errors regarding extra- and interpolated evaluation measurements. ### Models Contains Extra-P models as well as a converter from Extra-P format to text (creates PerformanceModel.py), that is used in `calculate_model_errors.py`. It requires an installation of Extra-P to run. ## Performance-Influence Models We use the scripts by [Weber et al.](https://github.com/AI-4-SE/White-Box-Performance-Influence-Models/tree/main/supplementary-website/code/modeling), partly modified to account for the configurations of the case studies and modeling based on known dependencies to functions. ### Data Contains the data of the cubex files in csv format. Data is parsed with `create_csv_from_cubex.py`. ### Model errors Contains the csv files with the model errors of the respective models. Files are generated by `learn_method_level_model_with_deps.py` and `learn_method_level_without_deps.py`, respectively. # License The files `calculate_model_errors.py`, `extrap_to_text.py`, and `create_csv_from_cubex.py` are modified parts of the Extra-P Software (cf. [cube_file_reader2.py](https://github.com/extra-p/extrap/blob/master/extrap/fileio/cube_file_reader2.py) and the license file `LICENSE-BSD3-EXTRAP`). The files `learn_method_level_model_with_deps.py` and `learn_method_level_model_without_deps.py` are modified parts of the supplementary material of [Weber et al.](https://github.com/AI-4-SE/White-Box-Performance-Influence-Models/tree/main/supplementary-website/code/modeling). The license file `LICENSE-GPL-PIM` applies to these files.

Supplementary material for "Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models" (DOI 10.1145/3524059.3532391)

Step 1: System analysis

We provide the processed JSON output of Perf-Taint for the Pace3D and Kripke case studies, as well as the bitcode of Kripke used as input of Perf-Taint. Because Pace3D is closed source, we cannot provide the source code to reproduce the analysis results.

Step 2: Experiment design

We provide scripts that:

check the output of Perf-Taint for parameters not interacting with each other,
detect iteration parameters potentially linearly influencing the runtime of the calculation
check the coefficient of variation of single iterations to confirm the linear influence

Step 3: Instrumented experiments

We provide the profiles of all measurements conducted for the training points.
For Pace3D, we had to add one function to the instrumentation filter manually because of a bug in Score-P: Functions declared as static inline inside a header file which are then called from different translation units are not instrumented correctly by Score-P. Thus, the resulting profiles are incorrect. However, we can work around this bug by including the function calling the static inline function.

Training Data Pace3D

Training Data Performance-Detective

Performance-Detective derived a minimal experiment design with 25 configurations to be measured once, resulting in 25 measurements.
The data for Extra-P is splitted into two folders that contain the same measurements that are labeled differently. As we did not implement modeling for the optimized subset, we later create two models (one using procs and vol, one using procs and cubes) and extract the model of each function from the respective model (depending on whether the functions relies on vol OR on cubes).
The data for PIM is the same as in the Extra-P folder, just labeled with all the values measured.

Training Data Full-Factorial

625 measurements for all possible combinations of the five values of procs, vol, and cubes. Each measurement was repeated 5 times.

Training Data Plackett-Burman

49 samples selected using a random seed and 5 levels -- effectively a subset of the full-factorial measurements. Each measurement was repeated 5 times.

Training Data Kripke

Training Data Performance-Detective

Performance-Detective derived a minimal experiment design with 5 configurations to be measured once, resulting in 5 measurements.
The data for Extra-P is splitted into two folders that contain the same measurements that are labeled differently. As we did not implement modeling for the optimized subset, we later create two models (one for procs, one for dirsets) and extract the model of each function from the respective model (depending on whether the functions relies on procs OR on dirsets). However, for the single-parameter modeler, modeling based on dependencies from Perf-Taint is not yet implemented. Therefore, the measurements are labeled as two parameter-experiments following the policy detailed in the respective folder.
The data for PIM is the same as in the Extra-P folder, just labeled with all the values measured.

Training Data Full-Factorial

125 measurements for all possible combinations of the five values of procs and dirsets. Each measurements was repeated 5 times.

Training Data Plackett-Burman

10 samples selected using a random seed and 5 levels -- effectively a subset of the full-factorial measurements. Each measurement was repeated 5 times.

Step 4: Modeling and Evaluation

Data of each case study is in the respective folder.

Evaluation Measurements

Extra- and interpolated measurement data in the respective folder.

Extra-P

Contains script for calculating the model errors regarding extra- and interpolated evaluation measurements.

Models

Contains Extra-P models as well as a converter from Extra-P format to text (creates PerformanceModel.py), that is used in calculate_model_errors.py. It requires an installation of Extra-P to run.

Performance-Influence Models

We use the scripts by Weber et al., partly modified to account for the configurations of the case studies and modeling based on known dependencies to functions.

Data

Contains the data of the cubex files in csv format. Data is parsed with create_csv_from_cubex.py.

Model errors

Contains the csv files with the model errors of the respective models. Files are generated by learn_method_level_model_with_deps.py and learn_method_level_without_deps.py, respectively.

License

The files calculate_model_errors.py, extrap_to_text.py, and create_csv_from_cubex.py are modified parts of the Extra-P Software (cf. cube_file_reader2.py and the license file LICENSE-BSD3-EXTRAP).
The files learn_method_level_model_with_deps.py and learn_method_level_model_without_deps.py are modified parts of the supplementary material of Weber et al.. The license file LICENSE-GPL-PIM applies to these files.

Schlagworte:

Zugehörige Informationen:

Sprache:

Herausgeber/in:

Karlsruhe Institute of Technology

Erstellungsjahr:

2022

Fachgebiet:

Computer Science

Objekttyp:

Dataset

Datenquelle:

Verwendete Software:

Datenverarbeitung:

Erscheinungsjahr:

2023

Rechteinhaber/in:

Schmid, Larissa https://orcid.org/0000-0002-3600-6899

Förderung:

Mehr anzeigen Weniger anzeigen

Name	Speichervolumen	Metadaten	Upload	Aktion

Status:

Publiziert

Eingestellt von:

kitopen

Erstellt am:

2023-04-20

Archivierungsdatum:

2023-06-21

Archivgröße:

943,9 MB

Archiversteller:

kitopen

Archiv-Prüfsumme:

50d8d872695c61740342ac3ae3076896 (MD5)

Embargo-Zeitraum:

Die Metadaten wurden nachträglich korrigiert. Die ursprünglichen Metadaten sind nach Download des Datenpakets verfügbar.

DOI: 10.35097/1330

Publikationsdatum: 2023-06-21

Datenpaket herunterladen

Herunterladen (943,9 MB)

Metadaten herunterladen

Statistik

0
Views

0
Downloads

Lizenz für das Datenpaket

Dieses Werk ist lizenziert unter
CC BY 4.0

Datenpaket zitieren

Schmid, Larissa (2023): Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models - Supplementary Material. Karlsruhe Institute of Technology. DOI: 10.35097/1330

Datenpaket: Performance-Detective: Automatic Deduction of Cheap and Accurate Performance Models - Supplementary Material

Step 1: System analysis

Step 2: Experiment design

Step 3: Instrumented experiments

Training Data Pace3D

Training Data Performance-Detective

Training Data Full-Factorial

Training Data Plackett-Burman

Training Data Kripke

Training Data Performance-Detective

Training Data Full-Factorial

Training Data Plackett-Burman

Step 4: Modeling and Evaluation

Evaluation Measurements

Extra-P

Models

Performance-Influence Models

Data

Model errors

License