This dataset contains 7 converted XRay datasets, part of the OmniMedSeg superset. All datasets are converted to a standardized structure with binary masks for each segmentation target.
ARCADE: Creative Commons Zero v1.0 Universal
BTXRD: CC-BY 4.0
COVID19_CXR: Each image has license specified here: https://github.com/GeneralBlockchain/covid-19-chest-xray-segmentations-dataset
HIPBONE: CC-BY 4.0
PANDENTAL: CC-BY 4.0
PTX_498: CC-BY 4.0
TEETH_SEG: CC0 1.0
================================================================================
DETAILED INFORMATION BY DATASET
[1] ARCADE
License: Creative Commons Zero v1.0 Universal
Dataset link: https://zenodo.org/records/10390295
Metadata file: XRay/ARCADE/metadata.json
Citation (bibtex):
@article{popov2024dataset,
title={Dataset for automatic region-based coronary artery disease diagnostics using X-ray angiography images},
author={Popov, Maxim and Amanturdieva, Akmaral and Zhaksylyk, Nuren and Alkanov, Alsabir and Saniyazbekov, Adilbek and Aimyshev, Temirgali and Ismailov, Eldar and Bulegenov, Ablay and Kuzhukeyev, Arystan and Kulanbayeva, Aizhan and others},
journal={Scientific data},
volume={11},
number={1},
pages={20},
year={2024},
publisher={Nature Publishing Group UK London}
}
[2] BTXRD
License: CC-BY 4.0
Dataset link: https://figshare.com/articles/dataset/A_Radiograph_Dataset_for_the_Classification_Localization_and_Segmentation_of_Primary_Bone_Tumors/27865398?file=50653575
Metadata file: XRay/BTXRD/metadata.json
Citation (bibtex):
@article{Yao2024,
author = "Shunhan Yao and Yuanxiang Huang and Xiaoyu Wang and Yiwen Zhang and Ian Costa Paixao and Zhikang Wang and Charla Lu Chai and Hongtao Wang and Dinggui Lu and Geoffrey I Webb and ShanShan Li and Yuming Guo and Qingfeng Chen and Jiangning Song",
title = "{A Radiograph Dataset for the Classification, Localization, and Segmentation of Primary Bone Tumors}",
year = "2024",
month = "11",
url = "https://figshare.com/articles/dataset/A_Radiograph_Dataset_for_the_Classification_Localization_and_Segmentation_of_Primary_Bone_Tumors/27865398",
doi = "10.6084/m9.figshare.27865398.v1"
}
[3] COVID19_CXR
License: Each image has license specified here: https://github.com/GeneralBlockchain/covid-19-chest-xray-segmentations-dataset/blob/master/metadata.csv . Including Apache 2.0, CC BY-NC-SA 4.0, CC BY 4.0.
Dataset link: https://github.com/GeneralBlockchain/covid-19-chest-xray-segmentations-dataset
Metadata file: XRay/COVID19_CXR/metadata.json
Citation (bibtex):
@article{cohen2020covidProspective,
title={COVID-19 Image Data Collection: Prospective Predictions Are the Future},
author={Joseph Paul Cohen and Paul Morrison and Lan Dao and Karsten Roth and Tim Q Duong and Marzyeh Ghassemi},
journal={arXiv 2006.11988},
url={https://github.com/ieee8023/covid-chestxray-dataset},
year={2020}
}
[5] HIPBONE
License: CC-BY 4.0
Dataset link: https://data.mendeley.com/datasets/zm6bxzhmfz/1
Metadata file: XRay/HIPBONE/metadata.json
Citation (bibtex):
@misc{Gut_2021,
author = {Daniel Gut},
title = {X-ray images of the hip joints},
year = {2021},
publisher = {Mendeley Data},
version = {V1},
doi = {10.17632/zm6bxzhmfz.1},
url = {https://data.mendeley.com/datasets/zm6bxzhmfz/1}
}
[6] PANDENTAL
License: CC-BY 4.0
Dataset link: https://data.mendeley.com/datasets/hxt48yk462/1
Metadata file: XRay/PANDENTAL/metadata.json
Citation (bibtex):
@article{abdi2015automatic,
title={Automatic segmentation of mandible in panoramic x-ray},
author={Abdi, Amir Hossein and Kasaei, Shohreh and Mehdizadeh, Mojdeh},
journal={Journal of Medical Imaging},
volume={2},
number={4},
pages={044003--044003},
year={2015},
publisher={Society of Photo-Optical Instrumentation Engineers}
}
[7] PTX_498
License: CC-BY 4.0
Dataset link: https://zenodo.org/records/8266529
Metadata file: XRay/PTX_498/metadata.json
Citation (bibtex):
@article{wang2021deepsdm,
title={DeepSDM: Boundary-aware pneumothorax segmentation in chest X-ray images},
author={Wang, Yunpeng and Wang, Kang and Peng, Xueqing and Shi, Lili and Sun, Jing and Zheng, Shibao and Shan, Fei and Shi, Weiya and Liu, Lei},
journal={Neurocomputing},
volume={454},
pages={201--211},
year={2021},
publisher={Elsevier}
}
[8] TEETH_SEG
License: CC0 1.0
Dataset link: https://www.kaggle.com/datasets/humansintheloop/teeth-segmentation-on-dental-x-ray-images
Metadata file: XRay/TEETH_SEG/metadata.json
Citation (bibtex):
@misc{humans_in_the_loop_2023,
title={Teeth Segmentation on dental X-ray images},
url={https://www.kaggle.com/dsv/5884500},
DOI={10.34740/KAGGLE/DSV/5884500},
publisher={Kaggle},
author={Humans In The Loop},
year={2023}
}
================================================================================
IMPORTANT NOTES
- All datasets listed are publicly available
- Full metadata is stored in each dataset's metadata.json file
- For CC0-licensed datasets, attribution is appreciated but not required