| Policy number | Policy name | Policy date | Sunset date |
|---|---|---|---|
| PUB 3-C | Instructions to Authors and Reviewers: Dataset and Software Article | 5/16/2026 | 12/31/2030 |
| Section No section assigned |
|||
| Policy source | |||
| Policy text | |||
A. SCOPE AND SUBMISSION GUIDELINESDataset & Software Articles (DS) describe scientifically or clinically valuable open-access datasets and/or software (from now on referred to as material) with high potential for contributing to the research of medical physicists and other investigators working on related problems. In contrast to Research Articles, DS should not include hypothesis testing or data analyses supporting generalizable conclusions. DS should provide detailed descriptions of high impact and re-usable research/clinical practice material, including the value; scope; conditions of acquisition/use; known limitations; curation and quality assurance processes. For datasets, the amount and format of the data should be specified. For software, information on the format of the input and output should be specified. Comprehensive descriptive analysis is required and graphical visualizations are allowed and even encouraged to better understand the material. DS should focus on helping others reuse data and/or software rather than presenting new interpretations, methods, or in-depth analyses. Example of use cases beyond the original application are also encouraged. As a condition of submission and acceptance of an DS, the authors must place the dataset or software (optionally including the corresponding code) files in a recognized and stable data archive prior to submission that makes the material available to other investigators with limited restrictions for commercial use but none for scientific purposes. As with all AAPM Journal articles, submitted DS manuscripts will be peer-reviewed for possible publication. Editorial evaluation and peer review of DSs will consider their novelty, importance to the field, completeness quality of the dataset or software, reusability, and accessibility. The material must be made publicly available without restriction in the event that the DS is accepted for publication. DSs describing dataset collections of images derived from obsolete imaging devices; images or data acquired under poorly controlled conditions; or datasets lacking sufficient annotations, e.g., physician-drawn contours, segmentation landmark, or clinical diagnoses or clinical outcomes, to support hypothesis-driven research will likely be rejected. Software that is only applicable to obsolete technologies or whose validation has not been sufficiently documented will likely be rejected. It is essential that the authors make a convincing case for the value and the potential reusability of the material. A.1 Data acquisition policies and selection of a repositoryIt is AAPM Journals’ policy that all key datasets DS-including computational data, curated data, and data acquired via an experimental or observational procedure – and/or all software -including compiled executables and/or software code and any and all necessary input or accessory files- described by an DSDS manuscript should be placed in an appropriate external repository prior to submission of the manuscript. We believe that this is the best means of making these materials discoverable, reproducible and reusable, and we will work with our authors to identify the most appropriate location(s) for them. For datasets, authors should provide the data in datasets in the ’rawest’ form that will permit substantial reuse. It may be advantageous to release some types of data at multiple levels to enable their broadest reuse, for example, CT sinogram data may best be released as ’raw’ readings, including only detector response and background corrections, as well as more processed sinograms including flood fielding and water-equivalent beam hardening corrections. Authors may also submit supplementary information files – including code, models, workflows and summary tables – via secondary repositories or as supplementary files linked below the Conclusion section to be excluded from the PDF version of the article. However, primary data should not be submitted as supplementary information. Data, including headers and metadata, derived from patient studies must be fully anonymized and free of protected health information (PHI). All human or animal research studies from which the dataset was derived must have adhered to human health services (HHS) and all other applicable regulations and ethical guidelines with any necessary institutional review board (IRB) permissions. For software, authors are encouraged but not necessarily need to provide the underlying code when posting compiled executables. However, the software should be accompanied by any and all data/parameter/input files required for use of the software, in addition to sufficient guides or "read me" files to clarify how the software is used, the computing environment(s) required, and the format of the expected output. All code and information needed for the Referees to access and manipulate the material must be provided at submission. Finding appropriate digital repositories for large and complex datasets and software files is challenging. Currently, neither the American Association of Physicists in Medicine (AAPM) nor our publisher are able to provide archiving and curation for large datasets. A trusted archive must
Currently, we cannot review DS until the material has been archived in a final form prior to submission. If the authors would prefer another repository or foresee some delays before finalizing the submission to the designated repository, this needs to be conferred with the Editor-in-Chief and the managing Deputy Editor. Repository fees are the author’s responsibility. The NIH provides a resource list of repository options by file type and indexing/versioning criteria: (NIH Repository Chart). A generalist repository will meet most authors’ needs. B. DATA ARTICLE FORMATDSs should be limited to 10 published pages. At the Editor’s discretion, additional pages may be published provided the authors are willing to assume excess page charges. B.1 Structured Abstract (<300 words)
B.2 Manuscript structureB.2.1 Introduction:Summary of the scientific and clinical background of the material, including a succinct summary of its novelty and expected clinical and/or scientific reuse and impact. The use of the data or description of the algorithm and/or underlying scientific methods used by the software in any prior publications should be described and cited here. Include a list of possible use cases for the material. In the case of software, describe the typical applications it is suited for, whether for research, clinical practice, and/or educational purposes. Include a link to the material and relevant documentation at the end of the introduction to guide the reader toward accessing and using it. B.2.2 Acquisition and Validation Methods:B.2.2.1: DatasetsThe Methods should include a detailed description of the experimental and/or computational procedures used in producing the data, including full descriptions of the experimental design, data acquisition assays, and any computational processing (e.g. normalization, image feature extraction). If the data are derived from observations of animal or human subjects, appropriate regulatory approvals should be cited and the protocol for subject selection and study design summarized, including anonymization procedure. Related methods should be grouped under corresponding subheadings where possible, and the methods should be described in enough detail so that other researchers can interpret and repeat, if required, the full study. Specific data outputs should be explicitly referenced via data citation (see Data Records and Data Citations, below). For studies using computational tools in the generation or processing of datasets, a statement must be included in the Methods section, under the subheading "computational tools", indicating whether and how the associated code can be accessed, including any restrictions to access. This section should also include information on the versions of any software used, if relevant, and any specific variables or parameters used to generate, test, or process the current dataset. For example, if a condensed history code is used to generate particle tracks, the transport model used must be fully identified, along with all relevant parameter settings (step size, energy cutoffs, etc.) and cross section libraries used. While model and algorithm details maybe referenced, a general overview should be given. Reference to any previous publications using dataset should be highlighted. The Data Validation subsection should present any experiments or analyses that are needed to support the technical quality of the dataset. This section may be supported by figures and tables, as needed. This is a required section; authors must provide information to justify the reliability of their data. Possible content may include:
Generally, this should not include:
B.2.2.2: SoftwareFor software-describing manuscripts, this section should be broken down into three components: algorithm, implementation, and validation. General algorithm/underlying theory This section should provide a brief overview of the underlying theory or model used by the software. Assuming the algorithm/theory itself is already well-established and published elsewhere, this section should focus on briefly referencing the relevant work and explaining the core principles on which the software is based. Implementation This section should provide an in-depth explanation of the software architecture and functionality, so that the end user can:
The specific contents will depend on the architecture and whether source code is included. For software designed with an object-oriented architecture, this section should describe the key classes, methods, and modules, explaining how they interact to achieve the software’s intended functionality. A README file should be included in the software repository, to deal, at a minimum, with software installation and user I/O. Therefore, this Implementation section is not to become an "user’s manual". This section should focus on the inner workings and implementation details of the software. Validation The validation section should describe the tests and measurements performed to ensure the accuracy and reliability of the software. If experimental validation has been carried out, provide details on the experimental setup and how the software results were compared to real-world data. If the software has been validated against pre-existing models or benchmarks (e.g., previously validated software), this section should clearly explain the validation methodology and the criteria for success. The focus here is to demonstrate that the software’s results are trustworthy and reliable within its intended scope. Additional recommendations:
B.2.3 (for datasets only) Data Format and Usage Notes:The Data Format section should open with an overview of the data file structures and their format including the repository where this information is stored and the methodology for accessing the data. The authors should strive to make the data presentation to follow the FAIR (Findable, Accessible, Interoperable, Re-usable) guidelines, which publishing as an DS will aid this process. A detailed description should be given of the format of each data record associated with this work and the physical identity and units associated with each data field. For large, complex datasets, tables should be used to succinctly describe data record format and content and should clearly indicate the samples and subjects, their provenance, and the experimental manipulations performed on each with clear references to the methods described in the previous section. This section should contain brief instructions to assist other researchers with access and manipulation of the data. This may include discussion of software packages that are suitable for analyzing the data, suggested downstream processing steps (e.g. normalization, etc.), or tips for integrating or comparing the data records with other datasets. Authors are highly encouraged to provide code, programs or data-processing workflows if they may help others understand or reuse the data. Any specialized software tools needed to access the data set or transform it into an accessible data format, such as binary data readers, interpolation code, or recovery of data from linear combinations of basis functions, should be provided in the data repository or linked to it using open-source. See the software guidance in Section B.2.2.2. B.2.4 (for software only) Results:Step-by-step example This section should provide a guided example, featuring detailed explanations. The example should illustrate the main use case for the software, ensuring that users can follow along and understand how the software operates in practice. All corresponding files (inputs, outputs, and relevant code) should be included in the software repository, with clear links or references to these files provided within the article. Validation results Present and discuss validation results. This section can also include figures or tables to help visualize the comparison between the measured data and the model’s outputs, allowing readers to clearly see the software’s performance. Other resources and use cases If additional files or examples are provided with the software, describe them here. These examples can be presented as proofs of concept that showcase the potential versatility of the software, even if they are not fully validated or outside the initial scope. For instance, describe particular cases or possible future applications of the software, or discuss scenarios where users might adapt the software beyond the core functionality (if applicable). B.2.4 DiscussionThis section should discuss future applications, analyses, hypotheses; or potential dataset/software extensions enabled by making the authors’ material available. Limitations of the proposed material with respect to future scientific uses and comparison to competing material of the same type should be addressed in this section. For software, highlight any significant findings or observations from the validation process and how they support the intended applications. Briefly discuss the next steps, such as future improvements or extensions that could be made, or how the software could be adapted to different use cases in the future. B.2.5 ConclusionA succinct summary description of the material, its contribution to the literature, and potential applications Appendices Additional descriptions of files or parameters. Derivation of mathematical expressions used in the software, if no references exist. B.2.6 AcknowledgementsIn addition to acknowledging funding sources and contributions from non-authors, any potential conflicts of interest should be specified B.2.7 ReferencesFollow the formatting requirements outlined in the full Instructions to Authors B.3 Author Checklist
B.4 Instructions to Referees:
|
|||
| Policy version history | ||||
| Policy number | Policy name | Policy date | Sunset date | Active? |
|---|---|---|---|---|
| PUB 3-A | Instructions to Authors and Reviewers: Medical Physics Dataset Article | 3/20/2018 | 3/20/2023 | Inactive |
| PUB 3-B | Instructions to Authors and Reviewers: Medical Physics Dataset Article | 3/21/2023 | 3/20/2028 | Inactive |
| PUB 3-C | Instructions to Authors and Reviewers: Dataset and Software Article | 5/16/2026 | 12/31/2030 | Active |



















