DBM MRMC 2.1 BETA VERSION 2 WINDOWS™ Operating System (September 2006) Kevin S. Berbaum, Ph.D. Medical Image Perception Laboratory Department of Radiology The University of Iowa e-mail: [email protected] Charles E. Metz, Ph.D. Kurt Rossmann Laboratories For Radiologic Image Research Department of Radiology The University of Chicago e-mail: [email protected] Lorenzo L. Pesce, Ph.D. Kurt Rossmann Laboratories For Radiologic Image Research Department of Radiology The University of Chicago Kevin M. Schartz, Ph.D., M.C.S. Medical Image Perception Laboratory Department of Radiology The University of Iowa DBM MRMC User's Guide (Windows OS) Page 1 of 45 Contents Contents ............................................. 2 System Requirements & Notes .......................... 4 Disclaimer & licensing agreement ............................4 System Requirements .........................................4 Where to find this software .................................5 How to install (and uninstall) the software .................5 Request for feedback ........................................6 An Overview of DBM MRMC2.X ............................ 7 Purposes of DBM MRMC.........................................7 Warnings and notes about the DBM MRMC algorithm..............8 Statistical models.....................................................8 ROC curve models and indices...........................................8 Sketch of the various steps of the DBM MRMC algorithm..................9 Acknowledgments ............................................10 To Run DBM MRMC ..................................... 11 To start the program .......................................11 Figure 1. Program main window. ............................12 Options available at the start .............................12 Execution of an MRMC analysis ..............................13 Open and read the input file..........................................13 Select the analysis options...........................................14 Recategorization options..............................................15 Running the analysis..................................................15 Reading result of analysis............................................16 Input file description .............................. 18 Input data .................................................18 Input file types ...........................................18 LABMRMC input file format description ......................18 To create an input file using Microsoft Word™ ........................21 To create an input file using Microsoft Excel™ .......................23 Legacy MRMC input file format description ..................25 Modern MRMC input file format description ..................30 Output files description ............................ 31 List of output files with description of their content .....31 Description of program output ..............................32 DBM MRMC User's Guide (Windows OS) Page 2 of 45 Definition of ROC Terms ............................. 33 Actually-positive and actually-negative ....................33 TPF, FPF, TP, FP, TN, FN ...................................33 Confidence ratings and continuous ratings and categorical data...........................................................33 Area, Area under the curve, Az ..............................34 Leave-None-Out, Leave-One-Out and pseudovalues .............34 Conventional binormal model (RSCORE model) .................34 Proper binormal model (PROPROC or PROPROC2 model) ..........35 LABROC4/LABROC5 categorization .............................36 APPENDIX ............................................ 38 Using the Recategorization menu option................................38 References .......................................... 45 DBM MRMC User's Guide (Windows OS) Page 3 of 45 System Requirements & Notes This version of DBM MRMC has been tested on the 32-bit versions of Microsoft Windows 2000™ and XP™. The software has not been tested on Windows 95, 98, NT, or Vista. We plan to release versions for other operating systems in the future. In this document we will be using the expression “DBM MRMC” to refer to the version of DBM MRMC downloaded with this user’s guide. Occasionally we will specify which version of this software this guide actually refers to. Most comments apply to all versions. Disclaimer & licensing agreement The use of DBM MRMC is constrained by the licensing agreement that users are required to acknowledge before downloading the software. Using the software assumes acknowledgement of that agreement. We ask that you do not distribute this software to others. However, any copying or distribution of this software implies that the person distributing it assumes full responsibility for informing the recipients of the licensing agreement with The University of Chicago and University of Iowa. Modification of the software should be indicated in any modified version of this software. Failure to acknowledge the source of this software is breach of copyright law. By using this software to analyze data, you agree that any publications based on your analysis will cite the appropriate DBM MRMC references listed at the bottom of the ANOVA output that is produced. Also reference the Medical Image Perception Laboratory website (http://perception.radiology.uiowa.edu) and the Kurt Rossmann Laboratories for Radiologic Image Research website (http://xray.bsd.uchicago.edu/krl/), but do NOT reference this user’s manual. Although this software has been carefully tested, neither The University of Chicago, The University of Iowa nor any of the individuals (present or past) who participated in the development or testing of this software and user’s guide are responsible for any errors or for any damages that may result from use of the software. Inquiries or comments concerning this program should be directed to the addresses on the cover page System Requirements • • • A PC equipped with a Pentium or later processor. Windows 2000™ or Windows XP™ The program requires the Microsoft .NET Framework Version 1.1 to be installed on the system. You can visit DBM MRMC User's Guide (Windows OS) Page 4 of 45 http://msdn.microsoft.com/netframework/downloads/howtoget.aspx to check whether you have the .NET framework installed and to view instructions for downloading via Windows Update. Where to find this software DBM MRMC 2.X is available from the following web sites: • http://perception.radiology.uiowa.edu/ Follow the link under the “software” tab on top of the page • http://xray.bsd.uchicago.edu/krl/ Follow the link at the “software” tab on the right of the page. In case of downloading problems, use the contacts provided on the cover page. How to install (and uninstall) the software • • • • • • • • Connect to either of the web sites mentioned in the previous section Follow the online instructions to download the DBM MRMC zip archive to your computer. When downloading the file, the user should select the “Save” option rather than the “Open” option because it is necessary to save the zip file to the user’s hard drive for proper extraction. Unzip the DBM MRMC archive (select option to preserve folder structure). Select the “DBM” folder from the location where you unzipped the archive Select the “DBM MRMC 2.1 beta2 Installed” folder The zip file contains three files: setup.exe, setup.ini, and DBM_MRMC_Beta_Setup.msi. Double click the icon “setup.exe” Follow the instructions of the installation wizard The installer will install the program to the Program Files folder on the user’s machine and create a Start menu entry and desktop shortcut for the program. An uninstaller for the program is also created in the Add/Remove program applet in the Control Panel. Note1: We recommend installing the program in the installation folders suggested during the installation process itself. In case other installation schemes should be followed we won’t be able to help you in case the installation should fail. Note2: The standard installation will create an icon on the desktop as well as a Start menu entry. The program can be uninstalled using the “Add or Remove Programs” applet in the Control Panel. Note3: The standard installation is to C:\Program Files\DBM Software\DBM MRMC 2.1 beta (from now on called the DBM MRMC folder), in this directory the DLLs (dynamically linked library) for all the subunits of DBM MRMC are also available. The DLLs can be called by external programs such as SAS and used independently. This topic is addressed in more detail on the Medical Image Perception Laboratory website (http://perception.radiology.uiowa.edu). DBM MRMC User's Guide (Windows OS) Page 5 of 45 Disclaimer: We cannot assume any responsibility for problems created during the installation or un-installation process Request for feedback DBM MRMC 2.1 BETA2 is a Beta version of this software. Although we have tested this software extensively and fixed all of the bugs discovered, there is no guarantee that we have found every bug. If you discover any possible errors or anomalies in the software, please contact us immediately so that we may fix the problem. Our address appears on the cover page of this guide. DBM MRMC User's Guide (Windows OS) Page 6 of 45 An Overview of DBM MRMC2.X DBM MRMC is designed to perform an analysis of variance when both reader and case variation are relevant to calculate the statistical significance of the differences between different treatments (diagnostic tests, or modalities). It will also perform an analysis with the readers or the cases as fixed factors, if the user requires it. It should be noted that the user must determine which ANOVA option is appropriate for her or his experimental design—all options are not appropriate for all experimental designs. It uses conventional ROC analysis (1) on both continuously-distributed and ordinal category data (e.g., “confidence-rating”) diagnostic test results. There are no predetermined limits on the number of readers, treatments/modalities, or cases. Purposes of DBM MRMC DBM MRMC is designed to determine the statistical significance of difference between ROC indices when the performance of a diagnostic device is affected both by the cases analyzed (e.g., patient, specimen or sample) and by the observer (e.g., technician or physician) The required data are paired test results from actually-negative (e.g., “normal” or “noise”) cases and actually-positive (e.g., “diseased” or “signal”) cases. This means that every reader will read every case using every modality. This is a fully-crossed design. The description of the ANOVA model can be found in Dorfman et al. (2). However changes have been introduced to the original model by Hillis et al. (3, 4) and those changes are implemented in the current version of the code. A familiarity with these three articles is assumed in the remainder of this document. DBM MRMC can compute the ANOVA analysis under the following models: A. Treating both readers and cases as random samples B. Treating only cases as random samples C. Treating only readers as random samples It should be remembered that only option A allows the results to be generalized to the population of readers and cases from which the used cases and readers are sampled. Option B assumes the user only wishes to generalize to the population of cases, and option C assumes the user only wishes to generalize to the population of readers. DBM MRMC User's Guide (Windows OS) Page 7 of 45 Warnings and notes about the DBM MRMC algorithm Note: The input data must not include more than one test result value from each readertreatment-case combination. If multiple test results from any reader-treatment-case combination are included in the input, the program will overestimate the statistical significance of any apparent difference between the ROC curves, thereby invalidating the statistical test. When multiple test results are available from each reader-treatmentcase combination (for example, from replication in each condition) please contact the people on the cover of this document or use a different analysis. Statistical models The available statistical models were already discussed in the previous section and references therein. ROC curve models and indices DBM MRMC computes the ROC indices used in the statistical analysis using the following models for the ROC data: Two Proper models (to be defined below): • Proper binormal model (from now on called PROPROC2 because it is the second version of the proper binormal model implemented at The University of Chicago) • Contaminated binormal model (from now on called CBM2, because it is the second version of the proper binormal model implemented at The University of Iowa) One “conventional” model: • Conventional binormal model in the RSCORE 4.7 implementation (referred to as RSCORE in this documentation) One empirical or non-parametric method: • Trapezoidal/Wilcoxon, non-parametric estimation of ROC indices The trapezoidal or Wilcoxon model does not make any distributional assumption with regards to the positive and negative cases--this is why it is non-parametric. Details about this model can be found in Delong et al. (5). The other three models are semi-parametric, in the sense monotonic transformations of the test result values (the quantity measured in the experiment) are assumed to have normal distributions, one for the negative and one for the positive cases (actually two for the contaminated model), even if the decisions about the state of cases are not made in the same way. PROPROC2 assumes that decisions are made using likelihood ratios, i.e., the decisions about an unknown case are based upon the relative likelihood of being positive for that specific value (as an ideal observer would do) (6). CBM2, assumes that part of the positive cases are indistinguishable from the negative cases (the contamination), and part is separated (7). RSCORE assumes that the true ROC curve for each reader-treatment combination plots as a DBM MRMC User's Guide (Windows OS) Page 8 of 45 straight line on "normal-deviate" axes, or equivalently, that the input data follow normal distributions for both the actually negative and actually positive fractions of the population after some unknown monotonic transformation. ROC curves measured in a broad variety of fields have shapes compatible with these assumptions. The assumptions are often acceptable even when the raw data have multimodal and/or skewed distributions because it is equivalent to assume that some variable functionally related to the one measured in the experiment (the test result value) is normal for both conditional distributions (actually negative and actually positive), respectively. The proper models have the great advantage of not producing curves that have change in convexity, which is considered incompatible with medical evidence (8). A number of indices are available for the analysis: • Area under the curve (AUC) • Partial area under the curve between two value of FPF or two values of TPF (to be defined) • Sensitivity for a value of specificity • Specificity for a value of sensitivity For a description of the meaning of the different indices the users are referred to the book of Pepe (1) or other papers or books on medical diagnostics. Note that not all indices are available for all models for different versions of the software. Refer to the copy of the software you have or to the web site for the most complete version. In general more recent versions have more features. Note: any of the indices can be used for the analysis. If another index is desired, the analysis needs to be repeated. It should be noted that repeating the analysis for multiple indices and “looking” for the best “statistical significance” technically violates the assumptions of the statistical analysis and invalidates the statistical tests. Sketch of the various steps of the DBM MRMC algorithm Data Categorization step. In a preliminary step, DBM MRMC automatically categorizes continuously-distributed input data for each reader-treatment combination in an attempt to produce a useful spread of operating points on each ROC curve. The program either follows the LABROC4 algorithm that produces no loss of ROC information (9) or a binning algorithm. If the data is categorical, empty categories are removed and identical categories are collapsed. See section “Definition of ROC terms” for details. This again does not affect the estimation of any ROC index, whether done non-parametrically or using maximum likelihood estimation (9). This collapsing is done to render the computation more stable, and, again, does not affect the results. Pseudovalue creation step. Each of the marginal categorical data sets created by the categorization for each reader-treatment combination is then used independently to estimate the desired ROC index. These parameter estimates are called the Leave-None-Out estimates. The program then leaves each case out of the reader-treatment sample in turn and computes new estimates. These estimates are called the Leave-One-Out estimates [one for each case]. Next, the Leave-None-Out and the Leave-One-Out estimates of the ROC index are combined to create “pseudovalue” estimates for each reader-treatment-case combination (2, 4). This step created a matrix, with data entries (pseudovalues) for each reader-treatment-case combination. DBM MRMC User's Guide (Windows OS) Page 9 of 45 ANOVA step. DBM MRMC takes these “pseudovalue” estimates and computes, using ANOVA methods (2-4) the statistical significance of the differences among treatments, etc. ⇒ Note: After each of the cases is removed to compute the Leave-One-Out, the categorization procedure may be repeated to insure the numerical stability of the algorithm. The comments about categorization made before apply also to this step. Acknowledgments This guide was compiled by Lorenzo Pesce and Kevin Schartz. The algorithm employed by DBM MRMC was designed by Donald Dorfman, Kevin Berbaum, Charles Metz and Steve Hillis. The program itself was written by Lorenzo Pesce and Kevin Schartz, who also designed part of the algorithm and tested it. The code is an almost completely rewritten version of parts originally written by Benjamin A. Herman and Hatem Abu-Dagga, with some code taken from earlier programs written by Jong-Her Shen, Helen Kronman and Xiaochuan Pan. Development of this software was supported by the National Institute of Health under Grant RO1 EB000863 (Kevin S. Berbaum, P.I.). By using this software to analyze data, you agree that any publications based on your analysis will cite the appropriate DBM MRMC references listed at the bottom of the ANOVA output that is produced and also reference the Medical Image Perception Laboratory website (http://perception.radiology.uiowa.edu) and the Kurt Rossmann Laboratories for Radiologic Image Research website (http://xray.bsd.uchicago.edu/krl/). DBM MRMC User's Guide (Windows OS) Page 10 of 45 To Run DBM MRMC To start the program ⇒ ⇒ Start the DBM MRMC application by double-clicking on its icon (Be sure to have open the folder in which the icon is located. Standard installations have the icon on the Desktop.). The icon is called “DBM MRMC 2.1 beta.” Alternatively it can be started from the “Start” menu shortcut A brief splash screen will appear to indicate the release number of the edition of the program installed. The program main window (see Figure 1) is made of a menu bar and program page, like most Windows programs. The main program form contains a list of the limitations of the version installed. They should be read and understood carefully. Note: Throughout the program execution only the tabs or options that are colored in black are available, the other ones cannot be selected. Note that this can change during the different phases of execution to serve as a guide as to what options are appropriate during the current phase of execution. This can also change in different versions of the program as more options become available. Note: After having concluded the analysis of a dataset, in the beta version it is necessary to restart the program to analyze another dataset . DBM MRMC User's Guide (Windows OS) Page 11 of 45 Figure 1. Program main window. Options available at the start At the beginning the only available tabs on the menu bar are the “File menu and the “Help” menu. The “Help” tab has two available options: • “About” option which describes the current version of the program, and • “Users Guide” option that opens the user’s guide (this one or a run time user’s guide, depending upon the version). The “File” tab, has three available options: • “Open” is the standard Windows™ pop up window that allows to select the input file • “User Options” selects a window that allow the user to select: o A default data folder where the program will look for data files o A preferred data file type (data file type will be defined in the following sections) • “Exit” to exit the program DBM MRMC User's Guide (Windows OS) Page 12 of 45 Execution of an MRMC analysis Once the program has started these are the steps to be followed to run an MRMC analysis: Open and read the input file. Select the “Open” option of the “File” tab, and use the pop up window to select the desired input file. If a test run of the program is desired, we recommend the user to click the “File” tab and select the DBM MRMC folder (it is the one selected by default in a standard installation). In the folder an input file can be selected, for example “SAMPLE1.lrc”, which has the lrc format (the old LABMRMC format, described below). Description on how to make and name input files are provided in the following sections. The program does not accept keyboard input. After this stage from the “View” tab the input file can be viewed in another window. DBM MRMC User's Guide (Windows OS) Page 13 of 45 Select the analysis options Select the “Analysis options” option from the “Run” tab (which was activated by reading an input file”). The following screen will appear: On the left side, on top, the different options available for the ROC model are listed. Note that some options might be set in gray. This means that those models are unavailable in this software version. The models have been described in the Overview section of this user’s guide, where the appropriate references are reported. Only one model can and must be selected. On right side, on top, the list of the available indices is described. The indices available for the specific ROC models will appear in black, the other ones will be gray. As already stated, indices might no be available for all ROC models at the current time. • The “Area” is the area under the curve (AUC), also known as Az for the conventional binormal model, here implemented as RSCORE. It is an average index and corresponds DBM MRMC User's Guide (Windows OS) Page 14 of 45 • • • • to the probability of actually-positive cases to have a signal larger than actually-negative cases. The “Partial area” corresponds to the area under the curve computed between either two values of FP (or FPF) or two value of TP (or TPF). Any two values between 0 and 1 can be chosen, as long as the first value is smaller than the second. Note that the partial area between 0 and 1 is the same as the area under the curve. “Sensitivity at specificity of” represents the estimate value of sensitivity when the specificity is set to a specific value. All curves will have all possible values of sensitivity or specificity, but they will map to each other differently. Note that FP or FPF = 1 – specificity and TP or TPF = sensitivity. “Specificity at sensitivity of” represents the estimate value of specificity when the sensitivity is set to a specific value. The cutpoints or threshold analysis is not fully developed yet and is not available. The bottom part of the window allows the user to select the type of ANOVA model desired: A. Analysis treating both readers and cases as random samples B. Analysis treating only cases as random samples C. Analysis treating only readers as random samples The first option (A) is the traditional MRMC analysis, the other two assumes that either the readers used are the only readers relevant to this analysis (B) or the cases used are the only cases relevant for this analysis (C); the latter two options do not allow inferences to be drawn for the whole population of readers and cases. Recategorization options Some models (CBM2, RSCORE) require recategorization if the data set contains more than 20 categories, but other models don’t (PROPROC2, WILCOXON). However, for challenging datasets it might happen that the algorithms do not converge unless the number of categories is reduced. Moreover, when extremely large datasets with many categories are used, it can be prohibitively time-consuming to run an analysis without reducing the number of categories, i.e., recategorizing , because the computation time is affected mostly by the number of categories used. There are various options of recategorization, See the “Definition of ROC terms section for details”. Recategorization is unnecessary in many situations. See a description of recategorization in the Appendix for further details. Running the analysis The algorithm needs first to generate the pseudovalues to run the ANOVA. At this stage there are two options from the “Run” tab: A. “Run All”, will automatically compute the pseudovalues and do the ANOVA calculations without further user interaction. If recategorization is necessary for the data, the Recategorization dialog will be displayed (see Appendix). The Run All option is appropriate for most users. B. “Generate Pseudovalues”. If this option is selected, the pseudovalues are computed, but the ANOVA is not performed. This option would be appropriate if the user only wished to generate pseudovalues to be analyzed with separate software. Note that after the DBM MRMC User's Guide (Windows OS) Page 15 of 45 “Generate Pseudovalues” option a number of output files will be available for inspection or analysis. This is useful if there are problems in the calculation or if a different analysis is desired and so only the pseudovalues values are required. After this stage the “Run ANOVA” option of the “Run” tab needs to be selected to finish the analysis. The ANOVA output file will be written after the completion of the ANOVA computations. Dialogs will appear to display progress of the computations. Reading result of analysis Under the tab “View” a number of options are listed, those are the different output files produced by MRMC. Selecting a file will prompt the program to open a specific application that will display the file. The files are: • “Input”, shows in Notepad™ the input file used for this calculation. This can be used to verify if indeed the correct input file was selected. • “Pooled” displays the results of Leave-None-Out calculations (see overview section and references therein), opened in Excel™. Description: o First column, the treatment (number) o Second column, the reader (number) o Third column: the number of effective categories used (that is if the user used 4 categories out of 5, this will show a 4) o Fourth column, the value of the index chosen o Fifth and sixth column, the parameters of the model (if semi-parametric) otherwise they will not be present. • “Pseudovalues” the list of the pseudovalues in list form, by column, opened in Excel™: “R” = Reader number “T” = Treatment number “C” = Case number “P” = Value of pseudovalue “IC” = Initial Category (prior to any recategorization) to which this pseudovalue corresponds (all pseudovalues from the same category have the same value) o “RC” = Recategorized Category (after any recategorization) to which this pseudovalue corresponds (all pseudovalues from the same cateogory have the same value) o “CC” = Collapsed Category (after application of LABROC4 or other algorithm to eliminate empty/unnecessary categories) to which this pseudovalue corresponds (all pseudovalues from the same category have the same value) o o o o o • “ROC Plot Info”, information about the ROC plot. Opened in Excel™. This can be used to plot the curves for each reader treatment combination. The file is divided in blocks that follow one another vertically. Each block is defined by the header “Reader x Treatment y” and tells that the following data regards readers x treatment y. o The first vertical sequence of numbers (under the columns FPF and OBS) represents the empirical operating points as determined by the input data. o The second vertical sequence of numbers (under the columns FPF and THEOR) represents the theoretical operating points. The different operating points correspond to “cutoffs” on the ROC curve. If the cutoffs estimated for the empirical operating points are used on the estimated curve one obtains the DBM MRMC User's Guide (Windows OS) Page 16 of 45 theoretical operating points. They are the place where the operating points would be for an infinitely large dataset if the population curve is the estimated curve. It is absent for the non-parametric methods. o The third vertical sequence of numbers (under the columns FPF and TPF) represents a sequence of 200 points on the estimated ROC curve. They can be used to plot the estimated ROC curve. It is absent for the non-parametric methods. • “ANOVA”, Notepad™ opens a file that contains the information about the ANOVA calculations. The file itself is amply commented, and the terms that appear in the file are described in the references (2-4), which should be referenced (together with the web sites) when using this program for research publications. DBM MRMC User's Guide (Windows OS) Page 17 of 45 Input file description Here we describe the different input files formats accepted by DBM MRMC and then shortly describe how to prepare input files using conventional text editing or spreadsheet programs. These last instructions are not meant by any means to be exhaustive or even appropriate for the specific kind of data and data collection used by all possible users, but only to provide some guidance and help determine the source of potential I/O problems. Input data Data is usually defined as continuous, categorical or ordinal data and confidence data. Continuous data, and all data that is numeric, is entered in the input file as it is: the data can be entered directly following the instructions of the following sections. Confidence data, if expressed in terms of language, e.g., “signal possibly present” (see “Definition of ROC terms” section), has first to be transformed into numerical values, usually integers. Of course care must be put to give larger values to categories that express higher probability of containing the signal. Input file types The different types of files available in the version installed can be viewed by selecting File -> Open and clicking the “Files of type:” drop-down filter. Here is a description of those input files: • LABMRMC files should have a file extension of .lrc or .txt in order to be recognized by the program. • Files in the legacy MRMC file format (e.g., for MRMC 1.70) should have the .in or .inp extension. • The extension of .fsp is used to indicate a file in a revised MRMC format where all fields are four characters wide (I4 format for Fortran programmers). • The extension of .csv is used to indicate a file in a revised MRMC format where all fields are separated by commas rather than appearing in particular columns (commadelimited rather than a fixed format) If a file does not have one of these extensions or has the wrong extension for its type, it will not be properly read, and you will receive an error message (if the selected file is a binary rather than text file, the program may crash). If you encounter a problem reading a file in the beta version of the program, you will need to restart the program before attempting to read another file because the File -> Open option will become disabled. Some sample data files are available for download from the websites LABMRMC input file format description (Please see example on page on page 22 for input files created using Microsoft Word, or example on page 23 for input files created using Microsoft Excel) Numbers represent input lines or groups of input lines, whereas bullets represent the description of those input lines. An example of a part of a LABMRMC file is provided in figure 2 of this section. An example input file is also provided with the installation of DBM MRMC, it is the file SAMPLE1.lrc. 1. A free-text description for the file (up to 60 characters, including any leading blanks). DBM MRMC User's Guide (Windows OS) Page 18 of 45 This description allows you to identify easily the type of data stored in the file. For example, if the current file contains information on a 10 observer study of 4 different mammographic CAD techniques, then this line might be: “2006 CAD comparison in mammo, 10 readers, 4 treatments” 2. The name of the particular reader whose data you are about to enter (starting with the first one). 3. On one line, enter the names of all the treatments Each treatment name must be enclosed in quotes (“”) and be no more than 12 characters long. DBM-MRMC 2.1 and later does not have limitations on the number of treatments used. 4. On one line enter an alphabetic code word (“small” or “large” or simply “s” or “l”) for each treatment, separated by one or more blank spaces, to indicate that smaller or larger test results, respectively, are associated with stronger evidence of positivity (e.g., “signal”, “disease” or “abnormality”). DBM-MRMC 2.1 reads only the first character of each of these code words. 5. A sequence of test-result values for actually-negative (e.g., “noise” or “normal”) cases. On a line for each actually-negative case, enter the test result for treatment 1, one or more blank spaces, the test result for treatment 2, one or more blank spaces, the test result for treatment 3, etc. Optionally, these test results can be followed by one or more blank spaces and then a brief free-text description of the case. The description must follow the values of all modalities (i.e., be at the end of the line). It is absolutely essential that the cases appear in the input data in exactly the same order for each reader. The program currently requires all test results values for each condition Each test-result value can include up to 6 digits to the left of the decimal point (in addition to a “+” or “-” sign) and up to 6 digits to the right. DBM MRMC 2.1 and later does not have limits on the number of actuallynegative cases that can be entered. Input of actually-negative cases must be terminated by a final line containing an asterisk (*) as its first character. 6. A sequence of test-result values for actually-positive (e.g., “signal”, “diseased” or “abnormal”) cases. The input format is the same as that for actually-negative cases. DBM MRMC User's Guide (Windows OS) Page 19 of 45 DBM MRMC 2.1 and later does not have limits on the number of actuallypositive cases that can be entered. Again, the input sequence must be terminated by a final line of free text containing an asterisk (*) as its first character and a “Enter” (“Return”) as its last character. 7. Repeat steps 2, 5 and 6 for the remaining readers. DBM MRMC 2.1 and later does not have limits on the number of readers in a data set. End the dataset with a pound sign (#) and a “Enter” (“Return”). Acceptable file name extensions for the LABMRMC format are “.txt” and “.lrc”. We recommend to use “.lrc” because the “.txt” might be restricted in future versions. When making the input files with other programs that will save them as “.txt”, it is enough to change their names after the file is finished. DBM MRMC User's Guide (Windows OS) Page 20 of 45 Using other programs to create input files Many programs can be used to create the input files as described above. However, it is essential that these programs save the input file in a text-only (word processor) or formatted-text (spreadsheet) format. Although we shall mention only Microsoft Word™ and Excel™ in the examples below, most other equivalent programs will work in a similar fashion. To create an input file using Microsoft Word™ Open Microsoft Word™. Open a new document by selecting “New...” from the “File” Menu. Enter your data into the file according to the one of the file formats described above. Save the file: — select “Save As...” from the “File” Menu; — In the “Save File as Type” pop-up menu choose “MS-DOS TEXT with Layout” or “Text Only” (the names will vary depending on the version of Windows™ and Word™ that you are using; in the worst case keep trying until one works); — type the name you want to save the file as; and — click on the “Save” button or hit the “Enter” (“Return”) key to save the file. Figure 1. Save for input file created with Microsoft Windows™. Here the name of the folder is “rockit”, and the name of the file is “sample in.txt” DBM MRMC User's Guide (Windows OS) Page 21 of 45 Figure 2. . Example of Microsoft Word™ input file (LABMRMC input file) DBM MRMC User's Guide (Windows OS) Page 22 of 45 To create an input file using Microsoft Excel™ Open Microsoft Excel. Open a new document by selecting “New...” from the “File” Menu. Enter your data into the file according to one of the file formats described above: — Enter each number in its own column. ⇒ Note for users using the LABMRMC format: Excel does not automatically enclose text in quotes. Therefore you must enclose the condition descriptions in quotes and always save as “formatted text” or “TEXT(MS-DOS)” or some similar name, depending upon the version of Windows™ and Excel™ that you are using. — Enter each line in its own row. Save the file: — select “Save As...” from the “File” Menu; — choose “Formatted Text” or “TEXT(MS-DOS)” in the “Save File as Type” pop-up menu (see comment at the previous point); — type the name you want to save the file as; and — click on the “Save” button or hit the “Enter” (“Return”) key to save the file. Figure 3. Save for input file created with Microsoft Excel™ DBM MRMC User's Guide (Windows OS) Page 23 of 45 Figure 4. Microsoft Excel™ input file (LABMRMC input file) DBM MRMC User's Guide (Windows OS) Page 24 of 45 Legacy MRMC input file format description I. The original space-delimited MRMC input file format(*.in & *.inp extensions in DBM MRMC): line # -----1 2 3 4 5 6 7 columns ------1-2 type -----integer 3-4 1-4 1-2 1-2 1-4 1-2 2-4 5-6 7-8 . . . 1-2 3-4 5-6 7-8 integer integer integer integer integer integer integer integer integer take two values 0 or 1 (0..1). 0 means high value of the confidence rating is high category. 1 means low value of the confidence rating is high category. the value of the highest category (3..20). number of normal trials. number of treatments (2 ..). number of readers (2 ..). number of cases (2 ..). conf. rating of reader 1, treatment 1, case 1. conf. rating of reader 1, treatment 2, case 1. conf. rating of reader 1, treatment 3, case 1. conf. rating of reader 1, treatment 4, case 1. integer integer integer integer conf. conf. conf. conf. rating rating rating rating of of of of reader reader reader reader 1, 1, 1, 1, treatment treatment treatment treatment 1, 2, 3, 4, case case case case 2. 2. 2. 2. . . . And so on until the end of Reader 1 data. The same format is used for all the readers. DBM MRMC requires at least 2 treatments. If you have one treatment, use one of the curvefitting routines such as PROPROC, CBM, or RSCORE. The following is fragment of input file (the complete file is in separate file named SAMPLE1.IN): •1•5 ••33 •2 •4 •100 •3•3 •5•3 •4•2 •4•3 •4•5 •4•5 (The "•" do not actually appear in the data) DBM MRMC User's Guide (Windows OS) Page 25 of 45 •4•1 •2•2 •5•5 . . . . This file contains the rating data for two treatments, four readers, and 100 cases for each reader. Line 1: The "1" in the first line of the data file indicates that higher rating values indicate higher confidence. The "5" indicates that there are five category ratings in the data (1 through 5). Line 2: The "33" in the second line indicates that the first 33 cases for each reader are normal (noise), and the last 67 are abnormal (noise + signal). Line 3: The "2" indicates that there are two treatments in the data set. Line 4: The "4" indicates that there are four treatments in the data set. Line 5: The "100" indicates that there are 100 cases per reader. Remaining Lines: These are the data lines. Note that the columns correspond to treatments, rows to cases, and every 100 rows to a reader. The data for Reader 1 data comes before the data for Reader 2, etc. II. Another legacy space-delimited version of the original MRMC input file format is as follows (*.fsp extension in DBM MRMC): This version of the format was created to make it easier to use a spreadsheet program such as Excel to create the data files by saving them as space-delimited (*.prn) files. line # -----1 2 3 4 5 6 columns ------1-4 type -----integer 5-8 1-4 1-4 1-4 1-4 1-4 5-8 9-12 integer integer integer integer integer integer integer integer take two values 0 or 1 (0..1). 0 means high value of the confidence rating is high category. 1 means low value of the confidence rating is high category. the value of the highest category (3..20). number of normal trials. number of treatments (2 ..). number of readers (2 ..). number of cases (2 ..). conf. rating of reader 1, treatment 1, case 1. conf. rating of reader 1, treatment 2, case 1. conf. rating of reader 1, treatment 3, case 1. DBM MRMC User's Guide (Windows OS) Page 26 of 45 7 13-16 . . . 1-4 5-8 9-12 13-16 integer conf. rating of reader 1, treatment 4, case 1. integer integer integer integer conf. conf. conf. conf. rating rating rating rating of of of of reader reader reader reader 1, 1, 1, 1, treatment treatment treatment treatment 1, 2, 3, 4, case 2. case 2. case.2 case 2. . . . And so on until the end of Reader 1 data. The same format is used for all the readers. DBM MRMC requires at least 2 treatments. If you have one treatment, use one of the curvefitting routines such as PROPROC, CBM, or RSCORE. The following is fragment of input file (the complete file is in separate file named SAMPLE1.FSP): •••1•••5 ••33 •••2 •••4 •100 •••3•••3 •••5•••3 •••4•••2 •••4•••3 •••4•••5 •••4•••5 •••4•••1 •••2•••2 •••5•••5 (The "•" do not actually appear in the data) . . . . This file contains the rating data for two treatments, four readers, and 100 cases for each reader. Line 1: The "1" in the first line of the data file indicates that higher rating values indicate higher confidence. The "5" indicates that there are five category ratings in the data (1 through 5). Line 2: The "33" in the second line indicates that the first 33 cases for each reader are normal (noise), and the last 67 are abnormal (noise + signal). Line 3: The "2" indicates that there are two treatments in the data set. Line 4: DBM MRMC User's Guide (Windows OS) Page 27 of 45 The "4" indicates that there are four treatments in the data set. Line 5: The "100" indicates that there are 100 cases per reader. Remaining Lines: These are the data lines. Note that the columns correspond to treatments, rows to cases, and every 100 rows to a reader. The data for Reader 1 data comes before the data for Reader 2, etc. III. A comma-delimited version of the original MRMC input file format is as follows (*.csv extension in DBM MRMC): This version of the format was created to make it easier to use a spreadsheet program such as Excel to create the data files by saving them as comma-delimited (*.csv) files. Unlike the previous MRMC format descriptions, there is not mention of columns because the data does not need to be in particular columns. line # -----1 2 3 4 5 6 7 type -----integer integer integer integer integer integer integer integer integer integer take two values 0 or 1 (0..1). 0 means high value of the confidence rating is high category. 1 means low value of the confidence rating is high category. the value of the highest category (3..20). number of normal trials. number of treatments (2 ..). number of readers (2 ..). number of cases (2 ..). conf. rating of reader 1, treatment 1, case 1. conf. rating of reader 1, treatment 2, case 1. conf. rating of reader 1, treatment 3, case 1. conf. rating of reader 1, treatment 4, case 1. integer integer integer integer conf. conf. conf. conf. rating rating rating rating of of of of reader reader reader reader 1, 1, 1, 1, treatment treatment treatment treatment 1, 2, 3, 4, case 2. case 2. case.2 case 2. . . . And so on until the end of Reader 1 data. The same format is used for all the readers. DBM MRMC requires at least 2 treatments. If you have one treatment, use one of the curvefitting routines such as PROPROC, CBM, or RSCORE. The following is fragment of input file (the complete file is in separate file named SAMPLE1.CSV): DBM MRMC User's Guide (Windows OS) Page 28 of 45 1,5 33 2 4 100 3,3 5,3 4,2 4,3 4,5 4,5 4,1 2,2 5,5 . . . . This file contains the rating data for two treatments, four readers, and 100 cases for each reader. Line 1: The "1" in the first line of the data file indicates that higher rating values indicate higher confidence. The "5" indicates that there are five category ratings in the data (1 through 5). Line 2: The "33" in the second line indicates that the first 33 cases for each reader are normal (noise), and the last 67 are abnormal (noise + signal). Line 3: The "2" indicates that there are two treatments in the data set. Line 4: The "4" indicates that there are four treatments in the data set. Line 5: The "100" indicates that there are 100 cases per reader. Remaining Lines: These are the data lines. Note that the columns correspond to treatments, rows to cases, and every 100 rows to a reader. The data for Reader 1 data comes before the data for Reader 2, etc. DBM MRMC User's Guide (Windows OS) Page 29 of 45 Modern MRMC input file format description Beginning with DBM MRMC 2.1 beta2 and later, the DBM MRMC software permits the use of comments within MRMC-format data files (.in, .fsp, or .csv). Comment lines MUST begin with an exclamation point (!) as the first character on the line. These comment lines can appear anywhere within the data file and are ignored by the data parser (note that the entire line following the “!” will be ignored”). Comment lines are shown in green in the augmented example below. Also beginning with DBM MRMC 2.1 beta2, the DBM MRMC program also permits the user to specify labels for readers and treatments. These label lines MUST begin with a dollar sign ($). Following the dollar sign is the tag TITLE to refer to the title of the data set, the tag TRT to refer to treatment labels, and the tag RDR to refer to reader labels. In later releases of the software, these labels will be used in the output files produced. For beta2, the labels can be present in the data file, but they will not appear in the output. Label lines are shown in blue in the augmented example below. The sample file below is available from the website as file “SAMPLE1_augmented.fsp”. $TITLE: "Sample data set" ! Low # = High Confidence; 5 categories 1 5 ! Normal Cases 33 ! Treatments 2 $TRT: "Treatment 1" "Treatment 2" ! Readers 4 $RDR: "Reader 1" "Reader 2" "Reader 3" "Reader 4" ! Cases 100 ! Tx1 Tx2 3 3 5 3 4 2 4 3 4 5 4 5 4 1 2 2 5 5 These comments and labels can appear in all three MRMC file formats. DBM MRMC User's Guide (Windows OS) Page 30 of 45 Output files description The output of DBM-MRMC is organized in several types of files. These files are written to the directory containing the input file. In some circumstances, DBM MRMC itself or some of the DLLs it is calling may create additional log files. These files are there solely for debugging purposes and can be ignored by most users. List of output files with description of their content The expression “Input_file_name” will be used to refer to the name of the input file without extension (that is what comes after the “.”). In the following example the input file is called SAMPLE1.lrc (and it is one of the example files available with the release), so Input_file_name = SAMPLE1. Model_name refers to the ROC model used, in the example PROPROC2 shortened to proproc. “Norm” refers to the fact that pseudovalues are normalized. Only normalized pseudovalues are used by MRMC2.1B2. See Hillis and Berbaum (4) for a description of normalized pseudovalues. Index_used refers to the ROC index used. In the example “area” for area under the curve Naming scheme Input_file_name model_name norm Index_used anova.txt Example name SAMPLE1 proproc norm area anova.txt Input_file_name model_name norm Index_used plot.csv SAMPLE1 proproc norm area plot.csv Input_file_name model_name norm Index_used pooled.csv SAMPLE1 proproc norm area pooled.csv Input_file_name model_name norm Index_used pseudo.csv SAMPLE1 proproc norm area pseudo.csv DBM MRMC User's Guide (Windows OS) Description Text file. Output of the ANOVA routines such as inference results, confidence intervals, components of variance CSV file made to be read with a spreadsheet program like Excel™. Contains Leave-None-Out plotting information. Used to plot ROC curves. CSV file made to be read with a spreadsheet program like Excel™. Contains Leave-None-Out fitting information, such as curve parameters and area under the curve CSV file made to be read with a spreadsheet program like Excel™. Contains the pseudovalue used in the ANOVA calculations. To be used as a check or for additional calculations Page 31 of 45 Description of program output The output files were previously described in the "To Run DBM MRMC" section. The “View” menu can be used to open the output files, or they can be opened from within Windows Explorer (file manager). DBM MRMC User's Guide (Windows OS) Page 32 of 45 Definition of ROC Terms Actually-positive and actually-negative Actually-negative cases are the cases that have been verified to be negative. This means that the researcher is very confident, or as confident as possible, that the cases in fact do not contain the signal that is being looked after (e.g., cancer screening it means that the patient either is healthy or has a disease so small that the medical community does not consider it a disease). The verification is usually done using a different, very accurate, diagnostic tool, usually called gold standard (in the case of cancer screening this is usually a pathology report with some follow up period to insure that the diagnosis was in fact correct). Actually-positive cases are the cases that have been verified to be positive. This means that the researcher is very confident, or as confident as possible, that the cases in fact contain the signal that is being looked after (e.g., cancer screening it means that the patient has the disease in a form that the medical community considers clear). The verification is usually done using a different, very accurate, diagnostic tool, usually called gold standard (in the case of cancer screening this is usually a pathology report). TPF, FPF, TP, FP, TN, FN False positive fraction (FPF) or false positive (FP) or false positive rate (FPR) corresponds to the number of actually-negative cases incorrectly diagnosed as positive by the modality under investigation divided by the total number of actually-negative cases. Note that this value, unless the test is binary (Yes/No) refers to a specific threshold, see Pepe’s book for details (1). The same applies to all the following definitions. True positive fraction (TPF) or true positive (TP) or True positive rate (TPR) corresponds to the number of actually-positive cases correctly diagnosed as positive by the modality under investigation divided by the total number of actually-positive cases. True Negative (TN) is an actually-negative case correctly classified as negative. False Negative (FN) is an actually-positive case incorrectly classified as negative. Confidence ratings and continuous ratings and categorical data With the terms “confidence ratings”, we mean inherently ordinal data labeled using expressions that represent the confidence the rater has in the presence or absence of a signal, for example ratings could be any of these categories: “signal definitely absent”, “signal probably absent”, “signal possibly present, equivocal”, “signal probably present”, “signal definitely present". Another example of categorical data is the BI-RADS score, used in mammography. These data are often called categorical or discrete because there are many fewer categories that data points and usually the difference between categories is rather sharp (even if this does not preclude the existence of borderline cases). When analyzed usually the category labels are replaced with integers, to replicate the ordering of the data, e.g.: Signal definitely absent 1 Signal probably absent 2 Signal possibly absent, equivocal 3 Signal probably present 4 DBM MRMC User's Guide (Windows OS) Page 33 of 45 Signal definitely present 5 “Continuous ratings” are for example lab tests or serum concentrations. They are expressed as decimal figures (e.g., 3.456). Usually there are many more possible values than datapoints. In general empirical datasets range from being defined with very few categories (2 in the case of a sensitivity and specificity analysis) to subjective probability scales with many hundreds of categories, to the quasi-continuous scales of lab tests or neural network outputs. It should be noticed that apart from numerical and statistical issues (variance and bias of estimates) these data are equivalent from the ROC point of view as long as they can be ranked unequivocally. Area, Area under the curve, Az The by far most commonly used index in ROC analysis is the are under the ROC curve (1), it represents the average TPF value provided by the test. Usually it is called area under the curve (AUC). This last term applies to any ROC curve, defined using any model, whether parametric, semi-parametric or non-parametric. Sometimes this terms is simply replaced by “area” when the context makes the interpretation unequivocal. The term Az technically refers only to the AUC for the conventional binormal model (RSCORE here). Leave-None-Out, Leave-One-Out and pseudovalues The statistican inference made by this program is based on ANOVA. However, the data for the ANOVA computation is produced using Tukey-Quenouille jackknife (10). Data is considered to be fully crossed, i.e., every reader will read every case using every modality. If you select a reader and a modality then there will be a sequence of actually-negative and actually-positive cases, typical ROC data, from now on this will be called the reader-modality dataset. The first step is to compute the ROC index estimate for the reader-modality dataset. This is called the Leave-None-out, because no cases are left out of the sample, let’s call it θ. Since the ROC index cannot be computed for a single case, one way to compute the effect of the case contribution to the index is to compute the index taking that case out, θ(k), this is the Leave-OneOut. The contribution of that case can be estimated by “its negative” using the pseudovalue: Yk = nθ − (n −1)θ(k ) Conventional binormal model (RSCORE model) The term “conventional binormal model” is often used in comparisons with the “proper models” - otherwise it might be simply called “binormal model”. Each possible case is assumed to be associated with a value x. Usually, the variable x is not what is measured in the experiment being analyzed, but a monotonic transformation of that value and it is called a latent variable (i.e., not observed). In the model, x is normally distributed both for the actually-negative cases (the distribution of x conditional to being negative is normal) and for the actually-positive cases (the distribution of x conditional to being positive is normal). The two normal distributions have in general different means and standard deviations: μn, σn for the actually-negative cases and μs, σs for the actually-positive cases. The usual convention that positive-cases are more likely to have a larger value implies that μn < μs. Calling φ the normal distribution function, φ(μn, σn, -xt) is the fraction of actually-negative cases with value above xt (i.e., the false positive fraction or FPF) and φ(μs, σs, -xt) the fraction of actually-positive cases with value above xt (the true positive fraction or TPF). (it should be DBM MRMC User's Guide (Windows OS) Page 34 of 45 remembered that the fraction of cases above x is 1 - φ(μ, σ, x), which is the same as φ(μ, σ, -x) because of symmetry.) The ROC curve is produced by sweeping the value of xt and plotting φ(μs, σs, -xt) against φ(μn, σn, -xt) in the ROC plot (TPF vs FPF) (1). The latent variable x can be assumed to have produced the measurements (or test result values), which we are using to make the MRMC study, after it had been subjected to a monotonic transformation and possibly some discretization (to produce discrete values from the continuous values of the normal distribution). It is binormal, because we are assuming that this random variable is normal both for actually-positive cases and for actually-negative cases. For most distributions, the two normal densities will intersect two times, which means that there are either large values for which it is more likely to have an actually-negative case than to have an actually-positive (because the actually-negative conditional distribution is larger there after they intersected) or that there are smaller values for which it is more likely to have an actuallypositive case than to have an actually-negative (because the actually-positive conditional distribution is larger there because the value is before the intersection). This will create a socalled hook. (Note that this behavior is sometimes not negligible.) A “proper model” is a correction to this. The conventional binormal model is usually defined using two parameters: “a” that represents the vertical intercept and “b” that represents the slope of the fitted ROC curve when it is plotted as a straight line on “normal-deviate” axes. The first parameter is related to the difference between the means of the two normal distributions, while the second parameter is the ratio of their standard deviations. Proper binormal model (PROPROC or PROPROC2 model) The term “proper binormal model”, often used as opposed to “conventional binormal model”, means that the ROC curve is produced by sweeping the value of likelihood ratios as determined by a binormal distribution (6). The likelihood ratio (or better a monotonically increasing function of it) is the latent variable of this model (see previous section for a definition of a latent variable). Accordingly, the coordinates of the point on the ROC curve corresponding to each threshold value has abscissa, the False Positive Fraction (FPF or FP), equal to the number of actually negative cases with likelihood ratio above the threshold divided by the total the number of actually negative cases, and ordinate, the True Positive Fraction (TPF or TP), equal to the number of actually positive cases with likelihood ratio above the threshold divided by the total the number of actually positive cases. The latent variable is assumed to have produced the measurements (or test result values), which we are using to make the MRMC study, after it had been subjected to a monotonic transformation, plus possibly a discretization. It is binormal because we are assuming that the actually-negative and actually-positive densities from which the likelihood ratio is computed are normal. The use of the likelihood ratio prevents the formation of the so-called hooks. Other solutions to the “hooks” problem have been suggested and some are included in this software package (CBM2). The exact meaning of the parameters of the proper binormal model, da and c, is not intuitive. In general da is proportional (nonlinearly) to the value of the area under the curve, and c is proportional (also non-linearly) to the DBM MRMC User's Guide (Windows OS) Page 35 of 45 shape of the curve, i.e., how skewed it is. Refer to Metz et al. (6) for details about this models. A paper by Pesce and Metz describing the algorithm used in DBM MRMC, PROPROC2, was submitted to Academic Radiology (June 2006). LABROC4/LABROC5 categorization LABROC4 algorithm. Before maximum-likelihood estimation of the ROC parameters is attempted, continuously-distributed input data are rank-ordered and then collapsed into truthstate runs as per the LABROC4 algorithm (9). Since ROC analysis is concerned with the ability of an algorithm to separate actually-negative from actually-positive cases, once the set is rank ordered the actual values of the test results can be replaced with their ranks with no loss of information (i.e., the resulting ROC curve will be identical). Let’s make a simple example with a 12 case dataset. The dataset has 6 actually-negative (green) and 6 actually-positive (red) cases. The following steps will be involved: • Let us suppose that on input we have the actually-negative cases first, followed by the actually-positive (1st row) • The cases are ranked (2nd row) • The values are replaced with their ranks (3rd row) • Contiguous ranks of the same truth (either all actually-positive or all actually-negative) are collapsed (4th row). • The cases with the same number are counted and those counts are the new categorical data (following table). This table produces exactly the same ROC curve as the original data, apart from possible numerical issues. 5.0 7.0 3.0 1.0 1.0 3.0 6.0 7.0 8.0 10.0 1.0 7.0 1.0 1.0 1.0 3.0 3.0 5.0 6.0 7.0 7.0 7.0 8.0 10.0 1 1 1 2 2 3 4 5 6 6 7 8 1 1 1 2 2 2 3 4 5 5 5 5 DBM MRMC User's Guide (Windows OS) Page 36 of 45 LABROC4 categories (for example above) Category index I II III IV V # actually-negative cases 2 3 0 1 0 # actually-positive cases 1 0 1 0 4 LABROC5 algorithm. The runs obtained from the LABROC4 algorithm (let us supposed that it generated L categories, in previous example 5) can then reduced to K categories in an ad hoc but empirically useful way, in the sense that it was empirically shown to produce good fits (9). Here we describe a simplified version of the algorithm, the exact one can be found in Metz et al. (9). Steps of the LABROC5 algorithm (starting from the previous LABROC4 points): • Take the boundaries between points (in the example above they are between 0 and 1, 1 and 2, 2 and 3, 3 and 4). Since we move from the more positive to the less positive, the boundaries are ordered from the largest value to the smallest (1st row) • The boundaries are transformed to their operating points or positive and negative fractions (2nd row) • The two fractions for each point are summed, also called city block distance (3rd row) • The city block distances are centered to 1 (which corresponds to a point on the -45 degrees diagonal) (4th row) • These values are transformed into their normal distribution values (5th row) • The “most uniform” values are selected and the categories in between are collapsed 4,3 3,2 2,1 1,0 (0, 4/6) (1/6, 4/6) (1/6, 5/6) (4/6, 5/6) 4/6 5/6 1 9/6 -1/3 -1/6 0 ½ +.37 +.43 .5 .69 OK Collapse OK OK LABROC5 categories (for example above) Category index I II III IV # actually-negative cases 2 3 1 0 # actually-positive cases 1 0 1 4 The LABROC5 points do not preserve the information of the original dataset. DBM MRMC User's Guide (Windows OS) Page 37 of 45 APPENDIX Using the Recategorization menu option Recategorization is necessary if the input data contains more than 20 categories and the CBM2 or RSCORE 4.7 curve-fitting options were selected (PROPROC2 and the Wilcoxon/trapezoidal approach do not have the 20-category limitation). The user can manually invoke recategorization in other situations if desired by selecting Recategorization from the Run menu. After the Recategorization step, the Generate Pseudovalue step can be performed. The recategorization dialog appears as follows: DBM MRMC User's Guide (Windows OS) Page 38 of 45 At the top, the columns to the left display the minimum value, the maximum value, and the number of categories in the current data set. The columns to the right display the minimum value, maximum value, and the number of categories permitted by the selected curve-fitting algorithm. This example displays the values for a 101-category data set used with CBM2. A message in red typeface indicates that recategorization is necessary for this data set. If recategorization had not been necessary, a message in black typeface would indicate that recategorization was not necessary. In the situation shown, the user would need to check the box for “Recategorize the data” in order to proceed with analysis. The various recategorization options would then be enabled. The options available to the user will depend on the number of categories in the data set (e.g., if the data set has only 10 categories, the options to recategorize to 20 or 10 categories will not be enabled). A brief description of each option follows: DBM MRMC User's Guide (Windows OS) Page 39 of 45 Recategorize to 20 categories (fine ends, coarse middle) Assuming 101 categories (0 to 100 or subjective probability data), the following algorithm is used Original Data Range Recategorized Value 0 1-4 5 6-10 11-15 16-20 21-25 26-30 31-40 41-50 51-60 61-70 71-75 76-80 81-85 86-90 91-94 95 96-99 100 DBM MRMC User's Guide (Windows OS) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Page 40 of 45 Recategorize to 20 categories (evenly distributed) Assuming 101 categories (0 to 100 or subjective probability data), the following algorithm is used Original Data Range 0-5 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45 46-50 51-55 56-60 61-65 66-70 71-75 76-80 81-85 86-90 91-95 96-100 DBM MRMC User's Guide (Windows OS) Recategorized Value 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Page 41 of 45 Recategorize to 10 categories (evenly distributed) Assuming 101 categories (0 to 100 or subjective probability data), the following algorithm is used Original Data Range Recategorized Value 0-10 11-20 21-30 31-40 41-50 51-60 61-70 71-80 81-90 91-100 1 2 3 4 5 6 7 8 9 10 Recategorize to 5 categories (evenly distributed) Assuming 101 categories (0 to 100 or subjective probability data), the following algorithm is used Original Data Range 0-20 21-40 41-60 61-80 81-100 Recategorized Value 1 2 3 4 5 Use LABROC4 algorithm to reduce categories This algorithm is described in the “Definition of ROC Terms” section. NOTE: This algorithm does not reduce the data to a predetermined number of categories, so if there are a large number of categories in the data, it may not reduce the categories sufficiently for RSCORE or CBM2. DBM MRMC User's Guide (Windows OS) Page 42 of 45 Use the Dorfman/MRMC legacy algorithm to reduce categories Like the LABROC4 algorithm, this algorithm does not reduce the data to a predetermined number of categories. As such, it may not be appropriate for data recategorization for RSCORE and CBM2 if the number of categories in the data set greatly exceeds 20 (e.g., 101category or subjective probability data). A description of the algorithm follows. The algorithm makes two passes over the categorical data matrix that is created based on the input data: 1. The first pass deletes columns that contain a zero (0) in BOTH the N and S+N rows. 2. The second pass through the matrix collapses columns with a zero (0) for N and a one (1) for S+N; or one (1) for N and zero (0) for S+N. Columns are collapsed from right to left except for the first column,which is collapsed from left to right. Example: Original categorical data matrix: 1 2 3 4 5 _____ _____ _____ _____ _____ | | | | | | | 1 | 1 | 1 | 0 | 0 | |_____|_____|_____|_____|_____| | | | | | | | 0 | 0 | 0 | 0 | 1 | |_____|_____|_____|_____|_____| After the first pass: Deleting columns 1 2 3 5 _____ _____ _____ _____ | | | | | | 1 | 1 | 1 | 0 | |_____|_____|_____|_____| | | | | | | 0 | 0 | 0 | 1 | |_____|_____|_____|_____| Second pass: Collapsing columns, first collapse 1 2 3 _____ _____ _____ | | | | | 1 | 1 | 1 | |_____|_____|_____| | | | | | 0 | 0 | 1 | |_____|_____|_____| DBM MRMC User's Guide (Windows OS) Page 43 of 45 Second pass: Collapsing columns, second collapse 1 3 _____ _____ | | | | 2 | 1 | |_____|_____| | | | | 0 | 1 | |_____|_____| This is the final form of the matrix. SPECIAL NOTE: All of the recategorization options operate only on the data in program memory. The original data file remains as it was. The current version of the software does not permit you to save the recategorized data. This feature may be added to a future release. DBM MRMC User's Guide (Windows OS) Page 44 of 45 References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Pepe MS. The statistical evaluation of medical tests for classification and prediction. Oxford ; New York: Oxford University Press, 2004. Dorfman DD, Berbaum KS, Metz CE. Receiver operating characteristic rating analysis. Generalization to the population of readers and patients with the jackknife method. Invest Radiol 1992; 27:723-731. Hillis SL, Obuchowski NA, Schartz KM, Berbaum KS. A comparison of the DorfmanBerbaum-Metz and Obuchowski-Rockette methods for receiver operating characteristic (ROC) data. Stat Med 2005; 24:1579-1607. Hillis SL, Berbaum KS. Monte Carlo validation of the Dorfman-Berbaum-Metz method using normalized pseudovalues and less data-based model simplification. Academic Radiology 2005; 12:1534-1541. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988; 44:837-845. Metz CE, Pan X. "Proper" Binormal ROC Curves: Theory and Maximum-Likelihood Estimation. J Math Psychol 1999; 43:1-33. Dorfman DD, Berbaum KS. A contaminated binormal model for ROC data: Part II. A formal model. Acad Radiol 2000; 7:427-437. Metz CE. Some practical issues of experimental design and data analysis in radiological ROC studies. Invest Radiol 1989; 24:234-245. Metz CE, Herman BA, Shen JH. Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data. Stat Med 1998; 17:10331053. Tukey JW. Bias and Confidence in Not-Quite Large Samples. Annals of Mathematical Statistics 1958; 29:614-614. DBM MRMC User's Guide (Windows OS) Page 45 of 45
© Copyright 2026 Paperzz