Abstract
In this paper, we propose a novel pipeline for conducting disease quantification in positron emission tomography/computed tomography (PET/CT) images on anatomically pre-defined objects. The pipeline is composed of standardized uptake value (SUV) standardization, object segmentation, and disease quantification (DQ). DQ is conducted on non-linearly standardized PET images and masks of target objects derived from CT images. Total lesion burden (TLB) is quantified by estimating normal metabolic activity (TMAn) in the object and subtracting this entity from total metabolic activity (TMA) of the object, thereby measuring the overall disease quantity of the region of interest without the necessity of explicitly segmenting individual lesions. TMAn is calculated with object-specific SUV distribution models. In the modeling stage, SUV models are constructed from a set of PET/CT images obtained from normal subjects with manually delineated masks of target objects. Two ways of SUV modeling are explored, where the mean of mean values of the modeling samples is utilized as a consistent normality value in the hard strategy, and the likelihood representing normal tissue is determined from the SUV distribution (histogram) for each SUV value in the fuzzy strategy. The evaluation experiments are conducted on a separate clinical dataset of normal subjects and a phantom dataset with lesions. The ratio of absolute TLB to TMA is taken as the metric, alleviating the individual difference of volume sizes and uptake levels. The results show that the ratios in normal objects are close to 0 and the ratios for lesions approach 1, demonstrating that contributions on TLB are minimal from the normal tissue and mainly from the lesion tissue.
Keywords: PET/CT, standardized uptake value, disease quantification, metabolic activity, total lesion burden
1. INTRODUCTION
Positron emission tomography/computed tomography (PET/CT) images contain metabolic and anatomical information jointly, providing quantitative evidence for lesion differentiation, therapy planning, and prognosis assessment [1, 2]. Disease quantification (DQ) is commonly conducted by first segmenting the target lesions and then investigating the standardized uptake value (SUV) within the segmented volumes. However, the acquisition of PET images is affected by numerous technical and biological factors [3], leading to the inconsistent numeric meaning of image values even after transforming to SUVs. Furthermore, the low image resolution and partial volume effect increase the difficulty in segmenting lesions in PET images [4], making DQ at the single-lesion level a challenging task.
Several repeatability and multicenter studies were conducted to deal with the difficulties on DQ derived from image acquisition. Effects of image reconstruction, resolution, and ROI definition in multicenter trials were explored in [5]. Harmonization strategies are proposed in [6] to unify the process of patient preparation and parameter setting of PET/CT systems. To deal with the difficulty in segmenting lesions, automatic segmentation methods on paired PET/CT images are proposed to either delineating tumors [7] or target organs [8] and neighboring confounding organs [9]. Burger et al. proposed a method of calculating background subtraction lesion activity via histogram of a local volume around the tumor, where a Gaussian fit to the peak of the histogram is utilized to estimate normal background activity [10]. However, determining a proper region of interest may become a new challenge in such an approach.
As inconspicuous and conspicuous lesions may exist simultaneously in the PET image of a patient with tumor within an organ or tissue region, the total lesion burden (TLB) inside anatomically pre-defined regions (objects), compared with single lesions, may provide more instructive and stable information in clinical practice. In this paper, we propose a method of regional disease quantification in PET/CT images by estimating what the normal metabolic activity would be for the object in the given patient image and subtracting this entity from total metabolic activity. The idea is to retain only the disease portions of the metabolic activity and to remove the background normal activity without explicitly segmenting the lesions.
2. METHODS
2.1. DQ pipeline
A schematic representation of the proposed DQ pipeline via PET/CT images is shown in Figure 1. The pipeline consists of three key components: SUV standardization, object segmentation, and calculation of disease quantity within the object. Metabolic and anatomical information required in the DQ process is separately derived from the matched pair of PET and CT images.
To deal with the problem of inconsistent numeric pixel meaning in PET images, SUV images are usually generated by normalizing the PET values with injected radiotracer dose and patient body weight. Our previous work [11] further proposed a non-linear method of standardizing the overall scale of (near) whole-body PET images to alleviate the influence on image acquisition from other unquantifiable factors.
As most segmentation algorithms are explored in specific body regions with consideration for both accuracy and time-efficiency, body region recognition should be first conducted on the (near) whole-body CT images to extract the slice stack of interest [12]. Then, masks of the target objects can be obtained from the CT images by various segmentation algorithms, such as one of our previously proposed methods which combines fuzzy anatomy modeling for recognition and deep learning networks for delineation [13].
2.2. Calculating the total lesion burden
Compared with commonly used quantitative measures such as SUVmax, SUVmean, and SUVpeak, which show stable results under the repeatability problem of PET image acquisition and challenges in lesion segmentation [6], total lesion burden (TLB) may show better comparability and overall clinical meaning in practice. TLB(O) for an object O is calculated by estimating the background normal metabolic activity (TMAn) and subtracting it from the total metabolic activity (TMA) within the target object in the standardized SUV image, as shown in Equation (1).
(1) |
For any image I and for any object O, we define total metabolic activity TMA(O) of O as in Equation (2).
(2) |
where IS stands for the standardized SUV image, v stands for each voxel in the image, and FMO stands for the segmentation map of the target object O which can be either a fuzzy map (0 to 1, representing the probability or membership of the voxel belonging to the object) or a binary map (0 or 1, representing background and foreground separately).
TMAn is calculated with SUV models generated from a modeling dataset obtained from disease-free subjects with no obvious lesion having abnormal radiotracer uptake in the PET images. As the patterns of radiotracer uptake may be different among organs under consideration, SUV model or normality distribution, denoted by NDO(s) for SUV value as a random variable s, is generated for each of the organs separately. TMAn is explored in hard and fuzzy ways, correspondingly denoted by and .
Estimation of
In the hard approach, the normality distribution NDO(s) of O defined for the normal population is taken to be an impulse function with an impulse at s = μs which is taken to be the mean of mean SUV in O over its normal population. This means that μs is taken as a hard value which is multiplied by the volume of mask FMO to obtain
(3) |
In other words, this approach assumes that the normal portion at every voxel within FMO(O) has the same SUV, namely μs, to estimate the normal activity.
Estimation of
One issue with the hard approach expressed in (3) is that the resulting TLB(O) derived from (1) may not be an insignificant (meaning 0) value when estimated on normal datasets or patient images with very minimal disease burden in O. This may be due to several factors such as imprecision in standardization of PET images, errors in delineation of O, the SUV of normal tissue of O not being a fixed constant value, and partial volume effects, amongst others. In other words, the accuracy of estimating TLB(O) may be compromised in the hard approach. In the fuzzy approach, the idea is to estimate TMAn(O) by considering the full histogram HO(s) instead of using just the fixed mean SUV μs.
In this approach, for an SUV value (random variable value) s = IS(v) at a voxel v in the SUV image IS of a normal organ O, we will devise a normality map NMO(s) based on HO(s) such that NMO(s) expresses the likelihood of s denoting normal tissue of O at voxel v. We will employ this likelihood as a weight (fuzzy membership) value to modulate the contribution of v to in a patient image, as expressed in Equation (4).
(4) |
Three hyperparameters, a, b, and w, are involved in NMO(s), and empirical values are utilized in this work, where a = 1, b = 1.5, and w = 1. For normal datasets, we expect and TLB(O) estimated from (1) to be close to 0. When IS is an SUV image of a patient, we expect high uptake regions at some voxels in O in IS, where those voxels will contribute maximally to TMA(O) but very little to . Thus, if standardization is accurate, in the case of normal subjects and patients with very mild disease, we expect TLB(O) to be close to 0 on the negative or positive side.
3. EXPERIMENTS, RESULTS, AND DISCUSSION
3.1. Experiments
The proposed DQ method is evaluated on clinical and phantom datasets separately. The retrospective study was conducted following approval from the Institutional Review Board at the Hospital of the University of Pennsylvania along with a Health Insurance Portability and Accountability Act waiver. The thoracic regions of 34 18F-fluorodeoxyglucose (FDG)-PET/CT scans from normal subjects are utilized in the evaluation experiments. Masks of 4 objects are manually delineated on the CT images, including left lung (LLg), right lung (RLg), thoracic esophagus (TEs), and thoracic skeleton (TSk), as in the example shown in Figure 2(a) where the RLg mask is overlaid on the PET image. DQ on lesions is evaluated on 20 samples of the SNM Germanium Phantom datasets, where the phantom was filled with 68Ge (271-day half-life) in an epoxy matrix and the data were acquired using a GE Discovery STE-16 PET/CT scanner (with 16-row MDCT) [14]. Lesions are represented by spheres of 6 different sizes, respectively with diameters of 10 mm, 13 mm, 17 mm, 22 mm, 28 mm, and 37 mm. Masks of normal background and lesions are generated by thresholds as shown in Figure 2(b). When conducting DQ on a single lesion, a box extended by 4 mm beyond the bounding box of the binary lesion is generated for each lesion, representing a local region of interest containing both normal background and lesion, as shown in Figure 2(c). In the stage of model building, 20 out of 34 samples and 10 out of 20 samples are selected as the modeling datasets, where the remaining 14 and 10 samples are taken as the testing datasets. Histograms of the 4 thoracic organs and background normal tissue of phantom are created for generating SUV models as in (4).
3.2. Results and discussions
The results are summarized in Table 1. The ratio of absolute TLB to TMA is utilized as the evaluation metric, which alleviates the individual influence of both volume size and uptake level of the sample. We also compare the ratios of the results derived from hard and fuzzy strategies for estimating TMAn.
Table 1.
LLg | RLg | TEs | TSk | Background | Lesion (10 mm) | Lesion (13 mm) | Lesion (17 mm) | Lesion (22 mm) | Lesion (28 mm) | Lesion (37 mm) | |
---|---|---|---|---|---|---|---|---|---|---|---|
0.172 | 0.165 | 0.14 | 0.12 | 0.002 | 0.874 | 0.719 | 0.596 | 0.603 | 0.631 | 0.667 | |
0.178 | 0.161 | 0.125 | 0.088 | 0.002 | 0.15 | 0.037 | 0.027 | 0.014 | 0.006 | 0.003 | |
0.032 | 0.052 | 0.017 | 0.036 | 0.003 | 0.946 | 0.968 | 0.978 | 0.985 | 0.989 | 0.992 | |
0.029 | 0.03 | 0.053 | 0.048 | 0 | 0.027 | 0.008 | 0.002 | 0.001 | 0.001 | 0.001 |
1st values represent the mean of the ratio, and 2nd values represent the standard deviation of the ratio.
The fuzzy strategy shows better performance in both actual datasets and phantoms. For normal organs in the clinical PET image datasets, as well as the background tissue in the phantom PET image datasets, the ratios of absolute TLB to TMA are close to 0 representing disease-free regions. The ratios estimated by the fuzzy strategy show overall lower mean values than the hard strategy, except in the case of phantom normal background region while the fuzzy strategy reaches a standard deviation value lower than 0.001. For phantom lesions, TLB values are calculated within the extended boxes and TMA values are determined in the lesion region only. The ratios are close to 1, representing that the contribution to TLB within the boxes is minimal from the background normal tissue and mainly from the lesion region. Moreover, for the lesions with larger diameters, the ratio also reaches closer to 1, showing degrading accuracy with decreased ability to account for partial volume effect in smaller lesions.
4. CONCLUSIONS
In this work, we introduce a novel pipeline of disease quantification from scanned (near) whole-body PET/CT images to estimate disease quantity from anatomically pre-defined objects of interest. The SUV models are constructed based on a dataset of disease-free subjects. The normality potion can be estimated and subtracted from the total metabolic activity, enabling the estimation of disease quantity without the requirement for precise segmentation of lesions. Two ways of estimating TMAn are explored, including the hard strategy of using a consistent normality value and the fuzzy strategy of determining normality maps to account for various random and fuzzy phenomena. The fuzzy strategy demonstrated better performance on both normal objects and lesions due its ability to account for inaccuracies in various processes starting from the image acquisition stage.
ACKNOWLEDGMENTS
The research reported in this paper is supported by an NIH grant R01CA255748. J. Li is supported by China Postdoctoral Science Foundation via fellowship 2022M711849.
References
- [1].Hutchings M, and Barrington SF, “PET/CT for therapy response assessment in lymphoma,” Journal of nuclear medicine, 50(Suppl 1), 21S–30S, (2009). [DOI] [PubMed] [Google Scholar]
- [2].Brito AE, Santos A, Sasse AD, Cabello C, Oliveira P, Mosci C, Souza T, Amorim B, Lima M, Ramos CD, and Etchebehere E, “18F-fluoride PET/CT tumor burden quantification predicts survival in breast cancer,” Oncotarget, 8(22), 36001, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Lodge MA, “Repeatability of SUV in oncologic 18F-FDG PET,” Journal of Nuclear Medicine, 58(4), 523–532 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Zaidi H, and El Naqa I, “PET-guided delineation of radiation therapy treatment volumes: a survey of image segmentation techniques, “ European journal of nuclear medicine and molecular imaging, 37(11), 2165–2187, (2010). [DOI] [PubMed] [Google Scholar]
- [5].Westerterp M, Pruim J, Oyen W, Hoekstra O, Paans A, Visser E, van Lanschot J, Sloof G, and Boellaard R, “Quantification of FDG PET studies using standardised uptake values in multi-centre trials: effects of image reconstruction, resolution and ROI definition parameters, “ European journal of nuclear medicine and molecular imaging, 34(3), 392–404, (2007). [DOI] [PubMed] [Google Scholar]
- [6].Aide N, Lasnon C, Veit-Haibach P, Sera T, Sattler B, and Boellaard R, “EANM/EARL harmonization strategies in PET quantification: from daily practice to multicentre oncological studies,” European journal of nuclear medicine and molecular imaging, 44(1), 17–31, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Zhao X, Li L, Lu W, and Tan S, “Tumor co-segmentation in PET/CT using multi-modality fully convolutional neural network,” Physics in Medicine & Biology, 64(1), 015011, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Xiong X, Smith BJ, Graves SA, Sunderland JJ, Graham MM, Gross BA, Buatti JM, and Beichel RR, “Quantification of uptake in pelvis F-18 FLT PET-CT images using a 3D localization and segmentation CNN,” Med. Phys, 49(3), 1585–1598, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Seifert R, Herrmann K, Kleesiek J, Schäfers M, Shah V, Xu Z, Chabin G, Grbic S, Spottiswoode B, and Rahbar K, “Semiautomatically quantified tumor volume using 68Ga-PSMA-11 PET as a biomarker for survival in patients with advanced prostate cancer,” Journal of Nuclear Medicine, 61(12), 1786–1792, (2020). [DOI] [PubMed] [Google Scholar]
- [10].Burger IA, Vargas HA, Apte A, Beattie BJ, Humm JL, Gonen M, Larson SM, and Schmidtlein CR, “PET quantification with a histogram derived total activity metric: superior quantitative consistency compared to total lesion glycolysis with absolute or relative SUV thresholds in phantoms and lung cancer patients, “ Nuclear medicine and biology, 41(5), 410–418, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Mortazi A, Udupa JK, Tong Y, and Torigian DA, “A post-acquisition standardization method for positron emission tomography images, “ Medical Imaging 2020: Computer-Aided Diagnosis. SPIE, 11314: 907–912 (2020). [Google Scholar]
- [12].Agrawal V, Udupa JK, Tong Y, and Torigian DA, “BRR-Net: A tandem architectural CNN–RNN for automatic body region localization in CT images, “ Med. Phys, 47(10), 5020–5031, (2020). [DOI] [PubMed] [Google Scholar]
- [13].Udupa JK, Liu T, Jin C, Zhao L, Odhner D, Tong Y, Agrawal V, Pednekar G, Nag S, Kotia T, Goodman M, Wileyto EP, Mihailidis D, Lukens JN, Berman AT, Stambaugh J, Lim T, Chowdary R, Jalluri D, Jabbour SK, Kim S, Reyhan M, Robinson CG, Thorstad WL, Choi JI, Press R, Simone CB 2nd, Camaratta J, Owens S and Torigian DA, “Combining natural and artificial intelligence for robust automatic anatomy segmentation: Application in neck and thorax auto-contouring,” Med. Phys, 49(11): 7118–7149, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Doot R, and Kinahan P, “SNM lesion phantom report. Technical report, University of Washington, “ (2007). [Google Scholar]