A statistical method for evaluation quality of medical images: a case study in bit discarding and image compression

doi:10.1016/j.compmedimag.2004.01.003

Computerized Medical Imaging and Graphics

Volume 28, Issue 4, June 2004, Pages 167-175

https://doi.org/10.1016/j.compmedimag.2004.01.003 Get rights and content

Abstract

Many studies have been performed on quality evaluation for subtle differences in medical images. However, only limited success has been achieved. In this paper, medical images were prior manipulated by denoising, lossy compression and filtering. The Moran statistics is then applied to extract spatial information of images and using Kolmogorov–Smirnov (KS) test to determine whether the manipulated and original images differ significantly. Results show that on average discarding 1–2 bits in T1 and CR images or 2–3 bits in T2 and body CT images are indistinguishable. This method is also applied to a reconstructed MR, body CT image and an electronic SMPTE (Society of Motion Picture and Television Engineer) phantom from lossy image compression software. Compression ratios of 16:1 for a MR image, 8–9:1 for a cropped body CT image, 7:1 and 5:1 for high- and low-resolution regions in electronic phantom is proved undifferentiated from original. The proposed method is useful for complementing the human visual system, to optimize the performance of image compression technique.

Introduction

Medical images are often deteriorated by noise due to various interference sources and other phenomena that affect the measurement process in imaging acquisition systems. If the small difference between normal and abnormal tissues is contaminated by noise, direct analysis of the acquired images is often more difficult [1]. Noise in medical images also degrades the correlation between pixels. As a result, the reversible image compression ratio can only achieve a maximum compression ratio of about 2–4 [2].

Steward et al. [3] suggested that 5–6 least significant bits (LSBs) are spectrally white in an image by applying the power spectrum method to verify each bit plane in a magnetic resonance (MR) image. They concluded that the gray level dynamic range of a MR image spanned only six or seven bits at most. Chan et al. [4] showed that the detection accuracy was independent of pixel depth for discarded up to three LSB planes from a digitized mammography with 12 bits depth.

Olsen [5] estimated the standard deviation (STD) for Gaussian and uniform distributed white additive noise in images. He found that the use of STD to estimate noise in the residual images was most reliable. The residual image was obtained by subtracting the pre-filtered image from the original image, where smoothed image is obtained by average or median filters. Chuang et al. [6], [7], [8] reported that 8–9 information bits exist in medical images and 4–5 bits contain only noise. Discarding these noisy bits would not visually deteriorate image quality, as indicated in their study. However, the true effect on the image has not been verified.

With the evolution of teleradiology, the use of lossy compression technique is a requirement for reducing the image volume and speeding up communication [9]. However, greater compression is obtained at the expense of image quality. Efforts have been devoted to evaluate the subtle differences between images using visual or numerical methods. Many studies had tried to build a visual threshold or acceptance level of lossy image compression. However, the radiological tasks is widely varied and the building of visual threshold is task dependent and time consuming [2], [10], [11], [12], [13]. The normalized root mean square error (NMSE) or mean square error (MSE) for numerical estimation was most commonly used to measure the degradation in reconstructed images. The guidance document for Picture Archiving and Communication System (PACS) also requires that manufacturers report to FDA, the NMSE of their lossy compression techniques [14]. Nevertheless, NMSE and MSE are found to neither correlate nor provide any quality information [2], [10], [13], [15], [16].

The objective image quality metrics, visual discrimination model (VDM) and visible differences predictor (VDP), has been developed in past years. These metrics combined perceptual quality measurement by taking into account human visual system (HVS) features [16], [17], [18], [19], [20], [21], [22]. Since a human observer is the end user of image quality measurement, image quality model based on HVS seemed to be more appropriate for user perceptual. Recently, the detection threshold of artifact in medical image subject to JPEG and JPEG 2000 compression was reported [23]. They mentioned that the VDM predictions of artifact visibility were highly correlated with observer performance. However, it was reported [21], [22], [24] that none of the objective quality model outperformed others by comparing nine proponent objective quality models and all of them were statistically indistinguishable with the peak signal to noise ratio (PSNR).

Recently, Wang and Bovic [25] suggested a Q index to estimate image quality. The image spatial information is estimated from a local region with a 8×8 window size instead of from pixel, as indicated by their study. They showed that the variation of the Q index for a manipulated image was accordant with subjective image quality metrics. Chen et al. [26], [27] applied the Moran statistics in a local region to extract image spatial information. The sharpness or smoothness in image is indicated well by this statistics [27]. In this study we propose to evaluate spatial information in image by using Moran statistics and then a statistical method is applied to determine if this subtle difference between images is due to chance.

Section snippets

Method and material

The image was first manipulated to generate subtle differences images from the original. The image manipulation includes discarding LSBs as denoising, lossy image compression at low compression ratio and average filter blurring. The Z values on images were then estimated pixel-by-pixel using the Moran statistics with an 11×11 window [7], [8] and centered on it. Two cumulative Z value distributions were produced by collecting all Z values of image and then integrated. The Kolmogorov–Smirnov (KS)

LSBs manipulation

The LSBs in the image were considered as noisy bits and discarded using the whole-bit plane or adaptive methods. With the adaptive method, pixels of original image were discarded n_b(x,y) or a conservative level n_b(x,y)−1 bits. The mean discarding bits $n ̄_{b}$ and $n_{b} −1$ of the image could be obtained by averaging n_b(x,y) and n_b(x,y)−1 on images, as listed in Table 1, Table 2, Table 3, Table 4 for the T1, T2, CT and CR images. On average, the highest noising level is the CR images followed by the CT,

Denoising

On average the order of STD estimation is CR>CT>T2>T1, but the KS test shows different aspects. In spite of highest STD, the CR image hardly passed the KS test even at 1 LSB plane manipulation. However, CT and T2 images accepted the null hypothesis with 2-bit planes manipulation. The D value of T1 images fell in the critical region with $n_{b} −1$ (with mean=1.58 bits) bits discarding. The order of discarding level of noisy bits should be CT>T2>T1>CR. Although the result contradicts the STD value of

Summary

In this report, we proposed a method to prove indistinguishableness statistically for subtle differences between images. The proposed method has shown that the discrepancy between an original and bits-manipulated image could be attributed to chance, meaning statistically that the LSBs could be discarded without any image degradation. With lossy image compression, this method has shown to be consistent with the results of human observer and model observer. The Moran Z values were very sensitive

Acknowledgements

The medical images and SMPTE test patterns used in this work were obtained with the help of Jim-Chao Chuang from the Department of Medical Imaging and Alex Hsu from the Department of Radiation Oncology, Chung Shan Medical and Dental College Hospital, Tai Chung, Taiwan.

Tzong-Jer Chen currently is a Postdoctoral Researcher at Department of Nuclear Medicine, National PET/Cyclotron Center, Taipei Veterans General Hospital, Taipei, Taiwan. He received his PhD in Medical Physics and MS in Health Physics both from the National Tsing-Hua University, Taiwan. His research interests include image analysis, medical image quality evaluation and inverse planning for radiation therapy.

References (31)

S.I. Olsen
Estimation of noise in images: an evaluation
CVGIP
(1993)
H.K. Huang
Teleradiology technologies and some service models
Comput Med Imaging Graphics
(1996)
I.N. Bankmyan
Handbook of medical imaging processing and analysis
(2000)
B.J. Erickson
Irreversible compression of medical images
(2000)
B.K. Steward et al.
Gray level dynamic range in magnetic resonance imaging
SPIE Proc
(1986)
H.P. Chan et al.
Digitization requirements in mammography: effects on computer-aided detection of microcalcifications
Med Phys
(1994)
K.S. Chuang et al.
Comparison of chi-square and joint-count methods for evaluating digital image data
IEEE Trans Med Imaging
(1992)
K.S. Chuang et al.
Assessment of noise in a digital image using the join-count statistic and Moran test
Phys Med Biol
(1992)
K.S. Chuang et al.
Noise content analysis in clinical digital images
Radiographics
(1994)
R. Burgul et al.
Methods of measurement of image quality in teleultrasound
Br J Radiol
(2000)

A. Kalyanpur et al.

Evaluation of JPEG and wavelet compression of body CT images for direct digital teleradiologic transmission

Radiology

(2000)

R.M. Slone et al.

Assessment of visually lossless irreversible image compression: comparison of three methods by using an image-comparison workstation

Radiology

(2000)

S. Wong et al.

Radiologic image compression—a review

Proc IEEE

(1995)

Center for Devices and Radiological Health, The US Food and Drug Agency. Guidance for the submission of premarket...

B. Girod

What's wrong with mean square error?

Cited by (18)

A neural network based framework for effective laparoscopic video quality assessment
2022, Computerized Medical Imaging and Graphics
Citation Excerpt :
Monitoring and assurance of a good video quality is a very critical task in various applications like video streaming, video surveillance, medical procedures and underwater exploration (Wang, 2011). Any loss of useful information in these videos, resulting from some kinds of video distortion, may not only affect the visual experience but could also cause fatal consequences like in the fields of video surveillance (Beghdadi et al., 2018) and medical imaging (Amirrashedi et al., 2021; Chen et al., 2004). For instance, one of the most common medical procedures, where having a good video quality becomes highly desirable, is the laparoscopic/endoscopic surgery.
Video quality assessment is a challenging problem having a critical significance in the context of medical imaging. For instance, in laparoscopic surgery, the acquired video data suffers from different kinds of distortion that not only hinder surgery performance but also affect the execution of subsequent tasks in surgical navigation and robotic surgeries. For this reason, we propose in this paper neural network-based approaches for distortion classification as well as quality prediction. More precisely, a Residual Network (ResNet) based approach is firstly developed for simultaneous ranking and classification task. Then, this architecture is extended to make it appropriate for the quality prediction task by using an additional Fully Connected Neural Network (FCNN). To train the overall architecture (ResNet and FCNN models), transfer learning and end-to-end learning approaches are investigated. Experimental results, carried out on a new laparoscopic video quality database, have shown the efficiency of the proposed methods compared to recent conventional and deep learning based approaches.
Characterisation of heterogeneity and spatial autocorrelation in phase separating mixtures using Moran's I
2018, Journal of Colloid and Interface Science
Citation Excerpt :
As shown in the images in Fig. 1, increased clustering leads to higher values of Moran’s I. Where pixel intensities are randomly distributed, Moran’s I is equal to 0, and more alternating features lead to lower, more negative, values of Moran’s I [34,36]. In the MRI studies, Moran’s I has been used to assess noise levels [30,31,35], study neural networks [16] and distribution of fat in muscles [34]. In the study by Derado et al. [16], Moran’s I was used to investigate neural networks, which were identified using segmentation techniques.
In complex colloidal systems, particle-poor regions can develop within particle-rich phases during sedimentation or creaming. These particle-poor regions are overlooked by 1D profiles, which are typically used to assess particle distributions in a sample. Alternative methods to visualise and quantify these regions are required to better understand phase separation, which is the focus of this paper. Magnetic resonance imaging has been used to monitor the development of compositional heterogeneity in a vesicle-polymer mixture undergoing creaming. T₂ relaxation time maps were used to identify the distribution of vesicles, with vesicle-poor regions exhibiting higher T₂ relaxation times than regions richer in vesicles. Phase separated structures displayed a range of different morphologies and a variety of image analysis methods, including first-order statistics, Fourier transformation, grey level co-occurrence matrices and Moran’s I spatial autocorrelation, were used to characterise these structures, and quantify their heterogeneity. Of the image analysis techniques used, Moran’s I was found to be the most effective at quantifying the degree and morphology of phase separation, providing a robust, quantitative measure by which comparisons can be made between a diverse range of systems undergoing phase separation. The sensitivity of Moran’s I can be enhanced by the choice of weight matrices used.
Protecting patient privacy against unauthorized release of medical images in a group communication environment
2005, Computerized Medical Imaging and Graphics
In this paper, we identify and study an important patient privacy protection problem related to medical images. Following Health Insurance Portability and Accountability Act (HIPAA) mandate on privacy protection of patients' medical records, efforts have been devoted to guaranteeing the confidentiality of data and medical images during storage and transmission via an untrustworthy channel. However, to our knowledge, there has not been any effort towards protecting against unauthorized release of images by an authorized recipient. In this paper, we study the problem of tracing illegally distributed medical images in a group communication environment and identify a set of design requirements that must be met. We propose a fingerprint model suitable for many-to-many multicast, that is computationally efficient and scalable in user storage and key update communication. Simulation results also show that our scheme is highly robust to typical medical image processing and collusion attacks, while yielding high quality watermarked images.
An FCM-Based Image De-Noising with Spatial Statistics Pilot Study
2023, Applied Sciences (Switzerland)
A FCM-based image de-noising with spatial statistics pilot study
2022, Research Square
Compressed medical image quality determination using the kolmogorov- smirnov test
2017, Current Medical Imaging Reviews

View all citing articles on Scopus

Keh-Shih Chuang received both his BS in Nuclear Engineering and MS in Health Physics from the National Tsing-Hua University, Taiwan, and his PhD in Medical Physics from UCLA. He is currently an Associate Professor at the Department of Nuclear Science in the National Tsing-Hua University, Taiwan. His research interests are quality evaluation of medical image, scatter correction in PET image, and inverse planning for radiation therapy.

Yuang-Chin Chiang received his PhD degree in Statistics in 1988 from Florida State University, USA. He is currently an Associate Professor in the Institute of Statistics of the National Tsing-Hua University (Taiwan). His major research interest at present is in speech recognition, especially that of the Taiwanese language.

Jen-Hao Chang received his BS in Radiological Science and Technology from the National Yang-Ming University, Taiwan. Then he received his MS in Medical Physics from Department of Nuclear Science, National Tsing-Hua University, Taiwan. He is currently an Associate Researcher of the Biomedical Engineering Center, Industrial Technology Research Institute (ITRI), Taiwan. His interests are medical imaging, medical engineering, bio-photonics, radiation applications in biomedical technology.

Ren-Shyan Liu received his MD degree in medicine from National Defense Medical Center in Taiwan in 1997. From September 1997, he works at the Department of Nuclear Medicine, Taipei Veterans General Hospital and also was a faculty at National Defense Medical Center and National Yang-Ming University Medical School. He is now a staff physician at the Department of Nuclear Medicine, Taipei Veterans General Hospital, and the Associate Professor and Chief of Division of Nuclear Medicine at National University Medical School. He is also in charge of PET Gene Probe Core for National Research Program of Genomic Medicine in Taiwan. He is on the Editorial Board of Journal of Nuclear Medicine, European Journal of Nuclear Medicine, and Annals of Nuclear Medicine. His current research interests include molecular image, gene expression imaging, and application of PET in oncology, especially tumor hypoxia and C-11 acetate tumor imaging.

View full text

A statistical method for evaluation quality of medical images: a case study in bit discarding and image compression

Abstract

Introduction

Section snippets

Method and material

LSBs manipulation

Denoising

Summary

Acknowledgements

CVGIP

Comput Med Imaging Graphics

Handbook of medical imaging processing and analysis

Irreversible compression of medical images

Gray level dynamic range in magnetic resonance imaging

SPIE Proc

Digitization requirements in mammography: effects on computer-aided detection of microcalcifications

Med Phys

Comparison of chi-square and joint-count methods for evaluating digital image data

IEEE Trans Med Imaging

Assessment of noise in a digital image using the join-count statistic and Moran test

Phys Med Biol

Noise content analysis in clinical digital images

Radiographics

Methods of measurement of image quality in teleultrasound

Br J Radiol

Evaluation of JPEG and wavelet compression of body CT images for direct digital teleradiologic transmission

Radiology

Assessment of visually lossless irreversible image compression: comparison of three methods by using an image-comparison workstation

Radiology

Radiologic image compression—a review

Proc IEEE

What's wrong with mean square error?