Skip to main content

Advertisement

Log in

A real-time temporal Bayesian architecture for event surveillance and its application to patient-specific multiple disease outbreak detection

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Reliable and accurate detection of disease outbreaks remains an important research topic in disease outbreak surveillance. A temporal surveillance system bases its analysis on data not only from the most recent time period, but also on data from previous time periods. A non-temporal system only looks at data from the most recent time period. There are two difficulties with a non-temporal system when it is used to monitor real data which often contain noise. First, it is prone to produce false positive signals during non-outbreak time periods. Second, during an outbreak, it tends to release false negative signals early in the outbreak, which can adversely affect the decision making process of the user of the system. We conjecture that by converting a non-temporal system to a temporal one, we may attenuate these difficulties inherent in a non-temporal system. In this paper, we propose a Bayesian network architecture for a class of temporal event surveillance models called BayesNet-T. Using this Bayesian network architecture, we can convert certain non-temporal surveillance systems to temporal ones. We apply this architecture to a previously developed non-temporal multiple-disease outbreak detection system called PC and create a temporal system called PCT. PCT takes Emergency Department (ED) patient chief complaint data as its input. The PCT system was constructed using both data (non-outbreak diseases) and expert assessments (outbreak diseases). We compare PCT to PC using a real influenza outbreak. Furthermore, we compare PCT to both PC and the classic statistical methods CUSUM and EWMA using a total of 240 influenza and Cryptosporidium disease outbreaks created by injecting stochastically simulated outbreak cases into real ED admission data. Our results indicate that PCT has a smaller mean time to detection than PC at low false alarm rates, and that PCT is more stable than PC in that once an outbreak is detected, PCT is better at maintaining the detection signal on future days.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baron MI et al (2002) Bayes and asymptotically pointwise stopping rules for the detection of influenza outbreaks. In: Gastonis C, Kass RE, Carriquiry A (eds) Case studies in Bayesian statistics. Springer–Verlag, New York

    Google Scholar 

  • Bos T, Fetherston TA (1992) Market model nonstationarity in the Korean stock market. In: Rhee SG, Chang RP (eds) Pacific-Basin capital markets research, 3rd edn. Elsevier, North-Holland, Amsterdam

    Google Scholar 

  • Box G, Jenkins G, Reinsel R (1994) Time series analysis: forecasting and control. Prentice Hall, Englewood Cliffs

    MATH  Google Scholar 

  • Burges C (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2(2): 121–167

    Article  Google Scholar 

  • Cooper GF, Dash DH, Levander JD, Wong WK, Hogan WR, Wagner MM (2004) Bayesian biosurveillance of disease outbreaks. In: Proceedings of the 20th conference on uncertainty in artificial intelligence, AUAI Press, Arlington, Virginia, pp 94–103

  • Cooper GF, Dowling JN, Lavender JD, Sutovsky P (2007) A Bayesian algorithm for detecting CDC category A outbreak diseases from emergency department chief complaints. Adv Disease Surveill 2: 45

    Google Scholar 

  • Fawcett T, Provost F (1999) Activity monitoring: noticing interesting changes in behavior. In: Proceedings of the fifth SIGKDD conference on knowledge discovery and data mining, ACM Press, San Diego, California, pp 53–62

  • Hamilton J (1994) Time series analysis. Princeton University Press, Princeton

    MATH  Google Scholar 

  • Hogan W, Cooper GF, Wallstrom G, Wagner M (2007) The Bayesian aerosol release detector: an algorithm for detecting and characterizing outbreaks caused by an atmospheric release of bacillus anthracis. Stat Med 26(29): 5225–5252

    Article  MathSciNet  Google Scholar 

  • Jiang X (2007) A Bayesian network for predicting an epicurve. Adv Disease Surveill 2: 15

    Google Scholar 

  • Jiang X (2008) A Bayesian network model for spatio-temporal event surveillance, Ph.D. Thesis, Department of Biomedical Informatics, University of Pittsburgh

  • Jiang X, Cooper GF, Neill DB (2009) A Bayesian network model for spatial event surveillance. Int J Approx Reason. doi:10.1016/j.ijar.2009.01.001

  • Jiang X, Wallstrom GL (2006) A Bayesian network for outbreak detection and prediction. In: Proceedings of AAAI-06, Boston, Massachusetts, pp 1166–1160

  • Kulldorff M (1997) A spatial scan statistic. Commun Stat Theory Methods 26(6): 1481–1496

    Article  MATH  MathSciNet  Google Scholar 

  • Kulldorff M (2004) Satscan v. 4.0: software for the spatial and space-time scan statistics, Technical Report, Information Management Services, Inc.

  • Kulldorff M, Heffernan R, Hartman J, Assunco R, Mostashari F (2005) Space-time permutation scan statistic for disease outbreak detection. PLoS Med 2: 216–224

    Article  Google Scholar 

  • Kulldorff M, Mostashari F, Luiz D, Yih K, Kleinman K, Platt R (2007) Multivariate scan statistics for disease surveillance. Stat Med 26: 1824–1833

    Article  MathSciNet  Google Scholar 

  • Montgomery DC (2001) Introduction to statistical quality control. Wiley, New York

    Google Scholar 

  • Moore A (2001a) A powerpoint tutorial on hidden Markov models, available at http://www.cs.cmu.edu/~awm/781/timetable.html

  • Moore A (2001b) A powerpoint tutorial on support vector machines, available at http://www.cs.cmu.edu/~awm/781/timetable.html

  • Moore A, Anderson B, Das K, Wong WK (2006) Combining multiple signals for biosurveillance. In: Wagner M (eds) Handbook of biosurveillance. Elsevier, New York

    Google Scholar 

  • Neill DB, Moore AW, Cooper GF (2005a) A Bayesian spatial scan statistic. Adv Neural Inform Process Syst (NIPS) 18: 1003–1010

    Google Scholar 

  • Neill DB, Moore AW, Sabnani M, Daniel K (2005b) Detection of emerging space-time clusters. In: Proceedings of 11th ACM SIGKDD international conference on knowledge discovery and mining, Chicago, Illinois, pp 218–227

  • Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2): 257–286

    Article  Google Scholar 

  • Reis BR, Mandl KD (2003) Time series modeling for syndromic surveillance. BMC Med Inform Dec Making 3(2)

  • Reis BY, Pagano M, Mandl KD (2003) Using temporal context to improve biosurveillance. PNAS 100(4): 1961–1965

    Article  Google Scholar 

  • Serfling RE (1963) Methods for current statistical analysis of pneumonia-influenza deaths. Public Health Rep 78(6): 494–506

    Google Scholar 

  • Shmueli G, Fienberg S (2006) Current and potential statistical methods for monitoring multiple data streams for biosurveillance. In: Wilson A, Wilson GD, Olwell D (eds) Statistical methods in counterterrorism. Springer, New York

    Google Scholar 

  • Stirling R, Aramini J, Ellis A, Gillien L, Meyers R, Flevry M, Werker D (2001) Waterborne cryptosporidiosis outbreak, North Battleford, Saskatchewan, spring 2001. Can Commun Disease Rep 27(22): 185–192

    Google Scholar 

  • Soneson C, Bock D (2003) A review and discussion of prospective statistical surveillance in public health. JR Stat Soc A 166(1): 5–21

    Article  Google Scholar 

  • Sun L, Shenoy P (2007) Using Bayesian networks for bankruptcy prediction: some methodological issues. Eur J Oper Res 180(2): 738–753

    Article  MATH  Google Scholar 

  • Tsui FC, Wagner MM, Dato V, Chang HC (2001) Value of ICD-9-coded chief complaints for detection of epidemics. Symp J Am Med Inform Assoc 9: 4–47

    Google Scholar 

  • Wong WK, Moore A (2006) Classical time series methods for biosurveillance. In: Wagner M (eds) Handbook of biosurveillance. Elsevier, New York

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xia Jiang.

Additional information

Responsible editor: R. Bharat Rao and Romer Rosales.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, X., Cooper, G.F. A real-time temporal Bayesian architecture for event surveillance and its application to patient-specific multiple disease outbreak detection. Data Min Knowl Disc 20, 328–360 (2010). https://doi.org/10.1007/s10618-009-0151-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-009-0151-4

Keywords

Navigation