Citation: Sudip Bhattacharya. Predicting emerging and re-emerging disease outbreaks through internet search trends: An analysis from India[J]. AIMS Public Health, 2019, 6(1): 1-3. doi: 10.3934/publichealth.2019.1.1
[1] | Testimony J Olumade, Oluwafolajimi A Adesanya, Iyanuoluwa J Fred-Akintunwa, David O Babalola, Judith U Oguzie, Olusola A Ogunsanya, Uwem E George, Oluwawapelumi D Akin-Ajani, Damilola G Osasona . Infectious disease outbreak preparedness and response in Nigeria: history, limitations and recommendations for global health policy and practice. AIMS Public Health, 2020, 7(4): 736-757. doi: 10.3934/publichealth.2020057 |
[2] | Mehreen Tariq, Margaret Haworth-Brockman, Seyed M Moghadas . Ten years of Pan-InfORM: modelling research for public health in Canada. AIMS Public Health, 2021, 8(2): 265-274. doi: 10.3934/publichealth.2021020 |
[3] | Mohammed A. Soghaier, Khwaja M.I. Saeed, Khushhal K. Zaman . Public Health Emergency of International Concern (PHEIC) has Declared Twice in 2014; Polio and Ebola at the Top. AIMS Public Health, 2015, 2(2): 218-222. doi: 10.3934/publichealth.2015.2.218 |
[4] | Rosemary Mamka Anyona, Maximilian de Courten . An Analysis of the Policy Environment Surrounding Noncommunicable Diseases Risk Factor Surveillance in Kenya. AIMS Public Health, 2014, 1(4): 256-274. doi: 10.3934/publichealth.2014.4.256 |
[5] | Soufiane Bentout, Abdennasser Chekroun, Toshikazu Kuniya . Parameter estimation and prediction for coronavirus disease outbreak 2019 (COVID-19) in Algeria. AIMS Public Health, 2020, 7(2): 306-318. doi: 10.3934/publichealth.2020026 |
[6] | Ana Gama, Maria O. Martins, Sónia Dias . HIV Research with Men who Have Sex with Men (MSM): Advantages and Challenges of Different Methods for Most Appropriately Targeting a Key Population. AIMS Public Health, 2017, 4(3): 221-239. doi: 10.3934/publichealth.2017.3.221 |
[7] | Abdulqadir J. Nashwan, Rejo G. Mathew, Reni Anil, Nabeel F. Allobaney, Sindhumole Krishnan Nair, Ahmed S. Mohamed, Ahmad A. Abujaber, Abbas Balouchi, Evangelos C. Fradelos . The safety, health, and well-being of healthcare workers during COVID-19: A scoping review. AIMS Public Health, 2023, 10(3): 593-609. doi: 10.3934/publichealth.2023042 |
[8] | Joseph Inungu, Kechi Iheduru-Anderson, Ossam J Odio . Recurrent Ebolavirus disease in the Democratic Republic of Congo: update and challenges. AIMS Public Health, 2019, 6(4): 502-513. doi: 10.3934/publichealth.2019.4.502 |
[9] | Gabriel Owusu, Han Yu, Hong Huang . Temporal dynamics for areal unit-based co-occurrence COVID-19 trajectories. AIMS Public Health, 2022, 9(4): 703-717. doi: 10.3934/publichealth.2022049 |
[10] | Terri Rebmann, Kristin D. Wilson, Travis Loux, Ayesha Z. Iqbal, Eleanor B. Peters, Olivia Peavler . Outcomes, Approaches, and Challenges to Developing and Passing a Countywide Mandatory Vaccination Policy: St. Louis County’s Experience with Hepatitis A Vaccine for Food Service Personnel. AIMS Public Health, 2016, 3(1): 116-130. doi: 10.3934/publichealth.2016.1.116 |
Early detection is the key for the prevention and control of any communicable diseases. In India the Integrated Disease Surveillance Programme (IDSP) was launched in November 2004 for surveillance purpose. In this portal, most of the Indian districts report data for 22 epidemic prone notifiable diseases.
This one stop portal can also respond to the outbreaks through skilled and well equipped Quick Response Teams (QRTs) [1]. Presently under IDSP, the information is collected on three specified reporting formats, namely “S” (suspected cases/syndromic), “P” (presumptive cases) and “L” (laboratory confirmed cases). “S” form is filled by the health workers to report data on suspected cases/syndromes. “P” form is filled up by Medical Officers to report data on probable/clinically suspected cases. “L” form is designed to collect data on lab confirmed cases.
The obtained data flows from bottom to top via S, P, and L forms. i.e., community to national level. An outbreak is detected by this surveillance mechanism takes about 7 to 10 days, which is dangerous for rapidly spreading diseases like swine flu, bird flu, nipah virus etc. Therefore, supplementary system to this IDSP will play a great role in collecting timely data on epidemic prone communicable diseases which may minimize the influence of unprecedented outbreaks. An innovative surveillance mechanism for predicting disease outbreaks embedded with internet search behaviour of the population has currently emerged as a promising technique [2]. Now a days, internet use via cell phone in India is rapidly increasing [3],[4]. Recently, a large proportion of medical or health-related information are flowing to the cell phones via internet [5]. Recent data indicates that primary sources of information flows through the Internet in most of the communities [6]–[9].
Presently, big multinational companies are using search data (after data mining) for monitoring purposes as well as for marketing purpose. Among all, e-news, social network data, and blog data are used as the main source [2]. This whole procedure is commonly known as “nowcasting” [10]. This “nowcasting” process can help us to estimate the magnitude of outbreaks by near real-time in early stages of an outbreak. Additionally, this process can be embedded with the existing healthcare setup. Hence, this approach is quite useful in resource-poor countries where their health systems are already overburdened. It is established from various studies that “Google Trends” may be a useful instrument for disease surveillance [11],[12]. This instrument can act as an add on measure to the existing IDSP system for timely disease surveillance. A study conducted by Verma M et al. in India with an objective to evaluate the temporal correlation between Google Trends and conventional IDSP data in Haryana and Chandigarh (U.T.) [13].
Google search was conducted for febrile illness like malaria, dengue fever, chikungunya, and enteric fever for the abovementioned area in 2016. Those extracted data were compared with the IDSP data (only P form). Google search trends and IDSP reporting showed temporal correlation. A lag of −2 to −3 weeks was observed between google search trends and IDSP data.
That study had some limitations. 1) The study used only the ‘P’ form data of febrile cases as it is difficult to differentiate the fever cases reported in “S” form. 2) Also, “L” form data also was excluded due to delay in lab reporting. 3) External validity was also emerged as an important concern. 4) Due to cultural diversity in India there are language differences. As Indians, we also do use social media/transfer of e-data in our vernacular language. In that study only, English language was used for data retrieval, which underestimates the actual value/burden and thus error was occurred in the calculation of correlation. 5) As Google Trends does not provide data at intra-province level, the established correlation may not help to identify the hotspot of an outbreak or epidemic. 6) Varied strength of positive correlation was observed with all febrile illnesses which may not have much significance. 7) Besides Google search, people may communicate/retrieve data via various search engines like, SIRI, Bing, E- Explorer or they may use social media like Facebook, Instagram or Twitter etc. 8) Lastly, seasonal variation could not be identified as the data was collected for short period of time (1 year only).
Despite of limitations, this google based surveillance system can reinforce the existing IDSP system. This kind of outbreak predicting intelligence system may be experimented at the ground level covering the larger communities in resources constraint areas like India. Mathematical modelling techniques can be incorporated in the future studies for accurate forecasting of epidemics and outbreaks for better adjustment of confounders. Not only “Google search”, other portal sites like “E-explorer”, “SIRI”, “Bing” and other search engines can be used for data mining and disease surveillance in the future.
In spite of the massive potential of this application, this technique should be used as an add-on to the existing surveillance systems. As we are now used to work manually (filling IDSP forms), training and retraining (for filling digital forms) of the peripheral health workers are the key for successful implementation of this intelligence system. However, the results of this study hold promise in Indian scenario for forecasting of emerging and re-emerging diseases outbreaks in near future.
[1] | Ministry of Health & Family Welfare, Government of India. Integrated Disease Surveillance Programme [Internet] New Delhi: Ministry of Health & Family Welfare; c2018 [cited at 2018 Jul 4]. Available from: http://www.idsp.nic.in/. |
[2] |
Milinovich GJ, Williams GM, Clements AC, et al. (2014) Internet-based surveillance systems for monitoring emerging infectious diseases. Lancet Infect Dis 14: 160–168. doi: 10.1016/S1473-3099(13)70244-5
![]() |
[3] | Hoffman DL, Novak TP, Venkatesh A (2004) Has the Internet become indispensable? Commun ACM 47: 37–42. |
[4] | Telecom Regulatory Authority of India. The Indian Telecom Services Performance Indicators, April–June 2017 [Internet] New Delhi: Telecom Regulatory Authority of India; 2017 [cited at 2018 Jul 4]. Available from: http://trai.gov.in/sites/default/files/Performance_Indicator_Reports_28092017.pdf. |
[5] |
Hellawell GO, Turner KJ, Le Monnier KJ, et al. (2000) Urology and the Internet: An evaluation of internet use by urology patients and of information available on urological topics. BJU Int 86: 191–194. doi: 10.1046/j.1464-410x.2000.00831.x
![]() |
[6] | Tonsaker T, Bartlett G, Trpkov C (2014) Health information on the Internet: Goldmine or minefield? Can Fam Physician 60: 407–408. |
[7] |
Hesse BW, Moser RP, Rutten LJ (2010) Surveys of physicians and electronic health information. N Engl J Med 362: 859–860. doi: 10.1056/NEJMc0909595
![]() |
[8] |
Diaz JA, Griffith RA, Ng JJ, et al. (2002) Patients' use of the Internet for medical information. J Gen Intern Med 17: 180–185. doi: 10.1046/j.1525-1497.2002.10603.x
![]() |
[9] |
Moretti FA, Oliveira VE, Silva EM (2012) Access to health information on the Internet: A public health issue? Rev Assoc Med Bras 58: 650–658. doi: 10.1590/S0104-42302012000600008
![]() |
[10] |
Choi H, Varian H (2012) Predicting the present with Google Trends. Econ Rec 88: 2–9. doi: 10.1111/j.1475-4932.2012.00809.x
![]() |
[11] | Bragazzi NL (2013) A Google Trends-based approach for monitoring NSSI. Psychol Res Behav Manage 7: 1–8. |
[12] |
Kang M, Zhong H, He J, et al. (2013) Using Google Trends for influenza surveillance in South China. PLoS One 8: e55205. doi: 10.1371/journal.pone.0055205
![]() |
[13] |
Verma M, Kishore K, Kumar M, et al. (2018) Google search trends predicting disease outbreaks: An analysis from India. Healthc Inform Res 24: 300–308. doi: 10.4258/hir.2018.24.4.300
![]() |
1. | Abhishek Royal, MarceloAmaral Mali, Vaibhav Kumar, IndraniAlhad Wagh, Shashi Bhushan, AvishkarNitin Mokal, Kedar Mehta, Sudip Bhattacharya, Harnessing the potential of the primary healthcare facilities in India to respond COVID-19 pandemic: A scoping evidence-based research synthesis, 2021, 10, 2249-4863, 116, 10.4103/jfmpc.jfmpc_1609_20 | |
2. | Loick Pradel Kojom, Vineeta Singh, A Review on Emerging Infectious Diseases Prioritized Under the 2018 WHO Research and Development Blueprint: Lessons from the Indian Context, 2021, 21, 1530-3667, 149, 10.1089/vbz.2020.2661 |