| Review | Open Access |
|---|
The COVID-19 pandemic has irreversibly changed our approach to public health surveillance, piloting the widespread use of digital epidemiological tools and exposing new possibilities along with ongoing deficits in our surveillance architecture. The current study looked at recent developments in digital epidemiology during the post-pandemic period in terms of evolution, opportunities, and challenges. The study presented important opportunities, such as the ability to provide real-time surveillance, democratized access to epidemiological data, and integrated data sources. Furthermore, it also identified common challenges surrounding data privacy rules, digital divide, technical constraints, and governance systems. It demonstrated that although digital epidemiology holds great promise to improve public health surveillance, to fulfil it, the community must grapple with such key challenges as interoperability, equity, trust, and governance. These findings underscore the need for the post-pandemic period (2020–2025) to be a policy window of opportunities where it would be possible to build strong digital surveillance systems that are sustainable, equitable, and can effectively meet the needs of future public health emergencies whilst protecting individual rights and promoting health equity.
In the context of COVID-19 pandemic, several existing and novel surveillance tools were deployed to aid the public health response. States, public health authorities, and private sectors often defended the use of digital tools (e.g., cell phone geolocation, mobile contact tracing applications, closed-circuit cameras as well as drones). These digital tools were used, in compliance with the public health measures of COVID-19 to limit its spread, in order to survey the population and collect data [1]. Lyon (2007) defines surveillance as “the concentrated, structured, and systematic attention to personal details for purposes of influence, management protection or direction” (p. 14), Moreover, the World Health Organization (WHO) provides a broad-based definition of public health surveillance as sustained watchfulness and the observation of human events related to intervention [2]. Public health surveillance includes both passive (reporting data on routine diseases and programs) and active (a conscious search for information about a disease or condition) surveillance. Active surveillance has been established and sustained both for pandemic and non-pandemic reasons using a mix of passive and active measures that utilize sophisticated data analytics and innovative technological tools. Surveillance data can improve the efficiency and effectiveness of healthcare services by targeting interventions and reporting their effects on the population [3].
Need for this development is the quest to mitigate the implications of pandemic (e.g., illness-related morbidity/mortality, health services burden, social and economic consequences) which has led to a surge in digital innovation supporting public health efforts. Yet, concerns remain that while the pace of this response may be characteristically quick and effective, it also risks itself to backfire in unintended ways. Examples include threats to privacy with incursions on civil liberties and differential surveillance of vulnerable groups or potential erosion of human rights [4]. However, the rapid embrace also laid bare problems: privacy risks, algorithmic bias, and unequal access to technology, inadequate system compatibility, and fragile governance. Now, lessons learned from those experiences guide our thinking as to how stronger and more equitable systems can be created in the future [5]. This study sought to offer an overview of the digital epidemiology landscape, opportunities, and challenges that would shape this field in the future and ongoing gaps that may limit public health preparedness and response.
The current study offered a wide scope of different dimensions of digital epidemiology, covering technological breakthrough, methodological development, implementation issues, ethical considerations, and policy implications. Through aggregating knowledge from recent studies across these areas, this study aimed to support evidence-based decision-making around future digital surveillance infrastructure investments. Moreover, it also provided a roadmap for the prioritization of further research, policy formulation, and capacity building in the post-pandemic era.
Observing the epidemic trend is a basic and necessary part of epidemic prevention and control [6]. During the COVID-19 pandemic, cloud computing, Artificial Intelligence (AI), and big data technologies promoted the transformation of passive observation to active monitoring and early warning for public health big data [7, 8]. The latest information technologies, such as AI and big data make a great contribution to public health surveillance, as some experts pointed out [9]. In particular, AI algorithms can be applied to forecast infection rates based on big data analysis and statistics as well as confirmed case numbers to estimate the overall scale of epidemic. Crucially, trend prediction could inform on the course of the epidemic at large, whether an inflection point would occur and when that might be (if at all) [10]. These projections are critical for lifting the lockdown and reopening production.
Prediction of gene expression patterns can be a fundamental image-analysis factor in transition from passive prevention to active prevention; it also works as a prerequisite for decision-making [11]. Using the tool of mathematical modelling, big data analysis, and Machine Learning (ML) to display realistic epidemic dynamics is conducive to help the management departments take an initiative in controlling the outbreak [12]. Many provinces in China used big data, AI, cloud computing, and other digital technologies to continuously improve the monitoring measures of the epidemic [13].
Europe employ satellites to oversee and reduce the effects of the COVID-19 pandemic. Their satellite navigation and earth observation systems are open to the public, and the data gathered could help inform decisions to combat the epidemic [14].
Smart analysis and predictive decisions are helpful for government’s decision-making, crisis control, and resource deployment. These help managers understand the dynamic development and context of emergencies in that area from useful information through continuous changes, reaching a conclusion quickly [15]. In addition, fast and accurate information exchange is not only an important means to stabilize public mood and social order during COVID-19 epidemic period but also a necessary way for government to improve epidemic control. It depicts the epidemic situation in real-time, and involves official information guidelines, improving the trustworthiness and transparency of data further to prevent misunderstandings from spreading [1].
Table 1. Pre-pandemic vs Post-pandemic Epidemiology
|
Pre-pandemic |
Post-pandemic |
|---|---|
|
Traditional Surveillance Systems |
Digital & hybrid surveillance |
|
Manual Data Collection |
Automated Real-time Data Capture |
|
Hospital and Lab-based Reporting |
EHR, IoT, Social Media Integration |
|
Delayed Reporting (Days-Weeks) |
Rapid Reporting (Minutes-Hours) |
|
Limited Geographic Coverage |
Global Reach and Fine Granularity |
|
Aggregate |
Individual-level +Geospatial Insight |
The concept of AI was first launched in 1956 by McCarthy [1]. AI is capable of miming human decision-making and reasoning with the potential to enhance and augment human intelligence through continuous ML [16]. It combines computer science, mathematics, cognitive science, neurophysiology, and information theory among other fields including expert systems, ML, Natural Language Processing (NLP), automatic planning, and image processing [17]. Medical staff is also important in controlling the spread of a pandemic [18]. AI can support them with additional diagnosis and therapy as well as contribute to reduce their workload, increasing efficiency and quality of healthcare.
Smart AI systems are applied in a range of medical fields. The most commonly used and considered method for these intelligent systems is image inspection. Many medical institutions apply AI functional image recognition technology to analyse large amounts of images and diagnostic data in order to extract features from medical images [19]. The characteristics in the images are exploited to identify whether there is a lesion inside human body, and further deterioration of lesions inside human body, so as to increase the efficiency and accuracy of the diagnosis [20]. An AI-based medical robot is a major breakthrough in the current clinical practice. Surgical robot ‘The Da Vinci’ surgical robot system represents the most advanced surgical robots available to date. It can capture the surgical environment images through a 3-D vision system and complete some tasks which cannot be performed by hand [21].
3.2. Internet of Things (IoT) and Sensor NetworksThe Internet of Things (IOT) devices have the ability to decrease healthcare cost [22]. As the age pyramid is becoming an inverted triangle in society, the number of patients suffering from chronic diseases increases. The families of chronically-ill patients are burdened both financially and time wise with long-term care. Hence, IoT makes the automatic monitoring and tracking of patients as well as medical staff more feasible [23, 24].
Data from personal wearable devices, in particular, offers insights into population-level health monitoring by capturing trends including heart rate variability, sleep patterns, and indicators of activity levels or other physiological measures. These may serve as proxies for early signs of illnesses or changes in health statuses. However, the application of surveillance data from wearable devices raises important issues regarding privacy and consent that must be carefully considered to ensure proper use and protection of personal health information [25]. Environmental monitoring systems have been extended to monitor wastewater for viral RNA and air quality for surveillance of human respiratory health, as well as climate monitoring for the assessment of risks to environmental health. These systems are valuable complements to clinical surveillance systems and some may have early warning potential for a range of health threats [26].
3.3. Blockchain and other Distributed Ledger TechnologiesBlockchain technologies have been proposed as an enabling resource for data sharing and privacy challenges in digital epidemiology, enabling secure, transparent, and tamper-proof records of epidemiological data that upholds individual privacy through the use of cryptographic methods. Decentralized applications have been made for privacy-protection, contact tracing, electronic vaccination or health credentials. These could help overcome contested privacy risks that prevent mass adoption of centralized digital surveillance tools in pandemics [27].
Novel concepts, such as smart contracts also known as automated data sharing agreements, implemented using blockchain technology, may support more efficient and transparent data sharing in epidemiology while maintaining adequate governance oversight. Such systems could potentially facilitate automated requirements for data sharing agreements, privacy protection mechanisms, governance, and other nomenclature along with simultaneously reducing administrative burden and enhancing cooperation/efficiency of inter-watching surveillance [28].
The transformation of epidemiology has extended digital democratization as the global community from researchers to the public health workforce and NGOs throughout the world, can now engage in surveillance techniques and epidemiological research. Open-source platforms, cloud-based computing solutions, and user-friendly analytical tools have also minimised barriers of entry into epidemiological research and surveillance as a consequence helping to democratize their participation in public health [29, 30].
Citizen science projects have effectively captured public enthusiasm for participation in collecting and analysing the data, fostering emerging surveillance networks that do not compete with rather supplement the existing public health assets. Applications that permit self-report of symptoms, vaccination status, exposure history, and other health information represent real-time disease surveillance networks, complementing classical clinical reporting systems. Such participatory surveillance strategies have shown particular utility for monitoring seasonal diseases, vaccination coverage, and identifying health disparities in different populations [31]. Academic institutions have invested in web-based portals and educational curricula that facilitate access to specialized training in digital epidemiology methods for the global public health workforce. Large online open courses (MOOCs), webinars, and other educational technologies have extended the reach of training in data science, epidemiological methods, and digital health technologies for the development of capacity SOAs/epidemiological public health MDSSes. These are helpful to create skilled workforces with competencies necessary to implement and sustain digital surveillance systems [1, 32].
4.2. Integration of Different Data SourcesBased on the linkage of clinical information derived from electronic health records and public health surveillance, for instance, community-level exposure, interventions can lead to opportunities to understand the interface between individual health status and population level exposures and interventions. Such integration may help inform decisions about clinical care as well as population health strategies, building bridges between individual patient care and public health practice [33].
The power of ML algorithms lies in their ability to detect complex patterns and associations among multiple different data streams. These may provide insights on disease causation, transmission dynamics, and intervention effectiveness that are not evident through single-source surveillance [34]. Geospatial analysis has evolved with the application of satellite imagery, mobility data, environment monitoring systems, and ground level surveillance data. They allow tracking of environmental forces affecting risk of diseases, analysing the spread dynamics in a population by using mobility data and gauging specific geographic locations that need strategic interventions. Integration of remote sensing data with conventional epidemiological data continued to be particularly beneficial for surveillance of vector-borne diseases, environmental health risks, and the health impacts of climate change [35].
Table 2. Opportunity vs Challenges in Post-pandemic Digital Epidemiology
|
Opportunities |
Challenges |
|---|---|
|
Real-time Outbreak Detection |
Data Privacy and Governance |
|
Integration of Diverse Data Sources |
Interoperability |
|
AI/ML for Predictive Modelling |
Algorithmic Bias and Validity |
|
Democratized Data Access |
Digital Divides and Inequity |
|
Genomic + Digital Surveillance |
Limited Sequencing Capacity |
|
One Health Approaches |
Sustainability and Scalability |
Now, even when tremendous achievements of the technology and utility during the pandemic have been shown, privacy and security issues are still core challenges in digital epidemiology for public acceptance and system effectiveness. The sensitive nature of health data creates high privacy risks that need to be carefully weighed against public health benefits. Furthermore, consumer trust in digital surveillance systems was severely damaged by experiences during the pandemic, particularly with contact tracing applications and other surveillance technologies [5].
Privacy-preserving methods, such as differential privacy, homomorphic encryption, federated learning, and secure multi-party computation may serve as potential tools for enabling epidemiological analysis with protection of individual privacy. Secondly, such technologies have yet to be well-proven at the scale necessary for population-level surveillance, and trade-offs between preservation of privacy while preserving epidemiological utility need to be carefully considered and communicated openly with affected groups. The technical sophistication required to implement privacy-preserving mechanisms also serves as a bottleneck for uptake, especially in low-resource settings where technical capabilities may be limited [36].
Laws and other regulations have lagged technological capacities, with uncertainty about how to govern digital epidemiologic data contributing to public mistrust in surveillance capacity. The absence of transparent legal benches for emergency surveillance powers, data retention strategies, and algorithmic accountability has made the public sceptical with respect to digital surveillance systems [37]. A lack of open governance models, accountability mechanisms, and public involvement in the design and oversight of surveillance systems have further alienated the public from digital surveillance technologies [38].
5.2. Digital Divide and Equity of HealthThe existing health disparities are at a risk of being further exacerbated by digital surveillance systems that systematically exclude those with low levels of digital access, literacy or trust in technology. Rural, elderly, and low-income communities as well as various minority populations may be underrepresented in digital surveillance efforts. This leads towards blind spots that threaten the effectiveness of the system or maintain inequalities [39].
Owning a mobile phone and the ability to access internet are unequally distributed globally and within countries. Hence, this reduces the coverage and representativeness of smartphone-based surveillance systems. Even within the developed countries, there are huge differences in digital connectivity and use, especially among the elderly, low-income people, and minorities who may be at a higher risk of poor health. These differences imply that digital surveillance could systematically underreport the populations most in need of public health action [40]. Translation algorithms would not likely be able to grasp the nuance of local health concepts and terminology. Furthermore, cultural distinctions in health-seeking behaviours, privacy considerations, and technology use may impact participation in digital surveillance systems. The lack of culturally appropriate design and community engagement in surveillance system development may perpetuate the exclusion of marginalized communities [41].
5.3. Technical Issues and System LimitationsChallenges include the reliability, validity, and scalability of digital epidemiological systems. The paradox of integrating the data from heterogeneous sources with multiple collection procedures, quality criteria, and temporal scales presumes new challenges for data quality and analytic accuracy [42].
Algorithmic bias is a major concern, especially when ML models are trained on patterns lacking complete representation of the target populations. In the absence of diverse training data, poor validation practices, and a lack of attention to fairness in algorithmic development, surveillance systems may perform poorly for marginalized communities [43]. Challenges with signal-to-noise ratio are profound, especially in systems monitoring social media and the Internet in which true epidemiological signals must be separated from background noise and distractions, such as the media coverage of diseases, or other biases. Volume and velocity of digital data streams may surpass analytics ability, also the presence of disinformation, coordinated manipulation, and other types of data pollution could threaten surveillance validity [44]. Interoperability has been a challenge even after much progress has been made in standards development, especially at the semantic and organizational levels. Public bodies have their own definitions around data, how they collect it, how it can be analysed, and what to report — all of this adds complexity when trying to integrate into the wider industry. While technical standards for interoperability may help solve part of the problem, organizational, legal, and cultural barriers to data sharing frequently impede system integration effectively [45].
A lot of digital surveillance does experience problems with scalability of its systems in general, particularly when using manual data processing, complex analytical workflows or high-volume resource-intensive (often computational) analysis. A second issue is that funding charges for high-tech digital surveillance infrastructure could potentially be cost-prohibitive for some smaller public health agencies or resource-poor health jurisdictions, leading towards inequities in surveillance and response capacity [46].
5.4. Governance, Regulation, and AccountabilityThe accelerated development of digital surveillance technologies in response to the COVID-19 pandemic transcended the established regulatory guidelines, leaving unavoidable legal and ethical uncertainties that shall loom into the post-pandemic era. Concreate governance frameworks were called for, to determine data ownership rights, informed consent practices, algorithmic responsibility, transparency standards, and sunset processes of emergency surveillance measures [5, 47]. There is a need to modernize ethical oversight frameworks for digital epidemiological research and surveillance, including those involving big data analytics, AI applications, real-time surveillance systems, and population-level monitoring technologies. Conventional research ethics paradigms may not fully capture the distinct ethical challenges of population-level digital surveillance, such as collective consent, community benefit, algorithmic fairness, and long-term data use [48].
There is little public oversight of surveillance systems, including inadequate community input, grievance procedures, and system modification in response to user complaints [49]. The dynamic nature of digital technologies means that it requires continuous effort to keep knowledge and skills up-to-date, coupled to interdisciplinary competencies across a wide range of public health, data science, technology, ethics, and policy domains [50].
In the post-pandemic era, digital epidemiology requires strong policy frameworks that promote ethical, secure, and efficient use of digital health data. Policymakers should establish clear regulations for data privacy, cross-platform data sharing, and the responsible use of AI-driven surveillance tools. National health systems must integrate digital reporting systems, strengthen digital literacy among healthcare workers, and ensure equitable access to digital technologies across regions. Investing in robust digital infrastructure and transparent governance will enable faster outbreak detection, improved public health decision-making, and enhanced preparedness for future health emergencies.
6.1. Strengthening Public Health InfrastructureThe post-pandemic period calls for significant and long-term investment in public health infrastructure to support digital epidemiology capacities, namely the technological (systems) complemented by the human resources, institutional capacity, and governance mechanisms required for surveillance effectiveness. This investment should be based on the fact that digital surveillance systems need ongoing maintenance, updating, and evaluation to remain effective (and it not just a one-time technology deployment) [42]. Budgeting should take account of the entire cost lifecycle of digital surveillance (including start-up, overheads, system upgrading, workforce training; data storage and resources in terms IT processing power/support) as well as routine monitoring/review and improvements to systems. The attributing of digital surveillance capacity within routine public health practice involves reorganization, workflow adjustment, and staff development that necessitate dedicated efforts and resources [51].
6.2. RecommendationsWorkforce development efforts should concentrate on training public health professionals in digital epidemiologic methods, data science skills, as well as ethical and governance issues related to digital surveillance. Interdisciplinary training that cuts across public health, data science, technology, ethics, and policy is required for digital epidemiology. Public health organizations require capacity to manage the data, analyse it, integrate systems that apply statistical quality assurance standards, and evaluate trends. Engagement with academic institutions, technology entities, community-based organizations, and other partners would help secure enhanced expertise and resources with proper oversight and accountability [52].
6.3. Establishing Ethical and Legal NormsEthical standards for digital epidemiology should consider foundational issues around the proper extent of public health surveillance, when individual privacy can be justifiably constrained in service to collective goals, as well as how to prevent the abuse of surveillance powers [5]. Data governance within government data ownership, consent processes, retention policies, sharing agreements, and accountability for digital epidemiological systems should be covered in policies. Policies need to be made clear as to when the data is (a) encompassed by the principle of data minimization, (b) subject explicitly to purpose limitation requirements, or (c) require being removed or anonymized once surveillance purposes have been fulfilled [53].
Laws need to be updated to respond to emerging technologies, such as digital surveillance, cross-border data flows, algorithmic decision-making processes, the use of AI, and automated processing of personal health information. International coordination is necessary to design harmonized legal frameworks, allowing legitimate public health surveillance while respecting privacy rights and preventing misuse of surveillance capabilities [54]. Publication of surveillance system performance, value, and adherence to ethical standards can foster public confidence and promote ongoing improvement in surveillance [55].
6.4. Addressing Health Equity and Digital InclusionDigital surveillance systems should be developed with explicit consideration of the ethical principles associated with health equity to minimize exclusion of marginalized groups, and directly and actively promote equitable access to the benefits generated by surveillance. This demands purposeful design decisions that take into account the needs, capacities, preferences, and concerns of a wide range of populations about surveillance technologies [56].
Equity-focused impact assessments for proposed digital surveillance systems would help identify harmful impacts on specific subgroups of the population and allow deployment strategies to be adjusted to minimize unintended consequences. These evaluations should involve community stakeholders, including individuals from the affected populations, in evaluative activities. Furthermore, these evaluations should also consider how surveillance systems not only directly impact those who are under surveillance but also indirectly impact access to services, stigma, and other social determinants of health. These efforts should consider that digital surveillance systems are most effective when they are representative of all population groups, especially those with the highest risks of adverse health outcomes [57].
The future role of AI and ML for digital epidemiology would focus on accuracy, interpretability, fairness, and validity. Explainable AI can reduce concerns about opaque algorithms while maintaining predictive performance, allowing practitioners to validate recommendations and apply insights responsibly [29]. Federated learning can support joint model trainings across institutions sharing no central data, to mitigate privacy risks and provide more robust models. The development of NLP would facilitate the utilization of multiple text resources, including social media data, news, and clinical reports to enable the early extraction of epidemiological information. Computer vision and the mining of data from various sources would continue to revolutionize fields, such as medical imaging, environmental monitoring, and surveillance of behaviour while maintaining an emphasis on privacy and consent [58].
Better merging of digital surveillance with clinical systems is good for both public health and anti-inflation. Electronic health records (EHRs) would play a role in both population surveillance and clinical decision-making. Moreover, precision public health initiatives may also integrate genomic, environmental, and behavioural data into personalized recommendations. Continuous monitoring and early sickness detection are possible through wearable devices, though successful integration demands meticulous focus on data quality and privacy [59, 60]. CDSS based 6 detailed environmental data for inclusion in health surveillance, particularly where resources were lacking. Some of these other emerging areas may be more concerned with responses to specific challenges (e.g., environmental monitoring, health and transport planning, urban design) but there is a need to deal with the integration across systems and also engage communities [61].
Epidemic preparedness and global health security should also be digital. Prepared by early warning systems, genomic surveillance for rapid pathogen characterization as well as integrated modelling and simulation would provide real-time assessment of interventions and resources allocation [42]. Enhanced global surveillance systems would require investment in capacity, infrastructure, and governance to secure equitable participation and balance sovereignty with the imperatives of global health security.
Digital epidemiology has been making rapid progress as a result of the COVID-19 pandemic. Furthermore, it also enables real-time data collection from different sources for better public health surveillance. This development offers the possibility of responding rapidly to health risks and provides surveillance services to a larger proportion of the population. However, it also presents challenges including issues related to data privacy and the digital divide that could exacerbate health discrepancies. All governance systems and ethical considerations are important to make sure that surveillance systems are implemented fairly. In the future, efforts must concentrate on implementing AI into surveillance. This promotes international collaboration and establishes systems for health equity which, in turn, protects trust and accountability. Long-term digital epidemiology investments are essential to respond to new health threats.
The authors of the manuscript have no financial or non-financial conflict of interest in the subject matter or materials discussed in this manuscript.
Data availability is not applicable as no new datasets were generated or analysed.
No funding has been received for this research.