Managing Data in Screening Programs: Challenges and Solutions
DOI:
https://doi.org/10.20344/amp.23363Keywords:
Big Data, Data Management, Diagnostic Screening Programs, Public HealthAbstract
Population-based screening programs are vital public health initiatives that enable the early detection of diseases, significantly reducing both morbidity and healthcare costs. As these programs expand, the management of the extensive data they generate becomes increasingly complex, highlighting the need for structured digital solutions. This narrative review article presents a pragmatic framework aimed at clarifying big data analytics tailored to the needs and practices of healthcare professionals and administrators, focusing on effective integration into routine screening workflows. To achieve effective data utilization, the process begins with systematic archiving, which involves cloud-based storage solutions capable of securely maintaining various data formats in compliance with regulatory standards, thus ensuring long-term accessibility and continuity. Subsequent real-time processing of screening data facilitates rapid decision-making and patient management by providing immediate validation and analysis, essential for maintaining the responsiveness of screening services. Transformation processes play a critical role in converting diverse data inputs into standardized, consistent formats, enabling seamless communication and exchange among multiple healthcare systems. Integration further builds upon this standardization, merging data from different healthcare providers and diagnostic centers into centralized analytical platforms. This unified approach enables comprehensive patient monitoring and supports predictive modeling for early identification of at-risk individuals. Advanced analytics, particularly process mining and predictive techniques, reveal inefficiencies within screening workflows, highlighting areas needing improvement. These methods help healthcare managers to streamline operations, optimize resources, and enhance overall program performance. Real-time visualization tools provide administrators with continuous, practical insights into operational dynamics, despite existing challenges related to data governance and system interoperability. This article illustrates these concepts through concrete examples from the colorectal cancer screening program in Northern Portugal and the response to the COVID-19 pandemic. The colorectal cancer screening scenario demonstrates how structured data management significantly boosts operational efficiency and healthcare accessibility. Meanwhile, the COVID-19 experience highlights the importance of having flexible digital infrastructures capable of quickly adapting to unexpected crises. Finally, ongoing investments in digital infrastructure, professional training, and comprehensive data governance are crucial for sustaining these improvements. This review provides clear, actionable knowledge to support healthcare professionals in adopting big data analytics effectively within preventive healthcare programs.
Downloads
References
World Health Organization. Cancer prevention and control in the context of an integrated approach. Geneva: WHO; 2017.
World Health Organization Regional Office for Europe. Screening programmes: a short guide. Increase effectiveness, maximize benefits and minimize harm. Copenhagen: WHO/Europe; 2020.
Sweeney SM, Hamadeh HK, Abrams N, Adam SJ, Brenner S, Connors DE, et al. Case studies for overcoming challenges in using big data in cancer. Cancer Res. 2023;83:1183-90.
Mazzucco W, Stracci F, Gatta G, D’Argenzio A, Bidoli E, Carone S, et al. Cancer registries and data protection in the age of health digital interoperability in Europe: the perspective of the Italian Network of Cancer Registries (AIRTUM). Front Oncol. 2022;12:1052057.
American Association for Cancer Research. Screening for early detection. 2024. [cited 2025 Apr 24]. Available from: https://cancerprogressreport.aacr.org/progress/cpr24-contents/cpr24-screening-for-early-detection/.
World Health Organization. World Health Organization report on health and wellness. 2024. [cited 2025 Apr 24]. Available from: https://www.who.int/data/gho/data/major-themes/health-and-well-being.
Li L, Novillo-Ortiz D, Azzopardi-Muscat N, Kostkova P. Digital data sources and their impact on people’s health: a systematic review of systematic reviews. Front Public Health. 2021;9:645260.
Fragala MS, Shiffman D, Birse CE. Population health screenings for the prevention of chronic disease progression. Am J Manag Care. 2019;25:548-53.
Awrahman BJ, Aziz Fatah C, Hamaamin MY. A review of the role and challenges of big data in healthcare informatics and analytics. Comput Intell Neurosci. 2022;2022:5317760.
Akyüz K, Cano Abadía M, Goisauf M, Mayrhofer MT. Unlocking the potential of big data and AI in medicine: insights from biobanking. Front Med. 2024;11:1336588.
Batko K, Ślęzak A. The use of big data analytics in healthcare. J Big Data. 2022;9:3.
Aversano L, Iammarino M, Madau A, Pirlo G, Semeraro G. Process mining applications in healthcare: a systematic literature review. PeerJ Comput Sci. 2025;11:e2613.
Olawade DB, Wada OJ, David-Olawade AC, Kunonga E, Abaire O, Ling J. Using artificial intelligence to improve public health: a narrative review. Front Public Health. 2023;11:1196397.
Pastorino R, De Vito C, Migliara G, Glocker K, Binenbaum I, Ricciardi W, et al. Benefits and challenges of big data in healthcare: an overview of the European initiatives. Eur J Public Health. 2019;29:S23-7.
Mensah E, Goderre JL. Data sources and data tools: preparing for the open data ecosystem. Public Health Inf Inform Syst. 2020:105-27.
Borges do Nascimento IJ, Marcolino MS, Abdulazeem HM, Weerasekara I, Azzopardi-Muscat N, Gonçalves MA, et al. Impact of big data analytics on people’s health: overview of systematic reviews and recommendations for future studies. J Med Internet Res. 2021;23:e27275.
Alexiuk M, Elgubtan H, Tangri N. Clinical decision support tools in the electronic medical record. Kidney Int Rep. 2024;9:29-38.
Abdalkareem ZA, Amir A, Al-Betar MA, Ekhan P, Hammouri AI. Healthcare scheduling in optimization context: a review. Health Technol. 2021;11:445-69.
Pingili R. How workflow optimization improves patient care. Int J Res Comput Appl Inf Technol. 2024;7:1192-206.
Bernardi FA, Alves D, Crepaldi N, Yamada DB, Lima VC, Rijo R. Data quality in health research: integrative literature review. J Med Internet Res. 2023;25:e41446.
Cleverley WO, Cleverley JO, Parks AV. Essentials of health care finance. Massachusetts: Jones & Bartlett Learning; 2023.
Mendes D, Figueiredo D, Alves C, Penedones A, Costa B, Batel-Marques F. Impact of the COVID-19 pandemic on cancer screenings in Portugal. Cancer Epidemiol. 2024;88:102496.
Mehrtak M, SeyedAlinaghi S, MohsseniPour M, Noori T, Karimi A, Shamsabadi A, et al. Security challenges and solutions using healthcare cloud computing. J Med Life. 2021;14:448.
England PH. Retention, storage and disposal of mammograms and screening records. 2018. [cited 2025 Apr 24]. Available from: https://www.gov.uk/government/publications/breast-screening-manage-mammograms-and-records/retention-storage-and-disposal-of-mammograms-and-screening-records.
Schulz WL, Durant TJ, Torre Jr CJ, Hsiao AL, Krumholz HM. Agile health care analytics: enabling real-time disease surveillance with a computational health platform. J Med Internet Res. 2020;22:e18707.
Health Level Seven International. Introduction to HL7 standards. 2025. [cited 2025 Apr 24]. Available from: https://www.hl7.org/implement/standards.
Fast Healthcare Interoperability Resources. Welcome to FHIR®. 2025. [cited 2025 Apr 24]. Available from: https://build.fhir.org/.
Centers for Disease Control and Prevention. Implementing public health interoperability. 2025. [cited 2025 Apr 24]. Available from: https://www.cdc.gov/data-interoperability/php/public-health.
Williams E, Kienast M, Medawar E, Reinelt J, Merola A, Klopfenstein SA, et al. A standardized clinical data harmonization pipeline for scalable AI application deployment (FHIR-DHP): validation and usability study. JMIR Med Inform. 2023;11:e43847.
Van Der Aalst W. Data science in action. New Mexico: Springer; 2016.
Microsoft. Microsoft power bi. 2025. [cited 2025 Apr 24]. Available from: https://powerbi.microsoft.com/en-us.
Strategy. MicroStrategy. 2025. [cited 2025 Apr 24]. Available from: https://www.strategysoftware.com/.
Dalgaard P. R Development Core Team (2010). R: a language and environment for statistical computing. 2010. [cited 2025 Apr 24]. Available from: https://research.cbs.dk/en/publications/r-development-core-team-2010-r-a-language-and-environment-for-stats.
Rossum V. Python 3 reference manual. Scotts Valley: CreateSpace; 2009.
Union CotE. Council recommendation on strengthening prevention through early detection: a new EU approach on cancer screening replacing Council Recommendation 2003/878/EC. Off J Eur Union. 2022;100:1-10.
Santos MY, Ramos I. Business intelligence-da informação ao conhecimento. Lisboa: FCA–-Livros de Informática; 2017.
Brossard PY, Minvielle E, Sicotte C. The path from big data analytics capabilities to value in hospitals: a scoping review. BMC Health Serv Res. 2022;22:134.
Attah RU, Gil-Ozoudeh I, Garba B, Iwuanyanwu O. Leveraging geographic information systems and data analytics for enhanced public sector decision-making and urban planning. Magna Sci Adv Res Rev. 2024;12:152-63.
Munoz-Gama J, Martin N, Fernandez-Llatas C, Johnson OA, Sepúlveda M, Helm E, et al. Process mining for healthcare: characteristics and challenges. J Biomed Inform. 2022;127:103994.
Monteiro H, Oliveira M, Reis J, Tavares F. Optimizing colorectal screening in Portugal with process mining. Eur J Public Health. 2024;34:ckae144.2110.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Acta Médica Portuguesa

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
All the articles published in the AMP are open access and comply with the requirements of funding agencies or academic institutions. The AMP is governed by the terms of the Creative Commons ‘Attribution – Non-Commercial Use - (CC-BY-NC)’ license, regarding the use by third parties.
It is the author’s responsibility to obtain approval for the reproduction of figures, tables, etc. from other publications.
Upon acceptance of an article for publication, the authors will be asked to complete the ICMJE “Copyright Liability and Copyright Sharing Statement “(http://www.actamedicaportuguesa.com/info/AMP-NormasPublicacao.pdf) and the “Declaration of Potential Conflicts of Interest” (http:// www.icmje.org/conflicts-of-interest). An e-mail will be sent to the corresponding author to acknowledge receipt of the manuscript.
After publication, the authors are authorised to make their articles available in repositories of their institutions of origin, as long as they always mention where they were published and according to the Creative Commons license.

