Treffer: Optimization of testing protocols to screen for COVID-19: a multi-objective model.

Title:

Optimization of testing protocols to screen for COVID-19: a multi-objective model.

Authors:

Moheb-Alizadeh H; Graduate Program in Operations Research, North Carolina State University, Raleigh, NC, 27695, USA.; Nike Corp., Portland, OR, USA., Warsing DP Jr; Poole College of Management, North Carolina State University, Raleigh, NC, 27695, USA. don_warsing@ncsu.edu., Kouri RE; Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, 27695, USA., Taghiyeh S; Fitts Department of Industrial and Systems Engineering, North Carolina State University, Raleigh, NC, 27695, USA.; Discover Financial Services, Riverwoods, IL, 60015, USA., Handfield RB; Poole College of Management, North Carolina State University, Raleigh, NC, 27695, USA.

Source:

Health care management science [Health Care Manag Sci] 2024 Dec; Vol. 27 (4), pp. 580-603. Date of Electronic Publication: 2024 Oct 11.

Publication Type:

Journal Article

Language:

English

Journal Info:

Publisher: Baltzer Science Publishers Country of Publication: Netherlands NLM ID: 9815649 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1572-9389 (Electronic) Linking ISSN: 13869620 NLM ISO Abbreviation: Health Care Manag Sci Subsets: MEDLINE

Imprint Name(s):

Original Publication: Bussum, Netherlands : Baltzer Science Publishers, c1998-

MeSH Terms:

COVID-19*/epidemiology , COVID-19*/diagnosis , Algorithms* , COVID-19 Testing*/methods , SARS-CoV-2* , Mass Screening*/methods, Humans ; North Carolina/epidemiology ; Pandemics ; Epidemiological Models

References:

Mina MJ, Parker R, Larremore DB (2020) Rethinking COVID-19 test sensitivity — a strategy for containment. N Engl J Med 383. https://doi.org/10.1056/NEJMp2025631.
Mina MJ, Andersen KG (2021) COVID-19 testing: one size does not fit all. Science 371:126–127. https://www.science.org/doi/abs/10.1126/science.abe9187.
Kouri R, Warsing D, Singh N, Thomas B, Handfield RB (2022) An analytical tool for constructing and evaluating testing strategic for COVID-19. J Infect Dis Ther 10. https://www.omicsonline.org/open-access/an-analytic-tool-for-constructing-and-evaluating-testing-strategies-for-covid19.pdf.
PBS (2022) British government rushing COVID tests to schools so classes can reopen. https://www.pbs.org/newshour/world/british-government-rushing-covid-tests-to-schools-so-classes-can-reopen . [Posted 03 Jan 2022].
Larremore DB et al (2021) Test sensitivity is secondary to frequency and turnaround time for COVID-19 screening. Sci Adv 7. https://www.science.org/doi/abs/10.1126/sciadv.abd5393.
Gottlieb S (2021) Uncontrolled spread: Why COVID-19 crushed us and how we can defeat the next pandemic. Harper, New York.
Paltiel AD, Zheng A, Walensky RP (2020) Assessment of SARS-CoV-2 screening strategies to permit the safe reopening of college campuses in the United States. JAMA Netw Open 3.
Caetano MA, de Souza JF, Yoneyama T (2008) Optimal medication in HIV seropositive patient treatment using fuzzy cost function, pp 2227–2232. IEEE.
Persi PI, Gayathri P, Jaisankar N (2013) A fuzzy optimization technique for the prediction of coronary heart disease using decision tree. Int J Eng Technol 5:2506–2514.
Grosan C, Abraham A, Tigan S (2008) Multicriteria programming in medical diagnosis and treatments. Appl Soft Comput 8:1407–1417. (PMID: 10.1016/j.asoc.2007.10.014)
Denton BT, Kurt M, Shah ND, Bryant SC, Smith SA (2009) Optimizing the start time of statin therapy for patients with diabetes. Med Dec Making 29:351–367. (PMID: 10.1177/0272989X08329462)
Jingwei Z, Zujun M (2010) Fuzzy multi-objective location-routing-inventory problem in recycling infectious medical waste, pp 4069–4073. IEEE.
Piguillem F, Shi L (2022) Optimal Covid-19 quarantine and testing policies. Econ J 132:2534–2562. (PMID: 10.1093/ej/ueac026)
Hoertel N et al (2020) Facing the COVID-19 epidemic in NYC: a stochastic agent-based model of various intervention strategies. MedRxiv.
Charpentier A, Elie R, Laurière M, Tran VC (2020) COVID-19 pandemic control: balancing detection policy and lockdown intervention under ICU sustainability. Math Model Nat Phenom 15:57. (PMID: 10.1051/mmnp/2020045)
Carcione JM, Santos JE, Bagaini C, Ba J (2020) A simulation of a COVID-19 epidemic based on a deterministic SEIR model. Front Publ Health 230.
Kuniya T (2020) Prediction of the epidemic peak of coronavirus disease in Japan, 2020. J Clin Med 9:789. (PMID: 10.3390/jcm9030789)
Dandekar R, Barbastathis G (2020) Neural network aided quarantine control model estimation of global COVID-19 spread. arXiv:2004.02752.
Rezapour S, Mohammadi H, Samei ME (2020) SEIR epidemic model for covid-19 transmission by caputo derivative of fractional order. Adv Differ Equ 2020:1–19. (PMID: 10.1186/s13662-020-02952-y)
Anderez DO et al (2020) A COVID-19-based modified epidemiological model and technological approaches to help vulnerable individuals emerge from the lockdown in the UK. Sensors 20:4967. (PMID: 10.3390/s20174967)
Huang Y, Yang L, Dai H, Tian F, Chen K (2020) Epidemic situation and forecasting of COVID-19 in and outside China. https://www.researchgate.net/publication/339988990_Epidemic_situation_and_forecasting_of_COVID-19_in_and_outside_China.
Owusu-Mensah I, Akinyemi L, Oduro B, Iyiola OS (2020) A fractional order approach to modeling and simulations of the novel COVID-19. Adv Differ Equ 2020:1–21. (PMID: 10.1186/s13662-020-03141-7)
Kheirallah KA et al (2020) The effect of strict state measures on the epidemiologic curve of COVID-19 infection in the context of a developing country: a simulation from Jordan. Int J Environ Res Publ Health 17:6530. (PMID: 10.3390/ijerph17186530)
Kochańczyk M, Grabowski F, Lipniacki T (2020) Super-spreading events initiated the exponential growth phase of COVID-19 with R0 higher than initially estimated. R Soc Open Sci 7:200786. (PMID: 10.1098/rsos.200786)
Sun D, Duan L, Xiong J, Wang D (2020) Modeling and forecasting the spread tendency of the COVID-19 in china. Adv Differ Equ 2020:1–16. (PMID: 10.1186/s13662-020-02940-2)
Tang B et al (2020) Estimation of the transmission risk of the 2019-ncov and its implication for public health interventions. J Clin Med 9:462. (PMID: 10.3390/jcm9020462)
Wang H et al (2020) Phase-adjusted estimation of the number of coronavirus disease 2019 cases in Wuhan, China. Cell Discov 6:1–8. (PMID: 10.1038/s41421-020-0148-0)
Kuznetsov YA, Piccardi C (1994) Bifurcation analysis of periodic SEIR and SIR epidemic models. J Math Biol 32:109–121. (PMID: 10.1007/BF00163027)
He S, Banerjee S (2018) Epidemic outbreaks and its control using a fractional order model with seasonality and stochastic infection. Phys A Stat Mech Appl 501:408–417. (PMID: 10.1016/j.physa.2018.02.045)
Grassly NC et al (2020) Comparison of molecular testing strategies for COVID-19 control: a mathematical modelling study. Lancet Infect Dis 20:1381–1389. (PMID: 10.1016/S1473-3099(20)30630-7)
Abdin AF et al (2023) An optimization model for planning testing and control strategies to limit the spread of a pandemic-the case of COVID-19. Eur J Oper Res 304:308–324. (PMID: 10.1016/j.ejor.2021.10.062)
Farahani RZ, Ruiz R, Van Wassenhove LN (2023) Introduction to the special issue on the role of operational research in future epidemics/pandemics. Eur J Oper Res 304:1–8. (PMID: 10.1016/j.ejor.2022.07.019)
Zhang Y, Mayorga ME, Ivy J, Lich KH, Swann JL (2022) Modeling the impact of nonpharmaceutical interventions on COVID-19 transmission in K-12 schools. MDM Policy Pract 7.
Centers for Diseae Control and Prevention (2024) Preventing Spread of Respiratory Viruses When You’re Sick. https://www.cdc.gov/respiratory-viruses/prevention/precautions-when-sick.html.
Kirkpatrick S, Gelatt CD Jr, Vecchi MP (1983) Optimization by simulated annealing. Science 220:671–680. (PMID: 10.1126/science.220.4598.671)
Johnson DS, Aragon CR, McGeoch LA, Schevon C (1989) Optimization by simulated annealing: an experimental evaluation; Part I, Graph partitioning. Oper Res 37:865–892. (PMID: 10.1287/opre.37.6.865)
Johnson DS, Aragon CR, McGeoch LA, Schevon C (1991) Optimization by simulated annealing: an experimental evaluation; Part II, Graph coloring and number partitioning. Oper Res 39:378–406. (PMID: 10.1287/opre.39.3.378)
Hajek B (1988) Cooling schedules for optimal annealing. Math Oper Res 13:311–329. (PMID: 10.1287/moor.13.2.311)
Serafini P (1985) Mathematics of multi objective optimization, vol 289. Springer.
Serafini P (1994) in Simulated annealing for multi objective optimization problems, pp 283–292. Springer.
Engrand P (1998) A multi-objective optimization approach based on simulated annealing and its application to nuclear fuel management. https://www.osti.gov/etdeweb/biblio/316961.
Czyzżak P, Jaszkiewicz A (1998) Pareto simulated annealing—a metaheuristic technique for multiple-objective combinatorial optimization. J Multi-Criteria Decis Anal 7:34–47. (PMID: 10.1002/(SICI)1099-1360(199801)7:1<34::AID-MCDA161>3.0.CO;2-6)
Hapke M, Jaszkiewicz A, Słowiński R (2000) Pareto simulated annealing for fuzzy multi-objective combinatorial optimization. J Heuristics 6:329–345. (PMID: 10.1023/A:1009678314795)
Burke E, Silva JL (2002) Improving the performance of trajectory-based multiobjective optimisers by using relaxed dominance, vol 1, 203–207. Nanyang Technical University Orchid Country Club, Singapore.
Nam D, Park CH (2002) Pareto-based cost simulated annealing for multiobjective optimization, vol 2, pp 522–526. Citeseer.
Sánchez L, Villar JR (2008) Obtaining transparent models of chaotic systems with multi-objective simulated annealing algorithms. Inf Sci 178:952–970. (PMID: 10.1016/j.ins.2007.09.029)
Bandyopadhyay S, Saha S, Maulik U, Deb K (2008) A simulated annealing-based multiobjective optimization algorithm: Amosa. IEEE Trans Evol Comput 12:269–283. (PMID: 10.1109/TEVC.2007.900837)
Suman B, Kumar P (2006) A survey of simulated annealing as a tool for single and multiobjective optimization. J Oper Res Soc 57:1143–1160. (PMID: 10.1057/palgrave.jors.2602068)
Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Inc.
Global Epidemics Data (2021) Brown University School of Public Health. https://globalepidemics.org/testing-targets.
NEA (2020) 2020 NEA Policy Playbook for Congress and the Biden-Harris Administration. https://www.nea.org/resource-library/2020-nea-policy-playbook-congress-and-biden-harris-administration.
Taghiyeh S et al (2023) A multi-method approach to determining relative pandemic disease intensity across geographic regions. NC State University working paper.
Hwang C-L, Yoon K (1981) in Methods for multiple attribute decision making, pp 58–191. Springer.
Bradley PS, Bennett KP, Demiriz A (2000) Constrained k-means clustering. Microsoft Research, Redmond 20.
U.S. Census Bureau (2022) County population totals: 2020-2021. https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-total.html . Accessed 02 Jun 2022.
Gunderson A, Woskie L (2020) Understanding Predictions: What is R-Naught? https://globalhealth.harvard.edu/understanding-predictions-what-is-r-naught/.
Li T, Liu Y, Li M, Qian X, Dai SY (2020) Mask or no mask for COVID-19: a public health and market study. PLoS One 15:1–17. https://doi.org/10.1371/journal.pone.0237691.
Saxena SK et al (2022) Characterization of the novel SARS-CoV-2 Omicron (b.1.1.529) variant of concern and its global perspective. J Med Virol 94:1738–1744.
Fisher J (2022) Over \$100 million: adding up the cost of ‘free’ COVID testing in Wake County. https://www.wral.com/coronavirus/-100-million-adding-up-the-cost-of-free-covid-testing-in-wake-county/20102553/ . [Posted 27 Jan 2022].
Cutler DM, Summers LH (2020) The COVID-19 pandemic and the $16 trillion virus. JAMA 324:1495–1496. (PMID: 10.1001/jama.2020.19759)
Cutler DM (2022) The economic cost of long COVID: an update. https://scholar.harvard.edu/files/cutler/files/long_covid_update_7-22.pdf.
Jiang S, Ong Y-S, Zhang J, Feng L (2014) Consistencies and contradictions of performance metrics in multiobjective optimization. IEEE Trans Cybern 44:2391–2404. https://doi.org/10.1109/TCYB.2014.2307319.

Grant Information:

NIIMBL COVID-19-1.03 National Institute for Innovation in Manufacturing Biopharmaceuticals

Contributed Indexing:

Keywords: COVID-19; Multiple objective programming; OR in health services; SEIR model; Simulated annealing

Entry Date(s):

Date Created: 20241011 Date Completed: 20241213 Latest Revision: 20241213

Update Code:

20250114

DOI:

10.1007/s10729-024-09688-1

PMID:

39392585

Database:

MEDLINE

Weitere Informationen

In this paper we develop a new multi-objective simulated annealing (MOSA) algorithm to generate optimal testing protocols for infectious diseases, using the COVID-19 pandemic as our context. A SEIR (susceptible-exposed-infected-recovered) epidemiological model is embedded as the computational platform for our MOSA algorithm to optimize testing protocols for screening across three joint objectives: minimum cost of test materials, minimum total infections over the testing horizon, and minimum number of false negatives over the horizon. We demonstrate the application of this optimization tool to recommend screening protocols for K-12 school districts in the U.S. State of North Carolina. Our approach is scalable by population coverage and can be employed at the level of individual school districts or regional collections of districts, individual schools or collections of schools across a district, business sites, or nursing homes, among other congregate settings where individuals may be screened prior to gaining entry to the site. The algorithm can be solved two ways, generating either independent optimal protocols across individual testing locations, or a common protocol covering all locations in the collection of testing sites. Our findings can be used to inform policy decisions to guide the development of effective testing strategies for controlling the spread of COVID-19 or other pandemic diseases in a wide range of congregate settings across various geographic regions.
(© 2024. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.)

Declarations. Conflicts of interest: The authors declare that they have no known competing personal relationships or other conflicts of interest that could have appeared to influence the work reported in this paper

AN0181643333;bse01dec.24;2024Dec17.04:08;v2.2.500

Optimization of testing protocols to screen for COVID-19: a multi-objective model: Optimization of testing protocols to screen for COVID-19: a multi-objective model: H. Moheb-Alizadeh et al

Keywords: OR in health services; Multiple objective programming; Simulated annealing; COVID-19; SEIR model

Copyright comment Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Highlights

Utilize SEIR epidemiological model as platform for multi-objective algorithm

Optimize testing protocols for COVID screening across three joint objectives

Method applies to pandemic diseases in congregate settings across regions

Simulated annealing solution for both independent and common optimal protocols

Informs policy decisions for effective testing strategies to control virus spread

Introduction

Of the many lessons that government healthcare policy-makers learned from the COVID-19 pandemic, two are particularly notable. The first is that a substantial increase in broad viral testing across any population has the potential to mitigate the impact and re-emergence of the pandemic. The second is that any given nation's response to the pandemic will be more effective when coordinated by a small set of centralized organizations providing directives to a wide array of local agencies and public entities. COVID taught us that viral testing efforts by the public health community are best focused on screening — i.e., identifying infected individuals who are pre-symptomatic or asymptomatic including those people who do not have known, suspected or reported exposure to the virus [[1]]. Identifying and quarantining infected individuals by routine testing is a highly effective way to ensure that congregate settings (e.g., schools, work facilities, hospitals) are kept open and operational, as in [[3]] and [[4]]. The most effective tests to deploy in such instances are those that detect viral antigens, as they are inexpensive, easily administered and produce rapid results that are specific and sufficiently sensitive when routinely deployed [[5]]. We note that the approach we take in this paper differs from the one that was employed in the early stages of the COVID-19 pandemic in the U.S., when testing focused only on determining whether symptomatic individuals were infected with the SARS-CoV2 virus. As [[1]] and [[2]], among others, have pointed out, a symptoms-based (i.e., diagnostic) approach to testing is not as effective in containing the spread of a novel virus, as compared to screening all individuals in certain congregate settings, since symptomatic individuals may have already exposed many others to the virus between the point at which they were infected and the point at which they sought diagnostic testing.

By the summer of 2021, a variety of rapid and inexpensive, antigen-based COVID tests were available. One of the biggest problems was the lack of a single, coordinating public/private entity responsible not only for acquiring and distributing these tests, but also for educating the medical community and the public on their use. In addition, no standard methodology emerged for collecting and analyzing the resultant test data to render test distribution more effective. Although test kits were distributed initially in the U.S. by the federal government, it was left up to the state and local departments of health to determine how to allocate the limited supply of testing resources to the most hard-hit communities. These health departments were often at a loss as to how to proceed, given that they had no prior experience with this type of widespread pandemic scenario. This inability to quickly and efficiently test the population obscured the true scope of the pandemic, prevented an effective coordinated response, significantly impacted economic activity, and resulted in tremendous loss of life. This scenario was described in detail by Scott Gottlieb, who served as U.S. Food and Drug Administration Commissioner prior to the start of the COVID pandemic [[6]].

In this paper, we seek to address these problems by developing and demonstrating the use of an optimization tool that captures key tradeoffs related to the deployment of screening tests for COVID-19. Specifically, we present a multi-objective simulated annealing (MOSA) algorithm for protocol optimization, based on three joint objectives: (1) minimizing the cost of procuring test materials, (2) minimizing the total number of infections that ultimately develop over the testing horizon, and (3) minimizing the number of false negatives, given concerns related to test sensitivity for certain kinds of test kits. In addition, our MOSA code is flexible with regard to incorporating various constraints, such as limiting the maximum number of individuals in isolation (i.e., positive test outcomes) across a given testing horizon. Utilizing a SEIR (Susceptible-Exposed-Infected-Recovered) model similarly employed by Paltiel et al. [[7]] as the computational engine embedded in our algorithm, we recommend a selective set of possible screening protocols for a given population in a congregate setting. For instance, a common population of concern — and one that we explore in a computational example — might be all students and staff in K-12 schools in a given U.S. county, along the lines of Kouri et al. [[3]]. Our approach in this context can scale to various levels of population coverage, such that it can be employed at the level of, say, the school population across multiple school districts in an individual county, or for a regional collection of counties that covers all constituent schools in that region. In a multi-county setting, the optimization of the testing protocol can be solved in two ways: generating either independent optimal protocols across the individual counties, or constrained to generate a single optimal protocol covering all counties in the region. We describe the advantages of each solution method in our description of the algorithm and in our example application.

In the sections that follow, we begin by reviewing the relevant literature covering efficient allocation of medical supplies, the use of epidemiology models in establishing medical materials requirements, and the literature on relevant multi-objective optimization tools and methods. We then present our model and illustrate its application in the State of North Carolina. We conclude with a discussion of the results and how this approach may be extended in the future.

Literature review

Optimization techniques have been widely used to solve medical and health care delivery problems and span a number of healthcare policy issues. Recent work in this area includes [[8]], who developed a fuzzy cost function to find the optimal medication dosage for HIV seropositive patient treatment. A fuzzy set approach was also employed by Persi et al. [[9]], who use a decision tree fuzzy optimization method to predict the possible incidence of coronary heart disease. In a setting that involves both diagnosis and treatment of a variety of diseases, Grosan et al. [[10]] presented a multi-criteria programming model to optimize patient outcomes. Similarly, Denton et al. [[11]] developed an optimization model to find the optimal start time for statin therapy of diabetic patients. This type of work also extends to problems in areas supporting medical optimization, such as the routing inventory problem in recycling infectious medical waste investigated by Jingwei and Zujun [[12]] using fuzzy multi-objective approach.

Researchers in the field of epidemiology make wide use of optimization approaches. A welfare maximization problem from the perspective of public health planning was used by [[13]] to explore optimal responses to containing the spread of COVID-19. Measures such as mandatory quarantines were concluded to be optimal for different curvatures of the welfare function. These authors also discovered that testing can be a close substitute for quarantine, which can reduce the number of indiscriminate quarantines. A microsimulation model was used by Hoertel et al. [[14]] to evaluate the potential effect of quarantine duration, quarantine lifting policies, post-quarantine screening and use of an effective COVID-19 treatment. Similarly, a tractable quantitative analysis was performed by [[15]] to measure the effect of lockdowns and detection interventions. Using an extended Susceptible, Infectious, and Recovered compartment model (SIR) for COVID-19, the authors identified specific interventions for successive phases as an optimal lockdown policy.

SEIR is one of the prevalent mathematical algorithms used in epidemiology to describe the transmission of an epidemic disease, compartmentalizing the population of interest into categories of susceptible, exposed, infected, and recovered about the infectious disease of interest. Carcione et al. [[16]] present an excellent example of the use of the SEIR model. A more targeted SEIR model was applied by Kuniya [[17]] to predict the peak of the COVID-19 epidemic in Japan. The authors calculated the peak value using several infection rates and visualized the effect of an intervention on the peak value. However, their proposed model was limited in terms of its application only to specific regions of the country. In a broader study, Dandekar and Barbastathis [[18]] analyzed data from the U.S., China, South Korea, and Italy and applied a SEIR model using neural networks to calculate the effectiveness of quarantine policies. These authors also present a comparison between the original SEIR model and the neural network counterpart. In an extensive, multi-method approach, Rezapour et. al. [[19]] propose a SEIR model for the COVID-19 pandemic and use Caputo fractional derivatives and fractional Euler methods to approximate the steady-state solution for the system. Additionally, these authors propose a simulation model to predict the transmission of COVID-19. Data from United Kingdom was used to investigate the relationship between the mortality and number of vulnerable patients using a modified SEIR model [[20]]. A time-varying SEIR model was used by Huang et al. [[21]] to predict the severeness of COVID-19 pandemic. A new testing rate parameter was proposed by Owusu-Mensah et al. [[22]] for a fractional SEIR model. Their results demonstrated the effectiveness of tracing and testing in reducing the spread of COVID-19.

In other research utilizing SEIR, a modified model was proposed by [[23]] to evaluate the effect of non-pharmaceutical interventions during COVID-19 pandemic. They divided the infected segment of the model into three different sub-segments, namely pre-symptomatic, mild symptoms, and moderate-to-severe symptoms. The authors used a simulation framework to assess the performance of their model. SEIR models are also used to estimate the basic reproduction rate, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> , of a given infectious disease — e.g., Kochańczyk et al. [[24]]. Based on the available data from China, [[25]] developed a forecasting framework to predict the spread of COVID-19 and introduced an improved SEIR model. They then extended the SEIR model to incorporate two additional parameters (deaths and number of cured patients), which significantly improved the prediction accuracy of the model. Using quarantine, isolation and treatment, Tang et al. [[26]] introduced a general SEIR epidemiological model. In other research, Wang et al. [[27]] applied a phase-adjustment to a SEIR model to estimate the number of COVID-19 cases in Wuhan, China. The bifurcations of the periodic SEIR solutions with sinusoidally varying contact rate was investigated by Kuznetsov and Piccardi [[28]]. He and Banerjee [[29]] also studied modified SEIR models by applying a hard limited controller to a model was designed to epidemic spread. Kouri et al. [[3]] present a mathematical tool to evaluate alternative testing cadences for COVID-19, which is applicable to well-contained settings, such as long-term care facilities and public-school systems. In other research, a compartmental epidemic model was adapted by Paltiel et al. [[7]] to capture the essential features of the situation facing university decision-makers related to COVID-19. Grassly et al. [[30]] utilize infectiousness and PCR test sensitivity over time to develop a mathematical model for COVID-19 transmission. The authors used percentage reduction in the value of the transmission rate R as the key metric to evaluate the performance of each testing strategy.

Recently, [[31]], in a special journal issue focused on the role of operations research in future epidemics and pandemics [[32]], proposed a non-linear programming model to address the non-linearity of the disease progression and showed that increased levels of testing resulted in significant reductions of both hospitalizations and deaths. Finally, Zhang et al. [[33]] have shown using the SEIR model that a combination of PCR testing and rapid antigen testing can significantly reduce disease occurrence. While their work shows that the use of masks plus social distancing reduced infection rates alone, those practices in combination with testing suppressed infection rates significantly more.

Graph: Fig. 1 Closed testing network, adapted from [[7]]

Table 1 SEIR system base parameters

<table frame="hsides" rules="groups"><tbody><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq2.gif" /></td><td align="left">Number of testing cycles per day</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>n</mi><mrow><mi mathvariant="italic">btw</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq3.gif" /></td><td align="left">Number of cycles between tests</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>t</mi><mrow><mi mathvariant="italic">inc</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq4.gif" /></td><td align="left">Number of days for virus to incubate to infectious state</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>t</mi><mrow><mi mathvariant="italic">rec</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq5.gif" /></td><td align="left">Number of days to recover from infection</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>t</mi><mrow><mi mathvariant="italic">FPr</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq6.gif" /></td><td align="left">Number of days for false positive individual to return to testing pool (<italic>see note)</italic></td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>p</mi><mrow><mi mathvariant="italic">AS</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq7.gif" /></td><td align="left">Fraction of asymptomatic individuals that advance to the symptomatic state</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>p</mi><mrow><mi mathvariant="italic">SD</mi></mrow></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq8.gif" /></td><td align="left">Fraction of symptomatic cases resulting in fatality</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>S</mi><mi>e</mi></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq9.gif" /></td><td align="left">Test sensitivity</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>S</mi><mi>p</mi></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq10.gif" /></td><td align="left">Test specificity</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><msub xmlns=""><mi>R</mi><mn>0</mn></msub></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq11.gif" /></td><td align="left">Basic reproductive number of the virus</td></tr></tbody></table>

Note: <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>t</mi><mrow><mi mathvariant="italic">FPr</mi></mrow></msub><mo>≥</mo><mn>1</mn></mrow></math> based on the structure of the SEIR model computations in [[7]]

There is clearly a broad body of research work that addresses public health optimization modeling, the use of epidemiological modeling like SEIR, and the development and deployment of various interventions to control the spread of epidemic diseases. Examining the literature reveals, however, that none of the previous studies offers a concrete optimization method to efficiently design specific testing protocols for infectious diseases, such as COVID-19. To the best of our knowledge, our work is the first to propose a multi-objective optimization solution for infectious disease testing protocol design.

Testing model

In this section, we describe the elements of our COVID testing model that serves as the basis for defining the screening test protocols that are deployed in congregate setting locations that span a given geographic region. Our screening model is adapted from the SEIR model employed by Paltiel et al. [[7]] in their study of proposed policies for opening college campuses for the Fall 2020 semester during the COVID-19 pandemic in the U.S. The model is shown in Fig. 1. The transitions between the system states that appear in Fig. 1 occur between successive testing cycles. Accordingly, in our modeling we denote the index for testing cycles by c, such that transitions take place as cycles transpire, from <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>=</mo><mn>1</mn><mo>,</mo><mo>...</mo><mo>,</mo><mi>C</mi></mrow></math> .

Table 2 SEIR system computed parameters

<table frame="hsides" rules="groups"><tbody><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>τ</mi><mo>=</mo><mn>1</mn><mo stretchy="false">/</mo><msub><mi>n</mi><mrow><mi mathvariant="italic">btw</mi></mrow></msub></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq14.gif" /></td><td align="left">Fraction of testing pool that is tested in each cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>β</mi><mo>=</mo><msub><mi>R</mi><mn>0</mn></msub><mo>·</mo><mrow><mo stretchy="false">(</mo><mi>ρ</mi><mo>+</mo><mi>σ</mi><mo stretchy="false">)</mo></mrow></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq15.gif" /></td><td align="left">Fraction of susceptible individuals contacted and infected by infectious individuals in any given cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>θ</mi><mo>=</mo><mn>1</mn><mo stretchy="false">/</mo><mfenced close=")" open="("><msub><mi>t</mi><mrow><mi mathvariant="italic">inc</mi></mrow></msub><mo>·</mo><msub><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub></mfenced></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq16.gif" /></td><td align="left">Fraction of exposed individuals that advance to infectious stage in any given cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>ρ</mi><mo>=</mo><mn>1</mn><mo stretchy="false">/</mo><mfenced close=")" open="("><msub><mi>t</mi><mrow><mi mathvariant="italic">rec</mi></mrow></msub><mo>·</mo><msub><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub></mfenced></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq17.gif" /></td><td align="left">Fraction of infected individuals that recover and are removed from testing in any given cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>σ</mi><mo>=</mo><mi>ρ</mi><mo>·</mo><msub><mi>p</mi><mrow><mi mathvariant="italic">AS</mi></mrow></msub><mo stretchy="false">/</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><msub><mi>p</mi><mrow><mi mathvariant="italic">AS</mi></mrow></msub></mfenced></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq18.gif" /></td><td align="left">Fraction of asymptomatic individuals that become symptomatic in any given cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>μ</mi><mo>=</mo><mn>1</mn><mo stretchy="false">/</mo><mfenced close=")" open="("><msub><mi>t</mi><mrow><mi mathvariant="italic">FPr</mi></mrow></msub><mo>·</mo><msub><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub></mfenced></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq19.gif" /></td><td align="left">Fraction of false positive individuals who are identified and return to the testing pool in any given cycle</td></tr><tr><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mi>δ</mi><mo>=</mo><mi>ρ</mi><mo>·</mo><msub><mi>p</mi><mrow><mi mathvariant="italic">SD</mi></mrow></msub><mo stretchy="false">/</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><msub><mi>p</mi><mrow><mi mathvariant="italic">SD</mi></mrow></msub></mfenced></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq20.gif" /></td><td align="left">Fraction of symptomatic individuals who die from the disease in any given cycle</td></tr></tbody></table>

Testing demand model

In Tables 1 and 2, we define all of the parameters needed to compute the flows between the SEIR system states from period to period in Fig. 1. Since the SEIR system transitions take place over testing cycles, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>=</mo><mn>1</mn><mo>,</mo><mo>...</mo><mi>C</mi></mrow></math> , we denote the population size of this closed system at any given test cycle c by

1 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi>N</mi><mo>=</mo><msub><mi>U</mi><mi>c</mi></msub><mo>+</mo><msub><mi>E</mi><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub><mo>+</mo><msub><mi>P</mi><mi>c</mi></msub><mo>+</mo><msub><mrow><mi mathvariant="italic">TP</mi></mrow><mi>c</mi></msub><mo>+</mo><msub><mrow><mi mathvariant="italic">FP</mi></mrow><mi>c</mi></msub><mo>+</mo><msub><mi>V</mi><mi>c</mi></msub><mo>+</mo><msub><mi>D</mi><mi>c</mi></msub><mo>,</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

which is constant across the testing horizon, i.e., for all <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>...</mo><mo>,</mo><mi>C</mi></mrow></math> . As one can observe from Fig. 1, individuals in this system flow between the system states that are represented by the boxes in the figure and labeled as sets. In the initial state, only sets U and A are populated, starting with values <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>U</mi><mn>0</mn></msub></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>A</mi><mn>0</mn></msub></math> , such that <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>N</mi><mo>=</mo><msub><mi>U</mi><mn>0</mn></msub><mo>+</mo><msub><mi>A</mi><mn>0</mn></msub></mrow></math> . From that point forward, transitions between states are governed by the various parameters that appear on the arrows connecting the states in Fig. 1. We also note that, as in Paltiel et al. [[7]], these flows are treated as deterministic and not probabilistic flows — effectively as expected-value transitions — and therefore, many of the flow-related, computed parameters below are defined as "fractions" and not probabilities. Below, we describe a subset of the state transitions represented by the flows across the arrows in Fig. 1

Table 3 Relation of test performance parameters and test results

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left">Test negative</th><th align="left">Test positive</th></tr></thead><tbody><tr><td align="left">Patient negative</td><td align="left">True negative</td><td align="left">False positive</td></tr><tr><td align="left" /><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mo>Pr</mo><mo>=</mo><msub><mi>S</mi><mi>p</mi></msub></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq26.gif" /></td><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mo>Pr</mo><mo>=</mo><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>p</mi></msub></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq27.gif" /></td></tr><tr><td align="left">Patient positive</td><td align="left">False negative</td><td align="left">True positive</td></tr><tr><td align="left" /><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mo>Pr</mo><mo>=</mo><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>e</mi></msub></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq28.gif" /></td><td align="left"><math xmlns="http://www.w3.org/1998/Math/MathML"><mrow xmlns=""><mo>Pr</mo><mo>=</mo><msub><mi>S</mi><mi>e</mi></msub></mrow></math><inline-graphic mime-subtype="GIF" href="10729_2024_9688_Article_IEq29.gif" /></td></tr></tbody></table>

The computations detailed in Appendix A incorporate two important parameters used in epidemiology to describe the accuracy of tests for the presence or absence of disease in a patient specimen. Those parameters are test sensitivity and specificity, which we denote by <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>e</mi></msub></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>p</mi></msub></math> , respectively. They are defined as follows:

Sensitivity is the ability of the test to correctly designate a specimen infected with the disease — i.e., a positive specimen — as a positive test result.

Specificity is the ability of the test to correctly designate a specimen not infected with the disease — i.e., a negative specimen — as a negative test result.

Mathematically, those parameters relate test specimens to test outcomes as described in Table 3.

To aid in understanding the flows along the arrows in the system described in Fig. 1, we walk through the details related to some of the key initial transitions in the system, particularly the transitions of individuals into and out of states U and A, as those are the key starting-point transitions in the model. Thus, we note that, over time, individuals who began in states U and A can flow out of or into these states via several mechanisms:

Exposure to the virus from within the system causes a fraction of individuals, governed by parameter <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>β</mi></math> , to flow from U to E. (Recall that detailed computations of the period-to-period flow quantities are provided in Appendix A.)

"Exogenous shocks" of viral exposure from outside the system can cause previously uninfected individuals to carry the infection into the system, as represented by the dashed flow from U to E, governed by parameter X, which is specified, for relative simplicity, as a deterministic number of infections per unit time.

Some uninfected individuals may falsely test positive, moving temporarily to the Quarantine pool — specifically, in the FP set — at a rate governed by the specificity of the test being used for screening.

Uninfected individuals who falsely tested positive return from FP to U after administration of a confirmatory test, governed by parameter <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>μ</mi></math> . Individuals in TP whose confirmatory test is positive remain in set TP, in physical isolation, until they either develop symptoms or recover after an asymptomatic infection.

For details on the computations that govern the entire set of cycle-to-cycle transitions, we refer the reader to Appendix A. We observe here, however, that based on the flows shown in Fig. 1, states V and D are absorbing states. This means that given a sufficiently long period of time, all individuals in the system will end up as either recovered from an infection or deceased as the result of an infection.

Demand for test kits used in screening for the virus is defined as the number of screening tests administered in testing cycle c. This is determined by the sum of the number of individuals in states U, A, and E, as reflected in Fig. 1. In the computational modeling from [[7]], it is assumed that tests are administered at the beginning of each cycle according to the updates to all states in the system from the prior cycle, and therefore,

2 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mtext>Screening tests in cycle</mtext><mspace width="0.333333em" /><mi>c</mi><mo>=</mo><msub><mi>U</mi><mrow><mi>c</mi><mo>-</mo><mn>1</mn></mrow></msub><mo>+</mo><msub><mi>A</mi><mrow><mi>c</mi><mo>-</mo><mn>1</mn></mrow></msub><mo>+</mo><msub><mi>E</mi><mrow><mi>c</mi><mo>-</mo><mn>1</mn></mrow></msub><mo>.</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

As discussed above, the number of confirmatory tests to be administered is determined by the the number of individuals in states FP and TP, which is defined as:

3 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mtext>Confirmatory tests in cycle</mtext><mspace width="0.333333em" /><mi>c</mi><mo>=</mo><mi>F</mi><msub><mi>P</mi><mi>c</mi></msub><mo>+</mo><mi>T</mi><msub><mi>P</mi><mi>c</mi></msub><mo>.</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

Consistent with SEIR models in the field of epidemiology [[26], [27]–[28]], there is no "within-system" testing of symptomatic individuals — i.e., individuals in set P are not included in the testing, as they are assumed to be in isolation. Instead, we assume that symptomatic individuals will behave responsibly, not attempt to enter the congregate setting, and seek a diagnostic test outside the bounds of this testing system. Attempting to capture "bad behavior" would require additional assumptions regarding the fraction of symptomatic individuals that attempt to enter the congregate setting, and we have chosen to avoid such complications in our model. We note, however, that, in Section 5, we conduct a robustness analysis on the exogenous infections parameter in the model, which can be viewed a proxy for various "bad behaviors" that expose the congregate setting to more infections than it would otherwise generate endogenously. Moreover, as of this writing, the guidelines of the U.S. Centers for Disease Control (CDC) [[34]] preventing the spread of respiratory viruses, including SARS-CoV-2, are dramatically relaxed from earlier guidelines, and could result in honest mistakes by parents (e.g., lingering symptoms or an incorrectly executed home test) which lead to similar "leaks" in the system that generate additional infections to be captured in the screening. Finally, our analysis assumes no confirmatory antibody testing is required for recovered individuals (set V), consistent with public health practice throughout the course of the COVID-19 pandemic in the U.S. [[27], [30]]. Finally, this particular model assumes that infected individuals are truly "recovered," and do not require further testing. This was the assumed behavior of COVID-19 in the earlier period of the pandemic and it is the basis of the computations in this paper.

Decision variables

We denote the length of the testing horizon by T, which may be measured in a different time scale than the testing cycles, denoted earlier by c. This simply requires a conversion factor that sets the number of testing cycles, for example, per day or per week. For a given screening horizon length, T, the variables that define a screening testing protocol are as follows:

<bold> The _B__I_initial screening frequency_i_</bold>, denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> . This is the screening frequency in the initial portion of the testing horizon, measured as the number of cycles that transpire between successive rounds of screening. For example, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>=</mo><mn>1</mn></mrow></math> indicates that screening tests are run a rate of one testing cycle per day (or every day), while <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>=</mo><mn>10</mn></mrow></math> indicates screening every other week in a 5-day "business" or "school" week (or every tenth day).

<bold> The _B__I_secondary screening frequency_i_</bold>, denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> , which is the screening frequency in the secondary portion of the testing horizon.

<bold> The _B__I_initial horizon length_i_</bold>, denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> , which is the length of the overall horizon devoted to screening frequency <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , meaning that the length of the secondary screening horizon is <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>T</mi><mn>2</mn></msub><mo>=</mo><mi>T</mi><mo>-</mo><msub><mi>T</mi><mn>1</mn></msub></mrow></math> .

There are many reasons to split the testing horizon into two portions. Two of the main reasons are as follows. First, per the nature of epidemiological system, infections tend to "bloom" in the early part of the overall testing horizon. It takes some time for the testing and isolation procedure to succeed at identifying infections and isolating people to reduce the rate of spread for the remainder of the horizon (provided testing occurs at a sufficient frequency to achieve this outcome). In that case, testing can ease off after some initial period. Second, a less frequent rate of testing means fewer tests being administered which results in a lower testing cost. Thus, splitting the testing horizon provides flexibility to public officials in charge of testing to create overall testing protocols that are most effective in terms of the trade-off between infections and testing costs.

Objectives and optimization structure

As we discuss above, two important criteria for assessing any screening protocol across a given testing horizon are the number of resulting infections and the cost of testing. We include an additional criterion in our multi-objective approach, namely the number of false negatives. We include this objective in order to address criticism of antigen-based COVID-19 tests (also known as "point-of-care" tests or "at-home" tests in the U.S.) particularly early in the pandemic in the U.S. Antigen tests have lower sensitivity than lab-based testing methods (e.g., PCR tests), but also have the advantages of being not only dramatically less expensive but also, as their name suggests, rapid to administer and generate results within 30 minutes. These issues are discussed in [[1]] and [[2]]. Since lower sensitivity will tend to generate a higher number of false negatives, we include this as the third of our objectives.

Therefore, the objectives for our multi-objective optimization problem are defined as follows:

<bold> The _B__I_number of infections_i_</bold> across the testing horizon that result from screening protocol <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></math> , denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>I</mi><msub><mi>N</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub></mrow></math> .

<bold> The _B__I_number of false negative tests_i_</bold> across the testing horizon that result from screening protocol <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></math> , denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>F</mi><msub><mi>N</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub></mrow></math> .

<bold> The _B__I_total cost of testing_i_</bold> across the horizon for screening protocol <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></math> , denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>T</mi><msub><mi>C</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub></mrow></math> .

Computationally, total cost, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>T</mi><msub><mi>C</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub></mrow></math> , is the easiest of these to compute, as it is given simply by the product of the number of tests administered and the cost per test. In contrast, computing the other objectives is more complicated, but still straightforward. Specifically, the expression for the number of infections in cycle <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></math> (see Appendix A) is

4 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi>I</mi><msub><mi>N</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><mi>β</mi><mo>·</mo><mfrac><mrow><msub><mi>A</mi><mi>c</mi></msub><mo>·</mo><msub><mi>U</mi><mi>c</mi></msub></mrow><mrow><msub><mi>U</mi><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub><mo>+</mo><msub><mi>E</mi><mi>c</mi></msub></mrow></mfrac></mrow><mo>⏟</mo></munder><mrow><mtext>New infections</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><mi>X</mi><mo>·</mo><msub><mi>I</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>,</mo></mrow><mo>⏟</mo></munder><mrow><mtext>Exogenous shocks</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

where the decision triple <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></math> on this decision variable is suppressed for brevity, and the expression for false negatives in testing cycle <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></math> is:

5 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi>F</mi><msub><mi>N</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><mfenced close="]" open="["><msub><mi>E</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>+</mo><msub><mi>A</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>e</mi></msub></mfenced></mfenced><mo>·</mo><mi>τ</mi><mo>,</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

where, as we define in Appendix A, k is the number of cycles required to obtain test results. Note that false negatives include all individuals in set E — i.e., exposed to the virus but currently undetectable due to the relative insensitivity of the test, based on <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>e</mi></msub></math> — in addition to a fraction of set A (Asymptomatic), specifically <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>e</mi></msub><mo stretchy="false">)</mo></mrow></math> , who test negative due to the test sensitivity error level.

Therefore, we can use these defined objectives to formulate the following multi-objective programming model in order to design the most efficient testing protocol that optimizes simultaneously our performance criteria:

6 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><munder><mo movablelimits="true">min</mo><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>∈</mo><msup><mi>S</mi><msub><mi>F</mi><mn>1</mn></msub></msup><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>∈</mo><msup><mi>S</mi><msub><mi>F</mi><mn>2</mn></msub></msup><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo>∈</mo><msup><mi>S</mi><msub><mi>T</mi><mn>1</mn></msub></msup></mrow></munder><mrow><mo stretchy="false">{</mo><mi>I</mi><msub><mi>N</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub><mo>,</mo><mi>F</mi><msub><mi>N</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub><mo>,</mo><mi>T</mi><msub><mi>C</mi><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">}</mo></mrow><mo>,</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

where <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> are the decision variables, as defined above, and where <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>1</mn></msub></msup></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>2</mn></msub></msup></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>T</mi><mn>1</mn></msub></msup></math> denote the sets of feasible quantities for decision variables <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> , respectively. For example, a possible choice for <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>1</mn></msub></msup></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>2</mn></msub></msup></math> might be <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">{</mo><mtext mathvariant="italic">twice</mtext><mspace width="0.333333em" /><mtext mathvariant="italic">per</mtext><mspace width="0.333333em" /><mtext mathvariant="italic">day</mtext><mo>,</mo><mtext mathvariant="italic">daily</mtext><mo>,</mo><mtext mathvariant="italic">every</mtext><mspace width="0.333333em" /><mtext mathvariant="italic">two</mtext><mspace width="0.333333em" /><mtext mathvariant="italic">days</mtext><mo stretchy="false">}</mo></mrow></math> , whereas <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>T</mi><mn>1</mn></msub></msup></math> might be defined as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">{</mo><mi>t</mi><mo>∈</mo><mi mathvariant="script">P</mi><mo stretchy="false">|</mo><mn>0</mn><mo>≤</mo><mi>t</mi><mo>≤</mo><mi>T</mi><mo stretchy="false">}</mo></mrow></math> , where <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="script">P</mi></math> is the set of non-negative integers and T is the maximum length of time defined in the study. Thus, the optimization defined in expression (6) specifies the optimal values for the initial screening frequency, the secondary screening frequency, and the length of initial screening horizon in such a way that total infections, total false negatives and total cost are jointly minimized. We introduce a multi-objective simulated annealing algorithm in the next section in order to derive the Pareto optimal front for this multi-objective optimization problem.

Optimization algorithm

Simulated annealing, initially proposed by Kirkpatrick et al. [[35]], Johnson et al. [[36]], is a local search algorithm in which non-improving moves are probabilistically accepted in an attempt to avoid becoming trapped in the region of a low-quality, locally-optimal solution and prematurely terminating the algorithm. The algorithm is a heuristic technique drawing on an analogy from physical annealing processes, such as those used in fabricating steel components. It was originally designed for solving single objective optimization problems and was proven to be robust and convergent if the annealing rate that governs the search progress is sufficiently slow [[38]]. The first multi-objective version of simulated annealing was proposed by Serafini [[39]], with a modification of the acceptance criteria of solutions from the original algorithm developed in Serafini [[40]]. Engrand [[41]] proposed maintaining an external archive populated by all non-dominated solutions found at the current point in time in the search process. Subsequently, several Multi-Objective Simulated Annealing (MOSA) methods were developed that incorporated a dynamically updated Pareto set (e.g. Czyzżak and Jaszkiewicz [[42]] and Hapke et al. [[43]]). The solution acceptance criteria of these methods are all derived from the difference between the new and current solutions. However, in the presence of a Pareto set spanning multiple objectives, solely comparing the new solution to the current solution is not necessarily straightforward. That is why there have been a few techniques proposed that use a Pareto domination-based acceptance criterion in MOSA [[44], [46]–[47]], the salient feature of which is that the domination status of a given solution is considered not only with respect to the current solution but also to the archive of non-dominated solutions found up to that point in time. It has been widely demonstrated that simulated annealing algorithms are capable of finding multiple Pareto-optimal solutions in a single run. A detailed survey on single-objective and multi-objective simulated annealing can be found in Suman and Kumar [[48]].

Solution process

As we mention in Section 1, our solution approach is scalable in terms of testing population coverage, and can be employed at the level of a small, focused population, or across several populations that might represent, for example, a collection of school districts or multiple sites of a business firm. In the computational example that we present in Section 5, we provide the details related to identifying the appropriate population values that represent testing in public schools in the U.S. state of North Carolina, in which school districts are organized almost exclusively at the level of counties within the state. Thus, one issue that must be addressed in our solution process is whether a solution that is being applied to a given collection of counties should be solved independently across those counties — searching for the independently optimal solution in each county — or to seek the single, common set of policy variables that provides the best trade-offs of the multiple objectives when implemented across all counties. Given the nature of our policy variables, we refer to a given solution as a testing protocol, which we will, from here forward, refer to simply as a protocol.

Algorithm formulation

In this section, we present a formulation of the MOSA algorithm that allows us to address the two distinct scenarios we describe above — a common protocol across all population sets versus independent protocols across those population sets. The whole formulation is based on an archive-based MOSA initially proposed by Bandyopadhyay et al. [[47]] that uses a dominance measure to compute the probability of acceptance of a new solution. In the first scenario, the original archive-based MOSA [[47]] is adapted in such a way that a common protocol is implemented over all the population centers in the chosen cluster of separate populations. However, the original archive-based MOSA algorithm is independently executed for each population to derive its corresponding Pareto optimal solutions. In other words, while a common protocol is enforced over all populations in the first scenario, each population follows its individually designed protocol in the second scenario. We investigate the difference between the computational outcomes generated by these two scenarios in further detail in Section 5.

We define the following parameters to be used in specifying our archive-based MOSA algorithm:

HL – The target number of non-dominated solutions in the archive

SL – The maximum number of non-dominated solutions in the archive before performing clustering to reduce archive's size to HL

<math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>T</mi><mo movablelimits="true">max</mo></msup></math> – The initial temperature

<math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>T</mi><mo movablelimits="true">min</mo></msup></math> – The final temperature

iter – The number of iterations at each temperature

<math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msup><mi>α</mi><mrow><mi mathvariant="italic">lower</mi></mrow></msup><mo>,</mo><msup><mi>α</mi><mrow><mi mathvariant="italic">upper</mi></mrow></msup></mrow></math> – The lower and upper bounds the temperature cooling rate distribution

Before elaborating the steps of the algorithm, we describe and/or formally define some terms common to archive-based MOSA algorithms.

Definition 1

Domination - In the context of a minimization pro-blem, given the set of objective functions as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">{</mo><msub><mi>f</mi><mn>1</mn></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><msub><mi>f</mi><mi>m</mi></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo stretchy="false">}</mo></mrow></math> , a solution X dominates solution <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mo>′</mo></msup></math> , denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>X</mi><mo>≻</mo><msup><mi>X</mi><mo>′</mo></msup></mrow></math> in this study, if <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>∀</mo><mi>k</mi><mo>∈</mo><mrow><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mi>m</mi><mo stretchy="false">}</mo></mrow><mo>,</mo><msub><mi>f</mi><mi>k</mi></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo>≤</mo><msub><mi>f</mi><mi>k</mi></msub><mrow><mo stretchy="false">(</mo><msup><mi mathvariant="normal">x</mi><mo>′</mo></msup><mo stretchy="false">)</mo></mrow></mrow></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>∃</mo><mi>k</mi><mo>∈</mo><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mi>m</mi><mo stretchy="false">}</mo></mrow></math> for which <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>f</mi><mi>k</mi></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo><</mo><msub><mi>f</mi><mi>k</mi></msub><mrow><mo stretchy="false">(</mo><msup><mi mathvariant="normal">x</mi><mo>′</mo></msup><mo stretchy="false">)</mo></mrow></mrow></math> .

For example, if we have three minimizing objective functions whose values for solutions X and Y are (12.47, 3.18, 0.94) and (12.47, 3.64, 1.07), respectively, then according to the aforementioned definition, solution X dominates solution Y, i.e. <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>X</mi><mo>≻</mo><mi>Y</mi></mrow></math> .

A solution that is not dominated by other existing solutions is referred to as a non-dominated solution. Continuing the example above, if we identify another solution, Z, whose solution values are (12.21, 3.43, 0.92), then solutions X and Z are non-dominated, but solution Y remains dominated by the other two solutions. Two non-dominated solutions X and <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mo>′</mo></msup></math> are denoted as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>X</mi><mo>≈</mo><msup><mi>X</mi><mo>′</mo></msup></mrow></math> in this study. The set of non-dominated solutions constitutes the Pareto optimal solution set or Pareto optimal frontier of the problem. The amount of domination between two solutions X and <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mo>′</mo></msup></math> , which will be used in computing the acceptance probability of a new solution, is calculated as follows [[47]]:

7 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi mathvariant="normal">Δ</mi><mi>d</mi><mi>o</mi><msub><mi>m</mi><mrow><mi>X</mi><mo>,</mo><msup><mi>X</mi><mo>′</mo></msup></mrow></msub><mo>=</mo><munderover><mo>∏</mo><mrow><mi>i</mi><mo>=</mo><mn>1</mn></mrow><mi>m</mi></munderover><mfrac><mrow><mrow><mo stretchy="false">|</mo></mrow><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo>-</mo><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><msup><mi mathvariant="normal">x</mi><mo>′</mo></msup><mo stretchy="false">)</mo></mrow><mrow><mo stretchy="false">|</mo></mrow></mrow><msub><mi>R</mi><mi>i</mi></msub></mfrac></mrow></mtd></mtr></mtable></mrow></math>

Graph

where <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><mtext>x</mtext><mo stretchy="false">)</mo></mrow><mo>≠</mo><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><msup><mi mathvariant="normal">x</mi><mo>′</mo></msup><mo stretchy="false">)</mo></mrow></mrow></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mi>i</mi></msub></math> is the range of the i-th objective function. As <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mi>i</mi></msub></math> might be unknown a priori, all the non-dominated solutions in the archive along with the current and new solutions are employed to calculate it. In other words, given <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="double-struck">X</mi></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>X</mi><mo>′</mo></msup></math> , and X denote the the existing archive, new solution and current solution, respectively, then <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mi>i</mi></msub><mo>=</mo><mo movablelimits="true">max</mo><mspace width="0.166667em" /><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><mo>-</mo><mo movablelimits="true">min</mo><mspace width="0.166667em" /><msub><mi>f</mi><mi>i</mi></msub><mrow><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow></mrow></math> , where <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>x</mi><mo>∈</mo><mi mathvariant="double-struck">X</mi><mo>∪</mo><mi>X</mi><mo>∪</mo><msup><mi>X</mi><mo>′</mo></msup></mrow></math> .

Clustering - Clustering is a mechanism that is used to build the MOSA archive in a way that results in a diverse set of non-dominated solutions. Given HL and SL parameters, the size of the archive potentially rises to SL, at which point a clustering procedure is executed to reduce the archive's size to HL. Bandyopadhyay et al. [[47]] recommended using the single linkage algorithm [[49]] as the clustering procedure in their archive MOSA algorithm, where the distance between any two clusters corresponds to the length of the shortest Euclidean distance from any solution in the first cluster to any solution in the second cluster. Given an archive and HL parameter, the clustering operator is denoted by Cluster(archive, HL) throughout this study.

In the following sections, we describe our general MOSA algorithm and its specific implementation with respect to the multiple objective values for our testing protocol optimization problem. With only a slight change in the execution of the algorithm, it can be used to generate a set of Pareto frontiers for either the common-protocol or independent-protocol settings, as we discussed in Section 4.1.

Scenario 1 – Implementing a common protocol

In this section, we step through each of the major components of the MOSA algorithm — initialization, evaluation and perturbation, transition, and temperature updating — in the context of the first of our two solution scenarios, the one for which a common protocol is implemented by all populations in the set being considered. These steps are elaborated more formally in the pseudocode provided in Algorithm 1. Throughout the discussion below, note that we make reference to specific lines that can be found in Algorithm 1.

Graph: Algorithm 1 Multi-objective simulated annealing - scenario 1.

<bold>Initialization</bold> – The algorithm begins with the initialization of a number of non-dominated solutions in line 3. Algorithm 2 elaborates how to initialize an archive for Scenario 1. In this algorithm, a random solution for each decision variable <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> is generated from the corresponding feasible sets <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>1</mn></msub></msup></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>F</mi><mn>2</mn></msub></msup></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><msub><mi>T</mi><mn>1</mn></msub></msup></math> , respectively, using the random operator in lines 4 to 6. Since such a solution is implemented across all populations in Scenario 1, the total value of objective functions for all populations is calculated in lines 8 to 10. Based on the total value of objective functions and Definition 1, the dominance status of the current random solution is assessed with respect to all the previous solutions in the archive. If the current random solution is non-dominant compared to all other solutions in the archive, then it is added to the archive. This operation is executed in lines 13 to 20 in Algorithm 2. Finally, the archive's size is reduced to HL in line 25 by means of clustering if it goes beyond SL.

Graph: Algorithm 2 InitScen1 - initialization of an archive in scenario 1.

<bold>Evaluation and perturbation</bold> – After picking a random non-dominated solution from the archive as the current solution (line 4), the values of objective functions associated with the current solution are obtained (line 5) using the ObjEval procedure outlined in Algorithm 3. During this stage, it is crucial to highlight that the entire SEIR procedure, as depicted in Fig. 1, is executed to acquire the objective function values pertaining to a randomly selected non-dominated solution. Given a solution and a set of populations, Algorithm 3 generates the solution for each population, p, sums up the corresponding values of each objective function over all populations, and ultimately returns a tuple of the total values of objective functions, i.e. <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">{</mo><mi>o</mi><mi>b</mi><msup><mi>j</mi><mrow><mi mathvariant="italic">TC</mi></mrow></msup><mo>,</mo><mi>o</mi><mi>b</mi><msup><mi>j</mi><mrow><mi mathvariant="italic">FN</mi></mrow></msup><mo>,</mo><mi>o</mi><mi>b</mi><msup><mi>j</mi><mrow><mi mathvariant="italic">IN</mi></mrow></msup><mo stretchy="false">}</mo></mrow></math> . Although Algorithm 3 seems relatively straightforward, it is the basis for distinguishing between the Scenario 1 and Scenario 2 solutions, as we describe in Section 4.2.2.

Graph: Algorithm 3 ObjEval - evaluating the objective functions.

Next, a new solution is generated (line 8 of Algorithm 1) by perturbing the current solution. Recall that any solution stored in the archive is a 3-tuple of decision variables that specifies the initial screening frequency, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , the secondary screening frequency, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> , and the length of the initial screening frequency, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> . Thus, seven perturbation mechanisms may be applied to perturb the current solution: (1-3) perturbing any of <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>T</mi><mn>1</mn></msub></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>1</mn></msub></math> , or <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>F</mi><mn>2</mn></msub></math> individually; (4-6) jointly perturbing any pair of these decision variables, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>T</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mrow></math> , <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>T</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo stretchy="false">)</mo></mrow></math> , or <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo stretchy="false">)</mo></mrow></math> ; (7) jointly perturbing all three decision variables, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">(</mo><msub><mi>T</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo stretchy="false">)</mo></mrow></math> .

The MOSA algorithm randomly selects one of these mechanisms to be executed, and the variable(s) in question are perturbed within their corresponding feasible domains. Then, the values of objective functions related to the newly generated solution are obtained by applying Algorithm 3 (line 9).

<bold>Transition</bold> – Given current and new solutions with their corresponding objective function values, i.e., <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>O</mi><mi>b</mi><msub><mi>j</mi><mi>c</mi></msub></mrow></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>O</mi><mi>b</mi><msub><mi>j</mi><mi>n</mi></msub></mrow></math> , three different outcomes are possible in terms of the domination status between them:

Graph: Algorithm 4 DomStatus - specifying the domination status of a new solution against an archive.

Case I: The current solution dominates the newly generated solution.

Case II: The current and new solutions are non-dominated with each other.

Case III: The new solution dominates the current solution.

Before describing the actions of the MOSA algorithm in each of these cases, we first note that, prior to the evaluation of how the current and new solutions compare, Algorithm 1 at line 10 derives K and <math xmlns="http://www.w3.org/1998/Math/MathML"><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover></math> by means of Algorithm 4, where K and <math xmlns="http://www.w3.org/1998/Math/MathML"><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover></math> are defined as follows:

K – the set of solutions in the archive that dominate the new solution.

<math xmlns="http://www.w3.org/1998/Math/MathML"><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover></math> – the set of solutions in the archive that are dominated by the new solution.

Now, we can describe the details of the three comparison outcomes:

Case I: First, the average amount of domination of the new solution by <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">|</mo><mi>K</mi><mo stretchy="false">|</mo><mo>+</mo><mn>1</mn></mrow></math> solutions is calculated as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi mathvariant="normal">Δ</mi><mi>d</mi><mi>o</mi><msub><mi>m</mi><mrow><mi mathvariant="italic">avg</mi></mrow></msub></mrow></math> (line 12), where |K| denotes the cardinality of set K. The new solution is then set to be the current solution with a probability of pr (line 13), by comparison to a randomly generated U(0, 1) value (line 15).

Case II: In this case (line 17), three different sub-conditions are assessed:

The total amount of domination of the |K| solutions in the archive over the new solution is computed (line 19), and the new solution is accepted to be the current solution with a probability of pr (line 20).

If the newly generated solution is non-dominated with respect to all the solutions in the archive (i.e., <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">|</mo><mi>K</mi><mo stretchy="false">|</mo><mo>=</mo><mo stretchy="false">|</mo><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover><mo stretchy="false">|</mo><mo>=</mo><mn>0</mn></mrow></math> ), it is set as the current solution and added to the archive (lines 25 and 26, respectively). In this circumstance, if the archive's size exceeds SL, then clustering is performed to reduce its size (line 28).

If the new solution dominates <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">|</mo><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover><mo stretchy="false">|</mo></mrow></math> solutions in the archive, then the new solution is accepted as the current solution and added to the archive (lines 31 and 32, respectively), and the solutions in set <math xmlns="http://www.w3.org/1998/Math/MathML"><mover accent="true"><mrow><mi>K</mi></mrow><mrow><mo stretchy="false">¯</mo></mrow></mover></math> are eliminated from the archive (line 33).

Case III: If the new solution dominates the current solution, the domination status of new solution and the archive solutions is assessed, and three different sub-conditions are again evaluated:

If the new solution is dominated by |K| solutions in the archive, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi mathvariant="normal">Δ</mi><mi>d</mi><mi>o</mi><msub><mi>m</mi><mo movablelimits="true">min</mo></msub></mrow></math> is computed in line 36 as the minimum of the domination difference between the new solution and the solutions in K. With a probability of pr (37), the archive solution associated with <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi mathvariant="normal">Δ</mi><mi>d</mi><mi>o</mi><msub><mi>m</mi><mo movablelimits="true">min</mo></msub></mrow></math> is set as the current solution, or the new solution is set to the current solution with a probability of <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mn>1</mn><mo>-</mo><mi>p</mi><mi>r</mi></mrow></math> .

If the new solution is non-dominating with respect to all solutions in the archive, then it is set as the current solution and added to the archive (lines 44 and 45, respectively). If the current solution is already in the archive, it is dropped (line 47). Otherwise, the size of the augmented archive is diminished to HL using the clustering if its size becomes greater than SL (line 50).

If neither of the above sub-conditions are found, then the new solution is set to be the current solution, it is added to the archive, and all dominated solutions in the archive are removed.

The internal for statement iterates iter times, such that, at a fixed temperature, iter different perturbed solutions in the neighborhood of the current solution are investigated.

<bold>Updating temperature</bold> – After examining the neighborhood of the current solution, the temperature is reduced in line 59, where a multiplier <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>r</mi><mi>a</mi><mi>n</mi><msub><mi>d</mi><mrow><mi>U</mi><mo stretchy="false">(</mo><msup><mi>α</mi><mrow><mi mathvariant="italic">lower</mi></mrow></msup><mo>,</mo><msup><mi>α</mi><mrow><mi mathvariant="italic">upper</mi></mrow></msup><mo stretchy="false">)</mo></mrow></msub></mrow></math> is used to smoothly reduce the annealing temperature. Our experiments showed that this cooling schedule works sufficiently well when deriving the Pareto optimal front.

At the end of the algorithm, if archive's size is greater than the SL, the clustering procedure reduces its size to HL (line 62).

Scenario 2 – Implementing independent protocols

As mentioned earlier, Scenario 2 describes a situation where each population can have its own protocol designed independently from other counties. In this circumstance, we need only implement the original archive MOSA algorithm on each population to derive its respective Pareto optimal solution. In other words, if the set POP in Algorithms 1 and 3 contains only a single population, then running Algorithm 1 n times, once for each of the n populations in the collection of all populations, yields the independently Pareto optimal solutions for each population.

Computational application and results

To demonstrate our implementation of the SEIR model and its use in developing optimal screening protocols for infectious diseases in congregate settings, we have built a realistic case scenario based on screening for COVID-19 in public schools. Here, we use the State of North Carolina as our case setting. Estimates of the student and staff population of schools in the United States were obtained from the "Global Epidemics" website, maintained by the Brown University School of Public Health [[50]].

Graph: Fig. 2 Prioritized clusters of counties in North Carolina using data from Aug 2020 - Jul 2021 (panel (a)) and Aug 2020 - Feb 2022 (panel (b)) — darker shading represents higher intensity of COVID-19 pandemic in that area

Population data for public school screening in North Carolina

Although the "Global Epidemics" site provides data on school populations by state (actually through estimates provided by the National Education Association—see [[51]]), we chose to narrow the focus to the county level within the state, as policy decisions in schools are made at local levels in the U.S., and in North Carolina most school systems are organized at the county level. This leaves us with a number of modeling and solution decisions to consider, namely:

How many of North Carolina's 100 counties can be included in a single solution of our optimization algorithm while still allowing it to be solved within a reasonably well-contained computational time?

Should individual counties covered by a given solution be constrained to use a common optimal protocol, or should we allow for individually-optimal county protocols?

Is it possible that solving for a common optimal protocol among a group of counties generates outcomes that are sufficiently similar to those generated by independently optimal protocols?

In order to identify clusters of counties to serve as target solution areas, we employ COVID-19 prediction tools developed by [[52]]. These tools generate a prioritization of counties within a given state in the U.S. via a weighted average of two computational methods for assessing the relative intensity of the COVID-19 pandemic across these counties. One computational method employs machine learning algorithms to predict COVID-19 hospitalizations and deaths based on a large number of county-level statistics regarding various public health measures, demographic factors, and local weather data. The other method utilizes the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS), initially introduced by [[53]], to generate an overall priority weighting among multiple measures, in this case, data on COVID-19 hospitalizations and deaths in the weeks leading up to the week for which relative severity of the COVID-19 pandemic is being assessed.

Applying these prioritization methods to all 100 counties in the State of North Carolina, we generated the weighted average priority ranking of all 100 counties. Then, we employed the constrained clustering algorithm [[54]] to create 10 regional clusters of counties, each of which represents a geographical area that is sized in a way that allows us to answer the questions we pose above. Prioritized clusters are shown in Fig. 2 for data covering August 2020 to July 2021 (panel (a)) and August 2020 to February 2022 (panel (b)). The cluster that serves as the basis for our computations is highlighted in panel (a) of the figure, where we also provide the names of the individual counties. We note that this cluster — which includes Wake County, containing the state's capital and second-largest city, Raleigh — was the highest priority cluster in the state (i.e., highest relative intensity of the pandemic) in the period around July 2021, and the second highest priority cluster around February 2022.

In Table 4, we provide estimates of the testing population in the counties shown in Fig. 2. The source of the estimates of the overall school population for the State of North Carolina — which includes both students and staff — come from [[50]]. We allocate the statewide school population across counties by assuming that the school population follows the proportion of the total population in each county relative to the overall population of the state. State and county population statistics were obtained from [[55]]. We assume that each county's school population starts with 5% of students and staff infected with COVID-19 at the outset of the screening. In the context of the SEIR model (Fig. 1), this means that 95% of the population starts the horizon in state U and 5% in state A.

Table 4 School population estimates by county for high-priority cluster in North Carolina

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left">Initial</th><th align="left">Initial</th></tr><tr><th align="left" /><th align="left">County</th><th align="left">Proportion of</th><th align="left">Estimated</th><th align="left">Estimated</th><th align="left">Estimated</th><th align="left">susceptible</th><th align="left">infected</th></tr><tr><th align="left">Entity</th><th align="left">population</th><th align="left">NC population</th><th align="left">Grades K-8</th><th align="left">Grades 9-12</th><th align="left">total</th><th align="left">(95% of total)</th><th align="left">(5% of total)</th></tr></thead><tbody><tr><td align="left">State of North Carolina</td><td align="left">–</td><td align="left">–</td><td align="left">1,168,436</td><td align="left">362,635</td><td align="left">1,531,071</td><td align="left">–</td><td align="left">–</td></tr><tr><td align="left">Alamance county</td><td align="left">169,509</td><td align="left">0.0162</td><td align="left">18,884</td><td align="left">5,861</td><td align="left">24,745</td><td align="left">23,508</td><td align="left">1,237</td></tr><tr><td align="left">Caswell county</td><td align="left">22,604</td><td align="left">0.0022</td><td align="left">2,518</td><td align="left">782</td><td align="left">3,300</td><td align="left">3,135</td><td align="left">165</td></tr><tr><td align="left">Chatham county</td><td align="left">74,470</td><td align="left">0.0071</td><td align="left">8,296</td><td align="left">2,575</td><td align="left">10,871</td><td align="left">10,327</td><td align="left">544</td></tr><tr><td align="left">Durham county</td><td align="left">321,488</td><td align="left">0.0307</td><td align="left">35,816</td><td align="left">11,116</td><td align="left">46,931</td><td align="left">44,584</td><td align="left">2,347</td></tr><tr><td align="left">Granville county</td><td align="left">60,443</td><td align="left">0.0058</td><td align="left">6,734</td><td align="left">2,090</td><td align="left">8,824</td><td align="left">8,383</td><td align="left">441</td></tr><tr><td align="left">Harnett county</td><td align="left">135,976</td><td align="left">0.0130</td><td align="left">15,149</td><td align="left">4,701</td><td align="left">19,850</td><td align="left">18,858</td><td align="left">993</td></tr><tr><td align="left">Lee county</td><td align="left">61,779</td><td align="left">0.0059</td><td align="left">6,883</td><td align="left">2,136</td><td align="left">9,019</td><td align="left">8,568</td><td align="left">451</td></tr><tr><td align="left">Orange county</td><td align="left">148,476</td><td align="left">0.0142</td><td align="left">16,541</td><td align="left">5,134</td><td align="left">21,675</td><td align="left">20,591</td><td align="left">1,084</td></tr><tr><td align="left">Person county</td><td align="left">39,490</td><td align="left">0.0038</td><td align="left">4,399</td><td align="left">1,365</td><td align="left">5,765</td><td align="left">5,477</td><td align="left">288</td></tr><tr><td align="left">Wake county</td><td align="left">1,111,761</td><td align="left">0.1060</td><td align="left">123,857</td><td align="left">38,440</td><td align="left">162,297</td><td align="left">154,182</td><td align="left">8,115</td></tr></tbody></table>

Case parameters for public school screening

Below, we provide details on the case parameters that we explore in our computational experiments. The parameter values chosen are based on a combination knowledge of the tests employed (e.g., sensitivity and specificity of antigen tests and their costs when purchased in bulk), on the nature of the SEIR system described in Fig. 1, or by using virus behavior parameters that match those in [[7]].

<uline>Test parameters</uline>

Test sensitivity and specificity, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>S</mi><mi>e</mi></msub><mo>=</mo><mn>0.80</mn></mrow></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>S</mi><mi>p</mi></msub><mo>=</mo><mn>0.98</mn></mrow></math>

Test cost = Confirmatory test cost = $5 per test

Number of cycles to obtain test result, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>k</mi><mo>=</mo><mn>0</mn></mrow></math> (see Appendix A)

<uline>Testing environment parameters</uline>

Number of test cycles per day = 1; number of days per week = 5

Testing horizon = 80 days (16 weeks)

Exogenous infection shock frequency = 5 days

New infections per shock = <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>S</mi><mi>h</mi><mi>o</mi><mi>c</mi><msub><mi>k</mi><mrow><mi mathvariant="italic">pct</mi></mrow></msub><mo>·</mo><mi>P</mi><mi>o</mi><mi>p</mi></mrow></math> , where Pop is the size of the population over which testing is being applied (e.g., school system population) and <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>S</mi><mi>h</mi><mi>o</mi><mi>c</mi><msub><mi>k</mi><mrow><mi mathvariant="italic">pct</mi></mrow></msub></mrow></math> is a scaling factor that sets the extent of exogenous infections (see "Robustness parameters" below)

<uline>Virus parameters</uline>

Virus incubation period = 3 days; time to recovery from infection = 10 days

Fraction of asymptomatic individuals advancing to symptomatic infection = 0.3

Symptomatic case fatality ratio = 0.0001

Time to return FP from isolation = 1 day

<uline>Decision variable sets</uline>

<math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>F</mi><mn>1</mn></msub><mo>∈</mo></mrow></math> {Daily, Every 2 Days, Every 3 Days, Weekly, Every 2 Weeks, Every 3 Weeks, Every 4 Weeks}

<math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>F</mi><mn>2</mn></msub><mo>∈</mo></mrow></math> {Daily, Every 2 Days, Every 3 Days, Weekly, Every 2 Weeks, Every 3 Weeks, Every 4 Weeks}

<math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>T</mi><mn>1</mn></msub><mo>∈</mo><mrow><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mo>...</mo><mo>,</mo><mn>16</mn><mo stretchy="false">}</mo></mrow></mrow></math> [weeks]

Graph: Fig. 3 Independent-protocol versus common-protocol solutions for Caswell County, NC

Graph: Fig. 4 Independent-protocol versus common-protocol solutions for Wake County, NC

Finally, we perform robustness tests on parameters that reflect the environment in which the screening policy is being employed. Those key parameters are the number of infections introduced into the congregate setting in question from outside the system and the level of infectiousness of the virus, as measured by <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> , which epidemiologists call the "basic reproductive number" of a given virus (see [[56]]) The first robustness parameter that we explore stems from the structure of the underlying SEIR model of virus transmission. This parameter, which is denoted by X in Fig. 1, is the fixed number of new infections that enter the system every q testing cycles. We use the same terminology as [[7]], referring to these as exogenous shocks of new infections. In exploring the sensitivity of our 10-county solutions to the level of exogenous shocks, we base the computation of X on the underlying population of the testing pool. Specifically, we consider the base level to be 1% of the testing pool, and explore two levels less than this, 0.5% and 0.1%, and two larger levels, 2% and 6.25%, the last of which would represent a level of introduced infections that equals the total population (100%) over a 16-week school semester. In addition, we consider three levels of <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> . The first, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mn>0</mn></msub><mo>=</mo><mn>2.3</mn></mrow></math> , reflects epidemiologists' estimates of the infectiousness of the SARS-CoV-2 virus in the first year of the COVID-19 pandemic, during which the original, "wild-type" variant of the virus was circulating [[57]]. The lower level, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mn>0</mn></msub><mo>=</mo><mn>1.5</mn></mrow></math> , is intended to represent a level that could be expected in a congregate setting in which most individuals follow good hygiene, social distancing, and masking [[57]]. The higher level, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mn>0</mn></msub><mo>=</mo><mn>5.0</mn></mrow></math> , is intended to represent virus spread at a level that was seen later in the COVID-19 pandemic, as variants of the virus mutated in ways that helped it spread much more rapidly [[58]].

<uline>Robustness parameters</uline> (base case value in bold)

<math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>S</mi><mi>h</mi><mi>o</mi><mi>c</mi><msub><mi>k</mi><mrow><mi mathvariant="italic">pct</mi></mrow></msub><mo>∈</mo><mrow><mo stretchy="false">{</mo><mn>0.1</mn><mo>%</mo><mo>,</mo><mn>0.5</mn><mo>%</mo><mo>,</mo><mrow><mn mathvariant="bold">1.0</mn><mo>%</mo></mrow><mo>,</mo><mn>2.0</mn><mo>%</mo><mo>,</mo><mn>6.25</mn><mo>%</mo><mo stretchy="false">}</mo></mrow></mrow></math> (noting that 6.25% implies a number of infections covering 100% of the population over 16 weeks)

Generating Pareto-optimal protocols for public school screening in North Carolina

We employ the MOSA algorithms, described in Section 4, to search for non-dominated protocols for screening, based on the joint objectives of minimizing testing costs, minimizing infections, and minimizing false negative tests. We employ two solution methods, one that searches for the optimal protocol in each county, independently of the protocols in any other county in the overall target set of counties, and one that constrains the counties to have a common protocol.

The common-protocol approach has practical benefits, both from the standpoint of more easily solving the problem and from the standpoint of implementing the protocol from a public health perspective. On the latter point, we can assume that the public officials in the State of North Carolina would prefer to see more consistent testing protocols employed across counties, as opposed to having protocols that differ across counties, particularly between adjoining counties. Public health officials would clearly prefer to explain a commonly-deployed protocol for screening tests in county schools, as opposed to having to explain why County A's protocol differs from that of adjoining County B.

Table 5 Comparative solutions for common and independent protocols – Caswell County, NC

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left">Cost per</th></tr><tr><th align="left" /><th align="left">Initial</th><th align="left">Weeks of</th><th align="left">Secondary</th><th align="left" /><th align="left" /><th align="left">Number of</th><th align="left">Maximum</th><th align="left">Cost per</th><th align="left">reduction in</th></tr><tr><th align="left" /><th align="left">screening</th><th align="left">initial</th><th align="left">screening</th><th align="left">Testing</th><th align="left">Number of</th><th align="left">false</th><th align="left">number in</th><th align="left">case</th><th align="left">false</th></tr><tr><th align="left">Solution</th><th align="left">frequency</th><th align="left">screening</th><th align="left">frequency</th><th align="left">cost</th><th align="left">infections</th><th align="left">negatives</th><th align="left">isolation</th><th align="left">averted</th><th align="left">negatives</th></tr></thead><tbody><tr><td align="left">Common</td><td align="left">Every 2 Days</td><td align="left">11</td><td align="left">Weekly</td><td align="left">$423,105</td><td align="left">1,802</td><td align="left">66</td><td align="left">200</td><td align="left">$317</td><td align="left">$6,899</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Every 3 Weeks</td><td align="left">$435,852</td><td align="left">1,720</td><td align="left">62</td><td align="left">200</td><td align="left">$308</td><td align="left">$6,643</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Every 3 Days</td><td align="left">$465,751</td><td align="left">1,551</td><td align="left">59</td><td align="left">200</td><td align="left">$294</td><td align="left">$6,748</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">16</td><td align="left">Every 2 Weeks</td><td align="left">$498,249</td><td align="left">1,460</td><td align="left">56</td><td align="left">200</td><td align="left">$297</td><td align="left">$6,950</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$520,607</td><td align="left">1,443</td><td align="left">56</td><td align="left">200</td><td align="left">$308</td><td align="left">$7,244</td></tr><tr><td align="left">Independent</td><td align="left">Weekly</td><td align="left">2</td><td align="left">Every 2 Days</td><td align="left">$405,831</td><td align="left">1,755</td><td align="left">67</td><td align="left">287</td><td align="left">$294</td><td align="left">$6,785</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Weekly</td><td align="left">$455,370</td><td align="left">1,594</td><td align="left">59</td><td align="left">200</td><td align="left">$295</td><td align="left">$6,702</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">3</td><td align="left">Every 2 Days</td><td align="left">$498,249</td><td align="left">1,460</td><td align="left">56</td><td align="left">200</td><td align="left">$297</td><td align="left">$6,991</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$520,607</td><td align="left">1,443</td><td align="left">56</td><td align="left">200</td><td align="left">$308</td><td align="left">$7,287</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">14</td><td align="left">Daily</td><td align="left">$543,923</td><td align="left">1,418</td><td align="left">55</td><td align="left">200</td><td align="left">$317</td><td align="left">$7,530</td></tr></tbody></table>

Table 6 Comparative solutions for common and independent protocols – Wake County, NC

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left" /><th align="left">Cost per</th></tr><tr><th align="left" /><th align="left">Initial</th><th align="left">Weeks of</th><th align="left">Secondary</th><th align="left" /><th align="left" /><th align="left">Number of</th><th align="left">Maximum</th><th align="left">Cost per</th><th align="left">reduction in</th></tr><tr><th align="left" /><th align="left">screening</th><th align="left">initial</th><th align="left">screening</th><th align="left">Testing</th><th align="left">Number of</th><th align="left">false</th><th align="left">number in</th><th align="left">case</th><th align="left">false</th></tr><tr><th align="left">Solution</th><th align="left">frequency</th><th align="left">screening</th><th align="left">frequency</th><th align="left">cost</th><th align="left">infections</th><th align="left">negatives</th><th align="left">isolation</th><th align="left">averted</th><th align="left">negatives</th></tr></thead><tbody><tr><td align="left">Common</td><td align="left">Every 2 Days</td><td align="left">11</td><td align="left">Weekly</td><td align="left">$20,808,657</td><td align="left">88,620</td><td align="left">3,263</td><td align="left">9,851</td><td align="left">$317</td><td align="left">$6,899</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Every 3 Weeks</td><td align="left">$21,435,561</td><td align="left">84,592</td><td align="left">3,052</td><td align="left">9,851</td><td align="left">$308</td><td align="left">$6,643</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Every 3 Days</td><td align="left">$22,905,993</td><td align="left">76,273</td><td align="left">2,884</td><td align="left">9,851</td><td align="left">$294</td><td align="left">$6,747</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">16</td><td align="left">Every 2 Weeks</td><td align="left">$24,504,259</td><td align="left">71,787</td><td align="left">2,753</td><td align="left">9,851</td><td align="left">$297</td><td align="left">$6,950</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$25,603,874</td><td align="left">70,979</td><td align="left">2,744</td><td align="left">9,851</td><td align="left">$308</td><td align="left">$7,243</td></tr><tr><td align="left">Independent</td><td align="left">Every 2 Days</td><td align="left">7</td><td align="left">Every 3 Days</td><td align="left">$20,279,654</td><td align="left">87,146</td><td align="left">3,294</td><td align="left">9,851</td><td align="left">$303</td><td align="left">$6,844</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Weekly</td><td align="left">$21,624,808</td><td align="left">83,013</td><td align="left">3,068</td><td align="left">9,851</td><td align="left">$304</td><td align="left">$6,780</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Every 3 Days</td><td align="left">$23,337,088</td><td align="left">74,590</td><td align="left">2,826</td><td align="left">9,851</td><td align="left">$293</td><td align="left">$6,800</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Every 3 Days</td><td align="left">$24,130,456</td><td align="left">72,203</td><td align="left">2,756</td><td align="left">9,851</td><td align="left">$294</td><td align="left">$6,891</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">10</td><td align="left">Every 2 Days</td><td align="left">$24,504,259</td><td align="left">71,787</td><td align="left">2,753</td><td align="left">9,851</td><td align="left">$297</td><td align="left">$6,991</td></tr></tbody></table>

An important question, however, regarding the independent-protocol versus common-protocol solutions is whether they generate outcomes that are consistent in terms of the multi-objective trade-offs. In terms of an "eye test," Figs. 3 and 4 show the cost-versus-infections trade-off in the MOSA solutions for the least populous and most populous counties in our 10-county solution set, Caswell County, NC and Wake County, NC, respectively. One can observe from these figures that the common-protocol trade-off curve and that of the independent protocol solutions are nearly coincident with one another. We also note here that these cost-versus-infections plots are perhaps not as perfectly smooth as one might expect, containing some points that appear to be dominated by other points on the graph. Our MOSA solution, however, is applied to three objectives, so those points that might appear to be dominated on the cost-versus-infections trade-off must exhibit a preferable trade-off on either or both of the infections-versus-false negatives and cost-versus-false negatives trade-offs. We note that the false negative-vs-cost curves are not shown, as they are essentially identical to the infections-vs-cost curves. Thus, while our solutions limit the number of false negatives, we focus only on the outcome of limiting the spread of the viral infections.

Graph: Fig. 5 Sensitivity of infections-vs-cost trade-off to exogenous shock level for 10-county cluster in NC

Rather than depending solely on an "eye test", we offer a more rigorous method is detailed in Appendix B to compare the independent- and common-protocol solutions for each county in the solution set. This approach utilizes two post-hoc statistics that can be computed quickly and analyzed to assess whether there is a statistically-significant difference between the sets of Pareto-optimal solutions from each solution method for each county in our 10-county cluster. The two statistics are what we denote as the cost per case averted (CCA) and the cost per reduction in false negatives (CRFN). The definitions of those statistics may be found in Appendix B. From the formal quantitative comparisons and statistical tests detailed in Appendix B, we conclude that, for our example solution set of 10 counties, the common-protocol and independent-protocol solutions reflect consistent outcomes. Thus, we conclude that a common-protocol solution — which can not only be computed more quickly (approximately 20 minutes, as compared to approximately 35 minutes to generate the set of independent protocols for all 10 counties), but also is arguably easier for public health officials to explain, implement, and coordinate — can be employed for all 10 counties. In Tables 5 and 6 we provide some details on Pareto-frontier solutions from Figs. 3 and 4 that appear in what we call the "elbow" of the infections-cost tradeoff — i.e., the inflection point where the marginal decline in infections with increased testing cost levels off. In Fig. 4, for example, this occurs at around $20M of testing cost. In our view, this is a "balanced" protocol, the point at which greater investment in testing begins to generate less return in terms of reduced levels of infection.

Table 7 Selected solutions across different levels of infection prevalence – NC 10-county cluster

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left" /><th align="left">Weeks of</th><th align="left">Secondary</th><th align="left" /><th align="left" /><th align="left" /><th align="left">Maximum</th><th align="left">Cost per</th><th align="left">Cost per</th></tr><tr><th align="left" /><th align="left">Initial screening</th><th align="left">initial</th><th align="left">screening</th><th align="left" /><th align="left">Number of</th><th align="left">Number of</th><th align="left">number in</th><th align="left">case</th><th align="left">reduction in</th></tr><tr><th align="left">Shock level</th><th align="left">frequency</th><th align="left">screening</th><th align="left">frequency</th><th align="left">Testing cost</th><th align="left">infections</th><th align="left">false negatives</th><th align="left">isolation</th><th align="left">averted</th><th align="left">false negatives</th></tr></thead><tbody><tr><td align="left">0.10%</td><td align="left">Every 2 Days</td><td align="left">11</td><td align="left">Every 2 Weeks</td><td align="left">$41,433,980</td><td align="left">51,360</td><td align="left">196</td><td align="left">16,951</td><td align="left">$168.26</td><td align="left">$168</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Every 2 Weeks</td><td align="left">$44,062,551</td><td align="left">47,317</td><td align="left">183</td><td align="left">16,951</td><td align="left">$176.05</td><td align="left">$176</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Every 3 Weeks</td><td align="left">$46,023,223</td><td align="left">45,192</td><td align="left">177</td><td align="left">16,951</td><td align="left">$182.33</td><td align="left">$182</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Every 2 Weeks</td><td align="left">$46,679,672</td><td align="left">44,674</td><td align="left">176</td><td align="left">16,951</td><td align="left">$184.56</td><td align="left">$185</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Every 2 Weeks</td><td align="left">$51,886,847</td><td align="left">42,193</td><td align="left">169</td><td align="left">16,951</td><td align="left">$203.15</td><td align="left">$203</td></tr><tr><td align="left">0.50%</td><td align="left">Every 2 Days</td><td align="left">11</td><td align="left">Weekly</td><td align="left">$42,662,884</td><td align="left">105,247</td><td align="left">392</td><td align="left">17,567</td><td align="left">$221.79</td><td align="left">$221</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">10</td><td align="left">Every 3 Days</td><td align="left">$45,443,719</td><td align="left">95,211</td><td align="left">364</td><td align="left">17,567</td><td align="left">$224.53</td><td align="left">$224</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Every 3 Days</td><td align="left">$47,479,028</td><td align="left">90,670</td><td align="left">348</td><td align="left">17,567</td><td align="left">$229.44</td><td align="left">$229</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Every 3 Days</td><td align="left">$50,326,600</td><td align="left">86,236</td><td align="left">334</td><td align="left">17,567</td><td align="left">$238.10</td><td align="left">$238</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$53,949,202</td><td align="left">84,968</td><td align="left">333</td><td align="left">17,567</td><td align="left">$253.72</td><td align="left">$253</td></tr><tr><td align="left">1.00%</td><td align="left">Every 2 Days</td><td align="left">8</td><td align="left">Every 3 Days</td><td align="left">$40,294,719</td><td align="left">163,598</td><td align="left">618</td><td align="left">19,016</td><td align="left">$300.70</td><td align="left">$299</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">14</td><td align="left">Every 3 Weeks</td><td align="left">$43,428,779</td><td align="left">149,143</td><td align="left">550</td><td align="left">19,016</td><td align="left">$292.53</td><td align="left">$291</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Every 3 Weeks</td><td align="left">$45,401,054</td><td align="left">141,080</td><td align="left">533</td><td align="left">19,016</td><td align="left">$290.06</td><td align="left">$289</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">16</td><td align="left">Every 4 Weeks</td><td align="left">$47,299,600</td><td align="left">138,570</td><td align="left">531</td><td align="left">19,016</td><td align="left">$297.42</td><td align="left">$296</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$49,422,135</td><td align="left">137,010</td><td align="left">530</td><td align="left">19,016</td><td align="left">$307.75</td><td align="left">$307</td></tr><tr><td align="left">2.00%</td><td align="left">Every 2 Days</td><td align="left">15</td><td align="left">Daily</td><td align="left">$40,873,986</td><td align="left">232,096</td><td align="left">890</td><td align="left">26,413</td><td align="left">$623.96</td><td align="left">$611</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">14</td><td align="left">Daily</td><td align="left">$42,092,565</td><td align="left">228,425</td><td align="left">878</td><td align="left">26,980</td><td align="left">$608.47</td><td align="left">$597</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">13</td><td align="left">Daily</td><td align="left">$43,560,548</td><td align="left">224,340</td><td align="left">863</td><td align="left">27,504</td><td align="left">$594.58</td><td align="left">$584</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">12</td><td align="left">Daily</td><td align="left">$45,304,128</td><td align="left">220,025</td><td align="left">847</td><td align="left">27,992</td><td align="left">$583.98</td><td align="left">$574</td></tr><tr><td align="left" /><td align="left">Every 2 Days</td><td align="left">11</td><td align="left">Daily</td><td align="left">$47,336,781</td><td align="left">215,552</td><td align="left">829</td><td align="left">28,446</td><td align="left">$576.92</td><td align="left">$568</td></tr><tr><td align="left">6.25%</td><td align="left">Daily</td><td align="left">8</td><td align="left">Every 4 Weeks</td><td align="left">$44,632,459</td><td align="left">297,642</td><td align="left">1,543</td><td align="left">52,005</td><td align="left" /><td align="left">$11,696</td></tr><tr><td align="left" /><td align="left">Daily</td><td align="left">9</td><td align="left">Every 4 Weeks</td><td align="left">$46,857,795</td><td align="left">299,003</td><td align="left">1,528</td><td align="left">52,005</td><td align="left"><italic>No cases</italic></td><td align="left">$19,081</td></tr><tr><td align="left" /><td align="left">Daily</td><td align="left">11</td><td align="left">Every 4 Weeks</td><td align="left">$49,365,290</td><td align="left">298,324</td><td align="left">1,501</td><td align="left">52,005</td><td align="left"><italic>averted</italic></td><td align="left">$15,749</td></tr><tr><td align="left" /><td align="left">Daily</td><td align="left">12</td><td align="left">Every 3 Weeks</td><td align="left">$49,904,653</td><td align="left">298,306</td><td align="left">1,506</td><td align="left">52,005</td><td align="left" /><td align="left">$15,828</td></tr><tr><td align="left" /><td align="left">Daily</td><td align="left">12</td><td align="left">Weekly</td><td align="left">$50,287,597</td><td align="left">298,306</td><td align="left">1,513</td><td align="left">52,005</td><td align="left" /><td align="left">$15,949</td></tr></tbody></table>

Graph: Fig. 6 Sensitivity of infections-vs-cost trade-off to R0 level for 10-county cluster in NC

Table 8 Selected solutions across different transmission rates – NC 10-county cluster

Graph: Fig. 7 Wake county cumulative infections with 0% exogenous infections (except where indicated)

The data in Figs. 3 and 4, as well as the data in Tables 5 and 6, suggest an interesting similarity in the optimal testing protocols. For Wake County, the common protocol-generated solutions centered around initial testing of Every 2 Days for 11 to 16 weeks, followed by weekly secondary screening for one to 3 weeks. In one case, the optimal protocol was an initial screening Every 2 Days for 15 weeks followed by daily testing for one week. The results from the independent protocols varied only slightly from these screening cadences, and they were quite similar for both the least populous (Caswell) and the most populous (Wake) counties in our example solution set. Of importance was the fact that these optimal protocols resulted in similar levels of cost per case averted and similar levels of cost per reduction in false negatives between the two counties.

Some discussion of the results in Tables 5 and 6 is warranted here. Regarding the reduction in false negatives with more testing, this is due to the fact that the additional testing serves to capture more true positives, thus removing more individuals from the Asymptomatic (A) stage in the model, as they would otherwise be a source of false negatives. (Specifically, false negatives are computed as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">|</mo><mi>E</mi><mo stretchy="false">|</mo><mo>+</mo><mo stretchy="false">|</mo><mi>A</mi><mo stretchy="false">|</mo><mo>·</mo><mo stretchy="false">(</mo><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>e</mi></msub><mo stretchy="false">)</mo></mrow></math> , where |X| is the count of the number of individuals in set X and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>e</mi></msub></math> is the sensitivity of the test.) Regarding the maximum number in isolation (i.e., <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo stretchy="false">|</mo><mi>F</mi><mi>P</mi><mo stretchy="false">|</mo><mo>+</mo><mo stretchy="false">|</mo><mi>T</mi><mi>P</mi><mo stretchy="false">|</mo><mo>+</mo><mo stretchy="false">|</mo><mi>P</mi><mo stretchy="false">|</mo></mrow></math> ), this will be the same value for testing protocols that use the same initial screening frequency for a sufficiently large number of weeks. This result stems from the fact that more frequent screening will tend to root out infections relatively early in the horizon. Computational details (not shown here) demonstrate that protocols with an initial screening frequency of every 2 days hit their peak number in isolation (e.g., 9,851 for Wake County, as in Table 4) before the end of week 3 of the testing horizon. By contrast, a much less aggressive, weekly screening (again, not shown here) reflects growth in the number in isolation through around 9 weeks into the horizon to a peak well past 9,851 (up to 17,282, to be exact) due to allowing asymptomatic individuals to circulate in the population.

Robustness analysis of Pareto-optimal protocols for public school screening in North Carolina

In this section, we graphically explore the robustness of Pareto optimal front for common-protocol solutions for two key parameters, as discussed in Section 5.2. Those key parameters are the "basic reproductive number" of the virus, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> , and the number of infections introduced into the congregate setting in question from outside the system. Exploring differing levels of outside infections (Fig. 5), we note that if the infection rates outside these school districts remain at 1% or lower, testing protocols can be found that control the spread of the pandemic. However at infection rates approaching 6.5%, no amount of testing will control the infection spread. Table 7 provides detailed comparisons of Pareto-optimal protocols in the "balanced" area of the plot in Fig. 5, around the midpoint of the ranges of both cost and infections. Turning to differing levels of <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> , our results (Fig. 6) show that at <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> values of 1.5 to 2.3, testing protocols can be found that control the spread of infection. However, at <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mn>0</mn></msub><mo>=</mo><mn>5</mn></mrow></math> , no amount of testing will control the spread of the virus. Table 8 provides a comparison of the details for balanced protocols from Fig. 6.

Finally, a plot of cumulative infections over time for various protocols, applied to the Wake County school population with 0% exogenous infections (Fig. 7), except where noted in the figure, shows that an absence of testing yields almost 100% infectivity within 40 days. The MOSA-generated protocol that resulted in the lowest cost dulled the accumulation of infections somewhat, but still reached almost 100% by 55 days. Both the highest cost protocol and what we described earlier as a "balanced" protocol (i.e., at the inflection point of the infections-vs-cost plot of Pareto-optimal protocols), however, controlled the spread of infection. For either of those MOSA-generated protocols, cumulative numbers of infections decreased by almost 64% by 10 days and almost 84% by 20 days. Even if we allow approximately 1,600 new infections per week (1% exogenous rate), the balanced protocol still slows the number of cumulative infections by almost 72% within 20 days.

Discussion and conclusion

In this paper we used a quantitative methodology to generate optimal testing protocols for infectious diseases, using the COVID-19 pandemic as our context. The protocols were focused on populations that reside in congregate settings such as schools, businesses, hospitals, or nursing homes. For purposes of this paper, we focused on testing of student and staff populations within school districts that are found within counties within the State of North Carolina. The test format chosen was the rapid viral antigen test. These types of tests are inexpensive (approximately $5 in bulk), can be administered quickly ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo><</mo><mn>30</mn></mrow></math> minutes), and have reasonably high sensitivity ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>></mo><mn>80</mn><mo>%</mo></mrow></math> ) and specificity ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>></mo><mn>98</mn><mo>%</mo></mrow></math> ) levels.

The basis for our modeling is the SEIR epidemiological framework. Using this model, we compared various test frequencies (e.g. Daily, Every 2 Days, Every 3 Days, Weekly, Every 2 Weeks, Every 3 Weeks, Every 4 Weeks), varying them independently over a primary test period followed by a secondary test period. We developed and deployed a Multi-Objective Simulated Annealing (MOSA) algorithm to find Pareto-optimal sets of solutions that jointly minimize three objectives: 1) the number of infections over the testing horizon, 2) the number of false negative tests observed, and 3) the total cost of testing. Moreover, we employed our MOSA algorithm to solve for protocols that are independently optimal across all geographic regions being considered, and for protocols that are jointly optimal. We believe this comparison of independently solved versus jointly solved collections and our method of demonstrating their equivalence in our solution application are interesting and novel contributions. Moreover, we note that our approach can be applied generally to any public health decision making problem that poses a trade-off between investment and public health outcomes. Thus, we strike a balance in this paper between technical and practical application.

The potential impact of the use of such optimally-generated testing protocols is as follows:

A common testing protocol can be used to manage the spread of an infectious disease across both small populations (see Fig. 3 and Table 5) and large populations (see Fig. 4 and Table 6). The use of a single protocol should simplify decision making for health officials.

The optimal testing protocols consistently suggest an initial testing frequency of Every 2 Days, for 11 to 16 weeks, followed by secondary testing that ranged from Daily to Weekly for the remainder of the testing horizon (see Tables 5 and 6). Interestingly, the Cost per Case Averted is nearly identical across optimal protocols in the ranges of outcomes that balance overall infections with overall testing cost.

The number of false negative tests is quite low relative to the number of infections observed (see Tables 5 and 6). Thus, we can state that the relatively low sensitivity of approximately 80% for the antigen test (as compared to much more sensitive, lab-based tests) has only a small impact on the calculation of optimal protocols.

The number of exogenous infections brought into the school every week has a considerable impact on the the recommended testing protocol (see Fig. 5 and Table 7). If exogenous infections enter the system at the level of <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>></mo><mn>6</mn><mo>%</mo></mrow></math> of the overall school testing population across the region of interest, then no level of testing can avoid infecting the entire school population.

A similar result is seen for high levels of infectiousness, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> (see Table 8 and Fig. 6). At a level of virus infectiousness of <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>R</mi><mn>0</mn></msub><mo>=</mo><mn>5</mn></mrow></math> , even daily testing cannot control the spread of infections in the testing population. Clinical studies have shown, however, that good hygiene, mask wearing, and social distancing can decrease the effective <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math> by 50% [[57]].

These optimal protocols have a profound effect on the progress of the COVID-19 pandemic (see Fig. 7). Within 10 weeks of testing, the cumulative number of infections decrease by 64% compared to untested individuals, and by 20 weeks this decrease is 84%, indicating that disease spread in this population is effectively under control. Even if we allow for the introduction of <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>∼</mo><mn>1</mn><mo>,</mo><mn>600</mn></mrow></math> new infections per week to this population of 162,000 students and staff (a 1% exogenous infection rate), the testing protocol will suppress the cumulative level of infections by <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mo>></mo><mn>70</mn><mo>%</mo></mrow></math> over the testing horizon, as compared to no testing, allowing schools to remain open.

The cost of screening the public school system of North Carolina would be significant. For a 16-week semester, we estimate a testing cost for Wake County, with 192 schools and 160,000 students and staff, of approximately $20 million (see Table 6). Testing costs for the 10-county cluster in our computational example, surrounding Wake County, would be approximately $40 million (see Tables 8 and 7). It is interesting to note, however, that Wake County alone spent over $100 million in public funds to provide COVID testing free-of-charge to the public from June 1, 2020 to Nov 30, 2021 [[59]]. These were almost all PCR-based tests, which were not deployed strategically for screening in school systems or work settings. Thus, while the proposed test protocols can seem expensive, they are likely less expensive and more effective in terms of opening schools and work areas than the previously-used approaches. We also note that our MOSA algorithm could easily be adjusted to address possible constraints on the supply of antigen test kits by either adding a penalty to the objective function for solutions that violate the supply constraint, or by simply excluding constraint-violating solutions. For the sake of brevity, we did not include those results here.

Acquiring and distributing the proper number of test kits to serve specific congregate settings is a significant supply-chain logistics issue. Our experience in advising the State of North Carolina suggests that logistics management might best be directed at the state level. The tools developed in this paper should help decision-makers in determining the number of test kits required for any given geographical area. As discussed in Section 5, we employed machine-learning and TOPSIS modeling to generate a weighted average priority ranking for all 100 counties in NC, and then created prioritized clusters of counties based on the point-in-time state of the pandemic (Fig. 2). We note that the cluster that served as our computational example — which includes Wake County and contains the state's capital and second-largest city, Raleigh — was the highest priority cluster in the state in the period around July 2021, and the second highest priority cluster in the period around February 2022. For this high-priority cluster, we estimate that an average of approximately 500,000 test kits per week would be required for this 10-county cluster over a 16-week school semester. Distribution of these kits to the approximately 400 school districts in these 10 counties is a significant supply-chain challenge, but the tools developed in this paper would serve as significant aids to decision makers in state and local government.

Finally, we note that the availability of COVID-19 vaccines has mitigated, but not eliminated, the need for large scale testing in the U.S. The problems of vaccine hesitancy, the advent of novel viral strains, and the natural waning of immunological competence post vaccine, all suggest an ongoing need for viral screening. Importantly, the issue of long COVID is a huge potential health care burden. Cutler and Summers [[60]] and Cutler [[61]] estimate that the ongoing health care costs could total \$528 billion and the overall economic costs could amount to \$3.7 trillion. Obviously, it is critical to keep the infection rate as low as possible, and tools like the ones developed in this paper should be at the forefront of that effort.

In conclusion, this study provides a holistic and data-based approach for conducting testing of congregate populations, such as schools, hospitals, and public service agencies. Our work is an example of the kind of parametric analysis that policymakers can deploy in future pandemic situations, when there is a limited number of tests available and no pre-determined set of policies on how to allocate them to different regions within a state. This work fills an impotant gap in research on pandemic planning policy, employing well-known optimization methods to recommend allocation of potentially scarce resources based on logical input parameters. In this way, we believe this research provides useful input for current and future practice in healthcare policy-making.

Acknowledgements

This work was supported in part by a grant from the National Institute for Innovation in Manufacturing Biopharmaceuticals (NIIMBL) and the National Institute of Standards and Technology (NIST) under grant number NIIMBL COVID-19-1.03.

Declarations

Conflicts of interest

The authors declare that they have no known competing personal relationships or other conflicts of interest that could have appeared to influence the work reported in this paper

Appendix A. SEIR computations

In this appendix, we specify the computations of the flow quantities between states in the SEIR system (see Fig. 1), following the SEIR model of [[7]]. Extending from their work, we introduce a new parameter, k, which is the number of cycles required to obtain test results. Use of lab-based testing, for example, may require <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>k</mi><mo>≥</mo><mn>1</mn></mrow></math>

Graph

, meaning that the results of the test will not be known until k cycles in the future. For the case we consider in Section 5, we assume use of rapid-results antigen tests for confirmation, and therefore, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>k</mi><mo>=</mo><mn>0</mn></mrow></math>

Graph

<h31 id="AN0181643333-25"> <uline>Uninfected</uline> </h31>

A.1 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd /><mtd columnalign="left"><mrow><msub><mi>U</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mi>U</mi><mi>c</mi></msub><mo>·</mo><mfenced close="]" open="["><mn>1</mn><mo>-</mo><mi>β</mi><mo>·</mo><mfrac><msub><mi>A</mi><mi>c</mi></msub><mrow><msub><mi>U</mi><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub><mo>+</mo><msub><mi>E</mi><mi>c</mi></msub></mrow></mfrac></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>Uninfected minus internal infections</mtext></mrow></munder><mo>-</mo><munder><munder accentunder="true"><mrow><msub><mi>U</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>·</mo><mi>τ</mi><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>p</mi></msub></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>False positive results</mtext></mrow></munder></mrow></mtd></mtr><mtr><mtd columnalign="right"><mrow /></mtd><mtd columnalign="left"><mrow><mspace width="1em" /><mo>+</mo><munder><munder accentunder="true"><mrow><mi>μ</mi><mo>·</mo><msub><mrow><mi mathvariant="italic">FP</mi></mrow><mi>c</mi></msub></mrow><mo>⏟</mo></munder><mrow><mtext>Returning false positives</mtext></mrow></munder><mspace width="1em" /><mo>-</mo><munder><munder accentunder="true"><mrow><mi>X</mi><mo>·</mo><msub><mi>I</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>,</mo></mrow><mo>⏟</mo></munder><mrow><mtext>Exogenous shock infections</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

where the additional parameters are as follows:

Graph

is the rate at which infectious individuals both contact and infect susceptible individuals, computed as <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>β</mi><mo>=</mo><msub><mi>R</mi><mn>0</mn></msub><mo>·</mo><mrow><mo stretchy="false">(</mo><mi>σ</mi><mo>+</mo><mi>ρ</mi><mo stretchy="false">)</mo></mrow></mrow></math>

Graph

, where <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>R</mi><mn>0</mn></msub></math>

Graph

is the viral reproductive number, <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>ρ</mi></math>

Graph

is the rate at which infected individuals recover from disease, and <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>σ</mi></math>

Graph

is the rate of symptom onset for infected individuals.

Graph

is the rate at which an individuals in the testing pool are screened for infection, which is driven by the testing frequency (e.g., daily, every 2 days, etc.) and computed as the reciprocal of this frequency (i.e., at an "every 3 days" frequency, 1/3 of the individuals in the testing pool are assumed to be tested each cycle).

Graph

is the specificity of the test.

Graph

is rate at which false positives are identified and returned to the uninfected pool, which is computed by <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>μ</mi><mo>=</mo><mn>1</mn><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msub><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub><mo>·</mo><msub><mi>t</mi><mrow><mi mathvariant="italic">FPr</mi></mrow></msub><mo stretchy="false">)</mo></mrow></math>

Graph

, where <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>n</mi><mrow><mi mathvariant="italic">cyc</mi></mrow></msub></math>

Graph

is the number of testing cycles per day and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>t</mi><mrow><mi mathvariant="italic">FPr</mi></mrow></msub></math>

Graph

is the number of days required for false positives to return to the testing pool, where <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>t</mi><mrow><mi mathvariant="italic">FPret</mi></mrow></msub><mo>≥</mo><mn>1</mn></mrow></math>

Graph

Graph

is an indicator function taking the value 1 if there is a shock of new infections in cycle <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></math>

Graph

and 0 otherwise.

Thus, the "uninfected update" portion of the computation is given by period-starting uninfected ( <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>U</mi><mi>t</mi></msub></math>

Graph

) minus new infections ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>β</mi><mo>·</mo><msub><mi>U</mi><mi>t</mi></msub><mo>·</mo><msub><mi>A</mi><mi>t</mi></msub><mo stretchy="false">/</mo><mfenced close=")" open="("><msub><mi>U</mi><mi>t</mi></msub><mo>+</mo><msub><mi>A</mi><mi>t</mi></msub><mo>+</mo><msub><mi>E</mi><mi>t</mi></msub></mfenced></mrow></math>

Graph

). From this, false positives (from cycle <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></math>

Graph

) are subtracted, confirmed and returning false positives (from the current cycle, c) are added, and infections contracted by outside "shocks" are subtracted. We note that [[7]] assume that the new infections portion of the "Uninfected update" stems from applying the transmission rate <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>β</mi></math>

Graph

to the uninfected group <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>U</mi><mi>t</mi></msub></math>

Graph

, but only to those who have incurred probabilistic contact with the asymptomatic and infected group, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>A</mi><mi>t</mi></msub></math>

Graph

. Thus, it also assumes that the probability that such contact occurs is given by the ratio of <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>A</mi><mi>t</mi></msub></math>

Graph

to the "active testing pool," <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><msub><mi>U</mi><mi>t</mi></msub><mo>+</mo><msub><mi>A</mi><mi>t</mi></msub><mo>+</mo><msub><mi>E</mi><mi>t</mi></msub></mrow></math>

Graph

<h31 id="AN0181643333-26"> <uline>Exposed</uline> </h31>

A.2 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mi>E</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mi>E</mi><mi>c</mi></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><mi>θ</mi></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>Exposed update</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><mi>β</mi><mo>·</mo><mfrac><mrow><msub><mi>A</mi><mi>c</mi></msub><mo>·</mo><msub><mi>U</mi><mi>c</mi></msub></mrow><mrow><msub><mi>U</mi><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub><mo>+</mo><msub><mi>E</mi><mi>c</mi></msub></mrow></mfrac></mrow><mo>⏟</mo></munder><mrow><mtext>New infections</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><mi>X</mi><mo>·</mo><msub><mi>I</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>,</mo></mrow><mo>⏟</mo></munder><mrow><mtext>Exogenous shocks</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

where the additional parameter is:

Graph

, which is the rate at which exposed individuals advance to being infectious, but still asymptomatic.

Thus, the "Exposed update" above reflects the flow of <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>θ</mi><mo>·</mo><msub><mi>E</mi><mi>c</mi></msub></mrow></math>

Graph

individuals from state E to state A in Fig. 1. The inflows to state E come from infections flowing out of state U, both from internal contact among those individuals and from outside shocks.

<h31 id="AN0181643333-27"> <uline>Asymptomatic</uline> </h31>

A.3 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mi>A</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mi>A</mi><mi>c</mi></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><mi>σ</mi><mo>-</mo><mi>ρ</mi></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>Asymptomatic update</mtext></mrow></munder><mo>-</mo><munder><munder accentunder="true"><mrow><msub><mi>A</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>·</mo><mi>τ</mi><mo>·</mo><msub><mi>S</mi><mi>e</mi></msub></mrow><mo>⏟</mo></munder><mrow><mtext>True positive results</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><mi>θ</mi><mo>·</mo><msub><mi>E</mi><mi>c</mi></msub><mo>,</mo></mrow><mo>⏟</mo></munder><mrow><mtext>Infection transitions</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

where the additional parameter is:

Graph

, which is the sensitivity of the test.

Therefore, the inflows to state A come only from infections transitioning from the undetectable state E to detectable state A. The outflows are given by (i) recoveries, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>ρ</mi><mo>·</mo><msub><mi>A</mi><mi>c</mi></msub></mrow></math>

Graph

, from asymptomatic infections; (ii) transitions, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>σ</mi><mo>·</mo><msub><mi>A</mi><mi>c</mi></msub></mrow></math>

Graph

, from asymptomatic to symptomatic state P; and (iii) confirmed true positives results from tests in cycle <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></math>

Graph

, based on the test sensitivity rate, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>e</mi></msub></math>

Graph

<h31 id="AN0181643333-28"> <uline>False Positives and True Positives</uline> </h31>

A.4 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mrow><mi mathvariant="italic">FP</mi></mrow><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mrow><mi mathvariant="italic">FP</mi></mrow><mi>c</mi></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><mi>μ</mi></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>False positive update</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><msub><mi>U</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>·</mo><mi>τ</mi><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><msub><mi>S</mi><mi>p</mi></msub></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>False positive results</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

and

A.5 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mrow><mi mathvariant="italic">TP</mi></mrow><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mrow><mi mathvariant="italic">TP</mi></mrow><mi>c</mi></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><mi>σ</mi><mo>-</mo><mi>ρ</mi></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>True positive update</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><msub><mi>A</mi><mrow><mi>c</mi><mo>-</mo><mi>k</mi></mrow></msub><mo>·</mo><mi>τ</mi><mo>·</mo><msub><mi>S</mi><mi>e</mi></msub><mo>,</mo></mrow><mo>⏟</mo></munder><mrow><mtext>True positive results</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

both of which are based on parameters and expressions that have been defined above.

<h31 id="AN0181643333-29"> <uline>Symptomatic, Recovered, and Deceased</uline> </h31>

The cycle transitions in for the symptomatic set, P, and the recovered set, V, are given by

A.6 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mi>P</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><munder><munder accentunder="true"><mrow><msub><mi>P</mi><mi>c</mi></msub><mo>·</mo><mfenced close=")" open="("><mn>1</mn><mo>-</mo><mi>ρ</mi><mo>-</mo><mi>δ</mi></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>Symptomatic update</mtext></mrow></munder><mo>+</mo><munder><munder accentunder="true"><mrow><mi>σ</mi><mo>·</mo><mfenced close=")" open="("><msub><mrow><mi mathvariant="italic">TP</mi></mrow><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub></mfenced></mrow><mo>⏟</mo></munder><mrow><mtext>Transitions to symptomatic state</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

and

A.7 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mi>V</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><msub><mi>V</mi><mi>c</mi></msub><mo>+</mo><munder><munder accentunder="true"><mrow><mi>ρ</mi><mo>·</mo><mfenced close=")" open="("><msub><mrow><mi mathvariant="italic">TP</mi></mrow><mi>c</mi></msub><mo>+</mo><msub><mi>A</mi><mi>c</mi></msub><mo>+</mo><msub><mi>P</mi><mi>c</mi></msub></mfenced><mo>.</mo></mrow><mo>⏟</mo></munder><mrow><mtext>Transitions to recovered state</mtext></mrow></munder></mrow></mtd></mtr></mtable></mrow></math>

Graph

The cycle transitions to the deceased state, D, are given by

A.8 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><msub><mi>D</mi><mrow><mi>c</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><msub><mi>D</mi><mi>c</mi></msub><mo>+</mo><munder><munder accentunder="true"><mrow><mi>δ</mi><mo>·</mo><msub><mi>P</mi><mi>c</mi></msub></mrow><mo>⏟</mo></munder><mtext>Deaths</mtext></munder><mo>,</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

where the additional parameter is:

Graph

, which is the rate at which infected, symptomatic individuals die from the disease.

Appendix B. Non-parametric test for independent- versus common-protocol solution consistency

In this appendix, we offer a statistical analysis to assess whether the independent- and common-protocol solutions generate outcomes that are consistent in terms of the multi-objective trade-offs. We base our analysis on two statistics that we denote as the cost per case averted (CCA) and the cost per reduction in false negatives (CRFN), which are computed using the following inputs and outcomes from the set <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow><mi mathvariant="italic">PO</mi></mrow><mi>i</mi></msub></math>

Graph

of solutions <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>j</mi><mo>∈</mo><msub><mrow><mi mathvariant="italic">PO</mi></mrow><mi>i</mi></msub></mrow></math>

Graph

for county i:

The input value <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow><mi mathvariant="italic">IS</mi></mrow><mi>i</mi></msub></math>

Graph

, which is the number of individuals in the testing population in county i that are initially susceptible to infection (i.e., the number in box U of Fig. 1).

The output of the three objectives defined in Section 3.3, which now include an additional subscript i, denoting the county — i.e., <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow><mi mathvariant="italic">IN</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></math>

Graph

, <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow><mi mathvariant="italic">FN</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></math>

Graph

, and <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow><mi mathvariant="italic">TC</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></math>

Graph

, which are the objective values of solution j, reflecting the <math xmlns="http://www.w3.org/1998/Math/MathML"><mfenced close=")" open="("><msub><mi>F</mi><mn>1</mn></msub><mo>,</mo><msub><mi>F</mi><mn>2</mn></msub><mo>,</mo><msub><mi>T</mi><mn>1</mn></msub></mfenced></math>

Graph

protocol values specific to the county i solution.

Then, the CCA and CRFN values for each solution j in county i are computed as follows:

B.1 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi>C</mi><mi>C</mi><msub><mi>A</mi><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub><mo>=</mo><mfrac><mrow><mi>T</mi><msub><mi>C</mi><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></mrow><mrow><msub><mrow><mi mathvariant="italic">IS</mi></mrow><mi>i</mi></msub><mo>-</mo><msub><mrow><mi mathvariant="italic">IN</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></mrow></mfrac></mrow></mtd></mtr></mtable></mrow></math>

Graph

and

B.2 <math display="block" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mtable><mtr><mtd columnalign="right"><mrow><mi>C</mi><mi>R</mi><mi>F</mi><msub><mi>N</mi><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub><mo>=</mo><mfrac><mrow><mi>T</mi><msub><mi>C</mi><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></mrow><mrow><munder><mo movablelimits="false">max</mo><mrow><mi>k</mi><mo>∈</mo><msub><mrow><mi mathvariant="italic">PO</mi></mrow><mi>i</mi></msub><mo>,</mo><mi>k</mi><mo>≠</mo><mi>j</mi></mrow></munder><mrow><mo stretchy="false">{</mo><msub><mrow><mi mathvariant="italic">FN</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>k</mi></mrow></msub><mo stretchy="false">}</mo></mrow><mo>-</mo><msub><mrow><mi mathvariant="italic">FN</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></mrow></mfrac><mo>.</mo></mrow></mtd></mtr></mtable></mrow></math>

Graph

Our graphical analysis of these statistics for various solutions that we generated indicated that they did not appear to follow any particular statistical distribution, so we employed non-parametric testing on the CCA and CRFN statistics for each solution method — independent-protocol versus common — for each county in our 10-county solution set, and specifically based the analysis on Mann Whitney U Test (also known as Wilcoxon Rank Sum Test). Results of the non-parametric tests for statistically significant differences in the solutions for each county are shown in Table 9. Since the p-values for each test for every county in the 10-county set all exceed any reasonable confidence threshold, we fail to reject the hypothesis that the cost measures are statistically equal for each county.

Table 9 p-values for non-parametric test of independent versus common solutions for NC 10-county cluster

<table frame="hsides" rules="groups"><thead><tr><th align="left">County</th><th align="left">Cost per case averted</th><th align="left">Cost per reduction in false negatives</th></tr></thead><tbody><tr><td align="left">Alamance county</td><td align="left">0.549</td><td align="left">0.691</td></tr><tr><td align="left">Caswell county</td><td align="left">0.956</td><td align="left">0.605</td></tr><tr><td align="left">Chatham county</td><td align="left">0.639</td><td align="left">0.651</td></tr><tr><td align="left">Durham county</td><td align="left">0.307</td><td align="left">0.740</td></tr><tr><td align="left">Granville county</td><td align="left">0.676</td><td align="left">0.573</td></tr><tr><td align="left">Harnett county</td><td align="left">0.652</td><td align="left">0.803</td></tr><tr><td align="left">Lee county</td><td align="left">0.842</td><td align="left">0.803</td></tr><tr><td align="left">Orange county</td><td align="left">0.475</td><td align="left">0.963</td></tr><tr><td align="left">Person county</td><td align="left">0.244</td><td align="left">0.631</td></tr><tr><td align="left">Wake county</td><td align="left">0.933</td><td align="left">0.926</td></tr></tbody></table>

Table 10 Generalized spread measure for common and independent solutions for NC 10-county cluster

<table frame="hsides" rules="groups"><thead><tr><th align="left" /><th align="left">Common</th><th align="left">Independent</th></tr></thead><tbody><tr><td align="center">Alamance county</td><td align="left">0.992958</td><td align="left">0.991463</td></tr><tr><td align="center">Caswell county</td><td align="left">0.992957</td><td align="left">0.991461</td></tr><tr><td align="center">Chatham county</td><td align="left">0.992952</td><td align="left">0.991706</td></tr><tr><td align="center">Durham county</td><td align="left">0.992956</td><td align="left">0.990785</td></tr><tr><td align="center">Granville county</td><td align="left">0.992960</td><td align="left">0.990890</td></tr><tr><td align="center">Harnett county</td><td align="left">0.992954</td><td align="left">0.991274</td></tr><tr><td align="center">Lee county</td><td align="left">0.992956</td><td align="left">0.991688</td></tr><tr><td align="center">Orange county</td><td align="left">0.992956</td><td align="left">0.992925</td></tr><tr><td align="center">Person county</td><td align="left">0.992962</td><td align="left">0.991041</td></tr><tr><td align="center">Wake county</td><td align="left">0.992957</td><td align="left">0.991528</td></tr></tbody></table>

One additional comparison we can make between the sets of common and independent solutions is to consider the level of diversity in their collections of Pareto-dominant solutions. The measure we use is from [[62]], specifically the measure they call generalized spread, which accounts for both the coverage of the extreme regions of the objectives and the distribution of solutions across the objective regions generally. Measures closer to 1.0 reflect higher, and therefore more desirable, levels of solution spread. Graphically, a high (close to 1.0) value of generalized spread reflects solutions that are spread throughout the range of objective values forming the Pareto frontier. This is reflected by Figs. 3 and 4 in that there are independent and common solutions spread fairly evenly across a wide range of combinations of pairs of cost and infection levels. Generalized spread allows us to assess this property without explicitly plotting a trade-off plot for every county. Computed values of generalized spread for our common and independent solutions across all 10 counties in our example are shown in Table 10, from which we can see that the generalized spread measures for all the counties our example cluster are quite similar between common and independent solutions.

Thus, we conclude from Table 9 that the common-protocol and independent-protocol solutions have the same statistical centrality measured by median statistic in the Mann Whitney U Test. In addition, they spread in the same fashion in the objective functions space based on the generalized spread measures in Table 10. Therefore, we reliably conclude that common- and independent-protocol solutions generate consistent outcomes.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1 Mina MJ, Parker R, Larremore DB (2020) Rethinking COVID-19 test sensitivity — a strategy for containment. N Engl J Med 383. https://doi.org/10.1056/NEJMp2025631

2 Mina MJ, Andersen KG (2021) COVID-19 testing: one size does not fit all. Science 371:126–127. https://www.science.org/doi/abs/10.1126/science.abe9187

3 Kouri R, Warsing D, Singh N, Thomas B, Handfield RB (2022) An analytical tool for constructing and evaluating testing strategic for COVID-19. J Infect Dis Ther 10. https://www.omicsonline.org/open-access/an-analytic-tool-for-constructing-and-evaluating-testing-strategies-for-covid19.pdf

4 PBS (2022) British government rushing COVID tests to schools so classes can reopen. https://www.pbs.org/newshour/world/british-government-rushing-covid-tests-to-schools-so-classes-can-reopen. [Posted 03 Jan 2022]

5 Larremore DB et al (2021) Test sensitivity is secondary to frequency and turnaround time for COVID-19 screening. Sci Adv 7. https://www.science.org/doi/abs/10.1126/sciadv.abd5393

6 Gottlieb S. Uncontrolled spread: Why COVID-19 crushed us and how we can defeat the next pandemic. 2021: New York; Harper

7 Paltiel AD, Zheng A, Walensky RP (2020) Assessment of SARS-CoV-2 screening strategies to permit the safe reopening of college campuses in the United States. JAMA Netw Open 3

8 Caetano MA, de Souza JF, Yoneyama T (2008) Optimal medication in HIV seropositive patient treatment using fuzzy cost function, pp 2227–2232. IEEE

9 Persi PI, Gayathri P, Jaisankar N. A fuzzy optimization technique for the prediction of coronary heart disease using decision tree. Int J Eng Technol. 2013; 5: 2506-2514

Grosan C, Abraham A, Tigan S. Multicriteria programming in medical diagnosis and treatments. Appl Soft Comput. 2008; 8: 1407-1417. 10.1016/j.asoc.2007.10.014

Denton BT, Kurt M, Shah ND, Bryant SC, Smith SA. Optimizing the start time of statin therapy for patients with diabetes. Med Dec Making. 2009; 29: 351-367. 10.1177/0272989X08329462

Jingwei Z, Zujun M (2010) Fuzzy multi-objective location-routing-inventory problem in recycling infectious medical waste, pp 4069–4073. IEEE

Piguillem F, Shi L. Optimal Covid-19 quarantine and testing policies. Econ J. 2022; 132: 2534-2562. 10.1093/ej/ueac026

Hoertel N et al (2020) Facing the COVID-19 epidemic in NYC: a stochastic agent-based model of various intervention strategies. MedRxiv

Charpentier A, Elie R, Laurière M, Tran VC. COVID-19 pandemic control: balancing detection policy and lockdown intervention under ICU sustainability. Math Model Nat Phenom. 2020; 15: 57. 10.1051/mmnp/2020045

Carcione JM, Santos JE, Bagaini C, Ba J (2020) A simulation of a COVID-19 epidemic based on a deterministic SEIR model. Front Publ Health 230

Kuniya T. Prediction of the epidemic peak of coronavirus disease in Japan, 2020. J Clin Med. 2020; 9: 789. 10.3390/jcm9030789

Dandekar R, Barbastathis G (2020) Neural network aided quarantine control model estimation of global COVID-19 spread. arXiv:2004.02752

Rezapour S, Mohammadi H, Samei ME. SEIR epidemic model for covid-19 transmission by caputo derivative of fractional order. Adv Differ Equ. 2020; 2020: 1-19. 10.1186/s13662-020-02952-y

Anderez DO. A COVID-19-based modified epidemiological model and technological approaches to help vulnerable individuals emerge from the lockdown in the UK. Sensors. 2020; 20: 4967. 10.3390/s20174967

Huang Y, Yang L, Dai H, Tian F, Chen K (2020) Epidemic situation and forecasting of COVID-19 in and outside China. https://www.researchgate.net/publication/339988990%5fEpidemic%5fsituation%5fand%5fforecasting%5fof%5fCOVID-19%5fin%5fand%5foutside%5fChina

Owusu-Mensah I, Akinyemi L, Oduro B, Iyiola OS. A fractional order approach to modeling and simulations of the novel COVID-19. Adv Differ Equ. 2020; 2020: 1-21. 10.1186/s13662-020-03141-7

Kheirallah KA. The effect of strict state measures on the epidemiologic curve of COVID-19 infection in the context of a developing country: a simulation from Jordan. Int J Environ Res Publ Health. 2020; 17: 6530. 10.3390/ijerph17186530

Kochańczyk M, Grabowski F, Lipniacki T. Super-spreading events initiated the exponential growth phase of COVID-19 with R0 higher than initially estimated. R Soc Open Sci. 2020; 7: 200786. 10.1098/rsos.200786

Sun D, Duan L, Xiong J, Wang D. Modeling and forecasting the spread tendency of the COVID-19 in china. Adv Differ Equ. 2020; 2020: 1-16. 10.1186/s13662-020-02940-2

Tang B. Estimation of the transmission risk of the 2019-ncov and its implication for public health interventions. J Clin Med. 2020; 9: 462. 10.3390/jcm9020462

Wang H. Phase-adjusted estimation of the number of coronavirus disease 2019 cases in Wuhan, China. Cell Discov. 2020; 6: 1-8. 10.1038/s41421-020-0148-0

Kuznetsov YA, Piccardi C. Bifurcation analysis of periodic SEIR and SIR epidemic models. J Math Biol. 1994; 32: 109-121. 10.1007/BF00163027

He S, Banerjee S. Epidemic outbreaks and its control using a fractional order model with seasonality and stochastic infection. Phys A Stat Mech Appl. 2018; 501: 408-417. 10.1016/j.physa.2018.02.045

Grassly NC. Comparison of molecular testing strategies for COVID-19 control: a mathematical modelling study. Lancet Infect Dis. 2020; 20: 1381-1389. 10.1016/S1473-3099(20)30630-7

Abdin AF. An optimization model for planning testing and control strategies to limit the spread of a pandemic-the case of COVID-19. Eur J Oper Res. 2023; 304: 308-324. 10.1016/j.ejor.2021.10.062

Farahani RZ, Ruiz R, Van Wassenhove LN. Introduction to the special issue on the role of operational research in future epidemics/pandemics. Eur J Oper Res. 2023; 304: 1-8. 10.1016/j.ejor.2022.07.019

Zhang Y, Mayorga ME, Ivy J, Lich KH, Swann JL (2022) Modeling the impact of nonpharmaceutical interventions on COVID-19 transmission in K-12 schools. MDM Policy Pract 7

Centers for Diseae Control and Prevention (2024) Preventing Spread of Respiratory Viruses When You're Sick. https://www.cdc.gov/respiratory-viruses/prevention/precautions-when-sick.html

Kirkpatrick S, Gelatt CD Jr, Vecchi MP. Optimization by simulated annealing. Science. 1983; 220: 671-680. 10.1126/science.220.4598.671

Johnson DS, Aragon CR, McGeoch LA, Schevon C. Optimization by simulated annealing: an experimental evaluation; Part I, Graph partitioning. Oper Res. 1989; 37: 865-892. 10.1287/opre.37.6.865

Johnson DS, Aragon CR, McGeoch LA, Schevon C. Optimization by simulated annealing: an experimental evaluation; Part II, Graph coloring and number partitioning. Oper Res. 1991; 39: 378-406. 10.1287/opre.39.3.378

Hajek B. Cooling schedules for optimal annealing. Math Oper Res. 1988; 13: 311-329. 10.1287/moor.13.2.311

Serafini P (1985) Mathematics of multi objective optimization, vol 289. Springer

Serafini P (1994) in Simulated annealing for multi objective optimization problems, pp 283–292. Springer

Engrand P (1998) A multi-objective optimization approach based on simulated annealing and its application to nuclear fuel management. https://www.osti.gov/etdeweb/biblio/316961

Czyzżak P, Jaszkiewicz A. Pareto simulated annealing—a metaheuristic technique for multiple-objective combinatorial optimization. J Multi-Criteria Decis Anal. 1998; 7: 34-47. 10.1002/(SICI)1099-1360(199801)7:1<34:AID-MCDA161>3.0.CO;2-6

Hapke M, Jaszkiewicz A, Słowiński R. Pareto simulated annealing for fuzzy multi-objective combinatorial optimization. J Heuristics. 2000; 6: 329-345. 10.1023/A:1009678314795

Burke E, Silva JL (2002) Improving the performance of trajectory-based multiobjective optimisers by using relaxed dominance, vol 1, 203–207. Nanyang Technical University Orchid Country Club, Singapore

Nam D, Park CH (2002) Pareto-based cost simulated annealing for multiobjective optimization, vol 2, pp 522–526. Citeseer

Sánchez L, Villar JR. Obtaining transparent models of chaotic systems with multi-objective simulated annealing algorithms. Inf Sci. 2008; 178: 952-970. 10.1016/j.ins.2007.09.029

Bandyopadhyay S, Saha S, Maulik U, Deb K. A simulated annealing-based multiobjective optimization algorithm: Amosa. IEEE Trans Evol Comput. 2008; 12: 269-283. 10.1109/TEVC.2007.900837

Suman B, Kumar P. A survey of simulated annealing as a tool for single and multiobjective optimization. J Oper Res Soc. 2006; 57: 1143-1160. 10.1057/palgrave.jors.2602068

Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Inc

Global Epidemics Data (2021) Brown University School of Public Health. https://globalepidemics.org/testing-targets

NEA (2020) 2020 NEA Policy Playbook for Congress and the Biden-Harris Administration. https://www.nea.org/resource-library/2020-nea-policy-playbook-congress-and-biden-harris-administration

Taghiyeh S et al (2023) A multi-method approach to determining relative pandemic disease intensity across geographic regions. NC State University working paper

Hwang C-L, Yoon K (1981) in Methods for multiple attribute decision making, pp 58–191. Springer

Bradley PS, Bennett KP, Demiriz A (2000) Constrained k-means clustering. Microsoft Research, Redmond 20

U.S. Census Bureau (2022) County population totals: 2020-2021. https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-total.html. Accessed 02 Jun 2022

Gunderson A, Woskie L (2020) Understanding Predictions: What is R-Naught? https://globalhealth.harvard.edu/understanding-predictions-what-is-r-naught/

Li T, Liu Y, Li M, Qian X, Dai SY (2020) Mask or no mask for COVID-19: a public health and market study. PLoS One 15:1–17. https://doi.org/10.1371/journal.pone.0237691

Saxena SK et al (2022) Characterization of the novel SARS-CoV-2 Omicron (b.1.1.529) variant of concern and its global perspective. J Med Virol 94:1738–1744

Fisher J (2022) Over \$100 million: adding up the cost of 'free' COVID testing in Wake County. https://www.wral.com/coronavirus/-100-million-adding-up-the-cost-of-free-covid-testing-in-wake-county/20102553/. [Posted 27 Jan 2022]

Cutler DM, Summers LH. The COVID-19 pandemic and the $16 trillion virus. JAMA. 2020; 324: 1495-1496. 10.1001/jama.2020.19759

Cutler DM (2022) The economic cost of long COVID: an update. https://scholar.harvard.edu/files/cutler/files/long_covid_update_7-22.pdf

Jiang S, Ong Y-S, Zhang J, Feng L (2014) Consistencies and contradictions of performance metrics in multiobjective optimization. IEEE Trans Cybern 44:2391–2404. https://doi.org/10.1109/TCYB.2014.2307319

By Hadi Moheb-Alizadeh; Donald P. Warsing Jr.; Richard E. Kouri; Sajjad Taghiyeh and Robert B. Handfield

Reported by Author; Author; Author; Author; Author

Treffer: Optimization of testing protocols to screen for COVID-19: a multi-objective model.

Weitere Informationen

Optimization of testing protocols to screen for COVID-19: a multi-objective model: Optimization of testing protocols to screen for COVID-19: a multi-objective model: H. Moheb-Alizadeh et al

Highlights

Introduction

Literature review

Testing model

Testing demand model

Decision variables

Objectives and optimization structure

Optimization algorithm

Solution process

Algorithm formulation

Definition 1

Scenario 1 – Implementing a common protocol

Scenario 2 – Implementing independent protocols

Computational application and results

Population data for public school screening in North Carolina

Case parameters for public school screening

Generating Pareto-optimal protocols for public school screening in North Carolina

Robustness analysis of Pareto-optimal protocols for public school screening in North Carolina

Discussion and conclusion

Acknowledgements

Declarations

Conflicts of interest

Appendix A. SEIR computations

Appendix B. Non-parametric test for independent- versus common-protocol solution consistency

Publisher's Note

References

Links

Zusatz-Funktionen