Introduction

The COVID-19 pandemic, engendered by the SARS-CoV-2 virus, has impacted the global population, accounting for more than 609 million confirmed infections and 6.5 million mortalities1. In India, the situation has been particularly severe, with information indicating over 44.5 million cases and approximately 0.52 million fatalities1. This information positions India as a preeminent focus of the pandemic, superseded only by the United States1,2.

The urgency and scale of the pandemic have necessitated the development of robust mathematical models to inform public health strategies. Traditional compartmental models, such as the susceptible-infectious-recovered (SIR) model formulated in 19273, have been instrumental in describing the dynamics of infectious diseases like dengue4, malaria5, and tuberculosis6. However, the unique epidemiological characteristics of COVID-19 have required modifications to these models7, including the integration of wavelet theory and machine learning algorithms8.

A critical feature of the COVID-19 pandemic is the important role of asymptomatic carriers in virus transmission. Asymptomatic cases, who do not exhibit symptoms, often remain undetected and have more contacts compared to symptomatic cases, facilitating extensive community spread. The challenge of identifying and testing asymptomatic individuals leads to underreporting, which exacerbates the spread of the virus and can result in superinfection incidents. Short-term predictions are crucial for supporting rapid decision-making in prevention and mitigation efforts. Effective prediction is essential for healthcare resource allocation and controlling the virus spread. Mathematical models have been central in optimizing control strategies and conducting cost-effectiveness analyses of various intervention measures, which are critical in managing infectious diseases like COVID-199,10,11,12,13,14.

Several mathematical models have been developed to address the complexities of COVID-19 transmission, particularly incorporating asymptomatic and symptomatic cases. For instance, in15, asymptomatic, undetected, and total cases were estimated during the outbreak in Wuhan, highlighting the impact of nonpharmaceutical interventions on controlling the outbreak. In11, COVID-19 epidemics in multiple countries were modeled, incorporating both reported and unreported cases and emphasizing the timing and magnitude of public policies. Additionally, in16, the impact of unreported cases in North African countries was investigated using mobility reports to assess movement restrictions’ effects. In17, the dynamics of unreported cases in Morocco was analyzed, demonstrating the importance of accurate parameter estimation. In18, both asymptomatic and symptomatic compartments were considered in a model for COVID-19 transmission dynamics in India, emphasizing the role of undetected cases in the transmission dynamics. Further models, such as9,10,14,19,20,21, explored various intervention strategies and their effectiveness in controlling the spread of COVID-19. Studies in different countries also highlighted various approaches to understanding COVID-19 impacts and outcomes22,23,24,25.

Other models have also been proposed to analyze the spread of COVID-19. For example, in26, a model incorporating contact tracing and hospitalization procedures was developed to study the dissemination of COVID-19 in India, revealing exponential growth. In12, the roles of asymptomatic, isolation, and quarantine compartments were explored, suggesting that quarantine or isolation measures were effective in reducing virus transmission. In19, a model to predict the dissemination of COVID-19 and to assess the impact of quarantine and hospitalization was presented, demonstrating the sensitivity of model parameters in preventing the spread of the virus through intervention measures. Control problems were solved in 27 to evaluate quarantine, isolation, and social distancing, along with surface disinfection and emergency provision, mitigating the dissemination of COVID-19. A model incorporating age was constructed in 28 to assess the efficacy of nonpharmaceutical interventions in mitigating COVID-19 spread and reducing the strain on healthcare infrastructure. Additionally, in.20,29,30, further insights into optimal intervention strategies and the impact of various non-pharmaceutical interventions on the dynamics of COVID-19 were provided.

The advancement in epidemiological modeling has been enriched by the development of fractional pandemic models31 and other innovations32. The exploration of approaches in differential systems33, combined with machine learning24 and metaheuristic solutions for system stability34, has opened new pathways in computational epidemiology. Moreover, the application of statistical methods in epidemic modeling32,35 highlights the importance of statistical inference for disease forecasting. Furthermore, studies like11,27,28,29,30 contributed to the understanding of COVID-19 transmission dynamics and the effectiveness of various intervention strategies.

The objectives of this study are twofold: (i) to introduce a novel six-compartment epidemiological model that integrates both reported and unreported symptomatic cases to enhance prediction precision across various epidemiological scenarios, particularly by incorporating a higher isolation rate for unreported cases once detected, addressing the challenge of initial underreporting; and (ii) to apply this model to COVID-19 data from India to evaluate the cost-effectiveness of different intervention strategies using optimal control theory. This model uniquely captures both reported and unreported symptomatic cases, integrating sensitivity analyses and cost-effectiveness evaluations to offer a comprehensive view of disease dynamics and provide valuable insights for public health policy and strategy formulation. This article is organized as follows. “Methods” delineates the approaches used in this study related to the mathematical structure of the new model as well as an analysis of its stability and inherent properties. In “Results”, our findings are provided which are associated with the model fitting to empirical data and its predictive prowess, as well as the application of optimal control methods. “Discussion and conclusions” establishes a discussion of the obtained results synthesizing our findings and providing our conclusions about the present study.

Methods

This section introduces the methods considered in our study, focusing on the formulation of the new model and the underlying principles guiding its structure. We also analyze the model, examining its mathematical properties and epidemiological implications.

Formulation of the model

Within the range of the varied mathematical models for epidemiological transmission, the new model proposed in this study addresses nonlinear dynamics in the spread of COVID-19. We segment the human population of size \(N\) into six main compartments: \(S\), \(A\), \(I\), \(U\), \(G\), and \(R\), where \(S\) indicates the susceptible individuals, \(A\) the asymptomatic, \(I\) the infectious (reported symptomatic), \(U\) the unreported symptomatic, \(G\) the isolated, and \(R\) the recovered ones. This segmentation forms the basis of our nonlinear differential system and captures the disease transmission dynamics. The total population at any given time is expressed as \(N = S + A + I + U + G + R\), encompassing all the compartments. This model, designated as SAIUGR, allows analysis of the disease spread and effectiveness of various intervention strategies.

The SAIUGR model is formulated for the spread of COVID-19 by means of a system of nonlinear ordinary differential equations as

$$\begin{aligned} \frac{\text{d}S}{\text{d}t}&= \Lambda -\beta (\zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G}\,G) \frac{S}{N}-\mu S,\nonumber \\ \frac{\text{d}A}{\text{d}t}&=\beta (\zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G}\,G) \frac{S}{N}- (\omega + \mu )A,\nonumber \\ \frac{\text{d}I}{\text{d}t}&=\theta \,\omega \,A-(\eta _{I}+\gamma _{I}+\mu _{I}+\mu )I,\nonumber \\ \frac{\text{d}U}{\text{d}t}&= (1-\theta )\omega \,A - (\eta _{U}+\gamma _{U}+\mu _{U}+\mu )U,\nonumber \\ \frac{\text{d}G}{\text{d}t}&=\eta _{I}\,I+\eta \,\omega \,A a_{U}\,U-(\gamma _{G}+\mu _{G}+\mu )G,\nonumber \\ \frac{\text{d}R}{\text{d}t}&= \gamma _{I}\,I + \gamma _{U} U + \gamma _{G}\,G- \mu R. \end{aligned}$$
(1)

The SAIUGR model stated from (1) includes several parameters that describe the dynamics of disease transmission and progression. The parameter \(\Lambda\) represents the consistent influx of susceptibles due to births and migrations, while \(\mu\) denotes the natural mortality rate. Transmissibility is embodied by \(\beta\), which connects susceptible and infected cases, with transmission coefficients \(\beta \zeta _{A}\), \(\beta \zeta _{I}\), and \(\beta \zeta _{G}\) for asymptomatic, reported symptomatic, and isolated cases, respectively. These coefficients are influenced by measures such as face masks, lockdowns, and social distancing, contributing to dampening the infection rate. Transition of asymptomatic individuals to either reported symptomatic (\(I\)) or unreported symptomatic (\(U\)) categories at a rate \(\omega\), with \(\theta\) representing the proportion of symptomatic cases detected and reported, and \(1-\theta\) the fraction that remains unreported. Recovery rates are given by \(\gamma _{I}\), \(\gamma _{U}\), and \(\gamma _{G}\) for reported symptomatic, unreported symptomatic, and isolated individuals, respectively, with their corresponding mortality rates being denoted by \(\mu _{I}\), \(\mu _{U}\), and \(\mu _{G}\), respectively. These parameters account for the varying outcomes among different groups, which are crucial for understanding the disease dynamics that can be visualized in Fig. 1.

Fig. 1
figure 1

Flowchart of the dynamics in the SAIUGR model.

Analysis of parameters and their implications

To understand the working and mechanism of our SAIUGR model, one must examine each compartment and parameter involved, as they determine the disease spread and its eventual outcomes. Table 1 offers a systematic breakdown, describing each key parameter, along with their dvalues. Individuals, whether detected (compartment \(I\)) or undetected (compartment \(U\)), transition to the isolated compartment at rates \(\eta _{I}\) and \(\eta _{U}\) respectively.

Table 1 Description and value of the indicated parameter in the SAIUGR model.

The temporal dynamics of the SAIUGR model expressed in (1) include the mentioned six compartments: \(S\), \(A\), \(I\), \(U\), \(G\), and \(R\). Each compartment represents different stages in the disease progression. We initialize the model with non-negative values \(S(0)=S_{0}\), \(A(0)=A_{0}\), \(I(0)=I_{0}\), \(U(0)=U_{0}\), \(G(0)=G_{0}\), and \(R(0)=R_{0}\), where the numbers in parentheses denote the initial time. These initial values, representing the state of each compartment at the start of the formulation, are crucial for the integrity and realism of the model predictions.

The isolation rate for unreported symptomatic individuals (\(\eta _{U}\)) is set higher than that for reported symptomatic individuals (\(\eta _{I}\)) to represent the intensified isolation efforts that occur upon the late identification of previously unreported cases. This approach is modeled to correct for initial underreporting and to mitigate the risk of continued disease transmission from these individuals.

The SAIUGR model, grounded in empirical data and aligned with existing research, is designed as a comprehensive nonlinear dynamical system as established in (1), effectively describing the transmission and progression of COVID-19.

In the next section, we analyze the model mathematical structure and explore its predictive capabilities across different scenarios, examining its epidemiological implications.

Positivity and boundedness of the model solutions

The analysis of positivity and boundedness validates fundamental properties to maintain biological relevance and practical utility. Positivity guarantees non-negative populations across all compartments throughout an epidemic duration. Similarly, boundedness ensures that solutions do not unrealistically approach infinity. Positivity and boundedness aspects not only validate the model biological soundness but also set the stage for further stability and sensitivity explorations. Theorem 1 postulates these aspects.

Theorem 1

Let all the solutions \((S(t), A(t), I(t), U(t), G(t), R(t))\) of the structure stated in (1) have initial values \(S(0)=S_{0}\), \(A(0)=A_{0}\), \(I(0)=I_{0}\), \(U(0)=U_{0}\), \(G(0)=G_{0}\), and \(R(0)=R_{0}\). Then, these solutions are nonnegative and uniformly bounded in the region defined as

$$\begin{aligned} \rho \equiv \left\{ (S(t), A(t), I(t), U(t), G(t), R(t)) \in {\mathbb {R}}_{+}^{6}\text{: } 0 \le S(t), A(t), I(t), U(t), G(t), R(t) \le \frac{\Lambda }{\mu } \right\} . \end{aligned}$$

The proof of Theorem 1 is provided in Appendix A.

Derivation of the basic reproduction number

In epidemiological studies, the basic reproduction number \(R_{0}\) is essential. It represents the average number of secondary infections that an infected individual is expected to cause in a wholly susceptible population3. The value of the parameter \(R_{0}\) determines the potential spread of the disease within a population. Specifically:

  • If \(R_{0} > 1\), the disease spreads and can lead to an epidemic, but the disease may reach a higher endemic equilibrium (EE) that persists in the population at a constant level.

  • If \(R_{0} < 1\), the disease will eventually die out.

  • If \(R_{0} = 1\), that is, each infected individual, on average, produces only one secondary case and the EE persists in the population at a steady level without leading to large outbreaks.

At the disease-free equilibrium (DFE) \(E^{0} = ({\Lambda }/{\mu }, 0, 0, 0, 0, 0)\), \({\Lambda }/{\mu }\) represents \(S\) in the absence of the disease. To assess the stability of the DFE and understand the potential for disease emergence or eradication within the population, we employ the Jacobian matrices \({\mathscr {F}}\) and \({\mathscr {V}}\), which are constructed to represent the rates of new infections and transitions between compartments, respectively, for the entire system. In the context of epidemiological modeling, \({\mathscr {F}}\) captures the new infections caused by different infectious compartments, while \({\mathscr {V}}\) captures the movement between compartments and the removal of individuals from the infected classes.

To calculate \(R_0\), we use the next-generation matrix method as outlined in42,43. To do this calculation, we construct the matrices \({\mathscr {F}}\) and \({\mathscr {V}}\), which represent the rates of new infections and transitions between compartments for the entire system, and are defined as

$$\begin{aligned} {\mathscr {F}} = \begin{pmatrix} \beta \zeta _{A} & \beta \zeta _{I} & 0 & \beta \zeta _{G} \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ \end{pmatrix},\quad {\mathscr {V}} = \begin{pmatrix} (\omega + \mu ) & 0 & 0 & 0 \\ -\theta \,\omega & (\eta _{I}+\gamma _{I}+\mu _{I}+\mu ) & 0 & 0 \\ -(1-\theta )\omega & 0 & (\eta _{U}+\gamma _{U}+\mu _{U}+\mu ) & 0 \\ 0 & -\eta _{I} & -\eta _{U} & (\gamma _{G}+\mu _{G}+\mu ) \\ \end{pmatrix}. \end{aligned}$$

The next-generation matrix is given by

$$\begin{aligned} {\mathscr {F}}{\mathscr {V}}^{-1}= & \begin{pmatrix} \beta \zeta _{A} & \beta \zeta _{I} & 0 & \beta \zeta _{G} \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ \end{pmatrix}\\ & \times \begin{pmatrix} \frac{1}{\omega + \mu } & 0 & 0 & 0 \\ \frac{\theta \omega }{(\omega + \mu )(\eta _i + \eta + \mu _i + \mu )} & \frac{1}{\eta _i + \eta + \mu _i + \mu } & 0 & 0 \\ \frac{-(1 - \theta ) \omega }{(\omega + \mu )(\eta _u + \eta + \mu _u + \mu )} & 0 & \frac{1}{\eta _u + \eta + \mu _u + \mu } & 0 \\ \frac{\omega (\theta (\eta _u + \mu _u + \eta ) \eta _i + \eta _u (\eta _i - (1 - \theta )(\eta _i + \eta + \mu _i + \mu )))}{(\omega + \mu )(\eta _g + \mu _g + \eta )(\eta _u + \eta + \mu _u + \mu )(\eta _i + \eta + \mu _i + \mu )} & \frac{\eta _i}{(\eta _g + \mu _g + \eta )(\eta _i + \eta + \mu _i + \mu )} & \frac{\eta _u}{(\eta _g + \mu _g + \eta )(\eta _u + \eta + \mu _u + \mu )} & \frac{1}{\eta _g + \mu _g + \eta } \end{pmatrix}. \end{aligned}$$

By calculating the largest eigenvalue of \({\mathscr {F}}{\mathscr {V}}^{-1}\), we can determine the basic reproduction number as

$$\begin{aligned} R_{0} = \frac{\beta \zeta _{A}}{\omega + \mu } + \frac{\beta \theta \,\omega \zeta _{I}}{(\eta _{I} + \gamma _{I} + \mu _{I} + \mu )(\omega + \mu )} + \frac{\beta \zeta _{G}\eta _{U}\theta \,\omega (\eta _{U} + \gamma _{U} + \mu _{U} + \mu ) + \beta \zeta _{G}\eta _{U}(1-\theta )\omega (\eta _{I} + \gamma _{I} + \mu _{I} + \mu )}{(\eta _{U} + \gamma _{U} + \mu _{U} + \mu )(\eta _{I} + \gamma _{I} + \mu _{I} + \mu )(\gamma _{G} + \mu _{G} + \mu )(\omega + \mu )}. \end{aligned}$$

Stability analysis of the disease-free equilibrium

The DFE of the model is determined by setting the rates of change of all compartments to zero, as per the system of equations given in (1). Specifically, this implies \({\text{d}S}/{\text{d}t}=0\), \({\text{d}A}/{\text{d}t}=0\), \({\text{d}I}/{\text{d}t}=0\), \({\text{d}U}/{\text{d}t}=0\), \({\text{d}G}/{\text{d}t}=0\), and \({\text{d}R}/{\text{d}t}=0\). In the DFE, the infected compartments are all zero, that is, \(A = 0\), \(I = 0\), \(U = 0\), \(G = 0\), and \(R = 0\). The susceptible population \(S\) at the DFE can be deduced and is typically represented as the entire population in the absence of infection, which is \(\Lambda /\mu\) under the assumption of balanced birth and death rates. Theorems 2 and 3 state the stability of this DFE of the SAIUGR model.

Theorem 2

Let \(R_{0} < 1\). Then, the DFE \(E^{0}\) of the model described in (1) is asymptotically stable in a local sense.

Theorem 3

For \(R_{0} < 1\), the DFE \(E^{0}\) of the system defined in (1) is asymptotically stable in a global sense.

The proofs of Theorems 2 and 3 are provided in Appendix A.

Stability analysis of endemic equilibrium

Let \(\Sigma ^{*}=(S^{*}, A^{*}, I^{*}, U^{*}, G^{*}, R^{*})\) be the EE point of the structure given in (1) of the SAIUGR model and

$$\begin{aligned} \chi ^{*} = \left( \zeta _{A}\,A^{*} + \zeta _{I}\,I^{*} + \zeta _{G}\,G^{*}\right) \left( \frac{1}{N^{*}}\right) \end{aligned}$$
(2)

be the effective transmission rate (force of infection). In addition, suppose that \({\text{d}S}/{\text{d}t}=0\), \({\text{d}A}/{\text{d}t}=0\), \({\text{d}I}/{\text{d}t}=0\), \({\text{d}U}/{\text{d}t}=0\), \({\text{d}G}/{\text{d}t}=0\) and \({\text{d}R}/{\text{d}t}=0\). Then, we have that

$$\begin{aligned} & S^{*} = \dfrac{\Lambda }{\chi ^{*}+\mu }, \quad A^{*} = \dfrac{\chi ^{*}S^{*}}{K_{1}}, \quad I^{*}=\dfrac{\theta \,\omega \chi ^{*}S^{*}}{K_{1}K_{2}}, \quad U^{*} =\dfrac{(1-\theta )\omega \chi ^{*}S^{*}}{K_{1}K_{3}}, \quad G^{*}=\dfrac{(\eta _{I}\theta \,\omega K_{3}+\eta _{U}(1-\theta )\omega K_{2})\chi ^{*}S^{*}}{K_{1}K_{2}K_{3}K_{4}}, \\ & R^{*}=\dfrac{\gamma _{I}\theta \,\omega \chi ^{*}S^{*}}{\mu K_{1}K_{2}}+\dfrac{\gamma _{U}(1-\theta )\omega \chi ^{*}S^{*}}{\mu K_{1}K_{3}}+\dfrac{\gamma _{G}(\eta _{I}\theta \,\omega K_{3}+\eta _{U}(1-\theta )\omega K_{2})\chi ^{*}S^{*}}{\mu K_{1}K_{2}K_{3}K_{4}}, \end{aligned}$$

where \(K_{1}=(\omega +\mu )\), \(K_{2}=(\eta _{I}+\gamma _{I}+\mu _{I}+\mu )\), \(K_{3}=(\eta _{U}+\gamma _{U}+\mu _{U}+\mu )\), and \(K_{4}=(\eta _{G}+\mu _{G}+\mu )\). Substituting all the above expressions in (2), we get the nonzero equilibrium of the system stated in (1) satisfying the linear equation in terms of \(\chi ^{*}\) as \(P_{0}\chi ^{*}+P_{1} =0\), where \(P_{0}=\mu K_{2}K_{3}K_{4}+(\gamma _{I}+\mu )\theta \,\omega K_{3}K_{4}+\gamma _{U}(1-\theta )\omega K_{2}K_{4}+(\gamma _{G}+\mu )(\eta _{I}\theta \,\omega +\eta _{U}(1-\theta )\omega K_{2})\) and  \(P_{1}=\mu K_{1}K_{2}K_{3}K_{4} (1-R_{0})\), with \(P_{0}\) encapsulating various transmission and recovery dynamics, and \(P_{1}\) reflecting the impact of changes in the disease transmission rate and natural mortality rate. Clearly, \(K_{1}>0\), \(K_{2}>0\), \(K_{3}>0\), and \(K_{4}>0\). Since \((\gamma _{I}+\mu )>0\) and \((\gamma _{G}+\mu )>0\), we have \(P_{0}>0\). Similarly \(P_{1}>0\) as \(R_{0}>1\). Therefore, we can conclude that the system formulated in (1) possesses a unique positive EE point if \(R_{0}>1\), while it does not have any positive EE point if \(R_{0}<1\), which is formalized in Theorems 4 and 5. The proofs of Theorems 4 and 5 can be found in Appendix A.

Theorem 4

Let \(R_{0} > 1\). Then, the EE point \(\Sigma ^{*}\) is asymptotically stable in a local sense.

Theorem 5

For \(R_{0} > 1\), the EE point \(\Sigma ^{*}\) is asymptotically stable in a global sense.

Concluding our analysis of the SAIUGR model, we have detailed the transmission dynamics of COVID-19, providing an understanding of how the disease spreads. This analysis sets the stage for applying the model to empirical data, allowing for further validation and refinement of the results obtained.

Results

This section focuses on the empirical aspects of our study, demonstrating its application and validation through a series of analytical steps. We begin by calibrating the model to real-world COVID-19 case data from India. Then, we explore the application of optimal control theory to our epidemiological model, aiming to minimize the impact of COVID-19. We explore various control strategies and their effectiveness, both in isolation and in combination, through a cost-effectiveness analysis.

Model calibration to data

Building upon the analysis presented earlier, we now empirically calibrate the SAIUGR model using data from reported COVID-19 cases in India. This step aligns the model with real-world observations and compares theoretical forecasts against real-life epidemiological patterns.

The dataset, covering daily and cumulative confirmed cases from 30 January 2020 to 24 November 2020, a span of 300 days, was sourced from the COVID-19 API data portal (www.covid19india.org, accessed on 21 July 2024). We estimate the parameters \(\beta\), \(\eta _{I}\), \(\eta _{U}\), \(\gamma _{I}\), and \(\gamma _{U}\) during model calibration. We employ the Lsqnonlin function of MATLAB44, a non-linear least-square method, to optimize these parameters by minimizing the residual sum of squares between the observed data and model predictions.

Using the mentioned dataset, we calculated the basic reproduction number as \(R_{0} = 2.147\), using the approaches outlined in the present article. This calculation reports the severity of the COVID-19 outbreak in India at that time and is important for understanding the virus transmission dynamics as per our model. Figure 2 shows the alignment of the SAIUGR model with both the daily and cumulative confirmed COVID-19 cases in India. The empirical data are depicted using blue markers, while the simulation from the model is represented by a continuous red curve, illustrating the model fit to the real-world data.

Fig. 2
figure 2

Plots of simulations using the SAIUGR model with daily (a) and cumulative (b) numbers of COVID-19 cases in India from 30 January 2020 to 24 November 2020.

Figure 3 extends the model predictions, projecting the likely trajectory over an impending 50-day span, concluding on 12 January 2021. In this figure, the steadfast blue trajectory symbolizes the model projections for both daily new infections and the cumulative incidence of COVID-19 cases. These predictions hint at a potential downturn in case numbers over this forecasted interval. The methodology employed in this phase, utilizing non-linear least-square fitting in MATLAB, demonstrates a rigorous approach to model calibration. The alignment of the SAIUGR model with empirical COVID-19 data substantiates the robustness of the chosen method and its effectiveness in parameter estimation.

Fig. 3
figure 3

Plots of predictions for the next 50 days using the calibrated model compared to daily (a) and cumulative (b) numbers of COVID-19 cases in India from 30 January 2020 to 24 November 2020.

Analysis of transcritical bifurcation

To understand the behavior of the DFE and EE in the system formulated in (1), we analyze the presence of a transcritical bifurcation at the critical point where \(R_{0} = 1\). This analysis is essential to comprehend the critical thresholds at which the system dynamics change, marking the transition from a state where the disease can be eradicated to a state where it persists within the population.

In Fig. 4, we plot the parameter \(\chi ^{*}\) against \(R_{0}\), where \(\chi ^{*}\) represents the effective transmission rate, as mentioned. For \(R_{0} < 1\), the DFE exhibits local asymptotic stability, as indicated by the solid green line. Conversely, for \(R_{0} > 1\), the DFE loses its stability, transitioning to an unstable state, depicted by the dashed red line. The ascending blue line in Fig. 4 denotes a stable EE, indicating a shift from DFE to EE, which suggests the sustained presence of the virus in the population.

Fig. 4
figure 4

Plot of the transcritical bifurcation based on \(R_{0}\) versus \(\chi ^*\) using COVID-19 data from India.

From “Stability analysis of endemic equilibrium”, we know that the condition \(P_{0} \chi ^{*} + P_{1} = 0\) plays a relevant role in the stability analysis of the EE. Here, \(P_{0}\) and \(P_{1}\) are parameters derived from the model stability analysis. Specifically, \(P_{0}\) encapsulates various transmission and recovery dynamics, while \(P_{1}\) reflects the impact of changes in the disease transmission rate and natural mortality rate, as mentioned.

For \(R_{0} > 1\), the application of the Descartes rule of signs, combined with the condition that \(P_{0} > 0\) and \(P_{1} < 0\), suggests the presence of a singular EE. Conversely, the absence of an EE is noted for \(R_{0} < 1\). Figure 4 illustrates these equilibriums, showing that the system experiences a transcritical bifurcation at \(R_{0}=1\). In this figure, the green line represents the stable DFE \(E^{0}\) for \(R_{0} < 1\), while the red dashed line represents the unstable DFE point for \(R_{0} > 1\). The ascending blue line represents the stable branch of the EE point \(\Sigma ^{*}\). Therefore, \(\Sigma ^{*}\) is stable for \(R_{0} > 1\) and unstable for \(R_{0} < 1\). Similarly, \(E^{0}\) is stable for \(R_{0} < 1\) and becomes unstable for \(R_{0} > 1\). Thus, the SAIUGR model, as defined in (1), indicates that the coronavirus disease spreads in the population when \(R_{0} > 1\), as evidenced by the stable presence of the point \(\Sigma ^{*}\) in this scenario.

Sensitivity analysis

The sensitivity analysis states the importance of various parameters in the disease spread dynamics. An exposition concerning sensitivity analyses for malaria is provided in45, offering insights for enhancing the robustness of our model. This analysis grants us the capability to identify parameters that have a substantial influence on \(R_{0}\). By computing the sensitivity indexes of these control parameters, we obtain the contribution of each parameter to disease proliferation and persistence.

Adopting the methodology delineated in46, the normalized forward sensitivity index of a variable M with respect to a parameter q is represented as

$$\begin{aligned} \varGamma _{q}^{M}= \frac{\text{d} M}{\text{d }q} \frac{q}{M}. \end{aligned}$$

Utilizing this index, one can deduce the normalized forward sensitivity index corresponding to every parameter integrated into the expression for \(R_{0}\). For example, the normalized forward sensitivity index of \(R_{0}\) concerning the parameter \(\beta\) equates to one and it is given by

$$\begin{aligned} \varGamma _{\beta }^{R_{0}}= \frac{\text{d} R_{0}}{\text{d} \beta } \frac{\beta }{R_{0}}. \end{aligned}$$

Figure 5 shows the key parameters for diminishing the value of \(R_0\) beneath the threshold of one, inhibiting the viral spread.

Fig. 5
figure 5

Bar plot of the normalized forward sensitivity index of \(R_{0}\) for the indicated parameter using Indian COVID-19 data.

The positive sensitivity indices, associated with \(\zeta _{A}\), \(\zeta _{I}\), \(\zeta _{G}\), \(\theta\), and \(\beta\), predominantly influence the frequency of disease manifestations, with \(\beta\) having the highest positive index, indicating its crucial role in transmission dynamics. In contrast, negative sensitivity indices linked to parameters \(\omega\), \(\eta _{I}\), \(\eta _{U}\), \(\gamma _{I}\), \(\gamma _{U}\), \(\gamma _{G}\), \(\mu _{I}\), \(\mu _{U}\), \(\mu _{G}\), and \(\mu\) attenuate the disease transmission, with \(\gamma _{U}\) showing the most negative impact. Remarkably, parameters such as the isolation rate of non-disclosed cases (\(\eta _{U}\)), recovery rates of both non-disclosed symptomatic individuals (\(\gamma _{U}\)), and isolated entities (\(\gamma _{G}\)) wield an effect on curbing the dissemination of COVID-19. This highlights the importance of improving detection and isolation measures to control the spread of the virus effectively. Furthermore, to scrutinize the impact of model parameters on \(R_{0}\), contour representations of \(R_{0}\) in relation to specific parameters are exhibited in Fig. 6. In Fig. 6a, the contour map shows that, increasing the number of individuals in isolation, through joining individuals from the reported symptomatic infected group, reduces \(R_{0}\) and hence COVID-19 spread declines. Figure 6b shows that, as the isolation of both unreported and reported symptomatic infected individuals increases, with the value of \(R_{0}\) decreasing as a result, the COVID-19 spread reduces.

To enrich our comprehension regarding the behavior of \(R_{0}\), its relation with the disease transmission rate \(\beta\) and the transition rate \(\omega\) from asymptomatic to symptomatic cases has been graphically represented in Fig. 7. The three-dimensional representation in this figure features a ’folded leaf’ structure, signifying a critical area where small variations in \(\omega\) and \(\beta\) lead to changes in \(R_0\), particularly when \(R_0\) approaches and surpasses the critical threshold of the value one, indicating increased disease spread.

Fig. 6
figure 6

Contour plots of the relation of \(R_{0}\) with (a) \(\beta , \eta _{I}\) and (b) \(\eta _{I}, \eta _{U}\) using Indian COVID-19 data.

Fig. 7
figure 7

A three-dimensional plot of the basic reproduction number, as contingent upon the disease transmission rate \(\beta\) and the shift rate \(\omega\) from asymptomatic progressions to symptomatic manifestations using Indian COVID-19 data.

Impact of parameters on prevalence of COVID 19

Next, we conduct a time series analysis to explore the effect of the parameters \(\eta _{I}\) and \(\eta _{U}\) on the total symptomatic infected and isolated populations over the period [0,400] days. Figure 8a illustrates the impact of varying isolation rates on the total symptomatic infected population (\(I+U\)). As depicted by the bell-shaped curves in different colors, each representing a unique pair of isolation rates (\(\eta _{I}\), \(\eta _{U}\)), we observe a distinct pattern. The curve with the lowest peak, in blue, corresponds to higher isolation rates (\(\eta _{I}=0.0775\), \(\eta _{U}=0.1368\)), demonstrating a reduced number of symptomatic infections. As the isolation rates decrease (moving from violet to black, and then to red), the pattern shows a peak of the curve increases, signifying a higher number of symptomatic infections. This pattern reveals a clear relation: higher isolation rates lead to a decrease in symptomatic infections, emphasizing the effectiveness of isolation measures in controlling the spread of the virus. In Fig. 8b, the focus shifts to the isolated population (\(G\)) over time, with each curve representing different combinations of isolation rates. Similar to Fig. 8a, the lowest curve in blue (\(\eta _{I}=0.002\), \(\eta _{U}=0.046\)) indicates fewer isolated individuals, relating with lower isolation rates. As the isolation rates increase (violet to black, and then to red), the number of isolated individuals rises, reaching its peak in the red curve (\(\eta _{I}=0.017\), \(\eta _{U}=0.076\)). The overlapping tails of these curves on the right side suggest that irrespective of the initial differences in isolation rates, the number of isolated individuals converges over time, reflecting the dynamic nature of the pandemic progression and the impact of varying isolation strategies.

The analysis considers a higher \(\eta _{U}\) compared to \(\eta _{I}\), reflecting the model strategy to represent a rapid response following the late identification of these cases. This higher rate aims to demonstrate the effectiveness of isolation in reducing the spread of the disease once unreported cases are detected.

Fig. 8
figure 8

Time series of (a) symptomatic infected \((I+U)\) population and (b) isolated (G) population with respect to isolation rates \(\eta _{I}\) and \(\eta _{U}\) over a period [0, 400] days using Indian COVID-19 data.

Next, we explore the application of optimal control theory to our epidemiological model, aiming to minimize the impact of COVID-19. We explore various control strategies and their effectiveness, both in isolation and in combination, through a cost-effectiveness analysis.

Formulation of the optimal control problem

We implement optimal control theory to our epidemiological model to minimize the number of infected individuals. The model incorporates two control variables, \(u_1\) and \(u_2\), each targeting different aspects of disease management.

The susceptible population becomes aware of disease and their control techniques when they are provided with enough statistics, leading to a shift in population behavior. Due to the systematic dissemination of news, people have started practicing wearing face masks, adhering to social distancing measures, maintaining good cleanliness, observing quarantine protocols, and even using home isolation strategies. Therefore, we alter the rate of isolation by implementing the control factor \((1 - \varepsilon _{1}u_{1}(t))\), which measures the external efforts to prevent or control interactions between isolated \(G\) and susceptible \(S\) populations, where the parameter \(\varepsilon _1\) represents the effectiveness of these isolation measures.

The control variable \(u_1(t)\), ranging from zero to one, modifies the isolation rate and is defined by the factor \((1 - \varepsilon _{1}u_{1}(t))\). Here, \(u_1(t)\) regulates the interactions between \(G\) and \(S\) populations. A higher value of \(u_1(t)\) (closer to one) indicates more stringent isolation measures, aimed at reducing the disease transmission rate. Administering an optimal treatment strategy to persons who are isolated in the early stage of COVID-19 infection can reduce the mortality rate of the disease. The framework that is highly anticipated is when \(\varepsilon _{1}u_{1}(t)=1\), meaning that the interactions between isolated and susceptible populations are effectively avoided, resulting in a successful prevention of transmission and reducing the total transmission rate to zero. In this context, the value 0 represents a lack of response, whereas the value 1 represents a complete response as a result of the complete isolation provided.

Early in the epidemic progression, appropriate treatment strategies, inclusive of medical kit provisions and vaccination programs for isolated individuals, can mitigate the associated mortality rate. With these strategies, the epidemiological model, as set out in (1), is adjusted by integrating the treatment stated as \(-\varepsilon _{2}u_{2}(t)G\), where \(\varepsilon _{2}\) represents the treatment rate associated with the control variable \(u_{2}(t)\). Various expenses are associated with medication, such as the cost of drugs, healthcare services, isolation measures, vaccinations, and medical supplies. These costs should be considered during the medication period. Therefore, we define \(u_{2}(t)\) as a control variable that ranges from 0 to 1, with 0 indicating no reaction and 1 indicating a complete response following the therapy.

In this context, \(u_{2}\), also within the range [0,1], is introduced to optimize treatment strategies. A value of \(u_2\) near to one denotes a comprehensive implementation of treatment measures, potentially maximizing the per capita efficiency of vaccination rates. The parameter \(\varepsilon _2\) reflects the effectiveness of these treatment measures. Thus, the utilization of \(u_2\) reflects the model capacity to adjust treatment responses in line with the disease progression. The formulation for the optimal control model is presented as

$$\begin{aligned} \frac{\text{d}S}{\text{d}t}&= \Lambda -\beta (\zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G}(1-\varepsilon _{1}u_{1})\frac{S}{N}-\mu S,\nonumber \\ \frac{\text{d}A}{\text{d}t}&=\beta (\zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G}(1-\varepsilon _{1}u_{1})\frac{S}{N}- (\omega + \mu )A,\nonumber \\ \frac{\text{d}I}{\text{d}t}&=\theta \,\omega \,A-(\eta _{I}+\gamma _{I}+\mu _{I}+\mu )I,\nonumber \\ \frac{\text{d}U}{\text{d}t}&= (1-\theta )\omega \,A - (\eta _{U}+\gamma _{U}+\mu _{U}+\mu )U,\nonumber \\ \frac{\text{d}G}{\text{d}t}&=\eta _{I}\,I+\eta _{U}\,U-(\gamma _{G}+\mu _{G}+\mu +\varepsilon _{2}u_{2})G,\nonumber \\ \frac{\text{d}R}{\text{d}t}&= \gamma _{I}\,I + \gamma _{U} U +\varepsilon _{2}u_{2}G+ \gamma _{G}\,G- \mu R, \end{aligned}$$
(3)

subject to minimizing the objective function given by

$$\begin{aligned} {\mathscr {M}}(u_{1}(t),u_{2}(t))=\int _{0}^{t} \left( C_{1}A+C_{2}I+C_{3}U+\frac{1}{2}(C_{4}u^{2}_{1}+C_{5}u^{2}_{2})\right) \ \text{d}u, \end{aligned}$$
(4)

where \(C_{1}\), \(C_{2}\), \(C_{3}\), \(C_{4}\), and \(C_{5}\) are nonnegative weight constants. Drawing upon the Pontryagin maximum principle47, the indispensable conditions for this optimal control challenge are deduced following the methodology presented in48. The integrand cumulative costs are represented by the Lagrangian function detailed as \({\mathscr {L}}(A,I,U,u_{1}(t),u_{2}(t)) = C_{1}A+C_{2}I+C_{3}U+0.5(C_{4}u^{2}_{1}+C_{5}u^{2}_{2}).\) For this problem, the Hamiltonian function \({\mathscr {H}}\) is given by

$$\begin{aligned} {\mathscr {H}}= & C_{1}A+C_{2}I+C_{3}U+\frac{1}{2}(C_{4}u^{2}_{1}+C_{5}u^{2}_{2}) + \lambda _{1}\left( \Lambda -\beta \left( \zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G} (1-\varepsilon _{1}u_{1}G)\frac{S}{N}\right) -\mu S\right) \\ & + \lambda _{2}\left( \beta \left( \zeta _{A}\,A+\zeta _{I}\,I+\zeta _{G} (1-\varepsilon _{1}u_{1}G)\frac{S}{N}\right) - (\omega + \mu )A\right) + \lambda _{3}\left( \theta \,\omega \,A-(\eta _{I}+\gamma _{I}+\mu _{I}+\mu )I\right) \\ & + \lambda _{4}\left( (1-\theta )\omega \,A - (\eta _{U}+\gamma _{U}+\mu _{U}+\mu )U\right) + \lambda _{5}\left( \eta _{I}\,I+\eta _{U}\,U-(\gamma _{G}+\mu _{G}+\mu +\varepsilon _{2}u_{2})G\right) \\ & + \lambda _{6}\left( \gamma _{I}\,I + \gamma _{U} U +\varepsilon _{2}u_{2}G+ \gamma _{G}\,G- \mu R\right) , \end{aligned}$$

where \(\lambda _{1}\), \(\lambda _{2}\), \(\lambda _{3}\), \(\lambda _{4}\), \(\lambda _{5}\), and \(\lambda _{6}\) are the adjoint variables. The corresponding differential equations for the adjoint variables are stated as

$$\begin{aligned} \frac{\text{d}\lambda _{1}}{\text{d}t}= & -\frac{\partial {\mathscr {H}} }{\partial S} =\lambda _{1}-\lambda _{2}\beta (\zeta _{A} A+\zeta _{I} I)+\zeta _{G} (1-\epsilon _{1} u_{1} G)\frac{1}{N}+\lambda _{1}\mu ,\\ \frac{\text{d}\lambda _{2}}{\text{d}t}= & -\frac{\partial {\mathscr {H}}}{\partial A} =-C_{1}+(\lambda _{1}-\lambda _{2})\beta \frac{\zeta _{A}S}{N} +(\lambda _{2}-\lambda _{4})\omega +(\lambda _{3}-\lambda _{4})\omega \theta + \mu \lambda _{2},\\ \frac{\text{d}\lambda _{3}}{\text{d}t}= & -\frac{\partial {\mathscr {H}} }{\partial I} =-C_{2}+(\lambda _{1}-\lambda _{2})\beta \frac{\zeta _{I}S}{N} +(\lambda _{3}-\lambda _{5})\eta _{I}+ (\lambda _{3}-\lambda _{6})\gamma _{I}-(\mu +\mu _{I}) \lambda _{3},\\ \frac{\text{d}\lambda _{4}}{\text{d}t}= & -\frac{\partial {\mathscr {H}} }{\partial U} =-C_{3}+(\lambda _{4}-\lambda _{5})\eta _{U}+(\lambda _{4}-\lambda _{6})\gamma _{U}+(\mu _{U}+\mu )\lambda _{4},\\ \frac{\text{d}\lambda _{5}}{\text{d}t}= & -\frac{\partial {\mathscr {H}} }{\partial G} =(\lambda _{1}-\lambda _{2})\beta (1-\epsilon _{1}u_{1}G)\frac{S}{N}+(\lambda _{5}-\lambda _{6})(\gamma _{G}+\epsilon _{2}u_{2})-\lambda _{5}(\mu _{G}+\mu ),\\ \frac{\text{d}\lambda _{6}}{\text{d}t}= & -\frac{\partial {\mathscr {H}} }{\partial R} =\mu \lambda _{6}. \end{aligned}$$

Next, Theorem 6 establishes the existence of optimal controls. Having established the existence of optimal controls, we now aim to characterize them in Theorem 7. The proofs of Theorems 6 and 7 are provided in Appendix A.

Theorem 6

Let \(\Psi \equiv \{u_{1}(t),u_{2}(t)\text{: } 0\le u_{1}(t),u_{2}(t)\le 1,t\in [0,T]\}\). Then, under \(\Psi\), optimal controls \(u^{*}(t)=(u_{1}^{*}(t),u_{2}^{*}(t))\) exist and minimize the objective function \({\mathscr {M}}(u_{1}(t),u_{2}(t))\) stated in (4).

Theorem 7

Let optimal controls \(u_{1}^{*}(t)\) and \(u_{2}^{*}(t)\) minimize \({\mathscr {M}}(u_{1}(t),u_{2}(t))\) over the region \(\Psi\). Then, we have that \(u_{1}^{*}(t) = \min \left\{ 1, \max \left\{ 0,\bar{u_{1}}\right\} \right\}\) and \(u_{2}^{*}(t) = \min \left\{ 1, \max \left\{ 0,\bar{u_{2}}\right\} \right\}\), where \(\bar{u_{1}}\) and \(\bar{u_{2}}\) are expressed as

$$\begin{aligned} \bar{u_{1}} =\frac{(\lambda _{2}-\lambda _{1})\beta \zeta _{G}\,GS}{N C_{4}}, \quad \bar{u_{2}} = \frac{(\lambda _{3}-\lambda _{6})I}{C_{6}}. \end{aligned}$$

Optimal control model

Optimal control simulations are performed in MATLAB utilizing model parameters and fixed parameters mentioned in Table 1 over the time interval [0, 400] days. We employ the forward and backward difference approximation method for solving the extended system presented in (3), as per the approach outlined in49. We set the control weights as \(C_{1} = 1\), \(C_{2} = 1\), \(C_{3} = 1\), \(C_{4} = 50\), and \(C_{5} = 50\).

Figure 9 illustrates the effect of control interventions on the dynamics of the populations A, I, U, and G over the period [0, 400] days. Note that the application of the control measures \(u_{1}\) and \(u_{2}\) leads to a substantial reduction in the number of infections, compared to scenarios where no control interventions are implemented. Figure 10 presents the temporal profiles of the control variables \(u_{1}\) and \(u_{2}\) throughout the simulation period. The analysis indicates that maintaining these controls at their maximum value of one for a prolonged duration is essential for effectively eradicating the COVID-19 disease from the population. Figure 11 illustrates how the control profiles for \(u_1\) and \(u_2\) vary with the increasing cost associated with these control measures. Note that, as the cost of implementing the control parameters escalates, the duration for which it is necessary to maintain these controls at their maximum level of one reduces. This highlights the trade-off between the intensity and cost-effectiveness of control strategies in the management of the disease.

Our simulations underscore implications for controlling the spread of COVID-19 in the population. The control variables \(u_{1}\) and \(u_{2}\) must be maintained at a high level for extended periods to completely eradicate the disease. This insight emphasizes the importance of consistent efforts to contain the virus spread, even as the immediate threat of the pandemic recedes. When considering the rising cost, it becomes evident that the longer these controls are maintained at their maximum value, we have greater economic implications. Policymakers must weigh the economic against the public health benefits of these controls. In some contexts, the economic strain of rigorous controls might outweigh the health benefits of fewer infections, and vice-versa. Every decision must take into account the circumstances of the community and the potential impacts of these controls. Our results can also inform future research on optimal control strategies for infectious diseases. By incorporating additional parameters and constraints, models may be refined to more accurately reflect real-world complexities such as new variants, vaccination effects, and consequences of prematurely lifting controls. Therefore, our simulations offer vital insights into the control of COVID-19 spread. Understanding various control strategies and their trade-offs allows policymakers to make informed decisions that best serve their communities.

Fig. 9
figure 9

Plots of states for populations (a) A, (b) I, (c) U, and (d) G under control scenarios using Indian COVID-19 data.

Fig. 10
figure 10

Plots of optimal controls (a) \(u_{1}\) and (b) \(u_{2}\) using Indian COVID-19 data.

Fig. 11
figure 11

Plots of optimal controls (a) \(u_{1}\) and (b) \(u_{2}\) for the indicated \(C_4\) and \(C_5\) using Indian COVID-19 data.

Cost-effectiveness analysis

To reduce the spread of COVID-19 effectively and economically, we focus on identifying the most cost-efficient control strategy. This identification evaluates the efficacy of two distinct and combined control strategies: isolation control (strategy \(E_{1}\)), treatment control (strategy \(E_{2}\)), and their combined application (strategy \(E_{12}\)). We employ the incremental cost-effectiveness ratio (ICER)50,51 to assess the cost-effectiveness of these strategies. ICER is calculated by dividing the cost difference between strategies X and Y by the difference in their effectiveness, measured as the number of infections averted, where \(X, Y \in \{E_{1}, E_{2}, E_{12}\}\). Thus, for two strategies X and Y, with Y being more effective than X, that is, \(\text {TA}(Y) > \text {TA}(X)\), ICER values are determined as

$$\begin{aligned} \text {ICER}(X) = \frac{\text {TC}(X)}{\text {TA}(X)}, \quad \text {ICER}(Y) = \frac{\text {TC}(Y) - \text {TC}(X)}{\text {TA}(Y) - \text {TA}(X)}, \end{aligned}$$

where TC is the total cost for each strategy calculated using the objective functional stated in (4), and TA are the total infections averted by comparing the total number of COVID-19 infections with and without the corresponding strategy.

Table 2 displays the total infections averted and the total costs for each of the control strategies \(E_{1}\) and \(E_{2}\), individually and combined, sorted by the total infections averted. We now calculate and compare the effectiveness and costs of strategies \(E_{1}\) and \(E_{2}\), as detailed in Table 3. Comparing strategies \(E_{1}\) and \(E_{2}\), ICER(\(E_{1}\)) is lower, indicating \(E_{2}\) is less cost-efficient than \(E_{1}\). Therefore, we discard \(E_{2}\) as a standalone strategy.

The comparison of \(E_{1}\) and \(E_{12}\) is presented in Table 4. The ICER for strategies \(E_{1}\) and \(E_{12}\) are calculated as

$$\begin{aligned} \text {ICER}(E_{1}) = \frac{8.63 \times 10^{5}}{7.3 \times 10^{7}} = 0.0118 \end{aligned}$$

and

$$\begin{aligned} \text {ICER}(E_{12}) = \frac{-1.58 \times 10^{5}}{3.1 \times 10^{7}} = -0.0051. \end{aligned}$$

Since ICER(\(E_{12}\)) is less than ICER(\(E_{1}\)), this indicates that the combined strategy \(E_{12}\) is more cost-effective and efficient compared to the isolation control strategy \(E_{1}\) alone. Therefore, integrating both control strategies is identified as the most effective approach to reducing the burden of COVID-19 in the population.

Table 2 Total averted infections and total costs for the indicated control strategy using Indian COVID-19 data.
Table 3 Total averted infections, total costs, and ICER for the indicated control strategy using Indian COVID-19 data.
Table 4 Total averted infections, total costs, and ICER for the indicated control strategy using Indian COVID-19 data.

Discussion and conclusions

The present study developed the SAIUGR model, a six-compartment framework for analyzing COVID-19 transmission dynamics. Unlike traditional models, this model introduces a novel compartmentalization strategy by differentiating between reported and unreported symptomatic infections. Alongside the other compartments—namely susceptible, asymptomatic infected, isolated, and recovered—the model collectively encapsulates the various stages of disease progression and response. Our study provides a more nuanced understanding of disease spread, particularly in the Indian context, where diverse social and medical factors influence disease dynamics.

The structure of the model, with its six key compartments, enabled an in-depth exploration into the complexities of COVID-19 transmission. Utilizing 11 months of empirical data from India, the basic reproduction number was calculated as 2.147. This value of 2.147, a critical indicator of the pandemic intensity and potential spread, is shaped by several key parameters within the model. Specifically, the transmission rate, transition rate from asymptomatic to symptomatic, and detection rate of symptomatic cases played significant roles in influencing the dynamics observed. These parameters provided valuable insights into the interactions between different stages of infection, underscoring the importance of effective detection and isolation strategies.

To further understand these dynamics, a sensitivity analysis was conducted using the normalized forward sensitivity index, which assessed how variations in model parameters impact infection spread. This analysis highlighted the crucial role of accurate detection and efficient isolation in managing the pandemic, revealing that enhancing these rates can significantly alter the infection trajectory. Additionally, we extended the model to include optimal control methods, exploring the impacts of isolation and treatment strategies. Simulations performed in MATLAB demonstrated the potential effectiveness of these strategies in managing the pandemic. The cost-effectiveness analysis, using the incremental cost-effectiveness ratio, compared the cost and effectiveness of various health interventions, providing insights into the additional cost required to achieve a health benefit, such as a reduction in infection rates.

Building on the insights gained from our sensitivity analysis and control strategy simulations, our findings revealed that a combined control strategy, integrating both isolation and treatment measures, is not only more effective but also economically viable compared to implementing these strategies in isolation. This highlights the advantage of a synergistic approach in managing the pandemic efficiently. Furthermore, our study investigated the phenomena of transcritical bifurcation and endemic equilibrium. Transcritical bifurcation refers to the point at which the disease-free equilibrium state transitions to an endemic equilibrium state, indicating the potential for the disease to become a persistent part of the population. Endemic equilibrium represents a state where the disease consistently exists within the population at a steady level. By analyzing these phenomena, we gained insights into the stability of disease-free equilibrium and endemic equilibrium states under varying conditions. In addition, we explored the impact of specific parameters, such as isolation rates for reported and unreported symptomatic infections, on the prevalence of COVID-19, which enhanced our understanding of the dynamics and spread of the disease.

An essential aspect of our model is the inclusion of unreported symptomatic individuals, which enhances the accuracy of our transmission dynamics predictions. These individuals, although not detected by health authorities, contribute to the transmission of the virus and increase the overall disease burden. Our simulations demonstrated that unreported cases play a critical role in the spread of COVID-19, highlighting the importance of comprehensive detection and isolation strategies. By considering both reported and unreported cases, our model provides a more accurate and holistic view of the pandemic dynamics.

While our model provides valuable insights, it is important to acknowledge its inherent limitations, particularly regarding potential underreporting and assumptions about population mixing. One limitation is the omission of spatial diffusion effects, which can play a crucial role in the spread of infectious diseases. Spatial diffusion impacts how an epidemic propagates through different regions and populations, influencing the effectiveness of intervention strategies.

Future research should incorporate spatial dynamics to provide a more comprehensive understanding of disease spread. For instance, in52, a novel approach was proposed to investigate the stability analysis and dynamics of a reaction-diffusion susceptible-vacined-infectious-recovered epidemic model, highlighting the importance of spatial components in epidemic modeling. Additionally, in53, an optimal control problem of a SIR reaction-diffusion model with inequality constraints was discussed, which can inform future studies on the spatial aspects of epidemic control measures; see also54,55,56 for other models in this line. Future research directions also involve expanding the model to account for vaccination dynamics and environmental factors, aiming to enhance the model applicability and robustness against COVID-19 and other infectious diseases.

The choice of isolation and therapy as the main controls in our study was guided by their critical roles in managing infectious diseases. Isolation reduces the contact rate between infected and susceptible individuals, directly impacting the transmission dynamics. Therapy mitigates the severity and duration of the disease, improving recovery rates and reducing the overall infection burden. These controls were selected based on their proven effectiveness in epidemiological studies and their practical applicability in real-world public health strategies.

In real-life applications, the optimal control strategies suggested by our model can be translated into actionable public health measures. For instance, the optimal isolation strategy can be implemented through timely and widespread testing, contact tracing, and mandatory quarantine for infected individuals. The optimal therapy strategy can be applied by ensuring adequate healthcare resources, timely medical interventions, and efficient vaccination programs. This strategys, if implemented effectively, can reduce the spread of COVID-19 and other infectious diseases, thereby protecting public health and saving lives.

In conclusion, our research contributes to epidemiological modeling by demonstrating the importance of a multifaceted approach in understanding and combating COVID-19, as well as in preparing for future public health crises.