Evaluation of six months sputum culture conversion as a surrogate endpoint in a multidrug resistant-tuberculosis trial

Paul Meyvisch; Chrispin Kambili; Koen Andries; Nacer Lounis; Myriam Theeuwes; Brian Dannemann; An Vandebosch; Wim Van der Elst; Geert Molenberghs; Ariel Alonso

doi:10.1371/journal.pone.0200539

Abstract

The emergence of multidrug resistant-tuberculosis (MDR-TB), defined as Mycobacterium tuberculosis strains with in vitro resistance to at least isoniazid and rifampicin, has necessitated evaluation and validation of appropriate surrogate endpoints for treatment response in drug trials for MDR-TB. The trial that has demonstrated efficacy of bedaquiline, a diarylquinoline that inhibits mycobacterial ATP synthase, possesses the requisite features to conduct this evaluation. Approval of bedaquiline for use in MDR-TB was based primarily on the results of the controlled C208 Stage II study (ClinicalTrials.gov number, NCT00449644) including 160 patients randomized 1:1 to receive bedaquiline or placebo for 24 weeks when added to an 18–24-month preferred five-drug background regimen. Since randomization in C208 Stage II was preserved until study end, the trial results allow for the investigation of the complex relationship between sustained durable outcome with either Week 8 or Week 24 culture conversion as putative surrogate endpoints. The relationship between Week 120 outcome with Week 8 or Week 24 culture conversion was investigated using a descriptive analysis and with a recently developed statistical methodology for surrogate endpoint evaluation using methods of causal inference.

The results demonstrate that sputum culture conversion at 24 weeks is more reliable than sputum culture conversion at 8 weeks when assessing the outcome of adding one new drug to a MDR-TB regimen.

Figures

Citation: Meyvisch P, Kambili C, Andries K, Lounis N, Theeuwes M, Dannemann B, et al. (2018) Evaluation of six months sputum culture conversion as a surrogate endpoint in a multidrug resistant-tuberculosis trial. PLoS ONE 13(7): e0200539. https://doi.org/10.1371/journal.pone.0200539

Editor: Alejandro Escobar-Gutiérrez, Instituto de Diagnostico y Referencia Epidemiologicos, MEXICO

Received: November 24, 2017; Accepted: June 27, 2018; Published: July 19, 2018

Copyright: © 2018 Meyvisch et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: Janssen funded the C208 Stage II trial and was involved in its design and conduct and in the collection and analysis of the data. All the authors who worked for Janssen at the time of the analysis and manuscript preparation (Paul Meyvisch, Chrispin Kambili, Koen Andries, Myriam Theeuwes, Brian Dannemann, An Vandebosch, Wim Van der Elst) played a role in the decision to publish and preparation of the manuscript.

Competing interests: We have the following interests: Janssen funded the C208 Stage II trial. Koen Andries, Nacer Lounis, An Vandebosch and Wim Van der Elst are full-time employees of Janssen Pharmaceutica. Chrispin Kambili is employed full-time by Johnson and Johnson. Paul Meyvisch, Myriam Theeuwes and Brian Dannemann were previously employed by Janssen; all are potential stockholders of Johnson and Johnson. Paul Meyvisch is a voluntary researcher at the I-BioStat at KU Leuven and Universiteit Hasselt and is currently employed by Galapagos NV, Mechelen (Belgium). Geert Molenberghs is a full-time employee of IBioStat at UHasselt and KU Leuven, Belgium. Ariel Alonso is a full-time employee of IBioStat at KU Leuven, Belgium. Koen Andries is co-inventor and has three patents on the use of quinoline derivatives for the treatment of mycobacterial diseases (rights of which have been transferred to Johnson & Johnson): 1. WO 2004/011436 – Quinoline Derivatives and Their Use as Mycobacterial Inhibitors; 2. WO 2005/117875 – Use of Substituted Quinoline Derivatives for the Treatment of Drug Resistant Mycobaterial Diseases; 3. WO 2006/067048 – Quinoline Derivatives for the Treatment of Latent Tuberculosis. There are no further patents, products in development or marketed products to declare. This does not alter our adherence to all the PLOS ONE policies on sharing data and materials.

Introduction

The use of surrogate endpoints in drug development as a basis for reaching conclusions about the benefits of therapy has been received with enthusiasm and concern [1–4]. Surrogates can hasten treatment benefits for patients when the surrogate proves to predict clinical benefit, but use of surrogates could result in the adoption of questionable therapies if insufficient rigor is applied in surrogate evaluation. This becomes paramount when assessing the effectiveness of adding an experimental drug to a regimen for treatment of multidrug resistant-tuberculosis (MDR-TB), defined as Mycobacterium tuberculosis resistant to at least isoniazid and rifampicin. Because tuberculosis drug trials are usually very long, valid surrogate endpoints measured during or at the end of treatment could reduce both the time and cost of assessing the efficacy of new regimens or drugs.

There has been much debate about biomarkers and surrogate endpoints in drug trials for drug-sensitive-tuberculosis (DS-TB) and MDR-TB. Treatment for DS-TB began as an 18-month regimen until the British Medical Research Council (BMRC) randomized controlled studies showed that a 6-month, short-course regimen was feasible [5,6]. Sputum culture conversion on solid media after 2–3 months of anti-DS-TB treatment has been proposed as a surrogate marker of relapse-free cure [7]. While this is likely the best dichotomous biomarker available, its statistical validity as a surrogate is questionable based on a re-analysis of the BMRC trial data [8,9]. Also, Phillips et al [10], in an analysis of the tREMoxTB trial, found that sputum-based markers poorly discriminate between favorable and unfavorable outcomes. In contrast, Wallis et al [11–13] have claimed that 2-month sputum culture status is a good predictor for outcome in DS-TB when combined with duration of treatment as an independent variable.

Design features of MDR-TB clinical trials are not always suitable for surrogate endpoint evaluation. Randomization needs to be preserved until the study end and treatment duration for each treatment group should preferably be kept the same in order to estimate the treatment difference on both the putative surrogate and long-term outcome. While some authors have chosen 8 weeks for surrogate assessment [14,15], there is growing evidence that for MDR-TB, sputum culture conversion at 24 weeks or later is of greater prognostic value for clinical outcome [16,17]. A key distinction to make is to differentiate between a prognostic marker and a surrogate endpoint. A prognostic marker relates to a clinical endpoint and is an indicator of treatment response. In contrast, a surrogate endpoint is intended to substitute for a clinical endpoint, predict clinical outcome, and is statistically evaluated [18]. So, while the association of week 24 culture conversion with clinical outcome may be stronger than earlier time points, this by itself does not necessarily demonstrate the statistical validity of the surrogate.

In a re-analysis of the 120-week bedaquiline (BDQ) Phase II trial (C208 Stage II) (ClinicalTrials.gov number, NCT00449644) [19], we examined the complex association between outcome and either Week 8 or 24 culture conversion as putative surrogate endpoints using a recent information-theoretic approach for statistical evaluation of surrogate endpoints that is based on causal inference [20].

Methods

C208 study design

The clinical trial demonstrating efficacy of BDQ in newly diagnosed patients with pulmonary MDR-TB was a randomized, placebo-controlled trial (C208) for which treatment assignment was not changed until study end [19,21]. Each site obtained approval of the study protocol from at least one (or more, if required by local regulations) independent ethics committee or institutional review board (S1 Table). The trial was conducted in accordance with the principles of Good Clinical Practice and the Declaration of Helsinki. All patients provided written informed consent before trial entry.

The trial consisted of two stages. The independent first stage was a single-country, placebo-controlled, randomized trial in a small group of MDR-TB patients (N = 47) to compare the safety and efficacy of adding BDQ for 8 weeks to a preferred five-drug MDR-TB regimen [21]. The second stage (main trial) was a multi-country, placebo-controlled, randomized trial in a larger group of MDR-TB patients (N = 160). C208 Stage II compared the efficacy and safety of BDQ given for 24 weeks (BDQ 400 mg once daily for 2 weeks, followed by 200 mg three times a week for 22 weeks) versus placebo when added to a preferred five-drug MDR-TB regimen that was given for 18–24 months [19]. While national treatment-program regimens were respected, the preferred five-drug background regimen was ethionamide, pyrazinamide, ofloxacin, kanamycin, and cycloserine [19].

In both stages, the primary endpoint was time to confirmed sputum culture conversion from positive to negative in liquid broth. Changes in the background regimen were allowed according to the results of drug susceptibility testing, because of unacceptable adverse events or supply interruption of the drugs. The background regimen was continued for 12–18 months after the planned end of BDQ treatment, with an anticipated ≥6-month treatment-free follow-up period. This allowed clinical outcome to be assessed at 104 weeks (26 months) in Stage I and 120 weeks (30 months) in Stage II after randomization.

While regulatory approval of the drug was obtained on the basis of ‘time to confirmed sputum culture conversion’ during the first 24 weeks in stage II, a binary endpoint of sputum culture conversion (achieved or not) was also used to demonstrate superiority of the BDQ-containing treatment group. Sputum culture conversion was evaluated after 24 weeks (treatment completion of BDQ or placebo) and again 24 weeks after the anticipated completion of the entire treatment regimen at the 120 week endpoint. The CONSORT flow diagram for the C208 Stage II trial is shown in Fig 1.

Download:

Fig 1. CONSORT flow diagram for the C208 Stage II trial [19].

*The modified intent-to-treat population was a subset of the intent-to-treat population that excluded nine patients (6 BDQ and 3 placebo) with Mycobacteria Growth Indicator Tube results that did not allow for primary efficacy evaluation (no evidence of culture positivity prior to first intake of blinded study drug or no results during the first 8 weeks after first intake), seven patients (3 BDQ and 4 placebo) infected with extensively drug-resistant tuberculosis, eight (4 BDQ and 4 placebo) with drug-sensitive tuberculosis, and four patients (0 BDQ and 4 placebo) for whom the multidrug-resistant tuberculosis status could not be confirmed.

https://doi.org/10.1371/journal.pone.0200539.g001

Data collection and definition of endpoints in C208 Stage II

Spot sputum samples to assess the presence or absence of Mycobacterium tuberculosis were collected in triplicate at every visit and qualitative assessment was done in liquid medium (Mycobacteria Growth Indicator Tube [MGIT], Becton Dickinson) [19,22]. Visits were scheduled weekly during the first 8 weeks and biweekly until Week 24. From Week 24 onwards, visits were scheduled monthly until Week 36 and three-monthly thereafter until the end of the trial (Week 120).

Sputum processing.

Sputum samples were decontaminated with N-acetyl-L-cysteine and sodium hydroxide solution and inoculated into MGIT tubes with the addition of oleic acid, albumin, dextrose and catalase (OADC) and polymyxin B, amphotericin B, nalidixic acid, trimethoprim and azlocillin (PANTA). MGIT tubes were incubated in the MGIT machine for 42 days, but were removed earlier if cultures flagged instrument-positive. The positive MGIT cultures were checked for growth of contaminating organisms by sub-inoculating the MGIT culture on a blood agar plate with overnight incubation. In addition, Ziehl-Neelsen staining was performed on each positive MGIT culture to check for the presence of acid-fast bacilli (AFB). Identification of the Mycobacterium tuberculosis complex was done on every positive MGIT culture using either the MPT64 antigen test or molecular tests. Quality control checking was done by growing the pan-sensitive H37Rv Mycobacterium tuberculosis strain on every new batch of MGIT tubes.

Aggregate scoring.

Triplicate culture results were summarized prior to analysis into one single measure with values ‘negative’, ‘positive’, ‘contaminated’ or ‘missing’ [22]. This single aggregate measure was ‘negative’ when at least one of the three samples was negative and none were positive and was ‘positive’ when at least one of the three samples was positive.

Drug-susceptibility testing.

Per protocol, drug-susceptibility testing (DST) was performed at a central laboratory (Institute of Tropical Medicine, Antwerp, Belgium) as previously described [19]. For DST, a culture was grown on Löwenstein–Jensen medium to generate enough colonies. DST for isoniazid, rifampicin, ethambutol, ofloxacin, ethionamide, kanamycin and capreomycin were performed on 7H11 agar (proportion method) and for pyrazinamide in the MGIT960 system. The quality control was performed by testing the pan-sensitive H37Rv Mycobacterium tuberculosis strain grown on 7H11 agar or using the MGIT960 system (pyrazinamide) with the same drug concentrations, each time the sensitivity of a clinical isolate was tested.

Study population and primary efficacy analysis.

All patients contributing to the efficacy analyses were culture positive at baseline. Patients whose pre-randomization sputum sample was culture negative were excluded from the efficacy analysis, as were those whose culture was shown to be drug-sensitive or extensively drug-resistant, defined as MDR-TB with additional resistance to injectable second-line drugs (amikacin, kanamycin, or capreomycin) and fluoroquinolones. After exclusion of these categories of patients, the number of patients retained for primary efficacy analysis was 132 (66 patients in each treatment group) [19].

Regarding baseline susceptibility to drugs in the background regimen, 51 patients in the BDQ group vs 54 patients in the placebo group were infected with MDR-TB, and 15 vs 12, respectively, were infected with pre-extensively drug-resistant TB, defined as MDR-TB isolates also with resistance to second-line injectables or fluoroquinolones [19].

Sputum culture conversion at 8 weeks, 24 weeks, and at the end of the trial used the same criterion for confirmed conversion, i.e., the patient had to have at least two consecutive negative cultures at least 25 days apart (with no positive intermediate cultures). Patients who prematurely dropped out of the trial were considered failures from time of drop-out onwards, irrespective of whether they culture converted at the time they dropped out. We emphasize that this approach is consistent with the primary analysis as reported previously [19]. Since in this definition, data for patients who dropped out were imputed with the same outcome irrespective of their actual conversion at that time, using this missing = failure analysis was expected to increase the level of association (surrogacy) between the endpoints at different time points. Therefore, a sensitivity analysis using multiple imputation statistical techniques [23] to impute the data after drop-out was also conducted.

Statistical evaluation of surrogate endpoints when both the surrogate and true endpoint are binary outcomes

In a single-trial setting, when both endpoints are expected to be normally distributed, a commonly used measure to evaluate surrogacy at the level of the individual patient is the adjusted association (γ), which is defined as γ = corr(S,T|Z), where corr is the Pearson correlation coefficient, S is the surrogate endpoint, T is the true endpoint, and Z is an indicator variable for treatment [2]. However, when we move away from settings in which both endpoints are normally distributed it is no longer clear how γ should be quantified. Importantly, it has now been clearly established that an association between the putative surrogate and the true endpoint does not guarantee the validity of the former. Indeed, the main idea behind the use of a surrogate endpoint is the prediction of the treatment effect on the true endpoint based on the treatment effect on the surrogate endpoint. Although desirable, a strong association between S and T is not enough to achieve this goal.

Individual causal association.

Recently, a metric of surrogacy was proposed [20], the individual causal association (ICA), which uses information-theoretic concepts and a causal inference model for a binary surrogate endpoint and a true endpoint. The fundamental quantities that are used to determine the ICA are the Individual Causal Treatment Effects. The term ‘causal treatment effect’ refers to the causal effect of a given treatment on an outcome of interest like S and T. In the Neyman-Rubin ‘Potential Outcomes Framework’ of causality, an individual causal treatment effect is defined for each individual patient in terms of two ‘potential outcomes’. Each patient has one outcome that would manifest if the patient were exposed to the treatment and another outcome that would manifest if they were exposed to the control. The individual causal treatment effect is the difference between these two potential outcomes for the true endpoint and the surrogate endpoint, respectively (ΔT or ΔS). However, these individual-level causal treatment effects are unobservable because individual patients can only receive the treatment or the control, but not both. For any individual patient, ΔT and ΔS will be -1 (Harm), 0 (Equal), and 1 (Benefit), depending on how BDQ performs versus placebo on either endpoint.

The ICA is defined as the association between both individual causal treatment effects for the true endpoint and the surrogate endpoint, i.e., the association between ΔT and ΔS. As specified [20], the ICA always lies in the unit interval and has a simple and appealing interpretation in terms of uncertainty reduction. It takes a value of 1 if there is a deterministic relationship between ΔS and ΔT, and therefore ΔS predicts ΔT without error, i.e., knowing the treatment effect on the surrogate endpoint provides full information about the treatment effect on the true endpoint. In addition, when the ICA equals zero, both treatment effects are independent, and knowing the treatment effect on the surrogate endpoint does not inform about the treatment effect on the true endpoint. Like the individual causal treatment effects, ΔT and ΔS, the ICA cannot be estimated from the data directly. A two-step Monte-Carlo procedure was introduced to assess the value of the ICA [20]. The immediate consequence of ΔS and ΔT being non-identifiable is that estimation of the ICA results in a density rather than a fixed value.

While the interpretation of the ICA is straightforward, there is currently little guidance on how large it should be for a surrogate to qualify as acceptable. However, it is plausible to assume that when comparing two or more surrogates, the one with the largest ICA can be considered the best.

Surrogate predictive function.

Another technique that has been developed to assist in interpreting the relationship between ΔT and ΔS is the Surrogate Predictive Function (SPF) [24]. As previously stated, S is a good surrogate for T, when ΔS can predict ΔT with a certain level of precision. Even though the ICA offers a general assessment of the surrogate predictive power, further insight can be gained from studying the conditional distribution of ΔT given ΔS. The general idea is to quantify the individual probabilities of all possible outcomes for the treatment effect on the true endpoint, given the treatment effect on the surrogate endpoint P(ΔT = i|ΔS = j) for i,j ϵ {-1,0,1}. All analyses are consistent and complimentary to the derivation of the ICA in the sense that it is based on the same two-step Monte-Carlo procedure as referenced above [20].

Results

Descriptive analysis

Denoting Week 8 culture conversion as the surrogate endpoint S₈, Week 24 culture conversion as the surrogate endpoint S₂₄ and favorable outcome at Week 120 as the true endpoint T, Tables 1 and 2 present the relationship between S₈ and T and between S₂₄ and T, respectively, for BDQ and placebo.

Download:

Table 1. Relationship between S₈ and T for BDQ and placebo.

https://doi.org/10.1371/journal.pone.0200539.t001

Download:

Table 2. Relationship between S₂₄ and T for BDQ and placebo.

https://doi.org/10.1371/journal.pone.0200539.t002

A first observation is that there is a clear treatment effect on S₈ (p = 0.005, 95% CI: 7.8–40.7%)_, S₂₄ (p = 0.009, 95% CI: 5.7–36.7%) and on T (p = 0.036, 95% CI: 1.4–34.9%) using the Pearson χ² test.

Secondly, the off-diagonal elements for the relationship between S₂₄ and T shown in Table 2 (0 and 11) and (4 and 13), are generally smaller than for the relationship between S₈ and T shown in Table 1 (15 and 11) and (17 and 9), indicating that S₂₄ is in stronger ‘agreement’ with T compared with S₈. This is also apparent from the odds ratio (OR) estimates and corresponding 95% confidence intervals, which are only significant for S₂₄ in either treatment group. Do note that the OR for BDQ in Table 2 is determined using the Firth type estimate [25].

Looking specifically at the association between S₂₄ and T, 11 patients in the BDQ group and 13 patients in the placebo group who were Week 24 responders were considered non-responders at endpoint. Of the 11 patients in the BDQ group, two died, five discontinued from the trial, and four reverted to positive culture. Of the 13 patients in the placebo group, five patients discontinued and eight patients reverted to positive culture. Additionally, four patients in the placebo group who were considered non-responder at Week 24 subsequently culture converted and were considered responders at Week 120. In contrast, all patients in the BDQ group who responded at Week 120 were already responders at 24 weeks. The analysis of the relationship between S₂₄ and T shows a significant association for both BDQ and placebo patients; however, the association was stronger in the BDQ group than in the placebo group. This indicates that the addition of BDQ has a considerable impact as a greater proportion of patients culture convert and fewer relapse in the BDQ group. This change in relationship between surrogate and true endpoint upon addition of a new drug requires sound methodology to evaluate the statistical validity of the surrogates S₈ and S₂₄.

S2 Table presents the relationship between S₂₄ and T for BDQ and between S₂₄ and T for placebo, based on AFB rather than MGIT. In the AFB smear test, success was defined similarly compared to MGIT-based outcomes, i.e., the patient had to have at least two consecutive negative smears at least 25 days apart (with no positive intermediate smears). Patients who prematurely dropped out of the trial were considered failures from time of drop-out onwards, irrespective of whether their smear converted at the time they dropped out.

Values for sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) are provided in Table 3. The data show that S₂₄ is fairly specific for T with values of 100% for BDQ and 86. 2% for control, respectively, while sensitivity is much lower, i.e., 56% for BDQ and 64. 9% for control. Specificity, PPV, and NPV are consistently higher for S₂₄ than for S₈ regardless of treatment assignment.

Download:

Table 3. Sensitivity, specificity, positive predictive value and negative predictive value of S₈ and S₂₄.

https://doi.org/10.1371/journal.pone.0200539.t003

Re-analysis of the primary endpoint using multiple imputation

Among the 132 patients, 5 (3.8%) patients dropped out prior to Week 8 and an additional 17 (12.9%) patients dropped out between Week 8 and Week 24. A total of 85 (64.4%) patients completed the trial. It is further noted that dropout is monotone, i.e., response rates are available for all patients at all time points until the time of drop out. An overview of the number of observed (O) and missing (M) patients is presented in Table 4.

Download:

Table 4. Drop-out rate during the trial.

https://doi.org/10.1371/journal.pone.0200539.t004

Table 5 shows that the vast majority of patients at later time points were responders. Indeed, in the most extreme case, the response rate (M = F) in the BDQ group at Week 120 was 62% while another 35% of patients were failures as a result of drop out. This implies that for patients who were observed to complete the trial only 3% did not reach culture conversion, while for 35% of patients their outcome was not observed at trial end.

Download:

Table 5. Response rates (M = F) and % missing data during the trial.

https://doi.org/10.1371/journal.pone.0200539.t005

As the primary endpoint used a missing = failure approach as a single imputation which artificially enhances the association in the evaluation of surrogacy for early dropouts, a multiple imputation analysis was performed as a sensitivity analysis. Given that the missing data pattern is monotone and consists of binary outcome variables (response/ no response) [23], a logistic regression model was fitted for each time point, with the previous time points and treatment group as covariates. This way, the response variable at Week 8 was only regressed for treatment. The response variable at Week 24 was fitted using Week 8 response and treatment as independent variables etc… The imputed observations were subsequently obtained using the imputation algorithm as described in van Buuren, S 2012 [23]. A total of 5 imputations were deemed sufficient.

The results indicate that application of the multiple imputation increased the response rates at all time points as observed in Fig 2. In addition, the imputed profiles behaved consistently. The average of the 5 imputations at Week 8, Week 24 and Week 120 are displayed in Table 6.

Download:

Fig 2. Response rate over time in the missing = failure and multiple imputation analyses.

https://doi.org/10.1371/journal.pone.0200539.g002

Download:

Table 6. Average response rate over time in the multiple imputation analysis.

https://doi.org/10.1371/journal.pone.0200539.t006

S3 Table presents the relationship between S₂₄ and T for BDQ and between S₂₄ and T for placebo, based on culture conversion using the multiple imputation method.

In the next section, individual-level surrogacy using the Week 8 and Week 24 interim results as putative surrogate endpoints is evaluated. The multiple imputations serve as a sensitivity analysis to gain additional insights.

Individual causal association

The ICA for each of the two putative surrogate endpoints S₈ and S₂₄ can be assessed using the R library Surrogate, which is available in CRAN [26]. The R code used is available upon request. In addition to the two binary surrogate endpoints S₈ and S₂₄, we also investigated a third putative surrogate endpoint, denoted as S₈×S₂₄, where success is defined as culture conversion at both Week 8 and Week 24, and a fourth putative surrogate endpoint, which is based on AFB, rather than MGIT. The densities of the ICA with distributional statistics for each of the four binary surrogate endpoints are depicted in Fig 3.

Download:

Fig 3. Densities and distribution of the individual causal association between ΔT and ΔS for S₈, S₈xS₂₄, S₂₄ based on MGIT, and S₂₄ based on AFB.

https://doi.org/10.1371/journal.pone.0200539.g003

S₂₄ performs consistently better than S₈ and S₈×S₂₄, both of which perform similarly (Table 7). It is clear from above analysis that S₂₄ is the most predictive surrogate even though the density of ICA is also relatively low for S₂₄. In addition, culture conversion at Week 24 was more predictive than smear conversion.

Download:

Table 7. Distribution of individual causal association between ΔT and ΔS for S₈, S₈xS₂₄, S₂₄ based on MGIT, and S₂₄ based on AFB.

https://doi.org/10.1371/journal.pone.0200539.t007

Sensitivity analyses evaluating the ICA using the multiple imputation method revealed that the result worsened for Week 24 culture conversion as the surrogate endpoint compared with the missing = failure method. The impact of missing data on the Week 24 surrogate endpoint was larger than on the Week 8 surrogate endpoint (Fig 4). In addition, large differences were observed among the ICA densities of the imputed data sets. Note however that there were also large differences in odds ratios as shown in the S3 Table.

Download:

Fig 4. ICA evaluated using the missing = failure and multiple imputation methods at A) Week 8 and B) Week 24.

https://doi.org/10.1371/journal.pone.0200539.g004

Surrogate predictive function

Fig 5 shows conditional probabilities of possible outcomes for the treatment effect on the true endpoint (ΔT), given the effect on the surrogate endpoint (ΔS₈ or ΔS₂₄). This graphical representation of the SPF provides more granular insight as to how predictive ΔS₂₄ is for ΔT. From the nine conditional probabilities of ΔT given ΔS₂₄ depicted in Fig 5, it is apparent that two conditional probabilities, i.e., P(ΔT = 1|ΔS₂₄ = -1) and P(ΔT = -1|ΔS₂₄ = 1) are very low, which has a straightforward interpretation. Indeed, a harmful effect of BDQ on the surrogate endpoint rules out a beneficial effect of BDQ on the true endpoint. The converse is also true, i.e., a beneficial effect of BDQ on the surrogate endpoint is unlikely to result in a harmful effect of BDQ on the true endpoint. These probabilities remained low after multiple imputation of the missing data.

Download:

Fig 5.

Surrogate predictive value for A) S₈ and B) S₂₄. The conditional probabilities of (A) ΔT given ΔS₈ or (B) ΔT given ΔS₂₄ are shown. For any individual patient, ΔT and ΔS will be -1 (Harm), 0 (Equal), and 1 (Benefit).

https://doi.org/10.1371/journal.pone.0200539.g005

Conversely, the two probabilities that were clearly highest were P(ΔT = 1|ΔS₂₄ = 1) and P(ΔT = 0|ΔS₂₄ = 0). This means that patients in whom BDQ will be beneficial at the surrogate endpoint (i.e, would have a positive response with BDQ but not with placebo) are also expected to benefit from BDQ at the true endpoint (ΔT = 1|ΔS₂₄ = 1), with a median probability of 74%. Patients in whom the outcome on BDQ and placebo was equal at the surrogate endpoint, (i.e., would fail on either treatment or alternatively respond to either treatment) are expected to also have an equal outcome at the true endpoint (ΔT = 0|ΔS₂₄ = 0), with a median probability of 70%. Note that the latter median probabilities P(ΔT = 0|ΔS₂₄ = 0) were further increased to 81%, 73%, 76%, 79% and 78% respectively using the imputed data. In contrast, the median probabilities of P(ΔT = 1|ΔS₂₄ = 1) substantially decreased to 37%, 66%, 62%, 21% and 58% respectively, at the expense of comparable increases of the median probabilities of P(ΔT = 0|ΔS₂₄ = 1).

A less clear picture was seen from ΔS₈, as all median probabilities were in the range of 10% to 50%, which makes it difficult to draw firm conclusions as to which value of ΔS₈ is likely (or unlikely) to correspond with a value of ΔT.

Discussion

In the BDQ registrational Phase II trials (C208 Stage II and C209) [19,27], the selection of Week 24 instead of Week 8 culture conversion as an interim endpoint was based on a number of considerations. First, the treatment duration of BDQ or placebo (added to a preferred five-drug MDR-TB regimen) was 24 weeks, and it was plausible to evaluate at the end of the Week 24 treatment period. We also anticipated that culture conversion in patients with MDR-TB would generally be slower than in patients with DS-TB [28]. Another argument in favor of a Week 24 interim endpoint was our belief that culture in liquid media would be more sensitive than traditionally used solid media and paradoxically increase sputum culture conversion times. In light of the results presented here, we conclude that the selection of 24 weeks as an interim endpoint was a reasonable approach.

The current work sought to compare Week 8 versus Week 24 sputum culture conversion as a surrogate endpoint for favorable outcome at Week 120 in the treatment of MDR-TB that was based on the results of a single Phase II trial. Our results show that Week 8 culture conversion is a poor surrogate whereas Week 24 culture conversion performs better, in terms of specificity, PPV, NPV, and the ICA and SPF. This finding is important both for the design of future MDR-TB trials and in the context of individual patient care where failure to culture convert may prompt re-evaluation of ‘a failing regimen’. Even though the ICA was relatively low for the Week 24 time point, the analysis based on the SPF clearly shows that some situations can be confidently ruled out. For instance, it seems to be very unlikely that a beneficial effect of BDQ on Week 120 culture conversion (the true endpoint) is to be expected given a harmful effect at Week 24 (the surrogate endpoint). Conversely, a beneficial effect of BDQ on Week 24 culture conversion (surrogate endpoint) is unlikely to result in a harmful effect at Week 120 (true endpoint).

The relatively low ICA values are not surprising for a number of reasons. First, substantial information is lost in dichotomizing surrogate endpoints (i.e., S₈, and S₂₄). Alternative surrogate endpoints such as ‘time to sputum culture conversion’ and rate of bacterial load decline may perform better but require further investigation [19,29]. A methodology to evaluate continuous surrogate endpoints in combination with binary true endpoints is currently being developed and will allow evaluation of these putative surrogates with S₂₄, using the ICA as a common measure of association. Another reason for the low ICA density is the relatively high number of patients on placebo who either converted after Week 24 or who relapsed towards the end of the trial. Finally, a sensitivity analysis using multiple imputation has demonstrated that the high rate of dropout may artificially enhance the agreement between S₂₄ and the true endpoint (Week 120), resulting in elevated values for the ICA.

In retrospect, given that C208 Stage II had an add-on superiority design during which BDQ was added for 24 weeks to a standardized background regimen, acceptance of culture conversion at 24 weeks as a surrogate endpoint appears to have been a reasonable approach. BDQ not only shortened the time to culture conversion but also prevented relapse many months after treatment with BDQ stopped, in a setting where both treatment groups had a background regimen that was similar in composition and duration, acknowledging that more changes to the background regimen were made in the placebo group [19].

The field of MDR-TB treatment is evolving. In 2016, the WHO recommended the 9-month short-course regimen for the treatment of uncomplicated MDR-TB [30]. Various clinical trial initiatives are looking at shortened, simplified regimens for treatment of MDR-TB, including the endTB initiative (NCT02754765), TB PRACTECAL (NCT02589782), and Nix-TB (NCT02333799) [31], looking at 6–9 month, all oral regimens containing three to five drugs, most of which combine potent new drugs with existing or repurposed group five drugs [28]. As the field moves away from the current 18–24 month regimens and towards newer simplified short-course regimens composed of several strong drugs, it is possible that time to culture conversion will occur earlier, possibly nearing 100% by 2 months akin to DS-TB, in which case a 6-month surrogate endpoint may no longer have an advantage over the 2-month surrogate endpoint [28,31,32].

Supporting information

S1 Table. Institutional review boards.

https://doi.org/10.1371/journal.pone.0200539.s001

(DOCX)

S2 Table.

A) Relationship between S₂₄ (on the basis of AFB smear conversion) and T for BDQ; B) Relationship between S₂₄ (on the basis of AFB smear conversion) and T for Placebo control.

https://doi.org/10.1371/journal.pone.0200539.s002

(DOCX)

S3 Table.

A) Relationship between S₂₄ (on the basis of culture conversion) and T for BDQ: imputed values; B) Relationship between S₂₄ (on the basis of culture conversion) and T for Placebo control: imputed values.

https://doi.org/10.1371/journal.pone.0200539.s003

(DOCX)

Acknowledgments

We would like to thank the patients and their families for their participation and support during the C208 study, as well as study center staff, principal investigators and the Janssen study team. We acknowledge Ian Woolveridge of Zoetic Science, an Ashfield company, Macclesfield, UK, for assistance in technical editing and coordinating and collating author contributions, which was funded by Janssen.

References

1. Alonso A, Molenberghs G. Surrogate endpoints: Hopes and perils. Expert Rev Pharmacoecon Outcomes Res. 2008;8: 255–259. pmid:20528377
- View Article
- PubMed/NCBI
- Google Scholar
2. Buyse M, Molenberghs G. Criteria for the validation of surrogate endpoints in randomized experiments. Biometrics. 1998;54: 1014–1029. pmid:9840970
- View Article
- PubMed/NCBI
- Google Scholar
3. Buyse M, Molenberghs G, Burzykowski T, Renard D, Geys H. The validation of surrogate endpoints in meta-analyses of randomized experiments. Biostatistics. 2000;1: 49–67. pmid:12933525
- View Article
- PubMed/NCBI
- Google Scholar
4. Fleming TR. Surrogate endpoints and FDA’s accelerated approval process. Health Aff (Millwood). 2005;24: 67–78.
- View Article
- Google Scholar
5. East African/British Medical Research Council. Controlled clinical trial of four 6-month regimens of chemotherapy for pulmonary tuberculosis. Second report. Am Rev Respir Dis. 1976;114: 471–5. pmid:788570
- View Article
- PubMed/NCBI
- Google Scholar
6. Singapore Tuberculosis Service/British Medical Research Council. Clinical trial of six-month and four-month regimens of chemotherapy in the treatment of pulmonary tuberculosis. The results up to 30 months. Tubercle. 1981;62: 95–102. pmid:7029838
- View Article
- PubMed/NCBI
- Google Scholar
7. Wallis RS, Wang C, Doherty TM, Onyebujoh P, Vahedi M, Laang H, et al. Biomarkers for tuberculosis disease activity, cure, and relapse. Lancet Infect Dis. 2010;10: 68–69. pmid:20113972.
- View Article
- PubMed/NCBI
- Google Scholar
8. Phillips PPJ, Fielding K, Nunn AJ. An evaluation of culture results during treatment for tuberculosis as surrogate endpoints for treatment failure and relapse. PLoS One. 2013;8: e63840. pmid:23667677
- View Article
- PubMed/NCBI
- Google Scholar
9. Davies GR. Early clinical development of anti-tuberculosis drugs: Science, statistics and sterilizing activity. Tuberculosis. 2010;90: 171–176. pmid:20382567
- View Article
- PubMed/NCBI
- Google Scholar
10. Phillips PP, Mendel CM, Burger DM, Crook AM, Nunn AJ, Dawson R, et al. Limited role of culture conversion for decision making in individual patient care and for advancing novel regimens to confirmatory clinical trials. BMC Medicine. 2016;14: 19. pmid:26847437
- View Article
- PubMed/NCBI
- Google Scholar
11. Wallis RS, Peppard T. Early biomarkers and regulatory innovation in multidrug-resistant tuberculosis. Clin Infect Dis. 2015; 61 Suppl 3: S160–3. https://doi.org/10.1093/cid/civ612.
- View Article
- Google Scholar
12. Wallis RS, Peppard T, Hermann D. Month 2 culture status and treatment duration as predictors of recurrence in pulmonary tuberculosis: model validation and update. PloS ONE. 2015;10: e0125403. pmid:25923700
- View Article
- PubMed/NCBI
- Google Scholar
13. Wallis RS, Wang C, Meyer D, Thomas N. Month 2 culture status and treatment duration as predictors of tuberculosis relapse risk in a meta-regression model. PLoS ONE. 2013;8: e71116. pmid:23940699
- View Article
- PubMed/NCBI
- Google Scholar
14. Lee M, Lee J, Carroll MJ, Choi H, Min S, Song T, et al. Linezolid for treatment of chronic extensively drug-resistant tuberculosis. N Engl J Med. 2012;367: 1508–1518. pmid:23075177
- View Article
- PubMed/NCBI
- Google Scholar
15. Gler MT, Skripconoka V, Sanchez-Garavito E, Xiao H, Cabrera-Rivero JL, Vargas-Vasquez DE, et al. Delamanid for multidrug-resistant pulmonary tuberculosis. N Engl J Med. 2012;366: 2151–2160. pmid:22670901
- View Article
- PubMed/NCBI
- Google Scholar
16. Holtz TH, Sternberg M, Kammerer S, Laserson KF, Riekstina V, Zarovska E, et al. Time to sputum culture conversion in multidrug-resistant tuberculosis: predictors and relationship to treatment outcome. Ann Intern Med. 2006;144: 650–659. pmid:16670134
- View Article
- PubMed/NCBI
- Google Scholar
17. Kurbatova EV, Cegielski JP, Lienhardt C, Akksilp R, Bayona J, Becerra MC, et al. Sputum culture conversion as a prognostic marker for end-of-treatment outcome in patients with multidrug-resistant tuberculosis: a secondary analysis of data from two observational cohort studies. Lancet Respir Med. 2015;3: 201–209. doi: published online Feb 26. pmid:25726085
- View Article
- PubMed/NCBI
- Google Scholar
18. Lassere MN, Johnson KR, Boers M, Tugwell P, Brooks P, Simon L, et al. Definitions and validation criteria for biomarkers and surrogate endpoints: development and testing of a quantitative hierarchical levels of evidence schema. J Rheumatol. 2007;34: 607–615. pmid:17343307
- View Article
- PubMed/NCBI
- Google Scholar
19. Diacon AH, Pym A, Grobusch MP, de los Rios JM, Gotuzzo E, Vasilyeva I, et al. Multidrug-resistant tuberculosis and culture conversion with bedaquiline. N Engl J Med. 2014;371: 723–732. pmid:25140958
- View Article
- PubMed/NCBI
- Google Scholar
20. Alonso A, Van der Elst W, Molenberghs G, Buyse M, Burzykowski T. An information-theoretic approach for the evaluation of surrogate endpoints based on causal inference. Biometrics. 2016;72: 669–677. pmid:26864244
- View Article
- PubMed/NCBI
- Google Scholar
21. Diacon AH, Pym A, Grobusch M, Patientia R, Rustomjee R, Page-Shipp L, et al. The diarylquinoline TMC207 for multidrug-resistant tuberculosis. N Engl J Med. 2009;360: 2397–2405. pmid:19494215
- View Article
- PubMed/NCBI
- Google Scholar
22. Diacon AH, van Brakel E, Lounis N, Meyvisch P, Van Baelen B, De Marez T, et al. Triplicate Sputum Cultures for Efficacy Evaluation of Novel Antituberculosis Regimens. Am J Respir Crit Care Med. 2017;196: 1612–5. pmid:28448721
- View Article
- PubMed/NCBI
- Google Scholar
23. van Buuren S. Flexible imputation of missing data. Boca Raton: Chapman & Hall/CRC; 2012.
24. Alonso A, Van der Elst W, Meyvisch P. Assessing a surrogate predictive value: A causal inference approach. Stat Med. 2017;36: 1083–1098. pmid:27966231
- View Article
- PubMed/NCBI
- Google Scholar
25. Heinze G, Schemper M, A solution to the problem of separation in logistic regression. Stat Med. 2002;21: 2409–2419. pmid:12210625
- View Article
- PubMed/NCBI
- Google Scholar
26. Van der Elst W, Meyvisch P, Alonso A. Evaluation of surrogate endpoints in clinical trials. https://cran.r-project.org/web/packages/Surrogate (accessed: June 13, 2017).
27. Pym AS, Diacon AH, Tang SJ, Conradie F, Danilovits M, Chuchottaworn C, et al. Bedaquiline in the treatment of multidrug- and extensively drug-resistant tuberculosis. Eur Respir J. 2016;47: 564–574. pmid:26647431
- View Article
- PubMed/NCBI
- Google Scholar
28. World Health Organization. Treatment of tuberculosis: guidelines. 4th Edn (WHO/HTM/TB/2009.420). www.aidsdatahub.org/dmdocuments/Treatment_of_TB_Guidelines.pdf (last accessed: November 23, 2017).
29. Svensson EM and Karlsson MO. Modelling of mycobacterial load reveals bedaquiline's exposure-response relationship in patients with drug-resistant TB. J Antimicrob Chemother. 2017;72: 3398–3405. pmid:28961790
- View Article
- PubMed/NCBI
- Google Scholar
30. World Health Organization. Rapid diagnostic test and shorter, cheaper treatment signal new hope for multidrug-resistant tuberculosis patients. May 12 2016. http://www.who.int/mediacentre/news/releases/2016/multidrug-resistant-tuberculosis/en/ (last accessed: November 23, 2017).
31. Conradie F, Diacon AH, Everitt D, Mendel C, van Nieckerk C, Howell P, et al. The NIX-TB trial of pretomanid, bedaquiline and linezolid to treat XDR-TB. Presented at the 2017 Conference on Retroviruses and Opportunistic Infections (CROI), CROI Foundation in partnership with the International Antiviral Society-USA, Seattle, WA, February 13–16, 2017. Abstract 80LB.
32. Dawson R, Harris K, Conradie A, Burger D, Murray S, Mendel C, et al. Efficacy of bedaquiline, pretomanid, moxifloxacin & PZA (BPaMZ) against DS- & MDR-TB. Presented at the 2017 Conference on Retroviruses and Opportunistic Infections (CROI), CROI Foundation in partnership with the International Antiviral Society-USA, Seattle, WA, February 13–16, 2017. Abstract 724LB.

[ref1] 1. Alonso A, Molenberghs G. Surrogate endpoints: Hopes and perils. Expert Rev Pharmacoecon Outcomes Res. 2008;8: 255–259. pmid:20528377
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Buyse M, Molenberghs G. Criteria for the validation of surrogate endpoints in randomized experiments. Biometrics. 1998;54: 1014–1029. pmid:9840970
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Buyse M, Molenberghs G, Burzykowski T, Renard D, Geys H. The validation of surrogate endpoints in meta-analyses of randomized experiments. Biostatistics. 2000;1: 49–67. pmid:12933525
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Fleming TR. Surrogate endpoints and FDA’s accelerated approval process. Health Aff (Millwood). 2005;24: 67–78.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. East African/British Medical Research Council. Controlled clinical trial of four 6-month regimens of chemotherapy for pulmonary tuberculosis. Second report. Am Rev Respir Dis. 1976;114: 471–5. pmid:788570
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Singapore Tuberculosis Service/British Medical Research Council. Clinical trial of six-month and four-month regimens of chemotherapy in the treatment of pulmonary tuberculosis. The results up to 30 months. Tubercle. 1981;62: 95–102. pmid:7029838
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Wallis RS, Wang C, Doherty TM, Onyebujoh P, Vahedi M, Laang H, et al. Biomarkers for tuberculosis disease activity, cure, and relapse. Lancet Infect Dis. 2010;10: 68–69. pmid:20113972.
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Phillips PPJ, Fielding K, Nunn AJ. An evaluation of culture results during treatment for tuberculosis as surrogate endpoints for treatment failure and relapse. PLoS One. 2013;8: e63840. pmid:23667677
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Davies GR. Early clinical development of anti-tuberculosis drugs: Science, statistics and sterilizing activity. Tuberculosis. 2010;90: 171–176. pmid:20382567
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Phillips PP, Mendel CM, Burger DM, Crook AM, Nunn AJ, Dawson R, et al. Limited role of culture conversion for decision making in individual patient care and for advancing novel regimens to confirmatory clinical trials. BMC Medicine. 2016;14: 19. pmid:26847437
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Wallis RS, Peppard T. Early biomarkers and regulatory innovation in multidrug-resistant tuberculosis. Clin Infect Dis. 2015; 61 Suppl 3: S160–3. https://doi.org/10.1093/cid/civ612.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref12] 12. Wallis RS, Peppard T, Hermann D. Month 2 culture status and treatment duration as predictors of recurrence in pulmonary tuberculosis: model validation and update. PloS ONE. 2015;10: e0125403. pmid:25923700
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref13] 13. Wallis RS, Wang C, Meyer D, Thomas N. Month 2 culture status and treatment duration as predictors of tuberculosis relapse risk in a meta-regression model. PLoS ONE. 2013;8: e71116. pmid:23940699
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref14] 14. Lee M, Lee J, Carroll MJ, Choi H, Min S, Song T, et al. Linezolid for treatment of chronic extensively drug-resistant tuberculosis. N Engl J Med. 2012;367: 1508–1518. pmid:23075177
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref15] 15. Gler MT, Skripconoka V, Sanchez-Garavito E, Xiao H, Cabrera-Rivero JL, Vargas-Vasquez DE, et al. Delamanid for multidrug-resistant pulmonary tuberculosis. N Engl J Med. 2012;366: 2151–2160. pmid:22670901
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref16] 16. Holtz TH, Sternberg M, Kammerer S, Laserson KF, Riekstina V, Zarovska E, et al. Time to sputum culture conversion in multidrug-resistant tuberculosis: predictors and relationship to treatment outcome. Ann Intern Med. 2006;144: 650–659. pmid:16670134
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref17] 17. Kurbatova EV, Cegielski JP, Lienhardt C, Akksilp R, Bayona J, Becerra MC, et al. Sputum culture conversion as a prognostic marker for end-of-treatment outcome in patients with multidrug-resistant tuberculosis: a secondary analysis of data from two observational cohort studies. Lancet Respir Med. 2015;3: 201–209. doi: published online Feb 26. pmid:25726085
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref18] 18. Lassere MN, Johnson KR, Boers M, Tugwell P, Brooks P, Simon L, et al. Definitions and validation criteria for biomarkers and surrogate endpoints: development and testing of a quantitative hierarchical levels of evidence schema. J Rheumatol. 2007;34: 607–615. pmid:17343307
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref19] 19. Diacon AH, Pym A, Grobusch MP, de los Rios JM, Gotuzzo E, Vasilyeva I, et al. Multidrug-resistant tuberculosis and culture conversion with bedaquiline. N Engl J Med. 2014;371: 723–732. pmid:25140958
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref20] 20. Alonso A, Van der Elst W, Molenberghs G, Buyse M, Burzykowski T. An information-theoretic approach for the evaluation of surrogate endpoints based on causal inference. Biometrics. 2016;72: 669–677. pmid:26864244
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref21] 21. Diacon AH, Pym A, Grobusch M, Patientia R, Rustomjee R, Page-Shipp L, et al. The diarylquinoline TMC207 for multidrug-resistant tuberculosis. N Engl J Med. 2009;360: 2397–2405. pmid:19494215
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref22] 22. Diacon AH, van Brakel E, Lounis N, Meyvisch P, Van Baelen B, De Marez T, et al. Triplicate Sputum Cultures for Efficacy Evaluation of Novel Antituberculosis Regimens. Am J Respir Crit Care Med. 2017;196: 1612–5. pmid:28448721
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref23] 23. van Buuren S. Flexible imputation of missing data. Boca Raton: Chapman & Hall/CRC; 2012.

[ref24] 24. Alonso A, Van der Elst W, Meyvisch P. Assessing a surrogate predictive value: A causal inference approach. Stat Med. 2017;36: 1083–1098. pmid:27966231
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref25] 25. Heinze G, Schemper M, A solution to the problem of separation in logistic regression. Stat Med. 2002;21: 2409–2419. pmid:12210625
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref26] 26. Van der Elst W, Meyvisch P, Alonso A. Evaluation of surrogate endpoints in clinical trials. https://cran.r-project.org/web/packages/Surrogate (accessed: June 13, 2017).

[ref27] 27. Pym AS, Diacon AH, Tang SJ, Conradie F, Danilovits M, Chuchottaworn C, et al. Bedaquiline in the treatment of multidrug- and extensively drug-resistant tuberculosis. Eur Respir J. 2016;47: 564–574. pmid:26647431
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

[ref28] 28. World Health Organization. Treatment of tuberculosis: guidelines. 4th Edn (WHO/HTM/TB/2009.420). www.aidsdatahub.org/dmdocuments/Treatment_of_TB_Guidelines.pdf (last accessed: November 23, 2017).

[ref29] 29. Svensson EM and Karlsson MO. Modelling of mycobacterial load reveals bedaquiline's exposure-response relationship in patients with drug-resistant TB. J Antimicrob Chemother. 2017;72: 3398–3405. pmid:28961790
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref30] 30. World Health Organization. Rapid diagnostic test and shorter, cheaper treatment signal new hope for multidrug-resistant tuberculosis patients. May 12 2016. http://www.who.int/mediacentre/news/releases/2016/multidrug-resistant-tuberculosis/en/ (last accessed: November 23, 2017).

[ref31] 31. Conradie F, Diacon AH, Everitt D, Mendel C, van Nieckerk C, Howell P, et al. The NIX-TB trial of pretomanid, bedaquiline and linezolid to treat XDR-TB. Presented at the 2017 Conference on Retroviruses and Opportunistic Infections (CROI), CROI Foundation in partnership with the International Antiviral Society-USA, Seattle, WA, February 13–16, 2017. Abstract 80LB.

[ref32] 32. Dawson R, Harris K, Conradie A, Burger D, Murray S, Mendel C, et al. Efficacy of bedaquiline, pretomanid, moxifloxacin & PZA (BPaMZ) against DS- & MDR-TB. Presented at the 2017 Conference on Retroviruses and Opportunistic Infections (CROI), CROI Foundation in partnership with the International Antiviral Society-USA, Seattle, WA, February 13–16, 2017. Abstract 724LB.

Abstract

Figures

Introduction

Methods

C208 study design

Data collection and definition of endpoints in C208 Stage II

Sputum processing.

Aggregate scoring.

Drug-susceptibility testing.

Study population and primary efficacy analysis.

Statistical evaluation of surrogate endpoints when both the surrogate and true endpoint are binary outcomes

Individual causal association.

Surrogate predictive function.

Results

Descriptive analysis

Re-analysis of the primary endpoint using multiple imputation

Individual causal association

Surrogate predictive function

Discussion

Supporting information

S1 Table. Institutional review boards.

S2 Table.

S3 Table.

Acknowledgments

References

Cookie Preference Center

Customize Your Cookie Preference