Skip to main content

Analysis of multivariate longitudinal kidney function outcomes using generalized linear mixed models

Abstract

Background

Renal transplant patients are mandated to have continuous assessment of their kidney function over time to monitor disease progression determined by changes in blood urea nitrogen (BUN), serum creatinine (Cr), and estimated glomerular filtration rate (eGFR). Multivariate analysis of these outcomes that aims at identifying the differential factors that affect disease progression is of great clinical significance. Thus our study aims at demonstrating the application of different joint modeling approaches with random coefficients on a cohort of renal transplant patients and presenting a comparison of their performance through a pseudo-simulation study. The objective of this comparison is to identify the model with best performance and to determine whether accuracy compensates for complexity in the different multivariate joint models.

Methods and results

We propose a novel application of multivariate Generalized Linear Mixed Models (mGLMM) to analyze multiple longitudinal kidney function outcomes collected over 3 years on a cohort of 110 renal transplantation patients. The correlated outcomes BUN, Cr, and eGFR and the effect of various covariates such patient’s gender, age and race on these markers was determined holistically using different mGLMMs. The performance of the various mGLMMs that encompass shared random intercept (SHRI), shared random intercept and slope (SHRIS), separate random intercept (SPRI) and separate random intercept and slope (SPRIS) was assessed to identify the one that has the best fit and most accurate estimates. A bootstrap pseudo-simulation study was conducted to gauge the tradeoff between the complexity and accuracy of the models. Accuracy was determined using two measures; the mean of the differences between the estimates of the bootstrapped datasets and the true beta obtained from the application of each model on the renal dataset, and the mean of the square of these differences. The results showed that SPRI provided most accurate estimates and did not exhibit any computational or convergence problem.

Conclusion

Higher accuracy was demonstrated when the level of complexity increased from shared random coefficient models to the separate random coefficient alternatives with SPRI showing to have the best fit and most accurate estimates.

Background

Monitoring kidney function is a necessity for patients who underwent kidney transplantation to assess progression of renal disease. Typically three markers are measured repeatedly over time after transplantation to ensure that no signs of kidney problems or risk of renal failure due to graft rejection are present and these include blood urea nitrogen (BUN), serum creatinine (Cr) and estimated glomerular filtration rate (eGFR). These markers are correlated and needed to ensure an accurate evaluation of the kidney function since each has its own limitations and could be influenced by demographical and physiological characteristics of the patient [15]. Once a patient experiences graft rejection then measurements on the kidney markers won’t be recorded and patients will no longer be monitored for kidney function resulting in missing data. Given the interdependence of these outcomes in determining kidney function, it is important to evaluate the factors that affect the rate of change in these outcomes in a joint manner. The motivation of this study is to assess the association between the three outcomes (BUN, creatinine and eGFR) and key demographic and clinical factors accounting for the correlation between these markers in a multivariate fashion.

Analysis of multiple outcomes is given special attention in the literature. For example, multivariate linear mixed models (MLMM) were proposed to analyze multiple outcomes to assess and test for global exposure effect across outcomes while assuming flexible correlation structure for the multiple outcomes [6]. The idea was to provide robust estimates for the mean by separating it from the correlation parameters. This MLMM model was limited to Gaussian outcomes and did incorporate longitudinal data. Joint modeling was also used in the context of jointly studying time to clinical event and repeated measures on surrogate outcomes [7]. Others included joint modeling of the multilevel item response theory (MLIRT) and Cox’s proportional hazard model for time to dependent terminal event with shared random effects to link the two models [8]. A modeling framework for MLIRT also referred to as latent regression was widely considered and was based on the idea that the observed measurements are a result of some imperfect interaction between subject-specific latent variables and measurement-specific parameters [913]. The latent traits are considered as response variables and are regressed on a set of covariates, hence the name of latent regression. The advantage of the MLIRT models resides in the separation of the measurement-specific parameters and subject-specific covariates from manifest data and simultaneous estimation of these parameters and covariate effects [1416]. However, deviation from the normality assumption could affect the accuracy of the estimates and the inferences for the high-level outcomes [17]. Multivariate joint modeling was also an alternative wherein a joint distribution is specified to jointly model all random effect [1820]. Despite that data transformation solved the issue of non-normality it still had its imbedded disadvantages such as difficulty in the interpretation of the estimated parameters on transformed scale. In addition, joint normality may not be always achieved especially if transformation was conducted on a component basis, and lack of universal transformation that could be appropriate for the majority of datasets [21]. An alternative approach that is not constrained by the issue of non-normality is the joint modeling of multivariate longitudinal outcomes discussed in [22]. In this approach the interdependence among outcomes is captured by using shared latent variable(s) or the correlation between random coefficients corresponding to each outcome and the non-normality is solved using distributions and link functions from the list of general linear models. They describe the following four approaches, (1) shared random intercept (SHRI); (2) shared random intercept and slope (SHRIS); (3) separate random intercepts (SPRI); (4) separate random intercepts and slopes (SPRIS). The application and assessment of performance for these models were limited to health services outcomes in extremely large datasets and were not applied on clinical outcomes from a relatively small sample size.

The goals of our study were

  1. 1.

    To identify the best fitting joint model from the four approaches listed in terms of the tradeoff between the complexity of the model and precision of the estimates including convergence related complexities that could be due to small sample size.

  2. 2.

    To demonstrate the application of these models on a cohort of renal transplant patients assuming that the three correlated kidney markers follow the lognormal distribution. Hence in this research work we are proposing a novel application of the joint modeling approaches, and we are conducting an innovative thorough assessment of the performance of each approach in a clinical setting characterized by its small sample size. Studies with small sample sizes are commonly encountered in clinical settings of high cost, or where focus is on rare diseases, pilot studies as well as basic science research. Hence demonstrating the application and understanding the performance of these models in a dataset characterized by its small sample size, and multiple outcomes with non-normal distribution and high correlation is the key novel feature of this study.

Methods

The joint modeling approach investigated in this study is the multivariate generalized linear mixed models (mGLMM) which is a generalization of the linear mixed models. Fixed and random effects are both included and are referred to as \( \beta \) and bi respectively in the below equations of the joint models. Fixed effects represent the average rate of change in the outcome attributed to specific covariates at a population level; however, the random effects represent subject specific rate of change. The correlation between repeated measures on a certain outcome pertaining to the same individual subject is accounted for through the random effects. The correlations between the different multiple outcomes are incorporated through the variance–covariance matrix of the shared or separate random effects depending on the specific mGLMM. While linear mixed models are limited to normally distributed outcomes, mGLMM are extended to multiple correlated outcomes with other distributions, not necessarily normal, each of which is determined by its corresponding link function. The mGLMM is comprised of a set of generalized linear mixed models (GLMM) linked together by a SHRI, SHRIS, SPRI or separate random intercept and slope (SPRIS).

Let Yij1, Yij2,…, Yijk be the multivariate outcomes for subject i, measured at times j = 1 to ti, for outcomes 1 to k, the different joint models can be described as such:

SHRI

For three outcomes, SHRI model can be described as follows:

$$ \begin{aligned} E\left( {Y_{ij1} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{01} + b_{0i} + \beta_{11} t_{i} + \beta_{21} x_{i} } \right) \hfill \\ E\left( {Y_{ij2} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{02} + b_{0i} + \beta_{12} t_{i} + \beta_{22} x_{i} } \right) \hfill \\ E\left( {Y_{ij3} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{03} + b_{0i} + \beta_{13} t_{i} + \beta_{23} x_{i} } \right) \hfill \\ \end{aligned} $$

where \( b_{0i} \sim MVN\left( {0,\sigma_{b}^{2} } \right) \), \( \sigma_{b}^{2} \) is the variance for the intercept, \( x_{i} \) is the covariate of interest and \( g^{ - 1} \) is the link function. The model can accommodate more than one covariate. The correlation between the outcomes is assumed to be captured by the shared-common intercept.

SHRIS

$$ \begin{aligned} E\left( {Y_{ij1} |b_{i} } \right) = g^{ - 1} \left( {\beta_{01} + b_{0i} + \left( {b_{1i} + \beta_{11} } \right)t_{i} + \beta_{21} x_{i} } \right) \hfill \\ E\left( {Y_{ij2} |b_{i} } \right) = g^{ - 1} \left( {\beta_{02} + b_{0i} + \left( {b_{1i} + \beta_{12} } \right)t_{i} + \beta_{22} x_{i} } \right) \hfill \\ E\left( {Y_{ij3} |b_{i} } \right) = g^{ - 1} \left( {\beta_{03} + b_{0i} + \left( {b_{1i} + \beta_{13} } \right)t_{i} + \beta_{23} x_{i} } \right) \hfill \\ \end{aligned} $$

\( b_{i} = \left( {\begin{array}{*{20}c} {b_{0i} } \\ {b_{1i} } \\ \end{array} } \right) \sim MVN\left( {0,D_{2 \times 2} } \right) \) where \( D \) is the variance–covariance matrix of the shared intercept and slope so intuitively it includes the variances of the intercept and slope and their covariance. The correlation between the outcomes is assumed to be captured by the shared-common intercept and slope.

SPRI

The concept of capturing the correlation between the outcomes through intercept only, still applies here as in SHRI but the difference is in relaxing the shared intercept theory and assuming separate intercept associated with every outcome. Relaxing this assumption adds a layer of complexity by increasing the number of parameters to be estimated in the variance–covariance matrix. SPRI can be described as such

$$ \begin{aligned} E\left( {Y_{ij1} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{01} + b_{0i1} + \beta_{11} t_{i} + \beta_{21} x_{i} } \right) \hfill \\ E\left( {Y_{ij2} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{02} + b_{0i2} + \beta_{12} t_{i} + \beta_{22} x_{i} } \right) \hfill \\ E\left( {Y_{ij3} |b_{0i} } \right) = g^{ - 1} \left( {\beta_{03} + b_{0i3} + \beta_{13} t_{i} + \beta_{23} x_{i} } \right) \hfill \\ \end{aligned} $$

\( b_{0i} = \left( {\begin{array}{*{20}c} {b_{0i1} } \\ {b_{0i2} } \\ {b_{0i3} } \\ \end{array} } \right) \sim MVN\left( {0,D_{3 \times 3} } \right) \) where the number of parameters to be estimated increases to 6 attributed to the 3 variances and 3 covariances between the intercepts. The correlation between the 3 outcomes is assumed to be captured by the correlation between the separate intercepts.

SPRIS

The concept of capturing the correlation between the outcomes as in SHRIS is applied here with the difference in assuming separate intercept and slope for every outcome which adds another level of complexity on SPRI. SPRIS can be described as such:

$$ \begin{aligned} E\left( {Y_{ij1} |b_{i1} } \right) = g^{ - 1} \left( {\beta_{01} + b_{0i1} + \left( {b_{1i1} + \beta_{11} } \right)t_{i} + \beta_{21} x_{i} } \right) \hfill \\ E\left( {Y_{ij2} |b_{i2} } \right) = g^{ - 1} \left( {\beta_{02} + b_{0i2} + \left( {b_{1i2} + \beta_{12} } \right)t_{i} + \beta_{22} x_{i} } \right) \hfill \\ E\left( {Y_{ij3} |b_{i3} } \right) = g^{ - 1} \left( {\beta_{03} + b_{0i3} + \left( {b_{1i3} + \beta_{13} } \right)t_{i} + \beta_{23} x_{i} } \right) \hfill \\ \end{aligned} $$

\( b_{i} = \left( {\begin{array}{*{20}c} {b_{i1} } \\ {b_{i2} } \\ {b_{i3} } \\ \end{array} } \right) = \left( {\begin{array}{*{20}c} {\left( {\begin{array}{*{20}c} {b_{0i1} } \\ {b_{1i1} } \\ \end{array} } \right)} \\ {\left( {\begin{array}{*{20}c} {b_{0i2} } \\ {b_{1i2} } \\ \end{array} } \right)} \\ {\left( {\begin{array}{*{20}c} {b_{0i3} } \\ {b_{1i3} } \\ \end{array} } \right)} \\ \end{array} } \right) \sim MVN\left( {0,D_{6 \times 6} } \right) \) where D is the variance–covariance matrix for the 3 intercepts and 3 slopes associated with every outcome resulting in an increase of the number of parameters to be estimated to 21. For all the models, the dependence between and within outcomes is captured by the structure of the variance–covariance matrix that can be determined by the measures of model fitting AIC, BIC and −2log l.

The univariate counterpart of these models can be reduced to GLMM with random intercept only (RI) and with random intercept and slope (RIS).

Pseudo-simulation bootstrap study

To assess the performance of each model a pseudo simulation bootstrap study was conducted using a renal dataset and assuming that the three kidney outcomes (BUN, Cr and eGFR) follow the lognormal distribution. The covariates were limited to time and square of time since SPRIS exhibited convergence problems when other covariates were included. The square of time was included in the analysis to account for the curvilinear relationship between time and each of the outcomes. We implemented 2,000 replications each with a sample size of 110 individuals boot strapped from the cohort of renal patients. The performance of the 4 multivariate models (SHRI, SHRIS, SPRI, SPRIS) and that of the univariate random intercept model (RI), and random intercept and slope model (RIS) was examined. The performance was determined in terms of accuracy assessed by two measures; the mean of the differences between the estimates of the bootstrapped datasets and the true beta obtained from the application of each model on the renal dataset, and the mean of the square of these differences. Specifically, true betas are the fixed effects \( \left( {\beta_{11} ,\beta_{12} ,\beta_{13} ,\beta_{21} ,\beta_{22} ,\beta_{23} } \right) \) in the models’ equations. These parameters are estimated by applying the generalized linear mixed models on the renal transplant dataset to reflect the estimated effects of the corresponding covariates on the changes in kidney outcomes. In Table 1, the differences between the estimates generated in the simulation studies and these betas are computed to determine the accuracy of the various models, noting that the smaller the difference the more accurate the model. For simplicity, we refer to these measures as mean difference, and mean squared difference respectively. This comparison helped us achieve the following objectives:

Table 1 Bootstrapping on the (a) multivariate models and (b) univariate models: replicate 2,000 sample size 110
  1. 1.

    Identify the multivariate models that exhibited better performance than the other models.

  2. 2.

    Determine whether the complex separate random models (SPRI and SPRIS) had any advantage in terms of accuracy of the estimates over the simpler shared random effect models (SHRI and SHRIS). Accordingly, we can gauge complexity versus precision.

  3. 3.

    Examine whether including the slope as random effect along with the intercept as in the SHRIS and SPRIS models has any respective advantage over the models that assume solely random intercept as in SHRI and SPRI.

  4. 4.

    Evaluate whether there was a gain in accuracy under the multivariate modeling versus the univariate separate analysis implemented on every outcome.

The bootstrap results are presented in Tables 1, 2, and 3 for the multivariate and univariate implementation of the models with 2,000 replications. In Table 1 we present the mean difference and mean squared difference for the estimates under every model while in Table 2 we provide percent reductions in these measures for a specific multivariate model versus the others. In Table 3 the percent reductions in these measures between univariate and multivariate models are presented.

Table 2 Percent reduction in mean difference and mean squared difference that correspond to: SPRI compared to all other multivariate models (a, b, and c)b; SPRI compared to SHRI (d); SPRIS compared to SHRIS (e); SPRI compared to SPRIS(f)
Table 3 Percent reduction in mean difference that correspond to: SPRI Compared to RI (a); SPRIS compared to RIS (b)

Objective 1 As evidenced in Tables 1 and 2 (see also 2nd footnote labeled as b in Table 2, and Table 2 columns a, b, and c), we notice that 6 out of 12 estimates i.e. 50% of the estimates generated under SPRI had the smallest mean difference, and mean squared difference compared to those generated under the other models that include SHRI, SHRIS, and SPRIS. Specifically, 50% of the estimates generated under SPRI had an associated mean difference, and mean squared difference that are respectively 58.7% and 12% smaller on average than those generated under the other models. Hence, this leads to the conclusion that SPRI demonstrated the best performance compared to the other 3 models.

Objective 2 Compared to SHRI, 83% of the estimates generated under SPRI had associated mean difference that is 40% smaller on average, and mean squared difference that are close in value (15% reduction in mean squared differences under SPRI in 50% of the estimates) (see Table 2 column d, and footnote Table 2). In this context, we denote that the eGFR outcome had a significant 32% reduction in mean squared difference under SPRI compared to SHRI. When we compared SPRIS to SHRIS (see Table 2 column e, and footnote Table 2), we realized that about 50% of the estimates generated under SPRIS exhibited an approximate average reduction of 73% in mean difference compared to SHRIS; meanwhile, an observed reduction of 19% in mean squared difference was captured in only 33% of the estimates under SPRIS. Hence, mean squared difference did not greatly differ between SHRIS and SPRIS. Accordingly, one can conclude that separate modeling of the random effect had it be solely intercept (SPRI), or both intercept and slope (SPRIS) had an advantage specifically in mean difference reduction over the simpler models that assume shared random effects (SHRI and SHRIS).

Objective 3 To assess whether the increase in complexity when going from the simpler model (SPRI) to the more complex one (SPRIS) generates more accurate results, we observed that 66% of the estimates under SPRI had an associated mean difference that is 72% smaller on average than those generated under SPRIS, and 100% of the estimates under SPRI had smaller mean squared difference (9% average reduction in mean squared difference) than those generated under SPRIS (see Table 2 column f, and footnote Table 2). Hence, the simpler model (SPRI) appeared to generate more accurate estimates than the complex model (SPRIS). This conclusion supports the finding established in objective 1. However, this was not the case with SHRI and SHRIS. Specifically, when the performance of the simpler model (SHRI) was compared to that of the more complex one (SHRIS), similar overall performance was observed for both models wherein both models had close mean difference and mean squared difference (see Table 1 SHRI and SHRIS models).

Hence SPRI appeared to have better performance than the other models and increasing the level of complexity by going from SPRI to SPRIS did not improve on the accuracy of the estimates that were generated under SPRI. Hence, these results lead to the conclusion that modeling the slope as a random effect along with the intercept did not improve the performance of the associated models (SHRIS and SPRIS) compared to the simpler models that assume solely random intercept (SHRI and SPRI).

Objective 4 Our bootstrap pseudo simulation study also aimed at investigating the effect of multivariate modeling versus the univariate implementation of the models that assume that the outcomes are independent. In this regard a comparison was undertaken between the performance of the RI model and that of SHRI and SPRI, and between RIS and that of SHRIS and SPRIS (Tables 1, 3). The performance of SHRI and RI did not significantly differ since their associated mean difference and mean squared differences were close (see Table 1). Hence, this observation indicates that under models with random intercept, the multivariate shared model (SHRI) did not improve the accuracy of the estimates compared to those generated under the univariate random intercept model (RI). However, this conclusion did not hold when we compared SPRI and RI. Specifically, we noticed that 66% of the estimates generated under SPRI had a 42% average reduction in mean difference compared to those estimates generated under RI (see Table 3 column a, and footnote Table 3). For instance, 77 and 73% reductions in mean difference were observed for the slope estimates of the outcomes creatinine and eGFR generated under SPRI compared to those generated under RI. Nonetheless, the mean squared differences did not greatly differ in the estimates of these two models.

When SHRIS and RIS were compared, a minimal reduction (only 4%) was denoted for the mean squared differences in the majority of the estimates (about 80%) that were generated under SHRIS compared to RIS; but the mean differences did not differ between the 2 models. Hence SHRIS and RIS exhibited similar performance. Compared to RIS, about 50% of the estimates generated under SPRIS exhibited an observed reduction in the mean difference by about 41% on average (see Table 3 column b, and footnote Table 3), while the mean squared difference did not greatly differ between the 2 models. These results suggest that multivariate modeling specifically SPRI and SPRIS had an advantage in performance specifically in Mean Difference compared to the univariate models RI and RIS respectively (Table 3). However, this conclusion did not hold for the shared multivariate models SHRI and SHRIS wherein their associated mean difference and mean squared difference did not greatly differ from those under the univariate RI and RIS respectively. Hence only the separate multivariate models (SPRI and SPRIS) exhibited improved performance over their univariate counterpart models RI and RIS. This result also endorses our conclusion in objective 2 which stated that separate random effect models (SPRI and SPRIS) had better performance compared to the shared random effect models (SHRI and SHRIS).

Motivating example: renal transplant application

The current study was motivated by the cohort of renal transplant patients. The dataset consisted of 110 patients who underwent renal transplantation in the year 2000 and were followed for 3 years (till year 2003) post-transplant until they experience graft rejection or end of study period. After the scheduled termination date of the study in year 2003, patients were no longer followed and assessment of their kidney function was not undertaken afterwards.

In case of graft rejection, patients had to revert back to the dialysis regimen and were not monitored afterwards. About 19% of the patients in this cohort experienced graft rejection. Renal function and progression of renal disease is assessed by 3 correlated markers, BUN, serum Cr, and eGFR. These outcomes follow the lognormal distribution. In this study BUN and Cr had a positive correlation of 0.624 (P value <0.0001), while BUN and eGFR had a negative correlation of −0.63 (P value <0.0001) and that of Cr and eGFR was −0.71 (P value <0.0001). Changes in the levels of markers are what usually determine the progression of renal disease. In this regard, decrease in the level of BUN and Cr indicates an improvement in the kidney function over time. However, the opposite is true for eGFR where an increase in the levels of this marker indicates improved kidney function. This explains the positive correlation between BUN and Cr, and the negative ones between these markers and eGFR. The normal values for BUN are between 8 and 25 mg per 100 mL of blood, for Cr the normal range is between 0.7 and 1.3 mg per 100 mL of blood, and for eGFR this range is approximately between 90 and 120 mL/min/1.73 m2 for men and women respectively. Measurements on the three markers were taken on every patient for a 3-year period starting with the baseline measure pre-transplant registered as month 0, then at months 1, 3, 6, 12, 24, and 36 post-transplant.

The four models examined in this study were applied on this cohort of patients and the objective was to estimate the slope for every outcome over time along with some associated key covariates that include patient’s age, gender, and race and the vital status (deceased or living) of the respective donor along with some basic demographics such as donor’s gender and race (see Table 4). Applying the mGLMM on this cohort of patients enabled us to assess the effect of these various covariates on all the renal markers (BUN, creatinine and eGFR) collectively through the joint modeling approach. Moreover, the random effects term in the mGLMM accounts for the correlations between the repeated measures for every patient and the different markers respectively, and for the between-patients variation in the length of stay and completeness of measurements of renal outcomes. This variation emerges between patients who may dropout before the intended termination date of the study. In addition, the link functions and the generalized approach available in mGLMM make it feasible to model the renal outcomes measured in this study and which were found to follow the lognormal distribution without necessitating any transformation to achieve normality.

Table 4 Application of the four multivariate models on the renal transplant dataset with all covariates

By applying the four models we were able to identify the one(s) that had the best fit determined by measures such as AIC, BIC and −2 log likelihood (see Table 5), and compare the associated standard errors for each model. In addition we applied the univariate random intercept model (RI) and univariate random intercept and slope (RIS) on every outcome separately (see Table 6) to compare the estimates obtained under the multivariate application versus the univariate ones. The multivariate model SPRIS did not converge when all these covariates were included concurrently but just with the covariates time and time square. The relation between time and the three markers appeared to be curvilinear; hence, time squared was also included in the model to account for this non-linear relationship. The fitting measures when the 4 models were applied on the renal dataset with only time and time squared as covariates are also presented in Table 5. The variance–covariance matrices of the multivariate and univariate models are reported in Appendix 1 and Appendix 2, respectively.

Table 5 Measures of fit for the multivariate models applied on renal transplant dataset with (a) all covariates (b) time and time squared
Table 6 Application of the two univariate models on the renal transplant dataset with all covariates
Table 7 Variance–covariance matrix of SHRI and SHRIS when applied on renal transplant dataset with all covariates
Table 8 Variance–covariance matrix of SPRI when applied on renal transplant dataset with all covariates
Table 9 Variance–covariance matrix of SPRIS when applied on renal transplant dataset with covariates time and time squared
Table 10 Variance–covariance matrix of RI and RIS when applied on renal transplant dataset with all covariates

As an overall discussion of the estimates we can deduce that for all multivariate models (see Table 4) patients appear to have improved kidney function over time as evidenced by the negative slopes of BUN and Cr, and positive ones for eGFR. This improvement was demonstrated by the estimated decrease of 1.02 mg per 100 mL of blood in BUN on average across all multivariate models and 1.1 mg per 100 mL of blood for Cr. Along the same line, eGFR levels were estimated to increase by an average across all multivariate models of 1.1 mL/min/1.73 m2. This significant improvement in the progression of renal disease over time detected in the multivariate models was also established in univariate models (see Table 6). Gender appeared to be significantly associated with the changes in BUN and Cr but not with eGFR, and males patients were shown to have inferior kidney function compared to females. This conclusion was demonstrated in the multivariate models with the attributed increase in BUN levels for males versus females of 1.2 mg per 100 mL of blood and of 1.3 mg per 100 mL of blood for Cr (see Table 4). A similar conclusion was reached under the univariate models (see Table 6). This detected increase in the levels of these markers in males compared to females led to the conclusion that female patients have better prognosis of kidney disease compared to males that could be attributed to the protective effect of estrogen [23].

Race was found to be associated with BUN and Cr in the SHRI and SPRI models and only Cr in SHRIS. In the univariate models (see Table 6), race was found to be associated only with Cr and, unlike the multivariate models, no association was detected with BUN. Specifically, non-African American patients appeared to be at a disadvantage compared to other races due to their 1.1 mg per 100 mL of blood increase in BUN levels on average and 1.3 mg per 100 mL of blood increase on average in Cr (see Table 4). This conclusion was also valid under the univariate models; nevertheless, it was just demonstrated with Cr.

Age appeared to be positively correlated with BUN in all models and with every 1 year increase in age of transplantation corresponds to an increase in BUN levels of approximately 1.6 mg per 100 mL of blood on average. Hence those patients who had their renal transplantation at older ages were at a disadvantage in terms of renal function and progression of kidney disease compared to those patients who had the transplant at younger ages. This conclusion holds for all models had it be multivariate or univariate.

Males were better donors than females since patients who received their kidneys from male donors exhibited a decrease of approximately 1.2 mg per 100 mL of blood in their Cr levels, and an increase of approximately 1.2 mL/min/1.73 m2 in eGFR levels compared to patients who had female donors (Tables 4, 6). This result is in line with other studies that showed females to be poorer donors than males, an observation attributed to nephron underdosing and hence nephron overload in females [2325]. No association was detected between the donor’s race and BUN in all models. The other covariates such as donor’s gender and vital status were found to be not associated with any of the outcomes and as a result were not included in any of the models.

Comparison of models fit

SPRI exhibited the best fit as demonstrated by its lowest associated pseudo AIC, BIC and −2log l (see Table 5) wherein average decreases of 641, 631 and 650 in these measures were denoted between SPRI compared to the other models SHRI and SHRIS. Compared to the univariate model RI, the standard errors generated under SPRI were mostly slightly larger than the ones generated under RI. However, this was not the case when we compared the standard errors under SHRI and those under the univariate model RI wherein all the estimates under SHRI had smaller SE than those under RI. The same conclusion was observed when we compared SHRIS to RIS and we denoted that all estimates under SHRIS had smaller standard errors than those with RIS. This conclusion was anticipated since only one additional parameter (variance of the intercept) needed to be estimated under SHRI as indicated in the variance–covariance matrix (see Appendix 1). The same case was denoted for RI where also one additional parameter (variance of the intercept) needed to be estimated for every outcome (see Appendix 2) leading to three additional parameters that pertain to the three outcomes. The same can be said for SHRIS and RIS models. However SPRI had additional six parameters in the variance–covariance matrix (see Appendix 1). A comparison of the four multivariate models based on the application to the renal data with time and time squared as covariates (see Table 5), SPRI still exhibited the best fit and its associated AIC, BIC and −2log l were the smallest compared to SHRI, SHRIS and SPRIS.

Discussion

In this study we demonstrate a novel application of the different mGLMM approaches to a renal transplant dataset characterized by its small sample size of 110 patients and correlated multiple outcomes with lognormal distribution. While mGLMM is based on multivariate modeling of the correlated outcomes via joint modeling of latent variables, other approaches have also been used. For example, earlier analysis of the same renal transplant data [26] was based on a Two-stage likelihood-based approach wherein the individual observations for every patient are reduced into slopes computed in stage 1. A jointly modeling of individual slopes and their correlations along with the number of visits for every patient is conducted in stage 2. Maximization of the likelihood function that is integrated over the random effects results in maximum likelihood estimates of the population slopes and empirical Bayes estimates for the individual slopes. Since individual observations for every patient are reduced to the individual slopes then this required imposing an assumption that only patients with two visits or more can be included in the study. Moreover, it was assumed that all sufficient information is contained in the slopes and that the intercept could be discarded and excluded from the likelihood function. These two assumptions are no longer required in the mGLMM approaches presented in this study since intercepts were incorporated as a shared random effect or separate ones and individual observations are modeled rather than their corresponding slopes, hence the restriction of eligibility for those with two observations and more was relaxed here. In this study we gave additional insight into the assessment of shared versus separate models in clinical settings of small sample size. Our current assessment also includes the level of computational complexity and accuracy of the different models. As expected, there has been some discrepancy in the computational burden to fit each of the models. For example, the SPRIS model did not converge when multiple covariates were included along with time and time squared. A natural explanation for this phenomenon is having a large number of 21 parameters in the variance–covariance matrix of SPRIS that needed estimation (Appendix 1) and a small sample size of 110 did not help achieve convergence. This observation highlights a drawback with SPRIS and suggests that SPRIS should be used with caution in settings of small sample size. SPRI had the best fit when applied to the renal dataset and generated the most accurate estimates with minimum mean difference and mean squared differences. This advantage could be due to the fact that the number of parameters in the variance–covariance matrix for SPRI (6 parameters) is midway between the SHRI’s (1 parameter) and SHRIS’ (3 parameters) and that for SPRIS (21 parameters). This result is in agreement with what has been shown in studies with large sample size [22] where SPRI also demonstrated the best fit in terms of smallest AIC, BIC and −2log l. To validate these models using another independent dataset, the mGLMM approach was applied on a longitudinal study of type 1 diabetic patients with large sample size and multiple cardiovascular disease outcomes that have mixed non-normal distributions [27, 28]. Our results showed that SPRI had the best fit in terms of smallest AIC, BIC, and -2log l compared to the other models and had the minimum mean difference and mean difference squared in the majority of the estimates. This confirms again that SPRI had the best performance and best fit in the context of large datasets as was the case with the small dataset.

Models with separate random effects exhibited better performance in terms of mean difference and mean squared differences compared to those with shared ones and also demonstrated superior performance than the univariate RI and RIS models. Nonetheless, increasing the level of complexity from SPRI to SPRIS by adding the slope as a random effect along with the intercept did not increase the level of accuracy; on the contrary convergence problems were encountered. Hence it appears that the SPRI model that is midway in complexity was favored by the renal dataset and the bootstrap study.

Conclusion

In the context of a relatively small sample size study, we were able to demonstrate that the mGLLMs with SPRIs and separate random intercepts and slopes (SPRIS) generated more accurate estimates compared to the SHRI and SHRIS models. Similar conclusion was reached when we compared SPRI and SPRIS to their univariate alternatives RI and RIS. Convergence issues were encountered with SPRIS when the number of covariates increased, however, SPRI had the best fit and most accurate performance compared to the other models and its successful convergence was consistently achieved.

Abbreviations

BUN:

blood urea nitrogen

Cr:

serum creatinine

eGFR:

estimated glomerular filtration rate

mGLMM:

multivariate generalized linear mixed model

MLIRT:

multilevel item response theory

MLMM:

multivariate linear mixed model

SHRI:

shared random intercept

SHRIS:

shared random intercept and slope

SPRI:

Separate random intercept

SPRIS:

separate random intercept and slope

References

  1. James GD, Seally JE, Alderman M, Ljungman S, Mueller FB, Pecker MS et al (1988) A longitudinal study of urinary creatinine and creatinine clearance in normal subjects. Race, sex and age differences. Am J Hypertens 1:124–131

    Article  CAS  PubMed  Google Scholar 

  2. Schrier RW (2008) Blood urea nitrogen and serum creatinine not married in heart failure. Circ Heart Fail 1:25–33

    Article  Google Scholar 

  3. Froissart M, Rossert J, Jacquot C, Paillard M, Houillier P (2005) Predictive performance of the modification of diet in renal disease and Cockcroft-Gault equations for estimating renal function. J Am Soc Nephrol 16:763–773

    Article  PubMed  Google Scholar 

  4. Verhave JC, Fesler P, Ribstein J, du Cailar G, Mimran A (2005) Estimation of renal function in subjects with normal serum creatinine levels: influence of age and body mass index. Am J Kidney Dis 46:233–241

    Article  CAS  PubMed  Google Scholar 

  5. Cirillo M, Anastasio P, De Santo NG (2005) Relationship of gender, age, and body mass index to errors in predicted kidney function. Nephrol Dial Transplant 20:1791–1798

    Article  PubMed  Google Scholar 

  6. Sammel M, Lin X, Ryan L (1999) Multivariate linear mixed models for multiple outcomes. Stat Med 18:2479–2492

    Article  CAS  PubMed  Google Scholar 

  7. Xu J, Zeger SL (2001) The evaluation of multiple surrogate endpoints. Biometrics 57:81–87

    Article  CAS  PubMed  Google Scholar 

  8. He B, Luo S (2013) Joint modeling of multivariate longitudinal measurements and survival data with applications to Parkinson’s disease. Stat Methods Med Res (In Press Published on line first on April 16, 2013)

  9. Adams R, Wilson M, Wu M (1997) Multilevel item response models: an approach to errors in variables regression. J Educ Behav Stat 22(1):47–76

    Article  Google Scholar 

  10. Andersen E (2004) Latent regression analysis based on the rating scale model. Psychol Sci 46:209–226

    Google Scholar 

  11. Christensen K, Bjorner J, Kreiner S, Petersen J (2004) Latent regression in loglinear Rasch models. Commun Stat Theory Methods 33(6):1295–1313

    Article  Google Scholar 

  12. Mislevy R (1985) Estimation of latent group effects. J Am Stat Assoc 80:993–997

    Article  Google Scholar 

  13. Zwinderman A (1991) A generalized Rasch model for manifest predictors. Psychometrika 56(4):589–600

    Article  Google Scholar 

  14. Maier K (2001) A Rasch hierarchical measurement model. J Educ Behav Stat 26(3):307–330

    Article  Google Scholar 

  15. Kamata A (2001) Item analysis by the hierarchical generalized linear model. J Educ Meas 38(1):79–93

    Article  Google Scholar 

  16. Fox J (2005) Multilevel IRT using dichotomous and polytomous response data. Br J Math Stat Psychol 58(1):145–172

    Article  PubMed  Google Scholar 

  17. Pinheiro J, Liu C, Wu Y (2001) Efficient algorithms for robust estimation in linear mixed-effects models using the multivariate t distribution. J Comput Graph Stat 10(2):249–276

    Article  Google Scholar 

  18. Fieuws S, Verbeke G (2004) Joint modeling of multivariate longitudinal profiles: pitfalls of the random effect approach. Stat Med 23:3093–3104

    Article  PubMed  Google Scholar 

  19. Fieuws S, Verbeke G, Molenberghs G (2007) Random-effects models for multivariate repeated measures. Stat Methods Med Res 16(5):387–397

    Article  CAS  PubMed  Google Scholar 

  20. Bandyopadhyay S, Ganguli B, Chatterjee A (2011) A review of multivariate longitudinal data analysis. Stat Methods Med Res 20(4):299–330

    Article  CAS  PubMed  Google Scholar 

  21. Lachos V, Bandyopadhyay D, Dey D (2011) Linear and nonlinear mixed-effects models for censored HIV viral loads using normal/independent distributions. Biometrics 67:1594–1604

    Article  PubMed  Google Scholar 

  22. Gebregziabher M, Zhao Y, Dismuke CE, Axon N, Hunt KJ, Egede LE (2013) Joint modeling of multiple longitudinal cost outcomes using multivariate generalized linear mixed models. Health Serv Outcomes Res Method 13:39–57

    Article  Google Scholar 

  23. Zeier M, Dohler B, Oplez G, Ritz E (2002) The effect of donor gender on graft failure. J Am Soc Nephrol 13:2570–2576

    Article  PubMed  Google Scholar 

  24. Kasiske BL, Umen JA (1986) The influence of age, sex, race and body habitus on kidney weight in humans. Arch Pathol Lab Med 110:55–60

    CAS  PubMed  Google Scholar 

  25. Brenner BM, Cohen RA, Milford EL (1992) In renal transplantation, one size may not fit all. J Am Soc Nephrol 3:162–169

    CAS  PubMed  Google Scholar 

  26. Jaffa MA, Woolson RF, Lipsitz SR (2011) Slope estimation for bivariate longitudinal outcomes adjusting for informative right censoring by using a discrete survival model: application to the renal transplant cohort. J Royal Stat Soc A 174:387–402

    Article  Google Scholar 

  27. EDIC Research Group (1999) Epidemiology of diabetes interventions and complications (EDIC): design, implementation and preliminary results of long-term follow up of diabetes control and complications trial cohort. Diabetes Care 22:99–111

    Article  Google Scholar 

  28. The DCCT Research Group (1986) The diabetes control and complications trial (DCCT): design and methodologic considerations for the feasibility phase. Diabetes 35:530–545

    Article  Google Scholar 

Download references

Authors’ contributions

Methodology: MAJ and MG. Acquisition of data: MAJ. Analysis and interpretation: MAJ and MG. Clinical interpretation: AAJ and MAJ. Drafting of the manuscript: MAJ, MG and AAJ; Critical revision of the clinical intellectual content: MAJ and AAJ; Critical revision of the important intellectual content: MAJ, MG, and AAJ. All authors read and approved the final manuscript.

Acknowledgements

All authors have contributed significantly to the manuscript.

Compliance with ethical guidelines

Competing interests The authors declare that they have no competing interests.

Ethics This study was granted exemption from requiring ethics approval from the Institutional Review Board at the Medical University of South Carolina.

Funding

This work was supported by the National Institutes of Health Grants HL077192 and HL087986 (AAJ).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miran A Jaffa.

Appendices

Appendix 1

See Tables 7, 8, 9.

Appendix 2

See Table 10.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jaffa, M.A., Gebregziabher, M. & Jaffa, A.A. Analysis of multivariate longitudinal kidney function outcomes using generalized linear mixed models. J Transl Med 13, 192 (2015). https://doi.org/10.1186/s12967-015-0557-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12967-015-0557-2

Keywords