Robust regression with compositional covariates including cellwise outliers

We propose a robust procedure to estimate a linear regression model with compo- sitional and real-valued explanatory variables. The proposed procedure is designed to be robust against individual outlying cells in the data matrix (cellwise outliers), as well as entire outlying observations (rowwise outliers). Cellwise outliers are first filtered and then imputed by robust estimates. Afterwards, rowwise robust composi- tional regression is performed to obtain model coefficient estimates. Simulations show that the procedure generally outperforms a traditional rowwise-only robust regression method (MM-estimator). Moreover, our procedure yields better or comparable results to recently proposed cellwise robust regression methods (shooting S-estimator, 3-step regression) while it is preferable for interpretation through the use of appropriate coordinate systems for compositional data. An application to bio-environmental data reveals that the proposed procedure-compared to other regression methods-leads to conclusions that are best aligned with established scientific knowledge.
Refereed journal
Output Tags
WP 1.2 Water resources and flood risk management (RESAS 2016-21)