259424 Variable Selection in Multivariate Modeling of Drug Product Formula and Process

Tuesday, October 30, 2012: 9:50 AM
Allegheny II (Westin )
Yong Cui1, Xiling Song2, King Chuang1, Cadapakam Venkatramani1, Sueanne Lee3, Gregory Gallegos4, Thirunellai Venkateshwaran5 and Minli Xie6, (1)Small Molecule Pharmaceutical Sciences, Genentech, Inc. , South San Francisco, CA, (2)Small Molecule Pharmaceutical Sciences, Genentech, Inc., South San Francisco, CA, (3)Samll Molecule Pharmaceutical Sciences, Genentech, Inc. , South San Francisco, CA, (4)Small Molecule Pharmaceutical Sciences, Genentech Inc., South San Francisco, CA, (5)Small Molecule Pharmaceutical Sciences, Genentech, Inc. , South san francisco, CA, (6)Small Molecule Pharmaceutical Sciences, Genentech, Inc. , south san francisco, CA

Multivariate data analysis methods such as partial least square (PLS) modeling have been increasingly applied to product development, particularly in the quality-by design paradigm. This study applied the PLS modeling to analyze a product development dataset combining a design of experiment study and historical batch data. Attention was paid in particular to the assessment of the importance of predictor variables, and subsequently the variable selection in the PLS modeling. The results indicate that in conducting PLS modeling irrelevant and collinear predictors could be extensively present in the initial model. Therefore, variable selection is an important step in the model optimization. The VIP and coefficient values can be employed to rank the importance of predictors and to help remove irrelevant predictors. To reduce collinear predictors, on the other hand, multiple rounds of PLS modeling on different combinations of predictors may be necessary. To this end, stepwise reduction of predictors based on their VIP/coefficient ranking was introduced and appeared to be an effective approach to identify and remove redundant collinear predictors.

Extended Abstract: File Not Uploaded