Parallel Optimization Methods for Simulation-Based Problems in Nanoscience Juan Meza, Michel van Hove, Zhengji Zhao Lawrence Berkeley National Laboratory Berkeley, CA http://hpcrd.lbl.gov/~meza Supported by DOE/MICS SIAM CSE Conference, Orlando, FL, Feb. 12-15,2005 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Many scientific applications require the solution of an optimization problem C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Low-energy electron diffraction (LEED) Goal is to determine surface structure through low energy electron diffraction (LEED) Inverse problem consists of minimizing so-called R-factor - a measure of fitness between experiment and theory Combination of global/local optimization Inherently noisy optimization problem Low-energy electron diffraction pattern due to monolayer of ethylidyne attached to a rhodium (111) surface C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Surface structure determination from experiment Forty five structural models were proposed in a complex surface structure determination by low energy electron diffraction experiments Lattice sites can be occupied by Ni or Li atoms, or have a vacancy. In addition continuous fit parameters corresponding to local relaxation of positions are also allowed here. Many arrangements of Ni atoms (light and dark green) and Li atoms (yellow and orange) are possible within the outlined 2-dimensional square unit cell. C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Surface structure determination from experiment Electron diffraction determination of atomic positions in a surface: Li atoms on a Ni surface Global optimization of structure type: which of these 45 structure types best fits experiment? Local optimization of structure parameters: which are the best interatomic distances and angles? C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Low Energy Electron Diffraction R-Factors C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Characteristics of optimization problem Inverse problem minimize R-factor - defined as the misfit between theory an experiment Several ways of computing the R-factor Combination of continuous and categorical variables • • Atomic coordinates, i.e. x, y, z Ni, Li No derivatives available - standard issue with “black- box” simulations Invalid structures lead to function being undefined in certain regions and/or discontinuous C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Pendry R-factor where the intensity curve, I, is computed by the LEED code C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Previous Work Previous attempt used genetic algorithms to solve the global optimization method. Large number of invalid structures generated (more on this later). Overall, a solution was found - after adding sufficient constraints. Global Optimization in LEED Structure Determination Using Genetic Algorithms, R. Döll and M.A. Van Hove, Surf. Sci. 355, L393-8 (1996). A Scalable Genetic Algorithm Package for Global Optimization Problems with Expensive Objective Functions, G. S. Stone, M.S. dissertation, Computer Science Dept., San Francisco State University, 1998. C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Pattern search methods Pattern search methods, Torczon, Lewis & Torczon, Lewis, Kolda, Torczon (2004), etc. Extension to mixed variable problems by Audet and Dennis (2000). Case of nonlinear constraints studied in Abramson’s PhD dissertation (2002). A frame-based Mesh Adaptive Direct Search (MADS) method proposed by Audet and Dennis (2004) that removes restriction of a finite number of poll directions. Good software available APPSPACK (Kolda), NOMADm (Abramson), OPT++ (Hough, Meza, Williams) C O M P U T A T I O N A L R E S E A R C H D I V I S I O N NOMADm Variables can be continuous, discrete, or categorical General constraints (bound, linear, nonlinear) Nonlinear constraints can be handled by either filter method or MADS-based approach for constructing poll directions Objective and constraint functions can be discontinuous, extended-value, or nonsmooth. Available at: http://en.afit.edu/ENC/Faculty/MAbramson/NOMADm.html C O M P U T A T I O N A L R E S E A R C H D I V I S I O N MVP Algorithm 1. Initialization: Given D , x0 , M0, P0 2. For k = 0, 1, … a) SEARCH: Evaluate f on a finite subset of trial points on the mesh Mk Global phase can include user heuristics or surrogate functions b) POLL: Evaluate f on the frame Pk Local phase more rigid, but necessary to ensure convergence 3. If successful - mesh expansion: a) xk+1 = xk + Dk dk 4. Otherwise contract mesh C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Convergence properties Assuming f(x) is suitably smooth... For unsuccessful iterations, k rf(xk) k is bounded as a function of the step length Dk Via globalization, lim inf Dk = 0 Conclude: lim inf k rf(xk) k = 0 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Test problem Model contains three layers of atoms Using symmetry considerations we can reduce the problem to 14 atoms 14 categorical variables 42 continuous variables Positions of atoms constrained to lie within a box Best known previous solution had R-factor = .24 Model 31 from set of TLEED model problems C O M P U T A T I O N A L R E S E A R C H D I V I S I O N GA results - atomic coordinates constrained to 0.4 Angstrom (invalid structures) C O M P U T A T I O N A L R E S E A R C H D I V I S I O N NOMAD results for minimization with respect to the continuous variables Best known solution: R-factor = 0.24 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N NOMAD results for minimization with respect to the continuous variables R-factor = 0.12 # of func call = 591 Best known solution: R-factor = 0.24 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N GA results - categorical variable search with fixed atomic positions best known solution: 11111222222222 Li Ni 11111122211122 11111122221122 11111222221222 11111222222222 21112222222222 Remark: population size = 10 / Generation C O M P U T A T I O N A L R E S E A R C H D I V I S I O N NOMAD results for categorical variables with fixed atomic positions Best known solution (R = 0.24): 11111222222222 Li Ni 11111122211122 R = 0.2387 # of func call = 49 11111222222222 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Robustness of NOMAD: 15 of 20 initial guesses (poll step only) Best known solution (R = 0.24): 11111222222222 Li Ni AVG initial R = 0.5671 AVG # of func call = 63 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Five of 20 trials are trapped in local minima using only poll step Best known solution (R = 0.24): 11111222222222 Li C O M P U T A T I O N A L R E S E A R C H Ni D I V I S I O N LHS search + GSS poll escapes from local minima (R = 0.24): 11111222222222 Li Ni New minimum found (R = 0.1184): 22222112111111 N C O M P U T A T I O N A L R E S E A R C H D I V I S I O N NOMAD results for 20 trials using LHS + GSS 20 trials of identity search AVG intial R= 0.5243 R = 0.2387 AVG # of func call =73 R = 0.1184 AVG # of func call =152 Best known solution (R = 0.24): 11111222222222 New minimum found (R = 0.1184): 22222112111111 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Minimization with respect to both continuous and categorical variables Simultaneous relaxation of both continuous and categorical variables removes restriction on coordinates R-factor = 0.24 # of func call = 212 Best known solution: R-factor = 0.24 R-factor = 0.2151 # of func call = 1195 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Minimization with respect to both types of variables removes coordinate constraints Penalty R-factor = 1.6 (invalid structures) Best known solution: R-factor = 0.24 C O M P U T A T I O N A L R E S E A R C H D I V I S I O N LEED Chemical Identity Search: Ni (100)-(5x5)-Li Best known solution (R = 0.24) New structure found (R = 0.1184) C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Conclusions GSS methods for mixed variable problems were successful in solving the surface structure determination problem On average NOMAD took 60 function evaluations versus 280 for previous solution (GA) Improved solutions from previous best known solutions found in all cases Generation of far fewer invalid structures Algorithm appears to be fairly robust, with a better structure found in all 20 trial points Ability to minimize with respect to both categorical and continuous variables a critical advantage for these types of problems C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Future work Implement parallel version of algorithms Improve the probability of not being trapped in local minima Develop new SEARCH strategies, especially for categorical variables Develop automatic strategies for switching between different structure models Improve objective function call Develop new validity check Experiment with other R-factor formulations to increase the sensitivity Implement simultaneous minimization of other physical quantities (e.g., energy) C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Acknowledgements Chao Yang Lin-Wang Wang Xavier Cartoxa Andrew Canning Byounghak Lee C O M P U T A T I O N A L R E S E A R C H D I V I S I O N Questions C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
© Copyright 2026 Paperzz