Parallelization Load Balance Strategy for a Global Grid-Point Model

+ Author Affiliations + Find other works by these authors
  • Funds:

    Supported by the National S&T Infrastructure Program for the 11th Five-Year Period under Grant No. 2006BAC02B00 and the National Natural Science Foundation of China under Grant Nos. 40575050 and 40775073

PDF

  • The Global/Regional Assimilation and PrEdiction System (GRAPES) is a new-generation operational numerical weather prediction (NWP) model developed by the China Meteorological Administration (CMA).It is a grid-point model with a code structure different from that of spectral models used in other operational NWP centers such as the European Centre for Medium-Range Weather Forecasts (ECMWF), National Centers for Environmental Prediction (NCEP), and Japan Meteorological Agency (JMA), especially in the context of parallel computing. In the GRAPES global model, a semi-implicit semi-Lagrangian scheme is used for the discretization over a sphere, which requires careful planning for the busy communications between the arrays of processors, because the Lagrangian differential scheme results in shortened trajectories interpolated between the grid points at the poles and in the associated adjacent areas. This means that the latitude-longitude partitioning is more complex for the polar processors. Therefore, a parallel strategy with efficient computation, balanced load, and synchronous communication shall be developed. In this paper, a message passing approach based on MPI (Message Passing Interface) group communication is proposed.Its key-point is to group the polar processors in row with matrixtopology during the processor partitioning.A load balance task distribution algorithm is also discussed. Test runs on the IBM-cluster 1600 at CMA show that the new algorithm is of desired scalability, and the readjusted load balance scheme can reduce the absolute wall clock time by 10% or more. The quasi-operational runs of the model demonstrate that the wall clock time secured by the strategy meets the real-time needs of NWP operations.
  • 加载中
  • 加载中
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Parallelization Load Balance Strategy for a Global Grid-Point Model

  • 1. School of Computer Science,National University of Defense and Technology,Changsha 410073;
    Center for Numerical Weather Prediction,National Meteorological Center of CMA,Beijing 100081;
    School of Computer Science,National University of Defense and Technology,Changsha 410073;
    Center for Numerical Weather Prediction,National Meteorological Center of CMA,Beijing 100081;
    Center for Numerical Weather Prediction,National Meteorological Center of CMA,Beijing 100081;
    Center for Numerical Weather Prediction,National Meteorological Center of CMA,Beijing 100081
Funds: Supported by the National S&T Infrastructure Program for the 11th Five-Year Period under Grant No. 2006BAC02B00 and the National Natural Science Foundation of China under Grant Nos. 40575050 and 40775073

Abstract: The Global/Regional Assimilation and PrEdiction System (GRAPES) is a new-generation operational numerical weather prediction (NWP) model developed by the China Meteorological Administration (CMA).It is a grid-point model with a code structure different from that of spectral models used in other operational NWP centers such as the European Centre for Medium-Range Weather Forecasts (ECMWF), National Centers for Environmental Prediction (NCEP), and Japan Meteorological Agency (JMA), especially in the context of parallel computing. In the GRAPES global model, a semi-implicit semi-Lagrangian scheme is used for the discretization over a sphere, which requires careful planning for the busy communications between the arrays of processors, because the Lagrangian differential scheme results in shortened trajectories interpolated between the grid points at the poles and in the associated adjacent areas. This means that the latitude-longitude partitioning is more complex for the polar processors. Therefore, a parallel strategy with efficient computation, balanced load, and synchronous communication shall be developed. In this paper, a message passing approach based on MPI (Message Passing Interface) group communication is proposed.Its key-point is to group the polar processors in row with matrixtopology during the processor partitioning.A load balance task distribution algorithm is also discussed. Test runs on the IBM-cluster 1600 at CMA show that the new algorithm is of desired scalability, and the readjusted load balance scheme can reduce the absolute wall clock time by 10% or more. The quasi-operational runs of the model demonstrate that the wall clock time secured by the strategy meets the real-time needs of NWP operations.

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return