دورية أكاديمية

Realistic Workload Modeling and Its Performance Impacts in Large-Scale eScience Grids.

التفاصيل البيبلوغرافية
العنوان: Realistic Workload Modeling and Its Performance Impacts in Large-Scale eScience Grids.
المؤلفون: Hui Li1 hui.li@computer.org
المصدر: IEEE Transactions on Parallel & Distributed Systems. Apr2010, Vol. 21 Issue 4, p480-493. 14p.
مصطلحات موضوعية: *MATHEMATICAL models, *HIGH performance computing, *DISTRIBUTED computing, GRID computing, CYBERINFRASTRUCTURE
مستخلص: Grid computing proves to be a successful paradigm for large-scale distributed data processing, and global eScience Grids have been in production for years (e.g., LCG and OSG). The majority of applications running on these production environments can be characterized as massive CPU-intensive batch jobs (or "bag-of-tasks"), sometimes considered as the "killer" application for the Grid. A deep understanding of its main workload characteristics is not only necessary for realistic performance evaluation of the existing system, but also crucial to generate new insights into better resource allocation schemes. This paper presents a comprehensive statistical analysis of the workloads on production eScience Grid environments. We focus on second-order statistics and the scaling behavior of main job characteristics, namely job arrivals and job runtimes. A range of autocorrelation structures is identified and analyzed, including pseudoperiodicity, short-range dependence (SRD), and long-range dependence (LRD). We further develop mathematical models that are able to capture these salient properties in the workloads. Workload models, in turn, enable us to quantitatively evaluate the performance impacts of autocorrelations in Grid scheduling. The results indicate that autocorrelations in workloads result in system performance degradation, sometimes the difference can be as large as up to several orders of magnitude. Nevertheless, better performance can be achieved at the Grid level under bursty local background workloads. Such effects of workloads on systems are extensively analyzed and explained. [ABSTRACT FROM AUTHOR]
Copyright of IEEE Transactions on Parallel & Distributed Systems is the property of IEEE and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Business Source Index
الوصف
تدمد:10459219
DOI:10.1109/TPDS.2009.99