

Type of Document Master's Thesis Author Banerjee, Shankha URN etd-03162012-020243 Title MPIOR: A Framework to Analyze File System Performance of MPI Applications Degree Master of Science Department Computer Science and Applications Advisory Committee
Advisor Name Title Varadarajan, Srinidhi Committee Chair Ribbens, Calvin J. Committee Member Tilevich, Eli Committee Member Keywords
- I/O
- replay
- MPI
- trace
Date of Defense 2012-02-17 Availability unrestricted Abstract MPI I/O replay (MPIOR) is an I/O performance modeling and prediction tool used to trace and replay a parallel application to determine application performance under a new I/O sub system. The trace collector deduces synchronization inter-dependencies between nodes and I/O demands placed by each node on the storage subsystem. It uses a novel runtime graph traversal technique to filter and log only those MPI calls that affect I/O, thus substantially reducing both the number of runs and the size of the trace file. Unlike other such tools, MPIOR collects a valid trace in a single run and it does not rely on node sampling or I/O sampling. MPIOR’s post processing engine analyzes the trace files and sets up the re-player. Due to minimal overhead for trace collection, MPIOR can be used during production runs rather than just as a debugging tool. The re-player mimics the behavior of the application across a variety of storage systems by mapping multiple processes to multiple threads running on a single node. We show average replay error for parallel applications is below 30%.
Files
Filename Size Approximate Download Time (Hours:Minutes:Seconds)
28.8 Modem 56K Modem ISDN (64 Kb) ISDN (128 Kb) Higher-speed Access Banerjee_Shankha_T_2012.pdf 419.35 Kb 00:01:56 00:00:59 00:00:52 00:00:26 00:00:02
If you have questions or technical problems, please Contact DLA.