From: tingyu (tz9_at_msstate.edu)
Date: Mon Jul 07 2003 - 08:28:45 PDT
Dear Sir, There is a question making me little puzzle on ur problem: since the project supports MPI program checkpoint, how do u define the "parallel MPI program checkpoint" ? As far as i understand, it seems in ur implementation, there is mechanism for cleaning up the messages transmitted in the network, then all of the processes invloved in this communication will be suspended, and later all of the processes (? not for sure) will be migrated to other node ans restarted. Is it correct? I checked the paper but still didn't get a comprehensive image.. Thanks a lot for ur help! Tingyu July 7,2003