cr_checkpoint error

From: Yuan Wan (ywan_at_ed.ac.uk)
Date: Fri Feb 29 2008 - 05:04:25 PST

  • Next message: Paul H. Hargrove: "Re: cr_checkpoint error"
    Hi all,
    
    I get the following error messege during my cross-node checkpoint/restart 
    test:
    ---------------------------------------------------------------------------
    Checkpoint command: cr_checkpoint -f context_4644030.2 --run 10983
    ioctl(/proc/checkpoint/ctrl,  CR_OP_CHKPT_REQ): Operation not permitted
    ---------------------------------------------------------------------------
    
    the status value returned by this operation is 1 ranther than 0
    
    This error appears randomly on some nodes for some jobs, but the same 
    checkpoint operation of other jobs which are exactly of same codes works fine.
    
    Can anyone explain this error?
    
    --Yuan
    
    Yuan Wan
    -- 
    Unix Section
    Information Services Infrastructure Division
    University of Edinburgh
    
    tel: 0131 650 4985
    email: ywan@ed.ac.uk
    
    2012 Computing Services, JCMB
    The King's Buildings,
    Edinburgh, EH9 3JZ
    

  • Next message: Paul H. Hargrove: "Re: cr_checkpoint error"