A new version of the ULFM specification based on the upcoming MPI 3.1 and the discussions going on at the MPI Forum Meeting in Chicago in December 2013 has been posted under the ULFM Specification item. Head to ULFM Specification for more info.
An flyer has been created for SC’13. It puts further emphasis on use-cases and features updated graphs showcasing performance while failures are being digested by the system.
Download the flyer
A new version of the ULFM specification based on the upcoming MPI 3.1 has been posted under the ULFM Specification item. Head to ULFM Specification for more info.
To better accommodate management of the ULFM repository, the URL has changed. It can now be found at:
All of your previous SSH keys will continue to work. All you need to do is change your .hg/hgrc file to point to the new repository.
To clarify the difference between installation/setup and usage, the old Usage Guide has been moved to ULFM Setup and a new Usage Guide has been put in place to provide instruction and examples for using ULFM constructs in MPI code. For now, this example section provides the code outlined in the ULFM specification, but this will eventually be amended to include more complete and unique examples. You can find the both of these pages in the menu bar, under User Level Failure Mitigation.
The User Level Failure Mitigation team will be at this year’s Supercomputing conference in Salt Lake City. Come visit us at The University of Tennessee booth (#3010) to hear more about our work as well as lots of other interesting research going on at UTK.
We have a flyer for User Level Failure Mitigation to show new results and design. It can be found at this link: SC12 ULFM Flyer.
The second beta for the ULFM implementation in Open MPI has been posted. This is a minor update to fix agreement operations. The changelog is relatively small and can be found in the Release Notes section of ULFM.