CIFellows Postdoc Job Opportunities

A courtesy job posting service for computing postdocs

* Post a Position
* Edit a Position

Click for Available Candidate Profiles

Postdoctoral or Post-Master’s Research Associate

University/Research Lab: Oak Ridge National Laboratory

Department: Computer Science and Mathematics Division
Location: Oak Ridge, TN
Salary:
Original Posting Web Page: http://orise.orau.gov/sep/needs/files/ORNL10-05-CSMD.pdf
Keywords: , , , , ,

Posted on: Monday, October 19th, 2009

Job Details:

The Oak Ridge National Laboratory invites applications for Postdoctoral and Post-Master’s term research appointments in the Computer Science Research Group of the Computer Science and Mathematics Division. The appointment is initially limited to one year from start with an option for a second year contingent to funding availability. It is part of a research project that aims at developing a soft error resilience strategy for future-generation extreme-scale high-performance computing (HPC) systems. Soft errors are an emerging threat for these systems. Uncorrectable errors that occur in an ECC memory module once within a few million hours can cause a system error rate of a few hours. Also, undetected errors (silent data corruptions) are becoming a problem as well.

This project targets two different solutions: (1) checkpoint storage virtualization to significantly improve checkpoint/restart efficiency, and (2) software dual-modular redundancy (DMR) to eliminate rollback/recovery. The planned checkpoint storage virtualization aggregates a variety of back-end resources, such as flash, memory, or both, and uses them in conjunction with traditional parallel file systems. The core concept of the planned DMR technology relies on software-level replication of computational processes and on process cloning for fast recovery. The project will involve multiple facets of software research and development (design, implementation and evaluation) in storage systems, rollback recovery and process redundancy, including but not limited to:

  • Storage: Distributed storage system, parallel I/O, user-space file system (FUSE), aggregated storage, caching, storage and I/O virtualization, LUSTRE parallel file system, SSD, performance analysis of storage systems
  • Redundancy: Active replication, state-machine replication, process group communication, process cloning/migration, message logging, performance analysis of fault tolerance protocols
  • OS/RTE: Linux, MPI

Qualifications:

The successful candidate is expected to have experience in C programming, system software development (Linux kernel development is a plus), high-performance computing, file systems development, network programming, and fault tolerance for parallel and distributed systems (active replication is a plus). This person will have excellent communication skills and the ability to work as part of a team. Applicants should clearly state relevant experience and skills, including level of proficiency, and indicate whether they are seeking a research-oriented or software development position.

For a postdoctoral position, an earned Ph.D. in computer science, computer engineering or a closely related discipline is required. A successfully defended Ph.D. dissertation is considered acceptable to start the appointment with the expectation that the Ph.D. will be granted at the close of the next academic term of the granting institution.

For a post-master’s position, an earned master’s in computer science, computer engineering or a closely related discipline is required. For this appointment, the master’s degree must have been completed and granted by the university at the time of starting the appointment.

Applicants cannot have received the most recent degree more than five years prior to the date of application and must complete all degree requirements before starting their appointment.

How to Apply:

Qualified applicants may apply online at https://www2.orau.gov/ORNL_POST/. All applicants will need to register before they can begin the online application. For complete instructions, on how to apply, please see the instructions at
http://www.orau.gov/orise/edu/ornl/ornl-pdpm/application.htm.

NOTE: Please use Microsoft Internet Explorer Browser to apply for the position you desire. Once at the site, please select the “New Applicant user? Click the Here to Register” link. Once you have registered you will receive a confirmation email. At that point you will be able to fill out the on line application.

This appointment is offered through the ORNL Postgraduate Research Associates Program and is administered by Oak Ridge Associated Universities (ORAU). This appointment is open to all qualified U.S. and non-U.S. citizens without regard to race, color, age, religion, sex, national origin, physical or mental disability, or status as a Vietnam-era veteran or disabled veteran.

Application Deadline:

Contact Information:

Send questions about this position to: Christian Engelmann

E-Mail: EMAIL OBFUSCATED

Phone: 865 574 3132

Application Deadline:

Categories Posted To:

Networks / Operating Systems, Numerical/Scientific Computing / HPC / Data-Intensive Scalable Computing

twitter-icon

Browse Posts in other Research Areas