Senior Software Engineer - HPC

Job Locations US-Remote
Job ID
2024-4786
Name Linked
Remote: US
Country
United States
City
Remote

Overview

DDN Storage is seeking great candidates to join our dynamic team of passionate customer-enabling technologists!

 

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DDN Storage is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

 

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC

 

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - ~ Marc Hamilton VP, Solutions Architecture & Engineering | NVIDIA

 

DDN Storage is the global leader in AI and multi-cloud data management at scale. Our cutting-edge storage and data management solutions are designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN Storage empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.

 

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

 

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.

Job Description

We are looking for a Senior Software Engineer for our Lustre Networking team which focuses on developing and maintaining the Lustre Network (LNet) kernel module and associated test infrastructure. The ideal candidate will have experience designing, implementing, and shipping software using modern development tooling and practices. This is remote position. 


Responsibilities for this role include, but are not limited to the following:

• Working with next-generation networking technologies to ensure that DDN’s solutions remain the most scalable and performant available.
• Evolve LNet codebase to interact effectively with kernel and other third-party software.
• Design and develop tests and test suite infrastructure for LNet.
• Working with customers, both interactively in live debug sessions and analyzing logs in order to provide code fixes where applicable
• Provide code reviews for LNet patches submitted by other engineers
• Work with the Engineering manager and a geographically distributed team to understand product requirements, feature development, and support priorities.
• Contribute to LNet product documentation for new features, while validating and updating the existing documentation.

 

Qualifications:

• BS/MS in Computer Science, Computer Engineering or equivalent degree/experience.
• 5+ years of experience working in Linux environments with C, Python, Bash scripting.
• Familiarity with networking technologies, including TCP, Infiniband, RDMA.
• Understanding of open-source Kernel development.
• Good grasp of multi-threaded programming and parallel computing.
• Advanced problem-solving skills
• Knowledge of Linux internals and administration is desired
• Strong team player with good written and verbal English communication skills
• Excellent time management skills
• Familiarity with Parallel File Systems, in particular Lustre, is desirable.
• Experience with JIRA, Jenkins, Gerrit, Git and Github preferred

 

#LI-Remote

DDN

Our team is highly motivated and focused on engineering excellence.

We look for individuals who appreciate challenging themselves and thrive on curiosity.

Engineers are encouraged to work across multiple areas of the company.

We operate with a flat organizational structure.

All employees are expected to be hands-on and to contribute directly to the company’s mission.

Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills.

They should be able to concisely and accurately share knowledge with their teammates.

 

Interview Process: After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 30-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

  • Coding assessment in a language of your choice.
  • Systems design: Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on: Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive: Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week.
  • Every application is reviewed by a member of our technical team.

 

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed