Senior DevOps Engineer

Job Locations US-Remote
Job ID
2024-4934
Name Linked
Remote: US
Country
United States
City
Remote
Worker Type
Regular Full-Time Employee

Overview

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

 

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC 

 

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA 

 

DDN is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence. 

 

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management. 

 

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage. 

Job Description

About the Team:  

 
Our DevOps team is at the forefront of innovation, driving high-performance and highly scalable emerging technologies with Infinia Software Defined Storage (SDS). Infinia SDS is designed to support multiple client protocols, delivering robust, enterprise-grade storage solutions for cutting-edge use cases. With a global presence of expert developers, QA, and DevOps engineers, we are transforming the way software-defined storage operates in both on-premises and cloud environments. 

 

Why Join Us? 

 

We provide a unique blend of on-premises and cloud DevOps—more than just managing infrastructure, you will have the chance to deploy critical applications on Kubernetes and shape infrastructure as code (IaC) from the ground up. Here’s a glimpse of the exciting work we do: 

 

  • Infrastructure-as-Code: Constantly evolving our infrastructure with Terraform, Python, and Bash, building cloud infrastructure on GCP with future plans to expand into other cloud providers. 

  • On-Premises Excellence: Deploy self-sustaining infrastructure that delivers mission-critical applications faster, while enhancing flexibility and performance. 

  • Build and Release Acceleration: Reducing build times by integrating with Nexus/JFrog Artifactory, streamlining build workflows, and connecting tools to self-managed artifact repositories. 

 

Cutting-Edge Tools We Use:

 

Our infrastructure and tooling reflect our commitment to modern DevOps practices. Here’s how we empower engineers to stay at the top of their game: 

 

  • Kubernetes and ArgoCD: Orchestrate scalable deployments with Kubernetes and leverage ArgoCD for automated and declarative application management. 

  • Packer + MaaS: Create custom bare-metal images with Packer and deploy them seamlessly using MaaS (Metal as a Service) for Ubuntu. 

  • K3s + KubeVirt: Build a self-provisioning VM environment to empower developers and QA engineers. 

  • MinIO & Harbor: Utilize MinIO for S3 storage services, and Harbor for container image management. 

  • GitHub Enterprise (GHE) + Actions: Collaborate using GHE, with DevOps leading the development and maintenance of workflow actions for CI/CD pipelines. 

  • Docker: Build containerized environments for both cloud and on-prem applications. 

  • Quay & GCS Buckets: Manage external artifacts with Quay and Google Cloud Storage (GCS). 

 

What You’ll Work On:

 

  • Self-Deployable Environments: Our DevOps engineers are building a seamless, self-service environment to empower developers and QA teams. 

  • Bare-Metal & Cloud Integration: Get hands-on experience with bare-metal provisioning using MaaS, while deploying infrastructure in hybrid models with GCP and expanding to other cloud providers. 

  • Streamlining Build Systems: We integrate tools like Nexus/JFrog Artifactory to accelerate builds and optimize workflows, ensuring smoother development and faster delivery. 

  • Constant Innovation: We believe in continuous improvement, constantly enhancing infrastructure-as-code to make deployments faster, more reliable, and easily replicable. 

 
 
Required Skills: 

 
All of the following skills are mandatory: 

  • Application Deployments within Kubernetes using tools like Helm or ArgoCD. 

  • Docker experience. 

  • Configuration Management with Ansible or similar tools. 

  • Monitoring and Metrics using Prometheus, Grafana, or equivalent. 

  • Bash Scripting expertise. 

  • CICD Pipeline Development using GitHub/GitLab, Jenkins, and related tools (e.g., YAML, Groovy). 

  • Basic Linux Proficiency for underlying infrastructure management. 

Preferred Skills: 

The following skills are desirable but not mandatory: 

  • Linux Administration, Networking, and Troubleshooting. 

  • On-prem Kubernetes Deployment and Administration. 

  • Virtualization experience. 

  • Python or Golang programming knowledge. 

 

DDN

Our team is highly motivated and focused on engineering excellence.

We look for individuals who appreciate challenging themselves and thrive on curiosity.

Engineers are encouraged to work across multiple areas of the company.

We operate with a flat organizational structure.

All employees are expected to be hands-on and to contribute directly to the company’s mission.

Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills.

They should be able to concisely and accurately share knowledge with their teammates.

 

Interview Process: After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 30-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

  • Coding assessment in a language of your choice.
  • Systems design: Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on: Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive: Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week.
  • We don’t rely on recruiters for assessments.
  • Every application is reviewed by a member of our technical team.

 

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

 

#LI-Remote

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed