DevOps

Senior

-

Ongoing

TotalCloud – Where multi-cloud future is engineered

TotalCloud is an automation-first technology leader, focused on helping clients maximize their multi-cloud (AWS, Azure, GCP) value. Our mission is to transition clients to a state of Continuous IT Operation through advanced automation, DevOps transformation, and security best practices.

Our core focus areas are: Cloud Operations &Automation, Cost Optimization, and building robust, reliable CI/CD pipelines. Join us to define the future of cloud environments!

Why TotalCloud?

Our success is driven by innovation, customer-centricity, and a team of passionate professionals. This is your chance to make a significant impact in the world of AI and data management.

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

Your mission

We are seeking a hands-on, self-motivated DevOps engineer to support and manage our mission-critical dedicated benchmarking and ISV (Independent Software Vendor) lab environments. You will be a key enabler for high-performance benchmarking, AI/ML workloads, and ISV testing. This role will oversee physical and virtual infrastructure, including Linux systems, containers, virtual machines, networks, and storage.

Key responsibilities (The core)

  • Linux Mastery: Maintain and manage a fleet of Linux-based servers, including provisioning, monitoring, and patching.
  • Virtualization & Containers: Deploy and manage VMs and containers (Docker/K8s), supporting diverse benchmarking workloads.
  • Network engineering: Configure and maintain network infrastructure, including IP address management, routing, VLANs, and firewall rules.
  • Storage systems: Set up, maintain, and optimize storage systems (Object, Block, and NFS).
  • Secure access: Coordinate and manage secure user access to lab systems and software environments.
  • CI/CD automation: Support DevOps tooling and automation efforts using CI/CD pipelines where applicable.
  • Lab health ownership: Own lab health, including hardware lifecycle, physical troubleshooting, and environmental monitoring.
  • Documentation: Document procedures, topologies, and system configurations meticulously.

Required qualifications (The must-haves)

3+ years of hands-on experience with:

  • Linux system administration.
  • Networking (routing, subnets, firewalls, VLANs).
  • Virtualization (KVM, VMware, or similar).
  • Containers (Docker, Pod man, Kubernetes).
  • Hardware troubleshooting (x86 servers, switches, cabling).
  • Strong experience with shell scripting, basic automation, and infrastructure management tools.
  • Hands-on expertise with storage constructs (RAID, NFS,LUNs, object stores).
  • Ability to work independently, troubleshoot complex systems, and manage multiple priorities.
  • Excellent documentation and communication skills.
  • Experience that sets you apart (nice-to-haves)
  • DevOps tooling: Experience with Ansible, Terraform, Jenkins, GitLab CI/CD, or similar tools. High-performance systems: Exposure to benchmarking environments, ISV certification labs, AI/ML workloads, HPC systems, or GPU-based systems.

TotalCloud's 4 pillars of culture

Any successful employee at TotalCloud will demonstrate these four essential capabilities:

  • Self-starter: Takes independent action to identify and solve problems. Seeks out relevant information needed to make decisions. Gets involved with new initiatives.
  • Success/achievement orientation: Delivers quality results consistently. Targets, achieves (or exceeds) measurable results. Sets challenging goals, focuses on critical priorities, and is accountable.
  • Problem solving: Recognizes problems and responds with a systematic assessment that identifies and addresses the cause of the issue. Practical, realistic, and resourceful.
  • Innovative: Builds and improves key business processes that enhance the effectiveness of TotalCloud. Generates new ideas, challenges the status quo, and solves problems creatively.

Apply for this job

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.