About the role
<p>We are seeking a Hardware Solutions Engineer to design, validate, and support customized server and infrastructure solutions for customers across AI, HPC, and enterprise environments. This role bridges customer requirements with engineering execution, ensuring scalable, high-performance, and reliable hardware solutions.</p><p></p><h3><strong>Key Responsibilities</strong></h3><ul><li>Design, architect, and implement high-performance computing (HPC) and AI/ML infrastructure solutions, including GPU-accelerated clusters, high-speed storage, and low-latency networking.</li><li>Lead the end-to-end solution lifecycle from pre-sales architecture through system integration, validation, and deployment in customer environments.</li><li>Collaborate with sales and customers to translate requirements into scalable AI/HPC system designs, including compute, GPU topology, interconnect (InfiniBand/Ethernet), and storage architectures.</li><li>Drive OEM server and appliance configuration, including BOM definition, firmware/BIOS tuning, thermal/power considerations, and manufacturability alignment.</li><li>Work closely with manufacturing and integration teams to ensure system-level validation, rack integration, burn-in testing, and production readiness for complex server appliances.</li><li>Deliver technical presentations, demos, and proof-of-concepts (POCs) focused on AI workloads (training/inference), HPC simulations, and data-intensive applications.</li><li>Support cluster bring-up and optimization, including OS provisioning, workload managers (e.g., Slurm), container platforms, and GPU software stacks (CUDA, drivers, AI frameworks).</li><li>Provide advanced troubleshooting and performance tuning across compute, GPU, storage, and networking subsystems.</li><li>Perform on-site or remote deployment support, including rack-level integration, cluster commissioning, and acceptance testing.</li><li>Develop and maintain technical documentation, including solution architectures, integration guides, and manufacturing/test procedures.</li><li>Interface with cross-functional teams (engineering, manufacturing, support) to resolve complex system-level issues and improve product quality.</li><li>Stay current with emerging technologies in AI infrastructure, HPC architectures, GPU platforms, and data center design.</li></ul>