Skip to main content

Solutions Architect (Cloud & MLOps)

Job DescriptionJob DescriptionAbout the Company

Our client is at the forefront of the AI revolution, providing cutting-edge infrastructure that's reshaping the landscape of artificial intelligence. They offer an AI-centric cloud platform that empowers Fortune 500 companies, top-tier innovative startups and AI researchers to drive breakthroughs in AI. The company is committed to building full-stack infrastructure to service the explosive growth of the global AI industry, including large-scale GPU clusters, cloud platforms, tools, and services for developers.

  • Company Type: Publicly traded

  • Product: AI-centric GPU cloud platform & infrastructure for training AI models

  • Candidate Location: Remote anywhere in the US

Their mission is to democratize access to world-class AI infrastructure, enabling organizations of all sizes to turn bold AI ambitions into reality. At the core of their success is a culture that celebrates creativity, embraces challenges, and thrives on collaboration.

The Opportunity

We're seeking a Solutions Architect to join our client's dynamic team. This role offers a unique chance to shape the adoption of AI technology and work with cutting-edge cloud and AI infrastructure. As a trusted advisor, you'll help clients leverage GPU-powered cloud solutions to optimize their AI and machine learning workflows.

What You'll Do

  • Serve as a trusted technical advisor to clients on GPU cloud technologies

  • Conduct PoCs, workshops, and training sessions to educate clients

  • Develop tailored solution architectures based on client requirements

  • Design and document Infrastructure as Code solutions

  • Optimize client pipeline performance, scalability and resource utilization

  • Act as the primary expert on customer scenarios for Product, Technical support, and Marketing teams.

What You Bring

  • At least 5 years of experience as a Cloud Solutions Architect, System/Network Engineer, Developer, or similar technical role focused on cloud computing

  • Strong hands-on experience with Infrastructure as Code and configuration management tools (Terraform, Ansible), Kubernetes, and Python coding skills

  • Solid understanding of GPU computing practices for ML training and inference workloads, including GPU software stack components (CUDA, OpenCL)

  • Ideally, experience with HPC/ML orchestration frameworks (Slurm, Kubeflow) and deep learning frameworks (TensorFlow, PyTorch)

  • Knowledge of cloud ML tools from industry leaders (NVIDIA, AWS, Azure, Google)

  • Bachelor's degree in a relevant field (advanced degree is a plus)

  • Excellent communication skillsĀ 

Key Attributes for Success

  • Passion for AI and transformative technologies

  • Results-driven mindset and problem-solver mentality

  • Adaptability and ability to thrive in a fast-paced startup environment

  • Comfortable working with an international team and diverse client base

Why Join?

  • Competitive compensation: $140,000-$175,000 per year

  • Full medical benefits: 100% company-paid medical, dental, and vision coverage for employees and families

  • 401(k) plan with a 4% match program with immediate vesting

  • Company-paid short-term, long-term, and life insurance coverage

  • 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers

  • Company-paid short-term, long-term, and life insurance coverage

  • Flexible remote work environment

  • Up to $85/month for mobile and internet

  • Startup culture with stability: excitement and innovation of a startup backed by the resources of an established company

  • Work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs (H100, L40S, with H200 and Blackwell chips coming soon)

  • Be part of a team that operates one of the most powerful commercially available supercomputers

  • Contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings

Interviewing Process

  • Level 1 - Interview with the Talent Acquisition Lead (General fit. Q&A)

  • Level 2 - Interview with the Hiring Manager (Skills assessment)

  • Level 3 - Interview with the C-level (Final)

  • Reference and Background Checks: conducted after successful interviews

  • Job Offer: provided to the selected candidate

We are proud to be an equal opportunity workplace and are committed to equal employment opportunity regardless of , , , , , , marital status, ancestry, physical or mental , genetic information, veteran status, , or expression, , or any other characteristic protected by applicable federal, state or local law.

Compensation Range: $140K - $180K

Solutions Architect (Cloud & MLOps)

San Francisco, CA
Full time

Published on 11/25/2024

Share this job now