Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity.
We are a growing and highly motivated team dedicated to an ambitious technical plan. Our structure is flat, our ambitions are out‑sized, and leadership is earned by shipping excellence.
We seek engineers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of architecture, coding, and communication skills.
LOCATION: On-site at our office in San Francisco or Westwood, Los Angeles.
About the Role
As a Systems Software Engineer, you will work on key performance critical systems such as the daemon which orchestrates every host in our fleet. You will improve the performance, reliability and capability of our infrastructure and containerization technologies including monitoring and telemetry.
- Full-time
- On-site at either our SF or LA offices
C, C++, Python, Linux
Ideal Experience
- Programming: Strong programming skills in at least one language, ideally C++
- Linux and Virtualization: Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
- Isolation Techniques: Deep understanding of workload and network isolation techniques in multi-tenant environments
- Cloud Security: Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
- Multi-tenant Security: Strong background in workload and network isolation, network security, and cloud-native security practices
- GPU Security: Experience with GPU programming and an understanding of GPU-specific security concerns.
- Expand and extend our GPU cloud daemon
- Design and deploy market-based resource management systems
- Harden code and infrastructure to meet zero‑trust standards
- Benchmark, profile, and eliminate bottlenecks across hypervisor, container, and network layers
After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages:
- 15 min - Initial screening (virtual)
- 45 min - Quick dive into Vast, systems and architectures (virtual)
- 1 hour - LLM-assisted coding assessment (virtual)
- 2 hours - Meet and greet with coding assessment (on-site)
$120,000 – $180,000 + equity + benefits
Vast.ai is hiring across all experience levels with compensation commensurate with background, experience and potential.