-
View all jobs
Role : AI/ML Engineer
Location : San Jose, CA
Skills : Cloud resource allocation, Auto-scaling, performance tuning, DevOps
Role: AI/ML Engineer
Location: San Jose, CA (5 days WFO)
Notice period: 2 weeks
Visa: Any (Except OPT and CPT)
Note: Need atleast 1 or 2 resumes by today EOD please try to submit profiles please.
Job Description
Location : San Jose, CA
Skills : Cloud resource allocation, Auto-scaling, performance tuning, DevOps
Role: AI/ML Engineer
Location: San Jose, CA (5 days WFO)
Notice period: 2 weeks
Visa: Any (Except OPT and CPT)
Note: Need atleast 1 or 2 resumes by today EOD please try to submit profiles please.
Job Description
- Design and implement AI Agents to optimize cloud resource allocation, auto-scaling, and performance tuning.
- Develop predictive models for failure detection, incident management, and system health monitoring.
- Automate operational workflows using machine learning and intelligent scripting.
- Integrate AI-driven insights with existing cloud monitoring tools.
- Collaborate with DevOps and SRE teams to deploy, monitor, and improve ML models in production environments.
- Conduct anomaly detection for security, cost optimization, and performance analytics.
- Continuously evaluate emerging AI technologies and tools for operational improvements.
- Maintain documentation and best practices for AI/ML integration in cloud systems.
- Bachelor's or equivalent experience or master’s degree in computer science, Data Science, or related technical field.
- Proven ability building and deploying ML models, with at least 2 years focused on infrastructure or cloud operations.
- Solid knowledge of hybrid cloud technologies (AWS, GCP, OpenStack, Kubernetes).
- Experience with Python, Jupiter, and ML libraries such as PyTorch, TensorFlow, or scikit-learn.
- Familiarity with cloud-native monitoring, logging, and automation tools (e.g., Terraform, Ansible, Prometheus, Splunk, AppDynamics).
- Comfortable working with streaming data, APIs, and telemetry systems.
- Strong communication and multi-functional collaboration skills.
- Experience with Agile and DevOps operating models, including project tracking tools (e.g., Jira), Git (any Version Control systems), and CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins).
- Proficient in general-purpose programming languages (Python, GoLang, Bash and/or C/C++) and development platforms and technologies.
- Deep understanding of operating systems and experience with Cisco technologies (UCS, Nexus, Thousand Eyes)
- Established record of leading technical initiatives, delivering results, and a commitment to fostering a supportive work environment.
- Hard-working, dedicated to providing quality support for your customers
Key Skills
Ranked by relevance
cloud
ai
python
devops
machine learning
prometheus
tensorflow
terraform
openstack
ansible
pytorch
golang
gitlab
splunk
nexus
bash
cicd
jira
git
aws
gcp
san
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Python Developer
2025-06-27
Full-time
Entry
United States
IT Services
Engineering
View Job Details
Related
Python Developer
2025-06-04
Full-time
Entry
United States
IT Services
Engineering
View Job Details
Related
Backend Python Developer – AI/ML
2025-12-24
Full-time
Entry
United States
IT Services
Engineering
Login to Apply
- Posted
- Aug 11, 2025
- Type
- Full-time
- Level
- Entry
- Location
- San Jose
- Company
- Snowrelic Inc
Industries
IT Services
IT Consulting
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Python Developer
2025-06-27
Full-time
Entry
United States
IT Services
Engineering
View Job Details
Related
Python Developer
2025-06-04
Full-time
Entry
United States
IT Services
Engineering
View Job Details
Related
Backend Python Developer – AI/ML
2025-12-24
Full-time
Entry
United States
IT Services
Engineering