ML Engineer – Computer Vision (Generative AI)

Track This Job

Add this job to your tracking list to:

Monitor application status and updates
Change status (Applied, Interview, Offer, etc.)
Add personal notes and comments
Set reminders for follow-ups
Track your entire application journey

Save This Job

Add this job to your saved collection to:

Access easily from your saved jobs dashboard
Review job details later without searching again
Compare with other saved opportunities
Keep a collection of interesting positions
Receive notifications about saved jobs before they expire

AI-Powered Job Summary

Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.

Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.

About the Role

We are looking for a Machine Learning Engineer – Computer Vision with experience in image, audio and video generation to join our Computer Vision team. You’ll work on cutting-edge projects involving multi modal models, diffusion models, GANs to develop social media content editing and generation tools.

Key Responsibilities

Design and implement generative models for image, audio and video creation.
Build robust CV pipelines for object detection, segmentation, classification, clustering and dataset augmentation.
Collaborate with marketing campaign management to generate real time social media content optimization.
Manage image / video datasets and preprocessing strategies for high-quality generation.
Stay current on advancements in generative media and advise / inform the team on updating our models / feature space.

Requirements

Required:
3 years of hands-on ML / DL experience with a focus on Computer Vision.
1 to 2 years of experience in using Generative AI models, ideally with diffusion or GAN-based methods.
Strong background in python programming and DL frameworks such as PyTorch, TensorFlow, scikit-learn, Hugging Face, etc.
Proficiency in building and managing visual data pipelines and augmentations.
Understanding of social media content creation strategies and the use of generative AI in creating content.
Nice-to-have:
Masters’ or PhD in a relevant field.
Working knowledge of LLMs, text-to-image, image-to-video, etc. relevant to multi-modal models.
Some knowledge of audio processing and generation models or frameworks.

Apply

Post Date

2025-06-13

Job Type

Employment type

Full-time