Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
About the Role
We are looking for a Machine Learning Engineer – Computer Vision with experience in image, audio and video generation to join our Computer Vision team. You’ll work on cutting-edge projects involving multi modal models, diffusion models, GANs to develop social media content editing and generation tools.
Key Responsibilities
- Design and implement generative models for image, audio and video creation.
- Build robust CV pipelines for object detection, segmentation, classification, clustering and dataset augmentation.
- Collaborate with marketing campaign management to generate real time social media content optimization.
- Manage image / video datasets and preprocessing strategies for high-quality generation.
- Stay current on advancements in generative media and advise / inform the team on updating our models / feature space.
Requirements
- Required:
- 3 years of hands-on ML / DL experience with a focus on Computer Vision.
- 1 to 2 years of experience in using Generative AI models, ideally with diffusion or GAN-based methods.
- Strong background in python programming and DL frameworks such as PyTorch, TensorFlow, scikit-learn, Hugging Face, etc.
- Proficiency in building and managing visual data pipelines and augmentations.
- Understanding of social media content creation strategies and the use of generative AI in creating content.
- Nice-to-have:
- Masters’ or PhD in a relevant field.
- Working knowledge of LLMs, text-to-image, image-to-video, etc. relevant to multi-modal models.
- Some knowledge of audio processing and generation models or frameworks.