Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are looking for an innovative AI Data Scientist to design and implement synthetic data solutions that power advanced AI workflows. This role involves creating high-fidelity, privacy-compliant datasets, applying generative modeling techniques, and collaborating with cross-functional teams to integrate data into GenAI applications.
Key Responsibilities
- Synthetic Data Generation: Design, generate, and validate synthetic datasets across multiple formats (tables, time-series, images, PDFs, JSON, Excel, CSV).
- Generative Modeling & Privacy: Apply techniques such as GANs, VAEs, statistical resampling, and data masking to simulate realistic, privacy-aware data.
- Data Enrichment & Classification: Clean, tag, classify, and enrich real or synthetic data to support risk detection, insights, and LLM-driven decision workflows.
- Research & Innovation: Explore emerging techniques for structured output extraction and data simulation; share findings, tools, and best practices with the team.
- Visualization & Reporting: Build dashboards and visualizations to summarize metrics, simulation results, and insights for stakeholders.
- Cross-Functional Collaboration: Work closely with AI Data Engineers and Frontend Developers to embed synthetic and structured data into GenAI applications and POCs.
- Compliance & Governance: Ensure all data usage adheres to privacy, regulatory, and compliance standards in partnership with legal and risk teams.
- Strong proficiency in Python and libraries such as SDGym, Synthpop, Faker, or Synthea.
- Expertise in pandas, NumPy, scikit-learn, and data wrangling for structured/unstructured data.
- Experience with Jupyter-based experimentation and visualization tools (matplotlib, seaborn, Plotly).
- Familiarity with structured output tooling (e.g., Pydantic, LangChain) and integration with LLM pipelines.
- Bonus: Hands-on experience with PyTorch or TensorFlow for custom generative models.
- Comfortable working in notebook environments (Jupyter, Databricks) for exploration and modeling.
- Knowledge of BI tools (Power BI, Streamlit, Dash) for stakeholder-facing dashboards is a plus.
About Us
Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem.
EEO
Infosys provides equal employment opportunities to applicants and employees without regard to race; color; sex; gender identity; sexual orientation; religious practices and observances; national origin; pregnancy, childbirth, or related medical conditions; or disability.
Key Skills
Ranked by relevanceReady to apply?
Join Infosys and take your career to the next level!
Application takes less than 5 minutes

