TechSci Research - Lead Data Scientist - LLM Models

TechSci ResearchIndia5 days ago

Full-timeInformation Technology

Track This Job

Add this job to your tracking list to:

Monitor application status and updates
Change status (Applied, Interview, Offer, etc.)
Add personal notes and comments
Set reminders for follow-ups
Track your entire application journey

Save This Job

Add this job to your saved collection to:

Access easily from your saved jobs dashboard
Review job details later without searching again
Compare with other saved opportunities
Keep a collection of interesting positions
Receive notifications about saved jobs before they expire

AI-Powered Job Summary

Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.

Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.

About The Role

The design and development of a cutting-edge application powered by large language models (LLMs).

This tool will provide market analysis and generate high-quality, data-driven periodic insights.

You will play a critical role in building a scalable and intelligent system that integrates structured data, NLP capabilities, and domain-specific knowledge to produce analyst-grade content.

Key Responsibilities

Design and develop LLM-based systems for automated market analysis.
Build data pipelines to ingest, clean, and structure data from multiple sources (e.g., market feeds, news articles, technical reports, internal datasets).
Fine-tune or prompt-engineer LLMs (e.g., GPT-4.5, Llama, Mistral) to generate concise, insightful reports.
Collaborate closely with domain experts to integrate industry-specific context and validation into model outputs.
Implement robust evaluation metrics and monitoring systems to ensure quality, relevance, and accuracy of generated insights.
Develop and maintain APIs and/or user interfaces to enable analysts or clients to interact with the LLM system.
Stay up to date with advancements in the GenAI ecosystem and recommend relevant improvements or integrations.
Participate in code reviews, experimentation pipelines, and collaborative research Required :
Strong fundamentals in machine learning, deep learning, and natural language processing (NLP).
Proficiency in Python, with hands-on experience using libraries such as NumPy, Pandas, and Matplotlib/Seaborn for data analysis and visualization.
Experience developing applications using LLMs (both closedand open-source models).
Familiarity with frameworks like Hugging Face Transformers, LangChain, LlamaIndex, etc.
Experience building ML models (e.g., Random Forest, XGBoost, LightGBM, SVMs), along with familiarity in training and validating models.
Practical understanding of deep learning frameworks: TensorFlow or PyTorch.
Knowledge of prompt engineering, Retrieval-Augmented Generation (RAG), and LLM evaluation strategies.
Experience working with REST APIs, data ingestion pipelines, and automation workflows.
Strong analytical thinking, problem-solving skills, and the ability to convert complex technical work into business-relevant insights.

Preferred

Familiarity with the chemical or energy industry, or prior experience in market research/analyst workflows.
Exposure to frameworks such as OpenAI Agentic SDK, CrewAI, AutoGen, SmolAgent, etc.
Experience deploying ML/LLM solutions to production environments (Docker, CI/CD).
Hands-on experience with vector databases such as FAISS, Weaviate, Pinecone, or ChromaDB.
Experience with dashboarding tools and visualization libraries (e.g., Streamlit, Plotly, Dash, or Tableau).
Exposure to cloud platforms (AWS, GCP, or Azure), including usage of GPU instances and model hosting services.

(ref:hirist.tech)

Key Skills

Ranked by relevance

Ready to apply?

Join TechSci Research and take your career to the next level!

Application takes less than 5 minutes

Apply