Senior Data Scientist with LLM experience

Fusemachines

Canada · Full-time · Mid-Senior

About Fusemachines

Fusemachines is a leading provider of AI strategy, talent, and education services. Founded by Dr. Sameer Maskey, an Adjunct Associate Professor at Columbia University, our mission is to democratize AI. With a presence in four countries—Nepal, the United States, Canada, and the Dominican Republic—and a team of over 350 full-time employees, we leverage our global AI expertise to drive innovation and transformation for businesses worldwide.

This is a remote role. 6 months contract with us and after that will be hired as a full time employee directly with the client.

About The Role

As a Data Scientist on our team, you will contribute to new product development in a collaborative, small-team environment, writing production code for both run-time and build-time applications.

You will help design and implement data-driven solutions for complex business challenges by discovering, extracting, and modeling knowledge from large-scale natural language datasets. Your work will involve prototyping new ideas and collaborating with data scientists, product designers, data engineers, front-end developers, and domain experts to drive innovation.

This role offers the opportunity to work in a fast-paced, start-up-like culture while leveraging the resources and scale of an established company.

Responsibilities

Develop and implement LLM-based applications for various use cases
Evaluate and maintain data assets and training/evaluation datasets
Design and build pipelines for preprocessing, annotating, and managing large-scale text datasets
Collaborate with domain experts to understand requirements and ensure ML applications align with business needs
Conduct experiments and evaluate model performance to drive continuous improvements
Fine-tune and deploy large language models(LLMs) to enhance their performance on specialized tasks
Interface with other technical teams to finalize requirements
Work closely with development teams to understand complex product requirements and translate them into scalable software solutions
Implement development best practices, including coding standards, code reviews, and production-ready implementations

Requirements

Practical experience with large language models (LLMs), prompt engineering, fine-tuning RAG-based applications, and benchmarking using frameworks like LangChain
Strong background in natural language processing (NLP) with experience using spaCy, word2vec, Flair, BERT
Formal training in machine learning, including dimensionality reduction, clustering, embeddings, and sequence classification algorithms
Proficiency in Python and experience working with ML frameworks like PyTorch, TensorFlow, and Hugging Face Transformers
Experience with cloud platforms such as AWS, GCP, or Azure
Understanding of data modeling principles and complex data architectures
Experience working with relational and NoSQL databases and vector stores (e.g., MySQL, Postgres, Solr, Elasticsearch, OpenSearch)
Familiarity with distributed computing frameworks like Spark, Scala, or Ray (highly preferred)
Knowledge of API development, containerization (Docker, Kubernetes), and ML deployment (highly preferred)
Hands-on experience with ML Ops/AI Ops, including experiment tracking tools like LangFuse and DVC
Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers

Preferred Qualifications

MS in Data Science, Computer Science, Statistics, Machine Learning, or related field
5+ years of relevant work experience

Equal Opportunity Employer: Fusemachines is committed to fostering a diverse and inclusive workplace. We welcome applications from all qualified individuals regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, disability, protected veteran status, or any other legally protected status.

Powered by JazzHR

VgHk9GKT4N

Key Skills

Ranked by relevance

ai machine learning tensorflow pytorch natural language processing distributed computing containerization elasticsearch deep learning prototyping kubernetes python docker scala nosql mysql cloud spark aws gcp

Related Jobs

3 roles aligned with this opportunity

View all jobs

Data Scientist_ML (India)

2026-05-21

Full-time

Mid-Senior

India

Internet Publishing

Engineering

Data Scientist

2026-05-24

Full-time

Not Applicable

Canada

Insurance

Engineering

Senior PHP Developer

2026-05-25

Full-time

Not Applicable

Spain

Internet Publishing

Engineering

🇨🇦

Country Guide

Canada

Express Entry & tech-friendly immigration

Posted: Sep 05, 2025
Type: Full-time
Level: Mid-Senior
Location: Toronto
Company: Fusemachines

Industries

Internet Publishing

Related Jobs

3 roles aligned with this opportunity

View all jobs

Data Scientist_ML (India)

2026-05-21

Full-time

Mid-Senior

India

Internet Publishing

Engineering

Data Scientist

2026-05-24

Full-time

Not Applicable

Canada

Insurance

Engineering

Senior PHP Developer

2026-05-25

Full-time

Not Applicable

Spain

Internet Publishing

Engineering

Senior Data Scientist with LLM experience

Key Skills

Related Jobs

Data Scientist_ML (India)

Data Scientist

Senior PHP Developer

Related Jobs

Data Scientist_ML (India)

Data Scientist

Senior PHP Developer

Cookie Settings