-
M Science

Data Scientist

M Science
United States · Full-time · Associate

Title: Data Scientist

Location: New York, NY


About M Science:

M Science is a data-driven research and analytics firm, uncovering new insights for leading financial institutions and corporations. M Science is revolutionizing research, discovering new data sets, and pioneering methodologies to provide actionable intelligence. Our research teams have decades of experience working with massive amounts of unstructured data in near real-time to discern critical insights that help clients make smarter, more informed decisions. We combine the best of finance, data, and technology to create a truly unique value proposition for both financial services firms and major corporations.


Job Overview:

We are seeking a highly skilled Data Scientist to design and develop pipelines and AI/ML models and workflows on our 50+ alternative data panels. The ideal candidate will have deep expertise in mathematical and statistical modeling, and will have built models with Python, SQL, and PySpark. This person will test new data assets for the firm, develop AI/LLM tools and agents, contribute to the firm’s analytics library, and use traditional machine learning and statistical methods to improve panel data. M Science expects its data scientists to implement production code, so the ideal candidate will have experience writing well tested, performant, object-oriented code.


Responsibilities:

  • Develop agentic workflows for automated insight retrieval, data analysis, data download, and data forecasting
  • Contribute to the firm’s documented, unit-tested analytics library
  • Process, cleanse, and verify the integrity of data used for analysis
  • Create automated alerting and notification systems for deviations in data quality, validation failures, or unusual patterns
  • Evaluate new datasets for the firm
  • Design, develop, and optimize scalable and fault-tolerant data ingestion pipelines using Databricks, Airflow, Python, and Spark
  • Build resilient data pipelines that handle vendor-related issues such as delayed deliveries, schema changes, incomplete records, and data corruption


Qualifications:

  • Advanced Python for data processing, scripting, and automation
  • Fluency in PySpark/Spark and distributed data processing; pandas, dask, daft, polars also a plus
  • Excellent knowledge of multivariate statistical analysis, including but not limited to ordinary least squares, principal component analysis, factor analysis, LDA, and panel methods
  • Excellent knowledge of other ML methods including additive modeling and ensemble modeling
  • Some knowledge of LLMs and common LLM orchestration frameworks like LangChain and LangGraph
  • Experience with named entity resolution methods a strong plus
  • Familiarity with cloud data platforms (AWS) and cloud-based storage solutions
  • Strong troubleshooting skills to diagnose and resolve performance bottlenecks in data pipelines


Primary Location: New York, NY

Salary Range: $90,000-$175,000 USD/Annual

The salary offered will take into consideration an individual’s experience level and qualifications. In addition to salary, M Science offers, for eligible employees, an annual discretionary incentive bonus, competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, disability insurance; and wellness programs. M Science also offers paid time off packages that include planned time off (vacation), unplanned time off (sick leave), paid holidays and paid parental leave.

Key Skills

Ranked by relevance

python cloud machine learning data analysis storage pandas sql aws
Login to Apply
Posted
Apr 08, 2026
Type
Full-time
Level
Associate
Location
New York City Metropolitan Area
Company
M Science

Industries

Financial Services Market Research

Categories

Finance Research Engineering

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
GAP Solutions, Inc.
Related

Health Data Scientist

2026-04-08

Full-time
Mid-Senior
United States
Public Health
Information Technology
View Job Details
BBVA
Related

Data Scientist GenAI (LLMs)

2026-04-08

Full-time
Mid-Senior
Spain
Banking
Engineering
View Job Details
Emonics LLC
Related

Entry Level AI Engineer

2026-04-08

Full-time
Entry
United States
Staffing
Engineering