M Science
Data Scientist Intern
M ScienceUnited States1 day ago
InternshipRemote FriendlyEngineering

Data Scientist Intern – Alternative Data

Anticipated to graduate in May 2027


Location: New York, NY | Hybrid


About M Science

M Science is a data-driven research and analytics firm, uncovering new insights for leading financial institutions and corporations. We revolutionize research by discovering new data sets and pioneering methodologies to provide actionable intelligence. Our research teams have decades of experience working with massive amounts of unstructured data in near real-time, helping clients make smarter, more informed decisions. By combining finance, data, and technology, M Science delivers a unique value proposition to financial services firms and major corporations.

The Role

We are seeking a Data Scientist Intern to join our team for the summer. This internship is designed for students passionate about alternative data, AI, and data engineering. You will gain hands-on experience building and testing data pipelines, developing AI/ML tools, and applying statistical models to real-world alternative datasets. Interns at M Science work alongside experienced data scientists and engineers to support impactful research used by hedge funds, asset managers, and other institutional clients.


Key Responsibilities

  • Assist in building and optimizing data ingestion pipelines using Python, SQL, Databricks, and Spark.
  • Support the development of AI/ML workflows and agents for automated insights, data analysis, and forecasting.
  • Contribute to data testing, validation, and quality assurance, including anomaly detection and error handling.
  • Process, cleanse, and verify the integrity of large and complex datasets.
  • Help evaluate new alternative data assets for potential research and investment use.
  • Work with senior data scientists to implement and document unit-tested analytics tools and models.
  • Collaborate with cross-functional teams to support analytics, research, and client deliverables.


What You’ll Learn

  • How to leverage alternative data and AI tools in investment research and analytics.
  • Best practices in data engineering, testing, and pipeline optimization.
  • Application of Python, SQL, PySpark, and Databricks for large-scale data processing.
  • Fundamentals of statistical modeling and machine learning, including multivariate and ensemble methods.
  • Introduction to AI/LLM frameworks and agent-based workflows.


Preferred Qualifications

  • Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Data Science, Statistics, Mathematics, Engineering, or a related quantitative field anticipated to graduate in May 2027.
  • Strong programming skills in Python and SQL; experience with PySpark/Spark is a plus.
  • Familiarity with cloud platforms (AWS, Databricks) and distributed data processing.
  • Interest or exposure to machine learning, AI agents, or large language models.
  • Highly analytical, detail-oriented, and able to work independently and collaboratively.
  • Strong written and verbal communication skills.


Nice-to-Have Skills

  • Experience with LLM orchestration frameworks like LangChain or LangGraph.
  • Familiarity with named entity resolution or other NLP methods.
  • Prior experience in data testing, anomaly detection, or production-quality code.


Why M Science?

  • Hands-on experience with real alternative datasets and AI tools used in financial research.
  • Work alongside senior data scientists and engineers on impactful projects.
  • Hybrid work model offering both in-office collaboration and flexibility.
  • Exposure to cutting-edge analytics in finance, technology, and data science.


Hourly Rate: $45/hr USD

Key Skills

Ranked by relevance