10 Hrs Per Week for 6 months
About the Company: Artificial General Intelligence (AGI) Data Services is at the forefront of AI innovation, specializing in the development and refinement of large language models (LLMs). Our mission is to create AI systems that can understand and solve complex mathematical problems, revolutionizing the fields of scientific computing, data analysis, and mathematical research.
About the Role: As an LLM Evaluation Expert specializing in Mathematics, you will play a crucial role in assessing and improving our language models' mathematical capabilities. Your expertise will be instrumental in evaluating LLM-generated mathematical solutions, making high-level judgments, and setting the standard for what constitutes excellent AI-assisted mathematical problem-solving.
Responsibilities:
- Critically analyze and evaluate mathematical responses generated by our LLMs across various fields of mathematics (e.g., algebra, calculus, statistics, number theory)
- Exercise expert judgment to select the most appropriate and efficient mathematical solutions from multiple LLM-generated options
- Make informed decisions on behalf of our customers, ensuring that selected solutions meet rigorous mathematical standards, are logically sound, and address specific research or application needs
- Develop and write mathematical demonstrations to illustrate "what good looks like" in AI-generated solutions, setting benchmarks for accuracy, elegance, and insight
- Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output of mathematical concepts
- Collaborate with the AI research team to identify areas for improvement in the LLM's mathematical reasoning and problem-solving capabilities
- Stay abreast of the latest developments in mathematics, mathematical software, and AI to ensure our evaluations remain cutting-edge
Qualifications:
- Advanced degree (Ph.D. preferred) in Mathematics, Applied Mathematics, or a closely related field
- Extensive experience (5+ years) in mathematical research, problem-solving, or applied mathematics across multiple subfields
- Demonstrated ability to critically evaluate mathematical proofs, solutions, and reasoning for correctness, efficiency, and elegance
- Strong analytical and decision-making skills, with the ability to make complex judgments in abstract and theoretical contexts
- Excellent written and verbal communication skills, with the ability to explain complex mathematical concepts clearly
- Experience in technical writing, particularly in creating mathematical proofs, explanations, or tutorials
Preferred Skills:
- Previous experience working with or evaluating AI systems, particularly in the context of mathematical problem-solving
- Familiarity with computer algebra systems, numerical computing environments, and mathematical software packages
- Understanding of machine learning concepts, particularly as they apply to mathematical reasoning and problem-solving
- Experience in creating or contributing to mathematical textbooks, research papers, or educational content
Key Skills
Ranked by relevance
Related Jobs
3 roles aligned with this opportunity
Data Scientist, Mid
2026-05-20
Data Scientist (m/w/d)
2026-05-28
Full Stack Engineer (Remote Ireland / UK)
2026-05-28
- Posted
- Dec 19, 2024
- Type
- Part-time
- Level
- Mid-Senior
- Location
- United States
- Company
- Motion Recruitment
Industries
Categories
Related Jobs
3 roles aligned with this opportunity
Data Scientist, Mid
2026-05-20
Data Scientist (m/w/d)
2026-05-28
Full Stack Engineer (Remote Ireland / UK)
2026-05-28