Appen

Engineer Intern, GenAI Research (Summer Internship)

Posted 13 Days Ago

Remote

Hiring Remotely in US

Internship

Remote

Hiring Remotely in US

Internship

Design and implement supervised fine-tuning pipelines, create benchmarks, analyze and improve production LLMs, run experiments and hyperparameter searches, build Python tooling for training/evaluation, and document research results.

The summary above was generated by AI

Why Join This Team

Appen’s GenAI research team advances how frontier models are evaluated, improved, and deployed in production environments.

The purpose of this role is to design and implement research and engineering workflows that strengthen model performance, create new benchmarks, and improve production models without regressing on core characteristics.

This role provides hands on ownership of training and evaluation pipelines, benchmark development, and model improvement initiatives that directly influence deployed systems.

Your Impact

Design and implement a lightweight supervised fine tuning training pipeline using open source LLMs.
Create new benchmarks to evaluate frontier models across defined scientific and performance criteria.
Analyze production models to identify measurable areas for improvement.
Improve model performance through targeted retraining and hyperparameter search.
Deploy improved models while maintaining core model characteristics and avoiding regression.
Build Python tooling to automate training, evaluation, benchmarking, and experimentation workflows.
Implement structured evaluation methods, including rubric based scoring and LLM as a judge workflows.
Document experimental design, benchmark methodology, and performance results with clarity and precision.
Iterate rapidly in a research driven environment to increase model quality and reliability.

What You Bring

Current enrollment in or recent completion of a Master’s or PhD in Computer Science, AI, Machine Learning, Computer Engineering, or a closely related technical field.
Strong experience working with large language models, including supervised fine tuning, prompt engineering, or model evaluation.
Hands on experience building machine learning pipelines or research infrastructure.
Experience improving model performance through retraining or hyperparameter tuning.
Proficiency in Python and comfort working with machine learning frameworks and open source model ecosystems.
Familiarity with cloud environments such as AWS or Azure.
Strong technical problem solving ability, including use of LLMs as development aids for building and iteration.
Ability to work independently with minimal hand holding.
Strong written communication skills for summarising research and drafting technical documentation.
Ability to collaborate effectively in a remote research environment.

Additional Details

Duration: June-August
Schedule: Full-time
Work Type: Remote

Why You'll Love Working Here

At Appen, we foster a culture of innovation, collaboration, and excellence. We value curiosity, accountability, and a commitment to delivering the highest quality AI solutions for frontier models.

You’ll work on complex challenges that shape the future of AI across industries and geographies, alongside talented people in a culture that values humility over ego. You’ll have the flexibility to deliver in a way that works for you and your team, supported by tools, resources and development opportunities to continue to build your capability over time.

About Appen

Appen has been a leader in AI training data for over 30 years. We specialise in human generated data to train, fine tune, and evaluate models across generative AI, large language models, computer vision, and speech recognition. Our AI assisted data annotation platform and global crowd of more than 1 million contributors in over 200 countries support model pre training, supervised fine tuning, evaluation and benchmarking, safety and red teaming, and multilingual global expansion.

Top Skills

Python,Large Language Models,Open Source Llms,Supervised Fine Tuning,Prompt Engineering,Hyperparameter Tuning,Machine Learning Frameworks,Aws,Azure

Similar Jobs

Unanet

Architect

26 Minutes Ago

Remote

United States

157K-185K Annually

Senior level

157K-185K Annually

Senior level

Enterprise Web • Fintech • Marketing Tech • Software

The Enterprise Technical Architect oversees complex technical issues, enhances product support, and collaborates cross-functionally to refactor existing solutions for better performance and sustainability.

Top Skills: AntApache TomcatAPIsJavaMiddlewarePythonRestSQL

GameChanger

Software Engineer

27 Minutes Ago

Remote

United States

160K-180K Annually

Senior level

160K-180K Annually

Senior level

Computer Vision • Digital Media • Kids + Family • Mobile • Software • Sports

The Senior Backend Software Engineer will enhance youth sports applications by building scalable backend services, improving user experiences, and collaborating with cross-functional teams while ensuring system reliability and performance.

Top Skills: AWSNode.jsPostgresPythonRedisTypescript

BuildOps

Project Manager

29 Minutes Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

105K-140K Annually

Mid level

105K-140K Annually

Mid level

Cloud • Mobile • Software

As an Implementation Project Manager at BuildOps, lead cross-functional teams for customer onboarding, manage timelines, and ensure successful project delivery for enterprise accounts.

Top Skills: Communication ToolsProject Management SoftwareSaaS

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories