As a Prompt Engineer, you'll design and evaluate AI prompts, lead analysis teams, and collaborate on improving AI solutions for restaurants.
As a Prompt Engineer focused on Data Science and Quality Analysis, you’ll design, test, and evaluate prompts for AI systems that interact with real-world restaurant data. You’ll work cross-functionally to develop AI solutions that drive operational efficiency, improve data interpretation, and support smarter decision-making for restaurant operators.
Your work will directly shape how AI models perform in high-stakes, dynamic environments like order processing, reporting, support automation, and performance analysis.
Essential Job Functions:
- Prompt Design & Evaluation: Develop, test, and refine prompts for tasks such as text generation, question answering, data classification, and structured data extraction to optimize Voice AI performance.
- Data-Driven Analysis & Quality Measurement: Design evaluation frameworks and analyze prompt outputs using quantitative metrics, human-in-the-loop evaluation, and user feedback to identify improvement opportunities.
- Experimentation & Iteration Conduct experiments to test prompt variations, measure their business and operational impact, and iterate to enhance accuracy, consistency, and safety.
- Regression Testing & Compliance Build principled regression test suites using tools like LangFuse and Galileo to ensure prompts remain compliant and high-performing as models and use cases evolve.
- Collaboration Across Teams: Work closely with data science, product, legal, engineering, and operations teams to align prompt designs with business goals, operational workflows, and compliance requirements.
- Model Adaptation & Strategy Development prompts across multiple LLMs (GPT, LLaMA, Gemini, and Checkmate’s fine-tuned models), understanding model differences to optimize outputs effectively.
- Team Leadership & Mentorship Lead a team of analysts focused on prompt evaluation and data quality analysis, guiding prioritization, experimentation, and reporting. Collaborate with ops teams for seamless deployment and feedback loops.
- Research & Continuous Learning Stay up to date on emerging prompting techniques, LLM behaviors, evaluation frameworks, and AI safety practices to keep Checkmate’s AI solutions best-in-class.
- Strong analytical and data science skills, with hands-on experience in Python (pandas, NumPy, scikit-learn)
- Experience designing and conducting experiments and evaluations in applied AI or NLP contexts
- Proficiency in SQL and working with relational databases (e.g. MySQL, PostgreSQL, Oracle, MS SQL)
- Good understanding of data processing, quality measurement, and testing fundamentals
- Experience leading analyst or operations teams, with strong prioritization, mentorship, and collaboration skills
- Strong problem-solving mindset with a drive to explore, optimize, and automate workflows
- Excellent communication skills for presenting insights to technical and non-technical stakeholders
- Bachelor’s degree in Data Science, Computer Science, Statistics, Engineering, or a related field
- Flexible to work US hours until at least 6 pm ET, with a strong remote setup
Preferred Qualifications
- Experience with LLM evaluation and prompt engineering workflows
- Familiarity with tools like LangFuse and Galileo for prompt evaluation and analysis
- Knowledge of cloud platforms (AWS, GCP, Azure) and data pipeline tools
- Familiarity with machine learning concepts and NLP workflows
- Master’s or PhD in Data Science, Statistics, Computer Science, or a related field
- Health Care Plan (Medical, Dental & Vision)
- Retirement Plan (401k)
- Life Insurance (Basic, Voluntary & AD&D)
- Flexible Paid Time Off
- Family Leave (Maternity, Paternity)
- Short Term & Long Term Disability
- Training & Development
- Work From Home
- Stock Option Plan
Top Skills
AWS
Azure
Galileo
GCP
Langfuse
Ms Sql
MySQL
Numpy
Oracle
Pandas
Postgres
Python
Scikit-Learn
SQL
Similar Jobs
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The BSA Compliance Consultant I will review compliance regulations, ensure adherence, analyze transactions, and assist the compliance team for financial clients.
Top Skills:
Anti-Money Laundering RegulationsBank Secrecy LawsCustomer Due DiligenceKnow Your Customer
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Advisory Solutions Consultant will support sales teams by understanding customer needs, providing product demonstrations, and participating in the sales process, focusing on Identity Security solutions.
Top Skills:
AWSAzureGCPJavaJSONLdapSQLXML
Machine Learning • Payments • Security • Software • Financial Services
Manage a team of software developers, oversee application projects, ensure quality standards, and foster professional growth.
Top Skills:
Agile DevelopmentGitJIRA
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories