The Machine Learning Engineer will develop and deploy large language models focused on preprocessing datasets, training, fine-tuning, and maintaining ML pipelines, primarily in healthcare settings.
Position Summary
Requirements
Benefits
The Machine Learning Engineer will be responsible for the end-to-end development and deployment of Large language and machine learning models, with a primary focus on data preprocessing, model training, and fine-tuning using large-scale healthcare datasets. This role requires a strong understanding of Large language models, machine learning principles, data engineering, and experience working with sensitive healthcare data.
Key Responsibilities- Data Preprocessing: Clean, transform, and prepare large, complex healthcare datasets for machine learning model development. This includes handling missing values, outlier detection, feature engineering, and data normalization. Identify, collect, and curate relevant, industry-specific datasets for model retraining. Format data appropriately for the chosen LLM and training pipeline
- Model Training & Fine-Tuning: Design, train, and fine-tune various LLMs on extensive healthcare data to solve specific clinical or operational problems. Set up and manage the training environment, including GPU instances and required software. Train and fine-tune pre-trained LLMs on the custom dataset to achieve specific goals. Experiment with and fine-tune hyperparameters such as learning rate, batch size, and training epochs to optimize model performance. Integration of structured + unstructured data (multi-modal/multi-input models)
- Model Evaluation & Optimization: Evaluate model performance using appropriate metrics, identify areas for improvement, and implement optimization strategies.
- Pipeline Development: Develop and maintain robust and scalable data and ML pipelines for model training, inference, and deployment.
- Collaboration: Work closely with data scientists, clinicians, and software engineers to understand requirements, integrate models into production systems, and ensure data privacy and security compliance.
- Research & Development: Stay up-to-date with the latest advancements in machine learning and healthcare AI, and explore new technologies and methodologies to enhance our solutions.
- Documentation: Maintain clear and comprehensive documentation of models, data pipelines, and experimental results.
Requirements
- Education: Bachelor's or Master's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related quantitative field.
- Experience:
- 5+ years of experience in Machine Learning Engineering or a similar role.
- Proven experience with large-scale data preprocessing, LLM/model training, and fine-tuning.
- Experience with distributed training (PyTorch Distributed, DeepSpeed, Ray, Hugging Face Accelerate).
- Experience with GPU/TPU optimization, memory management for large language models.
- Experience working with healthcare data is highly desirable.
- Technical Skills:
- Proficiency in Python and relevant ML libraries (e.g., TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy).
- Strong understanding of various machine learning algorithms,Large Language Models, and deep learning architectures.
- Experience with cloud platforms (e.g., GCP, AWS) and distributed computing frameworks (e.g., Spark) is a plus.
- Familiarity with MLOps practices and tools.
- Soft Skills:
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration abilities.
- Ability to work independently and as part of a team in a fast-paced environment.
Benefits
Why Join Us?
Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.
Benefits:
- Competitive salary and benefits package.
- Flexible working arrangements (remote or hybrid options available).
- The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
- Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
- Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Top Skills
AWS
GCP
Numpy
Pandas
Python
PyTorch
Scikit-Learn
Spark
TensorFlow
Similar Jobs
eCommerce • Healthtech • Pet • Retail • Pharmaceutical
As a Machine Learning Engineer II, you'll design, implement, and test machine learning systems, collaborating with various stakeholders to enhance predictive applications.
Top Skills:
AWSDatabricksPythonPyTorchTensorFlow
Cloud • Enterprise Web • Software • Infrastructure as a Service (IaaS)
The Applied Machine Learning Engineer will design and deploy generative AI applications, optimize RAG architectures, and collaborate on AI prototypes to drive business value.
Top Skills:
Hugging FaceLangchainLlamaindexPythonPyTorchTensorFlow
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Develop and enhance behavioral models for road users in autonomous vehicles, optimizing perception models to support safe driving decisions.
Top Skills:
Deep LearningMachine LearningPython
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



