Optimize and compress large language and vision models for on-device inference. Build distillation and hardware-specific compilation pipelines, benchmark performance across NPU/GPU architectures, and deploy models to edge environments.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Similar Jobs
Agency • Artificial Intelligence • Blockchain • Web3
Design and run adversarial tests on language and multimodal models, build guardrails and real-time filters for autonomous tools, and develop constitutional AI principles and RLHF alignment pipelines to ensure safe AI deployment.
Top Skills:
Adversarial MlAutomated Red-Teaming FrameworksGuardrailsJailbreak TaxonomiesLlmsMultimodal AgentsPrompt EngineeringReal-Time FilteringRlhf
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Manager in Oracle HCM, you'll help clients optimize HR processes by implementing Oracle solutions, leading teams, and ensuring project success through effective problem-solving and innovation.
Top Skills:
Cc&BEbsFusionHyperionOracle ApplicationsOracle Hcm CloudPeoplesoftRiceSiebel
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead and deliver specialized tax strategies focused on R&D tax credits, manage client engagements from planning to completion, analyze complex tax regulations, mentor junior staff, uphold professional standards, embrace technology to improve delivery, and build strong client relationships to identify tax optimization opportunities.
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

