EverAI

Mid/Senior LLM Inference Engineer (Remote - Worldwide)

Reposted Yesterday

In-Office or Remote

8 Locations

Mid level

In-Office or Remote

8 Locations

Mid level

As an LLM Engineer, you will optimize large language models for user interactions, oversee algorithm development, manage datasets, and enhance product features.

The summary above was generated by AI

Our Vision & Products

🚀 EverAI — Building the Future of AI Companionship

One of the Top 15 Largest & Fastest-Growing AI Companies in the World

30+ Million Users in under 2 years — Help Us Reach 100M first, 500M next

At EverAI, we’re shaping what it means to connect with AI. With 30+ million users and counting, we're not just building products — we're creating entirely new categories.

Our flagship product is the world's largest AI girlfriend/boyfriend platform, redefining relationships for millions. And we’re only just getting started.

Up next? We’re scaling our second product to revolutionize the creator economy. Think best-in-class AI content engines for video and image generation — designed to put world-class tools in every creator’s pocket.

All of this is governed by our proprietary moderation system, EverGuard — an internal AI designed to ensure everything we build is safe, ethical, and human-first.

Our Team

We are an enthusiastic, passionate and hardworking team of 70 people. Our founding team has strong entrepreneurial experience building and scaling web products from 0 to IPO.

Alexis Soulopoulos [CEO]

• 10+ years in Tech Executive Leadership

• Co-Founder Mad Paws Holdings (from 0 to IPO)

• Forbes 30 under 30 + Deloitte TechFast50 ’22 & ‘23

Michael Monin [Co-founder & CTO]

• 10+ years as CTO / COO (web2/web3), 1+ year in AI/LLM

• Serial-entrepreneur: MTK Digital (exited / 0->$20m revenue) and Zipchat (AI Chatbot for E-commerce brands)

Thomas Lacroix [Co-founder & CMO]

• 8+ years in Customer Acquisition & E-commerce Growth

• Serial-entrepreneur: Curatible (sold to Blackstone) and MTK Digital (exited / 0->$20m revenue)

Maruša Fasano [CFO/Legal]

• 25+ years in Finance, Strategy, M&A

• Ex-CFO/M&A @Curatible (exited to Blackstone)

• Ex-President of the Board @SotremoSA (exited)

• Co-founder/CFO @SoftOne (exited)

Your Role

🚀 Architect the Future of AI Relationships

As our LLM Engineer, you'll fine-tune and optimize large language models that power conversations for over 30 million users, processing more than 5 million messages daily. You'll be at the forefront of developing AI companionship technology that scales globally while maintaining personalized and meaningful interactions.

Key Responsibilities

Interact with stakeholders (Co-founders, Web Engineers, DevOps Engineers) to bring your project to life.
Oversee the creation and optimization of algorithms for LLM behavior adjustments based on user interactions, focusing on fine-tuning and prompt engineering.
Develop features to improve the richness of the product (multi-character chats, gamification, etc)
In addition to chat, interacting with modalities managed by other team members (audio, image, video), and collaborating with them
Adaptation and fine-tuning of base models for multilingual support
Manage the creation and maintenance of diverse datasets critical for training and improving the performance of LLMs.
Assess and determine the best technological approaches, selecting between classifiers, fine-tuning, and other methods based on the specific project's needs.

Your Qualifications

Must-Haves

Python Mastery: 5+ years building production‑grade, modular, maintainable codebases
LLM Architecture Expertise: Deep understanding of transformers and their training dynamics (attention, positional encodings, samplers, tokenizers, post-training, reasoning LM)
Inference Optimization at Scale: Expert with vLLM / TensorRT‑LLM (or similar); proven record of reducing latency and memory via quantization and/or distillation
Distributed Training: Hands‑on multi‑GPU / multi‑node fine‑tuning using FSDP, DeepSpeed, or accelerate; comfortable with mixed‑precision, gradient checkpointing, and memory‑aware scheduling
Performance Profiling & Optimization: Skilled at identifying and resolving compute or memory bottlenecks across CPU/GPU pipelines with industry‑standard profiling workflows

Nice‑to‑Haves

Concurrency & Runtime Engineering: Strong with asyncio, multiprocessing, or equivalent backend/batch‑scheduling patterns
Low‑level Systems: Practical CUDA / Triton experience; able to write or debug custom kernels
Open‑Source Impact: Contributor to core LLM tooling (vLLM, HF Transformers, Triton, etc.)
Real‑time Deployments: Built or maintained latency‑critical, multi‑user LLM services (RAG, streaming, agents, chatbots)
Specialized Generation Use Cases: Exposure to erotic role playing, multi‑turn instruction tuning, or non‑English quality alignment

Soft Skills

🗣 Strong communication & collaborative skills (perfectly fluent in English)

🎯 Goal-oriented, ownership and commitment

⚡️ Doer mindset - we are moving fast and we need people who can find the right balance between executing, planning and strategy

🧢 Humble - willing to learn, open to feedback

🍭 #NSFW - you are comfortable building products that are based on uncensored models and content

Why EverAI?

📈 Exponential Growth: From 30M+ users in 18 months, to 100M next — and 500M beyond

🚀 Track Record of Category-Creating Innovation: We consistently launch world-first AI applications — setting the pace, not following it

🌍 Global Impact: Top-tier user growth, real-world adoption, and cultural relevance

🧠 Proven Leadership: A senior team that’s launched, scaled, and exited & IPO’d multiple scale ups — now fully focused on reshaping AI companionship

👥 Elite Remote Team: 100% remote and built to win — world-class talent from Tier 1 tech companies, with a culture of ownership, velocity, and radical creativity

🛡️ Ethical Core: Our AI ecosystem is governed by EverGuard, our proprietary AI moderation technology, ensuring responsible development at scale

What We Offer

✍️ We prefer a B2B contract but we can be flexible, as long as you’re in it for the long haul

📍 Full-remote (you work from the place that suits you best)

🏝️ 4 weeks PTO

👨‍👩‍👧‍👦 Annual gathering to get to know each other better

❤️‍🩹 Health & Wellness support: Up to $200 for your wellbeing expenses + Access to unlimited 1:1 sessions with psychologists and lifestyle experts through OpenUp (also available to up to three of your family members)

📚 Learning budget

💻 Company laptop

⚡️ GPT-4, Mistral and Hugging Face Pro plan

🎯 Top Tier Talent Is Our Multiplier

We’re a fully remote group of A-players from Tier 1 tech, led by an exec team who’ve launched, scaled, and exited multiple companies. We move fast, and care deeply about what we build — and who we build it with.

We’re looking for exceptional talent ready to ship & distribute world-first AI products at scale, fast, and co-create with us this category-defining business.

If that’s you — reach out and apply!

Top Skills

Cuda

Deepspeed

Fsdp

Python

Tensorrt-Llm

Triton

Similar Jobs

Airwallex

Account Executive

3 Hours Ago

In-Office or Remote

Mid level

Artificial Intelligence • Fintech • Payments • Financial Services • Generative AI

This role involves establishing relationships with C-level executives, managing outbound sales processes, and negotiating contracts to drive business growth for SMEs.

Top Skills: Google SuiteLinkedin Sales NavigatorOutreachSalesforceZoominfo

WeLocalize

Linguistic Quality Control Specialist

3 Hours Ago

In-Office or Remote

Junior

Machine Learning • Natural Language Processing

The role involves performing quality control checks on translated documents, ensuring adherence to guidelines, using CAT tools, and collaborating with project management.

Top Skills: MemoqMS OfficePhraseTradosXtm

Motive

Sales Engineer

3 Hours Ago

Easy Apply

Remote

Canada

Easy Apply

138K-200K Annually

Mid level

138K-200K Annually

Mid level

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation

As a Sales Engineer for Mid Market, you will partner with Account Executives to drive revenue growth through product demos, RFIs, and POCs, while becoming an expert on the product and competition.

Top Skills: APIsSaaS

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories