Etech Global Services Logo

Etech Global Services

Sr Data Engineer

Posted 26 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
The Senior Data Engineer will design data pipelines for the AI ecosystem, manage Vector Databases, and ensure data governance, optimizing schemas for AI consumption.
The summary above was generated by AI

This is a remote position.

Senior Data Engineer - AI Context & Knowledge Systems
We are looking for a Data Engineer to build the "memory" and "knowledge" backbone of our Agentic AI ecosystem. You will be responsible for designing data pipelines that feed into our Model Context Protocol (MCP) servers, ensuring that AI agents managed via Gravitee have real-time access to accurate, secure, and contextually relevant enterprise data.
Key Responsibilities
  • Context Engineering: Design and optimize data schemas specifically for LLM consumption, ensuring that data retrieved via MCP servers is structured to minimize token usage and maximize reasoning accuracy.
  • Hybrid Pipeline Development: Build robust data pipelines using Python (for AI/ML workflows) and C#/.NET (for enterprise integration) to move data from legacy systems into AI-ready formats.
  • Vector Database Management: Implement and maintain Vector Databases (e.g., Pinecone, Weaviate, or Milvus) to support Retrieval-Augmented Generation (RAG) alongside live API tool calls.
  • Data Governance for AI: Work with the Gravitee API Gateway to enforce data masking, PII redaction, and fine-grained access control before data reaches an LLM.
  • Metadata Orchestration: Manage the OpenAPI and MCP metadata that allows AI agents to "understand" the data they are querying.
Technical Qualifications
  • Languages: Expert-level Python (Pandas, PySpark, SQLAlchemy) and strong familiarity with C# for interacting with .NET-based data layers.
  • AI Data Stack: Hands-on experience with Vector Databases and embedding models.
  • API Management: Understanding of how data is exposed through Gravitee APIM and secured via MCP-specific authorization flows.
  • Modern Data Stack: Experience with SQL/NoSQL databases, dbt, and cloud data warehouses (Snowflake, BigQuery, or Databricks).
  • Protocol Knowledge: Familiarity with the Model Context Protocol (MCP) and how it standardizes data retrieval for AI agents.
Preferred Skills
  • Experience building Knowledge Graphs to provide relational context to AI agents.
  • Familiarity with semantic caching to reduce LLM costs and improve response times.
  • Knowledge of Gravitee Observability for monitoring data drift in agentic conversations.


Similar Jobs

2 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
186K-222K Annually
Senior level
186K-222K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Design and scale data pipelines and ML/LLM systems, build agentic automation for pipeline generation and maintenance, improve data monitoring, and collaborate with analysts, product, and ML teams to deliver reliable end-to-end data and AI infrastructure for a high-growth e-commerce platform.
Top Skills: AirflowAws Ec2Aws EksAws LambdaAws S3DbtLlmsMcp ServersMl PipelinesPythonRagSnowflake
3 Days Ago
Remote
United States
164K-193K Annually
Senior level
164K-193K Annually
Senior level
Healthtech • Social Impact • Software • Telehealth
Build and maintain scalable, secure data pipelines and platforms to enable AI-driven analytics. Partner with analytics, product, and marketing to translate requirements, deploy production systems, implement data governance and access controls, and support AI applications at scale.
Top Skills: AWSIcebergKafkaLarge Language Models (Llms)Model Context Protocols (Mcps)PythonSemantic LayersSnowflakeSparkSQLTerraform
4 Days Ago
Easy Apply
Remote or Hybrid
Easy Apply
100K-150K Annually
Senior level
100K-150K Annually
Senior level
Fintech • Software • Financial Services
Design, build, and maintain scalable batch and real-time data pipelines and data lake architecture. Improve observability and SLOs, optimize ETL/ELT, develop dbt workflows, support event-driven architectures, integrate financial APIs, and collaborate with analytics and ML teams to deliver reliable, model-ready data products.
Top Skills: AirflowApache IcebergAvroAws AthenaAws GlueAws KinesisAws S3BigQueryCi/CdCloud SqlCloud StorageDbtEmrGCPGitopsParquetPrefectPythonSQLTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account