Tiger Analytics Logo

Tiger Analytics

Gen AI Data Engineer

Reposted 17 Days Ago
Remote
Hiring Remotely in United States
Expert/Leader
Remote
Hiring Remotely in United States
Expert/Leader
The Gen AI Data Engineer will design and build distributed data systems, develop data pipelines, manage data infrastructure, and integrate technologies for real-time and batch processing, contributing to scalable analytics solutions.
The summary above was generated by AI

Tiger Analytics is looking for experienced Machine Learning Engineers with Gen AI experience to join our fast-growing advanced analytics consulting firm. Our employees bring deep expertise in Machine Learning, Data Science, and AI. We are the trusted analytics partner for multiple Fortune 500 companies, enabling them to generate business value from data. Our business value and leadership has been recognized by various market research firms, including Forrester and Gartner.

We are looking for top-notch talent as we continue to build the best global analytics consulting team in the world. You will be responsible for:

Technical Skills Required:

Programming Languages: Proficiency in Python, SQL, and PySpark.

Data Warehousing: Experience with Snowflake, NOSQL and Neo4j.

Data Pipelines: Proficiency with Apache Airflow.

Cloud Platforms: Familiarity with AWS (S3, RDS, Lambda, AWS batch, SageMaker processing Job, CloudFormation, etc.) or GCP (Vertex AI RAG, Data pipeline, Bigquery, GKE)

Operating Systems: Experience with Linux.

Batch/Realtime Pipelines: Experience in building and deploying various pipelines.

Version Control: Experience with GitHub.

Development Tools: Proficiency with VS Code.

Engineering Practices: Skills in testing, deployment automation, DevOps/SysOps.

Communication: Strong presentation and communication skills.

Collaboration: Experience working with onshore/offshore teams.


Requirements

Desired Skills:

·        Big Data Technologies: Experience with Hadoop and Spark.

Data Visualization: Proficiency with Streamlit and dashboards.

·        APIs: Experience in building and maintaining internal APIs.

·        Machine Learning: Basic understanding of ML concepts.

·        Generative AI: Familiarity with generative AI tools and techniques.

Additional Expertise:

·        Knowledge Graphs: Experience with creation and retrieval.

·        Vector Databases: Proficiency in managing vector databases.

·        Data Persistence: Ability to develop and maintain multiple forms of data persistence and retrieval methods (RDMBS, Vector Databases, buckets, graph databases, knowledge graphs, etc.).

·        Cloud Technologies: Experience with AWS, especially SageMaker, Lambda, OpenSearch.

·        Automation Tools: Experience with Airflow DAGs, AutoSys, and CronJobs.

·        Unstructured Data Management: Experience in managing data in unstructured forms (audio, video, image, text, etc.).

·        CI/CD: Expertise in continuous integration and deployment using Jenkins and GitHub Actions.

·        Infrastructure as Code: Advanced skills in Terraform and CloudFormation.

·        Containerization: Knowledge of Docker and Kubernetes.

·        Monitoring and Optimization: Proven ability to monitor system performance, reliability, and security, and optimize them as needed.

·        Security Best Practices: In-depth understanding of security best practices in cloud environments.

·        Scalability: Experience in designing and managing scalable infrastructure.

·        Disaster Recovery: Knowledge of disaster recovery and business continuity planning.

·        Problem-Solving: Excellent analytical and problem-solving abilities.

·        Adaptability: Ability to stay up-to-date with the latest industry trends and adapt to new technologies and methodologies.

·        Team Collaboration: Proven ability to work well in a team environment and contribute to a positive, collaborative culture.

GenAI Engineer Specific Skills:

·        Industry Experience: 8+ years of experience in data engineering, platform engineering, or related fields, with deep expertise in designing and building distributed data systems and large-scale data warehouses.

·        Data Platforms: Proven track record of architecting data platforms capable of processing petabytes of data and supporting real-time and batch ingestion processes.

·        Data Pipelines: Strong experience in building robust data pipelines for document ingestion, indexing, and retrieval to support scalable RAG solutions. Proficiency in information retrieval systems and vector search technologies (e.g., FAISS, Pinecone, Elasticsearch, Milvus).

·        Graph Algorithms: Experience with graphs/graph algorithms, LLMs, optimization algorithms, relational databases, and diverse data formats.

·        Data Infrastructure: Proficient in infrastructure and architecture for optimal extraction, transformation, and loading of data from various data sources.

·        Data Curation: Hands-on experience in curating and collecting data from a variety of traditional and non-traditional sources.

·        Ontologies: Experience in building ontologies in the knowledge retrieval space, schema-level constructs (including higher-level classes, punning, property inheritance), and Open Cypher.

·        Integration: Experience in integrating external databases, APIs, and knowledge graphs into RAG systems to improve contextualization and response generation.

·        Experimentation: Conduct experiments to evaluate the effectiveness of RAG workflows, analyze results, and iterate to achieve optimal performance.


Benefits

This position offers an excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment with a high degree of individual responsibility.

Top Skills

Apache Airflow
AWS
CloudFormation
Docker
GCP
Git
Github Actions
Hadoop
Jenkins
Kubernetes
Linux
Neo4J
NoSQL
Pyspark
Python
Snowflake
Spark
SQL
Streamlit
Terraform
Vs Code

Similar Jobs

21 Days Ago
Remote
United States
50K-130K Annually
Senior level
50K-130K Annually
Senior level
Software
Lead the development of Big Data solutions and machine learning systems, mentor engineers, and maintain high standards of engineering excellence.
Top Skills: AWSAzureCi/CdDeltaDockerGCPHudiIcebergKubernetesPysparkSpark
An Hour Ago
Remote or Hybrid
United States
100K-145K Annually
Mid level
100K-145K Annually
Mid level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Design, develop, and maintain identity governance solutions using SailPoint IdentityIQ; troubleshoot IAM issues and improve IAM processes.
Top Skills: Active DirectoryAzure DevopsBeanshellDatabase TechnologiesJavaLdapPowershellPythonRestSailpoint Identityiq
An Hour Ago
Remote or Hybrid
United States
61K-81K Annually
Mid level
61K-81K Annually
Mid level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The consultant is responsible for developing and managing training programs for new hires, collaborating with business leaders, and improving service delivery through training and coaching.

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account