Yahoo Logo

Yahoo

Sr. Data Engineer

Reposted 18 Days Ago
United States of America
128K-267K Annually
Entry level
United States of America
128K-267K Annually
Entry level
The Data Engineer will develop and improve data infrastructures and pipelines for machine learning and analytics. Responsibilities include designing systems for efficient data processing, collaborating with teams to implement algorithms, and troubleshooting data issues, while managing large volumes of data at petabyte scale.
The summary above was generated by AI

Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It’s the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.

A Little About Us

Yahoo makes the world’s daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. Yahoo’s vast businesses span across Search, Communications, Media, and many other verticals.

 

Yahoo generates terabytes of data every day and it is critical to collect, manage and process data at petabyte scale to provide timely and accurate insights to executives, sales, product managers and product developers on all aspects of user interaction. 

The Mail Analytics Engineering team at Yahoo is responsible for building mission critical data systems, pipelines, warehouses, analytics systems, and Machine Learning/AI/data mining programs for the Communications business, which includes Yahoo Mail, with 200M monthly active users. We are constantly pushing the envelope of data platforms due to the insane amount of data we need to harness. 

A Lot About You

As part of the Mail Analytics Engineering team, you will be working on data engineering infrastructures, pipelines and next generation Machine Learning- and AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features. 

Our Big Data footprints are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale stream processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day

  • Develop new or improve existing data infrastructures for data processing machine learning, and deep learning using your core expertise

  • Work with other engineers to implement algorithms and systems in an efficient way

  • Take end to end ownership of Machine Learning-based distributed data systems - from data and training pipelines, to real time data serving engines.

  • Develop complex queries, very large volume data pipelines, and analytics applications

  • Develop complex queries and software programs to solve analytics and data mining problems

  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions

  • Prototype new metrics or data systems

  • Lead data investigations to troubleshoot data issues that arise along the data pipelines

  • Maintenance and improvement of released systems

  • Engineering consulting on large and complex warehouse data

You Must Have

  • BS/MS/PhD in Computer Science/Electrical Engineering, or related engineering disciplines, ideally with specialization in Data Engineering or Machine Learning

  • 6+ years of hands-on experience in relevant fields, including data engineering

  • Strong fundamentals: algorithms, distributed computing, data structure, database

  • Fluency with: Python/Java/SQL

  • Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations

Preferred

  • Experience in Hadoop technologies (Map/Reduce, Pig, Hive, HBase, Storm, Spark, Kafka, Oozie).

  • Experience with Google Cloud Platform (BiqQuery, Dataproc, Dataflow, etc.) a big plus

  • Experience with machine learning algorithms, NLP, and/or statistical methods a big plus

  • Experience in any of: machine learning, analytics, data mining, or data mart and warehouse

  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell

#LI-FM1

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Similar Jobs

5 Days Ago
Hybrid
2 Locations
144K-181K Annually
Mid level
144K-181K Annually
Mid level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Data Engineer, you'll design, develop, and support technical solutions while collaborating with Agile teams and utilizing big data technologies.
Top Skills: AWSCassandraEmrGurobiHadoopHiveJavaKafkaLinuxMapreduceMongodbMySQLNoSQLOpen Source RdbmsPythonRedshiftScalaSnowflakeSparkUnix
4 Days Ago
Hybrid
Plano, TX, USA
144K-165K Annually
Senior level
144K-165K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Data Engineer, you'll design and build scalable data pipelines using emerging technologies, impacting thousands of auto dealerships while enhancing analytics capabilities.
Top Skills: AWSCi/CdDynamoDBFlinkKafkaOpensearchPythonRedshiftSnowflakeSparkSQLUnix/Linux
Yesterday
Hybrid
Boston, MA, USA
117K-146K
Senior level
117K-146K
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
As a Senior Data Engineer, you'll design data solutions, ensure data quality, collaborate with teams, and support compliance efforts in data governance.
Top Skills: AWSBitbucketConfluenceDatabricksDatadogJIRAKafkaNoSQLPagerdutyPythonSigmaSnowflakeTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account