19 Apr
Senior Data Engineer
California, Lajolla , 92037 Lajolla USA

Vacancy expired!

In partnership with ExxonMobil, Synthetic Genomics, Inc. (SGI) is growing algae biofuels to one day power planes, propel ships and fuel trucks - ultimately offering the potential to cut emissions in half. SGI's research spans from developing genetically engineered algae strains to cultivating acres of energy-rich algae at our state-of-the-art farm in California's Imperial Valley. At the center of this research is SGI's Integrated Data Platform, which is responsible for the automated collection and analysis of IoT sensor data and sophisticated laboratory measurements. SGI's Integrated Data Platform provides a common operating picture that fosters cross team collaborations and provides an increased understanding of the factors driving performance variation across the scales from lab to farm. To improve automation and reduce time to actionable insights, SGI is looking for a Senior Data Engineer to join its Integrated Data Platform team. We are looking for creative problem solvers with both a passion for innovation and a focus on delivering technical solutions. As a Senior Data Engineer, your work will improve the quality, reliability, accuracy and consistency of our research data. You will also work with the team to design, build and deploy data science and analytic solutions at scale.Responsibilities

  • Build project-specific data pipelines (ETL processes) and validation tools using Python, SQL and AWS cloud technologies
  • Partner with members of the data scientists and laboratory scientists to define requirements for the Integrated Data Platform
  • Implement data models, database schemas, data structures and processing logic to support automated insights
  • Use best practices for code development, optimization and unit testing
  • Collaborate with data scientists to define SLAs for data availability, quality, usability and correctness.
  • Develop and maintain automated data availability, quality monitoring, and alerting for the Integrated Data Platform
  • Manage concurrent requests from multiple research teams and strategically, prioritize when necessary
Qualifications
  • BS in Computer Science, Mathematics or similar field with a MS preferred
  • 7+ years of data engineering or software engineering experience
  • 3+ years of experience building and operating scalable data pipelines
  • 3+ years of hands-on experience developing solutions with a MPP data warehouse (e.g., Redshift, Teradata, Vertica MPP)
  • 1+ year working with AWS Ecosystem
Key Skills
  • Programming Languages: proficient using Python with a willingness and ability to learn other languages
  • Technical expertise with relational and non-relational databases
  • Enjoys working with all aspects of data: analyzing, organizing, improving quality and efficient delivery
  • Experience building data pipelines sourced from both Web APIs and Web-Service APIs
  • Effectively communicate and collaborate with business and scientific leads from other organizations
  • Ability to work with ambiguous requirements and be comfortable exploring new technology and making your own tools when standard approaches don't meet requirements
  • Passionate about delivering high quality, data solutions to further scientific research within an algal biofuels program

Vacancy expired!


Report job