• Design and build standalone services to support a variety of reporting needs, using data sources like SQL & Redshift (and eventually Kafka) and design patterns of your choice.
  • Leverage knowledge of dimensional modelling techniques and the ability to come up with a model which helps business users easily understand the data instead of expecting business users to understand how the operational system works.
  • Advise in the performant design, creation, management, and implementation of large datasets.
  • Document and map the existing data structures from the operational system to the new curated data structures.
  • Assess new technologies and assess their practicality for integration into our existing BI and DS pipelines and infrastructure.
  • Build efficient, flexible, extensible, and scalable ETL and reporting solutions.
  • Work with engineering and BI/DS to enable the appropriate capture and storage of key data points.
  • Collaborate with a cross-functional team to Implement processes and systems that automate manual processes, optimize data delivery, and re-design existing infrastructure for greater scalability.
  • Contribute views on approach and architecture. We like to collaborate, review each other’s code, and pair from time to time.

Core Competencies

  • 5+ years data engineering experience.
  • Deep experience with at least one compiled language (Java or Scala preferred) and one scripting language (Python).
  • Advanced working SQL knowledge and experience working with relational databases.
  • Experience with AWS cloud services, especially Redshift, EMR, Glue, Athena, Lambda, Snowflake.
  • Experience with Airflow or other workflow management tools (Dagster, Prefect, etc.) highly preferred.
  • Knowledge of distributed data processing and distributed data stores.
  • Strong analytic skills related to working with unstructured datasets.
  • Excellent business and communication skills; able to work with cross-functional business owners to develop and define key business questions, and to build data sets that answer those questions.
  • Excellent organization and attention to detail with the ability to prioritize multiple concurrent projects while still delivering timely and accurate results

 To be considered for this or any other exciting role please email a copy of your CV to cloud


Apply for this position