SENIOR DATA ENGINEER, ANALYTICS SYSTEMS
PENNSYLVANIA
Responsibilities:
- Design and build standalone services to support a variety of reporting needs, using data sources like SQL & Redshift (and eventually Kafka) and design patterns of your choice.
- Leverage knowledge of dimensional modelling techniques and the ability to come up with a model which helps business users easily understand the data instead of expecting business users to understand how the operational system works.
- Advise in the performant design, creation, management, and implementation of large datasets.
- Document and map the existing data structures from the operational system to the new curated data structures.
- Assess new technologies and assess their practicality for integration into our existing BI and DS pipelines and infrastructure.
- Build efficient, flexible, extensible, and scalable ETL and reporting solutions.
- Work with engineering and BI/DS to enable the appropriate capture and storage of key data points.
- Collaborate with a cross-functional team to Implement processes and systems that automate manual processes, optimize data delivery, and re-design existing infrastructure for greater scalability.
- Contribute views on approach and architecture. We like to collaborate, review each other’s code, and pair from time to time.
Core Competencies
- 5+ years data engineering experience.
- Deep experience with at least one compiled language (Java or Scala preferred) and one scripting language (Python).
- Advanced working SQL knowledge and experience working with relational databases.
- Experience with AWS cloud services, especially Redshift, EMR, Glue, Athena, Lambda, Snowflake.
- Experience with Airflow or other workflow management tools (Dagster, Prefect, etc.) highly preferred.
- Knowledge of distributed data processing and distributed data stores.
- Strong analytic skills related to working with unstructured datasets.
- Excellent business and communication skills; able to work with cross-functional business owners to develop and define key business questions, and to build data sets that answer those questions.
- Excellent organization and attention to detail with the ability to prioritize multiple concurrent projects while still delivering timely and accurate results
Job Location: PENNSYLVANIA