Looking for a Software Engineer to join our Customer Data Engineering team. In this role you
will play a key part in developing application/data processes and services for our customer data
to support all our production applications, operation processes, data sciences, and business
intelligent data processing.
Our team works with:
Microservices and distributed applications in Java, Python, Spark
Relational data modeling and batch processing on SQL databases
Distributed batch and streaming frameworks like Spark, Spark Streaming
Streaming data platforms like Kafka/Kinesis
PaaS offerings on AWS such as S3, RDS, EC2, Glue, EMR, Redshift, Athena
Primary Responsibilities
Design, develop and maintain scalable applications and services to manage customer
related data
Design, develop and maintain scalable Java/Python/SQL based ELT/ETL workflows to
transform and integrate data into Data Platform/Data Lake/Data Warehouse
Play multiple roles that span data architecture, design, data
warehousing, DevOps, and ELT/ETL processes
Work closely with product management, business, engineers, cross-functional analysts and
data scientist to uncover new insights from our data
Enhance scalability, performance, and stability of data infrastructure and processes
Technical Requirements and Desired Skills
8+ years of experience with Java with a focus on modern Java development paradigms
(Java8+, Spring Boot, ReactiveX)
5+ years of expertise in Python, Java, Spark with wrangling structured and unstructured data –
Parquet/CSV/JSON, and SQL databases – MySQL/Postgres
3+ years of experience working within an Agile/Scrum and CI/CD environment, using
TDD/BDD patterns (ex. JUnit, TestNG, PyTest)
Strong knowledge of computer science fundamentals: object-oriented design and
programming, data structures, algorithms, databases, networking
Extensive experience working with structured and unstructured data platforms, ELT/ETL
processes and platforms, and data/domain modeling
Experience in the design of new systems or redesign of existing systems to meet
business requirements or changing needs, or newer technology
Bonus points for experience with ML and Stats frameworks, NoSQL databases (Mongo,
Cassandra), or search (Solr, ElasticSearch)
Data Engineer candidates require to complete DE exercise