Openings at Glassbox


Data Engineer

Full-time · Senior

About The Position

Glassbox is looking for a Data Engineer to join our R&D team

Glassbox is a world leader in recording analytics. This is an opportunity to work with bleeding-edge technology and top-tier customers while solving problems at an incredible scale

When was the last time you were excited to go to work?

What you'll be doing

As our Data Engineer, you will surely get excited with the opportunity of handling exceptional volumes of customer data (hundreds of billions of monthly events!) and help us facilitate cutting-edge Machine Learning technologies at large scale for some of the largest organizations in the world

You will work closely with several R&D teams within Glassbox - Product, Software and Data Science teams and take a pivotal role in defining our next generation data pipeline and ML platform infrastructure


  • Utilize a combination of cloud based and open source frameworks to solve our most complex data problems
  • Define, design and implement multiple data pipelines and ETL processes (normalization, aggregation, transformation, data movement, etc).
  • Work closely across multiple R&D teams to understand their needs and build the absolute best-in-class data architecture to solve their problems and building long term data strategy
  • Develop processes and tools to monitor, analyze, maintain and improve data pipeline operation, performance and usability


  • Enjoy building and owning solutions end to end from proof of concept through to production of highly scalable, enterprise-class solutions
  • Expert skills in Python and SQL but can understand other languages as well (e.g. Java, Javascript) - A Must
  • Proven experience in building, optimizing and maintaining big data pipelines using popular open-source frameworks (Kafka, Spark, Hive, Presto, Airflow, etc)
  • Deep understanding and experience with data storage in RDBMS, NoSQL DB, data lakes, data warehouses (like S3, Redshift, Clickhouse, Postgres, ElasticSearch, Cassandra,, etc) 
  • Hands-on experience in public cloud and services (e.g. AWS S3, RDS/Aurora, EMR, Redshift, Step Functions, Athena, etc)
  • Knowledge in data pipeline optimization for execution time, complexity and compute cost 
  • Independent self-starter and team player with excellent communication and interpersonal skills
  • Degree or equivalent experience in a Computer Science or similar Engineering fields
  • Excellent / Fluent written and spoken English

An advantage

  • Big Advantage - Prior experience and understanding of digital customer experience (CX) or similar business domain
  • Thrives and enjoys working in startups and fast paced environment

Apply for this position