What do we do?
We are a community of millions of educators who come together to share their work, their insights, and their inspiration with one another. We are the world’s first and largest open marketplace where teachers share, sell, and buy original educational resources with over 2/3 of US teachers using the site.
The New York Times called the phenomenon we lead “A Sharing Economy Where Teachers Win”.
Our data pipeline is currently owned by our Search and Personalization team. As our data engineering needs grow in scope, we would like to build a dedicated data engineering team. Members of this team will be able to focus on improving our data pipeline and helping our product teams build data-backed features for our users.
Our data stack is built in Python with Airflow and PySpark. We pull our data from Google Analytics, BigQuery and MySQL databases and use the results for recommendation and search products,Teacher-Author dashboards and business intelligence.
There are many areas where you can contribute as a Data Engineer, from performance and reliability to strategy and design.
As we process ever-increasing amounts of data, we will encounter performance problems and you will need to help optimize how we use our tools.
Data changes. Code changes. Sometimes these changes cause failures. You will improve how we test and monitor our pipeline so we find out sooner when something doesn’t work.
The impact of our data pipeline is dependent on how we apply it. You will collaborate with data scientists and product engineers on how to best use the pipeline and identify new enhancements to the system.
In the long term, you will have a hand in developing our strategy for data engineering and be a critical part in helping to realize it with incremental steps and progressive improvement.
1 year of professional experience building and operating data pipeline
To see more Data Engineer Jobs click here