Experienced data engineer capable of creating and implementing a large-scale, online data
management system for a start-up technology company focused on using both structured and
unstructured data to train predictive models and drive real-time business decisions
If you enjoy learning new things and are excited by the challenge of building a modern cloud native
data stack from the ground up without being constrained by legacy technology, this job is for you!
Duties & Responsibilities:
Lead design, development, and optimization of a modern data technology stack (including ELT pipelines, cloud-based data warehouse, and large-scale online processing systems)
Understand all data sources and structures in order to facilitate data transformation and maintenance
Develop set processes for data modeling, mining, and production. Design and implement processes to prepare data models for use in predictive modeling and BI workstreams.
Apply software engineering best practices to the production and maintenance of analytics code.
Ensure data models are properly tested and documented and run reliably in a production environment
Service as both technical lead and manager for a growing data engineering team. Think strategically about business, product, and technical challenges
Strong commitment to teamwork, sharing knowledge, and constant learning. No one can be expected to know everything about rapidly changing best practices in data technology – let’s learn together!
Bachelor’s degree in computer science, engineering, or similar
5 years minimum. Financial technology or technology startup experience would be ideal
Familiar with agile methodologies (e.g., Scrum, Kanban)
Must know with Git, useful to have familiarity with Bitbucket
Must be familiar with automated tests to ensure code quality & accuracy
Experience with continuous integration of cloud-based ELT pipelines for ingesting and replicating data. Some familiarity with managed ELT pipeline solutions (e.g., Stitch, FiveTran) is a plus
Practical experience working with an analytics team using BI software (e.g., Looker, Mode,Tableau, or similar) and preparing data models for predictive modeling. Familiarity or even curiosity about building or deploying machine learning models is a plus
3+ years development experience with large scale cloud based database platforms (e.g., AWS Redshift, BigQuery, Azure, Snowflake)
Strong knowledge and hands on expertise with Python, Jenkins, and SQL
Experience working with scripting language such as Ruby or Go is a plus
Experience with streaming data sources and decision models (e.g., Apache Kafka)
Experience with database architecture including SQL and NoSQL (e.g., Redis, MongoDB, ELK)
Familiarity with data transformation modeling software (e.g., Apache Airflow, dbt) is a plus
Speaks Spanish & English
Comparte la Vacante
Vacante: Data Engineer Lead CodersLink - Gustavo A. Madero, Ciudad de México