Accountabilities:
- Design, implement, and maintain scalable, high-performance data architectures connecting relational and non-relational systems.
- Manage end-to-end data pipelines, ensuring seamless ingestion from scrapers to AI/ML workflows.
- Audit and optimize existing workflows for efficiency, accuracy, and flexibility.
- Provide operational support for production systems, ensuring high reliability of scrapers and ML pipelines.
- Collaborate with cross-functional teams to translate business needs into technical solutions and maintain clear documentation.
- Apply best practices for data modeling, storage, and retrieval across relational and NoSQL databases.
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
- 5+ years of experience in data engineering, with a proven track record managing large-scale pipelines.
- Strong proficiency in Python with 5+ years of professional experience.
- Advanced skills in data modeling, schema design, and modern data warehousing concepts.
- Expertise with relational databases (MySQL, PostgreSQL) and NoSQL systems (MongoDB).
- Hands-on experience with cloud infrastructure (AWS) and big data frameworks (Spark, Kafka, Hadoop).
- Fluent in English and Spanish, with strong communication skills to articulate technical solutions to diverse stakeholders.
- Preferred: experience with MLOps, containerization (Docker), orchestration (Kubernetes), CI/CD pipelines, and data privacy/security best practices.
- Fully remote work from anywhere in the world, with flexibility to manage your own schedule.
- Opportunity to work on cutting-edge data and AI technologies with global impact.
- Professional growth through challenging projects and exposure to innovative workflows.
- Collaboration with a diverse, international team of passionate professionals.
- Supportive work culture that values autonomy, accountability, and technical excellence.