Spring Labs is redefining how data is exchanged for the new age of data sharing, security, and consumer privacy through decentralization. Our Spring Protocol Tech Stack, which includes the use of Blockchain and Cryptography, allows institutions to share information among themselves to verify identities and reduce fraud – all while protecting consumer data.
Working at Spring Labs is about being part of a collaborative team, comprised of some of the most talented people in the industry. You would be welcomed into a fun, inclusive environment where we care as much about our employees as we do about our product.
At Spring Labs, the systems that power our data science products are currently built and maintained by our software and infrastructure teams. As the scale and traction of our data science projects grow, we are hiring our first Data Engineer to take ownership of these systems. This role will be responsible for the design, maintenance, and deployment of our data pipelines. You will interact closely with the Data Science and Software Engineering teams, as well as Infrastructure Engineers to identify the best solutions for their needs, and ensure our data collection, transformation, and storage efforts are running smoothly and at scale.
Spring Labs is an in-office culture (partial remote work during Covid-19) that fosters a highly creative and collaborative work environment.
If you are motivated by solving real world problems with an extremely talented, accomplished team and want to learn from the top Engineers in the area, we want to hear from you.
What you’ll do
- Configure and manage internal AWS services needed to operate our pipelines: Lambda, EKS, EMR, ElasticSearch, RDS, S3, etc
- Architect and implement the infrastructure required to handle ETL processes
- Build tools to automate data cleaning and validation
- Perform root cause analysis for data quality issues
- Investigate and resolve technical issues
- Design procedures for system troubleshooting and maintenance
About you
- Degree in CS, Engineering, or Mathematics preferred
- Basic understanding of modern machine learning techniques
- 3+ years of experience building and operating production data pipelines
- 5+ years of experience in configuration and operations of AWS infrastructure
- Experience with common data pipeline components: Apache Spark, Kafka, EMR, relational and non-relational databases
- Driven to optimize the processing of large data sets
- Motivated by a fast paced, proactive team environment
- Thrives in a highly collaborative environmentValues and produces high quality documentation
- Enjoys learning and teaching others
- Casual Work Environment
- Fully Stocked Kitchen
- Free Gym
- Weekly Office Events
- Flexible PTO
- 401(k)
Equal Opportunity Statement:
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.