Hire
Data Systems Scala Developers
Built through years working with teams and engineers in the Scala ecosystem.
We specialize in functional backend systems, data platforms, and distributed infrastructure.
Focused, relevant introductions from a curated network.
Available Developers
Data Engineer
Engineered large-scale data processing pipelines using Apache Spark and Scala, significantly enhancing data throughput. Developed real-time data ingestion systems with Apache Kafka, ensuring high availability and fault tolerance. Expert in streaming analytics, applying Flink for dynamic data transformations in IoT applications.
- Integrated Scala Play for responsive web data visualization
- Utilized Cats Effect for functional programming concurrency control
- Designed a custom schema registry for Kafka message validation
- Implemented end-to-end data encryption in distributed environments
JVM Developer / Data Engineer
Designed and optimized a high-throughput real-time data processing pipeline using Kafka and Spark, achieving sub-second latency. Developed microservices architecture for a large-scale distributed system leveraging Akka and Spring frameworks.
- Implemented event-driven architectures with Apache Kafka
- Built scalable data pipelines with Apache Spark
- Optimized distributed system performance for large-scale deployments
Senior Data Engineer
- Led migration of monolithic system to microservices architecture
- Built high-throughput data pipeline processing 1M+ events per second
- Designed and implemented real-time monitoring and alerting platform
Senior Data Engineer
Architected and implemented end-to-end ETL pipelines in Scala and Python, handling petabyte-scale datasets in cloud environments. Spearheaded the development of a real-time analytics platform for IoT data, enhancing data throughput and processing speed.
- Led migration of legacy systems to modern cloud infrastructures
- Developed custom data serialization protocols in Go
- Automated data workflows using advanced Bash scripting techniques
Senior Engineer / Technical Lead
Led the design and implementation of a distributed event streaming platform using Akka and Kafka. Spearheaded big data processing initiatives with Spark on AWS for analytics.
- Developed large-scale data pipelines with Apache Spark
- Managed team of engineers for complex system integrations
- Optimized server-side rendering with Play Framework
Data Engineer / Contractor
Specialized in building data pipelines using Scala and Java for large-scale ETL processes. Designed graph database solutions using Cypher and CQL to improve data retrieval efficiency.
- Automated data processing workflows with Groovy
- Enhanced data integration using Python scripts
- Implemented graph algorithms for advanced data insights
Staff Data Engineer
Architected large-scale data pipelines using Apache Spark and Scala, optimizing for real-time analytics in the e-commerce domain. Spearheaded the migration of data infrastructure to Databricks, enhancing processing speed and reliability. Developed a custom data validation framework in Python, ensuring data integrity across distributed systems.
- Implemented Rust-based data transformation tools for performance gains
- Designed SQL-based ETL workflows for complex data models
- Automated data quality checks using custom-built Python scripts
Solution Architect
Architected enterprise-level solutions utilizing REST and SOAP services for seamless integration across legacy and modern systems. Designed and implemented cloud-native applications with an emphasis on scalability and maintainability. Pioneered the adoption of microservices in large-scale e-commerce platforms, enhancing modularity and system resilience.
- Migrated monolithic applications to microservices architecture
- Integrated cross-platform APIs for heterogeneous environments
- Developed custom middleware solutions for data synchronization
Senior Data Engineer
Designed and implemented robust data pipelines using PySpark and Scala for large-scale analytics. Developed and optimized Lakehouse and Medallion architectures to streamline data processing and storage. Spearheaded the migration of legacy data systems to cloud-based solutions, improving performance and scalability.
- Built real-time data ingestion frameworks with Python and SQL
- Engineered automated ETL processes for high-volume data environments
- Enhanced data governance through advanced schema management techniques
Software Developer
Developed large-scale data processing systems using Scala and Python, optimizing real-time data ingestion pipelines for financial analytics. Engineered robust database solutions, focusing on high availability and fault tolerance for transactional systems.
- Implemented distributed data storage solutions for big data projects
- Optimized SQL queries for performance in high-volume environments
- Designed scalable architectures for data-intensive applications
Lead Backend Engineer
Designed backend systems using Akka HTTP and Play Framework for high-load applications. Developed GraphQL interfaces to streamline data access across distributed services.
- Implemented service mesh architecture for microservices
- Enhanced data serialization techniques for better efficiency
- Led backend modernization efforts with Scala and Java
Senior Software-Data Engineer
Developed a scalable recommendation engine using Cats and Scala, improving user engagement metrics. Built a high-performance web application with Play framework, handling thousands of concurrent users. Implemented data transformation workflows in Python for automated ETL processes.
- Utilized ZIO for managing complex asynchronous workflows
- Enhanced API performance with Spring Boot optimizations
- Migrated legacy systems to a microservices architecture
Founding Database Engineer
Pioneered database internals and distributed systems design. Advanced knowledge in operations and systems design using AWS.
- Pioneered database internals
- Expert in distributed systems design
- Advanced AWS operations knowledge
Senior Data Architect
Led the design of data lake architectures using Delta Lake and Databricks to support high-volume, real-time analytics. Developed robust ETL pipelines with Spark and PySpark to ensure data quality and consistency across distributed systems. Migrated legacy data warehouses to Snowflake, optimizing query performance and storage costs.
- Architected scalable data models for predictive analytics
- Integrated machine learning workflows with Databricks
- Automated data lineage tracking with custom DLT solutions
- Enhanced data governance with role-based access controls
Senior Software Engineer
Designed and implemented a real-time data ingestion pipeline using Apache Pulsar for a healthcare analytics platform. Leveraged ZIO in Scala for building highly concurrent, fault-tolerant backend services. Developed a serverless architecture on S3 for scalable storage solutions.
- Built a reactive web application using Elm and JavaScript
- Integrated third-party APIs for enhanced data processing capabilities
- Optimized data serialization formats to reduce network latency