Hire
Data Systems Scala Developers
Built through years working with teams and engineers in the Scala ecosystem.
We specialize in functional backend systems, data platforms, and distributed infrastructure.
Focused, relevant introductions from a curated network.
Available Developers
Big Data Solutions Architect / Tech Lead
Designed and implemented a petabyte-scale data processing platform using distributed systems technologies. Spearheaded the development of a real-time streaming analytics solution for IoT data ingestion.
- Architected a multi-tenant data platform
- Led a team in building a batch processing system
- Implemented a fault-tolerant message queue
Data Engineer
Engineered large-scale data processing pipelines using Apache Spark and Scala, significantly enhancing data throughput. Developed real-time data ingestion systems with Apache Kafka, ensuring high availability and fault tolerance. Expert in streaming analytics, applying Flink for dynamic data transformations in IoT applications.
- Integrated Scala Play for responsive web data visualization
- Utilized Cats Effect for functional programming concurrency control
- Designed a custom schema registry for Kafka message validation
- Implemented end-to-end data encryption in distributed environments
Senior Data Engineer
- Led migration of monolithic system to microservices architecture
- Built high-throughput data pipeline processing 1M+ events per second
- Designed and implemented real-time monitoring and alerting platform
Architect / Lead Data Engineer
Designed and optimized large-scale data lakes and real-time streaming architectures using Apache Spark and Flink. Led migration of on-premise data warehouses to AWS cloud infrastructure, improving scalability and reducing latency. Specialized in developing ETL pipelines for financial analytics and risk management systems.
- Developed proprietary data ingestion framework for high-throughput processing
- Integrated machine learning models into data pipelines for predictive analytics
- Architected multi-region data replication solutions for disaster recovery
Senior Data Engineer
Developed real-time data processing systems utilizing Apache Kafka and Apache Flink to handle streaming data at scale. Designed and optimized ETL pipelines for large-scale data ingestion and transformation using Apache Spark. Implemented HBase solutions for high-throughput, low-latency data storage in big data environments.
- Architected distributed systems for high-volume data processing
- Built scalable data lakes with fault-tolerant architectures
- Optimized data transformation pipelines for performance and reliability
JVM Developer / Data Engineer
Designed and optimized a high-throughput real-time data processing pipeline using Kafka and Spark, achieving sub-second latency. Developed microservices architecture for a large-scale distributed system leveraging Akka and Spring frameworks.
- Implemented event-driven architectures with Apache Kafka
- Built scalable data pipelines with Apache Spark
- Optimized distributed system performance for large-scale deployments
Senior Data Engineer
Architected and implemented end-to-end ETL pipelines in Scala and Python, handling petabyte-scale datasets in cloud environments. Spearheaded the development of a real-time analytics platform for IoT data, enhancing data throughput and processing speed.
- Led migration of legacy systems to modern cloud infrastructures
- Developed custom data serialization protocols in Go
- Automated data workflows using advanced Bash scripting techniques
Solution Architect
Architected enterprise-level solutions utilizing REST and SOAP services for seamless integration across legacy and modern systems. Designed and implemented cloud-native applications with an emphasis on scalability and maintainability. Pioneered the adoption of microservices in large-scale e-commerce platforms, enhancing modularity and system resilience.
- Migrated monolithic applications to microservices architecture
- Integrated cross-platform APIs for heterogeneous environments
- Developed custom middleware solutions for data synchronization
Data Engineer / Contractor
Specialized in building data pipelines using Scala and Java for large-scale ETL processes. Designed graph database solutions using Cypher and CQL to improve data retrieval efficiency.
- Automated data processing workflows with Groovy
- Enhanced data integration using Python scripts
- Implemented graph algorithms for advanced data insights
Quant Dev
Developed quantitative trading algorithms with a focus on low-latency execution and risk management. Built containerized microservices for financial data analysis using Rust and Python, optimizing for performance and scalability. Managed deployment and orchestration of applications on AWS using Kubernetes.
- Automated CI/CD pipelines with GitLab for seamless deployments
- Integrated real-time data feeds with complex event processing engines
- Optimized performance-critical components using Rust for speed and efficiency
- Leveraged AWS services for scalable cloud-based computation
Staff Data Engineer
Architected large-scale data pipelines using Apache Spark and Scala, optimizing for real-time analytics in the e-commerce domain. Spearheaded the migration of data infrastructure to Databricks, enhancing processing speed and reliability. Developed a custom data validation framework in Python, ensuring data integrity across distributed systems.
- Implemented Rust-based data transformation tools for performance gains
- Designed SQL-based ETL workflows for complex data models
- Automated data quality checks using custom-built Python scripts
Senior Data Engineer
Designed and implemented distributed data processing pipelines using Apache Spark, optimizing for large-scale data ingestion and transformation. Developed real-time analytics systems in the finance sector, leveraging Scala and Akka to ensure high throughput and low latency. Spearheaded a project to refactor legacy data systems, enhancing performance and scalability by 40%.
- Led migration from monolithic to microservices architecture.
- Implemented CI/CD pipelines for data workflows.
- Optimized SQL queries for a 200TB data warehouse.
- Integrated machine learning models into data processing pipelines.
Founding Database Engineer
Pioneered database internals and distributed systems design. Advanced knowledge in operations and systems design using AWS.
- Pioneered database internals
- Expert in distributed systems design
- Advanced AWS operations knowledge
Senior Big Data Engineer
Engineered large-scale data processing workflows with Apache Spark, optimizing ETL operations. Developed data warehousing solutions using Hive for efficient querying and reporting. Managed Hadoop clusters, ensuring high availability and fault tolerance.
- Implemented real-time analytics using Spark Streaming
- Enhanced SQL query performance with custom indexing
- Automated data pipeline monitoring and alerts
Senior Software Engineer / Solution Architect / Trainer
Architected and implemented large-scale distributed systems for real-time data processing in the telecommunications domain. Developed scalable, fault-tolerant microservices using Scala and Java, optimizing for high throughput and low latency. Played a key role in migrating legacy systems to cloud-native architectures using functional programming paradigms.
- Designed a custom DSL for domain-specific data transformations
- Led the development of a real-time analytics platform handling millions of events
- Implemented a multi-tenant solution for scalable SaaS applications
- Conducted workshops on functional programming for enterprise teams