Scala Developer

Distributed Systems Engineer – Rust/Scala – Hpc, Multithreaded Application

Hello !

I am an Engineer specialized in developing multithreaded applications and distributed systems (Big Data).

These two paradigms, while distinct, share a common philosophy: maximizing computational power through parallelization !

From constructing ETLs with Scala and Python to developing multithreaded applications with Akka (Scala) and Tokio (Rust), I leverage frameworks tailored to each challenge. My AWS certification and Databricks training further strengthen my expertise in scalable cloud solutions and Big Data processing.

I am able to work with you on:
o Conception of diverse ETL, databases, multithreaded applications.
o Realisation, industrialization of data processing pipelines.
o The collect and exploration of your data.
o Automatization of Data Ops : git, CI/CD
o Contribute to the constant improvement of your team's agility
o The development of Spark applications for the processing of big data at scale

I am also skilled with : Agile methodology, AWS, Kubernetes, Databricks, TDD.

Experience: 3 years

Yearly salary: $60,000

Hourly rate: $290

Nationality: 🇫🇷 France

Residency: 🇫🇷 France

Experience

Big Data Mission in Market Finance

NATIXIS corporate & investment banking

2024 - 2024

Development of Spark/Scala Workflows: Automated reserve calculations across various financial assets, optimized for high data volumes. Data Integration from Multiple Sources: Integrated data from Hive, IQ, and ASE for rapid access by financial analysts and data scientists. CI/CD Pipelines: CI/CD pipelines with Jenkins to automate and ensure reliable deployments. BDD Testing with Cucumber: Implemented comprehensive functional testing for calculation workflows. Job Scheduling on Control-M: Scheduled and managed jobs to ensure efficient processing. Hive Scripting for Intermediate Tables: Developed Hive scripts to create intermediate tables and optimize data access.

Arbitrage Bot (MEV)

CyptoLayout

2024 - 2024

Scalable Cross-EVM Solution: Millisecond-level arbitrage detection with an asynchronous, multithreaded architecture and secure data sharing (thread-safe). High-Performance Token Pricing Algorithm: Recursive algorithm enabling token price estimation with detection at sub-millisecond precision. Advanced Tracing and Real-Time Monitoring: Arbitrage opportunity tracking with Elastic Stack (Filebeats, Kibana). Robust Unit Testing and Generic HTTP Solution: Comprehensive unit testing and a generic solution for HTTP request/response management. Deployment Scripting and Binary Versioning. BSC Node Configuration: Configured BSC node with Geth, currently testing with Reth for enhanced performance.

Big Data Mission on a CRM Tool (Data Flow Migration)

BNP Parisbas

2022 - 2022

Code Coverage (90%) in Scala/Maven. Data ingestion package maintenance (Hive to Cassandra). Adaptation of Spark processing. Implementation of Oozie scheduler. Statistical analysis in Cassandra using Spark-shell (Data Integrity).

Big Data Developer Mission for Fraud Prevention

BNP Parisbas

2022 - 2023

High-Performance API Development for Real-Time Fraud Detection: Developed on a multithreaded, scalable API with Play/Akka (actor model) capable of processing millions of records daily, achieving millisecond-level fraud detection. Continuous Data Stream Integration: Integrated Kafka for real-time data flow within the API, enhancing responsiveness and data handling. Large-Scale Data Processing: Developed and optimized Spark/Scala jobs for efficient processing of high data volumes. Data Modeling and Query Optimization: Created CQL (Cassandra Query Language) models and queries for efficient storage and rapid retrieval (Design by query). CI/CD Pipeline with Real-Time Monitoring: Set up a Jenkins CI/CD pipeline with real-time monitoring and alerting via Splunk and Dynatrace. Comprehensive Testing: Implemented unit and integration tests to ensure 90% code coverage.

Prove of Concept

Aubay

2021 - 2022

Deployment of Auto Machine Learning Platform on AWS Instance. Kubernetes cluster virtualization on Docker using kind. Deployment of Kubeflow on the cluster (auto-ML platform). External access to pods: user interface, Kibana. Implementation of machine learning pipelines coded in Python. ELK index mapping and prediction visualization on Kibana.

Data scientist (Internship)

Aubay

2021 - 2021

Computer Vision Algorithm: Object detection, segmentation, and distance estimation of objects in a monocular image. Use of Pre-trained Models: maskRCNN, Yolo, detectron2, OpenCV. Geometric Method: Inverse Perspective Mapping (IPM). Development: Application with ReactJS (frontend) and Flask (backend).

Skills

hadoop

rust

scala

english

french

Create Profile Hire Scala Developers