frankxieke

Senior Java Engineer

Distributed System like Message queue

Distributed Object storage System based on Haystack or paxos


Experience: 5 years

Yearly salary: $12,000

Hourly rate: $20

Nationality: 🇨🇳 China

Residency: 🇨🇳 China


Experience

Bigdata warehouse construction
sidian
2022 - 2024
1. Bigdata warehouse construction. For example, ETL work from HIVE to Doris sync to ensure query efficiency. In order to improve query efficiency for BI usages. Build a ETL service, which scan all tables in hive, detect detect hive schema and build table in OLAP database(DORIS). Support Full Sync for small table and Incremental sync for partitioned hive table according its partitioned field. Build data warehouse for company, especially for ODS layer which sync from original system, and build wide-table for DWD with multiple dimensions. Using Doris as storage layer and open source BI (DataEase) to explore data and build dashboard.
Open Source Message queue Pulsar, bookKeeper
Beijing Streamnative Information Technolofy Co.
2021 - 2022
Open Source Message queue pulsar/bookkeeper
Distributed file storage system operation and development of back-end API Gateway and microservice development
Beijing 58 Tongcheng Information Technology Co.
2019 - 2021
Build Distributed file system like open source cubefs which can be mounted on container using fuse. With In-memory metadata server which stores name,directory. Data-nodes store partitioned file and location is stored on metadata server.
Mainly responsible for the development and maintenance of big data infrastructure message queue related work
Beijing ByteDance Network Technology Co.
2017 - 2019
For example log data collector to Kafka, Kafka to HDFS dumper, Kafka Mirror between clusters. Build distributed Kafka sink to HDFS service as KafkaStreamDumper which uses nosql to store message committed partition offsets and master node splits partitions among workers and different work consume messages and write into HDFS separately.
Mainly responsible for the company's internal distributed nosql database, distributed object storage and other development work
Beijing 58 Tongcheng Information Technology Co.
2015 - 2017
Build distributed Nosql platform which uses rocksdb as storage layer with stateless proxy to handle client requests. Provide users with different programming SDK. Build distributed Object Storage with restful interface. The principle is like HayStack from facebook. The file uploaded will be split as small needles with multiple replicas and a lot of needles stored in datanode as bigfile. The metadata of the file (name,location,bucket) is stored in Nosql database.
Software R&D Engineer
Nanjing Research Institute, Huawei
2014 - 2015
Mainly responsible for the development and maintenance of CDN media servers for content distribution networks.

Skills

big-data
golang
java
redis
english