Data Engineer

Москва, Россия
Миддл • Сеньор
Аналитика, Data Science, Big Data • OLAP • Инженер • Разработчик • Data Science • Python • Hadoop • ClickHouse • Map Reduce • OLAP • Vertica
Удаленная работа
Опыт работы от 3 до 5 лет
Есть файл резюме (защищен)
О себе

На данный момент Data Engineer.

Мои компетенции и опыт

  1. NDA, Senior Data Engineer | New York, USA | June нужен доступ к резюме Present
    Data Engineer/Architect at a creator-focused social network in the US market
    • Led and executed a full migration of the core analytical platform from AWS Athena to ClickHouse, reducing analytics costs by 17% and improving query latency and overall analytical performance
    • Re-architected real-time ingestion by migrating from Kinesis Streams to AWS Lambda–based processing, cutting monthly infrastructure costs from $22k to $12k
    • Implemented end-to-end monitoring and alerting for data pipelines, reducing incident resolution and support effort by 40%, fully owning production data infrastructure (Airflow, DMS, streaming and ETL jobs)
    • Designed and built a reliable end-to-end ETL pipeline ingesting data from PostgreSQL, DynamoDB and third-party APIs (AppsFlyer, etc.), delivering production-grade data for in-app recommendation systems (feed/reels-like experience)
    • Defined ClickHouse data architecture used as the single source of truth for all analytical dashboards; enabled migration to self-hosted Redash, eliminating BI licensing costs
     
  2. Wildberries, Senior Data/DWH Engineer | Moscow, Russia | Jule нужен доступ к резюме June 2025
    • Designed and optimized a hot–cold analytical storage architecture based on ClickHouse and HDFS, handling 100+ TB of data and reducing query costs by 30%
    • Improved ClickHouse storage efficiency by applying optimal ORDER BY strategies and compression codecs (ZSTD, LowCardinality), reducing storage costs by 25%
    • Optimized Spark-based ETL pipelines and cluster configurations, reducing batch execution time by 1.5–2x
    • Reworked Hive STG layer using Kafka-based incremental updates, cutting processing time from 40 to 10 minutes (4x)
    • Refactored complex analytical SQL queries in ClickHouse, achieving 5x performance improvements for critical BI workloads
     
  3. BERESNEV Games, Middle Data/DWH Engineer | Prague, Czech Republic | May нужен доступ к резюме Jule 2024
    • Optimized core analytical pipelines in ClickHouse (indexes, projections, MergeTree engines), reducing end-to-end processing time from 3 hours to 30 minutes (6x)
    • Designed and developed an internal analytics service using FastAPI for ad monetization analysis (“waterfall”), saving 5+ hours per week of manual analyst work
    • Established a single source of truth for core product KPIs and data quality, increasing analytics team efficiency by 20%
    • Developed a Python library for ingesting and normalizing data from external APIs and S3, reducing pipeline errors by 30%
     
  4. OZON, Data Analyst/ETL Developer | Moscow, Russia | June нужен доступ к резюме May 2022
    • Designed and maintained large-scale analytical datamarts (billions of rows) in Vertica and ClickHouse, supporting daily reporting for 10–20 BI analysts
    • Containerized and deployed data services (Airflow) using Docker, reducing setup and deployment time by 20%
    • Improved ETL pipeline stability and monitoring, reducing incident frequency related to data delays and schema issues


Есть файл резюме (защищен)


Интересные кандидаты