Nikita Data Engineer
Summary
A seasoned Data Engineer with over 6 years of experience in the field of software and big data engineering. Holds a strong academic background in Computer Science and Software Engineering, certified as a Google Cloud Professional Data Engineer. Demonstrates deep expertise in high-load system design, performance optimizations, and domain-specific solutions for Healthcare, Fintech, and E-commerce. Proficient in Python and SQL, with significant exposure to data engineering tools such as Apache Hadoop, Apache Spark, and Apache Airflow, and cloud technologies from AWS and GCP. Adept at working with various databases and message brokers, excelling in data modeling, BI, and data visualization using tools like Looker, Power BI, and Tableau. Enhanced system efficiencies through SQL and data pipeline optimizations, driving significant improvements in processing speed and system performance. A collaborative engineer with a strong grasp of DevOps practices, committed to best-in-class data governance and security standards.
Work Experience
Data Engineer, Data Mesh Implementation for Healthcare Data
Duration: 08.2021 - till nowSummary:
- A data-driven startup leveraging the concept of data mesh to process and transform massive healthcare data
- Promoting data democratization within the organization
Technologies: Python, SQL, Apache Spark, PySpark, Apache Airflow, AWS, Kafka, Redis, Oracle, Pandas, NumPy, Tableau, Bash scripting, CI/CD, Docker, Docker Compose, Kubernetes, GitHub Actions, GitHub
Data Engineer, Financial Data Management and Analytics Platform
Duration: 07.2019 - 07.2021Summary:
- Revolutionizing financial data management and analysis with a unified platform that brings together disparate data sources and facilitates advanced analytics
- A combination of traditional DWH and Data Lake features for overcoming data silos, incomplete insights, and inefficient analysis methods
Technologies: Python, SQL, Apache Airflow, Apache Spark, PySpark, GCP, Redis, MongoDB, PostgreSQL, Pandas, NumPy, Tableau, Matplotlib, Scikit-learn, Jenkins, CI/CD, Bash scripting, Docker, Docker Compose, GitHub
Data Engineer, Sales Analysis and Business Performance Improvement
Duration: 12.2018 - 07.2019Summary:
- Sales analysis project to improve business performance through insights into customer behavior by utilizing data visualization and statistical analysis to identify trends and opportunities for growth
- Optimized pricing, promotions, and marketing campaigns
Technologies: Python, SQL, Kafka, Apache Spark, PySpark, Apache Hadoop, AWS, MS SQL, Redis, MongoDB, Power BI, Pandas, NumPy, Bash scripting, CI/CD, Docker, Docker Compose, Kubernetes, Bitbucket
Data Engineer, E-commerce Platform for Healthy Lifestyle Products
Duration: 01.2018 - 12.2018Summary: E-commerce platform for a healthy lifestyle online store, providing product selection, ordering, delivery, and advice in sports and nutrition, with a focus on convenience and customer engagement.
Responsibilities: Provide cloud solutions with AWS, resolve RabbitMQ issues, secure Spark Streaming applications, automate data processes in Power BI, develop RabbitMQ messaging components, configure EC2 performance, orchestrate containers with Kubernetes, implement AWS Lambda serverless parts, write SQL queries, monitor RabbitMQ clusters, implement data backup and recovery strategies, troubleshoot Apache Spark, assist with data modeling, optimize SQL queries.
Technologies: Python, SQL, RabbitMQ, Apache Spark, PySpark, AWS, MongoDB, PostgreSQL, Power BI, Pandas, NumPy, Docker, Docker Compose, Kubernetes, CI/CD, Bash scripting, GitLab
Education
- Computer Science and Software Engineering
Certification
- Google Cloud Certified – Professional Data Engineer
2023