Marcio A. Data Engineer

Data Engineer, Data Science

Summary

- 20+ years of experience in IT
- Experience in Support Analyst, Systems Analysis, Software Development, Data Engineer, Database Engineer, Systems Integration, Integration Developer, Data Analysis, Data Structures, Data Integration, Database Migration, Database Modeler, Database Analyst/Developer, Database Design, Database Management, Data Warehouse, Data Discovery, ELT/ETL, Extract, Transform, Load, Data Pipelines, Data Replication, Business Intelligence, Data Modeling (Dimension and Relational), Performance Tuning (SQL, PL/SQL Applications), experience in working with large databases and datasets, testing, validation, implementation, and troubleshooting skills, performance and resource optimization improvements, maintaining development, test and production environments, experience with process automation (monitoring, alerting, auto-corrective) and distributed systems, strong knowledge in PL/SQL, SQL, Oracle Database and
- Knowledge in Agile methodology (Scrum, Kanban).
- Upper-Intermediate English
- Available ASAP
- No vacation plans

Experience

Data Engineer, Systems Integration Consulting, ETL/ELT, Data Engineering, NEORIS

May 2020 – May 2021 ( 1 year project )
Responsibilities:

  • I worked as an IT Consultant assigned by NEORIS for British American Tobacco Holding.
  • Develop and Supports the metadata ingestion process to help the company create a Modern Datawarehouse/Data
  • Lake from many sources (Databases, Datasets, Files, APIs) using Wherescape RED Tool, ensuring that it runs efficiently, reliably, and scalably, through the initiation and monitoring of ingestion processes.
  • Collaborates closely with internal and external stakeholders in the ingestion process. Contributes to the design, development, testing, integration, support, and documentation of user-facing applications and back-end systems.

Technologies: SQL, SnowFlake Database, Wherescape RED, Microsoft Azure (Azure Blob Storage, Azure DevOps, Azure Datafactory)

Data Engineer, ETL Developer, Data Integration, Database Developer, Software Engineer, System Analyst , IT Consultant

September 1994 – Present ( more than 25 years )
Responsibilities:

  • Requirements Analysis, System Analysis, Software Developer
  • Database Developer, Database Design, Database Modeler, Database Object Management
  • Database Analyst, Data Analyst, ETL Developer, Oracle Developer, Software Developer, Data Integration
  • Data Replication, Performance Tuning and Query Optimization, Process Automation, experience working remotely when needed.

Data Engineer, ETL, Database Developer, Systems / Data Analyst, Software Engineer DBC Company

March 2015 – April 2019 (4 years 3 months)
Responsibilities:

  • I worked as an IT Consultant for clients of DBC Company Below are some clients where I worked assigned by DBC Company - www.dbccompany.com.br
  • Systems Analyst, Oracle Developer, Database Developer

Dana Holding Corporation Brazil

March 2015 – October 2015 (7 months)
Responsibilities:

  • I worked alongside the client’s business area and my main job tasks were Tasks Management, Improvement of Business Process, Process Automation, Requirements Analysis, System Analysis, Software Developer,
  • Database Design, Database Modeling, Database Object Management, Maintaining Development, Test and Production Environments.

Technologies: Oracle Database (10g, 11g), Oracle Forms 10g, Oracle Reports 10g, Oracle PL/SQL,
SQL, Windows, Microsoft Office.

ETL, Database Developer, Systems Analyst, Data Analyst, Software Engineer Grupo DIMED

November 2015 – April 2019 (3 years 6 months)

Description: The result of this project made it possible to keep up-to-date near real-time
critical data such as promotions, prizes, price of items, sales, items of sales, taxation, customers information and others, avoiding integrity errors and creating predictive actions alerts to solve possible errors. In another special project, I worked to improve the validation process of the PBM (Program Benefits Management) based on the sales information and optimization of the data warehouse integration process.
Responsibilities:

  • I worked as System Analyst, Data Analyst, ETL Developer, Oracle Developer, Software Developer, Data Integration, Data Replication, Performance Tuning and Query Optimization, experience working remotely when needed.
  • My main job tasks are Analysis, Development, Process Optimization, Requirements Analysis, Process Automation, Improvement of Business Process, System Analysis, Database Design, Database Modeling, Database Object Management, Database Analyst, ETL, Data Integration, Data Replication Process
  • Monitoring more than 400 Oracle instances(pharmacy stores), Maintaining Development, Test and Production Environments.
  • I was responsible for analyzing, designing and developing a new Integrated System that has a Data Pipeline from 4 main Oracle Instances and more than 400 others Oracle Instances outside the company, located in many cities in 4 provinces.

Technologies: Oracle Database (10g, 11g, 12c) Oracle Forms 10g, Oracle Reports 10g, Oracle PL/SQL, SQL, Oracle APEX, Python, Linux, Windows, Microsoft Office.


São João Pharmaceutical Group

2014
Analysis and Development - Data Integration Process Monitoring for more than 400 Database Instances (pharmacy stores), maintaining development, test and production environments.

Court of Justice of Rio Grande do Sul – Brazil

2013
Analysis and Development - Data Replication Process Monitoring about 100 Oracle instances (database of County Courthouse), maintaining development, test, and production environments.

TNT - Brazil, Electronic Document Management System

2010

Analysis and Development, maintaining development, test and production environments, implementing a Tracking Control near real-time based in EDI files for Apple products.
Technologies: GCP, AWS, Azure, Python, Oracle Database (9i, 10g, 11g), Oracle Forms (6i, 10g),
Oracle Reports (6i, 10g), Oracle PL/SQL, SQL, ODI 10g (Oracle Data Integrator), MySQL, PostgreSQL, Pentaho Data Integration, Metabase, Java, Microsoft Visual Basic; Microsoft SQL Server, T-SQL, ZIM, Microsoft Office, Windows, Unix, Linux, SVN, CVS, GitHub.

Education

Artificial Inteligence, Machine Learning and Data Science - Lato Sensu Postgraduate

Lato Sensu - Postgraduate Degree - Online, Information Technology
Universidade - Centro de Inovação VincIT – UNICIV – On-line
https://www.uniciv.com.br/cursos/intelig%C3%AAncia-artificial%2C-machine-learning-e-data-science

Database Technology
Associate Degree – Online, Information Technology
Universidade Cruzeiro do Sul – On-line - www.cruzeirodosulvirtual.com.br/graduacao/banco-de-dados
January 2018 – June 2021

Bootcamp Data Lakehouse - Snowflake (8 hours)
triggo.ai – www.learning.trigoo.ai/
On-line, October 2021

Bootcamp Cloud Data Engineering (148 hours)
IGTI - www.igti.com.br
On-line, August and September 2021

Data Engineering – Apache Kafka
Apache Kafka, Data Architecture
Training at One Way Solution - www.linkedin.com/company/one-way-solution
On-line, August 2021

Kubernetes Bootcamp 2.0
Formação KubeDev – k ubedev.io
On-line, July 2021

Data Engineer on Databricks
Databricks Academy - academy.databricks.com/data-engineer
On-line, June 2021

Data Engineering with Apache Spark
Spark, Kafka, Databricks, Cloud Stacks, Architecture, Streaming and Batch Processing
Training at One Way Solution - www.linkedin.com/company/one-way-solution
On-line, May 2021

Data Engineering on Databricks
Architecture, Streaming and Batch Processing, Databricks, Kafka, Spark, Cloud Stacks
Training at One Way Solution - www.linkedin.com/company/one-way-solution
On-line, November 2020

Data Engineer (304 hours)
Professional Extension Program at Data Science Academy - www.datascienceacademy.com.br
On-line, October 2020

Data Engineering on Apache Kafka Confluent
Kubernetes, Kafka, Apache Druid, Spark, Cloud Stacks
Training at One Way Solution - www.linkedin.com/company/one-way-solution
On-line, September 2020

Bootcamp DevOps Professional (148 hours)
IGTI - www.igti.com.br
On-line, September and October 2020

Big Data Support and Infrastructure (176 hours)
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
On-line, July 2020

Google Cloud Platform – Architecting with Google Compute Engine
Specialization Program at Coursera - www.coursera.org
On-line, June 2020

Google Cloud Platform – Cloud Engineering with Google Cloud
Specialization Program at Coursera - www.coursera.org
On-line, June 2020

Google Cloud Platform - Big Data, Data Science and Machine Learning
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
On-line, April 2019

AWS Solutions Architect
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
On-line, December 2018

Coursera - www.coursera.org
Data Engineering on Google Cloud Platform Specialization - Extension Program – Online
- Google Cloud Platform Big Data and Machine Learning Fundamentals
- Unstructure Data with Cloud Dataproc
- Serverless Data Analysis with BigQuery and Cloud Dataflow
- Serverless Machine Learning with Tensorflow
Associate Cloud Engineer - Extension Program – Online
- Essential Google Cloud Infrastructure: Foundation
- Google Cloud Platform Fundamentals for AWS Professionals
- Google Cloud Platform Fundamentals: Core Infrastructure
- Essential Google Cloud Infrastructure: Core Services
- Elastic Google Cloud Infrastructure: Scaling and Automation
- Reliable Cloud Infrastructure: Design and Process
- Preparing for the Google Cloud Associate Cloud Engineer Exam

In Progress/Soon (On-Demand):
- Kubernetes Trainning https://kubedev.io
- Data Scientist
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
- Data Architect
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
- Machine Learning Engineer
Specialization Program at Data Science Academy - www.datascienceacademy.com.br
- Blockchain Engineer
Specialization Program at Data Science Academy - www.datascienceacademy.com.br

The whole programs will complement and improve my skills in many areas as:
- Data Warehouse - Design and Implementation
- Data Lake - Design, Project and Integration
- Data Engineering with Hadoop Ecosystem and Spark
- Big Data Real-Time Analytics with Python and Spark
- Big Data, Data Science and Machine Learning - Google Cloud Platform
- Machine Learning and Artificial Intelligence in Distributed Environments
- Security and High Availability of Data
- Analytics, Visualization, Reports and Decision-Making
- Distributed Databases
- SQL and NoSQL Databases
- Database Management and Data Governance
- Big Data Analytics
- Data Mining and Predictive Analysis
- Programming in Python, Scala, Java, Javascript, Go, Solidity
- Blockchain Foundations and Development of Decentralized Applications
- Machine Learning with Scala and Spark
- Predictive Analysis with Machine Learning in Java
- IA Applications with Deep Learning in Java
- Project Management of Big Data
- DevOps, DataOps, Docker, Kubernetes
- Google Cloud Platform, Amazon Web Services, Azure