Upstaff Sign up
Oleh O
🇺🇦Ukraine (UTC+02:00)
Created AtUpstaffer since January, 2025

Oleh O — AI/ML Engineer and Data Scientist

Expertise in Data Science (7.0 yr.), AI and Machine Learning (4.0 yr.).

Last verified on February, 2025

Core Skills

Python
Python
Azure
Azure
TensorFlow
TensorFlow
PyTorch
PyTorch

Bio Summary

* Data Scientist with a Master’s Degree in Computer Science and extensive experience in machine learning, deep learning, and cloud services ( Azure,AWS and GCP).
* 7 years of experience, proficient in building ML pipelines and deploying scalable solutions.
* Developed and deployed a voice-to-voice pipeline for a call center VoIP system using Whisper, Llama, and MMS-TTS, including API integrations and Docker deployment.
* Implemented a Google Doc AI Wrapper, improving document recognition accuracy through advanced preprocessing techniques.
* Designed a deforestation detection system using clustering and Azure-based services to monitor forest areas.

Technical Skills

Programming LanguagesPython
AI & Machine LearningAmazon Lex, AWS ML (Amazon Machine learning services), Google Document AI, Keras, Kubeflow, Linear models, LLaMA, ollama, OpenCV, OpenVoice, PCA, Prompt Engineering, PyTorch, Spacy, TensorFlow, Transformer models, vLLM, Whisper
.NET PlatformAzure
Python Libraries and ToolsKeras, Matplotlib, NLTK, Pandas, Pillow, Pvporcupine, PyTorch, Seaborn, TensorFlow
Mobile Frameworks and LibrariesKVO
JavaScript Libraries and Toolsp5.js
Data Analysis and Visualization TechnologiesDecision Tree, ETL, Jupyter Notebook, Pandas
Cloud Platforms, Services & ComputingAWS, Azure
Amazon Web ServicesAWS Boto3, AWS LightSail, AWS ML (Amazon Machine learning services)
Google Cloud PlatformGCE, GCP BigQuery, Google Docs
SDK / API and IntegrationsAPI, Google API
Third Party Tools / IDEs / SDK / ServicesAsterisk, Sublime Text
iOS Libraries and ToolsCore Audio
Virtualization, Containers and OrchestrationDocker, GCE
Web/App Servers, MiddlewareMSCS (Microsoft Cluster Server)
PlatformsUnity
Mail / Network Protocols / Data transferVoIP
Other Technical SkillsMIP, MIPS, MLP, MMS, MMS-TTS, PELCO-D, Rhino Grasshopper

Work Experience

Caller - VoIP server for call center

Implemented voice-to-voice pipeline in Hebrew for VoIP server for call center.

Responsibilities:

  • Implemented voice-to-voice pipeline Whisper (STT) - Llama (text-only) - mms-tts (TTS) - OpenVoice (voice cloning),
  • Implemented API to interact with models and to integrate them with the VoIP server,
  • Prepared dataset for training text-to-speech and speech-to-text models,
  • Trained whisper model,
  • Deployed models using Docker, created Dockerfiles for models,
  • Prompt engineered LLama,
  • Configured Asterisk VoIP server,

Tools and technologies:Docker, AWS, ollama, vLLM, Llama, Whisper, MMS-TTS, OpenVoice, Prompt Engineering, Asterisk

Google Doc AI Wrapper

Implemented wrapper for Google Doc AI service in order to improve service accuracy in recognizing freeform documents

Responsibility:

  • Used image preprocessing techniques to improve document recognition by Google service,
  • Parsed recognized document data and searched for desired form fields, tables and other valuable data,
  • Implemented processing and filtering recognized table data,
  • Implemented optimization function that combine different image preprocessing methods in order to find combination that provide most recognized data that are searched for

Tools and technologies: Google document ai, OpenCV, pillow, Pandas

Assets tracking with camera POC

tracking asset’s location: real-time positioning, collecting information about cargo and handling equipment using machine vision

Responsibilities:

  • Trained container segmentation model, added OCR,
  • Created a pipeline for localizing an object on a map using a depth camera,
  • Implemented optimization function that combine different image preprocessing methods.

Tools and technologies: Google Vision API, OAK-D (OpenCV Kit), Tensorflow, Keras, Unity3D (synthetic data generation, visualization)

LiveScetchScanner

Implemented camera image and mask processing module; implemented voice command interface

Responsibilities:

  • Extraction image part based on mask from NN and Hough line detector,
  • Image transformation, processing, filtering,
  • implementation voice command interface using Amazon Lex bot API and rhino model

Tools and technologies: OpenCV, Pyaudio, Pvporcupine, rhino, Amazon Lex, boto3

Meowtalk

App to translate the cat’s meows

Responsibilities:

  • Created Docker containers with code parts needed for model training,
  • set up training pipeline for Meowtalk model using Kubeflow Pipeline in GCP,
  • Modified data preprocessing to achieve better accuracy of the model.

Tools and technologies: Docker, Kubeflow pipeline, GCP, Keras

Creating the model to predict the playoff refsults of the World Cup

Responsibilities:

  • Data analysis, preprocessing, data cleaning
  • Search for additional data
  • Feature development based on all available data
  • Dataset formation for training models
  • Selection of models (Linear models, Decision trees, MLP) for prediction and analysis of results

Tools and technologies used: scikit-learn, PCA, Linear models, Decision trees, MLP

Meeting summarizer

Implemented model for creation summaries of audio meetings based on article

Responsibility and achievements:

  • Research of existing abstractive text summarization methods
  • Implemented tree-based method

Tools and technologies used: Python, Azure, spacy, NLP-abstractive text summarization, MIP

Deforestation detection

Developing model for deforestation detection on Ukraine territory

Achievements:

  • Collected dataset of map fragments: writing scripts to work with the API, searching for the necessary fragments by coordinates, date, filtering by cloudiness and types of fragments and their download
  • Created a dataset for network training by clustering images (k-means, image clustering), searching for outliers / anomalies, and then selecting images from different clusters
  • Implemented forest watch service based on Azure function
  • Implemented a model for deforestation detection.

Tools and technologies used: Rasterio, MS Azure, image clustering

Education

  • Master’s Degree in Computer Science,National University

How to hire with Upstaff

1

Talk to Our Talent Expert

Our journey starts with a 30-min discovery call to explore your project challenges, technical needs and team diversity.

2

Meet Carefully Matched Talents

Within 1-3 days, we’ll share profiles and connect you with the right talents for your project. Schedule a call to meet engineers in person.

3

Validate Your Choice

Bring new talent on board with a trial period to confirm you hire the right one. There are no termination fees or hidden costs.

Why Upstaff

Upstaff is a technology partner with expertise in AI, Web3, Software, and Data. We help businesses gain competitive edge by optimizing existing systems and utilizing modern technology to fuel business growth.

Real-time project team launch

<24h

Interview First Engineers

Upstaff's network enables clients to access specialists within hours & days, streamlining the hiring process to 24-48 hours, start ASAP.

x10

Faster Talent Acquisition

Upstaff's network & platform enables clients to scale up and down blazing fast. Every hire typically is 10x faster comparing to regular recruitement workflow.

Vetted and Trusted Engineers

100%

Security And Vetting-First

AI tools and expert human reviewers in the vetting process is combined with track record & historically collected feedbacks from clients and teammates.

~50h

Save Time For Deep Vetting

In average, we save over 50 hours of client team to interview candidates for each job position. We are fueled by a passion for tech expertise, drawn from our deep understanding of the industry.

Flexible Engagement Models

Arrow

Custom Engagement Models

Flexible staffing solutions, accommodating both short-term projects and longer-term engagements, full-time & part-time

Sharing

Unique Talent Ecosystem

Candidate Staffing Platform stores data about past and present candidates, enables fast work and scalability, providing clients with valuable insights into their talent pipeline.

Transparent

$0

No Hidden Costs

Price quoted is the total price to you. No hidden or unexpected cost for for candidate placement.

x1

One Consolidated Invoice

No matter how many engineers you employ, there is only one monthly consolidated invoice.

Ready to hire Oleh O
or someone with similar Skills?
Looking for Someone Else? Join Upstaff access to All profiles and Individual Match
Start Hiring