Angel N., Web Scraping Engineer with Data Analytic skills

Angel N., Web Scraping Engineer with Data Analytic skills

Data Scraping (8.0 yr.), Data Analyst (DA) (8.0 yr.)

Summary

- 10+ years with data and web scraping, data analysis, and crawler development;
- Developed 200+ web crawlers for fashion brands and 700+ crawlers for job offer aggregation including Fortune 500 companies;
- Led a team of junior engineers in data crawling projects;
- Managed scraping infrastructure, data parsing, and cleaning procedures using pandas and regex;
- Skilled in Python, PHP, JavaScript, SQL, and tools like Scrapy, Selenium, Pandas, BeautifulSoup, TensorFlow, and Regex;
- Analyzed data and built visualizations with Tableau and Power BI;
- Experienced with version control (Git) and Agile/Scrum methodologies;
- Used reverse engineering, XPath, CSS selectors, and AJAX calls to handle dynamic websites;
- Developed software architecture and contributed to test automation;
- Worked on projects involving distributed tasks with Gearman, Selenium, Puppeteer;
- Stayed updated on web scraping best practices and emerging technologies.

Web Scraping Experience

Web Scraping Engineer, NDA

Duration: Nov 2021 – Present

  • Led a 5-person engineering team focused on dynamic pricing projects for e-commerce;
  • Designed and maintained scrapers for major European retailers and marketplaces;
  • Oversaw large-scale data collection and adaptation to seasonal website changes.

Scraped Resources:

  • Retail & Marketplaces: amazon.de, amazon.es, amazon.fr, amazon.it, amazon.nl, amazon_uk, carrefour.es, carrefour.fr, auchan.fr, leclerc.fr, elcorteingles.es, mediamarkt.at, mediamarkt.de, mediamarkt.es, mediamarkt.nl
  • Tech & Electronics: acer.de, asus.de, dell.de, hp.de, hp.uk, irobot.de, irobot.es, irobot.fr, irobot.nl, irobot.uk, otto.de, skullcandy.eu
  • Home & DIY: fnac.fr, leroymerlin.fr, electrodepot.fr, lechai.fr

Web Scraping Engineer, Recruit.net

Duration: June 2020 – Oct 2021

  • Led a team of two junior engineers, managing over 1M job offer crawls daily;
  • Created 700+ scrapers for job boards and enterprise career portals;
  • Parsed XML-based job data and ensured stable data pipelines under high load.

Scraped Resources:

  • Big Tech & SaaS: amazon, apple, google, facebook, stripe, tiktok, netflix, uber, vmware, docker, intel, ibm, cisco, taleo_net, join_com
  • Fortune 500 & Enterprises: boeing, tesla, daimler, general_motors, verizon, ups, pepsico, nike, dell, paypal, campbellsoupcompany, coca_cola_company, disney
  • HR Platforms & Job Boards: applicantpool, careerplug, allhires, aviahire, bankofamerica, phillips66_jobs

Web Scraping Engineer, Tool.Domains

Duration: June 2019 – May 2020

  • Engineered scraping tools to collect metadata, tags, links, and screenshots from millions of domains;
  • Applied distributed scraping using Gearman and browser automation with Selenium and Puppeteer.

Technologies: Python, PHP, JavaScript, Selenium, Puppeteer, Gearman

Web Scraping Engineer, Clippings

Duration: Sep 2017 – Oct 2018

  • Collected and structured product data for furniture brands and online retailers;
  • Applied data science tools for product matching and analysis.

Technologies: Pandas, NLTK, TensorFlow

Web Scraping Engineer, Styloko

Duration: July 2015 – June 2017

  • Created ~200 fashion e-commerce crawlers using Scrapy and AWS-based pipelines;
  • Applied XPath, regex, CSS selectors, and reverse-engineered AJAX calls.

Scraped Resources: alexandermcqueen, allsaints, armani, anthropologie, annataylor, gap, nike, mango, rayban, lacoste, pandora, ragbone, sephora, avon, cultbeauty, jimmychoo, stellamccartney, viviennewestwood

Work Experience

Founder, NDA

Duration: Nov 2021 - Current

Summary: Software development company delivering innovative business solutions across various industries. Projects focus on custom applications to optimize business processes.

Responsibilities:

  • Established a leading software development company specializing in innovative solutions for businesses;
  • Developed the company's vision, mission, and strategic direction, driving growth and profitability;
  • Oversaw all aspects of business operations, including budgeting, resource allocation, and performance management, ensuring optimal efficiency and productivity.

Web Scraping Engineer, Recruit.net

Duration: June 2020 - Oct 2021

Summary: A global job search platform that aggregates listings from company websites, agencies, and job boards across 30+ countries. Offers candidate matching, recruitment ads, and analytics tools for employers.

Responsibilities:

  • Monitored scraping infrastructure, including servers and proxies, to ensure uninterrupted data extraction;
  • Implemented data parsing and cleaning procedures to transform raw scraped data into usable formats, using tools like pandas and regex;
  • Stayed updated with the latest web scraping techniques, emerging technologies, and industry best practices to continuously enhance scraping efficiency and effectiveness.

Technologies: Python, Pandas, Regex.

Web Scraping Engineer & Data Analyst, Edoms

Duration: June 2019 - May 2020

Summary: A platform focused on domain name research, acquisition, and SEO services, with access to a large database of expired and aged domains. Provides tools for evaluating domain value and supports SEO-driven domain selection and strategy.

Responsibilities:

  • Created documentation for web scraping processes, data collection methodologies, and data analysis workflows to ensure knowledge sharing and replication of results;
  • Collaborated with stakeholders to understand business needs and requirements, translating them into data-driven solutions and actionable recommendations;
  • Actively participated in team meetings and contributed to discussions on data strategies, project planning, and process improvements.

Technologies: Python, documentation tools.

Engineer & Data Analyst, Develated Ltd.

Duration: Apr 2019 - May 2019

Summary: Software company.

Responsibilities:

  • Proactively monitored data quality, identifying and resolving data discrepancies and anomalies to ensure the integrity of data assets;
  • Actively participated in code reviews, providing constructive feedback to peers and improving overall code quality and maintainability;
  • Contributed to the continuous improvement of development processes and methodologies, identifying areas for optimization and implementing streamlined workflows.

Technologies: Python, Git.

ERP Project Manager, AutoHit

Duration: Mar 2019 - Apr 2019

Summary: AutoHit is a company specializing in automotive software solutions, focusing on vehicle diagnostics and repair management.

Responsibilities:

  • Collaborated closely with stakeholders, including senior management, department heads, and cross-functional teams, to gather requirements, define project scopes, and align project objectives with overall business goals;
  • Conducted thorough project assessments, including risk analysis, resource allocation, and project feasibility studies, to ensure successful project outcomes;
  • Managed project budgets, tracking expenses, and resource utilization, ensuring effective cost management and delivering projects within budgetary constraints.

Technologies: Project management tools.

Web Scraping Engineer & Data Analyst, Clippings

Duration: Sept 2017 - Oct 2018

Summary: A digital procurement platform connecting interior designers with over 650 brands and millions of products.

Responsibilities:

  • Analyzed scraped data to identify patterns, trends, and insights, contributing to data-driven decision-making within the organization;
  • Created interactive visualizations and reports using tools like Tableau and Power BI to communicate data findings and provide actionable insights to stakeholders;
  • Conducted data cleansing and preprocessing tasks to ensure data accuracy, consistency, and integrity, leading to improved data quality across internal databases.

Technologies: Python, Tableau, Power BI.

Web Scraping Engineer & Data Analyst, Styloko

Duration: July 2015 - June 2017

Summary: Styloko was a London-based fashion discovery platform aggregating products from major UK retailers. It offered personalized recommendations and price tracking features to enhance the shopping experience.

Responsibilities:

  • Developed web scraping scripts using Python and BeautifulSoup to extract relevant data from various websites, ensuring accurate and reliable data collection for analysis purposes;
  • Collaborated with cross-functional teams, including data scientists and business analysts, to understand data requirements and implement effective web scraping solutions;
  • Stay up-to-date with industry trends and best practices in web scraping techniques and data analysis methodologies, actively seeking opportunities to enhance technical skills and knowledge.

Technologies: Python, BeautifulSoup.

JavaScript ExtJS 4 & 6 Engineer, MittagQI

Duration: Oct 2015 - Feb 2016

Summary: MittagQI develops software solutions for the translation industry, including the open-source translation management system translate5. The company provides process automation, machine translation integration, and consulting for language service providers.

Responsibilities:

  • Implemented efficient data binding techniques and utilized RESTful APIs to retrieve and manipulate data from server-side systems, ensuring optimal performance and data integrity;
  • Conducted thorough code reviews and debugging sessions to identify and resolve issues, ensuring the delivery of a clean, maintainable codebase;
  • Actively participated in Agile development methodologies, attending daily stand-up meetings, and contributing to sprint planning and retrospective sessions.

Technologies: JavaScript, ExtJS, REST APIs.

Engineer, Perennial AG 

Duration: Aug 2014 - June 2015

Summary: Perennial SA is an independent insurance brokerage firm based in French-speaking Switzerland, specializing in corporate insurance, occupational pensions, absence management, and wealth planning.

Responsibilities:

  • Collaborated with the development team at Perennial AG Company, a leading software solutions provider, on various projects for clients in diverse industries;
  • Developed responsive websites and web applications using HTML5, CSS3, JavaScript, and modern frameworks such as React.js and AngularJS;
  • Utilized version control systems such as Git for efficient code management and collaboration.

Technologies: HTML5, CSS3, JavaScript, React.js, AngularJS, Git.

Web Engineer, ClearWare Ltd

Duration: Mar 2014 - Aug 2014

Summary: ClearWare Ltd is a software development company specializing in custom solutions, AI integration, and system automation across multiple industries.

Responsibilities:

  • Maintained existing websites and applications, monitoring performance metrics and implementing improvements to enhance functionality and user satisfaction;
  • Conducted research and kept abreast of emerging web technologies and trends, applying the latest tools and techniques to continuously improve development processes and deliver innovative solutions;
  • Participated in code reviews, providing constructive feedback to peers to promote code quality, maintainability, and adherence to established coding standards.

Technologies: HTML, CSS, JavaScript.

Senior Web Engineer, Stock Logistic Ltd.

Duration: Feb 2009 - Mar 2014

Summary: Stock Logistic Ltd is a logistics operator providing comprehensive transportation solutions by sea, rail, air, and road.

Responsibilities:

  • Led the development and implementation of responsive web designs, ensuring seamless user experiences across multiple devices and browsers;
  • Mentored junior engineers, providing guidance on coding best practices, architectural design, and troubleshooting techniques;
  • Worked closely with stakeholders to gather feedback, assess user needs, and continuously improve web applications based on usability testing and analytics data.

Technologies: HTML, CSS, JavaScript.

Web Engineer, Homesite

Duration: Sept 2008 - Dec 2008

Summary: Homesite is a digital insurance provider specializing in homeowners and renters insurance with a focus on online policy management.

Responsibilities:

  • Optimized websites for search engines (SEO) and implemented best practices for website performance and accessibility;
  • Resolved technical issues, provided timely support, and performed regular website maintenance tasks;
  • Stay up-to-date with emerging web technologies and industry trends, proactively recommending improvements to enhance the overall user experience.

Technologies: HTML, CSS, SEO tools.

Web Engineer, Megalan Network Ltd

Duration: Apr 2008 - May 2008

Summary: Megalan Network Ltd was a Bulgarian telecom company providing internet and digital TV services before being acquired by Mobiltel.

Responsibilities:

  • Utilized version control systems, such as Git, to manage the codebase and facilitate collaboration among team members;
  • Integrated third-party APIs, such as payment gateways and social media platforms, to enhance website functionality and user engagement;
  • Contributed to the continuous improvement of web development processes and standards within the company.

Technologies: Git, API integration.

Web Engineer, Gameloft

Duration: July 2007 - Feb 2008

Summary: Gameloft is a French video game company developing and publishing mobile, console, and PC games worldwide.

Responsibilities:

  • Designed and implemented responsive web designs to ensure optimal user experience across various devices and screen sizes;
  • Implemented SEO best practices to optimize web applications for search engines and improve online visibility;
  • Worked closely with content management systems (CMS) to update and manage website content efficiently.

Technologies: HTML, CSS, CMS.

Web Engineer, Mirchev Ideas

Duration: Dec 2006 - May 2007

Summary: Mirchev Ideas is a software company specializing in e-commerce solutions, including the Summer Cart shopping platform.

Responsibilities:

  • Assisted in the optimization of websites for search engine optimization (SEO) and implemented SEO best practices to improve organic rankings and visibility;
  • Actively participated in project planning and scoping activities, providing technical expertise and suggestions for improving user experience and functionality;
  • Stayed up-to-date with emerging web technologies and trends, proactively incorporating new tools and techniques into projects to enhance overall quality and efficiency.

Technologies: SEO tools, HTML, CSS.

Web Engineer, WebGate JSC

Duration: Dec 2005 - June 2006

Summary: WebGate JSC is a Bulgarian software company specializing in mobile apps and digital solutions.

Responsibilities:

  • Conducted thorough testing and debugging to identify and fix any issues or bugs, ensuring high-quality deliverables;
  • Participated in code reviews and provide constructive feedback to peers, promoting code quality and best practices;
  • Actively contribute to project planning and estimation, providing technical expertise and guidance to ensure successful project execution.

DB Manager, Ratola

Duration: June 2005 - Oct 2005

Summary: Ratola Corporation is a Bulgarian automotive company and exclusive distributor for SsangYong and BRABUS brands.

Responsibilities:

  • Planned database upgrades and migrations, ensuring minimal disruption to business operations and seamless transition to new technologies;
  • Maintained comprehensive documentation, including data dictionaries, system diagrams, and operational procedures, to ensure knowledge sharing and facilitate efficient database management;
  • Provided technical guidance and mentorship to junior team members, fostering their professional growth and promoting a culture of continuous learning within the department.

Technologies: SQL, database management tools.

Web Engineer, Mochanin

Duration: Sept 2004 - Feb 2005

Summary: Mochanin is a software development company based in Sofia, Bulgaria, specializing in PHP/MySQL-based frameworks and web applications.

Responsibilities:

  • Utilized version control systems (e.g., Git) to track and manage code changes, facilitating seamless collaboration and enabling efficient troubleshooting;
  • Optimized websites for search engines (SEO) by implementing best practices for meta tags, keyword placement, and content organization;
  • Monitored website analytics using tools like Google Analytics to track user behavior, gather insights, and propose data-driven improvements.

Technologies: Git, Google Analytics, SEO tools.

Education

Sofia University St. Kliment Ohridski, M.D.: Theology/Theological Studies (03/2000)