Gustavo R Santos
I solve business challenges through the power of data.
BIO
I’m a passionate Data Scientist who thrives on solving complex problems through data analysis, visualization, and machine learning. I excel in delivering agile, impactful solutions—from statistical analysis to building advanced models—using Python, R, and SQL to turn raw data into strategic insights.
As a published author with Packt and an instructor on Udemy, I love sharing knowledge, evidenced by my blog attracting 20k+ views monthly. Whether I am exploring datasets or helping leadership make data-driven decisions, I bring curiosity, creativity, and a hands-on approach to every project. My goal? To transform data into value and inspire others along the way.
PROFESSIONAL EXPERIENCE
Food Lion, Salisbury, NC – USA
November, 2020 – August, 2024
Data Scientist
As part of the Corporate Strategy team, I performed analysis and data wrangling leveraging PySpark and SQL to improve data quality, helping to boost e-commerce model precision by 20%.
Applied statistical analysis and optimization algorithms to solve business challenges, including developing a checkout lane recommendation algorithm that cut labor costs by 33%.
Built machine learning models in Python and R, delivering actionable insights via Power BI dashboards, such as HR incident tracking. Utilized advanced techniques like clustering and NLP to enhance business outcomes, including customer segmentation, sentiment analysis and store traffic predictions with <2% MAPE.
IBM, Brazil
July, 2012 – July, 2019
Data Analyst
Analyzed projects financial data using SQL and Excel to generate insights for decision-making. Developed KPI reports, forecast tracking tools, and dashboards, improving executive processes. Implemented 6+ semi-automated reports, cutting lead time by 70%, reducing input errors by 99%, and decreasing overtime by 50% through process improvements.
IBM, Brazil
March, 2010 – July, 2012
Billing Lead
Led a 3-member team managing billing processes, ensuring accurate and timely reporting. Redesigned workflows with VBA, cutting lead time by 40% and automating Excel tasks, significantly reducing manual effort. Streamlined data workflows to achieve 99% billing accuracy, enhancing efficiency and client satisfaction.
CORE COMPETENCIES
Python | R Language | SQL | Databricks | Spark | Power BI | Excel
EDUCATION BACKGROUND
2023 – USP, Brazil – MBA Data Science and Analytics – Completed GPA 98%.
2020, Data Science Academy — Completed GPA 90%
Online program of 6 courses including machine learning and statistics, totaling 432 hours.
2020, MIT, Online — Extension – Completed GPA 98%
Professional certificate. Data Science and Big Data Analytics: Making Data-Driven Decisions.
2006, FGV, Brazil — MBA Marketing
PUBLICATIONS
2023
Data Wrangling with R
A book about modern techniques to load, clean, and transform data using the R language.
2023
Mastering Data Wrangling with PySpark in Databricks
Complete course about how to manipulate, clean and transform Big Data using PySpark in Databricks.