Projects

Here is my portfolio of projects, where you can see what I am working on and how I approach different kinds of problems.

Data Science Project Lifecycle with CRISP-DM

This project goes over an end-to-end Data Science project lifecycle following CRISP-DM lifecycle framework.

News Classifier with Multinomial Naive Bayes

Project to classify news headlines into categories like weather, politics, sports.

Data Engineering Pipeline for Stocks Data

In this project, I created a Data Pipeline from web scraping TeleCom stocks data and working them through the final dashboard.

Regression Model Car Pricing App

This is a complete Regression Analysis Project to train a Random Forest Regression model to estimate prices of cars in Brazil.

Before and After AB Testing

An easy to follow guide to perform Before versus After Treatment AB tests in Python.

Restaurant Dashboard App with Streamlit

Restaurant Dashboard application written and deployed with Streamlit and Python.

Text Sentiment Analysis App with Shiny and R

The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.

Predicting Results of the FIFA® World Cup 2022

I have created a model to predict the results of the FIFA World Cup games.

CSV to Parquet File Converter Windows .exe

Windows application that converts files from CSV to Parquet format or Parquet to CSV.

Flight Prices scraped, saved to SQLite and Send message via WhatsApp

The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.

Predicting The 30-Year Fixed Mortgage Rates with Time Series

This project uses Darts and LightGBM packages from Python to decompose, analyze, fit and predict the first months of 2025 of the 30-year Fixed Mortgage rates in the USA.

Optimized Price and Demand Elasticity Modeled with PyGAM in Python.

Analyze the pricing history of 5 products and generate a model to optimize the price and get the maximum revenue.

Movie Recommendation System with FastAPI + Streamlit

The project uses Python Language to create a full-stack movie recommendation system. The backend is built with FastAPI, while the front end piece is coded with Streamlit, with click tracking system and data stored in SQLite.

Project Documentation with MkDocs in Python

Documenting data projects is very important, as it gives the power to navigate the project, make it work as designed and solve eventual bugs.

AI-Powered File Reader Assistant in Python

Enhancing the capacity of an LLM with Retrieval-Augmented Generation (RAG) in Python to create an AI-Powered File Reader Assistant that can respond to questions about any PDF.

Search By Similarity with Vector Databases

The project uses Python Language to create vector databases in three different DBs: ChromaDB, Pinecone and Qdrant. There is an app to perform search by similarity