Projects

AI-Powered File Reader Assistant in Python

Enhancing the capacity of an LLM with Retrieval-Augmented Generation (RAG) in Python to create an AI-Powered File Reader Assistant that can respond to questions about any PDF.

Search By Similarity with Vector Databases

The project uses Python Language to create vector databases in three different DBs: ChromaDB, Pinecone and Qdrant. There is an app to perform search by similarity

Creating Agentic AI Crew to Write Blog Posts with CrewAI

This project utilizes the CrewAI framework to automate the creation of a blog post. It defines a crew of agents, each with specific roles, and tasks that guide the workflow from content planning to final editing and illustration.

From API To Dashboard: AI-Powered Weather ETL Pipeline with Databricks and GPT-4o.

This project builds a complete Data Pipeline implemented in Databricks. The pipeline automates the extraction of real-time weather data, uses AI to generate clothing suggestions, and stores the results in a structured Lakehouse environment.

Flight Prices scraped, saved to SQLite and Send message via WhatsApp

The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.

Optimized Price and Demand Elasticity Modeled with PyGAM in Python.

Analyze the pricing history of 5 products and generate a model to optimize the price and get the maximum revenue.

Here is my portfolio of projects, where you can see what I am working on and how I approach different kinds of problems.

Data Science Project Lifecycle with CRISP-DM

This project goes over an end-to-end Data Science project lifecycle following CRISP-DM lifecycle framework.

News Classifier with Multinomial Naive Bayes

Project to classify news headlines into categories like weather, politics, sports.

Data Engineering Pipeline for Stocks Data

In this project, I created a Data Pipeline from web scraping TeleCom stocks data and working them through the final dashboard.

Regression Model Car Pricing App

This is a complete Regression Analysis Project to train a Random Forest Regression model to estimate prices of cars in Brazil.

Before and After AB Testing

An easy to follow guide to perform Before versus After Treatment AB tests in Python.

Restaurant Dashboard App with Streamlit

Restaurant Dashboard application written and deployed with Streamlit and Python.

Text Sentiment Analysis App with Shiny and R

The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.

Predicting Results of the FIFA® World Cup 2022

I have created a model to predict the results of the FIFA World Cup games.

CSV to Parquet File Converter Windows .exe

Windows application that converts files from CSV to Parquet format or Parquet to CSV.

Movie Recommendation System with FastAPI + Streamlit

The project uses Python Language to create a full-stack movie recommendation system. The backend is built with FastAPI, while the front end piece is coded with Streamlit, with click tracking system and data stored in SQLite.

Project Documentation with MkDocs in Python

Documenting data projects is very important, as it gives the power to navigate the project, make it work as designed and solve eventual bugs.

Generating Data Dictionary for Excel Files with openpyxl + LLM

This project uses an AI agent to read an Excel file, determine the data types and descriptions of each column, and generate a data dictionary. The data dictionary is then added as comments to the header of the Excel file.

Predicting The 30-Year Fixed Mortgage Rates with Time Series

This project uses Darts and LightGBM packages from Python to decompose, analyze, fit and predict the first months of 2025 of the 30-year Fixed Mortgage rates in the USA.

Project TBD

The project  is To Be Defined…

Project TBD

The project  is To Be Defined…