AI-Powered File Reader Assistant in Python
Enhancing the capacity of an LLM with Retrieval-Augmented Generation (RAG) in Python to create an AI-Powered File Reader Assistant that can respond to questions about any PDF.
Search By Similarity with Vector Databases
The project uses Python Language to create vector databases in three different DBs: ChromaDB, Pinecone and Qdrant. There is an app to perform search by similarity
Creating Agentic AI Crew to Write Blog Posts with CrewAI
This project utilizes the CrewAI framework to automate the creation of a blog post. It defines a crew of agents, each with specific roles, and tasks that guide the workflow from content planning to final editing and illustration.
From API To Dashboard: AI-Powered Weather ETL Pipeline with Databricks and GPT-4o.
This project builds a complete Data Pipeline implemented in Databricks. The pipeline automates the extraction of real-time weather data, uses AI to generate clothing suggestions, and stores the results in a structured Lakehouse environment.
Flight Prices scraped, saved to SQLite and Send message via WhatsApp
The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.
Here is my portfolio of projects, where you can see what I am working on and how I approach different kinds of problems.
Data Science Project Lifecycle with CRISP-DM
This project goes over an end-to-end Data Science project lifecycle following CRISP-DM lifecycle framework.
News Classifier with Multinomial Naive Bayes
Project to classify news headlines into categories like weather, politics, sports.
Regression Model Car Pricing App
This is a complete Regression Analysis Project to train a Random Forest Regression model to estimate prices of cars in Brazil.
Before and After AB Testing
An easy to follow guide to perform Before versus After Treatment AB tests in Python.
Text Sentiment Analysis App with Shiny and R
The project uses R Language to create a Shiny web application that reads a text and returns the word frequencies, Word Cloud and a graphic with sentiment analysis.
Predicting Results of the FIFA® World Cup 2022
I have created a model to predict the results of the FIFA World Cup games.
Movie Recommendation System with FastAPI + Streamlit
The project uses Python Language to create a full-stack movie recommendation system. The backend is built with FastAPI, while the front end piece is coded with Streamlit, with click tracking system and data stored in SQLite.
Project Documentation with MkDocs in Python
Documenting data projects is very important, as it gives the power to navigate the project, make it work as designed and solve eventual bugs.
Generating Data Dictionary for Excel Files with openpyxl + LLM
This project uses an AI agent to read an Excel file, determine the data types and descriptions of each column, and generate a data dictionary. The data dictionary is then added as comments to the header of the Excel file.
Predicting The 30-Year Fixed Mortgage Rates with Time Series
This project uses Darts and LightGBM packages from Python to decompose, analyze, fit and predict the first months of 2025 of the 30-year Fixed Mortgage rates in the USA.
