Yo! My name is Rafael Barbosa, a statistician graduated from the Federal University of Pará (UFPA). I currently work as a Data Bricklayer at a growing company.
Previously, I worked at:
- MarketData as a Data Scientist
- Juntos Somos Mais as a Data Analyst
- Statistics Intern at Secretaria Estadual da Fazenda - PA (SEFA-PA)
If you’d like to see some of the projects I’ve worked on using exploratory data analysis, visualization, and statistical modeling/machine learning check it out:
- Customer churn prediction (EN)
- Multiple time series forecast (PT-BR)
- Tutorial about Polars in Python (PT-BR)
- Roadmap in Data Science
- Back order prediction (PT-BR)
- Maps in R (PT-BR)
- The seconde one - House prices prediction (PT-BR)
- The only one - Titanic dataset classification (PT-BR)
- Twitter scrapping for #BelemAlagada (PT-BR)
- COVID Analysis (PT-BR)
- Telegram scrapping R-Brasil group (PT-BR)
At some point in my life I have worked with:
- Data Analysis: Excel, R/Rstudio (Tidyverse), Python (Pandas, Dask, Polars), Databricks (PySpark), Minitab and SPSS
- Machine Learning: R/Rstudio (glm, forecast, tidymodels), Python (sklearn, keras, hierarchical_forecast), MLFLow and Orange
- Deep Learning: Keras and TensorFlow
- Artificial Inteligence: LangChain, HuggingFace and OpenAI API
- Database: MySQL, PostgreSQL, Google BigQuery, SQLite, HiveSQL, PrestoSQL, SparkSQL and DBT
- Data visualization: Excel, R/Rstudio (ggplot2, plotly, Rmarkdown, shiny), Python (matplotlib, seaborn, plotly, plotnine, bokeh, streamlit), PowerBI, Tableau and DataStudio/LookerStudio
- Code versioning: Git (Github, Gitlab)
- Orchestration: Airflow
- Cloud: AWS, Azure and GCP