Skip to content
View lukasrozado's full-sized avatar

Organizations

@TransferoNovaIguacu

Block or report lukasrozado

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
lukasrozado/README.md

๐Ÿ‘‹ Hi, I'm Lukas Rozado

Data Analyst & Data Engineering Enthusiast โ€” I build reliable data pipelines, analytical systems, and long-term predictive modeling projects.

stack stack stack stack


๐Ÿš€ About Me

I work with data engineering and analytics, designing pipelines, building structured datasets, and creating analytical models used for decision-making.
For the last 4+ years, Iโ€™ve also been developing a personal predictive modeling system, where I handle everything end-to-end: data ingestion, feature engineering, modeling, evaluation, automation, and monitoring.

I enjoy solving real problems with pragmatic, clean, and reliable data solutions.


๐Ÿ”ฌ What I Work On (Personally)

๐Ÿ“ˆ Long-term Predictive Modeling System (4+ years)

A large personal project focused on applying statistical modeling, simulation, and ML techniques.
Core components include:

  • Automated data ingestion & cleaning
  • Feature engineering pipelines
  • Statistical models + ML ensembles
  • Bootstrap confidence, backtesting & metric monitoring
  • Continuous experiment cycles
  • Full reproducibility + versioning

This project reflects my technical depth, persistence, and ability to maintain a complex system long-term.


๐Ÿ’ผ Professional Experience (Transfero)

I currently work as a Data Analyst, contributing to the companyโ€™s data infrastructure and analytics stack.

Key areas:

  • Resilient ETL pipelines with Python (stream processing + idempotency patterns)
  • Data ingestion from multiple APIs and sources
  • Serverless pipelines with Azure Functions
  • Checkpointing, retries, and performance improvements
  • Data Lake โ†’ Data Warehouse transformations (PostgreSQL)
  • Structured logging and secure workflow (Key Vault, Managed Identity)

My role strengthens my foundation in data reliability, pipeline design, and cloud-based ingestion.


๐Ÿ›  Technical Skills

Languages & Tools

  • Python (Pandas, NumPy, Polars, SQLAlchemy)
  • SQL (PostgreSQL, query optimization, indexing)
  • Azure Functions, Blob Storage, Key Vault
  • Git, GitHub Actions
  • Docker
  • Jupyter, VSCode
  • Basic ML (scikit-learn, statsmodels, PyTorch for experimentation)

Engineering Practices

  • Idempotent ETL design
  • Retry & checkpoint strategies
  • Data quality checks
  • CI pipelines
  • Modular project structures
  • Reproducible experiments

Pinned Loading

  1. operacaofuturoseguro operacaofuturoseguro Public

    Lead capture and life insurance simulation website specialized for public safety agents, developed for Blรก Insurance Broker.

    HTML

  2. lukasrozado.github.io lukasrozado.github.io Public

    HTML