Lubo Bali - Data Engineer | AI Systems Builder

Chicago, IL • (312)-358-0008 • data@lubobali.com •

See My Portfolio



Lubo Bali – Data Engineer | AI Systems Builder

  • 14 years data experience: 12 in financial operations ($500K+ annual revenue) + 2 in data engineering

  • Endorsed by Zach Wilson (ex-Netflix, Airbnb, Meta): "Best project in the bootcamp, ready to make money"

  • Solo-built LuBot.ai - production AI analytics platform (112K lines, 36 tables, 6 NVIDIA models)



WORK EXPERIENCE


LUBOT.AI                                                                                                                                                                        Jun 2025 – Present

AI/Data Engineer (Founder)                                                                                              

  • Serves real users with personalized AI analytics - built the entire platform solo: 112K lines Python, 248 files, 36

database tables delivering insights that adapt to each user

  • Designed 36-table PostgreSQL schema tracking user behavior patterns - clicks, queries, preferences, interaction

history powering the personalization engine

  • Implemented 18 nightly workers that learn while users sleep - route optimization, user profiling, baseline calculations

make each session smarter than the last

  • Built 9-module Intelligence Engine: anomaly detection, driver analysis, correlation discovery, forecasting,

concentration risk - all with domain-aware context

  • Engineered zero-hallucination pipeline with source citations - every insight traces back to actual data

  • 100% NVIDIA Nemotron stack - 6 models + self-hosted RTX 4090. 4-tier intent routing, 3-tier response system

  • Tech stack: Python, PostgreSQL, FastAPI, Docker, NVIDIA NIM API, FAISS, Redis, Next.js, React


REMAX                                                                                                                                                                       Jan 2024 – Jan 2025

Data Engineer                                                                                               

  • Built Python ETL pipeline processing 10K+ residential property listings from MLS APIs into MySQL - real estate agents needed fast access to market changes

  • Cut 3+ hours of daily manual data work to 15 minutes with automated refreshes and validation checks

  • Created Tableau dashboards tracking property inventory and pricing trends across 20+ neighborhoods - helped 15+ agents spot opportunities faster than competitors

  • Tech stack: Python, MySQL, Tableau, MLS APIs


2828 ARTHINGTON CONDOMINIUM ASSOCIATION Jan 2013 – Jan 2024   Financial Data Manager

  • Tracked $500K+ annual revenue across 50 units for 12 consecutive years - managed financial operations using SQL

    and Excel for lease agreements, vendor contracts, and budget planning

  • Generated monthly/quarterly reports on revenue trends, expense patterns, and budget variances - board used these to make decisions on $100K+ capital improvements

  • Cut month-end close from 3 days to under 2 days by automating reconciliation in Excel - eliminated manual errors from copy-paste workflows

  • Tech stack: PostgreSQL, MySQL, SQL, Excel, Power BI, Tableau, financial reporting



TECHNICAL PROFICIENCIES


Data & Pipelines: PostgreSQL, Neon, Snowflake, SQL, dbt, Databricks, Delta Lake, Airflow, Soda, Apache Iceberg, Python,

TypeScript, FastAPI, REST APIs, Redis

Streaming: Apache Spark, Flink, Kafka AI/ML: NVIDIA Nemotron (Ultra 253B, Nano 8B, Vision 12B), NIM API, AdalFlow,

FAISS, Prophet, Ollama

Data Patterns: Change Data Capture (CDC), SCD, Dimensional Modeling, Growth Accounting

Infrastructure: AWS, Hetzner, Linux, Nginx, Docker, Distroless Containers, Backblaze B2, Git, Tableau

Frontend: Next.js, React, Tailwind CSS, Plotly

Certifications: DataExpert.io Analytics Engineering Excellence (Feb 2026), Data Engineering (Aug 2025), Data

Analytics Accelerator (Jun 2025)



EDUCATION

 

Associate of Applied Science in Computer Systems Expected Jul 2026

Lincoln Land Community College


Data/AI Analytics Engineering Bootcamp Apr 2026

DataExpert.io - Snowflake, dbt, Airflow, Apache Iceberg, Databricks, Delta Lake, Lakehouse architecture


Data Engineering Bootcamp Aug 2025

DataExpert.io - Dimensional modeling, Apache Spark, Flink, Kafka, Airflow, data quality


Data Analytics Bootcamp Jun 2025

Data Career Jumpstart


PROJECTS

     

MERGEAI - 5-Agent AI Data Analyst (Scored 100/100)

  • Upload any CSV, ask questions in plain English - 3 NVIDIA agents collaborate to write SQL, validate results, and

return insights.

  • Tech stack: TypeScript, Next.js 15, NVIDIA NIM API, Neon PostgreSQL, Drizzle ORM, Vercel

  • Live: merge-ai-omega.vercel.app | Demo: youtu.be/Yr0CkXKNF0M | GitHub: github.com/lubobali/mergeAI


AIRFLOW DATA QUALITY PIPELINE

  • Production-grade Airflow DAG with write-audit-publish pattern - catches bad data before it hits production.

  • Tech stack: Airflow, Soda, PostgreSQL, Python | GitHub: github.com/lubobali/airflow-dq-pipeline


ADDITIONAL PROJECTS

View all at: GitHub | LuboBali.com | LuBot.ai

Lubo Bali

Chicago, IL data@lubobali.com See My Portfolio

Data Engineer | AI Systems Builder 

  • 14 years data experience: 12 in financial operations ($500K+ annual revenue) + 2 in data engineering

  • Endorsed by Zach Wilson (ex-Netflix, Airbnb, Meta): "Best project in the bootcamp, ready to make money"

  • Solo-built LuBot.ai - production AI analytics platform (112K lines, 36 tables, 6 NVIDIA models)



WORK EXPERIENCE

   

LUBOT.AI                                         Jun 2025 – Present

AI/Data Engineer (Founder) 

  • Serves real users with personalized AI analytics - built the entire platform solo: 112K lines Python, 248 files, 36

database tables delivering insights that adapt

to each user

  • Designed 36-table PostgreSQL schema tracking user behavior patterns - clicks, queries, preferences, interaction

history powering the personalization engine

  • Implemented 18 nightly workers that learn while users sleep - route optimization, user profiling, baseline calculations

make each session smarter than the last

  • Built 9-module Intelligence Engine: anomaly detection, driver analysis, correlation discovery, forecasting,

concentration risk - all with domain-aware context

  • Engineered zero-hallucination pipeline with source citations - every insight traces back to actual data

  • 100% NVIDIA Nemotron stack - 6 models + self-hosted RTX 4090. 4-tier intent routing, 3-tier response system

  • Tech stack: Python, PostgreSQL, FastAPI, Docker, NVIDIA NIM API, FAISS, Redis, Next.js, React


REMAX Jan 2024 – Jan 2025

Data Engineer   

  • Built Python ETL pipeline processing 10K+ residential property listings from MLS APIs into MySQL - real estate agents needed fast access to market changes

  • Cut 3+ hours of daily manual data work to 15 minutes with automated refreshes and validation checks

  • Created Tableau dashboards tracking property inventory and pricing trends across 20+ neighborhoods - helped 15+ agents spot opportunities faster than competitors

  • Tech stack: Python, MySQL, Tableau, MLS APIs


2828 ARTHINGTON CONDOMINIUM ASSOCIATION  Financial Data Manager Jan 2013 – Jan 2024

  • Tracked $500K+ annual revenue across 50 units for 12 consecutive years - managed financial operations using SQL

    and Excel for lease agreements, vendor contracts, and budget planning

  • Generated monthly/quarterly reports on revenue trends, expense patterns, and budget variances - board used these to make decisions on $100K+ capital improvements

  • Cut month-end close from 3 days to under 2 days by automating reconciliation in Excel - eliminated manual errors from copy-paste workflows

  • Tech stack: PostgreSQL, MySQL, SQL, Excel, Power BI, Tableau, financial reporting


TECHNICAL PROFICIENCIES


Data & Pipelines: PostgreSQL, Neon, Snowflake, SQL, dbt, Databricks, Delta Lake, Airflow, Soda, Apache Iceberg, Python,TypeScript, FastAPI, REST APIs, Redis

Streaming: Apache Spark, Flink, Kafka AI/ML: NVIDIA Nemotron (Ultra 253B, Nano 8B, Vision 12B), NIM API, AdalFlow, FAISS, Prophet, Ollama

Data Patterns: Change Data Capture (CDC), SCD, Dimensional Modeling, Growth Accounting

Infrastructure: AWS, Hetzner, Linux, Nginx, Docker, Distroless Containers, Backblaze B2, Git, Tableau

Frontend: Next.js, React, Tailwind CSS, Plotly

Certifications: DataExpert.io Analytics Engineering Excellence (Feb 2026), Data Engineering (Aug 2025), DataAnalytics Accelerator (Jun 2025)


EDUCATION


Associate of Applied Science in Computer Systems Expected Jul 2026

Lincoln Land Community College


Data/AI Analytics Engineering Bootcamp Apr 2026

DataExpert.io - Snowflake, dbt, Airflow, Apache Iceberg, Databricks, Delta Lake, Lakehouse architecture


Data Engineering Bootcamp Aug 2025

DataExpert.io - Dimensional modeling, Apache Spark, Flink, Kafka, Airflow, data quality


Data Analytics Bootcamp Jun 2025

Data Career Jumpstart




PROJECTS


     

MERGEAI - 5-Agent AI Data Analyst (Scored 100/100)
  • Upload any CSV, ask questions in plain English - 3 NVIDIA agents collaborate to write SQL, validate results, and

return insights.

  • Tech stack: TypeScript, Next.js 15, NVIDIA NIM API, Neon PostgreSQL, Drizzle ORM, Vercel

  • Live: merge-ai-omega.vercel.app | Demo: youtu.be/Yr0CkXKNF0M | GitHub: github.com/lubobali/mergeAI


AIRFLOW DATA QUALITY PIPELINE

  • Production-grade Airflow DAG with write-audit-publish pattern - catches bad data before it hits production.

  • Tech stack: Airflow, Soda, PostgreSQL, Python | GitHub: github.com/lubobali/airflow-dq-pipeline


ADDITIONAL PROJECTS

View all at: GitHub | LuboBali.com | LuBot.ai

Download Resume📄(PDF)



Created by Lubo Bali


© Copyright 2025. All rights reserved Privacy Policy



Created by Lubo Bali

All rights reserved

© Copyright 2025. Privacy Policy