Lubo Bali - Data Engineer | AI Systems Builder
Chicago, IL • (312)-358-0008 • data@lubobali.com •
Lubo Bali – Data Engineer | AI Systems Builder
14 years data experience: 12 in financial operations ($500K+ annual revenue) + 2 in data engineering
Endorsed by Zach Wilson (ex-Netflix, Airbnb, Meta): "Best project in the bootcamp, ready to make money"
Solo-built LuBot.ai - production AI analytics platform (112K lines, 36 tables, 6 NVIDIA models)
WORK EXPERIENCE
LUBOT.AI Jun 2025 – Present
AI/Data Engineer (Founder)
Serves real users with personalized AI analytics - built the entire platform solo: 112K lines Python, 248 files, 36
database tables delivering insights that adapt to each user
Designed 36-table PostgreSQL schema tracking user behavior patterns - clicks, queries, preferences, interaction
history powering the personalization engine
Implemented 18 nightly workers that learn while users sleep - route optimization, user profiling, baseline calculations
make each session smarter than the last
Built 9-module Intelligence Engine: anomaly detection, driver analysis, correlation discovery, forecasting,
concentration risk - all with domain-aware context
Engineered zero-hallucination pipeline with source citations - every insight traces back to actual data
100% NVIDIA Nemotron stack - 6 models + self-hosted RTX 4090. 4-tier intent routing, 3-tier response system
Tech stack: Python, PostgreSQL, FastAPI, Docker, NVIDIA NIM API, FAISS, Redis, Next.js, React
REMAX Jan 2024 – Jan 2025
Data Engineer
Built Python ETL pipeline processing 10K+ residential property listings from MLS APIs into MySQL - real estate agents needed fast access to market changes
Cut 3+ hours of daily manual data work to 15 minutes with automated refreshes and validation checks
Created Tableau dashboards tracking property inventory and pricing trends across 20+ neighborhoods - helped 15+ agents spot opportunities faster than competitors
Tech stack: Python, MySQL, Tableau, MLS APIs
2828 ARTHINGTON CONDOMINIUM ASSOCIATION Jan 2013 – Jan 2024 Financial Data Manager
Tracked $500K+ annual revenue across 50 units for 12 consecutive years - managed financial operations using SQL
and Excel for lease agreements, vendor contracts, and budget planning
Generated monthly/quarterly reports on revenue trends, expense patterns, and budget variances - board used these to make decisions on $100K+ capital improvements
Cut month-end close from 3 days to under 2 days by automating reconciliation in Excel - eliminated manual errors from copy-paste workflows
Tech stack: PostgreSQL, MySQL, SQL, Excel, Power BI, Tableau, financial reporting
TECHNICAL PROFICIENCIES
Data & Pipelines: PostgreSQL, Neon, Snowflake, SQL, dbt, Databricks, Delta Lake, Airflow, Soda, Apache Iceberg, Python,
TypeScript, FastAPI, REST APIs, Redis
Streaming: Apache Spark, Flink, Kafka AI/ML: NVIDIA Nemotron (Ultra 253B, Nano 8B, Vision 12B), NIM API, AdalFlow,
FAISS, Prophet, Ollama
Data Patterns: Change Data Capture (CDC), SCD, Dimensional Modeling, Growth Accounting
Infrastructure: AWS, Hetzner, Linux, Nginx, Docker, Distroless Containers, Backblaze B2, Git, Tableau
Frontend: Next.js, React, Tailwind CSS, Plotly
Certifications: DataExpert.io Analytics Engineering Excellence (Feb 2026), Data Engineering (Aug 2025), Data
Analytics Accelerator (Jun 2025)
EDUCATION
Associate of Applied Science in Computer Systems Expected Jul 2026
Lincoln Land Community College
Data/AI Analytics Engineering Bootcamp Apr 2026
DataExpert.io - Snowflake, dbt, Airflow, Apache Iceberg, Databricks, Delta Lake, Lakehouse architecture
Data Engineering Bootcamp Aug 2025
DataExpert.io - Dimensional modeling, Apache Spark, Flink, Kafka, Airflow, data quality
Data Analytics Bootcamp Jun 2025
Data Career Jumpstart
PROJECTS
MERGEAI - 5-Agent AI Data Analyst (Scored 100/100)
Upload any CSV, ask questions in plain English - 3 NVIDIA agents collaborate to write SQL, validate results, and
return insights.
Tech stack: TypeScript, Next.js 15, NVIDIA NIM API, Neon PostgreSQL, Drizzle ORM, Vercel
Live: merge-ai-omega.vercel.app | Demo: youtu.be/Yr0CkXKNF0M | GitHub: github.com/lubobali/mergeAI
AIRFLOW DATA QUALITY PIPELINE
Production-grade Airflow DAG with write-audit-publish pattern - catches bad data before it hits production.
Tech stack: Airflow, Soda, PostgreSQL, Python | GitHub: github.com/lubobali/airflow-dq-pipeline
ADDITIONAL PROJECTS
View all at: GitHub | LuboBali.com | LuBot.ai
Lubo Bali
Chicago, IL data@lubobali.com See My Portfolio
14 years data experience: 12 in financial operations ($500K+ annual revenue) + 2 in data engineering
Endorsed by Zach Wilson (ex-Netflix, Airbnb, Meta): "Best project in the bootcamp, ready to make money"
Solo-built LuBot.ai - production AI analytics platform (112K lines, 36 tables, 6 NVIDIA models)
WORK EXPERIENCE
LUBOT.AI Jun 2025 – Present
AI/Data Engineer (Founder)
Serves real users with personalized AI analytics - built the entire platform solo: 112K lines Python, 248 files, 36
database tables delivering insights that adapt
to each user
Designed 36-table PostgreSQL schema tracking user behavior patterns - clicks, queries, preferences, interaction
history powering the personalization engine
Implemented 18 nightly workers that learn while users sleep - route optimization, user profiling, baseline calculations
make each session smarter than the last
Built 9-module Intelligence Engine: anomaly detection, driver analysis, correlation discovery, forecasting,
concentration risk - all with domain-aware context
Engineered zero-hallucination pipeline with source citations - every insight traces back to actual data
100% NVIDIA Nemotron stack - 6 models + self-hosted RTX 4090. 4-tier intent routing, 3-tier response system
Tech stack: Python, PostgreSQL, FastAPI, Docker, NVIDIA NIM API, FAISS, Redis, Next.js, React
REMAX Jan 2024 – Jan 2025
Data Engineer
Built Python ETL pipeline processing 10K+ residential property listings from MLS APIs into MySQL - real estate agents needed fast access to market changes
Cut 3+ hours of daily manual data work to 15 minutes with automated refreshes and validation checks
Created Tableau dashboards tracking property inventory and pricing trends across 20+ neighborhoods - helped 15+ agents spot opportunities faster than competitors
Tech stack: Python, MySQL, Tableau, MLS APIs
2828 ARTHINGTON CONDOMINIUM ASSOCIATION Financial Data Manager Jan 2013 – Jan 2024
Tracked $500K+ annual revenue across 50 units for 12 consecutive years - managed financial operations using SQL
and Excel for lease agreements, vendor contracts, and budget planning
Generated monthly/quarterly reports on revenue trends, expense patterns, and budget variances - board used these to make decisions on $100K+ capital improvements
Cut month-end close from 3 days to under 2 days by automating reconciliation in Excel - eliminated manual errors from copy-paste workflows
Tech stack: PostgreSQL, MySQL, SQL, Excel, Power BI, Tableau, financial reporting
TECHNICAL PROFICIENCIES
Data & Pipelines: PostgreSQL, Neon, Snowflake, SQL, dbt, Databricks, Delta Lake, Airflow, Soda, Apache Iceberg, Python,TypeScript, FastAPI, REST APIs, Redis
Streaming: Apache Spark, Flink, Kafka AI/ML: NVIDIA Nemotron (Ultra 253B, Nano 8B, Vision 12B), NIM API, AdalFlow, FAISS, Prophet, Ollama
Data Patterns: Change Data Capture (CDC), SCD, Dimensional Modeling, Growth Accounting
Infrastructure: AWS, Hetzner, Linux, Nginx, Docker, Distroless Containers, Backblaze B2, Git, Tableau
Frontend: Next.js, React, Tailwind CSS, Plotly
Certifications: DataExpert.io Analytics Engineering Excellence (Feb 2026), Data Engineering (Aug 2025), DataAnalytics Accelerator (Jun 2025)
EDUCATION
Associate of Applied Science in Computer Systems Expected Jul 2026
Lincoln Land Community College
Data/AI Analytics Engineering Bootcamp Apr 2026
DataExpert.io - Snowflake, dbt, Airflow, Apache Iceberg, Databricks, Delta Lake, Lakehouse architecture
Data Engineering Bootcamp Aug 2025
DataExpert.io - Dimensional modeling, Apache Spark, Flink, Kafka, Airflow, data quality
Data Analytics Bootcamp Jun 2025
Data Career Jumpstart
PROJECTS
Upload any CSV, ask questions in plain English - 3 NVIDIA agents collaborate to write SQL, validate results, and
return insights.
Tech stack: TypeScript, Next.js 15, NVIDIA NIM API, Neon PostgreSQL, Drizzle ORM, Vercel
Live: merge-ai-omega.vercel.app | Demo: youtu.be/Yr0CkXKNF0M | GitHub: github.com/lubobali/mergeAI
AIRFLOW DATA QUALITY PIPELINE
Production-grade Airflow DAG with write-audit-publish pattern - catches bad data before it hits production.
Tech stack: Airflow, Soda, PostgreSQL, Python | GitHub: github.com/lubobali/airflow-dq-pipeline
ADDITIONAL PROJECTS
View all at: GitHub | LuboBali.com | LuBot.ai

Download Resume📄(PDF)

