Skip to content
View juandelaf1's full-sized avatar

Block or report juandelaf1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
juandelaf1/README.md

Juan de la Fuente Banner

👋 Juan — Data Analyst

LinkedIn GitHub Email Madrid

GitHub followers GitHub stars GitHub repos

Data Analyst · SQL · Python · Power BI · Business & Operations Analytics

Background in business operations, now focused on data. I combine domain knowledge in sales, costs, and margins with analytical rigor in Python, SQL, and BI. I don't just extract value from data — I understand what value looks like because I've operated where it's created.

Every project here produces measurable impact: predictive models that businesses trust, dashboards that inform critical decisions, data pipelines that transform raw inputs into actionable intelligence.


🎯 Core Capabilities

  • Operational Intelligence — Translating business metrics into data strategies that drive real change
  • End-to-End Solutions — From data extraction to delivery: APIs, dashboards, ML models, automated pipelines
  • Domain Expertise — Understanding sales, costs, margins, forecasting because I built the systems that manage them
  • Statistical Rigor — Hypothesis testing, validated models, clean pipelines, transparent methodologies

📌 Featured Projects

🌍 Climate & Sustainability

☀️ SKY CAST

Enterprise Climate Intelligence Platform

  • AEMET OpenData integration, crowdsourcing, smart alerts
  • 158 automated tests with CI/CD pipeline
  • Real-time monitoring, predictive analytics
  • Stack: FastAPI · PostgreSQL · Streamlit · Docker

AI-Powered Life Cycle Assessment

  • ISO 14040/14067 compliant LCA automation
  • LangChain + Gemini for intelligent document processing
  • Generates sustainability seals and QR codes
  • Stack: FastAPI · LangChain · Gemini · FAISS · PostgreSQL

EU Telecom Network Stress Simulation

  • 41,937 rows CNMC regulatory data + 1.8M+ Eurostat macrodata
  • 5/6 hypotheses confirmed on market dynamics
  • Traffic CAGR: +127% vs Revenue CAGR: -0.4% (Scissors Effect)
  • Stack: Python · Pandas · Power BI · DuckDB · Docker

🚚 Logistics & Mobility

Multi-Market Logistics Intelligence

  • 15+ KPIs, 5+ data sources, multi-regional adaptability
  • ML cost prediction model (R² = 0.987)
  • End-to-end ETL pipeline from acquisition to visualization
  • Stack: Python · Pandas · Streamlit · Plotly · PostgreSQL

🚜 EnRuta

Collaborative Rural Logistics Platform

  • 335 matches between producers and transporters
  • 3.5T CO₂ emissions reduction
  • Coverage of 360 municipalities in Spain
  • OSRM + SQLite integration
  • Stack: Python · Dash · OSRM · SQLite

🏕️ Parada-Viva

Rural Commerce Network for Travelers

  • PWA mobile app for vanlifers and caravanists
  • Map-based discovery (Leaflet + OpenStreetMap)
  • QR check-in, magic link authentication
  • Stripe integration for premium features
  • Stack: Next.js · TypeScript · Prisma · Supabase · Stripe

🤖 AI & Machine Learning

Hotel Revenue Management

  • Random Forest + Optuna hyperparameter tuning
  • +30% net revenue improvement vs OTA at 30% commission
  • AI-driven dynamic pricing reducing commission dependency
  • Stack: Python · scikit-learn · Optuna · Streamlit

Digital Attention Economics

  • 16,402 posts from 7 platforms, 4,779 users analyzed
  • NLP sentiment prediction and engagement modeling
  • Global Gini coefficient: 0.974 (extreme concentration)
  • Power law analysis (Pareto principle across digital platforms)
  • Stack: Python · Polars · Transformers · Streamlit · scikit-learn

Intelligent Web Scraper API

  • Selenium + Gemini for dynamic website scraping
  • Structured data extraction (name, price, images, URLs)
  • Handles JavaScript, cookies, anti-bot measures
  • Docker containerization for deployment
  • Stack: FastAPI · Selenium · Gemini · Docker

90s Nostalgia AI Generator

  • Google Gemini LLM for nostalgic storytelling
  • REST API with Swagger documentation
  • SQLite persistence, Docker deployment
  • Stack: FastAPI · Gemini · SQLite · Docker

📊 Analytics & Business Intelligence

10,000+ Spanish Wines Analysis

  • 60+ regions, 10,000+ wines comprehensive EDA
  • Price-rating correlations and regional insights
  • Statistical models identifying quality-value relationships
  • Stack: Python · Pandas · Matplotlib · Seaborn · Jupyter

🧰 Technical Stack

Core Tools

Python Pandas SQL Power BI Streamlit

Infrastructure

FastAPI Docker Git PostgreSQL

ML & AI

scikit-learn LangChain Next.js TypeScript


📊 GitHub Profile


🏆 Key Achievements

  • Predictive Modeling: R² > 0.98 on cost forecasting (DashLogistics)
  • Scale: 16K+ data points analyzed across 7 digital platforms
  • Efficiency: Automation of manual reporting processes reducing time-to-insight by 80%
  • Impact: 3.5T CO₂ savings through optimized logistics matching
  • Validation: 5/6 scientific hypotheses confirmed in telecom analysis
  • Deployment: Production-ready APIs and dashboards serving business needs

💼 Approach

  • Domain-driven — I come from operations. I know what margins, costs, and forecasts mean because I've worked with them directly.
  • Technical — Python, SQL, Power BI, ML. I build pipelines, models, and dashboards end-to-end.
  • Outcome-focused — Every project here has a measurable result: R² scores, CO₂ saved, revenue impact, hypotheses tested.

Let's Connect → All Projects →


Pinned Loading

  1. SkyCast SkyCast Public

    Enterprise Climate Intelligence Platform - FastAPI, Streamlit, PostgreSQL, AEMET OpenData

    Python 1

  2. OptimusPrice OptimusPrice Public

    Optimus Price is an ML-powered revenue management tool for hotels, delivering real-time pricing based on demand patterns and seasonality.

    Jupyter Notebook

  3. DashLogistics DashLogistics Public

    End-to-end ETL pipeline for logistics data, including advanced KPI calculations, analysis, and visualization for actionable insights.

    Jupyter Notebook 1

  4. attention-observatory attention-observatory Public

    Empirical Economics of Digital Attention - Modeling human attention as a finite ecological resource. 7 data sources, NLP, power law analysis, and econometric validation.

    Python 1

  5. DataScope DataScope Public

    Forked from Anasfady/Pearsons_Four

    Multi-Source Salary Analytics

    Jupyter Notebook 1

  6. NetTension NetTension Public

    NetTension - EU Telecom Network Stress Simulation Framework

    Python 1