• How Data Teams Are Validating Pipelines with Schema-on-Read
    Jun 29 2026
    Episode 80 of The Data Business Podcast explores how data teams are using schema-on-read validation to catch pipeline failures before they corrupt downstream analytics. Lucas and Luna discuss a real case at a mid-sized e-commerce company where a misclassified field in a Parquet file caused a $200,000 reporting error. They break down the difference between schema-on-write and schema-on-read, explain how tools like Apache Iceberg and Delta Lake enable late-binding schema enforcement, and walk through the trade-offs: flexibility versus performance. The episode also covers how this approach fits into broader data observability and data contract strategies, with practical advice on when to use schema registries like Confluent Schema Registry versus file-level validation. Listeners learn one concrete technique they can apply to their own pipelines. #SchemaOnRead #DataValidation #ApacheIceberg #DeltaLake #DataObservability #DataContracts #DataPipelines #Parquet #SchemaRegistry #DataQuality #DataEngineering #Analytics #EcommerceData #PipelineReliability #BusinessAndTechnology #FexingoBusiness #BusinessPodcast #TheDataBusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    12 mins
  • How Data Teams Are Using Data Contracts for Cost Allocation
    Jun 28 2026
    Episode 79 of The Data Business Podcast. Lucas and Luna explore how forward-thinking data teams are using data contracts not just for reliability but for precise cost allocation. They dive into a case at a mid-size fintech that cut its cloud data warehouse bill by 28 percent by attaching consumption tags to contract clauses. Lucas explains the mechanics of 'cost-attributed schemas' and Luna questions whether this creates perverse incentives for data producers. The conversation covers implementation gotchas, the role of open table formats, and why this approach beats traditional chargebacks. They also touch on how the practice is spreading from finance to retail and ad tech. A clear, practical episode for anyone running a data platform or paying a six-figure Snowflake bill. #DataContracts #CostAllocation #CloudDataWarehouse #Fintech #DataEngineering #DataGovernance #Snowflake #DataPlatform #FinOps #DataCosts #Chargeback #OpenTableFormats #DataProducers #DataConsumers #Analytics #BusinessPodcast #FexingoBusiness #DataInfrastructure Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    12 mins
  • Why Data Teams Are Using Contract Testing for Data Pipelines
    Jun 28 2026
    Data contracts are getting a lot of attention as a way to enforce schema and quality guarantees between producers and consumers. But a new practice is emerging: applying contract testing — borrowed from software engineering — to data pipelines. Lucas and Luna explore how companies like Monzo and others are using consumer-driven contract tests to catch breaking changes before they hit production. They walk through a concrete example: a finance team's daily revenue report breaks because a source table column gets renamed. With contract testing, the pipeline fails fast during CI, not at 3 a.m. in a Slack alert. The episode covers the tooling landscape (from open-source Pact to dbt expectations), the organizational shift required, and why this approach is especially powerful for data mesh architectures. A practical look at how treating data pipelines like distributed services can reduce downtime and rebuild trust. #DataContracts #ContractTesting #DataPipelines #DataQuality #DataEngineering #Monzo #Pact #dbt #DataMesh #CIForData #PipelineTesting #DataObservability #DataGovernance #Business #Technology #FexingoBusiness #BusinessPodcast #DataBusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • How Data Teams Are Using Data Clean Rooms for Privacy-Compliant Analytics
    Jun 27 2026
    Episode 77 of The Data Business Podcast explores the rise of data clean rooms—secure environments where companies can join datasets for analytics without exposing raw data. Lucas and Luna dissect a specific case: how a major retailer and a CPG brand used a clean room to measure ad effectiveness without sharing customer-level data. They walk through the architecture, the trade-offs (query performance vs. privacy guarantees), and why clean rooms are becoming essential for compliance with regulations like GDPR and CCPA. Lucas brings numbers from a recent industry report showing a 140% increase in clean room adoption among Fortune 500 companies since 2023. Luna challenges whether clean rooms are a genuine privacy solution or a PR shield. The episode also covers the technical distinction between differential privacy and secure multi-party computation within clean rooms, and why data teams need to rethink their data-sharing contracts. Hosted by Lucas and Luna. #DataCleanRooms #PrivacyCompliance #GDPR #CCPA #DataSharing #Analytics #AdMeasurement #DifferentialPrivacy #SecureMultiPartyComputation #Retail #CPG #Fortune500 #DataContracts #DataArchitecture #PrivacyEngineering #Business #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • Why Data Teams Are Building Semantic Layers for Business Users
    Jun 27 2026
    In Episode 76 of The Data Business Podcast, Lucas and Luna explore the growing trend of semantic layers — a middle layer between raw data and business tools that lets non-technical users query metrics like 'monthly recurring revenue' without knowing SQL. They examine how companies like Airbnb and Intuit have implemented semantic layers using tools like Looker's LookML and Apache Calcite to reduce data team bottlenecks, improve governance, and speed up decision-making. The episode dives into a real example: how a fintech firm cut reporting turnaround time from two weeks to under an hour by adopting a semantic layer. They also discuss trade-offs like maintenance overhead and the risk of oversimplification. If you're building or running a data-driven organization, this episode offers concrete insights on whether a semantic layer is right for your team. #DataBusiness #SemanticLayer #BusinessIntelligence #DataGovernance #Looker #LookML #ApacheCalcite #EnterpriseData #SelfServiceAnalytics #DataArchitecture #MetricsStore #NoSQL #DecoupledDataStack #DataProduct #BusinessTechnology #AnalyticsEngineering #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    11 mins
  • How Data Teams Are Using Query Forecasting to Control Cloud Costs
    Jun 26 2026
    Episode 75 of The Data Business Podcast dives into the rising practice of query cost forecasting — a technique data teams use to predict and control cloud spending before bills arrive. Lucas and Luna break down how a mid-market fintech company used a simple cost-per-query model to reduce unexpected overages by 40% in Q1 2026. They explore the difference between reactive cost monitoring and proactive forecasting, the role of active metadata in building accurate models, and why CFOs are starting to demand this from data leaders. The episode also touches on open-source tools like DuckDB and the shift toward pre-commit cloud contracts tied to forecasted usage. If your data team is tired of surprise cloud bills, this conversation offers a practical playbook. #QueryForecasting #CloudCostOptimization #DataEngineering #ActiveMetadata #DuckDB #DataInfrastructure #FinOps #CostPerQuery #SQL #CloudSpend #DataObservability #BusinessIntelligence #DataLeadership #CFO #FexingoBusiness #BusinessPodcast #DataProducts #DataMonetization Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    11 mins
  • Why Your Data Team Needs Reverse ETL
    Jun 26 2026
    Episode 74 dives into reverse ETL — the practice of syncing data from a warehouse back into operational tools like CRMs, ad platforms, and customer support systems. Lucas and Luna unpack how companies like Airbnb and Uber use this approach to activate customer data in real time, without building bespoke integrations. They discuss the rise of reverse ETL platforms like Hightouch and Census, and why data teams are shifting from 'data as a report' to 'data as an action.' Specific numbers: how one mid-size e-commerce company reduced customer churn by 12% by syncing churn-risk scores to its support agents' dashboard. If data analytics is about insight, reverse ETL is about making that insight do something. A concrete look at an emerging data infrastructure pattern that's changing how teams operationalize their warehouse. #ReverseETL #DataActivation #DataEngineering #DataInfrastructure #Hightouch #Census #CustomerData #DataDriven #OperationalAnalytics #DataWarehouse #Business #Technology #DataProducts #ChurnReduction #RealTimeData #FexingoBusiness #BusinessPodcast #DataBusiness Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    13 mins
  • Why Data Teams Are Adopting Data Product Contracts
    Jun 25 2026
    In this episode, Lucas and Luna explore the emerging practice of data product contracts—formal agreements between data producers and consumers that specify schema, freshness, quality SLAs, and pricing. They use the example of a mid-sized e-commerce company that reduced data incidents by 40 percent after implementing contracts for its top 20 data products. The hosts discuss how these contracts differ from data contracts (which focus on pipeline-level guarantees), why they are gaining traction as data teams shift toward product thinking, and what happens when a data contract is violated—including automatic rerouting to backup datasets. Luna pushes back on whether contracts add bureaucracy, and Lucas shares how one team tied contract compliance to engineering performance reviews. A concrete look at how data teams are formalizing trust. #DataProductContracts #DataProducts #DataGovernance #DataQuality #DataEngineering #DataManagement #DataContracts #DataSLA #BusinessPodcast #TechnologyPodcast #DataBusiness #FexingoBusiness #BusinessPodcast #DataAnalytics #DataInfrastructure #DataTeam #ProductThinking #DataObservability Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins