The data generated by a business should be owned by this business for its own and its customers benefits.


  1. Own data warehouse over vendor locked in
  2. Central data warehouse over silos
  3. Open, transferable data format over vendor proprietary
  4. De-coupled warehouse, ETL and business analysis tool over monolith
  5. Open-source over proprietary


  1. Own warehouse doesn’t necessary mean owned physical infrastructure, use of AWS, Azure or other cloud infrastructure is fine as long as you have a plan of how can you move out if required