Mark Rittman is joined in this Looker JOIN 2018 Special by long-term friends of the show Tristan Handy from Fishtown Analytics, and Stewart Bryson from Red Pill Analytics to talk about dbt and enabling data engineering for data analysts; the state of modern data analytics consulting today, and what we’re looking forward to hearing about at next week’s Looker JOIN 2018 conference in San Francisco, CA.
Mark Rittman is joined in this episode by Jonathan Palmer from King Games to talk about the role of analytics in the development of Candy Crush Saga and other King games, their use of Looker along with Google BigQuery and Exasol to provide analytics capabilities to their game designers and product owners and his approach to doing all of this in a fast-moving, technology-driven internet business.
Mark Rittman is joined by Neil Barton, Chief Technology Officer at WhereScape to talk about metadata-driven data warehouse design, automating the build and management of data warehouse infrastructure and the thinking behind his company's WhereScape Red and Wherescape 3D tools.
In this specially-extended episode just before ODTUG KScope'18, Mark Rittman is joined by Matt Yorke from Qubix to talk about Oracle Essbase Cloud, Oracle Analytics Cloud and the business of Oracle Cloud analytics consulting
Mark Rittman is joined by Yali Sassoon from Snowplow to talk about data pipelines and Hadoop in the cloud; how web analytics evolved from counting pageviews to today's event-level analysis of consumer behavoir across all digital channels; why digital analytics is hard but interesting; and Snowplow's approach to building a successful hybrid open-source/commercial software business that competes successfully with megavendors such as Google and Adobe.
Snowplow Insights commercial hosted service details
Mark Rittman is joined in this episode by Greg Michaelson from DataRobot, talking about the benefits of automating the discovery and automation of analytics and machine learning in financial services and other industries.
Mark Rittman is joined by ThoughtSpot's Chief Data Evangelist Doug Bordonaro to talk about the value of data, issues around trust and consent raised by the EU's new GDPR regulations, and how ThoughtSpot are applying ideas from search engines combined with artificial intelligence smarts to surface insights and drive real value for business users from their analytics investment
Mark Rittman is joined by returning Special Guest Mark Grover to talk about his move from Cloudera and product engineering to a product manager role at Lyft; analytics use-cases in the ride-sharing industry; and the move from conversations about ETL tools, technology and engines to templates, paradigms and developer productivity.
- Mark Grover LinkedIn Profile and Github Profile
- "Hadoop Application Architectures"
- "Drill to Detail Ep. 7 'Apache Spark and Hadoop Application Architectures'
- Lyft Engineering Blog
- "Software Engineer to Product Manager" blog by Gwen Shapira
- "Introduction to the Oracle Data Integrator Topology" from the Oracle Data Integrator docs site
- Apache Airflow and Amazon Kinesis homepages
- "Experimentation in a Ridesharing Marketplace" by Nicholas Chamandy, Head of Data Science at Lyft
- "How Uber Eats Works with Restaurants"
- "Deliveroo has built a bunch of tiny kitchens to feed more hungry Londoners" - Wired.co.uk
Mark Rittman is joined by Special Guest Fangjin Yang to talk about the history of Druid, a high-performance, column-oriented, distributed data store originally developed by the team at Metamarkets to provide fast ad-hoc access to large amounts of event-level marketing data, and his work at Imply to commercialise Druid and build a suite of supporting query and data management tools.
Mark Rittman is joined in this 50th Episode Special by our original guest on the first episode of Drill to Detail, Stewart Bryson, to talk about developing agile BI applications using FiveTran, SnowflakeDB and Looker and his recent work developing a BI solution for Google Play Marketing using Google Data Studio and Google Cloud Platform. We're also joined later in the show by Alex Gorbachev from Pythian, our mystery guest who Stewart then interviews flawlessly armed only with a set of questions given to him as the guest was unveiled ... though be sure to listen past the final closing music for the bonus out-takes.
Mark Rittman is joined by Will Davis from Trifacta to talk about the public beta of Google Cloud Dataprep, Trifacta's data wrangling platform and topics including metadata management, data quality and data management for big data and cloud data sources.
- Google Cloud Dataprep on Google Cloud Platform
- "Google Cloud Dataprep: Spreadsheet-Style Data Wrangling Powered by Google Cloud Dataflow"
- "A New Cloud-Based Data Prep Solution from Google & Trifacta"
- Trifacta website
- "A Breakthrough Approach to Exploring and Preparing Data"
- Trifacta platform architecture
- "Garbage In, Garbage Out: Why Data Quality Matters"
- "How to Put an Effective Metadata Strategy in Place"
Drill to Detail returns after the New Year break with Special Guest Julian Hyde from Hortonworks to talk about bitmap indexes and CASE tools, Mondrian and open-source OLAP analysis, and Apache Calcite's mission to bring sanity, cost-based optimisers and support for OLAP workloads to today's dis-aggregated, distributed new-world database engines.
- Oracle Designer page on Oracle.com
- Bitmap Index page on Wikipedia
- Mondrian project page on Github
- Mondrian OLAP Server page on Wikipedia
- MultiDimensional eXpressions (MDX) page on Wikipedia
- Julian Hyde blog
- Apache Calcite project homepage
- Apache Calcite Introduction and Overview deck
- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California
Mark is joined by long-term industry veteran and friend Christian Berg to talk about surviving fifteen years as a contractor in analytics industry, changes he's seen in the market and in how project are approached, the value in getting involved in the community, and in a specially extended Christmas and New Year edition we look back at what was topical in 2017 and what are Christian's predictions for 2018 ... and appoint Christian as Head of our Best Practices Found on the Internet.
Mark Rittman is joined in this episode of Drill to Detail by Dr. Carsten Bange from BARC to talk about findings from the recently completed BI Survey 17 including the continuing move to modern BI platforms and self-service desktop tools, analytics adoption trends and the increasing incorporation of BI functionality within business applications, the surprising topicality of master data management and data governance ... and whatever happened to Nigel Pendse and his legendary OLAP Report?
- The BI Survey 17: The World’s Largest Annual Survey of BI Users
- Master Data and Data Quality Management Now the #1 Trend in BI
- BI Trend Monitor 2018 Infographic: The Evolution of Trends
- The Business Intelligence Industry Continues Its Ongoing Empowerment of Business Users
- The OLAP Report: The origins of today’s OLAP products (c. 2005, from the Internet Archive)
Mark Rittman is joined in this episode by returning special guest Jen Underwood to talk about what's new and innovative in the BI and analytics industry right now, and how AI and machine learning are this year's data discovery and data visualization.
- "Between The Lines At Tableau Conference" - Jen Underwood.com blog
- "Transform The Business With Automated Embedded Artificial Intelligence" - JenUnderwood.com blog
- "Moving From Bi To Machine Learning With Automation" - JenUnderwood.com blog
- "How Smart Data Discovery Will Radically Transform Analytics" - Tellius Webinar with Jen Underwood
- YellowFin BI - homepage
- Paxata - homepage
- "Drill To Detail Ep.8 'Self-Service BI, Data Prep & Big Data Vendor Strategy' With Special Guest Jen Underwood"
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
Mark is joined by Mike Durran from the Oracle Analytics Product Management team in this UKOUG Tech’17 special to talk about his route into product management via the Oracle Discoverer BI tool, Oracle’s latest product in this space Oracle Data Visualization Desktop 4 and its new features, and Mike’s upcoming sessions at the UK Oracle User Group’s Tech’17 event next week in Birmingham, UK.
Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.
Mark is joined in this episode by Google Cloud Platform Developer Advocate Felipe Hoffa, talking about getting started as a developer using Google BigQuery along with Google Cloud Dataflow, Google Cloud Dataprep and Google Cloud Platform's machine learning APIs.
Mark Rittman is joined in this episode by Taylor Brown from Fivetran to talk about middleware for SaaS data, their focus on integrations with SaaS vendors and how this differentiates their offering, his thoughts on packaged analytic applications announced at the recent Looker Join conference ... and where the name "Fivetran" came from.