In this specially-extended episode just before ODTUG KScope'18, Mark Rittman is joined by Matt Yorke from Qubix to talk about Oracle Essbase Cloud, Oracle Analytics Cloud and the business of Oracle Cloud analytics consulting
Mark Rittman is joined by ThoughtSpot's Chief Data Evangelist Doug Bordonaro to talk about the value of data, issues around trust and consent raised by the EU's new GDPR regulations, and how ThoughtSpot are applying ideas from search engines combined with artificial intelligence smarts to surface insights and drive real value for business users from their analytics investment
Mark Rittman is joined by returning Special Guest Mark Grover to talk about his move from Cloudera and product engineering to a product manager role at Lyft; analytics use-cases in the ride-sharing industry; and the move from conversations about ETL tools, technology and engines to templates, paradigms and developer productivity.
- Mark Grover LinkedIn Profile and Github Profile
- "Hadoop Application Architectures"
- "Drill to Detail Ep. 7 'Apache Spark and Hadoop Application Architectures'
- Lyft Engineering Blog
- "Software Engineer to Product Manager" blog by Gwen Shapira
- "Introduction to the Oracle Data Integrator Topology" from the Oracle Data Integrator docs site
- Apache Airflow and Amazon Kinesis homepages
- "Experimentation in a Ridesharing Marketplace" by Nicholas Chamandy, Head of Data Science at Lyft
- "How Uber Eats Works with Restaurants"
- "Deliveroo has built a bunch of tiny kitchens to feed more hungry Londoners" - Wired.co.uk
Drill to Detail returns after the New Year break with Special Guest Julian Hyde from Hortonworks to talk about bitmap indexes and CASE tools, Mondrian and open-source OLAP analysis, and Apache Calcite's mission to bring sanity, cost-based optimisers and support for OLAP workloads to today's dis-aggregated, distributed new-world database engines.
- Oracle Designer page on Oracle.com
- Bitmap Index page on Wikipedia
- Mondrian project page on Github
- Mondrian OLAP Server page on Wikipedia
- MultiDimensional eXpressions (MDX) page on Wikipedia
- Julian Hyde blog
- Apache Calcite project homepage
- Apache Calcite Introduction and Overview deck
- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.
Mark is joined in this episode by Google Cloud Platform Developer Advocate Felipe Hoffa, talking about getting started as a developer using Google BigQuery along with Google Cloud Dataflow, Google Cloud Dataprep and Google Cloud Platform's machine learning APIs.
Mark Rittman is joined by Industry Analyst Mark Madsen to talk about marketing analytics and the rise of the omni-channel consumer, the use of AI in analytics and personalization and what this all means for brands, for advertisers and for marketers.
Mark Rittman is joined by Donald Farmer to talk about his work at Microsoft on SQL Server Analysis Services and Integration Services, why he moved to Qlik and the challenges of evolving a BI product strategy from focusing on desktops to focusing on the enterprise, and some advice for customers, software vendors and partners working with data and analytics tools.
In this episode Mark is joined by Tristan Handy from Fishtown Analytics to talk about building-out analytics functions in high-growth startups, and three related blog posts he wrote on this topic.
Mark Rittman is joined by Gwen Shapira from Confluent to talk about Apache Kafka, streaming data integration and how it differs from batch-based, GUI-developed ETL development, the problem with architects, exactly-once processing and how data governance is coming to Kafka development with Confluent's new schema registry server.
Mark Rittman is joined by Maxime Beauchemin to talk about analytics and data integration at Airbnb, the Apache Airflow and Superset open-source projects he helped launch and now works with day-to-day at Airbnb , and his recent Medium article on "The Rise of the Data Engineer".
- "The Rise of the Data Engineer" blog by Maxime Beauchemin
- Apache Airflow
- Airbnb Superset
- "Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department" blog by Jeff Magnusson
Mark Rittman is joined by Timo Elliott, originally of Business Objects and now Innovation Evangelist for SAP, to talk about the origins of self-service BI with Business Objects' innovative "Universe" and the role analytics now plays within SAP; why analytics is the most important function within your organization and why the vast majority of analytics is still reporting (which isn't so bad); and the role AI and other innovations will play in analytics going in the future.
Mark Rittman is joined by Daniel Mintz from Looker to talk about BI and analytics on Google BigQuery, data modelling on the new generation of cloud-based distributed-data warehousing platforms, and Looker's re-introduction of semantic models to big data analytics developers.
Mark Rittman is joined by Independent Consultant Chris Webb to talk about MDX & DAX, MSAS and SQL SQL Server and the fall ... and rise, of Microsoft BI
Mark Rittman is joined by Gartner analyst and ex-Oracle Database Cloud Service PM Rick Greenwald to talk about IT’s continuing (and essential!) role in corporate BI&DW deployments and the debate around Mode1 vs. Mode2 Analytics, how we got here, and the future of data warehousing database platforms as we move into the cloud.
Mark Rittman is joined by Gluent's Tanel Poder to talk about Hadoop, Gluent Data Platform, the coming of the hybrid world and how Hadoop will evolve as it moves into the cloud.
Mark Rittman is joined by Graham Spicer in this UKOUG Tech'16 Special to talk about BI past, present and future including the history of Oracle OLAP, running and owning a consulting business over 30 years of changes in the industry, what's hot on the market today, and what skills and techniques are still relevant even as we adopt big data products and look to new methods of analysis.
Mark Rittman is joined once more by Stewart Bryson, talking about Oracle's recent reboot of it's cloud big data platform at Oracle Openworld 2016, thoughts on DataFlowML and comparisons with Google's Cloud DataFlow and Amazon Kinesis, and data storytelling with Oracle Data Visualisation Desktop 2.0
Show notes / links:
Mark Rittman is joined by Jen Underwood to discuss the aftermath of the Gartner BI&A Magic Quadrant 2016 and the rise of self-service, Mode-2 analytics; innovation in predictive analytics and data preparation tools, and how the big data cloud vendors are differentiating themselves (or not).