VLDB

Reading Parallel Execution Plans With Bloom Pruning And Composite Partitioning

October 12, 2010
By

You’ve probably heard sayings like “sometimes things aren’t always what they seem” and “people lie”. Well, sometimes execution plans lie. It’s not really by intent, but it is sometimes difficult (or impossible) to represent everything in a query execution tree in nice tabular format like dbms_xplan gives. One of the optimizations that was introduced...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Set Processing vs Row Processing

July 20, 2010
By

In over six years of doing data warehouse POCs and benchmarks for clients there is one area that I frequently see as problematic: “batch jobs”.  Most of the time these “batch jobs” take the form of some PL/SQL procedures and packages that generally perform some data load, transformation, processing or something...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Data Loading

April 23, 2010
By

Getting flat file data into your Oracle data warehouse is likely a daily (or more possibly frequent) task, but it certainly does not have to be a difficult one.  Bulk loading data rates are governed by the following operations and hardware resources: How fast can the data be read How fast can data be...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Parallel Execution

April 19, 2010
By
The Core Performance Fundamentals Of Oracle Data Warehousing – Parallel Execution

Leveraging Oracle’s Parallel Execution (PX) in your Oracle data warehouse is probably the most important feature/technology one can use to speed up operations on large data sets.  PX is not, however, “go fast” magic pixi dust for any old operation (if thats what you think, you probably don’t understand the parallel...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Partitioning

January 25, 2010
By

Partitioning is an essential performance feature for an Oracle data warehouse because partition elimination (or partition pruning) generally results in the elimination of a significant amount of table data to be scanned. This results in a need for less system resources and improved query performance. Someone once told me “the fastest...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Table Compression

January 19, 2010
By

Editor’s note: This blog post does not cover Exadata Hybrid Columnar Compression. The first thing that comes to most people’s mind when database table compression is mentioned is the savings it yields in terms of disk space. While reducing the footprint of data on disk is relevant, I would argue it...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Balanced Hardware Configuration

December 22, 2009
By
The Core Performance Fundamentals Of Oracle Data Warehousing – Balanced Hardware Configuration

If you want to build a house that will stand the test of time, you need to build on a solid foundation. The same goes for architecting computer systems that run databases. If the underlying hardware is not sized appropriately it will likely lead to people blaming software. All too often...

Read more »

The Core Performance Fundamentals Of Oracle Data Warehousing – Introduction

December 14, 2009
By

At the 2009 Oracle OpenWorld Unconference back in October I lead a chalk and talk session entitled The Core Performance Fundamentals Of Oracle Data Warehousing. Since this was a chalk and talk I spared the audience any powerpoint slides but I had several people request that make it into a presentation so they could...

Read more »

Oracle OpenWorld 2009: The Real-World Performance Group

July 20, 2009
By

Even though Oracle OpenWorld 2009 is a few months away, I thought I would take a moment to mention that the Oracle Real-World Performance Group will again be hosting three sessions. Hopefully you are no stranger to our Oracle database performance sessions and this year we have what I think will be a very...

Read more »

Oracle Parallel Execution: Interconnect Myths And Misunderstandings

July 6, 2009
By
Oracle Parallel Execution: Interconnect Myths And Misunderstandings

A number of weeks back I had come across a paper/presentation by Riyaj Shamsudeen entitled Battle of the Nodes: RAC Performance Myths (avaiable here). As I was looking through it I saw one example that struck me as very odd (Myth #3 – Interconnect Performance) and I contacted him about it. After further review...

Read more »

Facebook: Hive – A Petabyte Scale Data Warehouse Using Hadoop

June 10, 2009
By

Today, June 10th, marks the Yahoo! Hadoop Summit ’09 and the crew at Facebook have a writeup on the Facebook Engineering page entitled: Hive – A Petabyte Scale Data Warehouse Using Hadoop. I found this an very interesting read given some of the Hadoop/MapReduce comments from David J. DeWitt and Michael Stonebraker as well...

Read more »

The Impact Of Good Table And Query Design

March 19, 2009
By

There are many ways to design tables/schemas and many ways to write SQL queries that execute against those tables/schemas. Some designs are better than others for various reasons, however, I think that frequently people underestimate the power of SQL (for both “good” and “evil”). All too often in data warehouses, I see tables designed...

Read more »