Cost/Benefit Case for IBM DB2 10.5 for High Performance

Management Report
October 2014
Cost/Benefit Case for IBM DB2 10.5 for
High Performance Analytics
Compared to Microsoft SQL Server 2014
International Technology Group
609 Pacific Avenue, Suite 102
Santa Cruz, California 95060-4406
Telephone: 831-427-9260
Email: [email protected]
Website: ITGforInfo.com
International Technology Group
October 2014
Table of Contents
EXECUTIVE SUMMARY
The Players
The Landscape Changes
Lost Opportunity Costs
Technology Differentiators
Performance
Complexity
Conclusions
SOLUTIONS
SQL Server 2014
Overview
Clustered Columnstore Indexes
DB2 10.5
Overview
Overall Capabilities
DETAILED DATA
Basis of Calculations
Installations
Costs of Ownership
Costs Breakdowns
1 1 2 3 4 4
5
5 6
6 6
6 6 6
8
10
10 10
10
11 List of Figures
1.
Three-year Costs of Ownership for IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columstore Indexes
1 2.
Key Technologies Incorporated in IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columnstore Indexes
2 3.
Three-year Lost Opportunity Costs for Use of IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columnstore Indexes – All Installations
3 4.
BLU Acceleration Data Reduction Processes
5.
Principal IBM DB2 10.5 BLU Acceleration Technologies for Analytics Processing
6.
Principal Capabilities of Overall IBM DB2 10.5 Environment
7.
Installations Summary
8.
Three-year Costs of Ownership Breakdowns for IBM DB2 10.5 and Microsoft SQL Server 2014
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
4 7 8 10 11 International Technology Group
October 2014
Executive Summary
The Players
The database world is undergoing unprecedented change. Data growth continues to accelerate, and database
structures and contents to become more complex. New challenges must be met as Big Data technologies gain
traction. Cloud computing defines new deployment and operating models. Demand for increasingly powerful
analytics solutions has become pervasive.
These shifts have changed the strategies of major database vendors. IBM and Microsoft have implemented new
technologies in their mainstream databases. For high-performance analytics applications, which are the focus of
this report, key new capabilities were provided in IBM DB2 10.5 with BLU Acceleration, and in Microsoft SQL
Server 2014 with Clustered Columnstore Indexes.
There are some commonalties between these solutions. User experiences, however, indicate that BLU
Acceleration is more powerful, incorporates a broader range of technologies and is better optimized to deliver
sustained performance for high-volume analytics queries.
BLU Acceleration, moreover, employs a dramatically simplified SQL design. Application delivery times are
reduced, database administrator (DBA) staffing is lower and system overhead is less than for use of Clustered
Columnstore Indexes.
Data compression and space reclamation are also a great deal more effective. BLU Acceleration employs global
table-wide technology; i.e., the system searches for compression opportunities across entire tables. Users report
higher compression rates than for Microsoft’s segment-based approach.
In further contrast, space reclamation in BLU Acceleration is an automated online process; i.e., space is reclaimed
on an ongoing basis during production operations. Clustered Columnstore Indexes require offline administrator
intervention; i.e., it is a great deal slower. Users may not be able to realize the full potential of compression, or
may not apply it all to avoid the service interruptions and administrative overhead of the Microsoft approach.
These differences are reflected in costs of ownership. In representative installations presented in this report, threeyear costs for use of SQL Server 2014 with Clustered Columnstore Indexes ranged from 1.7 to 2.1 times more,
and averaged 1.9 times more than for DB2 10.5 with BLU Acceleration. Figure 1 illustrates these results.
IBM DB2 10.5
641.6
1,217.4
Microsoft SQL Server 2014
$ thousands
Databases
Servers
Deployment
Personnel
Facilities
Figure 1: Three-year Costs of Ownership for IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columnstore Indexes
Lost opportunity costs – meaning bottom-line business losses due to deployment delays – averaged 2.6 times
more for use of SQL Server 2014 with Clustered Columnstore Indexes than for DB2 10.5 with BLU Acceleration.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
1
International Technology Group
October 2014
The Landscape Changes
Growing demands for information to deal with more complex and volatile business conditions have intersected
with an acceleration of decision-making cycles at all levels of organizations. Increasingly, information must be
analyzed in real time.
Bottlenecks, however, have emerged in the ability of conventional data warehouse architectures to meet these
demands. One is that application development and deployment practices often impose delays that business users
find unacceptable. A second is that, in conventional architectures, latencies in movement of data between
processors and disks impair throughput. In high-volume environments, performance impact may be severe.
New technologies to address this constraint have emerged. Columnar data structures process only data in specific
columns, and enable significantly higher levels of compression. In-memory technologies maintain data in RAM
rather than on disk, increasing performance by wide margins. Data skipping avoids processing of data
unnecessary to specific queries, further reducing I/O loading.
Latest-generation server platforms also accelerate performance through single instruction, multiple data (SIMD)
techniques that parallel and vector processing at the microprocessor level. The is the case for Intel E7-based
systems, as well as IBM Power Systems, including new models based on IBM POWER8 processors.
POWER8-based systems offer new DB2 10.5 performance optimization capabilities, including support for largescale concurrent multithreading (up to eight threads per core) and exploitation of 128-bit registers. New reliability
features are also implemented.
The extent to which vendors have exploited new technologies varies. SQL Server 2014 with Clustered
Columnstore Indexes, and IBM DB2 10.5 with BLU Acceleration implement the technologies shown in figure 2.
Technology
IBM DB2 10.5 with
BLU Acceleration
Microsoft SQL Server 2014 with
Clustered Columnstore Indexes
High-performance compression
Table-based
Segment-based
In-memory technology
✔
✔
Data skipping
✔
Limited
SIMD exploitation
✔
N/A
Figure 2: Key Technologies Incorporated in IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columnstore Indexes
A DB2 10.5 capability introduced in August 2014 allows users to create columnar shadow tables of row-based
transactional data, and to execute queries directly on these. Shadow tables are updated automatically. Early
adopters have typically employed this capability for real-time operational queries and reporting. Users report that
DBA overhead is minimal, and that there is no impact on transactional performance.
Other distinctive DB2 10.5 BLU Acceleration features include automatic workload management (Clustered
Columnstore Indexes requires manual techniques); full separation of data management and security privileges
(not supported by Microsoft); and the ability to use key and unique constraints to avoid duplication of data in
tables (supported by SQL Server 2014 only for non-clustered columnstores, which cannot be updated).
Costs of Ownership
Costs of ownership calculations include database licenses and support; server hardware and operating systems;
personnel costs for database administration and related tasks; deployment costs and facilities (primarily energy)
costs. Acquisition, maintenance and support costs are based on discounted prices reported by users.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
2
International Technology Group
October 2014
Database costs are similar. Although Microsoft per core pricing is aggressive, IBM per terabyte pricing leverages
BLU Acceleration strengths in data compression to reduce overall costs. Server costs are marginally higher for
SQL Server 2014 with Clustered Columnstore Indexes, reflecting use of Windows Server rather than the less
expensive Linux distribution employed by IBM.
The largest disparities are in people-related costs for database administration (costs for use of SQL Server 2014
with Clustered Columnstore Indexes average three times more than for DB2 10.5 with BLU Acceleration) and
deployment (costs for use of SQL Server 2014 with Clustered Columnstore Indexes average 2.6 times more). In
these and other areas, the key differentiator is that DB2 10.5 with BLU Acceleration is less complex.
Lost Opportunity Costs
Analytical applications may yield significant bottom-line gains in a matter of weeks to months. The corollary is
that delays in bringing such applications into production may represent significant lost revenue and profit.
This effect is apparent in the same installations employed for costs of ownership comparisons. In these cases, lost
opportunity costs for use of SQL Server 2014 with Clustered Columnstore Indexes ranged from 2.5 to three times
more than for DB2 10.5 with BLU Acceleration. Figure 3 illustrates disparities.
Financial Services Company
Manufacturing Company
22.8
60.8
IBM DB2 10.5 with BLU Acceleration
Microsoft SQL Server 2014 with
Clustered Columnstore Indexes
87.9
263.6
426.1
IT Services Company
1,065.3
$ thousands
Figure 3: Three-year Lost Opportunity Costs for Use of IBM DB2 10.5 with BLU Acceleration and
Microsoft SQL Server 2014 with Clustered Columnstore Indexes – All Installations
These costs are for initial applications only. In practice, organizations would continue to deploy new applications.
The cumulative impact of faster deployment over multi-year periods would be a great deal larger. Disparities in
lost opportunity costs would increase by wide margins.
These and other results presented in this report are based on input from 24 companies in the same industries and
size ranges, with generally similar business profiles employing IBM DB2 10.5 with BLU Acceleration or
Microsoft SQL Server 2014 with Clustered Columnstore Indexes in comparable roles.
Further information on profiles, methodology and assumptions employed for calculations, along with cost
breakdowns for installations and platforms may be found in the Detailed Data section of this report.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
3
International Technology Group
October 2014
Technology Differentiators
Performance
In terms of performance, the two platforms are differentiated as follows:
1. SQL Server 2014 incorporates in-memory and columnar technology based on Microsoft’s earlier VertiPaq
engine and on the company’s Apollo development project. Columnar technology was introduced in SQL
Server 2008 R2 and enhanced in SQL Server 2014.
Early limitations limited adoption. For example, implementation of columnar technology was limited to
indexes, and could be employed only in read-only mode. Tables could not be changed without extensive
workarounds. In SQL Server 2014, tables may be more easily modified.
Users have reported significantly higher performance compared to use of SQL Server without Clustered
Columnstore Indexes. Microsoft benchmark tests have shown acceleration levels of 8 to 20 times using
cold buffer pools (i.e., data is fully loaded into RAM) and 4 to 10 times using warm pools (i.e., data is
divided between RAM and disk) for queries that benefit from columnar technology.
Overall performance gains are reported to be typically two to five times, with an average of slightly more
than 2.9 times. Performance increases of 300 to 800 times have been reported for individual queries.
Although Microsoft has variously claimed up to seven and up to 10 times compression, users report
between 40 and 80 percent (1.7 to 5 times). The norm appears to be two to three times compared to pagelevel compression employed in SQL Server 2014. These values, moreover, refer to raw and actual data
sizes for indexes only. SQL Server 2014 continues to employ row-based structures for other data.
2. DB2 10.5 with BLU Acceleration, introduced in April 2013, implements a range of new-generation
technologies in a more integrated and optimized manner. These include columnar and in-memory
processing, high-performance compression and caching, data skipping (i.e., the ability to avoid processing
data that is not necessary to specific queries), and microprocessor-level parallel and vector processing.
Users have reported increases of between 5 and 74 times in overall query performance compared with
previous DB2 versions (typically DB2 10 or DB2 9.7), with an overall average of 31.6 times.
Compression levels ranged from 9 to more than 15 times, and averaged around 12.6 times. In comparison
with Clustered Columnstore Indexes, which compress only at the column segment level, BLU
Acceleration extends compression across entire tables.
Key technologies in BLU Acceleration are combined in a manner that progressively reduces the amount
of data that must be processed and moved through I/O to boost performance. The sequence of processes
through which this occurs is illustrated in figure 4.
IBM BLU ACCELERATION
User Data
10 TB
Actionable
Compression
1 TB
Column
Processing
10 GB
Data
Skipping
1 GB
Parallel
Processing*
31.25 MB
Vector
Processing*
7.8 MB
*32 cores
Figure 4: BLU Acceleration Data Reduction Processes
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
4
International Technology Group
October 2014
These numbers should be taken as indicative – actual volumes may vary widely by database and workload. But
they highlight the efficiency of the BLU Acceleration design.
A further BLU Acceleration characteristic should be highlighted. The IT world has seen a progressive shift
toward higher-density memory media for high-performance workloads. This shift has been reflected in use or
RAM for in-memory databases. BLU Acceleration is designed to move even beyond this stage, to the point where
processing is conducted overwhelmingly in cache.
In addition, new shadow tables allow concurrent DB2 10.5 processing of queries and transactions. Row-based
transactional data is continuously and automatically replicated to columnar tables, and execution of queries to
these fully leverages BLU Acceleration in-memory technology. Users have reported the same performance
improvements of 10x or more as for dedicated DB2 10.5 query workloads.
Shadow tables are automatically synchronized with corresponding row-oriented tables. If access to other data
sources is not required, users may avoid delays due to use of extract, transformation and load (ETL) tools.
Complexity
In contrast to SQL Server 2014 with Clustered Columnstore Indexes, BLU Acceleration employs a simple SQL
design. The system does not, for example, employ schemas, indexes, or aggregate tables. Simplicity reduces the
time required for tasks such as system design, application development, testing and tuning, and ongoing
administration. Deployment also becomes a faster and more reliable process.
Applications may be created and deployed in a few comparatively simple steps. Once individuals became
proficient with the system, these took minutes…less than 20 minutes…less than an hour…a few hours. One
commented: We create tables and load data. Period. A key benefit was reported to be that end users might
develop server-based applications directly, rather than going through programming staff.
In normal operations, processes such as tuning, optimizer and compression administration, space reclamation,
database reorganizations, statistics collection and reporting, and workload management are largely automated.
Performance tuning requirements were said to be minimal…virtually non-existent. An IBM-supplied tool
automated conversion of row-based tables to columnar format.
Organizations reported that, once initial deployment had been completed, DBA overhead for BLU Acceleration
was minimal. User estimates ranged from six hours a week to maybe a quarter of an FTE (full time equivalent).
Degrees of complexity affect comparative deployment times. Users of Clustered Columnstore Indexes reported
that systems were typically brought into production in two weeks to six months – most responses were in the six
weeks to six months range – with an average of around 93 days. In comparison, BLU Acceleration users reported
eight days to three months, with an average of around 38 days.
Conclusions
The selection of DB2 10.5 or SQL Server 2014 also involves a larger choice as to how use of high-performance
analytics will evolve within organizations. The Microsoft approach focuses on end-user control, and assumes that
Microsoft will supply basic tools that will be enhanced by third parties, customized, applied and maintained by
DBAs and system administrators.
This is the traditional Microsoft approach to IT, and it will no doubt appeal to many organizations. But DB2 10.5
represents a stronger offering in terms of performance, architectural simplicity, back-end data integration,
manageability and time to bring new applications into production. Its focus is on quality and timeliness of
information. Where these are critical business priorities, DB2 10.5 with BLU Acceleration is a better option.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
5
International Technology Group
October 2014
Solutions
SQL Server 2014
Overview
SQL Server 2014 is the latest version of Microsoft’s core database, which originated in the 1990s. Since the mid2000s, Microsoft has positioned SQL Server aggressively for data warehousing and business intelligence (BI)
applications. Clustered Columnstore Index was added as part of Microsoft’s xVelocity solution in SQL Server
2008 R2, and enhanced in SQL Server 2014.
Other SQL Server 2014 analytics-related features include an expanded version of the Microsoft extract,
transformation and load (ETL) suite, SQL Server Integration Services. New components include Master Data
Services, a master data management tool originally developed by Stratature, which Microsoft acquired in 2007;
and Data Quality Services for data quality management.
SQL Server 2014 also supports Microsoft AlwaysOn clustering and Windows Server Core Support, which
reduces memory footprint and disk space consumption.
Clustered Columnstore Indexes
This technology organizes data in memory into columnar form, and compresses it. While not all data required for
a query must fit into memory, performance will be significantly higher when this is the case.
Data may be processed in batch mode; i.e., multiple rows of data are fetched and processed in a single operation.
According to Microsoft, this approach is most effective for queries involving large numbers of joins, filters and
aggregations. It employs a proprietary Microsoft form of vector processing.
The determination of whether to use batch or row-by-row processing for a given query is made by the SQL Server
Query Optimizer. Normally, batches contain around 1,000 rows. Smaller data blocks are processed row-by-row.
Clustered Columnstore Indexes incorporate a limited form of data skipping, which enables the system to bypass
segments of data that are not required for a specific query. Segment size is, however, comparatively large – one
million rows – which means that in practice large amounts of unnecessary data are often processed.
DB2 10.5
Overview
Introduced in 1996, DB2 for Linux, UNIX and Windows (LUW) has progressively evolved toward greater
performance and functionality. Recent versions have included DB2 9.5 (2008), DB2 9.7 (2009), DB2 9.8 (2010),
DB2 10 (2012) and DB2 10.5, which added BLU Acceleration (2013).
In its initial form, BLU Acceleration is designed primarily to support data warehouse systems with from 1 TB to
10 TBs of raw user data. It is a single-server solution, although in-memory, caching and processor optimization
technologies enable levels of analytics performance and capacity utilization that are significantly higher than for
most x86 platforms.
BLU Acceleration is supported on x86 hardware using Red Hat Enterprise Linux (RHEL) 5 or 6 x86-64 or SUSE
Linux 10 or 11 x86-64, and on IBM Power Systems, including new POWER8-based models.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
6
International Technology Group
October 2014
The principal BLU Acceleration technologies for analytics processing are summarized in figure 5.
ANALYTICS PROCESSING
DB2 integration
New runtime technology embedded in DB2 kernel.
Columnar & row-based tables can be processed simultaneously on the same system, & employ the same schema.
Can also be accessed using the same SQL & language interfaces, process model, storage, memory & utilities.
Tables can be accessed within the same SQL statement.
BLU Acceleration uses existing DB2 client server infrastructure, compilers, tablespaces, buffer pools, sort heap &
package cache, & utilities including LOAD, BACKUP, RESTORE, EXPORT, SNAPSHOT, db2top, db2pd & others.
New utility enables conversion of row-based to columnar tables in a specified database. Row-organized tables
remain online during processing. System monitors conversion processes.
DB2 tooling supports conventional & BLU Acceleration functions. Tools include Optim Query Workload Tuner (may
be employed to recommend BLU Acceleration deployments & table transformations), along with IBM Data Studio,
InfoSphere Data Architect, InfoSphere Optim Performance Manager, & InfoSphere Optim Configuration Manager.
In-Memory & Caching Dynamic in-memory technology loads & processes data in RAM.
Technologies
New memory paging architecture means entire database table does not have to reside in main memory to be
processed. Blocks of BLU data may be moved into main memory as needed to query. According to IBM,
expectation is that 70 to 80 percent of active data will reside in RAM. Performance may, however, be maintained
even when the volume of data processed exceeds RAM capacity.
Scan-friendly memory caching algorithm, unique to BLU Acceleration, optimizes cache performance for scanintensive workloads. Automatically adapts operation to data characteristics. Represents alternative to least recently
used (LRU) algorithms designed primarily for transactional applications. Enables even (egalitarian) access to
cache resources for commonly used values.
Register-friendly encoding enables compressed data to be packed into cache structures for further efficiency in use
of processor, memory & I/O resources. Encoded values are packed into bits matching CPU register width –
includes support for 128-bit wide POWER8 registers.
Data Compression &
Space Reclamation
Actionable Compression enables processing of columnar data while still compressed; i.e., analytics may be
performed without decompression. Operates on row-based & columnar structures. Automatically adapts to data
characteristics.
Combines multiple IBM compression techniques including register-friendly encoding, described above. Users have
reported compression rates of 10 times or more compared to uncompressed tables, with corresponding
performance enhancements & storage savings.
Real-time automated space reclamation extends to row-based & columnar data. Space is freed online during
processing. DBA intervention not required for space management & REORGs.
Column Store
IBM implementation of technology enabling higher performance & reduced consumption of processor, memory &
I/O resources for analytics workloads. Scans are directed to values in a particular column or columns, avoiding the
need to process all data in a table.
Data Skipping
Reduces processor, memory & I/O resource consumption by excluding data unnecessary to query from
processing. Process is automatic (no DBA intervention is required), based on system-stored metadata on parent
table columns.
Shadow Tables
New feature in DB2 Fix Pack 4 allows concurrent processing of queries & transactions. Row-based transactional
data is continuously & automatically replicated to columnar tables. Execution of queries to these fully leverages
BLU Acceleration in-memory technology. Shadow tables are automatically synchronized with corresponding roworiented tables. Shadow tables are implemented as a form of Materialized Query Table (MQT) using InfoSphere
Data Replication Change Data Capture (CDC).
Intel CPU
Optimization
Exploits latest Intel Single Instruction, Multiple Data (SIMD) parallel processing enhancements for E5 processors,
including expanded Streaming SIMD Extensions (SSE) & Advanced Vector Extensions (AVX) instructions.
According to Intel, enables parallel execution across multiple cores on single E7 processor, for up to 8x
performance boost on 8-core processor. Vector processing provides additional performance of up to 4x for floating
point-intensive applications. Actual performance boosts depend upon workloads. 10-20x improvement is common.
POWER CPU
Optimization
Exploits SIMD parallel processing & other performance-related features on Power Systems. Optimization for
POWER8-based systems includes support for concurrent multithreading; use of 128-bit registers; & enhanced
reliability features including improvements in Data Page Memory Checking & expanded integrity checking.
Workload
Management
Enables simplified operation of DB2 Workload Manager (WLM) for BLU Acceleration mode. Maintains concurrency
subject to predefined threshold criteria. Optimizes use of processor, memory & I/O resources, & performs
automatic, ongoing performance tuning based on knowledge of underlying hardware. Optionally, allows users to
define more complex policies based on DB2 WLM. Where shadow tables are employed, WLM automatically routes
queries to these, & transactions to row-based tables.
Figure 5: Principal IBM DB2 10.5 BLU Acceleration Technologies for Analytics Processing
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
7
International Technology Group
October 2014
BLU Acceleration leverages a number of established DB2 strengths. In particular, the core DB2 Workload
Manager (WLM) supports a new BLU Acceleration mode; and IBM- and Optim-branded DBA tools have been
adapted to support BLU Acceleration as well as conventional administration and optimization functions.
Overall Capabilities
DB2 10.5 leverages longstanding DB2 strengths in such areas as performance optimization, data compression,
workload management, high availability, and simplification and automation. Capabilities of the overall DB2 10.5
environment are summarized in figure 6.
GENERAL CAPABILITIES
Time Travel Query
Differentiates system time (when an event is logged) & business time (an alternative date &/or time
associated with the event) in maintaining & querying records. Obviates need for custom-developed
applications to analyze multiple timelines. Complies with temporal features of ANSI/ISO SQL:2011.
Continuous Data Ingest
Employs IBM parallel loading technology for extremely fast, low-overhead data transfers. Enables realtime data warehousing applications. Offers alternative to conventional batch & trickle feed techniques.
AVAILABILITY & RECOVERY
DB2 pureScale
Enables scale-out failover clustering for continuous availability. Generates <5% system overhead with
clusters of up to 64 nodes (installations are typically in this range), & <16% with 128 nodes. Based on IBM
Parallel Sysplex Data Sharing & General Parallel File System (GPFS). Migration requires no application
changes. Does not currently support BLU Acceleration, but may be employed for other workloads.
High Availability Disaster
Recovery (HADR)
Enables replication of data changes to one or more standby servers, & recovery from these. Supports up
to three hot standby servers, & allows delays to be set to prevent replication of problems.
Tivoli System Automation
Mainframe-derived high availability & policy-based automation solution. Manages failover, restart &
recovery within pureScale & HADR clusters.
Online Reorg
Reduces time when data is not available to users during reorg processes. Data remains available during
reload & rebuild phases.
PERFORMANCE-RELATED
Query Parallelism
Enables parallel query execution for more efficient use of CPU & I/O resources. Most effective for longrunning queries reading large amounts of data.
Table (Range) Partitioning
Allows data in a single table to be placed in multiple tablespaces for greater scalability, processing
efficiency & data roll-in/roll-out.
Materialized Query Tables
Query-specific table structure offering higher performance than indexes for certain types of query,
especially complex queries.
Multi-Dimensional Clustering
Enables flexible clustering of across multiple dimensions. Optimized for use in large data warehouse
environments. Typically accelerates query performance by around three times, & improvements of ten
times or more have been reported.
Scan sharing
Enables sharing of system resources by multiple scans. May significantly improve concurrency &
performance, & reduce I/O loading for high-volume scanning workloads.
DATA COMPRESSION
Actionable Compression
Enables processing of columnar data while still compressed. Operates on row-based & columnar
structures. Automatically adapts to data characteristics. Combines multiple IBM compression techniques
including register-friendly encoding. Supports real-time automated space reclamation, described above.
Adaptive Compression
Algorithms integrate table, index & compression. Overall rates are typically four to ten times, with average
of around seven times.
Backup Compression
Enables compression of data in all DB2 structures during backup.
STORAGE-RELATED
Multi-temperature Data
Management
Enables automated storage tiering for higher performance & lower overall disk costs. Obviates need for
controller-based tiering for most workloads. Tightly integrated with DB2 workload management.
pureXML Storage
Enables storage of IBM pureXML (IBM implementation of Extensible Markup Language) data in native
hierarchical mode.
Advanced Copy Services
(ACS)
Supports fast copying by IBM DS8000, Storwize V7000, SAN Volume Controller (SVC) & XIV systems
during backup & restore operations. Includes Tivoli Storage FlashCopy Manager. ACS Scripted Interface
enables use with non-IBM storage.
Figure 6: Principal Capabilities of Overall IBM DB2 10.5 Environment
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
8
International Technology Group
October 2014
Recent features include temporal processing (Time Travel Query), software-based storage tiering (Multi
Temperature Data Management) and an IBM parallel loading tool (Continuous Data Ingest) designed for realtime data warehouse updates.
IBM has also moved to enable integration of new Big Data types, including Hadoop and MapReduce. DB2 10.5
supports SPARQL, Resource Definition Framework (RDF), JavaScript Object Notation (JSON) data interchange
format, and other emerging standards. JSON has proved increasingly popular as an alternative to XML.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
9
International Technology Group
October 2014
Detailed Data
Basis of Calculations
Installations
Cost comparisons presented in this report were based on the installations summarized in figure 7.
FINANCIAL SERVICES COMPANY
MANUFACTURING COMPANY
IT SERVICES COMPANY
BUSINESS PROFILE
Diversified retail bank
$70+ billion assets
5,000+ employees
250+ branches
Contract electronics manufacturer
$6+ billion sales
40,000+ employees
50+ manufacturing plants
IT outsourcing & professional services
$500+ million sales
6,000+ employees
20+ facilities
Sales & customer profitability analysis,
demand forecasting & related
Customer financial & operational key
performance indicator (KPI) applications
4/32 x Intel E5
0.25 FTE DBA
Deployment time: 4 weeks
8/64 x Intel E5
0.55 FTE DBA
Deployment time: 10 weeks
APPLICATIONS
Risk & compliance analysis/reporting,
financial & profitability analysis
IBM DB2 10.5 WITH BLU ACCELERATION
2/16 x Intel E5
0.15 FTE DBA
Deployment time: 3 weeks
MICROSOFT SQL SERVER 2014 WITH CLUSTERED COLUMNSTORE INDEXES
2/16 x Intel E5
0.35 FTE DBA
Deployment time: 2 months
4/32 x Intel E5
0.65 FTE DBA
Deployment time: 3 months
8/64 x Intel E5
1.5 FTE DBAs
Deployment time: 6 months
Figure 7: Installations Summary
Hardware platforms for both solutions are based on x86 servers from major vendors. Configurations and FTE
DBA staffing levels were based on user-reported data.
Costs of Ownership
These were calculated as follows:
•
DB2 10.5 with BLU Acceleration costs were calculated for per terabyte licenses, plus two years of support
(the first year is included in initial licenses); hardware acquisition and three years of maintenance for x86
servers; and three-year premium Linux subscriptions.
•
SQL Server 2014 with Clustered Columnstore Indexes costs were calculated for SQL Server 2014 R2
Enterprise Edition per core licenses, and Windows Server 2012 R2 Datacenter Edition per processor
licenses and Client Access Licenses (CALs), plus three-year Microsoft Software Assurance coverage.
Calculations also include hardware acquisition and three years of maintenance for x86 servers.
All maintenance and support costs for both platforms are for 24/7 coverage with four-hour response time.
•
Personnel costs were calculated based on annual salaries of $104,316/year for DB2 10.5 DBAs with BLU
Acceleration training and $97,103/year for SQL Server 2014 DBAs with Clustered Columnstore Indexes
training. Salaries were increased by 56.7 percent for bonuses, benefits and other per capita costs, and
multiplied for three years.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
10
International Technology Group
October 2014
Calculations also include appropriate training courses provided by Microsoft Learning Partners
(Microsoft does not offer classroom training directly) or IBM. The duration of these, and the number of
individuals trained varied between installations.
•
Deployment costs were calculated for external professional services staff, charged at $2,000 or $3,000 per
person-day, depending on required skill levels, plus travel and entertainment (T&E) expenses.
•
Facilities costs are for energy consumption. Calculations are based on vendor specifications and assume
near-24/365 operations over a three-year period. A conservative assumption for average cost per kilowatthour was employed.
All cost values are for the United States.
Costs Breakdowns
Costs of ownership breakdowns are presented in figure 8.
Financial Services
Company
Manufacturing
Company
IT Services
Company
157,920
236,880
394,800
IBM DB2 10.5 WITH BLU ACCELERATION
Databases
Servers
9,915
42,803
169,810
Deployment
69,834
95,386
238,420
Personnel
82,759
131,798
282,215
Facilities
TOTAL ($)
1,745
2,848
7,631
322,173
509,715
1,092,876
MICROSOFT SQL SERVER 2014 WITH CLUSTERED COLUMNSTORE INDEXES
Databases
115,483
230,966
461,933
26,274
75,314
233,075
Deployment
190,736
429,156
596,050
Personnel
201,698
338,643
738,632
Servers
Facilities
TOTAL ($)
2,043
3,274
8,820
536,234
1,077,353
2,038,510
Figure 8: Three-year Costs of Ownership Breakdowns for
IBM DB2 10.5 and Microsoft SQL Server 2014
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
11
International Technology Group
October 2014
International Technology Group
ITG sharpens your awareness of what’s happening and your competitive edge
. . . this could affect your future growth and profit prospects
International Technology Group (ITG), established in 1983, is an independent research and management consulting firm
specializing in information technology (IT) investment strategy, cost/benefit metrics, infrastructure studies, deployment
tactics, business alignment and financial analysis.
ITG was an early innovator and pioneer in developing total cost of ownership (TCO) and return on investment (ROI)
processes and methodologies. In 2004, the firm received a Decade of Education Award from the Information Technology
Financial Management Association (ITFMA), the leading professional association dedicated to education and advancement
of financial management practices in end-user IT organizations.
Client services are designed to provide factual data and reliable documentation to assist in the decision-making process.
Information provided establishes the basis for developing tactical and strategic plans. Important developments are analyzed
and practical guidance is offered on the most effective ways to respond to changes that may impact complex IT deployment
agendas. A broad range of services is offered, furnishing clients with the information necessary to complement their internal
capabilities and resources.
Clients include a cross section of IT end users in the private and public sectors representing multinational corporations,
industrial companies, financial institutions, service organizations, educational institutions, federal and state government
agencies as well as IT system suppliers, software vendors and service firms. Federal government clients have included
agencies within the Department of Defense (e.g., DISA), Department of Transportation (e.g., FAA) and Department of
Treasury (e.g., US Mint).
Copyright © 2014 International Technology Group. All rights reserved. Material, in whole or part, contained in this document may not be
reproduced or distributed by any means or in any form, including original, without the prior written permission of the International Technology
Group (ITG). Information has been obtained from sources assumed to be reliable and reflects conclusions at the time. This document was
developed with International Business Machines Corporation (IBM) funding. Although the document may utilize publicly available material from
various sources, including IBM, it does not necessarily reflect the positions of such sources on the issues addressed in this document. Material
contained and conclusions presented in this document are subject to change without notice. All warranties as to the accuracy, completeness
or adequacy of such material are disclaimed. There shall be no liability for errors, omissions or inadequacies in the material contained in this
document or for interpretations thereof. Trademarks included in this document are the property of their respective owners.
Cost/Benefit Case for IBM DB2 10.5 for High Performance Analytics:
Compared to Microsoft SQL Server 2014
12