Reference Architecture Guide EMC XTREMIO AND MICROSOFT EXCHANGE DATABASES Preliminary findings: Efficiency of various production samples Market overview and adoption of all-flash arrays Techniques for estimating efficiency EMC Solutions June 2015 Copyright © 2015 EMC Corporation. All Rights Reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. The information in this publication is provided as is. EMC Corporation makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. All trademarks used herein are the property of their respective owners. Part Number H14246 EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 2 Table of contents Reference architecture overview ........................................................................................................... 4 Document purpose .......................................................................................................................... 4 Business challenge .......................................................................................................................... 4 Technology solution ......................................................................................................................... 5 Solution value .................................................................................................................................. 6 Solution architecture ............................................................................................................................ 7 Architecture diagram ........................................................................................................................ 7 Hardware resources ......................................................................................................................... 8 Software resources .......................................................................................................................... 8 Technology overview .......................................................................................................................... 10 Introduction ................................................................................................................................... 10 EMC XtremIO .................................................................................................................................. 10 EMC Data Domain .......................................................................................................................... 11 EMC PowerPath/VE ........................................................................................................................ 11 VMware vSphere 5.5 ...................................................................................................................... 11 Microsoft Exchange Server 2010 .................................................................................................... 11 Validation testing and results ............................................................................................................ 13 Introduction ................................................................................................................................... 13 Methodology.................................................................................................................................. 13 Results........................................................................................................................................... 13 Analyzing your environment ............................................................................................................... 15 Conclusion ......................................................................................................................................... 17 References.......................................................................................................................................... 18 EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 3 Reference architecture overview Document purpose This reference architecture guide describes the solution that EMC has developed to demonstrate the efficiency and benefits of storing Microsoft Exchange 2010 and Exchange 2013 mailbox databases on an EMC® XtremIO™ storage array. The guide explains key elements of storage efficiency, the status of the all-flash storage array marketplace, and the position of XtremIO in the storage marketplace for Exchange and SharePoint Server. This reference architecture guide documents the methods that EMC used to analyze a currently installed Exchange 2010 Mailbox server implementation and provides a summary of the benefits of deploying the XtremIO solution. The guide also describes how to use a Mitrend analyzer tool to estimate the efficiency improvements that you can realize by deploying this XtremIO solution on your Microsoft Exchange databases. Business challenge Customers require reliable storage solutions for Microsoft Exchange that are easy to acquire, manage, configure, and scale. Microsoft Exchange is often at the center of an organization’s collaboration tool suite. Customers require Exchange to be operational and cost-effective. Migration from on-premises systems to off-premises/public cloud solutions introduces additional complexities and concerns regarding data sovereignty, service levels, and authentication management. Many customers require messaging data to remain onsite or under their own control. This desire to lower costs, complexities, and configuration issues has resulted in a re-evaluation of how messaging systems are deployed and operated. Several new technologies are available to assist customers in building a messaging solution. Customers need to know how to best use these technologies to maximize their investment, support service-level agreements, and reduce their total cost of ownership (TCO). This solution demonstrates potential efficiencies delivered by XtremIO for any Exchange customer’s messaging environment, and it validates the storage infrastructure for performance, scalability, and efficiency. By implementing EMC XtremIO for Microsoft Exchange, as compared to EMC VNX and to Microsoft’s preferred architecture of direct-attached storage (DAS) (single spindles), customers will experience: Reduced overall storage capacity Better end-user productivity with more consistent performance Simplified management with the environment contained in their data center or at a qualified co-location Better support of service level agreements and compliance initiatives Lower comparative installation, operational, and maintenance costs A February 2015 paper from IDC, The Copy Data Problem: Analysis Update, outlined the issue of retaining primary and backup copies of data within corporate environments. IDC's advice is simple—"use the most advanced storage array EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 4 technologies that support real-time thin provisioning, de-duplication, compression, and snapshot capabilities to reduce your overall storage footprint." IDC also completed an analysis of the all-flash array marketplace in May 2015, graphically depicted in Figure 1. Its findings were compelling. Although most organizations deploying all-flash arrays for use in messaging and collaboration workloads have more than 1,000 users, all-flash arrays have been and will continue to be deployed across organizations of all sizes. Figure 1. Technology solution All-flash array marketplace analysis This solution demonstrates how to use an XtremIO array, in the context of Exchange mailbox databases, for both primary storage and database copies used in conjunction with Exchange continuous replication and/or database availability groups (DAGs). Planning and designing the storage infrastructure for Exchange on XtremIO becomes a non-critical step because the shared storage can absorb large bursts of input/output (I/O) that occur during the course of a day. In typical direct-attached single-spindle architectures, user demand and index rebuilds, coupled with database maintenance task and backup operations, can lead to periods of erratic and unpredictable client/user performance. Users can adapt to slow performance, but unpredictable performance will quickly frustrate them. To provide predictable performance in a messaging infrastructure, the storage must be able to handle the peak I/O load from the clients without resulting in a high response time. Designing storage for this workload typically involves deploying advanced storage arrays that use caching algorithms, disks deployed in storage pools, and/or multitiered storage subsystems to handle brief periods of extreme I/O pressure, all of which are complex to design and expensive to implement and maintain. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 5 The XtremIO array circumvents the storage inefficiencies of advanced multitier storage arrays and the inflexible, underperforming nature of direct-attached singlespindle architectures. Solution value XtremIO reduces database and data center footprints and complexity with unparalleled in-line data reduction capabilities, addressing storage sprawl for Exchange databases. The XtremIO array eliminates storage usage in several ways: Globally deduplicating every redundant 8 KB block that enters the array for storage Reducing repeating sequences of data (compression) Eliminating fully-formatted volumes Eliminating volume boundaries Initial findings from our tests show impressive efficiency rates ranging from 2.5:1 to more than 8:1 depending on the number of mailboxes and the number of DAG copies. Our tests indicate that the likelihood of duplicated blocks within an Exchange system increases as the number of mailboxes increases. This is logical since email messages and email attachments are likely to be sent to multiple mailboxes. And, because XtremIO globally deduplicates redundant blocks of data, the more copies of messages that exist in the entire system, the higher the deduplication rate will be. Furthermore, because XtremIO never writes a previously written block to its flash drives, redundant copies of Exchange databases created by Exchange continuous replication/DAG consume no space on the XtremIO array. In the vast majority of installations, customers typically observe an average reduction of 60 percent in actual storage usage for the active copy. Comparing EMC XtremIO to the SATA DAS drives proposed by Microsoft’s costreduction strategy, we find that XtremIO offers efficiencies that SATA DAS does not, as shown in Table 1. Table 1. Comparison of XtremIO and SATA DAS capabilities Feature Note: XtremIO SATA DAS Global deduplication - Global compression - Thin provisioning - Elimination of 8 KB blocks containing all zeros - XtremIO uses deduplication to eliminate storage of 8 KB blocks containing all zeros. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 6 Solution architecture Architecture diagram Figure 2 depicts the overall physical architecture of the solution. Figure 2. Solution architecture EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 7 Hardware resources Table 2 lists the hardware resources used in this solution. Table 2. Solution hardware Equipment Configuration Storage system XtremIO array, single X-Brick™ SAN switches 8 Gb/s Fibre Channel switches Physical server Rack server: CPU: E7-2870, 2.4 GHz, two sockets with 10 cores per socket Memory: 160 GB Virtual machines Six virtual machines, each configured with: Four vCPUs 16 GB memory Backup equipment EMC Data Domain® (used to restore Exchange databases) Software resources Table 3 lists the software resources used in this solution. Table 3. Solution software Resource Version Purpose EMC XtremIO operating environment 3.0.3 XtremIO operating environment EMC XtremIO Management Server (XMS) 3.0.3 XtremIO management software and graphical user interface EMC PowerPath®/VE 6.0 Advanced multipathing for ESXi host Microsoft Exchange Server 2010 Email servers Microsoft Windows Server 2008 Operating system for Exchange 2010 VMware ESXi 5.5 Hypervisor software Use XMS to Configure the XtremIO array Display information about the array Run the CLI Make any configuration changes Create and size volumes Map volumes to hosts EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 8 Figure 3 displays the XMS console view of the XtremIO X-Brick in the rack. Figure 3. XMS console view of X-Brick in rack Figure 4 displays the configuration of the XtremIO array, as shown in the XMS console. Figure 4. XtremIO array configuration as viewed in XMS console EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 9 Technology overview Introduction EMC XtremIO This solution includes the following key technology components: EMC XtremIO EMC Data Domain EMC PowerPath Virtual Edition (VE) VMware vSphere 5.5 Microsoft Exchange Server The EMC XtremIO storage array is an all-flash system with a scale-out architecture. The system uses X-Brick building blocks that you can cluster together to scale performance and capacity as required. This solution uses two X-Brick blocks clustered together as a single logical storage system. XtremIO flash storage delivers value across the following main functional areas: Performance—Regardless of how busy the system is, and regardless of storage capacity utilization, latency and throughput remain consistent, predictable, and constant. Latency within the array for an I/O request is typically far less than one millisecond. Scalability—Based on a scale-out architecture, a single X-Brick is the XtremIO building block. You can cluster multiple X-Brick blocks together to provide increased performance or capacity. Performance scales linearly to ensure that two X-Brick blocks supply twice the IOPS and four X-Brick blocks supply four times the IOPS of the single X-Brick configuration. At the same time, the latency remains consistently low as the system scales out. XtremIO arrays can scale up for any required performance or capacity level. In-line data reduction—The core XtremIO engine implements content-based inline data reduction. XtremIO automatically deduplicates and compresses data as the system processes it. This reduces the amount of data written to flash, improving longevity of the media and reducing cost. Volumes are always thinprovisioned without any loss of performance, over-provisioning of capacity, or fragmentation. Data protection—XtremIO uses a proprietary flash-optimized data protection algorithm, XtremIO Data Protection (XDP), which provides superior data protection while enabling performance surpassing that of any existing RAID algorithms. Optimizations in XDP also result in fewer writes to flash media for data protection purposes. Functionality—XtremIO supports high-performance and space-efficient snapshots, in-line data reduction, thin provisioning, and full VMware vSphere Storage APIs – Array Integration (VAAI) with support for Fibre Channel (FC) and iSCSI protocols. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 10 EMC Data Domain Simplicity—Provisioning storage with XtremIO is as simple as deciding how large a LUN you want to create. You no longer need to select the RAID type, create a RAID group, or decide whether or not to enable thin provisioning or deduplication. These functions are already built into XtremIO. EMC Data Domain deduplication storage systems continue to revolutionize disk backup, archiving, and disaster recovery with high-speed, in-line deduplication. Data Domain systems reduce the amount of disk storage required to retain and protect data by an average of 10–30x. This makes Data Domain systems a cost-effective alternative to comparable capacity tape solutions. Data Domain systems provide high scalability, performance, and efficiency in a single system, which allows customers to keep data online and onsite for longer retention periods than previously feasible. EMC PowerPath/VE EMC PowerPath Multipathing is a host-resident software application that intelligently manages host-to-storage I/O data paths to maximize the performance and availability of mission-critical applications. It combines automatic load balancing, path failover, and multiple path I/O capabilities in one integrated package. EMC PowerPath/VE delivers PowerPath Multipathing features to optimize VMware vSphere and Microsoft Hyper-V virtual environments. PowerPath/VE for VMware supports multiple paths between a vSphere host and an external storage device. Having multiple paths enables the host to access a storage device, even if a specific path is unavailable. Multiple paths can also share the I/O traffic to a storage device. VMware vSphere 5.5 VMware vSphere 5.5 transforms a computer’s physical resources by virtualizing the CPU, RAM, hard disk, and network controller. This transformation creates fully functional virtual machines that run isolated and encapsulated operating systems and applications just like physical computers. VMware vSphere High Availability (HA) provides easy-to-use, cost-effective high availability for applications running in virtual machines. The vSphere vMotion and vSphere Storage vMotion features of vSphere 5.5 enable seamless migration of virtual machines and stored files from one vSphere server to another, with minimal or no performance impact. Coupled with vSphere Distributed Resource Scheduler (DRS) and vSphere Storage DRS, virtual machines have access to the appropriate resources at any time through load balancing of the compute and storage resources. Microsoft Exchange Server 2010 Microsoft Exchange Server 2010 is an enterprise email and communication system that enables businesses and customers to collaborate and share information. EMC enhances Exchange Server 2010 with a selection of storage platforms, software, and services. With Exchange 2010, Microsoft presents a new, unified approach to high availability and disaster recovery by introducing features such as DAG and online mailbox moves. Mailbox servers can now be implemented in mailbox resiliency configurations with database-level replication and failover. Major improvements with the application database structure and I/O reduction include support for a larger variety of disk and RAID configurations, including high- EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 11 performance flash drives, FC, and serial-attached SCSI (SAS) drives, and slowerperforming SATA and near-line SAS (NL-SAS) drives. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 12 Validation testing and results Introduction This section briefly describes the steps we took to analyze the potential storage capacity reduction of Exchange 2010 databases loaded on an XtremIO storage array, and the results achieved from the testing. Note: We tested this solution on a Microsoft 2010 Exchange server configuration, but the solution will work equally well on a Microsoft 2013 Exchange server. Methodology We borrowed unique active (production) databases from EMC IT’s operational email installation for use during these efficiency tests. We created six logical volumes (LUNs) on the XtremIO array. We masked one LUN to each of the six virtual machines. We initialized each LUN, formatted it using a block allocation unit size of 64 KB, and assigned it a drive letter. We sequentially restored EMC IT’s Exchange mailbox databases from the Data Domain system. We recorded the status of XtremIO capacity, volume use, compression, deduplication, and thin provisioning totals for analysis after copying each database. This section identifies the capacity savings elicited by XtremIO and reports each element of XtremIO’s data reduction capabilities through: Results EMC XtremIO testing results Thin provisioning Removal of zeroes Removal of duplicate blocks Removal of repeating characters within blocks Table 4 displays how the EMC XtremIO array responded to the migration of six unique Microsoft Exchange mailbox databases. XtremIO storage efficiency is calculated by dividing the storage provisioned to all hosts by the actual storage consumed by those hosts. Table 4. Results of Exchange 2010 databases storage efficiency testing on XtremIO Removal of repeating characters within blocks Cumulative database capacity of host volumes (TB) Cumulative capacity used (TB) Cumulative capacity savings per database file (TB)* Database sample Removal of zeros Cumulative removal of duplicate blocks Database 1 46.67% 23.08% 16.67% 1.3 1.075 0.609 Database 2 42.67% 52.38% 16.67% 2.6 1.579 0.881 EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 13 Database 3 33.67% 52.38% 16.67% 3.9 1.979 2.021 Database 4 36.67% 47.37% 16.67% 5.2 2.167 2.924 Database 5 38.67% 47.37% 16.67% 6.5 2.619 3.472 Database 6 27.67% 47.37% 16.67% 7.8 2.9866 4.4734 * Cumulative capacity savings derived from deduplication and compression (ignores savings from thin provisioning and elimination of zeros) In our tests, 18 TB of storage was provisioned and 2.9866 TB of storage was consumed on the XtremIO array. Based on the cumulative storage consumed versus the cumulative storage provisioned, the EMC IT mailbox infrastructure observed a 6:1 storage efficiency improvement on the primary (active) copy of mailbox databases. All databases were 1.3 TB in size and contained an average of 230 mailboxes. Each database was copied onto a separate 3 TB volume (per Microsoft best practice for provisioning storage volumes), resulting in thin provisioning savings of 56 percent for each of the volumes. As shown in Figure 5, excluding the benefits of thin provisioning, six 1.3 TB databases (totaling 7.8 TB) were copied onto the XtremIO array. The data consumed 2.9866 TB of capacity on the XtremIO flash drives. Not including the benefits of thin provisioning of the volume, the XtremIO array reduced the capacity necessary to store 7.8 TB of Exchange databases by 4.8 TB or 61.7 percent. Size (TB) XtremIO Storage Efficiency Analysis 20 18 16 14 12 10 8 6 4 2 0 Database 1 Database 2 Database 3 Database 4 Database 5 Database 6 Cumulative capacity provisioned (TB) 3 6 9 12 15 18 Cumulative Database Size on Volume 1.3 2.6 3.9 5.2 6.5 7.8 1.075 1.579 1.979 2.167 2.619 2.9866 Cumulative capacity used (TB) Figure 5. XtremIO storage efficiency analysis EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 14 Analyzing Your Environment Use the Mitrend XtremIO Efficiency Analyzer tool to determine what results you might expect from deploying the XtremIO solution with your own Microsoft Exchange databases. The results contained in this reference architecture report on a representative sampling of EMC IT’s Exchange 2010 mailbox databases. Every Exchange database has different amounts of duplicated email, attachments, and calendar data. Every database has varying amounts of empty database pages. And every database has varying content that responds to compression algorithms differently. Customers can evaluate database volumes before migrating them to XtremIO by using the EMC-developed Mitrend XtremIO Efficiency Analyzer tool. With this evaluation tool, customers can scan individual volumes to predict the effectiveness of XtremIO data efficiency. Refer to the following website, where you can download the tool and obtain detailed information on how the XtremIO Efficiency Analyzer works and how to get dependable results: https://app.mitrend.com/emc/#instructions/XtremIO_Reduction Note: If you click the Mitrend link and are asked for credentials that you cannot provide, contact your EMC Partner, EMC Sales Representative, or EMC Systems Engineer for assistance. To avoid running the scan on the production database, simply restore a representative sampling of the database to an alternate server (non-production virtual machine) and run the Mitrend tool against those volumes. After you collect the Mitrend data, ask your EMC Representative to submit your data collection to the Mitrend site. Mitrend produces a report, like the one shown in Figure 6, that you can use to make decisions. Total Scanned Zero Data Deduplication Compression Required 120,000 100,000 Capacity (GB) Introduction 80,000 60,000 40,000 20,000 0 Total Scanned Savings Required Category Figure 6. Mitrend sample report Data reduction analysis measures the data reduction benefits of thin provisioning, deduplication, and compression of file system data. For best results, scan all of the data that will be located on the same system to calculate the benefits of global deduplication. Scanning just one or two file systems will show thin and compression savings, but will not show the entire benefit of global deduplication. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 15 The Mitrend Data Analyzer works by analyzing the individual blocks of data on each file system. It computes a small fingerprint of each file system and loads it into a file named output.mitrend. The individual output.mitrend fingerprints can then be loaded to a single Mitrend assessment, where they are merged and used to compute the global deduplication savings. The Mitrend Data Analyzer can run in raw or file system mode. In raw mode, the entire raw block device is scanned (including unallocated blocks) and any data found is analyzed. With the exception of modern systems that use the UNMAP and TRIM SCSI commands, file systems do not erase data upon file deletion. As a result, a raw scan can find more data than is actually consumed by the file system. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 16 Conclusion EMC has responded to wide-ranging Exchange environment storage concerns with one comprehensive offering: EMC XtremIO is a thoroughly tested, production-proven, enterprise-class array. XtremIO has been tested and evaluated by dozens of customers including EMC IT. Initial findings report impressive and compelling efficiency rates ranging from 2.5:1 to more than 8:1, depending on the number of mailboxes and the number of DAG copies. In the vast majority of installations, customers typically observe an average reduction of 60 percent in actual storage usage for the active copy. EMC provides a viable and reliable method for analyzing currently installed Exchange environments in an effort to predict storage reduction ratios. EMC has verified that its Mitrend tool can furnish customers with insights into the storage efficiency benefits provided by XtremIO without the need to conduct an on-premises evaluation of the product. The XtremIO array provides cost-effective, highly-efficient storage for Exchange 2010 and Exchange 2013 mailbox databases. XtremIO enables customers to reduce the overall capacity required to store mailbox databases and operate their Exchange messaging systems more efficiently, compared to DAS and conventional block storage arrays. EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 17 References The following documents provide additional and relevant information: Best Practice Considerations for Testing All-Flash Arrays (available on EMC Community Network) Introduction to the EMC XtremIO Storage Array (Ver. 3.0) White Paper (available on EMC.com) EMC XtremIO and Microsoft Exchange Databases Reference Architecture Guide 18
© Copyright 2026 Paperzz