Shekhar Dasgupta is the Founder & CEO of GreenField Software Private limited, a venture pioneering next generation data center technologies for cloud infrastructures.
Data Center Infrastructure Management (DCIM) software has now come out of the shadows of an emerging technology it had started with a decade ago as a tool integrating Building Management System (BMS) and a Systems Management software. It has matured as a Data Center Operations, Planning and Management software vital for Data Center staff, managers as well as the CIO. Of course, evolution will further continue with Software Defined Data Centers (SDDCs) and DCIM becoming a natural inheritor of an Industrial Internet of Things (IoT) application.
Beyond monitoring and sending critical alerts, DCIM addresses the common data center operation problems that happen due to the absence of a comprehensive, up-to-date and accurate asset database. A Move-Add-Change operation goes untracked in a spreadsheet. DCIM automates this function with workflow-based approvals. The asset database itself can be created through an auto-discovery process and populating static attributes from in-built OEM libraries. Please click here to see this at work.
Beyond Data Center Operations, DCIM helps in Planning and Management. It helps identify the best-fit racks to place a new server, based on available space and remaining allocated power. It provides alerts when a Preventive Maintenance date is due. So downtime can be properly planned especially when there is possibility of downstream cascading impact. A Power Chain provides ready visualization to help avoid an adverse situation.
Finally, DCIM provides Data Center metrics. On the dashboard one can get rack space utilization, PUE and rack, row & room temperature for the month as against the SLA terms.
In a world of always-on service delivery, data center failures are unthinkable. The financial implication and loss of reputation alone make it imperative that we put in place systems that prevent such failures. At the same time, CFOs and CIOs are grappling with increasing capital expenditures and operating costs involved in running a High Availability Data Center.
GFS Crane DCIM delivers both on the High Availability promise as well enabling a leaner, greener and operationally more efficient data center. For more on GFS Crane DCIM, please see our new brochure.
We are privileged to be again participating in Intel DCM Booth at DCD New York on 19th & 20th April, 2016 at Marriott Marquis, Times Square.
We will be announcing our new version GFS Crane DCIM v3.0 with a number of enhancements.
With these enhancements, we will show how GFS Crane is a complete DCIM for Enterprise as well as Colocation Data Centers. We will demonstrate how it helps in Data Center Operations, Planning & Management.
We will demonstrate the integration with Intel Data Center Manager and how that reduces cost of ownership for better capacity planning, power, space and cooling management.
Last month, we did a joint webinar with Intel DCM Group where we showed how GFS Crane mitigates risks of data center failures. Those who missed that may see the recorded version. If you are attending DCD New York, please visit us at the Intel DCM booth to see GFS Crane as a Policy-driven DCIM. Threshold levels for each monitored parameter and SLA terms are defined within the Policies section of GFS Crane DCIM. When a threshold level is breached Active Alerts are sent to designated users via email or text messages or sent as SNMP traps to an ITSM tool. Dashboards and analytic reports provide Data Center Managers key metrics against SLA terms such as average PUE for the month, temperature range and uptime standards for critical devices.
You can read more from a recent case study of GFS Crane DCIM with a telecom operator.
Our recent DCIM implementations provide an insight to actual use pattern with enterprise DCIM customers in the South Asia region. While analyst reports, mostly focused on the North American and Western Europe markets, suggest Energy Efficiency, Capacity Planning and Compliance considerations as foremost reasons for DCIM deployment, here’s our observations:
- While 80% of our enterprise data center customers have licensed for GFS Crane DCIM full suite, the principal (but not only) reason for their deployment was to prevent a data center outage.
- To prevent this outage, customers needed instant monitoring and getting alerts from all critical infrastructures.
- If customer had a BMS, DCIM had to integrate with that.
- If customer did not have a BMS, then DCIM had to integrate directly with the devices, and specifically with those they perceived as the MOST critical, or the weakest link in the chain.
- The other DCIM functions in order of importance were: management dashboards with KPIs, data center visualization, asset and change management (with workflow approvals and audit trails) and capacity planning.
The above use patterns are for enterprise DCIM, as against DCIM in multi-tenant data centers who of course have additional reasons for DCIM deployment like automating customer on-boarding process, capacity planning and power/space inventory management, energy billing and offering customer portals for self-service.
Most enterprise data centers in India are less than 50 Racks and a large proportion do not have BMS or instrumentation for monitoring of physical infrastructure. They rely on periodic manual monitoring taking readings from device consoles, room thermometers and hand-held power meters. The inadequacy of this archaic approach is obvious to all. Hence the options are BMS, DCIM or a combination of both, latter two when customers are looking beyond monitoring and sending alerts.
The weakest link with one customer, operating in a region with daily twelve-hour power outages, were DG sets and fuel supply. Hence, GFS Crane DCIM had to offer a comprehensive fuel automation system including 24x7 hour monitoring of DG sets and fuel tanks and controlling fuel levels in the tanks.
With a High Performance Computing customer, paranoid about poor power quality or extended power outage damaging expensive equipment, GFS Crane DCIM provided extensive alerts as well as analytics, not just on individual UPS devices, but also on banks of them with DR policies defined within the DCIM. Passive alerts were converted to actionable instructions for preventing an application outage, and quickly isolating any expensive equipment from such power related incidents. Of course, both these customers are also benefiting from GFS Crane DCIM’s comprehensive asset & change management, capacity planning, power and environment management capabilities across both physical as well as IT infrastructure – the latter with Intel Data Center Manager.
I take this opportunity to wish all our customers, partners and visitors to our web site a Very Happy & Prosperous New Year.
GFS is a Knowledge Partner at DCD CONVERED Bangalore taking place this week. We are unveiling here the integrated solution of GFS Crane DCIM with Intel Data Center Manager. DCD Attendees can see this tomorrow (16th July) at Intel Booth.
GFS is sponsoring a Panel Discussion this afternoon “Key Metrics for Data Center Operations to Measure Reliability & Efficiency.” Moderated by our Vice President – Development, Parikshit Bhaduri, the panel includes
- Mr. N. Subramanyam, CIO, Diageo
- Mr. Rupinder Goel, Global CIO, Tata Communications
- Ms. Shaheen Meeran, Managing Director, Schnabel DC Consultants India
Data Center Metrics is a growing subject in data center operations management. Starting from the days when Power Usage Effectiveness (PUE) was first defined, Green Grid subsequently expanded with multi-level PUE measurements, Carbon Usage Effectiveness (CUE), and Water Usage Effectiveness (WUE) to round-off a set of sustainability metrics for the data center.
Other organizations, notably eBay, defined other measurement criteria for different data center roles. This will be the topic for today’s GFS sponsored panel. DCIM is expected to deliver all the key metric s for different data center roles real-time, to indicate not only the current state of health of the data center, but to enable instant corrective actions (if required) to maintain uptime and SLAs.
An example of this comes from our collaboration with Intel. Integrating Intel Data Center Manager gives us interesting measures on Rack Utilization Effectiveness (RUE) with respect to their Power, Space and Temperature on real-time basis. Instant view of RUEs helps in accurate Move-Add-Change (AC) decisions as well as Capacity Planning and Forecasting.
In the first part of this blog on Business Analytics for Data Centers, we explored why Analytics has become critical for Data Center operations . In this second part, we will explore how DCIM fulfills this role as a Business Analytic tool for Data Center operations.
While DCIM in its early days was largely seen as a bridge between Facilities and the IT Infrastructure Groups, it is now being recognized as an analytic tool for data center operations. Maturity in DCIM technology has meant that huge amounts of data from different devices are captured on a real time basis. Data Center Managers rightly expect that DCIM must now be more than just a monitoring tool and deliver meaningful insights from the data lake of power and environment monitoring, server utilization and threshold breaches.
At configuration stage, DCIM is mapped with the critical relationships and dependencies between all the assets, applications and business units in the data center. This makes it possible to identify cascading impacts of an impending failure. DCIM Analytics however goes deeper. Over a period of time, data patterns emerge which lend themselves to modern predictive and prescriptive analytics. Predictive analytics gives the data center team enough time to take measures to either avoid or reduce the impact of the failure when it happens. Prescriptive analytics, on the other hand, provides suggestions on how to achieve or improve benchmark levels on each of the metrics specified in advance.
DCIM works with environment probes that measure rack, row and room temperatures and humidity levels. Analytics can help to determine which areas in the data center need more cooling than others and even which PAC unit may be turned off in the data center at certain times of the day or month. Advanced DCIM, through analytics, recommends ways to reduce power consumption in the data center by raising temperature in zones that do not need extra cooling.
Other Benefits Using DCIM
There is a frequent Move-Add-Change (MAC) in data centers. DCIM has the capacity to deal with these MACs, as well as sudden surges in demand for data center resources. This works especially well with multiple virtual servers in the cloud. Most businesses today do not own just one data center housed in a single location – their data centers are spread around the world. Some are in-house and others are hosted by third-parties. DCIM is the only technology that lets business users control all their data center assets and resources from a single platform.
Data centers are notorious for their high power consumption. Advanced DCIM provides business and operational intelligence to maximize rack space use, minimize power distribution losses and optimize cooling while ensuring the data center meets SLA standards for temperature, availability and energy efficiency metrics like PUE (Power Usage Effectiveness).
Most businesses are finding it hard to make most of the existing space in their data centers, and the use of DCIM software mitigates this problem to a great extent. DCIM can help with reduced rack and floor space utilization, by providing detailed real-time reports on server utilization and capacity. Server utilization reports provide suggestions which of them can be decommissioned or virtualized and therefore overcome space constraints in the data center.
Finally, the most important function of DCIM is to prevent data center failures which can permanently damage the reputation of a business. In an age when a major data center failure can prove fatal for a business, DCIM provides monitoring as well as predictive analytic capability to prevent such a disaster.
A new breed of Data Center Infrastructure Management (DCIM) software is now emerging out of the shadows of being just a monitoring and tracking tool. Advanced DCIM are now providing the much needed Business Analytics for data centers. This can be a boon for both C-level executives and data center managers looking to cut costs while meeting demands for High Availability. This is a two-part blog. In this first part, we explore why Analytics has become critical for Data Center operations in the new world order of Internet of Things.
As data centers are growing in complexity, the need to keep them functioning at an optimum level, while cutting down on costs, is a challenge facing both the CIO as well as the CFO. Large businesses are spending millions to keep their data centers up and running and it is directly affecting their bottom line and ROI. Companies can no longer afford to let their data centers run under-utilized, nor can they afford failures. Sadly, most organizations are struggling to make the most of their data center investments.
Business Analytics and DCIM – An Introduction
Business Analytics software provides a broad set of capabilities for gathering and processing business data, and includes functions such as reporting, analysis, modeling and forecasting - all of which give business users the ability to make informed decisions and initiate actions directly from their dashboards.
In order to understand how a few of the advanced Data Center Infrastructure Management (DCIM) software provides similar capabilities for data centers, we have to first look at the challenge of running data centers effectively and at minimal cost. While the foremost responsibility of the Data Center Manager is maintaining High Availability, the challenge, somewhat ironical, can be summed up in one sentence:
Extreme redundancies with lots of assets increase the vulnerable points!
Not to mention, they also consume large amount of resources, and typically remain under-utilized.
Data center assets comprise both physical as well as IT infrastructure. The resources to keep them running include space and networks and also power and cooling without which the assets would not be able to function.
Advanced DCIM gives data center operators the ability to manage all their data center assets and resources from a single dashboard. Through real-time monitoring of all assets and resources, they can determine correlations between different parameters, thereby making their DCIM a powerful platform for deep analytics and business intelligence. DCIM Analytics ensures that all data center assets are in good health while consuming the least amount of resources and provides complete visibility to the power chain, enabling tracking and eliminating potential vulnerable points of failures.
In the second part, we will explore "How DCIM Business Analytics Works."
As we approach end of another year, it is time to reflect what is in store for us in the Data Center world in 2015 and beyond.
Up until now, from the dot com boom days, we witnessed first the proliferation and later consolidation of data centers.
Now that is about to change. Gartner recently pointed out there is a new disruption about to happen in the data center world. Growth of Internet of Things (IOT) would churn out hitherto unseen volumes of data every minute from zillions of Internet-enabled wearables, devices and industrial equipment all around the world. This data would have to be analyzed instantaneously for operational intelligence, predictive and prescriptive analytics and even national security. This would have major impact on data center operations.
The current paradigm of centralized operations means transmitting data over thousands of miles from point of data origination to central servers and then sending back to originating point and elsewhere, again over thousands of miles, and a few times over. This would require massive bandwidths and massive computing power. Instead we will see an emergence of a “glocal” model, with distributed data centers serving immediate local needs and then transmitting raw as well as processed data to centralized processing centers for deeper analysis for global consumption.
When this happens, it will bring about an overhaul of business processes and have a transformational effect on data center operations. Data Center operations will look like an intricate supply chain network, involving business partners, transfer pricing, and running transportation and cost optimization models. Data Center operations will be software-driven, even transcending what is currently defined within the boundaries of a Software Driven Data Center (SDDC), which is primarily about virtualization of computing resources. Data Center Infrastructure Management (DCIM) will be at the heart of this transformation, with a workflow-driven Business Process Management (BPM) layer that would auto-enable data traffic via a set of algorithms working on constraint-based planning.
The use cases of Internet of Things, from consumer efficiency to health care to transportation and even agriculture is fascinating. What is less written about is the profound impact this will have back-end on the Data Centers that would have to support this new era of a ubiquitous connected world.
In my last blog on this site I had stated: “Mapping the entire chain from application and the application’s business owner to the VM on which it is residing and all the way up in the power and network chain to the source of power (in small data centers mostly up to the UPS) and network routers and switches would introduce far greater financial savings that could be a blog topic by itself”.
I view DCIM Asset Management delivering in three different ways:
- Data Center Asset Database
- Data Center Asset Relationship Mapping
- Data Center Asset Lifecycle Management
In this blog I will focus on Data Center Asset Relationship Mapping and its potential ROI.
Data Center Asset Relationship mapping draws out the entire chain, provides the linkages as well as the hierarchy of all the IT and physical infrastructure assets in a data center. This starts from source of power right up to the VM Server. This mapping, sometimes referred to as power or network chain, can be depicted in a visual diagram which provides easy understanding of the cascading impact should there be a failure of any device in the chain. It also helps to understand the precise redundancies required to run a 24x7 operation within defined uptime levels. Enterprise-level DCIM Asset Management Software provides this Asset Relationship Mapping.
Next, where’s the ROI from this?
- Uncovering under-utilized or over-provisioned assets
- Recovering stranded capacities in data center assets
- Deferring or avoiding new data center construction/retrofit investments
Any of above can mean significant savings in capital costs and in cost of capital. We are not even going into the areas of potential savings in energy costs and opportunity costs of outages that can be prevented due to better visibility of the cascading impacts.
DCIM Asset Management, specifically Asset Relationship Mapping will be an important deliverable in our new wins at a High Performance Computing Center in a leading higher educational institute and in the data centers of a leading mobile operator in South Asia.
An often asked question: Is DCIM overkill for data centers that have less than 1000 square feet white space or those with connected power less than 500 KW?
This question comes up because of a commonly held myth that DCIM is only about power savings. The argument here is that monetary savings from reducing power consumption in a small data center would be minimal. While I would not dispute that, it may interest the CFO that the annual power cost in running this data center is approximately US$ 330,000 assuming PUE of 2.5 (typical of such a data center that has not undertaken any energy efficiency measures) and power tariff at $ 0.13 per kwh. Reducing the PUE to 2.0 would bring the annual power bill to U$ 260,000.00. While a small saving, I am not sure if $ 70,000 is totally irrelevant.
But DCIM tools are not just about cutting power costs, although that is indeed an important reason for larger data centers, and more so with multi-tenant data centers where over 40% of the operating costs are power related. In my mind, the fundamental reason why even a small data center should invest in DCIM software is for asset management, specifically asset relationship mapping. Mapping the entire chain from application and the application’s business owner to the Virtual Machine on which it is residing and all the way up in the power and network chain to the source of power (in small data centers mostly up to the UPS) and network routers and switches would introduce far greater financial savings that could be a blog topic by itself.
DCIM Asset Management’s relevance in a smaller data center is similar to the relevance of ERP in a smaller discrete manufacturing unit which may still have multiple levels of Bill of Material (BOM). Getting the BOM levels and the inter-relationships right is fundamental to lean manufacturing that brings about huge cost savings. It is also similar to the relevance of a CRM in a smaller Bank where customers may still have multiple banking relationships and CRM tracks that, enabling the Bank to provide better service and garner higher wallet share of the customer. A complete asset relationship mapping in a 1000 square feet data center, which may have as much as 500 inter-related devices, helps to avoid over provisioning (read wasted capital expenditure), better availability due to visibility (read avoiding costly failures) of the cascading impacts due to a device failure in the chain.
Just as an ERP or a CRM is universally adopted in a SMB enterprise for higher profitability, so also should DCIM find place in a smaller data center for lowering capital costs and mitigating the risks of a data center failure.
This was my first time at AFCOM’s Data Center World. Held in Las Vegas between 28th April and 2nd May, 2014 it was refreshingly different from other conferences as sponsor speakers could not advertise their wares during their speaking sessions. The sessions were aptly called “educational” as they provided an opportunity to hear some vendor-neutral versions of latest technologies. Of course, the hint was not lost when delivered by a vendor, but one was spared brazen advertisements. There were some outstanding educational sessions, including panel discussions on risk assessments, big data and of course also on DCIM. My session was on “Digital Services Efficiency: A New Management Scorecard.” The theme was how next-gen Data Center Infrastructure Management (DCIM) software would have role-based KPIs based on individual key result areas. The full version of the presentation is available here.
The exhibition floor displayed latest on power, cooling, security and fire suppression systems. That leads me to what I really want to discuss in today's blog.