

This is a Wikibon Voice of the Community Report, sponsored by ExtraHop Networks Inc. Voice of the Community posts are identified paid posts that appear on all pages of SiliconANGLE.com, supporting editorial efforts.
Information technology infrastructure data can provide valuable intelligence for reducing business overhead, improving back-office and customer-facing processes and ensuring compliance with key mandates. However, automating the distillation and delivery of those insights grows increasingly difficult as the scale and complexity of the infrastructure and its operational data skyrockets. Enterprises are swamped with ever greater volumes, varieties and velocities of operational data streaming in from public, private and hybrid clouds. In many organizations, this never-ending data avalanche threatens to overwhelm IT operational management staff. Machine learning is a promising tool for assisting IT infrastructure managers in deriving data-driven insights in support of root cause analysis, workload management, performance monitoring and other key operational tasks.
IT infrastructure processes almost every interaction, transaction and communication in our lives, and much of that infrastructure has already shifted to the cloud.
Cloud IT infrastructure has become the integration fabric for every type of application. For IT operations personnel, sifting through all of that cloud-sourced data — relevant to services, applications, networking, storage, devices and other IT infrastructure–can be a daunting challenge. They must monitor and correlate data at every level of the infrastructure stack — including hardware, networks, middleware and applications—and use it to manage service levels in real time. Just as important, they must leverage that data for anomaly detection, situation response, remote diagnostics, root cause analysis, capacity management and many other applications.
How can cloud IT infrastructure managers harness infrastructure data to extract the insights needed to do their jobs at every level of necessary detail? More to the point, how can they do this in the face of growing “data friction.” This refers to obstacles that prevent infrastructure managers from extracting the operational insights they need to do their jobs effectively.
Data friction stems from the complexities that administrators encounter in the processes of collecting, aggregating, analyzing, and otherwise using this data to monitor, maintain and improve service levels. And it may be partly exacerbated by enterprise investments in underpowered analytics tools.
Analytics can be a key for reducing data friction and powering agile data-driven decision support in many operational scenarios. With infrastructure wire data, the key task-focused insights may remain beyond infrastructure managers’ reach if they merely try to view them using simple queries, pre-existing reports and dashboards, and other standard analytic views. For deeper insights into more complex infrastructure issues, managers may need to call on more sophisticated real-time, historical and predictive analytics.
To explore these issues, Wikibon convened a Voice of the Community CrowdChat to discuss data friction and the techniques — such as machine learning – for addressing it proactively. For an hour in September 2017, 58 industry experts who are part of the Wikibon community met online to discuss this topic. The conversation comprised 226 collaborative observations that generated close to 1 million customer impressions on social-media channels.
What follows are key findings that were surfaced in the CrowdChat:
*******************************************
What is a CrowdChat?
CrowdChat is a community engagement tool used by Wikibon to research innovation. A CrowdChat brings together – online – experts in a domain to discuss complex technology, social, and business issues. Wikibon posts questions to these experts, which catalyzes a bloom of conversational interactions about the subject. Wikibon analysts then combine these interactions with other research sources to develop the findings that we publish in a Voice of the Community research paper.
*******************************************
Wikibon will now drill down into the free-form comments and poll results that were produced in this CrowdChat, revealing the Voice of the Community on particular issues and opportunities.
… on growth in the volume of infrastructure data
Infrastructure management data is growing rapidly. In the CrowdChat, participants cited annual growth rates in infrastructure data volumes of between 100 and 500 percent as typical in many organizations. Much of this growth stems from the expanding scale and complexity of the infrastructure from which the data is being sourced and in which it’s being consumed. Another key factor is the fact that more applications have data and analytics at their heart. Yet another growth factor is the expansion of edge computing, especially the Internet of Things, which is a non-stop source of data sensing, capture, and inference.
As regards the growing scale of infrastructure data, noteworthy comments from the community included:
Figure 1 presents a poll of CrowdChat participants on the subject of infrastructure data growth:
Figure 1: CrowdChat Poll: How rapidly is your enterprise infrastructure management data growing in volume?
… on the ease of extracting value from infrastructure data
Data growth challenges infrastructure managers to find golden needles in unfathomably massive haystacks.
In the CrowdChat, participants repeatedly commented on the fact that infrastructure data’s expanding scale, scope, and speed make it difficult to drill through it all for task-specific operational intelligence. In addition, the ever-expanding range of data-driven decision support scenarios challenge infrastructure managers to implement analytics that are agile enough to address future query, reporting, dashboarding and other decision-support use cases.
As regards the ease of distilling actionable insights from infrastructure data, noteworthy comments from the community included:
Figure 2 presents a poll of CrowdChat participants on the subject of ease of extracting relevant and actionable insights from infrastructure data:
Figure 2: CrowdChat Poll: Is it getting harder or easier to extract value from your infrastructure data?
…. on whether infrastructure data has become a strategic asset
IT managers are well aware of the promise of data-driven infrastructure intelligence.
In the CrowdChat, participants agreed that there’s strategic decision-support potential locked up in infrastructure data, but the requisite analytic and data management tools and analytics are largely lacking from their operations. A key consideration is the extent to which organizations can create assets — such as machine learning and predictive analytics models — that are derived from infrastructure data and can be reused flexibly to address future decision support and automation requirements for IT operational management.
As regards infrastructure data’s strategic potential, noteworthy comments from the community included:
Figure 3 presents a poll of CrowdChat participants on the subject of the strategic business value of infrastructure data:
Figure 3: CrowdChat Poll: Is your infrastructure data becoming more of a strategic asset or overhead burden?
… on whether the IT organization is a facilitator in gaining value from infrastructure data
IT managers struggle against constraints that prevent them from evolving their operational analytics.
In the CrowdChat, participants generally agreed that IT professionals would generally embrace better infrastructure data analytics, but are often hamstrung by constraints in staffing, skills, tooling, budget and other areas that deter or discourage them from taking the necessary next step. Though IT professionals take pride in their data-driven ingenuity, they also generally operate as a cost center that makes the best of whatever resources (albeit limited) are at their disposal.
Figure 4 presents poll of CrowdChat participants on the subject of IT managers’ commitment to maximizing the operational business value of infrastructure data:
Figure 4: CrowdChat Poll: Is your IT organization a facilitate or obstacle to gaining value from your infrastructure data?
… on the extent to which infrastructure data-driven insights permeate their organizations
IT managers have limited experience using sophisticated decision-support tools.
As discussed in the CrowdChat, data-driven decision making tends to be limited to a senior cadre of business analysts in most organizations, though acknowledged widely as an aspiration. Infrastructure data can be fearsomely hard for users to find, aggregate, and analyze, though machine learning can be useful in distilling relevant patterns and correlations. Of course, this situation parallels the meager adoption of operational business intelligence tools by knowledge workers in general, which consistently is limited to 5 to 10 percent of organizational staffs.
As regards IT management groups’ use of infrastructure data-driven insights, noteworthy comments from the community included:
… on the ease of finding infrastructure data and associated analytics
IT managers have limited experience using sophisticated decision-support tools.
As discussed in the CrowdChat, data-driven decision making tends to be limited to a senior cadre of business analysts in most organizations, though acknowledged widely as an aspiration. Infrastructure data can be fearsomely hard for users to find, aggregate and analyze, though machine learning can be useful in distilling relevant patterns and correlations. Bear in mind that the same can be said for big data in general, especially as it grows in volume, velocity and variety.
As regards the ease of discovering relevant infrastructure data and analytics, noteworthy comments from the community included:
Figure 5 presents a poll of CrowdChat participants on the subject of easy of finding infrastructure data and associated analytics:
Figure 5: CrowdChat Poll: How easy is it for user to find the infrastructure data & associated analytics they need?
… on the veracity of infrastructure data
IT managers have varying confidence on the quality of their infrastructure data.
As discussed in the CrowdChat, IT managers, like users in general, have varying degrees of confidence in the correctness, consistency, freshness, and other quality metrics surrounding their infrastructure data. Confidence in the data — hence in any data-derived infrastructure insights — depends on the extent to which the manager has visibility into the data’s provenance, handling and lineage. When it comes to “wire data” — in other words, raw data that streams direct from the applications and infrastructure nodes being managed — confidence tends to be high, due to the fact that it is perceived to be trustworthy (albeit raw and therefore difficult to interpret without further analytics).
In this regard, noteworthy comments from the community included:
*******************************************
What does the Wikibon community say about CSPs and innovation?
“Very high confidence in highly stylized data. Modest confidence in correlated data. Limited confidence in data that ties to business activity.”
*******************************************
Figure 6 presents a poll of CrowdChat participants on the subject of confidence in the veracity of infrastructure data:
Figure 6: CrowdChat Poll: How much confidence do users have in the veracity of infrastructure data?
… on using machine learning to unlock insights from infrastructure data
Machine learning, as a tool for surfacing IT infrastructure data insights, is embryonic.
As noted in the CrowdChat, IT infrastructure managers’ operational analytics have barely begun to incorporate machine learning for incident monitoring, anomaly detection, root cause analysis, predictive maintenance and other critical data-driven inferences. One key reason for this is that commercial IT operations management tools have only recently begun to incorporate machine learning. Another factor is that enterprises that have invested in IT operations management tools may lag considerably in upgrading or migrating to solutions that benefit from machine-learning-driven analytics.
In this regard, noteworthy comments from the community included:
Figure 7 presents a poll of CrowdChat participants on the subject of whether ML has made it easier to unlock insights in infrastructure data:
Figure 7: CrowdChat Poll: Has machine learning made it easier to unlock insights from your infrastructure?
Data friction stems from the complexities that administrators encounter in the processes of data ingestion (e.g., discovery, collection, transformation), governance (e.g., aggregation, cleansing, version control) and consumption (e.g, delivery, analysis, visualization, contextualization, anomaly detection, alerting). Increasingly, Wikibon sees infrastructure managers applying the most sophisticated machine learning, deep learning, and artificial intelligence tools for operational decision support. These technologies can help to elucidate the most noteworthy correlations, anomalies and other insights using statistics-based approaches such as artificial neural networks, support vector machines, and cluster analysis.
Wikibon recommends that IT managers use the following multifaceted approach for deriving actionable insights from their infrastructure data:
THANK YOU