Günther Niederreiter
11. February 2014 0

Detecting incidents before they occur

Not only does big data involve analyzing huge amounts of data, it can also be used to make predictions that will help T-Systems reduce the number of incidents and continue to improve the quality of services.

The employees in the Service Center are working feverishly and coordinating all of their efforts. A customer line from Munich to New York is being interrupted sporadically, and the connection is in danger of breaking down completely. The customer is unaware of this, but it won’t take long before the outages become apparent. A module in the switching node in Munich is faulty. And just yesterday the new release was installed. Everything was running smoothly until then.
In my conversation with Gerhard Keller, Head of Production Support Systems and Project Manager for Big Data in the Quality Area, I learn that situations like this one are not uncommon: New releases are deployed, everything is stable, yet after a while an outage can occur.

Troubleshooting through review

I also learn that statistical analyses based on historical data are conducted as a way to determine correlations during such incidents. These statistics are quite precise and enable experts to make forecasts about potential future incidents. This is the approach taken by T-Systems when intensively analyzing major problems, known as critical and high incidents, to discover weak spots that could negatively impact the service delivered to customers. For example, an analysis can reveal whether issues arose due to previous changes. However, only some of the available data is isolated and actually analyzed.

Big data thinks ahead – predictive analytics

Today big data can help service personnel look into the future: Several data sources are compiled and subjected to an integrated analysis in real time – for example, outages and symptoms, also known as incidents and events, can be analyzed thoroughly. At T-Systems all the incidents and events from the past two years are analyzed, and this unstructured data is then mapped. Thanks to specific analysis tools, this data can be correlated for the first time. All it basically takes is a click of the mouse to analyze incidents based on the age of the hardware, the manufacturer, operating systems, time of day, occurrence during vacation time and data center location.

Results will improve quality

This big data approach is scheduled for implementation in Delivery and will be productive before the end of 2014. The next step will be to include order management data and unstructured information from relevant log files in the analysis.

The big data early warning system

This method will uncover hidden spatial, temporal and even logical patterns among incidents and events so that an early warning system can be established to prevent incidents from taking place. Correlations between incidents and the age of the hardware or the manufacturer can be identified as well. This information can then be used as the basis for investment decisions as to when hardware needs to be replaced or which manufacturer provides the most stable hardware and operating system combination.
In the case of the line between Munich and New York, this approach would have indicated – in advance of the new release – that potential problems or channel fluctuations could be expected. And this would have enabled engineers to take steps in advance to prevent the incident from actually happening.

Let’s communicate big
Günther Niederreiter

Thanks go to Gerhard Keller, Head of Production Support Systems and Big Data Project Manager in the Quality Area

Leave a Reply

Your email address will not be published. Required fields are marked *

By sending this comment you accept our comment policy.

a) Blog visitors are always invited to comment.

b) Comments are supposed to increase the value of this weblog.

c) Comments will be activated only after validation.

d) Comments which do not relate to the topic, obviously violate copyrights, have offensive content or contain personal attacks will be deleted.

e) Links can be inserted to the comment but should refer to the topic of the blog post. Links to other websites or blogs which do not refer to the posting will be considered as spam and will be deleted.



tsystemsCom @tsystemsCom
T-Systems  @tsystemsCom
Let's take a quiz: What do you remember about #WannaCry? @BackofenD about the need of a comprehensive immunization… https://t.co/jZmL5zeSTr 
T-Systems  @tsystemsCom
#Digitisation also holds great promise in Public #Healthcare but fundamental challenges be overcome to create the r… https://t.co/o82LNVq9dt 
T-Systems  @tsystemsCom
#DigitalTransformation is front of mind for many senior executives, but too often #security is left behind, knows… https://t.co/7INkOONQca 
T-Systems  @tsystemsCom
Making the most from your #cloud: 10 best practices recommended by cloud computing experts and IT managers who are… https://t.co/MLF3xxrVDu 
T-Systems  @tsystemsCom
The Internet of Things needs your #Mobile: #IoT can’t connect the next billion until we reinvent mobile @SAP:… https://t.co/WxDHNUPfZP 
T-Systems  @tsystemsCom
Seven techniques that will help create natural project checkpoints, stakeholder feedback loops, and system adjustme… https://t.co/KowaYzEy4W 
T-Systems  @tsystemsCom
Big savings with the #cloud: Just being in the cloud costs about the same as for 500 workloads as it does for 2,000… https://t.co/kWygNZYJdf 
T-Systems  @tsystemsCom
Digital twins are the next innovation in manufacturing and by 2020 at least 50% of big manufacturers will have a… https://t.co/oS4ZAbQPVw 
T-Systems  @tsystemsCom
#Blockchain and your enterprise: There are four areas to consider when deciding if your business is ready for block… https://t.co/Gg99AqmudS