Misplaced Pages

Artificial Intelligence for IT Operations: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 06:54, 29 October 2021 editSauer202 (talk | contribs)Extended confirmed users20,267 editsNo edit summary← Previous edit Revision as of 07:02, 29 October 2021 edit undoSauer202 (talk | contribs)Extended confirmed users20,267 edits Process: since we are talking about machine learning it is most likely meant to refer to 'Normalization (statistics)', but may also refer to 'Database normalization'Next edit →
Line 9: Line 9:


==Process== ==Process==
The normalized data is suitable to be processed through machine learning algorithms to automatically reduce noise and identify the probable root cause of incidents. The main output of such stage is the detection of any abnormal behavior from users, devices or applications. The ] data is suitable to be processed through machine learning algorithms to automatically reduce noise and identify the probable root cause of incidents. The main output of such stage is the detection of any abnormal behavior from users, devices or applications.


Noise reduction can be done by various methods, but most of the researches in the field points to these three actions: Noise reduction can be done by various methods, but most of the researches in the field points to these three actions:

Revision as of 07:02, 29 October 2021

Some of this article's listed sources may not be reliable. Please help improve this article by looking for better, more reliable sources. Unreliable citations may be challenged and removed. (March 2021) (Learn how and when to remove this message)

Artificial Intelligence for IT Operations (AIOps) is a term coined by Gartner in 2016 as an industry category for machine learning analytics technology that enhances IT operations analytics. AIOps is the acronym of "Algorithmic IT Operations". Such operation tasks include automation, performance monitoring and event correlations among others.

There are two main aspects of an AIOps platform: machine learning and big data. In order to collect observational data and engagement data that can be found inside a big data platform and requires a shift away from sectionally segregated IT data, a holistic machine learning and analytics strategy is implemented against the combined IT data.

The goal is to enable IT transformation, receive continuous insights which provide continuous fixes and improvements via automation. This is why AIOps can be viewed as CI/CD for core IT functions.

Given the inherent nature of IT operations being closely tied to cloud deployment and the management of distributed applications, AIOps has increasingly led to the coalescence of machine learning and cloud research.

Process

The normalized data is suitable to be processed through machine learning algorithms to automatically reduce noise and identify the probable root cause of incidents. The main output of such stage is the detection of any abnormal behavior from users, devices or applications.

Noise reduction can be done by various methods, but most of the researches in the field points to these three actions:

  1. Analysis of all incoming alerts;
  2. Remove duplicates;
  3. Identify the false positives;
  4. Early anomaly, fault and failure (AFF) detection and analysis.

Anomaly detection - another step in any AIOps process is based on the analysis of past behavior of users, equipment and applications. Anything that strays from that behavior baseline is considered unusual and flagged as abnormal.

Root cause determination is done usually by passing incoming alerts through algorithms that takes into consideration correlated events as well as topology dependencies. The algorithms on which AI are basing their functioning can be influenced directly, essentially by "training" them.

Use

A very important use of AIOps platforms is related to the analysis of large and unconnected datasets, such as the Johns Hopkins Covid-19's data published through GitHub. The data in this example is pulled from a large number of un-normalized databases - aggregated data (10 sources), US regional data (113 sources) and Non-US data (37 sources), which are unuseable considering the needed emergency response time by the traditional analysis models.

Generally, the main areas of use for AOIps platforms and principles are

References

  1. Jerry Bowles (January 28, 2020). "AIOps and service assurance in the age of digital transformation". Diginomica.
  2. "Algorithmic IT Operations Drives Digital Business: Gartner - CXOtoday.com". Cxotoday.com. Archived from the original on January 28, 2018. Retrieved January 28, 2018.
  3. "Market Guide for AIOps Platforms". Gartner. Retrieved January 28, 2018.
  4. "Comprehensive approach for Artificial Intelligence for IT Operations transformation" (PDF). Deloitte. Retrieved January 28, 2018.
  5. "ITOA to AIOps: The next generation of network analytics". TechTarget. Retrieved January 28, 2018.
  6. "An Introduction to AIOps". The Register. Retrieved January 28, 2018.
  7. "AIOps - The Type of 'AI' with Nothing Artificial About It - Dataconomy". Dataconomy.com. Retrieved January 28, 2018.
  8. "AIOps: Managing the Second Law of IT Ops - DevOps.com". devops.com. 22 September 2017. Retrieved 24 January 2018.
  9. "What is AIOps or Artificial Intelligence for IT Operations. Top 10 Common AIOps Use Cases".{{cite web}}: CS1 maint: url-status (link)
  10. Harris, Richard. "Explaining what AIOps is and why it matters to developers". appdevelopermagazine.com. Retrieved 24 January 2018.
  11. Masood, Adnan; Hashmi, Adnan (2019), Masood, Adnan; Hashmi, Adnan (eds.), "AIOps: Predictive Analytics & Machine Learning in Operations", Cognitive Computing Recipes: Artificial Intelligence Solutions Using Microsoft Cognitive Services and TensorFlow, Apress, pp. 359–382, doi:10.1007/978-1-4842-4106-6_7, ISBN 978-1-4842-4106-6
  12. Duc, Thang Le; Leiva, Rafael García; Casari, Paolo; Östberg, Per-Olov (September 2019). "Machine Learning Methods for Reliable Resource Provisioning in Edge-Cloud Computing: A Survey". ACM Comput. Surv. 52 (5): 94:1–94:39. doi:10.1145/3341145. ISSN 0360-0300.
  13. WISC.edu - International Conference on Service Oriented Computing
  14. Machine Learning
  15. Importing COVID-19 data into Elasticsearch
  16. UPC.edu - Top 10 Artificial Intelligence Trends in 2019
Category: