Misplaced Pages

Artificial Intelligence for IT Operations

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

This is an old revision of this page, as edited by 49.207.202.150 (talk) at 11:55, 16 September 2021. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Revision as of 11:55, 16 September 2021 by 49.207.202.150 (talk)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)
Some of this article's listed sources may not be reliable. Please help improve this article by looking for better, more reliable sources. Unreliable citations may be challenged and removed. (March 2021) (Learn how and when to remove this message)

Artificial Intelligence for IT Operations (AIOps) is a term coined by Gartner in 2016 as an industry category for machine learning analytics technology that enhances IT operations analytics. AIOps is the acronym of "Algorithmic IT Operations". Such operation tasks include automation, performance monitoring and event correlations among others.

There are two main aspects of an AIOps platform: Machine learning and big data. In order to collect observational data and engagement data that can be found inside a big data platform and requires a shift away from sectionally segregated IT data, a holistic machine learning and analytics strategy is implemented against the combined IT data.

The goal is to enable IT transformation, receive continuous insights which provide continuous fixes and improvements via automation. This is why AIOps can be viewed as CI/CD for core IT functions.

Given the inherent nature of IT operations being closely tied to cloud deployment and the management of distributed applications, AIOps has increasingly led to the coalescence of machine learning and cloud research.

Process

The normalized data is suitable to be processed through machine learning algorithms to automatically reduce noise and identify the probable root cause of incidents. The main output of such stage is the detection of any abnormal behavior from users, devices or applications.

Noise reduction can be done by various methods, but most of the researches in the field points to these three actions:

  1. Analysis of all incoming alerts;
  2. Remove duplicates;
  3. Identify the false positives;
  4. Early anomaly, fault and failure (AFF) detection and analysis.

Anomaly detection - another step in any AIOps process is based on the analysis of past behavior of users, equipment and applications. Anything that strays from that behavior baseline is considered unusual and flagged as abnormal.

Root cause determination is done usually by passing incoming alerts through algorithms that takes into consideration correlated events as well as topology dependencies. The algorithms on which AI are basing their functioning can be influenced directly, essentially by "training" them.

Use

A very important use of AIOps platforms is related to the analysis of large and unconnected datasets, such as the Johns Hopkins Covid-19's data published through GitHub. The data in this example is pulled from a large number of un-normalized databases - aggregated data (10 sources), US regional data (113 sources) and Non-US data (37 sources), which are unuseable considering the needed emergency response time by the traditional analysis models.

Generally, the main areas of use for AOIps platforms and principles are

  • Automation of tasks (DevOps)
  • Machine Learning Platforms
  • Augmented Reality
  • Agent-Based Simulations
  • IoT (Internet of Things)
  • AI Optimized Hardware
  • Natural Language Generation
  • Streaming Data Platforms
  • Conversational BI and Analytics

References

  1. Jerry Bowles (January 28, 2020). "AIOps and service assurance in the age of digital transformation". Diginomica.
  2. "Algorithmic IT Operations Drives Digital Business: Gartner - CXOtoday.com". Cxotoday.com. Archived from the original on January 28, 2018. Retrieved January 28, 2018.
  3. "Market Guide for AIOps Platforms". Gartner. Retrieved January 28, 2018.
  4. "Comprehensive approach for Artificial Intelligence for IT Operations transformation" (PDF). Deloitte. Retrieved January 28, 2018.
  5. "ITOA to AIOps: The next generation of network analytics". TechTarget. Retrieved January 28, 2018.
  6. "An Introduction to AIOps". The Register. Retrieved January 28, 2018.
  7. "AIOps - The Type of 'AI' with Nothing Artificial About It - Dataconomy". Dataconomy.com. Retrieved January 28, 2018.
  8. "AIOps: Managing the Second Law of IT Ops - DevOps.com". devops.com. 22 September 2017. Retrieved 24 January 2018.
  9. "What is AIOps or Artificial Intelligence for IT Operations. Top 10 Common AIOps Use Cases".{{cite web}}: CS1 maint: url-status (link)
  10. Harris, Richard. "Explaining what AIOps is and why it matters to developers". appdevelopermagazine.com. Retrieved 24 January 2018.
  11. Masood, Adnan; Hashmi, Adnan (2019), Masood, Adnan; Hashmi, Adnan (eds.), "AIOps: Predictive Analytics & Machine Learning in Operations", Cognitive Computing Recipes: Artificial Intelligence Solutions Using Microsoft Cognitive Services and TensorFlow, Apress, pp. 359–382, doi:10.1007/978-1-4842-4106-6_7, ISBN 978-1-4842-4106-6
  12. Duc, Thang Le; Leiva, Rafael García; Casari, Paolo; Östberg, Per-Olov (September 2019). "Machine Learning Methods for Reliable Resource Provisioning in Edge-Cloud Computing: A Survey". ACM Comput. Surv. 52 (5): 94:1–94:39. doi:10.1145/3341145. ISSN 0360-0300.
  13. WISC.edu - International Conference on Service Oriented Computing
  14. Machine Learning
  15. Importing COVID-19 data into Elasticsearch
  16. UPC.edu - Top 10 Artificial Intelligence Trends in 2019
Category: