logo

NJP

What's new in the Washington DC release: ITOM Health

Import · Mar 21, 2024 · video

hello everyone um welcome to the call today thank you for staying on while there's been no intro music I do apologize for that but on that note I hope you're still all excited to learn about what is new this quarter in itom AI Ops so you may have noticed that we are referring to it as itom aiops because it is the solution that we offer through the specific product of itom Health if anyone needed some more clarification and with that explanation I just want to introduce myself for today my name is Victoria low and I am one of the outbound product managers for itom and I focus specifically on itom AI Ops and I'll be going through the session with you today with my um team member Jason Smith also supporting me here in the call so if you have any questions that may need to be answered in the Q&A module then he will most likely be the one to answer it because for me to speak in type at the same time I'm not very good at that so thank you Jason and I will be joining you today from Toronto Canada it's kind of cold up here and it was snowing this morning and I hope spring is actually coming soon and Jason will be joining us from Stockholm Sweden for anyone else who is in the area and we will be covering itom aops today but tomorrow at the same time we'll be having a webinar session for iton visibility as well so make sure you're already signed up for that if you are not already so let's get started so in case we make any slightly forward-looking statements um The Safe Harbor notice supplies here and you can read it below or in the slide right here so yes um but there likely be none because this is mostly just the webinar focused on what's new for this quarter that being said if you're interested in joining us for any of our future webinars please see the schedule here you can sign up with your QR code um quickly scan it right now but also you could always just go to live on service now on the community to look at all the other workshops that or sorry webinars that are available and I would like to note that as of last week we did have to just switch the webinars for itom AI Ops and itom visibility just because of some schuling conflicts so I hope you do stay for this webinar but in any case the other webinar for iton visibility will be tomorrow and I just want to go through some housekeeping so we save time at the end of this call for Q&A but please use the Q&A button at the bottom of your screen along the way so that Jason Smith or maybe me if we have some time can answer the questions throughout the session the presentation itself will also be recorded and shared on the service now community so in the case where you want to send this out to any of your other team members who may have not been able to join this call then you can do so by sending the link to them after the session and also after this event you will be prompted to fill out a short survey and we would really appreciate it because your feedback matters to us so thank you in advance for that so the agenda for today is that I'll be going over the product Evolution slide of itom AI Ops then I'll be going through several overviews of what we came out with in the February release as well as what's come out in the March store release which was just on March 7th and then I'll also be going over the end of life and sunset capabilities then I'll go more into depth about each of the release items from February and March and then I'll discuss some research opportunities that you could sign up for if you're interested and then I'll also be giving you some information about upcoming itom AI Ops events and then after all this we will have some time for Q&A so if you have any questions that aren't answered yet throughout the session then we could address them then so I have a few polls set up for this session and we would really appreciate if you could answer them so we could just get a gauge on how you use AI Ops and I can slightly change the um the way that I speak about some things depending on how mature some of you are with the product so I'll close the poll in a few more seconds I can see that some of you use it yourselves but more or less your organization uses it or you may not be using it so in the case that you're not using it I think that a lot of the capabilities that we've come out with in the past quarter will be very interesting to you and most likely very helpful to you so please let us know if you have any additional questions or if you want to reach out to learn even more about the product you can reach out directly to Jason Smith or ey at victorial servicenow.com or jason. Smith now.com I'm just going to end the poll right now and then we will move on with the rest of the presentation so here we can see there's the product Evolution slide you can see how we've evolved on itom aops from Rome all the way to Washington DC and how we have slowly been prioritizing different areas of aiops throughout each story I mean sorry throughout each um release and at this time if you're raising your hand I will not be able to answer because I just want to make sure we get through all the content so at the end of the call we will make time for discussion so just keep your hands down for now and we'll address it after if anything you can also put your questions in the Q&A and Jason Smith can help answer it while I am presenting so in Washington DC we had a few items that came out in the February store release around metric intelligence specifically and an accm check while in the March store release we focused on enhancements of the aops experience including the express list and Integrations Launchpad and we came up with a few items in The Innovation lab which include link View and alert automations we also have service reliability management or otherwise known as srf which is currently in control go to market and there's also analysis for itom alert simplification which went GA on March 7th which is our use of geni in itom AI Ops and I'll will go more into depth about all these items in this Washington DC column shortly so here is an overview of the February family store release features so we have made an update to the ux of metric rules previously known as static thresholds and I'll talk about that later and then we also have released in the store the Azure accm check making it easier for you to monitor your Azure VMS without having multiple checks you can monitor thousands of EMS with just one check instead as for the March stor leas we have the alert simplification like I mentioned hasone GA some enhan enhancments for Express list link views in Innovation lab there's also alert automations which is an innovation lab and an integration Launchpad enhancement as well as again service reliability which is currently in control good Market you'll have access to this um presentation after so in the case where you want to reference any of these you can look at the recording so here is also an overview of end of life and sset capabilities so Cloud native operations the store app is reaching Sunset capability where the capabilities for visibility will be available through accv but currently we do not have a direct replacement for any of the CNO for monitoring capabilities as well as the operator workspaces reaching sunset in Washington which is the release that just passed or that has come out today and if you're not already on the service operations workspace please start the migration because we are no longer um going to be enhancing the operator workspace but it will still be supported until it is completely end of life and then there's also the event management mobile app which will be passing in xanadoo which is later this year and we may have a replacement coming later this year as well as for end of life capabilities there are the metric intelligence connectors and it's already passed in Vancouver but in the case where some of you may have not yet upgraded to Vancouver I just want to let you know in case you're using the connectors for scom at this time and we hopefully will be releasing some new scal connectors in the future as well so I'll be going over the February store leas feature overview now and starting off with the metric rules ux update so previously when you had to set up these static thresholds it was on a legacy ux where it definitely did not look like this and on a previous module that would have been called Static thresholds but now we provided a new experience for you to set up your static thresholds for metric intelligence which is one of the applications of itom Health which is the product that we offer the our itom a op solution for and through this new ux we have a guided experience now and in any case if anyone is currently using deck there is a shared unified experience between these two applications for metric rule setup and I would like to note that Vancouver patch six is required for this but I will now go through a few more slides which will show you just how straightforward the setup experience is so right now we can see the module that you would see if you open metric rules and in the case that you are opening the new module when you select new metric rule or an existing metric rule you'll then be brought to this page to create a new metric Rule and now you can see that there's three specific steps that we guide you through to create it so first we ask you to select which CI class that you want to apply the metric rule to so this is where we will look at the specific CI classes metrics in order to compare it to the rule that you're setting right now so you select the CI class and then you can select additional filtering to find which specific CI in that class should this rule apply to and then at the point when you've defined this first criteria then you set up the alert criteria yourself so you set which specific value should be exceeded or um it should fall under for a specific alert to be triggered and you can also select the alert severity and you can also add additional alert severities for different ranges for the comparison operator if you have various values as well you can also choose exactly when the alert should be triggered based off of the breaching criteria which can be adjusted in the area down here and we made it even easier for you to understand when alerts will be opened and closed through this text box right here explaining that in this specific example the alert will automatically automatically be cleared when the node CPU value is greater than or equals to 100 and then at this point when you've selected all the criteria for the specific CI and when the alert should be triggered then we have the name setup so once you put the name and make sure that it is active rule then when you press finish it'll automatically be activated now moving on to the Azure accm check enhancement so previously you would have to have run multiple checks in order to monitor your various Azure VMS but now we're using a Azure batch API where we can now monitor thousands of VMS with one check as opposed to having to set up multiple checks for each VM itself and we're running this by using the resource ID from cloud Discovery and if you wanted to use this new check just make sure to update your metric intelligence Store app and then there is the March store leas feature overviews so starting with NIS for itom we have the alert simplif ification which is gon GA and this is the first capability that we're releasing as NIS for itom as part of itom AI Ops and we can see here that now we have an additional option in the express list where when an operator is looking at Express list and they may see a description that they may not be familiar with and in any case maybe they're a less experienced operator or a new person to the organization it can be hard for them to understand all the alerts that are being triggered by an organization's various monitoring tools right when they get started so we're making this easier and more approachable for these types of personas by leveraging generative AI using our internal now large language model instead of a thirdparty llm where through analyzing the short description that is just generated by your various monitoring tools we will generate a alert simplification where we provide to you a summary of the alert itself as well as we go beyond that and provide an alert analysis and by doing this we enable operators to more easily understand the alerts without having to reassign it or potentially escalate it which would drive down meantime to resolution while enabling them to just remed mediate themselves through the inert alert analysis which we provide information on exactly what may have triggered the alert in the first place as well as useful next steps for how you should continue monitoring or understanding how to fix the alert itself and this will actually be part of one of the new licenses that we have which will be part of the Pro or enterprise Enterprise Plus so if you are interested in this please contact your account teams um if you wanted to use Alert simplification in your existing itom instances and I just want to note as well that Vancouver patch 7 is the minimum requirement to use this new capability that we're offering and I just have a quick recorded demo of alert simplification which I'll just play right now and also talk through so right now we can see that this is actually me I I recorded this demo um we are on the express list which is the live alert list that is in the service operations workspace and for those who aren't familiar with the service operations workspace it is a shared workspace between itm and iton personas where they can work more closely together while being in the same workspace while having the specific spefic modules that apply most to them so here in the express list we can see now that I'm clicking on a specific alert that maybe I don't understand so much and I'm looking at the description and it doesn't really under make much sense to me and all the other information that we provide also isn't really helping me that much so what's the next best option since I have alert simplification I can just simplify the alert so now I can see that the summary is broken down of what on Earth the description is saying in the first place so I can more easily understand it and then I can see that there's also the alert analysis right below it which provides additional information as to what is happening in the alert itself and explaining more on what the addition what the original cause could have been which could have been that the node is under heavy load and maybe struggling to perform its tasks efficiently as well as the recommended steps to take which is investigating the cause of the high load and then taking the appropriate measures to reduce it such as optimizing resource allocation scaling the application or adding additional resources so through all this information as an operator I would now know what to do in order to remediate this alert without having to contact or reassign this alert to other members on on my team and that is also it for um itom alert simplification but moving on to express list enhancements so again Express list is the live alert list that we offer as part of itom aiops and we can see here that we've already released this specific list back last year in the q1 release of last year but now we're adding even more features so one of the first enhancements that we have is predefined team views so now for various users or groups you can set up predefined team views in the administration for event management where for alerts that satisfy specific conditions you can create predefined filters for the specific users and teams that you want to assign this filter to so that when they open Express list themselves they will have the options for whatever filters you've assigned for them so you can see right here that the Delta team is one of the predefined filters that we have or that you could set up and these are all the other filters that may have been assigned to the specific user at the time so we're making it easier for teams to manage their Express list while not having to constantly tell their own teams what's being changed in the scope of maybe what their monitoring you can update the predefined view from the admin console and you won't have to communicate it with your team it'll just be updated for everyone that the predefined view is assigned to and then another enhancement that we have is that for alert information in the express list side panel we've added more information so we can see now that we have additional information including alert Trends so so we can see that alert Trends include information about the Sim other similar alerts that have happened recently as well as if this exact alert has happened recently as well this enables operators to have access to more useful information that will help them in remediating the issue faster driving down again meantime to resolution by having access to other alerts that may have already been resolved in the past and then we're also providing additional information regarding probable root cause such as changes other related changes other related alerts incidents or problems or anything else that may be related to the alert that's been opened at this time so this is the second enhancement for Express list and then I'll be moving on to the Integrations Launchpad enhancements so Integrations Launchpad is our low code no code solution to integrating monitoring tools into itom so for the itom health specific product of event management you need to integrate your monitoring tools into service now so that we can receive all the events that are generated by your various monitoring tools that may send over events they may send over logs they may send over metrics but you all need to connect these tools to service now so that we can start processing them and creating alerts where necessary so we have already come out with a custom web hook option because we do have some out of thebox guided setups for very common monitoring tools but there are some that we haven't created out of the box options for yet and in those cases for monitoring tools that generate events we have the the custom web hook option and now we made it e even easier to set up this custom option where after we provide you the or you provide service now the necessary details to set up it initially after we get all the information that's needed we have the option to either gather real Source events from this new source so that you can map it from fields from the monitoring tool to the service now standard alert alert fields or the option for using a Json payload to map the fields is also an option but there is sometimes the roadblock of where you may not have access to this Json payload of the original monitoring tool source so we made it even easier by enabling you to gather some sample events essentially so that you can do the mapping without having to copy and paste the Json payload from The Event Source itself and the other steps of the guided setup are still the same after you gather the data that you need then you just do the field mapping from the monitoring tool fields to the standard service now alert Fields then we have link view which is part of the Innovation lab so link view is currently in the Innovation lab like I said so it will be in the service now store but you just have to navigate to The Innovation lab section and Link view is a topological view of alert groups themselves specifically for tag based alert groups to help you understand the alert blast radius of an alert group where you can easily see all the relevant information for you to understand the impact of this alert group on the greater scope of your service now environment so through understanding the additional information that's connected such as related IP addresses the related metrics or the configuration items as well as node or the source you can better understand the impact of the alert group on the greater area of your service now environment that you have set up so you don't necessarily need a mature cmdb to leverage this because we are supporting this through alert tags but it is an option um in the future most likely to also leverage um this for if you do leverage cmdb based grouping and there's also the compatibility which is quite important so you will need just leave Vancouver Family release in order for you to get a look at how link view works in your own environments so this will be again through the link view app and then there's also alert automations which is in Innovation lab so we are currently centralizing a lot of the common features or capabilities of itom Health's event management into the service operations workspace so one of the areas that we're centralizing or one of the capabilities that we're centralizing in the service operations workspace is alert aler automations so one of the alert automations is adding business contexts otherwise known as enriching alerts so we're making it easier for you to extract alert fields which would have been otherwise through going to the event rules module or the event field mapping module and now you can do it all in the service operations workspace yourself without having to navigate to several different modules in order to further enrich your alerts that you have coming into your service now environment you can also create alert tags now through this as opposed you going to the tag base um alert or tag based alert correlation engine which is what the um module is currently called you can just create the alert tags through this now more easily then there's also the alert grouping which is now being centralized into the service operations workspace so as opposed to having to navigate previously to the um alert grouping rules you can now do it and set those rules within the service operations workspace instead of having to navigate to a separate module you can do it all here so you can also easily see all the existing rules that you also have so that you can also reference them before you create potentially new automation or just see what you have currently in your environment that is active or not active so this for just a bit further information we do support various types of grouping so for those who may not be as familiar with alert grouping rules we do support grouping for alerts with common fields common tags CIS and more and the grouping can be based on exact match fuzzy match which can mean you can set a percentage of 50% 80% fuzzy match or pattern matching as well and then we also have included common Alert escalations in alert automations where we took the time to understand common questions or S sorry not common questions common escalations of customers and what they want to do with alerts and we made it easier for you to easily create incidents automatically from alerts or to send emails directly from alerts that satisfy specific conditions and I did want to note that we will have time at the end for live Q&A so I just want to make sure that I can make it through all the content before um answering additional questions myself and please put any questions you have in the Q&A we do have Jason Smith ready to answer answer anything you want to ask there and I just want to note that this is again is part of innovation lab and Washington patch one is required and to finish off the what's new content for this quarter there's also service reliability management currently is it it is in a controlled go to market release but we do plan to go ga later this year and in SRM which is a much easier way to save service reliability management we're providing self-service tools for distributed teams whether it is an Sr team or whether you may just have distributed teams that manage different application services or monitor different um services in general we're giving them more self-service options to manage their Integrations manage their specific alerts manage their event rules manage their event field mapping rules and so on as well as their sis and slos and air budgets so that they do not have to rely specifically on an admin to set up all these rules for them so all the features that I showed previously in alert automations this will be able this will be leverageable for these other members of teams if they are distributed if you're utilizing SRM as well and with all that information I did want to understand of all the things I showed you which were quite a few I wanted to know what you are most excited about and what you think that you may be able to implement first of all these things that we've talked about today and while you answering that and while I'm taking a short speaking break I'll just quickly move on to the iarm research opportunities that we have so with itom we are constantly looking at how we can improve our products or build products that are better to support you and we have three current areas and actually some more but these are the ones that apply most to the itom aop solution which is provided through the itom health application or itom health product if there was any confusion there and the three Focus areas that we have right now are the operator Excellence admin time to value and distributed teams so so operator Excellence is focused on making it easier for operators to analyze alerts and resolve them and the specific people we want to Target or if you are interested and you fit this Persona are knock operators and L2 responders and then there's also the admin time to Value which is focused on speeding up time to setup and optimize event management so that is why we are focusing on centralizing a lot of the core cap abilities into the service operations workspace and if you are a centralized monitoring team orme um we are interested in talking to you as well as the final um research opportunity but there's obviously some more because we have a greater product than just itom health is distributed team so focusing on operational working flows and team collaboration for especially teams that may not work directly together but work on infrastructure that relates to each other these are the specific areas that we want to focus on and the personas for this one are going to be subject matter experts in areas such as application infrastructure and technology that engage with monitoring and observability tools so if you have any additional questions about this you can contact ofer at his email down here he is the staff ux researcher but in any case if you are interested in signing up for research I am putting the link in the chat right now otherwise you can reach out to ofer right here if you're interested in learning more or if you don't want to sign up right away um that is also fine but make sure to reach out to ofer in case you wanted to know any more information before signing up for you search and I'm just going to share the results of the poll as well in case anyone was interested and you may also see Austin in the chat Austin Lazarus so in the case where you want to learn a bit more she's also put a bit more information about the research opportunities as well and then last but not least before we move on to Q&A and discussion we are having some upcoming itom aops events so for those of you who are in America we have some in-person workshops coming up in San Diego and New York City in April and those are the dates right there if you're interested in signing up I have put the links to sign up in the chat to register and these workshops will be in person so you will have to be there in person we will not be offering these remotely and more cities are to be announced we will be having some workshops as well in Europe Jason Smith will be hosting those so if anything if you're interested in workshops in Amia and you are not based in the US please contact Jason Smith servicenow.com I also put his email in the chat as well and then we'll also have a what's new webinar in Q2 in June we have not yet scheduled that but please keep in touch and stay tuned for when we do post the registration link for that as well and like I said we've already posted the registration Link in the zoom chat in case you are interested so yeah it looks like questions have slowed down to the point when they have stopped now so I I think I'm going to end the webinar very slowly in case anyone else wants to slip in a last minute City suggestion or question oh I also see a question about from anit about aiops and observability um we have things coming out within the next year planned that would help unify aops in observability but it is in development but please come to the June webinar to learn more about how we are better unifying AI Ops and observability so that is the answer I'll give you for now and there's a chance that we may also have observability Cloud observability joining us for our next what's new session or maybe hosting another session separately for them so just also stay tuned for that thank you for all the questions anit you have asked many today which I've enjoyed engaging with

View original source

https://www.youtube.com/watch?v=Uo4RcoENXaA