Business service boards
SL1 contains three virtual business services dashboards on the Dashboards page:
l NOC dashboard overview
l Business Services Dashboard
l Business service details dashboard
For more information on these dashboards, please see our Dashboards Guide.
In addition to these virtual dashboards, you can also create custom business dashboards
Example: retail banking
Using SL1 to monitor a business service is a quick way to determine if a service is available and running
Expected for a customer or end user. For example, a banking company wants to ensure that its retail banking services
The service is available worldwide. It uses the following workflow to set up its services in SL1:
Business service dashboards:
1. Since the company has offices around the world, it creates several services that the devices need to organize
Devices based on location or region. The company adds all of its devices to its associated device services.
2. The company is implementing many projects and advanced technological services to monitor the services of the devices from the first step, includingSeparate IT services for internet banking, cash register systems and ATM networks.
3. The company has a department that deals with retail sales. It works with all categories of its customers It includes all IT services (from level 2) that deal with retail banking.
Administrative operations for corporate services:
Two administrative processes (System> Settings> Administration processes) are used for the Business Service account
Worth:
l Business services: service management engine. This process collects all of the metric information
From your devices and services to compile health, availability and risk metrics. That lasts a long time
The process usually runs every 15 minutes. You should only run it often when you have a file
Cluster database (CDB) with space for the most common collection.
l Business Services: the service topology engine. This process calculates the relationships between you
Your services and devices. If your services and your device
Relationships are relatively stable, you can reduce the frequency to every 15 minutes.
Understand condition, availability and risks:
Understand availability
Availability measures whether something is accessible or working at a useful level. Here are a few
Examples to better understand availability:
for the side. The website URL must be responsive, i.e. it must either respond with the expected page or
With an error page indicating that the site is not accessible (up / down). Website response needs too
Be fast enough not to let users leave the page due to the slow response time. That should be considered
during the availability check.
l A group of database servers. For example, suppose a single database server can process 1,000 transactions per second
With good response times. In order to meet these response times of 3,000 transactions per second, four
The configured database servers are placed equivalently in a cluster. This method enables every single database
Server failure without loss of throughput and acceptable response. When there are three servers in the database
The cluster becomes unavailable, the remaining database server cannot maintain throughput, or
Answer so that the block is effectively unavailable.
for operations. Remember, Process A queues work to Process B. When the depth of the queue sits
0, it means that process A is not doing any new work and is considered unavailable. if it was
The queue is growing to some extent, this suggests that process B is and is not pulling work from the queue
considered unavailable.
Understand health:
A deterioration in the health of a particular service or device means that one or more key performance indicators (KPIs) are in place
Deterioration. If not identified, it would likely degrade throughput or responsiveness. Here are a few
Examples of health problems:
l database server. On an SL1 database server, the main function of the database is to retrieve and store events and files
Dynamic application data. You can create device service policies that lower the integrity of a volume
Move up the HF ranks as it suggests that CDB is being overloaded or being processed slowly
received data. This can lead to delays in events from compilers that are subjected to automation processes, etc.the
Page events and can affect the overall performance of the system.
for Windows servers. In some cases, the CPU queue depth on the Windows server is starting to increase, indicating that
The CPU does not have enough bandwidth to handle its workload. When this happens all processes or
Applications running on a Windows server run slowly, making them less responsive or productive.
You can create a policy to let you know when this happens on a Windows server.
for the side. The website that is the interface of the application shows the increase in the response times of the web URL
Confirmation upon delivery of the URL. If the url is known to be unavailable in 5 seconds,
This means that your customer can give up and go to another seller and then adjust their health status to suit for 1. worsened
A notification is sent for up to 4 seconds that the validity of the service is offensive and investigations and decisions can also be made
Was created before the URL reached an unavailable state.
Understand the risks
When assessing risk, consider the consequences of a KPI deteriorating. When a certain KPI is known that indicates attitudes
If left untreated, it will have a health impact or, for availability, you will need to create a policy for it. some
Examples:
l If the InnoDB table is empty on an SL1 database server, MariaDB is stopped, which leads to
The database server becomes unavailable. The shrinking level of available InnoDB space does not deteriorate
MariaDB response and throughput, and therefore the database server, can indicate that a file
The availability of the database server is at risk.