Downtime

src: 1technation.com

The term end time is used to refer to the period when a system is not available. Stop Time or duration of outages refers to the time period when the system fails to provide or perform its primary function. Reliability, availability, recovery, and unavailability are related concepts. unavailability is the proportion of time ranges that the system is unavailable or offline. This is usually the result of a system failing to function due to unplanned events, or due to routine maintenance (planned events).

This term is generally applied to networks and servers. Common reasons for unplanned outages are system failures (such as crashes) or communication failures (commonly known as network outages ).

This term is also commonly applied in industrial environments in relation to failure in industrial production equipment. Some facilities measure downtime that occurs during work shifts, or over a 12 or 24 hour period. Another common practice is to identify each downtime event as having an operational, electrical or mechanical origin.

The opposite of downtime is working time.

Video Downtime

Jenis

The industry standard for the term "Damage Duration" or "Maintenance Duration" can have different initiation and completion points so the following clarifications should be used to avoid conflicts in contract implementation:

"Turnkey" is the most exciting of all types of blackouts. Outages or Maintenance begins with the plant operator or equipment pressing the shutdown or stop button to initiate the termination of operation. Unless otherwise stated, Outage or Maintenance is deemed complete when the plant or equipment is back in normal operation ready to begin production or ready to be synchronized with the system or grid or ready to perform the task as a pump or compressor.
"Breaker to Breaker" This outages or Maintenance begins with a factory operator or equipment that releases power circuits (main "Off" or "Removable" or "On-Cooldown") not the control circuit of operation. This still allows the equipment to be cooled or carried to the ambient so that the blackout/maintenance work can be prepared or started. Depending on the type of equipment, the termination of "Breaker to Breaker" may be advantageous if contracting control-related maintenance as this type of maintenance work can be performed when the main equipment is still cold or on standby. Unless otherwise noted, this type of blackout is considered complete when the power circuit is recharged through a power smelter.
"Lock-out/Tag-out Completions" This outages or Maintenance (sometimes mistaken for "Off-Cooldown" but not the same) starts with the factory operator or equipment that releases the power circuit, disconnects the control circuit and neutralizes the potential resources and other hazards (commonly called Lock-Out, Tag-Out "LOTO") This maintenance point is usually the last stage of the blackout initiation stage before actual work begins in facilities, factories or equipment. Safety direction should always follow LOTO activity, before any work is done. Unless otherwise specified, this type of blackout is deemed complete when the equipment has reached a mechanical finish and is ready to be placed on a slow-roll for many heavy rotating equipment, rotary inspection or round inspections for the motor, etc. However, it must follow a return or work permit per LOTO procedure.

Any on-line testing, performance testing and required adjustments shall not be counted against the duration of the blackout because this activity is usually performed after the completion of the shutdown or maintenance event and is outside the control of most maintenance contractors.

Maps Downtime

Characteristics

Unplanned downtime may be the result of software bugs, human errors, equipment failures, malfunctions, high bit error rates, power failures, overloads over channel capacity, cascade failures, etc.

Telecommunication shutdown classification

The stop time may be caused by a failure in hardware (physical devices), software (logic control equipment), interconnection equipment (such as cables, facilities, routers,...), wireless transmissions (wireless, microwave, satellite), and/or capacity (system limit).

Failure can occur due to malfunction, failure, design, procedural (improper use by humans), engineering (use and dispersion), overload (traffic or system resources pressed beyond designed limits), environment (support systems such as power and HVAC), unscheduled time stops (blackouts designed into the system for purposes such as software upgrades and equipment growth), others (none of the above but known), or unknown.

Failure may be the responsibility of service providers/customers, vendors/suppliers, utilities, governments, contractors, end customers, public individuals, natural acts, others (none of the above but known), or unknown.

src: f4.bcbits.com

Impact

Failures caused by system failures can have serious impact on users of computer systems/networks, especially those that rely on services for almost 24 hours:

Medical information
Nuclear power and other infrastructure
Banks and other financial institutions
Flights, airlines
News reporting
E-commerce and online transaction processing
Continuous online game

Also affected can be ISP users and other customers of the telecommunications network.

Corporations may lose business because the network goes out or they may fail in the contract, resulting in financial losses. According to Veeam Availability Report, organizations experience unplanned downtime, on average, 13 times per year with an average cost of one hour of downtime for mission-critical applications of $ 82,864.

People or organizations affected by downtime can be more sensitive to certain aspects:

some are more affected by length of interruption - it's important for them how much time it takes to recover from the problem
others are sensitive to shutdown time - off during peak hours affecting them at most

The most demanding users are those who need high availability.

Real-Time Machine Monitoring with Downtime Cause Tracking and ...

src: i.ytimg.com

Famous annoyance

On Mother's Day, Sunday, May 8, 1988, a fire broke out in the main switching room of Hyndale Headquarters of the Illinois Bell telephone company. One of the largest state switching systems, the facility processes more than 3.5 million calls daily while serving 38,000 customers, including many businesses, hospitals, and O'Hare Airport and Midway Chicago.

Almost the entire AT & T network of 4ESS tandem switches in and out of service over and over again on January 15, 1990 disrupted long distance service to the entire United States. The problem goes away by itself when traffic slows down. Software bug found.

AT & amp; T lost its frame relay network for 26 hours on 13 April 1998. It affected thousands of customers, and bank transactions were one victim. AT & amp; T fails to meet service level agreements on their contracts with customers and must return 6,600 customer accounts, for millions of dollars.

Xbox Live experienced intermittent downtime during the 2007-2008 holiday season which lasted thirteen days. Increased demand from Xbox 360 buyers (the largest number of new user enrollments in Xbox Live history) is given as a reason for downtime; to make amends for service issues, Microsoft offers its users a chance to receive free games.

Sony PlayStation Network April 2011 blackout, starting on April 20, 2011, and gradually restored on May 14, 2011 starting in the United States. This blackout is the longest time PSN has been offline since its inception in 2006. Sony has stated that the problem is caused by external interference resulting in the seizure of personal information. Sony reported on April 26, 2011 that a large amount of user data has been obtained by the same hack that resulted in downtime.

The Telstra Ryde switch fails at the end of 2011 after the water goes out to the electrical switch board from the ongoing wet weather. Switch Ryde is one of the largest switch areas in Australia, and affects more than 720,000 services.

Miami ServerAxis offline data center was not announced on 29 February 2016 and was never restored. It affects many providers and hundreds of websites. This outage affects the coverage of Women's NCAA Women's Division Basketball Tournament 2016 as WBBState, one of the affected sites, by far the most comprehensive provider of women's basketball statistics available.

Too Much Unplanned Downtime? Here's the Network Support You Need

src: www.tvgconsulting.com

Service level

In service level agreements, it is common to specify a percentage value (per month or per year) calculated by dividing the sum of all time-downtime downtime with the total reference time span (eg a month). 0% downtime means the server is available at any time.

For Internet servers, downtime above 1% per year or worse can be considered unacceptable because this means a stop time of more than 3 days per year. For e-commerce and other industrial uses, any value above 0.1% is usually considered unacceptable.

Cove Point LNG Shutting Down for Maintenance This Fall ...

src: marcellusdrilling.com

Response and impact reduction

It is the task of the network designer to ensure that network interruptions do not occur. When that happens, a well-designed system will further reduce the blackout effect by having a local outage that can be detected and repaired as soon as possible.

A process needs to be done to detect the damage - network monitoring - and restore the network to working conditions - this usually involves a problem-solving assistance team, composed of trained engineers; separate help desk teams are usually required to input user input, which can be very demanding during downtime.

A network management system can be used to detect damaged or degrading components before customer complaints, with proactive rectification corrections.

Risk management techniques can be used to determine the impact of network intrusion on the organization and what actions are required to minimize risk. Risks can be minimized by using reliable components, by maintenance, such as upgrades, using redundant systems or by having contingency plans or business continuity plans. Technical tools can reduce errors by fixing code errors, retransmissions, checksums, or diversity schemes.

One of the biggest causes of downtime is a misconfiguration, where the planned change is false. Usually organizations rely on manual efforts to manage the configuration backup process, but this requires highly skilled engineers with time to manage processes across multi-vendor networks. Automation tools are available to manage the backup, but there are very few solutions that handle the configuration recovery needed to minimize the overall impact of outages.

src: f4.bcbits.com

Planning

The planned discharge is the result of activities planned by the system owner and/or by the service provider. This outage, often scheduled during the maintenance window, can be used to perform tasks including the following:

Treatments suspended, e.g., suspended hardware repairs or restart delays to clean up chaotic memory
Diagnostics to isolate detected errors
Repair a hardware error
Fix errors or omissions in database configuration or omissions in recent configuration database changes
Fixed errors in the app database or errors in recent app database changes
Software update/update to fix software errors.

Extinction can also be planned as a result of predictable natural events, such as Sun outage.

Maintenance of downtimes should be carefully scheduled in industries that rely on computer systems. In many cases, system-wide downtime can be avoided using so-called "rolling upgrades" - the process gradually takes part of the system to improve, without affecting the overall functionality.

How to Create a Website Downtime Strategy | SiteUptime Blog

src: www.siteuptime.com

Avoidance

For most websites, website monitoring is available. Website monitoring (synthetic or passive) is a service that "monitors" the time and the user on the site.

src: www.professionalacademy.com

Other uses

Time can also refer to the time when human capital or other assets fell. For example, if employees are in meetings or can not do their work because of other obstacles, they are down. This can be equally expensive, and could be the result of another asset (ie computer/system) being down. This is also known as "dead time".

The term is used also in factories or industrial purposes. View total productive maintenance (TPM).

The High Price Of IT Downtime | Network Computing

src: www.networkcomputing.com

Measure stop time

There are many external services that can be used to monitor work time and downtime as well as availability of services or hosts.

Downtime Monkey | Website Downtime Alerts & Monitoring

src: downtimemonkey.com

References

Intro and Website Downtime Tips

Source of the article : Wikipedia

Downtime

Minggu, 08 Juli 2018

Downtime

Video Downtime

Jenis

Maps Downtime

Characteristics

Telecommunication shutdown classification

Impact

Famous annoyance

Service level

Response and impact reduction

Planning

Avoidance

Other uses

Measure stop time

See also

References

Minggu, 08 Juli 2018

Downtime

Video Downtime

Jenis

Maps Downtime

Characteristics

Telecommunication shutdown classification

Impact

Famous annoyance

Service level

Response and impact reduction

Planning

Avoidance

Other uses

Measure stop time

See also

References

Share this