SPC, Downtime, and Overall Equipment Effectiveness
As 2011 draws to end, we hope that you had a wonderful year both in your personal life and your work life. And, we wish you a Happy New Year and the best of luck in 2012. This marks the end of our 8th year of monthly newsletters for us - this is number 96. We hope you have enjoyed the newsletters over the years and that the information in them has been helpful to you.
We close this year's newsletters with one on how to use SPC in tracking uptime and efficiency. Equipment downtime in a plant is almost always an issue, particularly if the downtime occurs at the bottleneck in the plant. This newsletter describes how to track downtime (and overall equipment efficiency) using control charts and Pareto diagrams.
Uptime by itself does not tell the full story. You can be up 100% of the time one day, but if you are running at half of what the equipment can do, you are really not performing very well. If you are simply reworking product, you are not adding to the income of the company. So, we also go beyond just uptime and include the rate at which the line is running, the planned "not scheduled" time, and the rate of quality product to determine an overall efficiency. You will find this called the Overall Equipment Effectiveness in the literature.
In this issue, we answer the following questions:
- Do I track individual pieces of equipment, a production or the entire plant?
- How do I determine the reasons for downtime?
- How do I collect the data?
- How do I define uptime?
- How do I define availability?
- How do I define the rate of quality product?
- How do I define the performance efficiency?
- How do I define the overall equipment efficiency?
- How do I present the results?
- How do I improve the results?
- What are some potential goals?
- Quick Links
We will talk about uptime/downtime to start with and then expand to the overall efficiency. You may leave a comment at the end of the newsletter.
Uptime on Equipment/Lines
The first step is to decide what you want to track. You want to track the uptime on the machines/equipment that are critical to the throughput in the plant, i.e., when the machine is down, the production in the plant will be down. If a machine is independent and goes down without taking down other parts of the plant, you would track the uptime on that individual machine. However, if there are machines that are linked together in a production line, you would track the uptime of the entire production line.
If the machine is not important to plant throughput, don't track its uptime. For example, suppose you have a mill that is used to prepare production samples for testing. If that mill goes down, will it limit plant production? It might if the sample has to be tested before the production run. In that case, it is a critical piece of equipment. However, if there is a spare mill that can be use, it is not critical and you would not track its uptime.
So, the rules are:
- Track only those machines/production lines that are critical to the throughput in the plant.
- If the machine is independent of other machines, track the uptime for that individual machine.
- If the machine is part of a production line, track the uptime for the entire production line.
Knowing the uptime on machines/production lines is important. But if you are going to want to improve that uptime (which you will want to do), you need to know the reasons for downtime. This means you will have to collect data not only on the time you are up and running but on the time you are down and the reasons for the downtime.
The reasons for downtime vary immensely depending on your process. You will want a list of reasons for downtime. This is best accomplished by meeting with those who are closest to the process (e.g., the operators) and brainstorming a list of downtime reasons. This list will be a starting point for you and will need to be revised over the next few months as you gain experience on tracking the reasons for downtime.
It is helpful to think about how downtime can occur due to three major areas: mechanical, process, and people. Mechanical reasons include equipment breakdowns such as pumps failing, belts breaking, etc. Process reasons include things such as sampling, moving material, waiting on material, setting up a machine, etc. People reasons include the machine not being scheduled, meetings, lunch, etc. You will most likely want to develop "downtime codes" for each reason to facilitate data recording.
You might also want to have codes for uptime. For example, you are up and running when you are making the product for the first time. But you are also up and running when you are reworking product that is out of specification. You might want different uptimes code for those types of situations.
Collecting the Data
How you collect the data depends on your setup. You might be able to automatically detect when equipment is running. But in the end, when a machine is down, someone has to decide why it is down and record the reason for the downtime - either manually or into a computer. Obviously, the more information you can enter into a program, the easier the analysis will be. Even if you don't have a fancy computer system, you can set up something in a database (e.g., Access) or spreadsheet (e.g., Excel) to do the analysis.
You will need to decide how often you collect the data. If you run 24 hours a day, 7 days a week, collecting data in 15 minute intervals is usually sufficient. You can set up an Excel spreadsheet, for example, as the table shown below.
Table 1: Spreadsheet Example for Downtime Data Collection
|Time||Material||Customer||Uptime Code||Downtime Code||Comments|
An operator would simply record if the machine/line was up during that 15 minute period or, if it is down, the reason code for why it is down. Always leave a space for comments from the operator in case there is something special the operator wants to enter.
Beware that it will take time to work out the data collection. It will not be perfect at first and will need to be revised over time.
Below we develop the definitions on which to base the calculations. It is best to calculate the data daily so you will be looking at daily results.
Uptime is simply the number of hours you are up and running each day. It can be running product for the first time or rework. But anytime the machine or line is up and processing material, it is uptime.
The availability of a machine/line is defined as the following percentage:
% Availability = 100(Uptime/Loading Time)
where loading time is the total available hours minus any not scheduled or other idle time. Since we are talking about taking the data daily, the total available hours are 24. The not scheduled time is included so you are not punished for things beyond your control. Not scheduled includes no material to run, holidays, experimental runs, etc.
For example, if you were scheduled to run all 24 hours and had 18 hours of uptime and 6 hours of downtime, your % availability for the day would be:
% Availability = 100(Uptime/Loading Time) = 100(18/24) = 75%
If you had four hours planned down because there was no material, then your % availability for the day would be:
% Availability = 100(Uptime/Loading Time) = 100(18/(24-4))= 100(18/20) = 90%
Planned downtime for maintenance is not considered "not scheduled." This should be included in the scheduled hours.
Rate of Quality Product
The rate of quality product is fairly straight forward. It simply is the ratio of good product to total product for the time period. If we are looking at daily results, it is that ratio for the day. So, assuming you just have the two possibilities of scrap and rework, the rate of quality product based on weight is:
Rate of Quality Product = (Run Weight - Scrap Weight - Rerun Weight)/Run Weight
You could also base the rate of quality product on pieces instead of weight. It will depend on your particular situation.
Continuing with our example above where our % availability was 90. Suppose that the capacity of our machine/line is 1000 pounds per hour. During the 20 hours we were up that one day, we ran 18,900 lbs. Of that, 150 pounds were reworked and 50 pounds were scrapped. The rate of quality production is then:
Rate of Quality Product = (18,900 - 150 - 50)/18,900 = 0.989 or 98.9%.
Performance efficiency compares what you ran to the best run rate. The best run rate could be the equipment capacity or the "ideal" based on everything working correctly. The "run weight" includes all material - good and bad. So, the % performance efficiency is:
% Performance Efficiency = 100(Run Weight/Uptime)/(Best Run Rate)
We will continue with the example above. We ran 18,900 pounds in 20 hours. The best run rate is 1000 pounds per hour. So, the performance efficiency is:
% Performance Efficiency = 100(18900/20)/(1000) = 94.5
Overall Equipment Effectiveness
The overall equipment effectiveness (OEE) is a combination of the availability, rate of quality product and performance efficiency.
Overall Equipment Efficiency = Availability x Rate of Quality Products x Performance Efficiency
So, for our example, the overall equipment efficiency is given by:
Overall Equipment Efficiency = 0.90 x 0.989 x 0.945 = 0.841 or 84.1%
Using Control Charts to Display the Results
The availability, rate of quality product, performance efficiency and the overall equipment effectiveness should each be displayed on their own control chart. The control chart allows you to see the underlying variation for each metric and to see how the metric changes over time - is it increasing, decreasing or staying the same. The individuals chart (X-mR) is the best chart to use for this. Each chart will show its normal range of variation and point out special causes when they occur.
Let's take a look at some example data. The data in the table below shows our results for 20 days. The collected data includes day number, uptime hours, loading time, pounds of product, and pounds of rework and scrap. The other numbers are calculated as shown above.
Table 2: Uptime Results
|Day Number||Uptime Hours||Loading Time||% Availability||Lbs Product||Lbs Rework and Scrap||Rate of Quality Product||Performance Efficiency (Best Rate = 1000 lbs/hr)||OEE|
The four calculated metrics will now be charted. These are the four that we want to monitor over time. The four control charts are shown below.
Figure 1: Availability
Figure 3: Performance Efficiency
Figure 4: Overall Equipment Effectiveness
How to Improve the Results
You use the control charts to guide your process improvement efforts. If you are new to control charts, please take a look at our newsletter on the purpose of control charts. If there are any special causes of variation, the reasons for the out of control points should be found and eliminated. Note that there is one out of control point on the availability control chart. Something happened that caused the machine/line to be down longer than "normal." The reason for this should be found and eliminated. This also created the out of control point on the OEE chart.
Once the process is fairly stable, you can determine which metric needs to be worked on first. Many times it will be the availability metric. You should be keeping a Pareto diagram on reasons for downtime. This chart will help you determine the major reasons for downtime and which reason should be worked on first (our newsletter on Pareto diagrams is here).
The objective is always to continually improve over time. However, leadership seems to like to have some goals to work towards when you first start. Those goals are up to you and depend on your situation. However, here are some recommendations to get you started.
- Availability: greater than 90%
- Rate of quality product: greater than 99%
- Performance Efficiency: greater than 95%
- Overall Effectiveness: greater than 85%
This newsletter has shown how you can use SPC to monitor downtime/uptime in a plant. In addition, we went a step further to include the three components of overall equipment effectiveness: availability, rate of quality product, and performance efficiency. The best control chart to use to monitor these metrics is the individual control chart. In addition, a Pareto diagram should be kept on the reasons for downtime.
Thanks so much for reading our publication. We hope you find it informative and useful. Happy charting and may the data always support your position.
Dr. Bill McNeese
BPI Consulting, LLC
Connect with Us
Control Chart Examples
- << Return to Categories
- An Example of the Misuse of SPC in Health Care
- Attribute Control Charts in Health Care
- Baseball Has Changed
- COVID-19 Data and Variation
- COVID-19 Data: Does a Control Chart Add Anything to the Analysis?
- Control Charts and America's Favorite Pastime - Baseball
- Control Charts and Purchasing
- Control Charts and Surveys
- Control Charts and Website Data
- Global Warming: A Trend or Step Changes?
- KPIs, Control Charts and Linking of Measurements
- Making Sense of Data: SPC and Aviation
- Monitoring Customer Complaints Using SPC
- My Blood Pressure is What???
- Plotting the Data and Immigration
- SPC & Process Improvement in the Warehouse
- SPC and Customer Service
- SPC and Global Warming - 2017 Update
- SPC and Global Warming
- SPC and Global Warming Update
- SPC and Global Warming: 1880 - 2020
- SPC and On-Time Performance
- SPC and Pharmaceutical Finished Product Quality Specifications
- SPC and Pharmaceutical In-Process Control
- SPC and Selecting a Supplier
- SPC and Your Suppliers
- SPC, Downtime, and Overall Equipment Effectiveness
- Trend Control Charts and Global Warming
SPC Knowledge Base
Click here to see what our customers say about SPC for Excel!
SPC Around the World
SPC for Excel is used in over 60 countries internationally. Click here for a list of those countries.