Is this a “problem?”

This morning I got an email from a friend that recounts a (still ongoing) story of a failed freezer.

We arrived home Tuesday from a week away to find the “extra” freezer in the garage totally kaput…..much of the stuff inside already ruined but some still partially frozen. It’s only 4 years old and within warranty, so [we] go on line and schedule an appointment with GE service for the next day, and spend hours sorting what [food] might be savable, getting bags of ice to try and bridge the time until (you would assume) they will exchange this unit with a new one. Tech comes out the next day, announces that the compressor is fried, and that he’ll order the part and see you in a week to install.

Needless to say, the customer is not exactly happy here. What could be saved now cannot. When they elevate the problem to “Customer Care” on the phone, the answer is basically holding the line to the warranty terms which give the company the option of replacing or repairing the unit.

Aside from speculation that the response would be different if this had been a commercial unit for a large corporate customer, this story brings up some interesting issues.

Clearly the company here is well within their agreement with the customer. That is (apparently) spelled out in black and white in the warranty, all approved by the legal department. And repair of the unit is the logical economic choice for the company.

But equally clearly, the customer here is not happy with the response.

All of my protestations about how an exchange unit shipped from their warehouse in Kent today would allow my wife to save her food falls on deaf ears. Not even a transfer to a “supervisor” for exception resolution could be arranged. If you don’t like it, tough luck..not buy another GE product? “hey, your choice” hard to believe!

And a customer with a technical problem has likely been turned into a customer for the competition.

So here is the question.

“Is this a problem?”

And when I say “problem” I mean, is this a “problem” from the standpoint of the company’s internal process?

I have my thoughts, and I’ll share them in a day or so. But I’d like to hear what you think.

How Many Production Decisions?

Whether in service delivery (including health care delivery), manufacturing, or any other production environment, your team members are likely having to make lots of decisions under perceived time pressure. Even with great visual aids, many of these processes are mistake-prone.

One of the reasons I like pre-kitting parts for a specific option configuration is that it separates the process of deciding which parts to pick from the process of installing them.

This might not seem that big a deal.

Fortunately, if you have a copy of Windows Vista, it comes with a great simulation that shows just how this can feel on the line.

Look under “Games” and start the “Purple Place” game. Select the building in the middle of the screen, and you will find yourself in a cake factory.

cakes

The idea is to look at the TV monitor on the left, and produce a cake that matches the picture. You can move the belt forward and back to position the cake under the various applicators. Then you simply select the correct choice. Seems pretty simple.

Go ahead, try it.

This screen shot is from the “Advanced” level, but even the “Beginner” is pretty easy to screw up unless you are focused and paying attention all of the time.

So – if you find yourself saying “All the employees need to do is look at the picture, and follow the directions – why is that so difficult?” then see how well you do on this game. Play it from the start of your regular work shift until the first break, say two hours, and see how many mistakes you make.

Now consider that your production environment is likely orders of magnitude more complex than this game for little kids. And you are expecting people to work all day, every day, without ever making a mistake.wrongcake

If you are in health care delivery – think about the picture of the finished cake as the physician’s instructions, and the production line as the actual process of filling the prescriptions, administering medications, protocols for preventing infections, record keeping procedures, and ask yourself if there aren’t many more opportunities for error – that are far better concealed – than the ones in this little game.

Just a thought for the day. Meanwhile – enjoy finding a work-related reason to have “Games” loaded onto your Vista machine! 🙂

Cool Email Mistake Proofing

My main desktop computer runs Ubuntu Linux. The default email client is called Evolution. A recent upgrade introduced a very cool feature. When I hit “Send” it looks for language in the email that might indicate I meant to include an attachment. If there is no attachment, it pops up this handy reminder:

screenshot-attachment-reminder

Maybe Microsoft Outlook does this too, I haven’t used the latest version, so I don’t know. But in any case, this is a great example of catching a likely error before it escapes the current process. I can’t count the number of times I have hit “Send” only to get an email reply “You didn’t include the attachment.” Obviously I was about to do it again, or I wouldn’t be writing this. 😉 Since I am sending out things like resumes right now, that is something I would really like to avoid.

When talking about mistake proofing, or poka-yoke, there are really three levels.

The first level prevents the error from happening in the first place. It forces correct execution of the correct steps in the correct order, the correct way. While ideal, it is sometimes easier said than done.

The next level detects an error as it is being made and immediately stops the process (and alerts the operator) before a defect is actually produced. That is the case here.

The third level detects a defect after it has occured, and stops the process so that the situation can be corrected before any more can be made.

Each has its place, and in a thorough implementation, it is common to find all of them in combination.

Related to this are process controls.

Each process has conditions which must exist for it to succeed. Having some way to verify those conditions exist prior to starting is a form of mistake-proofing. Let’s say, for example, that your torque guns rely on having a minimum air pressure to work correctly. Putting a sensor on the air line that shuts off the gun if the pressure drops below the threshold would be a form of stopping the process before a defect is actually produced.

A less robust version would sound an alarm, and leave it to the operator to correctly interpret the signal and stop the process himself. Your car does this if you start the engine without having the seatbelt fastened. (back around 1974-75 the engine would not start (see above), but too many people (i.e. Members of Congress) found this annoying so the regulation was repealed.)

Consider the question “Do I have all of the parts and tools I need?” What is the commonly applied method to ensure, at a glance, that the answer to this question is “Yes?”

If you answered “5S” then Ding! You’re right. That is one purpose of 5S.

A common question is how mistake-proofing relates to jidoka.

My answer is that they are intertwined. Jidoka calls for stopping the process and responding to a problem. Inherent in this is a mechanism to detect the problem in the first place.

The “respond” part includes two discrete steps:

  • Fixing or correcting the immediate issue.
  • Investigating, finding the root cause, and preventing recurrance.

Thus, the line stop can be initiated by a mistake-proofing mechanism (or by a person who was alerted by one), and mistake-proofing can be part of the countermeasure.

But it is not necessary to have mistake proofing to apply jidoka. It is only necessary for people to understand that they must initiate the problem correction and solving process (escalate the problem) whenever something unprogrammed happens. But mistake-proofing makes this a lot easier. First, people don’t have to be vigilant and catch everything themselves. But perhaps more importantly, they don’t have to take the (perceived) psychological risk of calling out a “problem.” The mechanics do that for them. It is safer for them to say “the machine stopped” than to say “I stopped the machine.”

Back to my email…

A3 – A Process, Not A Form

Kris Hallan is a frequent contributor on the LEI forum at lean.org.
In this post, he outlines some great experiences with trying to implement the “A3 process” in his organization. Lean Forums – A3.

One thing that really drove home what goes wrong most of the time with the A3 process, and frankly, with most well-intentioned efforts to bring good analysis into organizations, was his experience of an early effort to try to “require” it without having the behaviors to back it up:

One of the worst things you can do is require an A3 be written and then allow a poor A3 to get past you. This has a tendency to happen when you put an A3 mandate on something that you don’t necessarily have control over. For instance, we started by requiring all CAPEX [capital expenditure – ed] projects to be proposed using the A3 format (hoping that the A3 thought process would come with the format).

What we got was a lot of projects summarized on A3s and virtually no feedback to go back and improve anything. No one learned anything from the process, no hanei occured, and nemawashi was non-existent. It became a box that everyone had to check. This can have a very detrimental effect on people’s attitude toward A3. Since they don’t take it seriously, they can’t really learn anything from it. I would say that this actually moved us backwards in our understanding of problem solving.

I could not agree more. I have seen this in other companies. This is PLAN-DO without the CHECK and ACTion. Set an expectation, go through the motions of compliance, but don’t ever bother to see if it is actually working the way that is expected.

The good news, further into his post, is that Kris’s organization figured it out and found that doing it thoroughly is more important (and quicker!) than doing it fast.

Andon Leadership

On a world-class automobile assembly line, the actual work is continuously being compared to the planned work. In each work zone, there is a planned sequence of tasks which are expected to produce a specified output.

If there is any departure at all from the planned sequence, if things get behind the planned timeline, if the necessary conditions are not there, if any process step does not complete as required then either the Team Member or an automated system turns on a help call – an andon.

The response is immediate. The first response is within seconds with a priority of clearing the problem. The line itself is still moving, but if the problem is not cleared before the end of the takt time the line automatically stops – things go from “yellow” to “red.” When that happens, the responsibility for the problem also shifts up a level in the response chain.

The priority is still to clear the problem quickly – and clearing the problem means to restore conditions required for safety and quality without compromise.

Once the problem is cleared, and the line re-started, the rest of the problem solving process engages to find the cause of the problem and address it in the system itself. If the problem is outside of the bounds, or outside of the capability, of the original Team Leader to solve, then his chain-of-support will engage to help him solve it so that his skills can be improved.

In summary – we have a sequence of tasks to be accomplished over about six hours that, in the end, will result in a car. That sequence of tasks is further broken down into sub-intervals where progress against the plan is checked. If problems develop, the system responds, swarms the problem to clear it, understand it, and adjust the process if necessary.

None of the above should be a surprise.

But what happens in your management monthly reviews?

Huh? How are management monthly reviews related to an assembly line?

Let’s see. In a reasonably functional organization the business plan is (or should be!) a series of tasks to be accomplished over a period of time that, in the end, will deliver specified results. That sequence of tasks is further broken down into sub-intervals where progress of the plan is checked.

But in a typical monthly review, the analogy ends there. When problems develop, the reasons are discussed, mentioned, and worked around. In dysfunctional organizations only the objectives are discussed, and in truly dysfunctional organizations the objectives themselves are adjusted to match what is accomplished.

Shift your thinking a bit, and apply the assembly line analogy to its end-state.

The monthly review is the fixed-stop position. Up to this point, the person responsible for the task “owns” it. He should be checking progress and ensuring that things are getting done as specified, and that the results being achieved are as anticipated (predicted). He should be monitoring conditions and making sure that the operating assumptions are holding. Ideally those checks are built-in to the process and planning so that they are at least semi-automatic.

If all is “green” going into the monthly review, then things are great.

But if something is “off” – yellow – that is equivalent to an andon call. The responsible person is that first-responder. He has until the next review to clear the problem. Think about the ramifications here. This means that if the reviews with the boss are every month, the responsible manager better not wait until the week before to find out what is happening so he can report on it. He has to stay in touch with what is happening.

The monthly review is the fixed-stop-position at the review, and the “andon” goes red.

The reviewing manager now “owns” the problem. His job is now to ensure that there are effective countermeasures in place to get things back on track. That does not absolve the responsible manager, but rather, this becomes a “check” and an “act” for his professional development by the more senior leader.

Again, think of the ramifications. The responsible manager must not only be in touch with what is happening, he also needs to make sure his people are being developed and pushed to fully understand the problems as they occur.

The job of these leaders, at each level, is not only to keep intimately in touch with what is going on, but also be fully aware of the skills, gaps and development of their people. If someone should be able to handle an issue, but can’t, that is a skill gap that, like any other gap, must be addressed (in this case by developing the person).

In mediocre organizations, professional development is owned and overseen by H.R. and may be tied to the annual review process. In organizations that “get it” professional development is a natural part of the leadership process, and happens at all levels in the natural course of getting things done.

One-off and Customization

One of the questions that comes up frequently in “lean” discussions is the issue of non-repetitive work. This is especially an issue with complex information processes such as bid proposals, estimates, etc.

While it is true that breaking down and understanding truly repetitive work is easier, the same principles apply to more complex tasks.

But first, I’d like to challenge some thinking. If you were to tell me that “every time we do this it is totally custom,” my question would be “If it has never been done before, why can’t everyone do it just as well as you?” What makes up your expertise? Why do your customers come to you?

I’ll answer: Because you know how to do it. Obvious, right? But that’s the point. You have experience, because you have done it before. You have a process of some kind. And you carry out that process to produce your customized product.

Further, you have some kind of idea of what a “defect free product” is. You understand what you want to accomplish.

Complex operations are built up from simple ones. The tasks that are done over and over are these simple operations. These simple operations are great sources of waste because (typically) no one ever examines them.

The idea that “lean” only works for repetitive processes is largely the fault of the “lean industry.” It focuses so much on takt time and removing waste that it has not, so far, gotten to the actual heart of what makes continuous improvement work.

In an ideal repetitive process running to takt time, there are a number of key ingredients.

  • There is a highly specified work sequence that calls out the content, timing, sequence and intended outcome of the work. EVERY time that work sequence is carried out, the actual content, sequence, timing and outcome is being compared to what was specified. ANY departure from the specified work (or result) is going to trigger some kind of immediate response to correct the immediate issue, and then start the process of problem solving to find the reason this happened. This is jidoka.
  • Although I mentioned timing above, the importance of time is elevated. Taiichi Ohno is quoted as saying “Time is the shadow of motion.” Ohno was a great student. The idea that it is important to study motion itself rather than focusing on time comes from the pioneer Frank Gilbreth. In application, by specifying how long each normal operation should take, and checking the actual timing, it is possible to detect when unplanned motions creep into the work. This is the purpose of takt time and work balance. It is no more, and no less, than a tool for verifying planned vs. actual so that action can be taken immediately if there is a difference.
  • There is a specified amount of work-in-process. Each piece has a reason to be there. There is some means of detecting any departure from that standard, and any departure triggers an immediate response to understand why.

As you scale up from a work cell to an entire factory, these mechanisms scale as well, but the principle is always the same:

  • Tightly specify what you expect to do, when it should be done, who should do it, where it should happen, and how it should happen.
  • Tightly specify (understand) the intended “defect free” result of each step.
  • Have a way to verify, continuously, that what you are actually doing (and when, who, where and how) matches what you planned.
  • If When you DETECT any departure from what you specified, STOP pretending you are still running to plan; FIX or CORRECT whatever got you off plan and get back on plan; then work to understand and SOLVE THE PROBLEM. This is how the organization learns and acquires what Deming calls “profound knowledge” of itself and its processes.

In a complex one-off situation, you might never carry out that plan again. But plan carefully, applying everything you know… all of your expertise. Understand the elemental tasks that you do all of the time. Understand how long each should take. Understand what result you should get at each step.

As you carry out the plan, if when there are unforeseen real world issues, STOP long enough to understand their impact and start asking questions. First, what must we do to get this back on track?

If someone had to improvise, why? What knocked him off the plan or process?

If someone had to finish up work he got from upstream, why? Was the specification for a “defect free” transfer vague? Does it need clarification?

This can go on, but the bottom line is that most organizations with complex processes expect their people to accommodate and cope with noise in the system. They say “every time is different, we can’t possibly plan that well.”

The ones who are getting better every day say “We should be able to plan this perfectly. Why couldn’t we this time?” In simply asking that question, they get better each time they do it.

So – in the context of the original question. When putting together a complex bid and proposal, there are (I presume) steps you routinely go through. This is so even if you are not aware of what they are. Study those steps. Study the intermediate products. Understand where one step ends and the next begins. Understand what must be present (information, resources, etc.) for that step to execute successfully. If, at any point, those things are not there then STOP and understand WHY. Don’t just ask the people who SHOULD have those things to go find them.

You wouldn’t have an assembler on the shop floor hunt down missing parts. Don’t have a professional engineer hunt down information he should have received.

Check, at each intermediate step, that it was done as you expect.

At the end of the process, check that the bid is what you want it to be. If it needs rework (meaning someone didn’t approve it), understand why, and work on what information the process didn’t have the first time through. Next time, get it right.

The proposal itself is packaging. It is also your first cut at the plan for execution.

If you get the bid, and execute your plan, any departure from what you originally proposed should be understood. Why? What happened? What didn’t you see coming? Some things are truly different, and you are only estimating. But compare your actual vs. your estimate so that next time you can do a better job estimating.

There will always be imperfections. The question is in whether the organization chooses to bury them in waste or learn from them.

Hospital Error – Heparin in the news again

Corpus Christi, Texas: Hospital error blamed for more infant overdoses – Yahoo! News

Key points of the story are:

  • 14 babies received heparin overdoses while in intensive care.
  • Two premature twins died, though it is unknown if this was the cause.

…pharmacy workers at Christus Spohn Hospital South made what the hospital called a "mixing error." The two workers went on voluntary leave.

The heparin, which was 100 times stronger than recommended, was given to 14 infants in the hospital’s neonatal intensive care unit on July 4.

As I have cited on previous posts, medication errors are one of the most common ways hospitals unintentionally injure or kill patients in their efforts to treat them.

I am reasonably certain that the two workers who went on "voluntary leave" (yeah, right) will absorb more than their share of blame as the system solves the problem by asking the "Five Who?" questions.

The article then goes on to cite a history of similar incidents in hospitals all over the country, though it is sometimes unclear in the writing if it is talking about this case or a previous one.

So what should happen?

First, determine where the actual root error occurred. If the manufacturer shipped mislabeled product, for example, then the problem happened THERE, not in the hospital. That isn’t to say we can’t do a better job in the hospital catching those things, but that isn’t the root cause in this case.

Any investigation should center on an assumption that the workers were operating in good faith, paying as much attention as can be expected of a normal human being, and were positive that they were doing this right. They did not want to injure or kill their customers.

Somehow, then, the process of mixing (either in the hospital or at the manufacturer) allows a 100x error to go by without SCREAMING for attention. And scream it must. Humans doing routine things in routine ways operate on an assumption that everything is routine until presented with overwhelmingly compelling evidence to the contrary.

Any hunt for "who did it" is motivated by extracting retribution rather than solving the problem. Once again, I refer the reader to the great work by Sidney Dekker on human error. We can all learn and apply it to everything that people need to do correctly – safety, quality, any other process or procedure.

Safety and Lean Manufacturing

This is a (belated) response to a post from Patsi Sells on The Whiteboard. She asked about safety and kaizen.

When first implementing some of the tools and mechanics of the TPS (especially in a manufacturing environment), many of the initial efforts seem to run afoul of the industrial safety professionals. My experience suggests a couple of basic causes.

  • Safety countermeasures are not seen as necessarily directly contributing to production of the product. Thus, they are often lumped in to “not value added” or “waste” – at least by implication, if not directly, by over-enthusiastic kaizen event leaders.
  • Safety professionals can be stuck on specific countermeasures vs. looking at the actual risk and identifying other possible countermeasures.
  • Safety professionals are concerned (and rightly so) about repetitive motion injury. They perceive that takt driven standard work might increase the risk of this.
  • There is overall a general lack of understanding of the difference between regulatory compliance and keeping people safe. While these two things overlap, neither is necessary nor sufficient to assure the other.

Let me start with the last point since it probably raised the most eyebrows.

An industrial safety program has two distinct and discrete objectives:

  1. To keep people from getting hurt.
  2. To comply with all laws and regulations

As I said above, while meeting both of these objectives are absolutely necessary, meeting the requirements of one does not guarantee meeting the requirements of the other. To put it more directly –

  • It is possible to be fully compliant with all health, safety and environmental regulations and still have a workplace which is extremely dangerous.
  • It is possible to have a very healthy, safe and environmentally friendly workplace and still run afoul of laws and regulations.

Thus, it is necessary to ensure that each of these things are discretely addressed in any successful program.

It is important for both kaizen and safety professionals to be aware of this. Some things are simply required by law, even if they are wasteful, and sometimes interpretation is up to the whim of a local inspector who might be having a bad day. Whatever the case, for each regulatory requirement there must be a specific, targeted countermeasure, just as there must be one for each physical risk. Sometimes these things overlap, but sometimes they do not, and that is the key point here.

I will digress for a paragraph and make this point however: If you have the basics of workplace organization in place, you make a much better first impression on a health and safety inspector than if the place is a mess. If your fire department sees the fire extinguishers are all where they should be and up to date, access aisles and evacuation doors clear, etc. he is more biased toward the assumption that you have your basic act together than if the place is a mess. Everyone has some violations, but when they are blatant, simple things it starts you off with a bad impression, and inspectors usually start digging.

Moving beyond simple regulatory compliance, the objectives of “lean manufacturing” and safety are 100% congruent. Ethically it is never acceptable to knowingly put people in harm’s way for the sake of production. At a more pragmatic level, a hazardous workplace will adversely affect quality, production, cost, delivery and morale. I will not descend to the level of cost-justifying safety simply because it isn’t necessary. But it is equally true that an unsafe workplace has higher costs than a safe one.

The good news is that the kaizen process is incredibly effective at dealing with safety hazards.

Moving up on the above list, one of the biggest places where the safety professionals can help smooth out the work is with their expertise in ergonomics.

What the kaizen people should take a little time to understand is this: Motions with poor ergonomics nearly always take longer than motions with good ergonomics. To put it a little more clearly: Ergonomic improvements are kaizen. Once a good team understands the difference between good and bad ergonomics, they can quickly see many small improvements which all accumulate.

By standardizing the method, we standardize the good ergonomics. Where there are no standard methods, the Team Members will each develop their own which may, or may not, be the safest way to do the job.

Standardizing the right motions reduces the chance of repetitive motion injury. And paying attention to why unusual motions are necessary comes back to reducing variation and overload (muri, mura) in the workplace.

At the next level up, standard work is your best friend.

To develop any process you first take a good look and specify exactly what result you expect to achieve.
Define a defect-free outcome.
Part of that definition is “perfectly safe.” No one is going to argue with this. Of course you want a process that is perfectly safe, and produces a defect free outcome. Who wants the opposite? But until “defect free” and “perfectly safe” are explicitly defined or specified, there may be room for interpretation. Do this before working on the process steps.

Next define the work you believe will deliver a perfectly safe, defect-free outcome. Define the content, sequence and timing of the work.

Now you have a specification for content, sequence, timing and outcome – the four elements of activity that Steven Spear called out in “Decoding the DNA of the Toyota Production System.”

Next – try it. Run the process exactly as you specified it. Verify two things:

  • That the Team Member can perform the process as specified – there is nothing keeping him from doing it.
  • That the Team Member actually does it that way and put in guides, controls, mistake-proofing where things go off track.

Check your results.
Was it perfectly safe? Did you see any risks? How are the ergonomics?
Adjust as necessary, then repeat.

Was the result defect free? How do you know? Is there a way for the Team Member (or an automatic step) to verify the result?

In practice, you are “trying it” and “checking the results” not just when you are developing the process, but every time it is done.

If a defect is produced at some point, then you know either:

  • The process didn’t work as you expected.
  • For some reason (which you don’t know yet), the Team Member did something different, or omitted a step.

Likewise, if the Team Member so much as skins a knuckle, you know the same things: The process is not perfectly safe or the process wasn’t (or couldn’t be) followed.

In most cases where the process was not followed you likely had a Team Member doing something in good faith to “get the job done.” This is the time to gently remind him that, when he can’t follow the process as designed, to please pull the andon and let someone know. Maybe you will have to make an exception to correct the current situation, but you HAVE to know if the process isn’t working or is unworkable.

Bottom line?
You get perfect safety exactly the same way you get perfect quality.
The methods and approach for getting it, and the methods and approach for correction and countermeasure are exactly the same.

Remember:
The right process produces the right result.

If you aren’t getting the result you want, then take a look at the process.

Chatter in an ISO Process

I have been in, or encountered, a number of organizations which had (or were working on) ISO-900x quality registrations. While I am fully aware of the intent of the ISO requirements, in the cases I have seen, the effect seems to fall well short of the goal.

On the surface, the types of processes mandated by ISO 9001 seem quite reasonable. They require knowing what your processes are, having documentation for them, and having systems that address problems to root cause.

The requirements are not a lot different from what I mentioned a couple of posts ago in Chatter as Signal. The example of the U.S. Navy’s nuclear propulsion operations would certainly meet the criteria (and then some).

So the question is:
What is fundamentally different about an organization with a “paper only” ISO certification that struggles with chaos every day vs. one which is truly process driven (whether they have an ISO certification or not)?

Chatter as noise.
Chatter as signal.

The Importance of Heijunka

My friend Tom poses an interesting question to production managers:

“If I ask you to produce different quantities and types of products every day, what quantity of people, materials, machines, and space do you need?”

Of course the answer is usually, at best, inarticulate and, at worst, a blank stare. There isn’t any way to know. Add to this the well-established research of the “bullwhip effect” which amplifies the magnitude of these fluctuations as you move up the supply chain, and it is easy to see the suppliers are really set up to fail.

Then he asks another question:

“If I ask you to produce the same quantities and types of products every day, or every hour, could you then answer the question?”

And, of course, the answer is that this is a “no brainer.” It would be very easy.

So the rhetorical question to ask is: Why does Toyota place such emphasis on heijunka?”
But my question is “Why don’t we all do it?”

Heijunka is a process of dampening variation from the production schedule. In English it is called “production leveling.” It comes in two steps:

  • Leveling the daily workload – smoothing out variations in the overall takt time.
  • Leveling the product mix within the daily work load – smoothing out variations in the demand from upstream processes.

Production leveling, however, is difficult, and the management has to have the fortitude to do it. Honestly, most don’t. They don’t like to deliberately set the necessary inventory and backlog buffers into place. So I’d like to explore some of the consequences of not doing it and then ask if these costs are worth it.

Consider this analogy.

Take a look at what modifications are necessary for a vehicle to traverse a rough, irregular road. The suspension must be beefed up, more power is required, the drive train is far more complex for 4×4. The road itself is unmarked, so the driver does not know where he is or where he is going without sophisticated navigation equipment.

Most of this additional hardware is actually unnecessary most of the time. But it might be needed, so it has to be there… just in case.

The driver of this vehicle is primarily concerned with what is right in front of the vehicle, and far less concerned about what is a mile (or even a few dozen meters) up the road. He will deal with that problem when he comes to it. Driving in this environment requires skill, training, experience, and continuous vigilance. People do this for recreation just for these reasons.

Now smooth out the road. Straighten out the hard curves. Give it some pavement. Put in signs so it is possible to navigate as you go. The same speed can be maintained by a vehicle which is lighter, has less power, a simpler drive system, a simpler suspension. Even though the engine is smaller, it is more efficient because it can run at a constant power output, the sudden accelerations are not necessary. Everything is easier.

The vehicle is much less expensive and much more efficient. The driver’s task is far simpler.

When you allow outside-induced variation to work its way through your system, you are putting potholes in the road. You are introducing sudden turns, sudden changes. Sometimes you are washing out entire bridges. People must be more and more vigilant and they simply miss more things. Their mental planning horizon shrinks to what they are working on right now, and maybe the next job. They certainly aren’t checking what they need tomorrow. They will worry about that in the morning.

The Effect on Materials

Even in the worst managed operations, people generally want to be able to provide what they are supposed to. They are motivated to be “good suppliers.” They also intrinsically understand that if they are idle (not producing) this is not good. Even if they are not provided with the tools and resources to do so, they will do the best they can to succeed – even if those things hurt the overall organization.

(I should note that most “management by measurement” systems actually encourage people to do things that hurt the overall organization, but that is another article.)

When these well meaning people encounter problems, they try to mitigate the effects of those problems with the resources they have available.

  • If their upstream suppliers do not deliver reliably they will add inventory so they have what they need.
  • Likewise, if their upstream suppliers do not deliver good quality, they will add some more inventory to make sure they have enough good material.
  • If there is quality fallout within their own process, they will add inventory and up production to cover that. By the way, that increase of production also increases the demand on the upstream suppliers, sometimes in unpredictable ways.
  • If their customers have irregular demand patterns, they will add inventory so the customers can have what they need, when they need it.
  • If there is batch transportation either upstream or downstream from them, they have to accumulate inventory for shipping.
  • If there are on a different shift schedule from either their customer or supplier, inventory accumulates to accommodate the mismatch.

Do you notice a theme here? The key point is: Without the system level view from their leadership, and without the problem solving support, all they can do is add inventory to cope.

Without leveling, any variation in demand will propagate upstream. At each step, two things happen:

  1. Processes that accumulate and batch orders progressively add to the amplitude of the variation.
  2. Irregularities within each process are added to the variation that comes from the customer.

By the time this hits your supply base, it is a tsunami. First the beach goes dry as it looks like the order base has dried up. (This is why you need to constantly reassure your suppliers with a forecast – because they can’t see regularity in your orders.) Then all of that water comes rushing in at once – and your suppliers can’t cope. Worse, they may have allocated the capacity elsewhere because they were tired of waiting on you. Lead times go up, things get ugly.

But even internally, all of this self-protection just adds more and more noise to the system.
So they add more and more inventory.

For a management team that is reluctant to deliberately add some inventory or backlog buffer to contain sources of variation and protect the rest of the system here is some news: Your people are already doing it, and in aggregate, they are adding FAR more inventory than would be needed with a systematic approach. They can only see the local problems, and each is just trying to be reliable – even if their efforts work in the opposite direction and actually introduce more variation into the system.

The Effect on People

I live in the Pacific Northwest of the USA. A fact of life living here is that, occasionally, the earth literally moves under our feet. I can tell you from experience that this is psychologically unsettling.

In our factories we do the same thing to people when the schedule changes every day. In the name of flexibility we shift requirements up and down. Add to that chasing shortages and hot list jobs around, and the daily work place is chaos. People are not sure if they are succeeding. Or, at best, they declare victory because they were not buried today.

Daily kaizen? That is just not going to happen in this environment. When you start talking about introducing flow, you threaten the self-protecting inventory buffers, and I can assure you that you will have a fight on your hands. Why? Because your people believe they need these buffers to get the job done – the job you want them to do. Now you are taking that away? Are you insane?

This is why it is critical to establish a basic takt as early as possible, then immediately start aligning the expectation to just meeting that takt.

Anything that keeps people from meeting takt becomes a problem and must be addressed. This is jidoka. Heijunka is a block in the foundation of the TPS “house” for a reason. Unless people are standing on solid ground, they can’t even consider anything like “just in time” or “stop and respond to problems” because they are spending all of their mental bandwidth just trying to figure out what is going on hour by hour.

Conclusion

When I was a military officer, we were trained in tactics designed to present our opponent with a constantly changing picture of what was happening. We wanted to inject as much confusion and uncertainty as possible. The mechanics of defeat on the battlefield are simple: The force subjected to this first shifts from action to reaction. They lose initiative, and therefore lose psychological control. Next the horizontal control linkages start breaking down. Each sub-unit starts to feel isolated from the others. They feel less a team and more on their own. Then, as more and more of their attention is shifted to self-preservation, the vertical chain of command breaks down. Each sub-unit is now mentally isolated and can be defeated in detail.

Ironically many factories are managed such that the workers on the shop floor are subjected to exactly these same conditions – and we wonder why they have a cynical view. We are defeating ourselves.