|
Patient
Waiting Lists & Healthcare
In the previous page we saw how to apply
replenishment to log marshalling. Of
course people are not logs – we know that.
People are much more perishable, especially when they are unwell. However, is there something that we can
learn from log marshalling in general that can also be applied to
healthcare? Let’s run a test, let’s
compare the patient waiting/referral process against log marshalling and see
if it is different or similar. If it
is similar then maybe we already have a reference environment from which we
can extract the principles and apply to healthcare. Let’s see.
Firstly, however, if you have arrived at this page
directly rather than sequentially through the replenishment page and the
distribution page, then please consider reading the replenishment page
first. This will ensure your
understanding of the technical solution (the planning and control system)
that we are going to apply in this case.
Forearmed with such knowledge you will be in a much better position to
evaluate the description of the current problem and also the potential for
the detail of the solution.
The “producers” or the source node in this system –
the general practitioner, or local doctor – is in this case a service
operation having no set-up and appears to produce stock for the system –
patients – at a random rate and in units of one. Now, however, there is no longer a
geographic many-to-one relationship between producer and the next node, the
specialist. But rather there is a many
to many relationship. Any general
practitioner may wish to refer a patient to any one of a number of different
specialists depending upon the nature of the illness.
Let’s try to draw this network.
All we have done here is to have changed the
relationship from one-to-one in log marshalling to many-to-one in public
healthcare and changed the labels. As
a generalization then, the marshalling network seems to describe the flow of
patients. There is however one
interesting and critical difference in this network, the actual production
part isn’t at the source nodes any more, here labeled the local doctor. The actual production is at the end node,
the intervention/admission.
Health professionals “bristle” when manufacturing
analogies are used in health, “we don’t make cans of beans you know!” However, we need to use manufacturing
terminology for a moment to describe part of this system in consistent
terms. When the intervention is
carried out in a theatre, that part of the process is a production
process. It has a set-up, it has particular
equipment and staff for particular procedures, and it may even have particular
rooms. We need to know this to
distinguish it from the supply chain portion, the referral and waiting list
part, because the way in which the supply chain portion and production parts
are managed are intrinsically different.
How can we be sure that intervention is a production
process and not a project process?
Well, if we look at an operating list we can characterize it in terms
of patients per day, rather than days per patient. Therefore we can be quite certain that from
the system’s perspective the intervention node is a production process. This is important because it is likely that
the intervention node will always be the control point – the drum in
drum-buffer-rope terminology – regardless of whether the constraint is
internal (we don’t have enough capacity) or external (we don’t have enough
patients).
How can we be sure that this node will always be the
control point? Well, at a guess, it is
the most capital extensive step, either a new theater in surgery or a new
ward in medical cases. It certainly
will be the operationally most expensive step in terms of on-going staffing
and support. Therefore it is unlikely
that new theaters will be built in rapid response to demand. The control point is therefore unlikely to
shift somewhere else as a consequence of additional theatres being built.
So it looks as though there is some validity in
investigating marshalling or a convergent supply chain as a descriptor for
patient waiting lists. However, this
brings us to a significant issue – waiting.
On the very first page, the introduction to these webpages, we
mentioned that we can batch in time or we can batch in quantity. Supply chains are dominated by batching in
time, and the patient waiting list is a supply chain. Batching in time is so pervasive in
healthcare that we absolutely accept it as normal. We fail to even question the reasons for
its existence. Whenever we batch in
time we cause waiting to occur.
Reducing batching in time, along with removing the policy constraints
that limit output; will substantially reduced patient waiting lists. Would this be a worthwhile cause to
pursue? I think so. Are you interested?
Good, then we need a plan of attack.
Plan Of Attack
There is only one plan of attack, the 5 step focusing
process that we have used to date in the analysis of all of our logistical
endeavors. Let’s repeat it here for
good measure;
(1)
Identify the system’s constraints.
(2)
Decide how to
Exploit the
system’s constraints.
(3)
Subordinate everything
else to the above decisions.
(4)
Elevate the system’s constraints.
(5)
If in the
previous steps a constraint has
been broken Go back to
step 1, but do not allow inertia to cause a system constraint. In other words; Don’t Stop.
What is the constraint in this system? What are we trying to protect? The constraint isn’t a lack of “customers”
in this instance; there is no shortage of patients. It something else; it is expensive and
finite capacity – limited physical space somewhere, possibly “funded beds”
(as opposed to beds that exist but are deemed to be unfunded) for medical
conditions or theatre space for surgical conditions. That almost answers the second question –
how to exploit the constraint. We lose
output from the system whenever we have patients who need intervention, but
who are not in the right place at the right time to receive that
intervention. Healthcare is a service;
we can’t store the intervention for use at a later time.
In order to exploit the system we need to ensure
that we can never waste an opportunity to carry out an intervention. We will need to deduce how to best do this
– a strategy for exploitation. We will
also need to develop how to best subordinate the rest of the system once the
exploitation strategy is in place.
To properly and successfully exploit and subordinate
the valuable capacity in this system would mean that we begin to have shorter
waiting times and eventually spare capacity.
First however in order to be able to determine the exploitation and subordination
strategy we need to examine the properties of waiting list networks in a
little more detail. Let’s do that.
The Waiting List Network
Let’s acknowledge right at the outset that there are
two ways that we could effectively exploit the constraint – one way is to
decrease the input and the other is to increase the output. We could express this as follows;
(1)
Approaches that avoid
illness in the first place.
(2)
Approaches that mitigate
or cure illness once contracted.
Although there is now an increased emphasis on
pro-active prevention rather than reactive intervention, the fact remains
that much illness still requires active intervention and moreover there is a
backlog of work. This backlog arises
from rising general expectations from taxpayers and improved levels of
care/technology from practitioners.
Technology is a double edge sword here, it both substantially reduces
the effort in simple interventions freeing up bed space that was unimaginable
20 or 40 years ago, and at the same time making possible interventions that
tie up bed space that was unimaginable 20 or 40 years ago.
We need to recognize that the patient waiting list
network in the main is concerned with non-acute admissions although how we
handle this network strongly impinges upon acute work also. Public health systems must deal with
non-acute, acute, and emergency patients; all at the same time. But this is not unusual. I have not yet seen a process that didn’t
operate a concurrent but differential priority system of some sort. Concurrent differential priorities are the
rule not the exception in serial processes and health professionals need to
recognize this commonality. In any
system; manufacturing, service, or supply chain, the only way to manage
differential priorities concurrently is to have adequate sprint capacity and/or
buffering.
Let’s examine our non-acute network then, from the
perspective of a patient/taxpayer (tired pun intended). It is a bit radical to take a patient’s
perspective but let’s press on. The
patient books an appointment with the local general practitioner who
determines that the problem more correctly needs specialist assessment. The doctor might ask “public or private”
implying some differential service but we won’t go there. Our patient paid taxes damn it and is going
to go public. Well, please wait 2
weeks and you will get a letter from the hospital telling you when to attend
an outpatient’s clinic. Two weeks pass
and the letter arrives – please attend a clinic in 6 weeks time! Six weeks pass and the specialist
appointment date arrives. And isn’t it
funny how all the other people in the clinic seemed to have similar
conditions. Anyway an assessment is
made and intervention is recommended within 6 months. Ah, that doesn’t mean 3 or 4 months, that
means something like 5˝ months or 6
months. That’s 6 months waiting plus 1˝ months waiting
plus ˝ a months waiting. That’s 8
months waiting all up. That is, if the
specialist didn’t refer you back to the general practitioner.
Why all the waiting?
Well, we are just trying to be efficient didn’t you know!
If you look at the marshalling system it looks like
an “A” – upside down I grant you.
However A-plants describe a situation where there is general
convergence in manufacturing.
Patients’ waiting lists don’t manufacture anything, but as a supply
chain they seem to exhibit the same behavior as A-plants. Let’s look at an A-plant description; under
traditional management practices in A-plants the tendency is to misallocate
resource time in an attempt to maximize efficiency and utilization figures. Large batches are used to keep the
measurements high resulting in a poor component mix and constant shortage of
the right parts (1). Furthermore these
large batches move in waves throughout the plant causing temporary
bottlenecks to wander from resource to resource. Since material is constantly out of
balance, overtime is used to ‘catch up’ so that shipments can be made on time
(1). We can expect similar things to
happen in patient waiting lists.
What evidence do we have of local efficiency
measures to substantiate this assertion?
Well just reflect for a moment on the number of measures that we must
compile for reporting and how many of these directly relate to improving
wellness in the community. Aren’t most
really local efficiency figures?
Aren’t most of them sheer frustration to substantiate even on a good
day?
What, then, if we were to increase the frequency of
clinics in such as system – and indeed interventions as well. Nothing major, moving things from once a
fortnight to once a week or from once a week to twice a week. What effect would that have if we could
simultaneously address the backlog?
Think about it.
But Demand Will Increase!
Well, there is a very peculiar notion in healthcare
that if we improve patient service levels then demand will also
increase. We need to examine
this. This notion belongs on another
planet.
Would you ever wish upon yourself a serious
illness? Pretty damn unlikely. So if patient waiting lists decrease, and
service improves, are people going to become ill more often just to avail
themselves to the new levels of service?
Pretty damn unlikely also. So
where does this curious notion arise from?
One situation where it might arise from is if we are
currently failing to meet a real need (as opposed to a desire, or a want, or
what the marketers would call a latent need – one when you go out and buy
something you didn’t even know that you wanted). If we are failing to meet a real need and
we increase availability then of course there will be an apparent rise in
demand. However, that demand was
already present, it simply wasn’t being met.
Failure to meet a present and real demand is not a reason to restrict
services if it is possible to improve access those services using existing
resources.
There is another aspect to this apparent
paradox. Healthcare has very strong
negative reinforcing loops operating in it.
Failure to respond to a need at an early stage means that the need
when eventually met consumes far more resources. This might help explain why demand is
perceived to increase. It is not
necessarily the total incidence that is increasing but rather the severity of
the individual incidences once they reach an agreed level for intervention. We will return to this thought later.
You Don’t Understand We Have Too Many Acute Patients
Yes but, we have too many acute patients. This is an interesting problem. Some people become acutely ill
suddenly. Others become acutely ill
over time – time spent waiting for non-acute intervention. Governments find it hard not to fund acute
work and not so hard not to fund non-acute work, so we can guess why acute
work load is, in part, so significant.
Moreover, we already know how to break this vicious cycle – earlier
intervention. Too many acute patients
is really just an excuse. There are
others too.
The local population is too old; the local
population is too young. We have
diverse socio-economic challenges in our area, the area is too rural and
dispersed, the area is too urban and condensed. We can’t retain good staff, we can’t
attract talented doctors to our specialty, our specialty is under-recognized,
our staff are older and more expensive than the mean and so forth.
What about; our young doctors are attracted to the
large cities, or (if you are in a large city) our young doctors are attracted
overseas. And you can’t get good
locums anymore. Our buildings are over
40 years old, our buildings are an earthquake risk, the air conditioning is
antiquated, our total corridor length is much greater than anywhere else (its
true trust me). The flu season was
early/late this year – but never on time.
We have an unreasonable orthopedic load, we are a national center for
……… but this isn’t recognized in the funding.
In fact, you have probably worked out that there is
no end to this list. But don’t worry
these are not the problems either.
They may be symptoms of people’s frustrations, but they are not the
core problem, and therefore solving them is not the solution (but that has
never stopped anyone yet).
Let’s move on.
Cart Before The Horse
Improving the patient waiting list network is indeed
a fine ideal, especially if it ensures that we don’t miss an opportunity to
do an intervention because we didn’t have a patient ready – even though there
are 100’s of patients on the list (it happens!). However, the reason supply chain follows
production on these webpages is that we have to sort out our production first
if we are to improve our supply chain.
Now there are exceptions to this; for instance where we don’t own the
production stage or it is beyond our span of control or sphere of
influence. In these instances then,
yes, indeed we have to do our best in spite of the limitations. However, this
is not the case in public health systems, the production stage and the supply
chain stage are integral. So we should
address the production side first. And
if we can’t win there, then that shouldn’t be an excuse not to look at the
supply chain mechanics nonetheless.
In order to increase the production side we must
address a policy constraint. Let’s
have a look then at that.
There Is No Goal In Public Health
It has been said that if there is no goal then the
absence of a goal is the
constraint. We also noted in the
measurements section that the goal of a system is in fact open-ended. You can’t have enough of the goal. In contrast the necessary conditions that
support the goal can be viewed as having limits. Once we satisfy a necessary condition
additional satisfaction does not improve the rate at which the organization
moves towards its goal.
A problem arises however in not-for-profit
organizations of which a public health service is just one example. Scheinkopf notes that in not-for-profit
organizations “there is a tendency to believe that the measures are so
intangible and that attainment of purpose is such a subjective call, that
such measures are simply not discussed.
The focus ends up to be on measuring and managing the things we call
‘tangible,’ such as money (2).”
New
Zealand health boards must currently meet an 11% “capital charge” on some
types of new investment (the rate-of-return incidentally is one that some
public companies in the free market can currently only dream of). This means that the Government must pay the
health boards pro rata 11% too much to cover this capital charge, which the
boards then pay back to the Government showing that they are indeed
efficient. This financial efficiency
can be met by restricting access by raising the level (points) for non-acute
admissions. At best, meeting the
capital charge is a necessary condition and a perverse one at that.
Because
the capital charge is imposed upon the system from outside and must be met, it is a necessary
condition for success. However,
necessary condition aside, it is just a Government policy, this doesn’t mean
that its validity shouldn’t be challenged, nor the cost mentality behind
it. However the very real danger is
that once the necessary condition has been satisfied (boards run a “balanced”
budget) there is no driver for further improvement. The necessary condition, due to its
prominence, is mistaken as a goal – which emphatically it is not.
It is
no exaggeration to say that public health is missing a goal. Instead it has, as an objective, a
necessary condition – meet budget. We
can illustrate this further.
“The Nelson-Marlborough District Health Board confirmed all elective
surgery will be postponed for about six weeks over summer.
The moves come at a time when some patients are waiting up to five
years for non-urgent surgery, and the board is preparing to cut people from
its waiting lists if their conditions are not considered serious enough to
warrant treatment in the public health system.”
“The Health Ministry contracted the board to do fewer operations than
it had the capacity to perform. As a
result it was already significantly over budget less than four months into
the financial year.
The purpose of the cuts was to reduce surgery to contracted levels and
save money (3).”
If you believe
in reductionist/local optima viewpoint you will also believe that each
operation has a cost and by avoiding operations we can avoid all the costs
associated with them and thereby save money.
If you understand the systemic/global optimum viewpoint then you know
that such efforts will hardly save a penny.
Sure it will save on some variable costs. However, using the quote above as an
example, we should ask what will the buildings do for 6 weeks, what will the
staff do for 6 weeks, and what will the air conditioners do for 6 weeks? They are not variable expenses. And of course what is the final cost to the
system when the work is finally undertaken – is it more or is it less?
Let’s
have a look then at evidence of, not of postponement, but of removal from a
list.
“Many gallstone patients in Auckland must now suffer at least four
attacks of severe pain and vomiting in a year to qualify for surgery.
Or they must have two attacks of gall bladder inflammation, or
experience worse symptoms or complications.
Less that four pain and vomiting episodes, called biliary colic, and
you would probably fall below the cut-off point – set in response to
Government funding levels – for elective surgery at North Shore Hospital
(4).”
In the
same article but a different hospital.
“Waitemata officials started to introduce their new scheme in November
after struggling, like all district health boards, with having more patients
than can be treated. They hope to
extend it to other types of surgery later.
Under it, about 40 patients have already been taken off the surgery
waiting list because they are not considered sick enough.”
Note
that the qualifier is 4 attacks in one year – and then you go onto an
“elective” waiting list; but there was no mention of how long the list is until
intervention. Removing people from
waiting lists who are deemed not ill enough to warrant treatment conforms
exactly to one of Senge’s system thinking archetypes – eroding goals (5).
So I think
it is safe to say that the objective illustrated here is characterized by a
limiting necessary condition and that
necessary condition is to meet budget.
We don’t have a goal at present.
Who Should Set The Goal Then?
So if
we don’t have a goal at present in the health system, who should set the goal
then? Well, the answer is clear, the
owner of the system should set the goal.
And the owners are the taxpayers aren’t they? Sure, but the Government of the day
administers the health service on behalf of the taxpayers; so it is the
Government who is the proxy owner of the system in this instance, and it is the Government that should set the goal.
The
Government currently sets a number of necessary conditions that are financial
in nature because articulating a non-financial goal and the fundamental
measures to support it is deemed to be too difficult. But is it really that difficult? Let’s try.
How Do We Set The Goal?
Let’s
try and set a goal for public health so that we can move along back to our objective
of showing marshalling as a viable model for public waiting lists. How do we do that? How do we set the goal? I guess that we need to ask where we want
the public health service to be at the present. That would be a good place to start.
A politically
correct goal might then become; a timely and appropriate outcome. But what is the outcome? Is it community wellness? If it is community wellness, are we seeking
to maximize it? That certainly seems
open ended as a goal should be.
However, it might also imply incorrectly that funding should be
maximized and clearly there is a problem here because most people don’t want
taxes to increase which is exactly where the funding must come from.
Then,
how about; improve community wellness, as an appropriate outcome? Improving community wellness seems
sufficiently open-ended at this point in time (maybe even bottomless), we
could certainly do with a lot, lot, more of it. Why don’t we run with this for a while and
see if it will work for us. Thus the
trial goal for a public health system is to; improve community wellness now
and in the future. Let’s write that.
Establishing a
goal is fine; however, we now need to ask what are the absolute necessary
conditions or inputs that will give rise to this goal. In order to obtain this goal it seems that
there are at least 2 necessary conditions that we must satisfy. We alluded to these in defining the
goal. A timely and appropriate outcome
implies a timely and appropriate input.
The appropriate input could be pro-active
prevention or the reactive
intervention that we carry out.
The timeliness depends more upon availability at this moment than
anything else. So let’s add these two
necessary conditions to our goal.
It seems that
the appropriateness of the intervention isn’t so much in contention as the
timeliness. It seems then that one
necessary condition is currently satisfied – the appropriateness. Medical professionals do not appear averse
to taking up new approaches or technologies in either treatment or
prevention. However, the other
necessary condition – timeliness, isn’t currently satisfied.
In
fact, satisfying this non-financial necessary condition looks a little
untenable. The proverbial rock and a
hard place. We need to improve the
outcome – community wellness – with a level of availability and therefore
timeliness that many would consider is currently “insufficient.” It therefore would be too easy to write
another necessary condition leading into the current one saying “secure
sufficient funding” in order to increase the level of availability and
therefore increase the timeliness – however, it would be quite another thing
to actually receive that funding.
We
should also remember from the measurements page that a not-for-profit
organization such as a public health service must watch its operating
expenditure against its existing fixed level of funding least it runs a
deficit (6). So running in the red and
hoping is out as well. How then do we
ensure sufficient timeliness and maintain our operating expenditure at
the same time?
Let’s
go back to one of the most important statements in Theory of Constraints;
Productivity =
Throughput / Operating Expense
We have
muddied the water a little because our goal is now non-financial and
throughput, as defined, is a financial measure (sales – totally variable
costs excluding direct labor).
However, we can jury-rig another equation that will do just about as
well – we will substitute output for throughput;
Productivity =
Output / Operating Expense
We can
measure our output – patients. We can
measure our input – operating expense.
If output goes up and input goes down or stays the same then we have increased
our productivity and we have also moved towards our goal.
Let’s
be clear however, increased productivity doesn’t mean working harder. It does mean though, knowing sufficient
about the system, its dependencies, and the variability in and between dependencies
that we can protect the most valuable or most important part, that part that
we have the least capacity to spare.
Let’s make “not working harder” an explicit necessary condition to our
goal so that this aspect is not misinterpreted or misrepresented. Let’s draw it.
This then is the
goal and the necessary conditions for a public health system. We have identified a non-financial
necessary condition – timeliness – that is currently not being satisfied.
How Then Do We Measure Progress Towards The Goal?
Using
productivity as a measure of progress towards the goal is a bit of a blunt
weapon – in fact it is more an indication of the method than the measurement
that we should use. The fundamental measurement then is our
non-financial necessary condition, the one that we are failing to meet
currently – timeliness.
The Government – the owner of the system
– must set maximum national patient
wait-times that must be met. We
can measure this performance and it is non-financial. Moreover we can see that meeting an
increase in demand at static maximum wait criteria and funding must mean an
increase in productivity. Also meeting
a lowered maximum wait criteria at static demand and funding means an
increase in productivity. We can
measure progress towards or away from the fundamental measurement with two
local measures; patient-days-wait, and patient-days-late.
Yes But, The Government Already Uses Maximum Patient Wait Times
So how
is our proposal different then; the Government already uses maximum patient
wait times for many aspects of healthcare?
That is true, but how is the issue managed at present? Timeliness is currently managed not by
increasing productivity but by decreasing productivity. It is managed by raising the criteria for
consideration, so that the patient wait times may remain high and constant
but the level of unwellness in the waiting list becomes greater over time and
the number of people treated becomes fewer and fewer – we saw direct evidence
of this in the earlier quotes.
Moreover,
the maximum wait times are measured purely by the number of patients. Our local measures; patient-days-wait and
patient-days-late are much more revealing about the true nature of the
waiting list. But we ourselves must
wait a little before we can investigate this aspect further.
Broader Issues
First,
however, there is a broader aspect to productivity that applies to a public
health system. Public health systems
are not “stand-alone,” public health productivity impinges upon the
productivity of the whole nation/state.
Consider for instance a country with first class productivity in one
of the primary industries such as; agriculture, fisheries, forestry, or
mining, or first class productivity in any one of the secondary manufacturing
industries. These activities generate
national income. Why do we constantly
strive to increase the effectiveness of these national income generating
activities if a major consumer of this income, healthcare, operates on
assumptions once thought valid in a previous century – and I mean the 19th
not the 20th century. Other sections
of the economy have moved on.
Currently
most hospitals are implementing some form of patient information management
system and some form of enterprise-wide scheduling system. Enterprise-wide scheduling systems were
described in the section on production, essentially they are finite
scheduling solutions based upon a reductionist/local optima approach. As we know from manufacturing, reliance on
these techniques depends on excellent data integrity but generally results in
increased work-in-process because they fail to protect the system from
variation even through they have ample protection embedded within the
schedule – in short they fail to protect the constraint – output goes down,
work-in-process increases. Increased
work-in-process in this environment means more patients-in-waiting and
waiting for longer.
The
reality is that in both manual and automated scheduling systems many theater
opportunities are lost due to poor protection of the constraint. These losses are buried in the general
theater utilization hours, we have to scratch the surface to find them, but
they are real, and they do present a real opportunity to improve output at
current operating expense. And that
brings us to our critical erroneous assumption.
A Critical Erroneous Assumption
There
is an assumption that we totally fail to challenge – the assumption that we
are sufficiently productive and that we can not improve further. The pervasiveness of this assumption can be
demonstrated every time someone says; ”yes we could process more patients if
only we had access to more funding.”
The hospital in the earlier quotation is very likely to have
sufficient productivity – it could do more operations than contracted for
(don’t be fooled by contracted cost, you need to see the flow of money in and
out of the system). The real issue is
that if one hospital provides a better level of service than others it is
defying a charter that requires equitable access to all people in all parts
of the country. This means other
hospitals are currently not as productive.
The most productive hospital and all other hospitals in between must
be hobbled to the level of the least productive hospital in the system in
order to ensure equitable access.
Think about it.
Yes
but, all the other hospitals could improve to the same higher level couldn’t
they? Well you would think so; this
would be the ideal situation. However,
there are two reasons why this doesn’t happen. Firstly under a reductionist/local optima
costing process, if we improve our productivity our unit costs will go down
and next funding round we will receive less to do the same number of
procedures rather than the same amount to do more. This is a very real fear of hospital
management.
The
other reason is more important.
Currently in the health system there is little knowledge of the rules
of engagement that we first saw in the measurements section. Let’s repeat them here;
(1)
Define the system.
(2)
Define the goal of the system.
(3)
Define the necessary conditions.
(4)
Define the fundamental measurements.
(5)
Define the role
of the constraints.
As you
can see, in health at present we have just a few financial-based necessary conditions;
we are missing so much of the whole picture.
Why won’t we do this if it is so simple? Are we scared? No, I don’t think so. It might be that many people simply don’t
know how to evaluate the role of the constraints in this system yet – or that
they do know how to but common practice runs counter to this.
Well,
fortunately we are using our common sense rather than common practice, so
let’s continue with our examination of patient waiting lists and
marshalling. We really ought to stop
looking at the problem and start looking at the solution.
Semantics
How can we describe the actions of the nodes in the
patient waiting list network? We have
suggested that the supply chain here is a marshalling supply chain, or more
accurately marshalling and consolidation. Patients are marshaled in by referral from
numerous local doctors and consolidated into specialties and then lists. The consolidation is carried out in
accordance to a push-to-need basis.
General practitioners feel that a particular patient
needs specialist expertise (and it is the expertise of the general
practitioner to know when this is required) and “launches” the patient into
the process and hopes that the outcome will be favorable (and timely). As in all other supply chain solutions here
we need to replace this with some sort of pull-and-replace system. The constraint, the most valuable and
limited resource, must pull the patients via the waiting list network to a
position where they are ready to receive intervention as soon as possible. Maybe we should call this a pull-to-cure or
a pull-and-cure system.
If at some future point in time there are
insufficient patients to fully load the system then we are moving in the
right direction. And if currently we
can at least stop waiting list expansion (without fiddling with the criteria)
and affect a contraction then we know that we will eventually reach that
future point. The key is that the system must initially pull at a
faster rate than the incidence rate of the problem. How are we going to achieve that?
Well, unlike the distribution problem or the log
marshalling problem, where the constraints in the system were non-production
constraints, here the constraint is a production constraint. The intervention produces something; it
produces favorable outcomes – but currently it produces an insufficient
number of them. Thus we need to break
our solution into two subsystems;
(1)
Production
subsystem – Intervention.
(2)
Supply chain
subsystem – Patient Waiting List
Network.
And as you know, common sense tells us that the
answers are already in the system. So
let’s have a look.
General Solution – Part One; Intervention & Drum-Buffer-Rope
A solution with an unusual name and very powerful
consequences. Drum-buffer-rope is the
Theory of Constraints production solution, it is a logistical solution. It is fully described in the section on
production; it is really a way of thinking more than anything else – a way of
thinking that enables substantially increased output from constrained
situations without recourse to additional funding or manpower. There is a good example from neurosurgery
in the United Kingdom (7).
The Radcliffe Infirmary went from canceling 64
elective neuro-surgical procedures over a 3 month period to canceling none in
the same period the next year.
Out-of-hours operations were drastically cut and output went up by
16%. Would a reduction in non-acute
cancellations be useful to you? Would
reduced out-of-hours operating be useful to you? Would an increase in output be useful to
you? This is not a trivial solution.
We could get away here with just briefly mentioning
some aspects of the drum and the buffer.
The drum is the constraint, it beats out the rate at which the system
works at. In our case the constraint
is most probably a surgical theater or a medical bed. Let’s draw this using our systemic model
that we developed earlier. The
constraint – our drum – is the rate limiting step.
A buffer is quite tightly defined – in this
situation it is a measure of time, the time for a patient from the moment of
admission to the beginning of intervention.
To properly exploit our scarce resource in surgical cases we must
admit patients in good time so that they are always ready for
intervention. However, to properly subordinate
the scarce resource we must also not admit too many patients at any one time.
Watch the distinction; it is very, very,
important. After all, one of our local
ward measures is average “bed nights” or some such similar measure. Having a lot of patients for a short time
is locally positive; having few patients at any one time for longer is
locally negative. The current local
measures do not support the global objective of the system. If we have too many patients waiting for
too short a period we will absolutely miss some interventions “because the
patient wasn’t ready”. Hell, the
patient was ready. It was the system
that wasn’t ready. Our output goes
down.
If we have fewer patients waiting longer between
admission and intervention we won’t miss an intervention, output goes
up. System operating expense remains
the same. It seems counterintuitive,
but if it was intuitive we would have done it – right?
We could summarize this as follows;
Introducing constraint buffers and
decreasing process batch size automatically
aligns the process with the goal
What do we mean by process batch? Well, I guess that an operating list is a
process batch. The other sort of batch
size that we might refer to is a transfer batch, and in a service operation
like this a transfer batch will be in units of 1 – the patient. To decrease the size of the process batch
means that instead of operating all day only on Tuesdays for instance – and
causing uneven ward work-load, how about Tuesday and Monday and Wednesday
mornings instead. Forget the detail,
it is simply that we are trying to decrease the number of patients at any one
point and increase the frequency.
Really we are trying to better balance the
flow. Again be careful, we never
balance capacity but we always try to balance the flow – just the opposite
from local optimization. Of course there
are practical limits to this, but we should make sure that the limits are
real and not policy. We need to make
sure that the policy is not some assumption rooted in the 1960’s or the
1950’s. Increasing the production
frequency is the primary driver that flows on back up into the supply chain –
the patient waiting list network. We
had better look at that next.
General Solution – Part Two; Patient Waiting Lists & Replenishment
The constraints in this system are in the
intervention stage, the stage located within a hospital, and this is the
stage that we must exploit. Therefore,
all other stages are non-constraints and we must subordinate these to the
constraint. The patient waiting list
network, like the log marshalling network, must subordinate to the constraint
until such time as there is a substantially reduced waiting list and
additional patients present at admission at a rate that is less than rate of
intervention.
To properly subordinate we must ensure that the
waiting list network never “starves” the production node. It starves the production node when it
fails to produce a patient for admission in good time. It happens.
Talk to a scheduling clerk and you will hear stories
like “I need a patient for the operating list on Tuesday fortnight – and I have
rung and rung around the patients on the waiting list but do you think that I
can find one!” It’s amazing, but true,
and very frustrating for those trying to do their very best. Thus our intuition as well as a good dose
of common sense suggests that we should move patients through the waiting
list network as quickly as possible to the place of greatest aggregate safety
for both the patient and the system – just prior to admission.
In fact, in medical cases, it is likely that the
supply chain prior to admission will also form part of the constraint
buffer. This type of situation is not
so uncommon in manufacturing especially where the first step in the process
is so capital-intensive that to “buy another one” is prohibitive. The expense in both cases here is the
bricks and mortar and the considerable number of skilled staff required to
run the facilities around the clock.
Let’s consider some questions then;
What would happen if we could increase the frequency
of clinics prior to acceptance for intervention? As an example, instead of holding a clinic
once a month for a day (because it is efficient for staff) what about holding
a clinic for half a day every fortnight, or until mid-morning every week? It is kind of like waiting for
one 747 or one of two 737’s. The total
waiting time is less for the smaller more frequent service.
What potential could that have?
What about if we could remove nodes completely or
combine nodes so that they occur at the same time and place, maybe carry out
some tests on the same day in the same place for instance? What potential could that have?
Hold on to these thoughts for a moment.
As in linear supply chain, distribution, and log
marshalling, we need to introduce into this system the Theory of Constraints
supply chain solution – replenishment.
If you are unfamiliar with fixed-frequency variable-quantity
replenishment then please check the explanation on the replenishment page –
it is important.
Each node in the waiting list network becomes a
buffer for the next node containing sufficient patients to ensure that it can
supply the next node down while it pulls patients from the next node up. The constraint, the drum, in the production
portion is the originator of this pull signal. Let’s draw the supply chain portion then.
If we carry out replenishment correctly we will move
safety to the area that is most important, the area closest to
admission. Let’s draw that.
And if we increase the frequency of the clinics and
other waiting list processes then these buffers can be very small indeed and passage
from one end of the list to the other will be very rapid. That way once a referral is made the
patient can move through the system quickly and be available to be “worked
upon” – either an operation or a medical treatment if required as soon as possible. We can summarize this;
Introducing replenishment buffers and
increasing resupply frequency automatically
aligns the process with the goal
Now if we return to those thoughts that you are
holding on to, there is probably a big red flag saying’ “yes but there are
too many patients-in-waiting in the system now to make such a process
work.” Yes there are. But unless we get the appropriate mechanism
in place even before it is apparently needed things simply can’t
improve. If we were to size our buffers
today we would find that they are way over-full. But at least we know where we are heading.
In every situation where the system is drowning in
work-in-process, people are reluctant to give up the system that causes the
work-in-process that drowns them; “because there is so much work in the
system that doing this will have no effect.”
Exactly wrong.
We recognize how chaotic huge numbers of
patients-in-waiting are because periodically we “fiddle” with the criteria to
try and reduce the numbers. Unfortunately
that just feeds a negative reinforcing loop – we get more acute
patients. The only solution is to
maintain the criteria and increase productivity. You will be very, very surprised at the
effects. Patients are not logs, and
they are not cars, but that doesn’t mean that we can dismiss the
principles. Well in fact we can
dismiss them, but they won’t dismiss us.
We need to cut the strong negative reinforcing loops
and replace them with strong positive reinforcing loops. We need to look for systemic/global optimum
solutions not reductionist/local optima solutions. We need to look at trying to reframe the
environment and not to continually applying band-aids. The solutions are already in the system,
and those solutions although they represent change, represent a change in
meaning only.
Let’s Put It All Together
Let’s try and pull all of this together by showing
the system in its proper order; the supply chain patient waiting list network
feeding into the intervention stage.
Likely as not there is another supply chain at the other end –
district nursing, but let’s leave that for another day.
Doing this it becomes clear that there is a feedback
between the two. We need to make sure that
we don’t ever waste our scarce intervention stage, and at the same time we
need to ensure that the supply chain doesn’t ever fail to provide an
appropriate patient at an appropriate time.
Local Performance Measures
Earlier we parachuted in a goal for public service
healthcare and looked at how to measure whether we are moving towards the
goal or away from it. The goal and
necessary conditions might provide a measure for a whole system, but how do
we know in a system as complicated as a large public hospital or a district
health board that the parts – the subsystems – are also aligned and moving in
the right direction? Really we are
asking; how do we know that the non-constraints are subordinated to the
constraints. For this we need local
performance measures.
Another way of looking at local performance
measurements is that they should judge the quality of the execution of the
exploitation plan (8). What is the
plan in this case? Surely it is to
provide a timely and appropriate outcome.
We can’t comment here on the appropriateness but we certainly can on
the timeliness.
Timeliness
is reflected in two particular measures;
(1)
Unit-days-wait.
(2)
Unit-days-late.
In fact of the two, late-days is more important, but
waiting-days always seems easier to explain first. These two measures are simply a
re-verbalization of the two measures that we have consistently applied to any
subsystems in production or supply chain processes. In fact, we used these exact measures to
introduce the concept of local performance in the measurements section. Let’s have a look at these again in detail.
Let’s
say for instance that a certain outpatients’ clinic for referrals has 50
people on the waiting list at any one time and last year these people waited
on average for 12 weeks, this year we still have 50 people on the waiting
list at any one time but they now wait on average for 16 weeks. What is the total waiting time here?
Well,
we know that last year that there was on average 12 weeks by 5 days per week
by 50 people = 3000 patient-days-wait.
In comparison, this year there are 4000 patient-days-wait on the
list. Is the performance better or
worse? It’s worse of course. If we can stop patient-waiting-days from
increasing, or better still reduce it, then we must have improved the
system. Let’s add this measurement to
a linear representation of our health system (both patient waiting list
network and hospital intervention).
So waiting-days
is one measure that we can use to evaluate a subsystem with, or indeed even
departments within a subsystem.
Another
aspect of timeliness is that regardless of how long we must wait, do we still
receive attention “in time” at the end of the wait or are we late? Let’s continue with our analogy. Let’s assume that last year our patients
were expected to be seen by a specialist within a recommended guideline of 12
weeks of referral. Some, however,
weren’t seen within this time-frame.
Let’s say that 3 patients were seen after 13 weeks and 2 were seen
after 14 weeks. Again we might argue
that just 5 out of 50 or 1 in 10 patients were not seen within the
recommended guidelines. However, a
more realistic measure is that 3 were 1 week late and 2 were 2 weeks
late. This gives us 1 week by 5 days
per week by 3 patients plus 2 weeks by 5 days per week by 2 patients = 35
patient-days-late. Is this bad? Of course it is, it should be zero. We can add this measurement to our system.
So
unit-days-late is another measure that we can use to evaluate the performance
of a subsystem with – anywhere that there is a clear hand-off to another
subsystem.
If the subsystems are aligned to the goal of the
system we should expect patient-days-wait to decline and patient-days-late to
be zero. Now, these measures are
excellent at monitoring subsystems – nodes in the waiting list network for
instance – but there is no reason why they can not be used for the whole
system as well. If we use them for the
whole system, maybe divided by specialization, then they also provide us with
a non-financial measure of system success.
They don’t measure wellness in the community directly but rather
indirectly as the absence or decrease in unwellness. We should strive to reduce the unwellness,
wouldn’t you agree?
Let’s
hope that one day we can see in district health board meetings a 12-24 month
running graph tabled for each major subsystem showing patient-days-wait and patient-days-late. Then
we will know at a glance whether we are all moving in the right direction or
not.
We can
test for obfuscation with a simple graph.
In the graph below we have some initial criteria for
admission to an elective list – patients who have managed to reach the
“access threshold.” Over time the
total number of patient-days-wait increases as the effects of system
dependency, variability, and an absence of knowledge of how to protect the
constraint cause output to be lower than input into the list.
At some time the length of wait and the number of
patients waiting becomes too great.
There is a reassessment of the “access threshold” and a limit is
imposed.
The limit is imposed via new criteria for the access
threshold. Some of the previous
patients are “parked” in new categories such as the “residual waiting
list.” Nevertheless, the
patient-days-wait continues to increase as before, and for the same reasons,
but now artificially depressed for a time by the adjustment.
Patient-days-wait increases, that is until, once again,
the length of wait and the number of patients waiting becomes too great. There is another reassessment of the
“access threshold” and a new limit is imposed.
The new limit is imposed via new criteria for the
access threshold. Some of the previous
patients are once again “parked” in new categories with new and different
names such as “active review.”
And of course once again, patient-days-wait
continues to increase as before, and for the same reasons, because we have
still failed to address the fundamentals underlying the problem.
The important point is that if we extrapolate from
the earlier data we should be able to get a good estimate of the “real”
patient-days-wait had the criteria remained unchanged. An apple with apple comparison. Let’ draw what we mean below.
So even if people say the we can’t compare different
sets of criteria, because the criteria themselves have change, that is the
make-up of the rules in the “access threshold,” then we don’t need to allow
that to divert attention away from the very real conclusion that the wait
list based upon the initial criteria is much greater and can be estimated
with relative ease.
What should we see then, if we instead address the
fundamentals in the system? What
should we see if we identify, exploit, and subordinate to the system’s
constraints? Let’s have a look.
Rather than impose a new access threshold and “bump”
people off the list, we have identified and protected our weakest link and
increased its output.
Patient-days-wait haven’t been reduced to zero, but they have been
brought under control, the backlog removed and a new, lower, and stable rate
achieved. At each steep downward step
indicates that a constraint in the system has been overcome and output has
risen (and days-wait decreased). Then
after some time the cycle “flicks” from being negative and vicious to being
positive and virtuous (removal of excessive waiting time induced acute load)
and the rate of intervention becomes greater than the rate of admission to
the list.
Don’t allow the “we can’t compare criteria” to
obfuscate the real issues.
Yes But, Do We Do Tonsillectomies Or Hip Operations?
So far
we have mainly considered patient-waiting-days as some undifferentiated
mass. Of course different
specializations will “own” different lists.
Different lists have different degrees of difficulty. We could knock-off the easy ones first and
substantially reduce total patient-waiting-days. Well such things do happen. But what do you think will happen
next? We will run out of easy jobs to
pick. Then we get down to more serious
matters. The more responsible and
boring approach is to work away at the total mix.
Patient Waiting Lists Are Not Simple Replenishment
Patient waiting list networks are not simple linear
replenishment through a series of dependent nodes. They are first and foremost a convergent
marshalling and consolidation supply chain.
Moreover, the characteristics of the supply chain are strongly affected
by the characteristics of the integral production step – the
intervention. Why can we be so sure
that this isn’t simple replenishment?
Let’s have a look;
(1)
There is a
many to many relationship from the source nodes, the local doctor, to the
specialist assessment nodes.
(2)
We must
consolidate from numerous source nodes to a limited number of intermediate
nodes.
(3)
We must
currently subordinate the whole system to the intervention stage.
(4)
We must
position the maximum buffer protection in the place that best protects the
whole system, just before the intervention node.
(5)
We must
protect the more disaggregated source nodes differentially by a higher
frequency of resupply.
Of these, positioning the protection for the system in
the place that does the most good is probably the most important; this helps
avoid a major problem, having a lot of patients on the list but no one
immediately available.
Let’s Think It Through
So what
do we do first? Two things; a chicken
and egg scenario. Here goes.
We must
raise productivity at the intervention node.
We do this by overcoming our key policy constraint (or simply ignoring
it), implementing a proper global production approach – drum-buffer-rope (a
spreadsheet will do), and improving flow from our patient waiting list
network. However, to improve our flow
from our patient waiting list network we must raise productivity at the
intervention node and implement replenishment and buffer management in the
supply chain. Yes it’s a circular
argument. And do you know how to break
it? Just go out and do something,
somewhere, anywhere. It’s shockingly
simple, do something rather than talking it to death and you will begin to
move the system towards its goal.
Once
you have tackled this crux, increasing productivity will become
self-reinforcing; improving flow in the patient waiting list network will
also become self-reinforcing. Think
smaller batches, either in time or quantity and you are on your way, think
global results not local efficiency and you can deduce the solution for
yourself.
What Are The Unavoidable Outcomes?
If we
approach the problem as suggested then the unavoidable outcomes are;
(1)
Increased
patient output.
(2)
Absence of
“no-shows” in wards/theater (zero patient-late-days).
(3)
Decreased
waiting lists (decreased patient-waiting-days).
(4)
Decreased
waiting times on the list (decreased patient-waiting-days).
(5)
Lower
severity of illness at intervention.
(6)
Shorter
overall bed stay.
(7)
Lower overall
operational expenditure.
We will arrive at a position where we can
pull-to-need or pull-to-cure with increasing rapidity.
But Wait, Reality Isn’t This Simple!
If this explanation seems quite simple and
straightforward then that it excellent; then we know that we have developed a
broad understanding of patient waiting list networks with replenishment. If experience tells us that reality is much
more complicated than this generalized explanation, then that too is
excellent. Now we are in a position to
better investigate how to apply this methodology to our own particular
situation.
Give Us Some Examples
The Radcliffe Infirmary whose results we have
already discussed is one of the published examples of Theory of Constraints
in healthcare. The closest
approximation to this approach in New Zealand that I am aware of is the
operational and logistical expertise that the Health-Med Group has brought to
the privately run Venturo public urology service. Although Theory of Constraints was not used
there is so much plain common sense that this example should be very much
better known.
In 1993 Venturo was awarded a population-based
contract to undertake the treatment of all referred patients in need of
specialist urology services within its operating area. Traditionally services had been (and still
are) contracted for in-advance by the number of procedures or pre-determined
volume rather than a fixed fee based upon the total population served.
During the first year the service sought to minimize
its risk by dealing with the existing waiting list backlog. Outpatient visits increased by 32%,
inpatient admissions increased 9%, and day-patients increased 16%. In total 70% more people were treated from
the waiting list than the previous services had in the previous year. Total waiting time between GP referral and
operation was reduced by 44% from 37 weeks to 26 weeks (9).
How did they do this?
Good Management.
Clinics, wards, and operating theatres have dedicated urology staff
and the booking systems ensure maximum utilization of both staff and
facilities. Some investment was made
in ultrasound and laser equipment which has improved the ease and timeliness
of diagnoses and treatments. Protocols
were developed for GP referral.
However, one of the most significant gains was the standardization of
clinical practices amongst urologists (9).
Now read back through the earlier arguments. It appears that Venturo ensured that
patient and system risk was reduced and managed by moving patients to the
place of maximum safety – pre-admission or thereabouts, and by increasing the
output of the intervention stage. Its
just common sense. More people were
treated sooner, and access to urology specialist services increased (9). Moreover the contract fee for this service
has remained fixed for the last 7 years.
Summary
There is a very powerful policy constraint in place
at the moment which we must break. We
must be prepared to demand that each hospital tries to raise its productivity
to the level of the very best. Clearly
we can’t all be the very best. But the
spread, or tail, down from the very best must be quite tight. Currently, rather than raise productivity,
the system policy actively hobbles each hospital to the level of the very
worst.
We don’t’ need “free” enterprise to make this
work. We do need, however, some enterprise
of thought. If we look at this at a
national level we buffer our health risk through our taxes. We gain maximal benefit from minimal
payment. It would be a shame to allow
a profit margin to accrue to “free” enterprise just because the public enterprise
is constrained by its own well intentioned policy.
In order to raise productivity we must adequately
exploit and subordinate the very valuable and limited capacity that we
already do have. We can do this
without recourse to additional funding. Part of the exploitation and subordination
is to have patients ready for intervention from the patient waiting list
network as soon as possible. We can do
this by implementing a change in mind-set away from local efficiency measures
and towards global effectiveness measures.
A crude initial indicator is whether our total
patient-waiting-days on a list is decreasing or not. There is no other measure that doesn’t seek
to obfuscate reality is some fashion.
Returning to the Radcliffe Infirmary example. One further outcome was produced; "The
difference was visible, the staff just looked much better … and happy staff
means happy patients (7)." One of
our necessary conditions for success in Theory of Constraints is to provide
employees with a secure and satisfying workplace now and in the future. It seems that at Radcliffe they achieved
all of the necessary conditions.
Now why can’t you?
References
(1) Stein, R. E., (1994) The next phase of
total quality management: TQM II and the focus on profitability. Marcel Dekker, pg 36.
(2) Scheinkopf, L., (1999) Thinking for a change:
putting the TOC thinking processes to use. St Lucie Press/APICS series on
constraint management, pg 25.
(3) Surgery slashed in bid to reduce hospitals’
costs. New Zealand Herald, 20th October
2003.
(4)
Galling delays for surgery. New Zealand Herald, 26 February 2004.
(5) Senge, P. M., (1990) The fifth discipline:
the art & practice of the learning organization. Random House, pp 383-384.
(6)
Newbold, R. C. (1998) Project management in the fast lane: applying the
Theory of Constraints. St. Lucie
Press, pg 228.
(7)
Phipps, B., (1999) Hitting the bottleneck.
Health Management Magazine, February, pp 16-17.
(8)
Goldratt, E. M., (1990) The haystack syndrome: sifting
information out of the data ocean.
North River Press, pp 144-155.
(9)
Hoskins, R., Blaxall, B., and Sceats, J., (1996) Venturo, evaluation of a
pilot specialist budget-holding contract: report of the first two years. Midland Health, Health and disability Analysis
Unit Evaluations Series Number 1, 29 pp.
This Webpage Copyright © 2003-2009
by Dr K. J. Youngman
|