News|Articles|January 21, 2015

Forecast Enrollment Rate in Clinical Trials

Instinctively, when there are more investigators/sites being deployed for a trial with a defined number of patients needed, we should expect shortened enrollment cycle time. This sounds right, but is it really?

On the surface, every veteran in clinical trials can tell you a lot about different factors impacting patient enrollment cycle times. A specifically defined patient population in a particular disease, for example, can impact the ability of sites to identify and recruit patients in a defined period of time, therefore, impacting enrollment cycle time. An experienced and successful investigator/site has better ability to enroll qualified patients compared to an inexperienced investigator/site. The higher proportion of experienced sites in a pool of sites deployed in a clinical trial can result in shorter enrollment cycle times.

Here are two different scenarios:

As clinical development organizations are under pressure to deliver new products faster, senior management seems happy to apply “unlimited” resources behind pivotal clinical trials evaluating promising drug candidates. The simple logic is to add more sites to the pool for enrollment, aiming to proportionately shorten enrollment cycle time. But realistically, how often does this shorten enrollment cycle time? The simple answer is: rarely, if at all.

In another common scenario, when we are transitioning to a Phase III program after a successful Phase II program, we often “extrapolate” the operational results from Phase II trials(s) to Phase III trial(s); we use the enrollment rate from the Phase II trial(s) to calculate the number of sites needed for Phase III trials, hoping to achieve similar enrollment cycle times as we did in Phase II trial(s). That is all fine, except that the enrollment cycle time(s) will unlikely to be close to the calculation. The enrollment cycle times are generally substantially longer in this situation.

We have long noted that adding extra sites to a clinical trial has only limited impact to enrollment cycle time.¹ We naturally want a better, in-depth understanding of the issue-is there a pattern between the number of sites deployed and enrollment cycle time? If the answer is yes, is it possible to define that pattern in a simple and universally applicable mathematical relationship?

Interestingly, similar phenomena exist in other areas. When we track the growth of a school of fish, we find the average size of the fishes grows rapidly in the earliest days since their hatch. The incremental increase of their size diminishes at the same time segment was added. Eventually, the average size of these fishes will hit a ceiling; they will no long grow in size.

Similarly, when we charge a battery, we can relatively quickly get to, for example, the first 50% of the battery being charged. The charge speed slows down, until it hits a ceiling at some point.

We know that the fish growth pattern has been thoroughly studied by ecologists, and the pattern to charge a battery has also been thoroughly studied by physicists. Could we possibly borrow what they have learned and apply it to understand the relationship between investigator sites and enrollment rates?

We know it is never easy to find any pattern in clinical trial planning and execution, for a simple reason: each clinical trial is distinctly different, and there is no such thing as two identical clinical studies.

This article is a part of the author’s integrated effort to build a conceptual structure for managing study operations in clinical development by focusing on forecasting enrollment rate at the clinical trial level (clinical trial enrollment rate, CTER) and site level (gross site enrollment rate, GSER). We are able to establish a relationship between the site activation process and site enrollment performance.

This article will also establish the relationships between these concepts. They are truly integrated components of an increasingly comprehensive conceptual framework.

Method and approach

Clinical trial enrollment rate (CTER, number of patients per trial per month)

Using a clinical development database created by the author-a sub-database meeting-the following inclusion criteria was established:

Interventional

With 10 or more sites

Started in year 2000 or later

Completed enrollment at the time of analysis

And we excluded the following trials:

Extension trial

Registration trial

Trials including healthy subjects

Trials with expanded access

The sub-database of relatively “homogeneous” clinical trials for a single metabolic disease condition looked like this:

We took the following steps to derive this chart:

Focus on trials with a single disease condition as primary condition

Put the clinical trial into baskets according to number of sites:
10 to 25 sites

26 to 50 sites

51 to 100 sites

101 to 200 sites

201 to 400 sites

401 to 800 sites

801 to more sites

Build a data table to pair median number of sites and median of trial level enrollment rate (CTER, clinical trial enrollment rate, number of patients enrolled per month):

Plot the data pairs in a chart

Going through the same steps, we can draw a similar chart for trials in a single respiratory disease conditions:

Or for trials in a single neurology disease condition:

As a matter of fact, we can draw similar charts for a group of clinical trials in every single disease condition, when the sample size is big enough, and the disease condition is “pure” enough.

In each of the charts, we see a generalizable pattern: as more sites are added to a clinical trial in the same disease condition, the CTER increases. However, for every equal number of sites (N) added, the benefit to CTER diminishes. Eventually, the CTER will hit some sort of ceiling: the benefit from adding more sites becomes negligible.

The mathematics relationship is exactly the same relationship used to describe the growth rate for a school of fish in the ocean.

Each and every one of these charts seems to have distinct sizes and shapes. For the trained eyes of a mathematician, there is a simple equation to include all the charts:

Where:

We can do the same for the other two charts:

Going through the same steps, we can draw similar chart for trials in a single respiratory disease conditions:

Or for trials in a single neurology disease condition:

From what we know now, there is no “proportionate” relationship between number of sites and CTER. That is to say, the relationship between sites and enrollment rate are not linear. With all factors equal, adding sites to a clinical trial can increase CTER, but at a diminished incremental benefit. Moreover, the benefit diminishes as more and more sites are being added.

In another words, there is an operational boundary where we have to plan and execute clinical trials within. When we keep adding sites to a study, we will hit the ceiling at some point, where there will be no measurable benefit in gaining enrollment rate. It is safe to say that there is a limitation in terms of how far we can go to shorten enrollment cycle time by adding investigator sites. (The details of the operational boundaries are discussed in mathematical terms in the appendix).

But why? If we were adding more sites to a trial relatively homogenously, and assuming each of the sites behave in the same pattern as the others and do their job in recruiting patients for the trial, why can’t their contributions be added up to give a “proportional” (linear) relationship to increase CTER?

Gross site enrollment rate

The fact is, as we add more sites to a trial, participating sites can no longer behave in the same pattern as before. Simply put, the ability for individual sites to recruit and contribute patients to the trial is suppressed continuously as more sites are added to a clinical study, when other factors are equal.

Using the same approach as we just used to understand CTER, we can learn more about site level enrollment rate (GSER, gross site enrollment rate or number of patients per site per month).

Starting from the same sub-database being used to understand CTER, we took the following step to build the chart showing the relationship between number of sites (N) and GSER:

Going through the same steps, we can draw similar charts for trials in a single respiratory disease condition (left), and trials in a single neurology disease condition (right):

Focus on trials with a single disease condition as primary condition

Put the clinical trial into baskets according to number of sites:
10 to 25 sites

26 to 50 sites

51 to 100 sites

101 to 200 sites

201 to 400 sites

401 to 800 sites

801 to more sites

Build a data table to pair median number of sites and median of GSER:

Plot the data pair in a chart:

These charts have different sizes and shapes. But the pattern is relatively simple: as the number of sites used in a set of clinical trials for a single disease condition increases, GSER decreases. It is not a linear relationship. Rather, GSER drops much more quickly when the clinical trials involve a smaller number of sites. It stabilizes at a certain level when the clinical trials become big enough.

Again, we go with a short cut by utilizing the mathematic relationships behind this pattern:

For the same metabolic disease indication as we analyzed in CTER, we can establish the relationship as shown in the following charts:

We can do the same for the respiratory disease condition:

As well as for the neurology condition:

In the second scenario, as mentioned at the beginning of the article, we cannot simply apply the site enrollment rate in a usually smaller Phase II clinical trial to a usually much larger Phase III trial. The GSER for a smaller Phase II study, when other factors are equal, is larger than the GSER for a larger Phase III trial. When we try to extrapolate the operational results from a Phase II clinical trial to a larger Phase III clinical trial, and use the GSER to predict the enrollment cycle time for the planned larger Phase III study, we end up with disappointing results. We will have longer enrollment cycle time, and often have to launch a “rescue mission.”

Discussion

As previously mentioned, there is no “proportionate” relationship between number of sites and clinical trial enrollment rate (CTER). That is to say, the relationship between sites and enrollment rate are not linear. When other factors are equal, adding sites to a clinical trial can increase the trial-level enrollment rate, but at a diminished incremental benefit. Moreover, the benefit diminishes as more and more sites are being added.

Let’s use Parkinson disease clinical trials as an example:

When we plug in CTER=10 patients per month in the chart, we get N=24

When we plug in CTER=20 patients per month in the chart, we get N=58

If you are a math geek, you can calculate the number of sites by plugging the CTER into the following equation without the aid of the chart:

In another words, when other things are equal, if we want to double the trial enrollment rate, in order to shorten enrollment cycle time by half, we need to add more than twice as many sites to the pool (58 sites instead of 48). It is important to note that this is just an example to illustrate the concept. In reality, it is not usually possible to cut the enrollment cycle time by half.

This established relationship on CTER not only helps to understand the operational boundary, but also to quantitatively define the marginal benefit from adding investigator sites, which in return will help to optimize the planning and execution of clinical trials.

In a recent project to assess operational feasibility for an early-phase oncology study, the author was tasked to recommend operational parameters and to forecast operational deliverables.

By depicting the relationship between number of patients and enrollment cycle time, it became obvious that 10 patients per site would help minimize enrollment cycle time:

For a 70-patient trial, we recommended that the team use seven sites. Using the method described in this article, we can establish the relationship between number of trial sites and GSER as the following:

From the equation, we calculated the baseline enrollment rate (GSER) to be 0.2456 patients per site per month, and baseline enrollment cycle time to be calculated at 1,221 days.

In our work, we continue to provide specific recommendations to improve the baseline enrollment cycle time through site enrollment performance improvement, business process improvement, site design optimization, etc. By using these approaches, it becomes feasible to shorten enrollment cycle time from 1,221 days at baseline to 705 days.

There are many factors that can be used to help us understand why larger trials have lower GSER than those of smaller trials.

We established before that the enrollment performance for the pool of sites deployed in a clinical trial, as being measured by average site enrollment rate (ASER, number of patients per site per month), is impacted by the effectiveness of the site-activation process, which is measured by the site effectiveness index (SEI, 0% < SEI < 100%). With the introduction of GSER, we can use a simple formula to link all of them together:

GSER = ASER x SEI

As more sites (N) are involved in a clinical trial, operational complexity increases, which will lead to the decrease of SEI that, in return, will reduce the GSER.

There is another more simple reason. While it is always difficult to find high-performing investigator sites, it becomes even more difficult when we need to identify an even larger number of sites. It is not surprising that the average enrollment performance for a trial with a larger number of sites will be lower than studies that use a smaller number of sites.

Over the years, our efforts to help and support our colleagues in planning and executing clinical trials have been focusing on the following two objectives:

Level the playground for stakeholders in clinical trial planning and execution. By doing this, we can improve the effectiveness of communication among stakeholders, and objectively reward those colleagues that achieved quantifiable improvements.

Provide actionable opportunities to improve operational deliverables, through better site selection, better process, etc.

The establishment of a reliable way to forecast enrollment rate, both at the clinical trial level (CTER) and at the site level (ASER), will greatly enhance our ability to achieve our objectives.

This is not to say that all clinical trials will and can fit in these equations perfectly. Quite the contrary; we know that most clinical studies will not be a perfect fit. But not only are we not discouraged by this fact, we claim that the “imperfect fit” is one of the most important value propositions of our method. We predict that the following factors will cause “imperfect fit”:

A targeted age group too far away from “median” age group.

One or more biochemical and/or physiological and/or genetic measure(s) too far away from the “median” measures.

Targeted disease status too far away from a “regular” patient population.

Any other inclusion/exclusion criteria making the clinical trial too “unique.”

While this is not an inclusive list, we are happy to say that our database is comprehensive enough to explain, often quantitatively, the impact from these factors.

Gen Li, PhD, MBA, is founder and president of PhESi. He can be reached at gen.li@phesi.com.

References

1. Gen Li, Lauri Sirabella, 2010. “Planning the Right Number of Investigative Sites for a Clinical Trial.” The Monitor. 2010; 24(4): 54-58

2. Gen Li, 2009. “Finding the Sweet Spot.”Pharmaceutical Executive. Oct. 2, 2009. Available at: http://pharmexec.findpharma.com/pharmexec/R%26D/Finding-the-Sweet-Spot/ArticleStandard/Article/detail/631734

3. Gen Li, 2009. “Site Effectiveness Index and Methods to Measure and Improve Operational Effectiveness in Clinical Trial Execution.” U.S. patent publication No. 2010-0250273

4. Robert Gray, Gen Li, 2011. “Performance-Based Site Selection Reduces Costs and Shortens Enrollment Time.”The Monitor. 2011; 25(7): 32-36

Appendix 1

Terms used:

CTER: Clinical trial enrollment rate, number of patients enrolled in a defined unit period of time, which usually a month, in the duration of clinical trial enrollment period.

GSER: Gross site enrollment rate, number of patients enrolled by a single site in a defined unit period of time, which is usually a month, in the duration of clinical trial enrollment period.

N: Number of clinical trial investigator sites

e is a mathematical constant as being defined by (1 + 1/n)ⁿ . It is approximately equal to 2.71828.

Clinical trial level enrollment rate (CTER, number of patients enrolled per month) is expressed by the following equation to describe the relationship between CTER and clinical investigator sites (N):

In which B is a negative constant for a defined set of clinical trials (usually a single disease condition).

When N becomes infinitely big (when very large number of sites is used), become next to zero, and CTER will become close to A+C. That is about the same to say, no matter how many sites being deployed in a clinical trial, it is not possible to exceed trial level of A+C. In reality, we would like to get to as much close as possible to A+C, by utilizing as less as possible sites (smaller N).

A+C is the upper limit for trial level enrollment rate.

Constants A, B, and C are parameters specific to a set of clinical trials belong to a specific and single disease condition.

Site level enrollment rate (GSER, number of patients enrolled per site per month) is expressed by the following equation to describe the relationship between GSER and clinical investigator sites (N):

In which b is a negative constant for a defined set of clinical trials (usually a single disease condition). When N becomes infinitely big (use of very large number of sites), e^BNbecomes next to zero, and GSER will become close to c. That is to say, Gross site enrollment rate (GSER) cannot be smaller than c. The farther away we can stay from c by reduce the number of sites deployed in a clinical trial, the more we will be able to improve collective site enrollment performance in a clinical trial.

C is the lower boundary for site level enrollment rate.

Constants a, b, and c are parameters specific to a set of clinical trials belong to a specific and single disease condition.

Appendix 2

Definition of previously patented equations:

Stay current in clinical research with Applied Clinical Trials, providing expert insights, regulatory updates, and practical strategies for successful clinical trial design and execution.

Subscribe Now!

Forecast Enrollment Rate in Clinical Trials

Newsletter

Related Content

Accelerate Clinical Trials with AI-Enhanced Financial Management

SCOPE Summit 2026: Reducing Patient Burden Is the Foundation of Wearable Success in Oncology

Evolving FDA Risk Tolerance Reshapes Global Trial Alignment

SCOPE Summit 2026 Panel Discussion: Diversity in Clinical Trials—What’s Working, What’s Next

SCOPE Summit 2026: Elevating Patient Experience in Clinical Operations

Trending on Applied Clinical Trials Online

SCOPE Summit 2026 Keynote Panel: Is Radical Acceleration in Clinical Research Possible?

SCOPE Summit 2026 Panel Discussion: Diversity in Clinical Trials—What’s Working, What’s Next

Accelerate Clinical Trials with AI-Enhanced Financial Management

SCOPE Summit 2026: Reducing Patient Burden Is the Foundation of Wearable Success in Oncology

SCOPE Summit 2026 Keynote Fireside Chat: Aligning Purpose, Innovation, and Operational Excellence in Clinical Development