Ask The Analyst
Email

Print
PDF
Bookmark
Share

Tags:

STABILITY PROGRAMS FOR BIOTECHNOLOGY PRODUCTS

This article was originally published in The Gold Sheet

15 Oct 2005
News

The Gold Sheet

Executive Summary

...may require a reassessment of development and early clinical materials to understand the significance of process, product and analytical changes made later, FDA and industry experts are cautioning. At a "CMC strategy forum" on biotech product stability, participants explored the intimate web of relationships between the analytical work done in the development phase and a viable stability program on the commercial product. Producing and retaining enough material during the earlier phases is among the ways firms must plan ahead if the program objectives are going to be achieved. Challenging issues for biotech products include: - how to set reasonable stability targets in the early clinical phases - the role of stability assessments in change control - the implications of non-conforming results, and - how to incorporate new information as assays improve. [Included are presentations from the stability strategy forum on the role of stress studies in the development process, setting up a stability program in the late clinical/application phase, and post-marketing issues].

STABILITY PROGRAMS FOR BIOTECHNOLOGY PRODUCTS may require a reassessment of development and early clinical materials to understand the significance of process, product and analytical changes made later, FDA and industry experts are cautioning. At a "CMC strategy forum" on biotech product stability, participants explored the intimate web of relationships between the analytical work done in the development phase and a viable stability program on the commercial product. Producing and retaining enough material during the earlier phases is among the ways firms must plan ahead if the program objectives are going to be achieved. Challenging issues for biotech products include: ? how to set reasonable stability targets in the early clinical phases ? the role of stability assessments in change control ? the implications of non-conforming results, and ? how to incorporate new information as assays improve. [Included are presentations from the stability strategy forum on the role of stress studies in the development process, setting up a stability program in the late clinical/application phase, and post-marketing issues].

FDA And Industry Wrestle With Vagaries Of Biotech Stability

The potential importance of retaining sample material from development and early-stage clinical batch production for later evaluation is being emphasized by FDA and industry experts in exploring how to improve stability programs for biopharmaceutical products.

The value of being able to reassess early production material as processes, formulations and particularly analytical procedures evolve was a recurrent theme at a two-day "CMC strategy forum" on the "design and successful implementation of a stability program for biotechnology-derived products," held at the National Institutes of Health (NIH) in late July.

The stability meeting was part of an ongoing series of CMC strategy forums for well-characterized biological products (WCBP) being coordinated on a semiannual basis through the California Separation Science Society (CaSSS). The forums are designed to bring industry and regulators together to discuss ideas and share experiences on key issues of concern, with the goal of developing better CMC-related standards and guidance in the evolving biotech arena.

Parallel workshops focusing on viral vaccines and biotech comparability issues, respectively, were held in January in conjunction with the annual FDA/CaSSS WCBP conference in Washington D.C. At NIH the previous July, the topic was process and product impurities ("The Gold Sheet" September 2004), with the January '04 forum addressing specification setting ("The Gold Sheet" February 2004). The forum coordinating committee is publishing white papers on each of the topics based on the presentations and extensive discussions at the forums.

In coordination with the annual CaSSS WCBP conference in January 2006 in San Francisco, a CMC strategy forum will be held on the use of reference standards to support product and method specifications for biopharmaceutical products. The forum at NIH in July '06 will focus on the "changing paradigms in process validation."

As for the other forums, a series of questions was developed for the stability meeting to focus the discussions and help shape a more meaningful consensus position.

ä The discussions and presentations which preceded them on the challenges of designing and implementing a stability program were organized into the three stages of the product life-cycle: ? early clinical ? late clinical and marketing application, and ? post-marketing.

Concerns receiving particular attention for early clinical development included the basis for selecting methods and for determining expiration/retest dates. Issues generating debate for the late-stage clinical process included the handling of small-scale studies in evaluating drug substance stability, bridging study considerations when assessing manufacturing and analytical changes, and regional differences in stability testing and filing requirements. The post-marketing discussions centered on defining and handling non-conformities in stability studies and the role of stability in change control.

ä Through the course of the stability forum, participants explored the intricate web of relationships between the analytical work done during the development/early-stage clinical phases and the establishment of a stability program that will be viable through the lifecycle of the biotech product.

Producing and retaining enough product during the earlier phases was recognized as an important component in assessing the linkages and helping to resolve the issues that arise later as new information becomes available and changes occur.

In summarizing the CMC strategy forum discussions at an IIR-sponsored workshop on "stability-indicating and forced degradation assays for proteins" held in conjunction with the forum later the same week, Biologics Consulting Group Senior Consultant Nadine Ritter noted the emphasis placed on the retained samples at the forum. Ritter serves on the FDA/industry planning committee for the strategy forums and will help in preparing the white paper that will be developed for publication based on the stability forum discussions.

ä The message, she said, is to save the earlier material "even though you don't know the real-time stability of it. We can set stuff aside in development and early clinical trials and then come back and test it."

Ritter recognized that "there may be some artifacts that are the result of degradation at the state that you held it." However, she said, "those can be adjudicated with a stress study and be proven that they are actually the result of some intermediate degradation that occurred."

Pointing out that "assays do change over time" and "your measuring stick is variable early in development," Ritter stressed that "by retaining some of those materials, you have the ability to go back and look at them with tighter methods if that is what you want to correlate to."

Another participant at the workshop commented that preserving samples that had been placed on stability, particularly in a real-time protocol, can also be valuable.

"If you can demonstrate that your product is stable at, say, -70 or -20 ^o," he said, "then you might want to store those samples for retains if you want to assess whether any future methods might pick up different impurities and so forth, so you have actually some additional real time data - not just the retains for clinical trial materials or development tox studies, but also maybe even [from] stability studies, whether accelerated or real time or frozen."

ä At the CMC forum, this sample-freezing approach was also recommended as a viable one.

"Early on, we don't have enough experience with the methods. The methods are changing and you get a lot of scattered data," commented Rinat Neuroscience scientist Roberto Rodriquez. "One practice that we have in development," he said, "is to freeze all the samples at different time points, retain the samples, and then analyze them all at once with the methods that we think are stability indicating."

Human Genome Sciences Pharmaceutical Sciences Group Executive Director Thomas Spitznagel concurred that the approach of freezing retained samples has proved valuable for his company as well, particularly during the development stage. "Even in the GMP studies, we like to have a little left over, freeze it down, and where issues come up, where it is hard to

interpret what a trend is, there have been cases where we have gone back…on the development side to take a look at" the question of whether it is "a real trend versus just noise in the data. And it has been useful."

During his presentation on "strategies for developing a comprehensive stability program for early-stage clinical products," Spitznagel pointed to improved sample handling as one of the main benefits of doing freeze/thaw stress studies (see box, pp. 4-6). "This is not necessarily for your stability-indicating nature aspect, but it is really going to help you with sample handling," he said.

Human Genome Sciences likes to do a variety of conditions and temperatures and to track by sizing as well as SDS-PAGE. The testing is informative if the bulk is going to be frozen. "But most importantly," Spitznagel emphasized, "this allows you to freeze your samples, [which] can really help dampen the noise of your assay variability."

Sample handling, he continued, "ends up being very critical in early development." In turn, "being able to freeze samples gives you some flexibility in the inevitable assay downtime. And finally, it also allows you to have meaningful retains. Early on your assays may not be fully developed. It is nice to have samples left over from your earlier stability studies when a new assay comes up. You can actually generate several years of data if you have freeze/thaw availability."

ä At the follow-up workshop, Lilly R&D advisor Jerry Lewis commented that smaller biotech firms, which often have less material to work with and less clinical delivery flexibility, may have difficulty in developing the more extended sample retention and evaluation procedures.

The smaller firms "usually have just enough to do their clinical trial," Lewis pointed out. Under trial exigencies, the material "may be shipped to the clinical site before it is officially released [from] quarantine, and it goes into the patients immediately. You may have just enough to put into your patients, to have a little bit left aside to do some formulation studies, and the process development folks are kind of left hanging because they don't have any material to play with at all." Noting that this scenario is "what most companies are facing," Lewis suggested that it is not one in which there are retains readily available or sufficient flexibility to vary the age of material going into clinical trials.

ä Ritter responded that "fighting over the amount of material that can be made early in development or who gets it" is also typical at the big biotech companies. Having the amount needed for the CMC evaluation, she stressed, is a critical issue.

"Certainly it is always an issue to be able to have enough material" to not only put in patients, but also "to get the information you need to make the right decision about the product," Ritter said. "Whenever I get in these arguments myself with project teams, I tell them it is not useful if all you can do is get enough of it to be able to get the clinical data, because without the CMC data, without the stability, it is not useful, and in fact it is possibly even dangerous because you don't know enough about it. So you have to fight those fights."

In terms of retains, Ritter added, "one of the things that stability people sometimes do that makes them either heroes or villains, depending on what side of the equation you are on, is that when they build their stability allocation, they put in overages to…account for mistakes." In turn, "sometimes these overage amounts at the end of the program become pots of gold there where people can come and get their retains if they haven't done it any place else."

Improved Assays Create Problems and Solutions

At the CMC strategy forum, the value of retained samples from earlier development and clinical phases was noted at several junctures during the extended discussions that took place on how to interpret and compare new information on degradation pathways flowing from process, product and analytical method changes.

ä The point was made, for example, in the context of the more general concern of assessing the value of new or improved assays and the additional information and issues about the product that result from their application.

Addressing this concern further at the follow-up IIR workshop, Lilly's Lewis commented that new and improved assays developed after the clinical trails have progressed may reveal changes in the impurity profile. The question is then raised whether these impurities "existed all along" or "occurred during storage. Was it in the clinical samples and what relevance does it have because your clinical trials came out okay? So you end up with a conundrum:...If you create new analytical assays and start looking for stuff, you are going to find it. But if you find it, is it clinically relevant?"

Industry Experts And Guidance On The Role Of Stress Studies

The importance of stress studies in the development of a stable product and successful stability program was highlighted by several participants at the CMC strategy forum in July. Human Genome Sciences Pharmaceutical Sciences Group Exec. Dir. Thomas Spitznagel and Genentech Early Stage Formulation Scientist Mary Cromwell provided the following discussion of their role and the related guidance as part of their presentations at the forum.

SPITZNAGEL: Stress studies are probably the most critical aspect of a good development program. Why do we do stress studies? It really is a critical component of formulation development because it helps you identify how susceptible your product is to degradation. It also helps set sample handling and shipping needs for a stability program, and it certainly provides evidence of [the] stability-indicating nature of analytical methods. That is probably the most key point and that will come into play when we do our assay qualification.

Let's go through some examples of the stress studies that we typically perform early in development. The first one is a shaking study. We simply take our finished drug product in a liquid state, place it on its side, and shake it as fast as you can - typically over several time points, typically at room temperature.

This is an SEC chromatogram that shows a variety of time points. You can see the aggregate peak goes up slowly but steadily. In a very short period of time, you have shown two things: Number one, your product can aggregate - that is certainly a good thing to know; but number two, you know your method is capable of resolving those two species and that this is likely to be a good stability-indicating method. Sizing is typically the only thing we do with shaking, although there are other assays that can be used. And in addition to the assay, you can also find issues about fill/finish, shipping and mixing. It can also help you screen excipients such as surfactants.

Another stress study we like to perform is oxidation. Here we use a peroxide. We have also used t-butyl peroxide for surface exposed residues - typically around .03%, although this will be specific to your product of choice. We look over about 24 hours. Here we normally run reverse-phase primarily, although ion exchange can also pick up oxidation from time to time. Like sizing, it helps you identify the region of the chromatogram that is going to change during stability. These studies don't actually tell you what the peak is. It is a probability perspective here.

This is actually a reduced monoclonal antibody reverse-phase method. You can see the heavy chain here. Overexposure to peroxide forms a singly, doubly, triply (you can not really see the quadruply) oxidized species. Once again you don't know that without the MS data attached to it. But just doing the study itself tells you, 'hey, this method is likely to pick up changes that are induced by oxidation.' Certainly it is a useful tool to give you samples for your qualification study.

Deamidation is the other classic chemical degradant for proteins. Here we typically look at pH 9.0 in bicarbonate buffer. Sometimes you have to back off on the pH a little bit - typically over 24-hour time points. We like to follow this by both ion exchange as well as the isoquant kit. It is a commercially available kit. The reason we like this is that it measures essentially the isoaspartic acid content. It is not exclusive to deamidation. But if you do do a forced degradation study, one of the challenges you always have is, 'did it actually happen?' If your chromatogram does not change, the isoquant kit will in fact tell you whether or not deamidation occurred. With the oxidation you tend to have to run a peptide map or something like that.

Similar to the reverse-phase in oxidation, you basically are identifying the region of your chromatogram that will change. Once again, here is a 24-hour time course of a monoclonal antibody where the acidic species increases over time which ends up being singly deamidated product. Then you can see that over a further time course you can get even a multiply deamidated product. So once again, a relatively simple experiment takes a couple of days to execute and analyze and you know that this method is capable of being stability indicating.

The last example of a stress study that I want to go through is a freeze/thaw. This is not necessarily for your stability-indicating nature aspect, but it is really going to help you with sample handling. We like to do a variety of conditions and temperatures. Once again we track primarily by sizing as well as SDS-PAGE. Really what this helps you do: Number one, if you are going to freeze your bulk, it is nice to know this. But most importantly, this allows you to freeze your samples. Freezing samples can really help dampen the noise of your assay variability. Sample handling ends up being very critical in early development. Being able to freeze samples gives you some flexibility in the inevitable assay downtime. And finally, it also allows you to have meaningful retains. Early on your assays may not be fully developed. It is nice to have samples left over from your earlier stability studies when a new assay comes up. You can actually generate several years of data if you have freeze/thaw availability.

Be careful with what conditions you look at. At -80 to 5^o, which is the typical bulk freeze/thaw, we saw a very marginal increase in aggregate levels. Where we actually saw the biggest example was actually going from -80 to -20^o. This is the formulation that did not have a cryoprotectant and had salt. You have essentially a PBS-type solution. And what is happening here is you detect it by the sodium chloride that causes freeze/thaw denaturation just above -20^o. So as you go through that cycling period you can actually get a fair amount of aggregation....This tells you [that] if you have a frozen - 20^o product, don't ship it on dry ice because it will go through that transition over and over.

Some other stress studies that I will mention here but won't go into detail on: pH and heat. These are both classical experiments, critical to doing formulation development. Actually, we find a little bit less utility in them when it comes to generating stability-indicating data because so many pathways happen with exposure to these conditions. Nevertheless they are still useful.

Light we typically don't perform in early clinical development for a variety of reasons. Mainly, the package protects the primary configuration from light generally.

Some conclusions from the stress studies: Hopefully, you have seen that certainly you verify that the assay detects the typical degradations. It will demonstrate your selectivity, which can certainly be used in qualification. One thing that is very important here is it will show the propensity of the molecule to degrade. Once again, going back to the risk-based assessment, [by looking at] more products, especially of the same class, you will be able to take your stress stability data and apply it into your database and see where on the pathway your product is likely to fall out of stability. And finally it helps you establish appropriate handling conditions.

CROMWELL: Stress studies are mentioned in several ICH guidances as I think has been highlighted throughout this meeting so far.

The goal of stress studies is to generate degraded product. The use of that is really to look at your analytical methods - to make sure you can pick up potential degradations that you are seeing, to figure out what degradation products are created so that if you have an extreme exposure in pH, what does that do to your protein? And finally, to determine whether those specific degradation products are process-related impurities or are they simply variants. To do this, we look at extremes of pH, oxidation, thermal stress and intense light, among others. So it is not limited to these that I have pointed out so far.

I am just going to walk through the guidances and highlight where it talks about stress studies....If you go to Q1A revision 2 section 2.1.2, it talks about stress testing. If you read into it, it says you should do it - include the effects of temperatures, humidity, which would be important for lyophilized products, look at a wide range of pH and do photostability testing.

Q2B: Validation of your analytical methods. Have stress samples for specificity purposes: light; heat; humidity; acid/base hydrolysis and oxidation.

Q1B is probably, in my opinion, one of the most detailed guidances that there is because it actually tells you what you need to do, which is refreshing. And on this, it takes you through a very clear decision tree. You start. You expose your product to light. Do you see a change? If you do see a change, is it acceptable? No? You put your product into its immediate packaging, which would be the vial with the label. Is it still seeing an acceptable change? No? Put it in the box. Do you still see a change? If you do see a change that is not acceptable, go back and start all over again with a redesign. Either you change your formulation [or] you change your packaging, so that you can eventually get over here to the 'end the test' because you do have an acceptable change.

Q5C, which is the biotech products guidance, also mentions stress conditions. Here it really goes into accidental exposures to conditions other than those proposed, such as those during shipping.

To highlight what is really covered in these, I think you have to look at the selection of stress conditions. It tells you that you need exposures to different pHs, you need light exposure, you need thermal exposure, you need oxidation. And what I would propose is that you not only do those, but you make them somewhat relevant to your drug substance production or what your drug substance will likely see.

The example here, with the pH extremes, is you can expose it to low pH, go down to pH 2, you can go up to pH 12, but you are probably going to see some very different things happening at those pHs, such as protein unfolding, that you would never, ever see during your manufacturing. In most processes, and here I will limit it to monoclonal antibodies, you are not going to go that far down where you would see protein unfolding. If you use the extremes for your manufacturing process, then you have somewhat of a basis to say, 'OK, these are real degradation products that I am likely to see if things go awry.'

Thermal exposure: You can get some information for that from your pharmaceutical development. For some proteins, going up to 50° is perfectly acceptable and you can do thermal exposure up there to get information. For a lot of proteins you will start unfolding and so 50° is not acceptable and you have to back off and find out a suitable temperature to do the stress condition that is not really considered accelerated stability. So you are walking somewhat of a fine line....Thermal exposure can be lack of heat as well. What if you freeze your sample? Do you see issues on freeze/thaw stability where you are generating aggregates [or] something like that?

Oxidation: You are probably going to need to experimentally determine what conditions it takes to oxidize your protein. We have had one case at Genentech where we have had a protein that we cannot oxidize to save our lives. We expose it to extreme levels of hydrogen peroxide, we see absolutely no changes on the peptide map. So the question actually became, well what do we do to the protein to make it oxidize? Do we keep throwing in more peroxide? Do we unfold the protein, make it oxidize so that we can show that our peptide map would pick it up if it were completely unfolded? I think, again, you could do that, but is it really relevant? Unfolded protein is not going to happen. If you do see unfolded protein, you are going to have other changes that you are picking up.

Since you can't go back to the point at which the material was being made for the clinic, he said, "what ends up happening is you have a lot of internal fights" about what to do with this information.

ä CDER Monoclonal Antibodies Division biologist Joseph Kutza echoed the perspective expressed by his FDA colleagues during the CMC strategy forum that the problems created by the new assay information are more than offset by its value.

"Long term, my experience has been that the new assays, although they might cause some problem at first, end up giving you really good information," Kutza stated. "So I wouldn't hold back in developing new assays just to stay with the status quo."

Improving from a 10X to a 100X microscope will allow you to see more things, Ritter added. However, if these are "characteristics of the product that you expect, then there should be nothing wrong with that."

On the other hand, she recognized the challenge in trying to "dissect apart whether or not what you are seeing was actually present at the time it was in the clinic." She noted having seen study designs "where folks have to go back and try to determine what was the state of the product then." For "product-related impurities, the degradants that have occurred, there are studies you can do to force the stuff and prove that what you are seeing occurred over time as a degradation pathway as opposed to something that came through the process."

While challenges can present themselves, Ritter concurred with the FDA view that "it has never been a bad thing to get a more sensitive, more specific assay and then go back and sort of justify what you had back then." In this context, the retained samples become key. "With the retains you can have a case to build that it was or wasn't degraded, but without the retains it is all hypothetical," she said.

Assays Can Be Added And Subtracted

Among the concerns that received significant attention at the CMC forum was deciding how much of the assay work during development needed to be built into the early-stage or ongoing stability program and specifications and, in turn, how to choose which new assay tools to incorporate.

ä Specifically, the participants were asked when selecting stability-indicating methods for early-stage stability studies if they "include all assays that show degradation during stress studies, even if they show no change at the intended storage condition," and if there are criteria for either removing or adding these assays later in development.

Shire Pharmaceuticals (formerly Transkaryotic Therapies) Pharmaceutical & Analytical Development Senior Director Zahra Shahrokh commented that "since the purpose of early studies is really getting information, I think it is safer to err on the side of collecting more data early on and then deciding which ones you want to drop later rather than not collecting because you don't see a change in real time. And then again, you don't have a lot of real time data at that point."

Recognizing that new peaks will emerge as assays are improved, Shahrokh echoed other forum participants in stressing that "the key is having retained samples from tox and clinical material" to indicate if these are new species. She added that, in her experience, it is a frequent occurrence "that your assay could change and you could start seeing new things come out."

Genentech QC Clinical Development Director Wassim Nashabeh agreed that "bringing in new technology that shows us more information" should not be a big concern, "because especially throughout clinical development, we are at the stage where we are gathering information."

The purpose of stability testing at this stage, he pointed out, "is not to show no change, it is to manage the change in correspondence with the clinical program. So if we are two years into the program and now we have new tools that show us that we have something that we didn't see before, that is fine. The material has been at the clinics being tested. We have the data that correlates there. So I think we should encourage looking for tools that are sensitive [enough] to tell us what is going on with the product and shouldn't be too concerned about how will we deal with that information."

ä CDER Division of Therapeutic Proteins Biochemistry Lab Chief Emily Shacter commented that "any assay that shows change in your product over time should be part of your stability protocol," adding that "there wouldn't really be any reason for leaving it out. Part of the point is to learn to see how that changed product might be impacting your clinical efficacy or safety, and not to test it would not be acceptable actually I think from the agency perspective."

FDA, Shacter affirmed, "certainly would not resist having any new clinical assay added. And I suppose there could also be circumstances where if you saw a certain change that was not in the tox lot, if you had and you tested your retain materials, there could be situations where there would be a requirement to…go back and do some tox studies to make sure there is nothing untoward in there that would pose a real safety risk."

Commenting on the issue of adding new methodologies to the stability program over time, MedImmune Analytical Biochemistry Senior Director Mark Schenerman agreed with Shacter and Nashabeh that "it is always beneficial to be looking at new technologies, especially technologies that are information rich that could give you more data about your molecule than the legacy methods."

However, he cautioned about the need to also be "responsible about understanding what the new technology data is saying." So that is why you have to collect data in parallel in stability studies and understand what the methods are telling you." He added that "characterization is an important part of that. Characterizing your molecule and your degradants early can give you a lot of confidence that the new technology is really something meaningful."

ä In a dialogue later during the pre-marketing forum session on changing methods, Amgen Corporate Quality Compliance Associate Director Heather Simmerman injected a cautionary note, recommending that firms carefully evaluate the implications when choices are made on new approaches.

"I think we can over-engineer the methods and I think we have to ask again what is really needed to control the product," she said. "Just because you have a new assay or you have an assay that can resolve things" isn't definitive. "It really may be additional cost without an added value in terms of the impact to the product safety or efficacy."

In this regard, Simmerman suggested "a couple of things to keep in mind" when making the choices: A more precise assay will affect not just the specification range but also what is deemed significant in the data. Also with a tighter precision, "your system suitability ought to change in concert with that" which has a cost component. "You may simply be rejecting more assays without really positively impacting the quality of the product that is out there, and that is really not to anyone's benefit either."

A further issue is interpreting the meaning of the findings. More impurities may be seen, Simmerman said, but "are they relevant to the safety or the efficacy of the product? If not, then these improved methods certainly can be used for investigation purposes. And if you subsequently find out they are relevant, you can always add them. That would be your justification. But I think just because you can come up with something or a new technology, it doesn't mean that you should rush to implement it from a specification or a control standpoint."

ä In summarizing the early stage discussions, Ritter supported the general point that firms should not be "afraid to collect the data during development so that you can provide for yourself and your reviewer and adequate amount of information on which to make sound decisions" about which assays add value and should "go forward and which assays should not."

If you have multiple assays that test the same degradation pathway and they don't provide any other orthogonal information on a different parameter, "then go with the one that is the most sensitive, because it is the one that is going to be the best for the stability indication," Ritter commented.

Lilly's Lewis also concurred with the viewpoint that characterization assays and assessment of "what your molecule might be doing under different situations" is important to carry through and "collect a body of knowledge about your product. And then if you have that information and you share that with the agency, you have a better opportunity to reduce what you might consider to be redundant assays and keep your sensitive assays in place."

Lewis stressed the value of carrying the characterization assays "all the way through up to licensure and even beyond in some cases with regard to comparability. Even if you don't have them under the stability protocol, make sure that those assays are available" later to evaluate changes to the process or container/closure.

Role of Potency Assays Debated

The role of potency assays in the stability evaluations was a particular focus of attention at the forum.

During the late clinical phase discussions, Shacter commented that "whether or not your cell-based potency assay is stability indicating or not, it should be a part of your stability protocol. There isn't really any case where we would not want to see the potency assay done at every time point." The potency assays "are not always stability indicating, but they are one of the bottom-line parameters of a product. We need to know how anything that you see might or might not be impacting potency."

In support of Shacter's point, Genentech's Nashabeh noted that depending on the development or clinical stage, "you might not have enough confidence to determine whether something affects potency or not because you are evolving in your selection of your potency tests, and I think you need to monitor anything that changes until you gather that body of data prior to going to commercial."

ä Addressing the incorporation of stress studies, Nashabeh clarified that Genentech does not view oxidation as "one of the standard tests that is on a stability protocol or release, but it can be added if the product has a certain susceptibility to oxidation based on the data we gather."

Diosynth Biotechnology Customer Project Management Director Siddharth Advant affirmed that stress stability studies "are very important" in giving information about the molecule. He queried the audience whether they repeated the stress studies as multiple lots are made through clinical development - for example, on changing the scale.

"Ultimately you want to go after real-time stability data," Advant said. "But if you start looking at some of these profiles from stress stability studies, especially temperature profiles, and if you see that multiple lots made at different times show you the same rates of degradation for example, that could give you maybe some level of confidence that at a year or year and a half, you probably are going to see the same degradation profile in your real time."

Nashabeh concurred that "especially when you go through process changes this may be one of your comparability parameters that you look at because you are not going to wait a few years to get real-time data." These stress studies, he pointed out, "are not perfect. There are a lot of unknowns with them, and the rates may not be as predictable as what you would get in the real-time studies. But it is one tool that you use as long as you use it in context."

ä Ritter added to Nashabeh's point by noting that the value of the forced degradation "goes side by side in a comparability protocol….If these things are truly comparable they should degrade with the same kinetics and make the same degradants under the same physical or chemical stresses."

She has seen "a lot of examples where people utilize that and then gather that data at the end of their product development lifecycle and present it as a nice package to say 'not only does this demonstrate comparability from change to change to change, but over time it also supports the stability-indicating methods. It supports the degradation pathways that we have mapped out. It supports the key degradants of the product.' And it makes a really nice story when you have got it at the end."

Division of Therapeutic Proteins Deputy Director Barry Cherney pointed out that the concept of using stress studies in assessing comparability is incorporated into the ICH biotech comparability document Q5E.

ä A concern was brought up at this point by Amgen's Simmerman about "the use of the concept of comparability or comparability protocols for the early development work." The concept, she said, is really intended to apply to the product in its more mature, commercial form.

During the development cycle, Simmerman stressed, "we intend to make changes and frankly we don't want things to look the same. We want to be making them better." In turn, having protocols with prospective acceptance criteria, she cautioned, is not "entirely appropriate for the early development phases." While firms want to do comparisons and determine differences at this stage, "we need to expect the differences and in fact look for those improvements…. But the use of comparability, comparability protocols, the notion of acceptance criteria that in fact they are the same, I don't think that is really what we are intending here at the development stages."

Lilly Principle Regulatory Scientist John Dougherty agreed that a comparability protocol is primarily a post-approval regulatory instrument, suggesting that terminology such as "bridging studies" is more applicable in development.

Simmerman reiterated her concern that "the clarification of the terminology and the expectations is very important, because otherwise it is looking to me like we are setting a new standard and expectation in clinical development, and I don't think it quite works….The terminology for clinical development does need to be differentiated from the post-approval stage."

She noted that Amgen has "made some pretty dramatic changes in clinical development cell lines, cell sources, all these kinds of things, and they look very different from a profile perspective. They look very different from a stability perspective as you expect. Yes there was a bridging study, yes there were comparisons made, but it wasn't framed as a comparability. The verbiage I think that was used was, 'this was equivalent and suitable to go forward with.'"

Office of New Drug Chemistry Team Leader Stephen Moore noted FDA's use of the term "linked" in relationship with toxicology or early clinical trial material.

On the other hand, fellow CDER official Cherney pointed out that Q5E, while recognizing the concerns Simmerman raised, "does talk about comparability during development." He reassured participants that the agency understands the nuances of the comparability concept and its different application in development and after pivotal trials.

Determining Safety/Efficacy Impact Is The Holy Grail

Genentech QC official Ruzica Djerki reframed the discussion of early-stage concerns into two components: how stability is monitored, and then how the information should be interpreted.

The monitoring, she said, is relatively straightforward. "Basically we all do the maximum we can," with the main reliance on concurrent testing. On the other hand, "the real challenge is…when we see the changes, how and what are the tolerances there - what are the acceptances for changes that we see at that early stage.

The struggle, Djerki said, is around "what we are sending to our clinic. Should we send the worst materials so that we have more space there? How we can be sure that all our assays are covering all these things, that we are on the safe side and that certain things are tolerable - that is much bigger question for me." It is not so much "how we do the monitoring, but when we see the changes in monitoring, how we are addressing those."

The quandaries encountered when attempting to correlate stability and other CMC development information to clinical safety and efficacy have been receiving significant attention at FDA/industry conferences over the past few years ("The Gold Sheet" February 2004), and the issues were wrestled with again at the stability strategy forum and the follow-up workshop.

Reviewing the forum discussions on the issue of clinical relevance at the workshop, Ritter commented that "trying to link the product attributes, especially a stability attribute, to whether it is going to remain safe and efficacious in the clinic…is of course the Holy Grail of development."

Stability Study Design Considerations For Late-Phase Development Programs

The following overview of late-phase stability study design considerations was presented by CDER Division of Monoclonal Antibodies Biologist Michelle Frazier-Jessen at the recent CMC Forum held at NIH. Along with general requirements, she discussed reduced stability study designs, accelerated and stress studies, and bridging studies for changes in manufacturing, formulation, container closures, and analytical assays.

I don't know how many of you have looked at the draft guidance for ICH Q8 - it is on pharmaceutical development. I like the document. I think that it has a lot interesting points in it that I look at when I am thinking about a product as it is going through development stages. Really, the aim of any pharmaceutical development program is to deliver a good quality product and to design a manufacturing process to deliver that product in a reproducible manner. Any information that you gain along the way really helps to support what you know about the product and to be able to control manufacturing process and to set specifications.

The way that I am thinking about this is more along the lines of late-phase development. I am thinking late Phase 2, pre-Phase 3 to the submission of the BLA. I am just going to give you a brief outline. I am going to talk about: ? general requirements ? some interpretations of what is meant by representative ? reduced stability study designs ? accelerated and stressed stability studies, and ? bridging studies.

When I think about the late phase program, I am really thinking at this point [about] a product. There should be enough information that has been generated in the early studies to demonstrate how the product is going to degrade, what pathways are going to be utilized. That should be fairly well understood at this point in time. I mean, certainly you can never know everything, but you should have a good idea of how your product is going to degrade under its proposed storage conditions and packaging, etc.

You should have a set of analytical assays that provide a stability-indicating profile. I feel that potency needs to be part of that profile. A potency assay may not be the most sensitive stability-indicating assay that you have in your little package, but it should be able to be stability-indicating. If it is not I think that is sort of a problem, in my mind.

Additionally, you should be well into your stability program that you are going to use to support the licensure of your product. That should be ongoing. You also might be making some changes to your product to prepare it for licensure that might alter how you do your stability program. You might be scaling up. You might be making some formulation changes or container closure changes, things like that. But you should be collecting data on all of these things at this point in time for stability.

General Requirements

Looking at the general stability requirements [in ICH Q1A and Q5C] for drug substances and drug product, you obviously need real time and you need to have some accelerated. That would help support excursions that you might have for shipping or storage or any kind of things like that. And you need at least three batches. They can be full scale or pilot scale. This is what the guidance says.

Pilot scale needs to be representative of the manufacturing process. If for some reason, and it says this in the guidance, you were to use pilot scale, you could use pilot scale for a license application with a commitment to place your first three full-scale batches into long-term stability program after approval. I don't know that that is necessarily the practice…for the obvious reason of the risk that entails. I have not seen it but others might have.

You need to have defined the storage conditions that you are operating under: The relative thermal stability of your product. You should be evaluating the sensitivity to moisture, especially if it is a lyophilized product. This should be done in a representative container closure system.

I think you need to do some work with your intermediates as far as stability goes. You need to identify them. You need to generate some in-house data on them and the process limits that you are going to operate around. Although the guidances say…it might be possible with justification to use representative scale or pilot scale, you really should be establishing any kind of stability specifications for intermediates at the validated manufacturing scale process for obvious reasons. This needs to be real-time, real-condition data and will likely require more than one batch.

What is Representative?

I think we should get into the term 'representative.' I think…there are some vagaries here [on] how 'representative' is interpreted: I think you can talk about small scale and you can talk about pilot scale.

For small scale: If you are using small scale for drug substance stability, if you look at Q5C, you can use reduced size containers. That might be acceptable for storage. But the drug substance that you use should be stored in containers and the containers need to properly represent the actual holding containers used during manufacture. So you need to justify that those containers are properly representative.

With regard to pilot scale, as we had mentioned earlier you can use a pilot scale batch with proper justification. Obviously it needs to be fully representative of, and simulating that, to be applied to the full production scale batch. It does not really define a number or anything like that. It is interesting because the EMEA has further defined this and said that it needs to be at least 10% of the production scale batch. But it is not written that way in Q1, at least not to my knowledge, though maybe it has changed now - I don't know for sure. The additional thing with pilot scale that people need to remember is that it needs to be operating under current GMPs. I think people sometimes think 'pilot scale' and they forget it still needs to be under current GMPs...

Reduced Stability Study Designs

With regard to reduced stability study designs: Certainly at this point in time, while you are doing your development, you might be considering different dosage forms or dosage sizes and things like that. It is possible to apply some reduced study designs. It is usually for drug products, but that might not be acceptable if you have a very complex drug delivery system, so that might not be the appropriate thing to do. As I said, it is going to require justification. You are going to have to have supporting data. The supporting data are going to have to show that you don't have too much variability and the product is pretty stable.

Two common methods that are used are bracketing and matrixing. Bracketing is really a design of a schedule such that you only test extremes of certain design factors. This is pretty common… if you have different strengths. I will show you an example in a minute. It is generally not applicable for drug substances. It is mainly used for drug products.

[This] is taken from Q1D. You have 50 mg, 75 mg and a 100 mg. There are three different batches and you are just testing the extremes - the 15 ml size and the 500 ml size. The important thing to remember about this is that if you should decide to drop out one of those extremes, for example, like the container size or one of the dosages - you are not going to go through licensure with that, you decide not to carry it on - if you are doing your bracketing study and you have set that up you are still going to have to carry that bracketing study through the licensure as a post-marketing commitment so that we have that data, because that is how the study was initially set up. I think that it is a good idea that if these are the kinds of things you are going to be doing and you are going to be submitting to the agency, that when you have a pre-BLA meeting you bring these things up and make sure that what you are doing is going to be okay with the review group.

Matrixing is a little bit different. You are looking at a selected subset of the total number of possible samples. It is okay if your supporting data indicate that you don't have a lot of variability in your product. The more variability you have, the more justification you are going to need to be able to do a design like this. And it is really not very useful for drug substances.

This is an example also taken from Q1D. You can see this is a one-half reduction, so you are only testing about half of the time points. It is a matrix. You are doing a little bit here and a little bit there. That might be applicable. And once again, if you are going to do this, you need during your pre-BLA meeting - which I encourage everyone to have - you should be discussing these things and seeing if they are going to be acceptable.

Accelerated and Stress Stability Studies

I saw a lot of questions before [at the CMC strategy forum] that were put out regarding the accelerated and stress stability studies and what they are and when do they need to be done or why should they be done. So I thought it would be useful to talk about this a little bit more.

Accelerated testing: It is really studies that are designed to increase the rate of chemical degradation by using exaggerated storage conditions. Stress testing is a little bit more than that. You are really trying to elucidate the intrinsic ability of the product to degrade and it is really carried out under much more severe conditions than you would use for accelerated testing. With accelerated testing, obviously I think most people think of accelerated temperatures and things like that. Stress testing can run the gamut of agitation, pH extremes, oxidation, things like that. And for drug product, if you are going to do stress testing, another one that is very common is to look at photostability.

For stressing your samples: This is really quite useful to look at different degradation pathways, and it can help you to validate different methods. I guess probably an example that I think of most commonly is…if you want to change an analytical assay out, how do you know that that assay is going to be an improvement over the initial one? One way to go about looking at that is doing some stress stability studies and determining if your new method is at least as sensitive or maybe more sensitive to picking up these stress conditions, the degradation of the product.

According to Q1A, it may not be necessary to do some of these. I think what it really depends upon is how well you understand your product and how it degrades and the justification of what you need to do based upon that knowledge. So the more knowledge you have about how your product degrades - which I would think at this point in time of development, you would know that - then that will define the kinds of stress studies that you would do.

Bridging Studies

Then bridging. Oftentimes at this point in time a lot of sponsors are making a lot of changes to be able to bring their product to licensure. They can be manufacturing changes to scale up, to remove impurities, things like that, formulation changes, changes in the container closure system or changes as I just mentioned in analytical method or methods.

Why would you make changes in manufacturing? Well, you might be improving your manufacturing process. You might be increasing the scale, which is extremely common right before Phase 3, and you want to improve your product stability because you have gained more experience with the product and you understand how to keep it more stable. Or you might have to comply with changes in regulatory requirements.

For changes in manufacturing, certain changes, even though they might seem slight, have the potential to really alter the stability of a product. Certainly any change that can alter your protein structure or purity and impurity profiles really needs to be evaluated - not just on real-time, but under accelerated and stress conditions. And not only just your drug substance and your drug product but also your intermediates. I think people often forget that and sometimes that is where you have problems.

Stability studies that are undertaken can also be really helpful if you have a subtle difference and it just may not be detectible by the characterization studies that you have. So certainly, long-term stability studies and even accelerated or stressed under these conditions, comparing the two different processes, you might pick something up. One of the common ones that comes to mind...[are] proteases and divalent ions. But certainly, even a slight alteration in a protease level that may not necessarily be picked up because of the sensitivity of your assay, or it is so small it just does not seem like much, might actually really cause a difference in the degradation profile or the pattern or just the amount of degradation that you get. You may not pick it up unless you have done the real-time studies and some of the accelerated studies. I have certainly seen that happen where we have had a change in manufacturing process and very subtle differences in the purity levels - and we are still monitoring this product, obviously, because it is fairly early on - but you do see a difference in the impurity patterns. So it does happen.

Formulation is another one. Usually I think, by this time, most people…start out with an initial formulation and during development, you might make some changes, certainly to improve the stability. But at later stages, it might also be because you might be wanting to use a new administration route or delivery for the product. I think everybody knows that not all formulations are created equal. Every product is different and what works for one product might not work for the other.

This is just a good example of what happens. This is accelerated stability of a monoclonal antibody. This is an AUC data and you can see that the buffers,...not all of them are able to keep the product stable. If you are going to be changing your buffers you need to not only evaluate them under the normal storage conditions but also under stress conditions because you might have to do more work with regard to that.

I can think of a recent example that I just had. I had a sponsor that has a monoclonal antibody under IND, and they decided to go after a new clinical indication and this clinical indication required a different administration route. That administration route was not compatible with the current formulation scheme. So they re-formulated their monoclonal antibody into a phosphate-buffered saline buffer and initiated preclinical studies to support this new route. Then because it was going to be at a couple of different clinics, they decided the best thing to do would be to store it at -20.

I don't know how many people know much about phosphate-buffered saline, or phosphate buffers and protein products, but PBS-based buffers tend to form protein adducts or you get pH gradients. So this product dramatically degraded upon storage. They were using it and not realizing it for their preclinical studies. It turned out I think that they were okay because the very high concentration that they had actually was fairly stable, but the other two concentrations degraded dramatically, and they did not run their stability studies until after they had finished their pre-clinical studies. This could have been a really big ouch. I think it was a little bit of an ouch, but certainly was very painful.

The other big thing I think that a lot of sponsors do at this point in time is changing their container closure system. Maybe they have been in a vial and they have decided they want to go to a pre-filled syringe. I think that that is becoming much more common, that choice or that pathway. Certainly a new administration route might be a reason to do that. Certainly if you are going to something like a pre-filled syringe format, that is much easier for the physician and the patient to use so that would be of benefit. You might also have to change your container closure system because what you were using before is not available or you have decided you want to make a couple different versions - not just a 100 ml vial but maybe a 150 ml vial as well.

These are the kinds of studies that tend to lend themselves well to the bracketing studies. So it is important to remember with container closure systems that they are suitable for the product that you are using, and they are suitable for the intended storage and transportation, and that you have evaluated the potential interaction between all the contact areas. Extractables and leachates are things that you need to be looking at with these kinds of things, especially for the pre-filled syringes.

Q1A says you need to have stability data in the to-be-marketed format or something that looks, that is representative - once again the 'representative' word. I think that for some of these novel or newer container closure systems that we don't have as much experience with, to me that means that it needs to be in that container closure system. So it would be hard to go from a vial and say that that was an acceptable bridging. You really need to have some data in your container closure system. And you should probably have some clinical experience with that as well.

One of the big things that I certainly see happening a lot is the change in an analytical assay. You are replacing an existing method with hopefully an improved method. That would be the reason. This is extremely common for monoclonal antibodies with regards to the potency assay at this stage. A lot of times what sponsors come in with in the initial IND is...just a binding assay for potency. While that is allowed, sponsors are reminded that that might not be acceptable for licensure. Usually during the course of development, at this point in time, a new potency assay that is representative of the biological mechanism of action hopefully is coming on board. And that assay will need to be transitioned into the stability study, and it should be stability-indicating to some degree.

Certainly this is not all of the data that might be required to support, but just an example of what you might need to do with regard to an analytical assay that you are changing out, for example for potency. You want to look at not just the normal conditions, but you want to look at some stressed and accelerated data and compare the assays and see which assay really performs better, is more sensitive to the different degradation pathways as you understand them and is not as variable. And you want to compare this data in the context of your other known stability-indicating assays. So there should be some kind of a correlation to get an idea of what is going on. That would be a good reason to change out to a new assay.

The big issue really is developing a quality product and really understanding your manufacturing process and how your product degrades. And understanding really the big picture of 'this is the indication, this is how the drug is going to be delivered, stored, transported, etc.' All those things taken into consideration and all the data that you have generated will help support your application and help you to derive your specifications and shelf-life and things like that. Any changes that you encounter, or any data that you encounter, even negative data, are extremely useful from a regulatory perspective because it allows us to get an insight into the kind of knowledge you have on your product and make good decisions with regards to these important things like specifications and shelf-life.

Reflecting the discussions, she pointed out the basic dilemma that arises in this pursuit: "The most empirical way to figure out if there is a link is to take distressed material into the clinic. But that isn't necessarily a particularly sound strategy for a variety of reasons, one of which is that you don't want to do anything to jeopardize your clinical trials - not just the trial data, but in fact the patients themselves."

Ritter noted that participants at the forum did not indicate that there was an effort to date to use aged/degraded material in a systematic way in the clinic, although retrospective monitoring of the age of the samples and the product profile when used was an approach that was being deployed to help assess clinical correlations.

ä Lilly's Lewis sees a "Catch-22" in FDA's emphasis on knowing the impurity profiles of the clinical trial materials, with the idea that data may then be available to potentially support higher levels.

"The problem is that it is a slippery slope, at least in my mind, in terms of …putting degraded material into the clinic, or even holding back some aged material and putting that into the clinic. How far do you go?" In turn, "especially early phase, there is not enough power in those studies to be able to discern whether there is any differences between a certain age material and fresh material. That is a big issue."

Lewis pointed to the idea he hears expressed that "if there is not adverse events that we find with aged material, then we are okay." However, he sees the need for "more discussion on that topic," suggesting that "it is somewhat dangerous to design these, and especially in an early phase, even if your toxicology data may support higher levels."

Ritter concurred. "One of the burdens that we bear in biotech," she said, "is not just that the loss of our active could cause a loss of efficacy in the product, but clearly the fact that our degradants are themselves potentially harmful in terms of immunogenicity. And that is not something which can easily be discerned in a short-term study or a small sample size."

Ritter pointed out that this concern is another reason for continuing with the full regimen of assays during development. Doing so allows for retrospective analysis of the batches "and actually matching up in some sort of a matrix - what was the age of the material, what was the actual state of the material at the times it was used in the clinic, and correlating that with clinical use - not necessarily in terms of being able to look at long-term safety problems, but at least in the short term being able to say that these are characteristics of the material that have been subjected to clinical trials."

The problem, she noted, is that such an approach requires "a cross-disciplined discussion, and sometimes it is very difficult to get the clinical people and the CMC people together with the right data sets to be able to overlay them and make some sense of it. But that is really what it takes to be able to match this stuff up."

EU Clinical Supply Expiry Mandate Raises Concerns

Another key focus of attention during the early-stage discussion was on the issues involved in stability determinations and dating for clinical supplies, given limited experience with the product, manufacturing and methodologies.

In her summary of the forum discussions on the implications of the limited manufacturing base and lack of knowledge of batch-to-batch variability, Ritter noted general agreement that "you always build on your experience gained from development lots." She advised firms to "try to draw on that stuff. I know that it is very difficult sometimes when a project is moved from hand to hand to hand, but that information is a very important part of the knowledge of the product, even very early on."

Noting that "we don't have a lot published right now in the biotech community about a wide variety of molecules," Ritter again stressed the importance of sequestering materials from those early development activities and "using that to build on your knowledge and be able to start to infer some of the known degradation pathways for molecular types."

In his presentation on early-stage stability assessments, Human Genome Sciences' Spitznagel also noted that "the early development studies are going to provide the most data...and you certainly want to use it." In turn, "the better you set [the studies] up, the more applicable the data is going to be." He emphasized the importance of stress studies, in particular, as "probably the most critical aspect of a good development program" (see box, pp. 4-6).

In general, Spitznagel concluded, in setting clinical expiry or retest dates, the basic point is that "no single approach will work in all cases." The key, he asserted, is knowing the propensity of the product to degrade, and then "always monitoring your clinical lot concurrently." Extrapolation is "something you certainly should do. It can certainly be comforting and can actually provide some very good justification for setting an expiry period. But you have got to be real careful with the actual predictive numerical value that comes out of there. It is often not going to give you the actual expiry period."

ä The issues around setting stability parameters for clinical trial supplies are receiving heightened attention in the wake of the new EU clinical trial directive, which calls for all clinical supplies to be labeled with an expiration date rather than the practice in the U.S. of retest dates. Participants at the CMC forum noted that the requirement is forcing firms to do extrapolations in spite of their limitations.

Ritter commented that the EU requirement is "logistically very difficult. Apart from the extrapolation or how you get to that, the logistics of physically labeling the stuff has been problematic."

Firms with European trials, she said, are "deferring right now to ICH Q1E," but it is "very difficult" to apply the guidance to extrapolations for biotech products, particularly in terms of correlating the different assays involved. The decision for some firms, Ritter explained, is to say, "it is frozen, we are going to put a year on there. We will test it. If it fails, we will pull it out of the clinic."

CDER's Shacter commented that from FDA's "internal perspective, it is more important" to use real-time evaluations to "stay ahead of the curve…so you know what to expect with the stability of your product while you are in early clinical trials…because we understand that there isn't a good basis for having an expiration date per se."

CDER biotechnology office colleague Elana Gubina also expressed the need for caution. She noted having seen examples where "these extrapolations, especially based on a not-well-validated potency assay, can lead to very serious consequences." Extrapolation "may be a very useful tool, but it is a tool. Real-time stability data so far are much more valuable."

ä Gubina joined Spitznagel in affirming the important role played by the early-stage stability studies.

Noting that Phase 2 trials are based on the dose chosen from Phase 1, Gubina pointed out that "if your product is not very stable during development, you really don't know what to base your dose on."

Spitznagel maintained that the expiry period is actually "one of the lower uses of a stability program in early development." Confirming that the product is within specs "is important from a safety perspective" and is "what you are going to file with the agency." However, he noted, "from a development perspective," the stability work "is giving you that lot-to-lot variability data. It is giving you data about your assays" and allowing "you to learn a fair amount about your product during the early clinical program. So even if it doesn't necessarily predict the expiry period beautifully, you are going to get a lot of scientific information out of it."

ä One of the focal points at the late-stage clinical/application session of the CMC forum was the use of small-scale studies to support the stability of the drug substance.

Participants agreed that small scale studies are a common and acceptable practice if the small scale is sufficiently representative. "Representive," Genentech's Nashabeh commented in summarizing the discussions, "means that you take into account material of construction and other technical issues regarding headspace and so on, and that you will preferably have a justification in terms of how you have made that selection."

Participants in the small-scale dialogue pointed out that small scale often represents a worst case situation, compared to large scale where there is less manual manipulation.

Post-Approval Testing Regimen Changes Addressed

"Let's assume that you have gone through your clinical development profile. You have done all these studies. You have shown that some of these assays are not stability indicating. At what point post marketing," Advant queried, "can you actually take them off? Is there precedence from the industry that they have been able to take them off?" A related question, he said, is "can you take off some time points if you are doing, you know, zero, three, six, nine, twelve? Can you do minimal time points if you have shown that the product is stable?"

The debate over the criteria for adding, replacing or deleting assays in the stability program continued in the post-marketing session of the forum in response to a prepared question presented to the participants by Diosynth's Advant, who was serving as moderator.

Amgen's Hasselbacher On Developing A Comprehensive Stability Plan

At the CMC Strategy Forum on stability issues for biotech products held at NIH in late July, Amgen Head of Corporate Stability Carol Hasselbacher gave the following presentation on the "Development of a Comprehensive Stability Program for Product Commercialization."

I am going to continue with our talk on late-phase stability programs by starting with an example. Amgen is a company that has been fortunate enough to have a number of licensed products on the market, and I know that a lot of those of you that I have talked to this morning and others still have products at your companies that are solely in development. So what I was asked to talk about today was what it looks like from the other side. What does it look like after you are through with process and product commercialization? What have you learned? So the questions that I am going to try to ask are, "what could you do better?" and "what did we learn?"

One thing that I have learned is that there is a difference between a stability program and a stability strategy. The stability program: We have all been talking about the requirements….These are things that are the 'how.' This is how you do it. It includes tests, time points, lots on study, special studies, etc. And to succeed, you first of all need to go through the guidance….We also need to get feedback from the agency, and we are provided with lots of opportunities to do that during development. We also need to benchmark industry standards because there are lots of times when the requirements are evolving or expectations are evolving. And just by sitting at your own company and looking at how you have always done it, you are not going to be aware of these new standards.

But what I've learned is that we also need to include a stability strategy, which is the why you do what you do. And the question to ask here is, 'how does your stability program, how does the aggregate group of protocols that you have for your product, support your product commercialization goals?'

This is just a default product development timeline. I am sure a lot of you have variations of this at your company. The arrows are familiar developmental stages from early target through Phase 1, Phase 2, Phase 3, filing and launch. If you look up above where that large yellow arrow is, that is your stability program areas as I see it. You have got pre-clinical, Phase 1 and Phase 2. And then our focus is going to be on primary stability, the primary stability process, and getting ready for commercial. But these are all parts of the program.

If you look down below that group of arrows, you see a number of portals. For example, first-in-human portal, where you decide is the product ready to go into the clinic, up to end-of-Phase 2 portal - are we ready for our pivotal studies? Commit to launch - are we ready to launch? Most companies go through this sort of staged assessment, either formally or informally. But these have more to do with your stability strategy because at each point, you want to make sure that your stability programs are ready for the product development goals.

Here is an example, and this is something that we have talked about a little bit this morning with respect to early development. For a stability program, early development, your first and only GMP lot is put on stability. This meets all your requirements. That is fine, that is your program. But what about if you would be able to manufacture a couple of smaller lots and put them on stability. What does that require? That requires a lot more work up front, probably. It requires persuasion. It requires a good participation for the stability group within the project development plan and teamwork across functions. And it also requires for you probably to have some sort of documented rationale for what you hope to gain from this. But then you end up with clinical experience that is not limited to one data point. So this is sort of the back and forth that I am talking about between work up front and fast to the clinic, and where the stability groups need to position themselves as advocates in this process.

Of course stability strategies are phase-dependent. Each stage of development has its milestones and endpoints. But nobody knows when a product development program is kicked off where you are going to end up, what the product will look like. There are too many changes along the way. So what do you do about these changes? Stability has a big problem because our work depends on real time. We can't throw resources at a problem and have it go away. We need to sit tight and let the stability samples incubate for a certain number of months. So sometimes it is helpful to just think of things more generally in terms of design phase, which would be research and pre-clinical, development phase which would accompany the body of the development work, and confirmation phase with your primary end and conformance batches.

Have Commercialization in Mind

For those of you who haven't been to this point yet, I would say always start with commercialization in mind....Think first about the commercial phase and how to realize your development objectives. Work with the product teams also to determine timing and prioritization for each requirement. And you also need to do good risk assessment. You need to define what the risks are and communicate those risks to the project team when you can't meet your timelines or when there is work that you would advocate that can't get done.

So what are the goals for commercialization? I can think of three right off hand: ? one of them would be develop appropriate stability-indicating assays ? also hand in hand with this is develop a thorough product understanding, and as I have been talking about earlier ? understand and support the product deliverables.

First of all, whether you started out with a few assays and were adding assays as you went, or you started out with a whole bucket of assays and you are refining them, you want to be at a point in the late clinical phase where your assays are appropriate and comprehensive, and each test had better have a discreet purpose. You don't want a lot of redundant testing going on. You also need to document your rationale for this panel of tests. Once you are gone and there is no record of why this set of stability tests was chosen, it is not going to be very good for future product work so you need to document everything.

Hand in hand with this you need to: document your physical/chemical characterization data and summaries; define your degradation pathways;...also establish and document the stability profile. A word about documentation. It is really important. You really have to have a system at your company for archiving information, making it easily accessible to others. This is very hard to do in real time, but it is also much harder to do retrospectively as I am sure you all know. So it is something that just needs to get done.

For analytical considerations, the first goal in the design phase…early formulation development. It is often not the commercial development. Commercial formulation and stability studies are minimal at this point. They are used to support the tox lots and recommend an early formulation. The tests as was mentioned this morning are also quite general, probably not optimized. If you are working on monoclonals you may have platform strategies that you use at this point.

During development: This is where analytical formulation, process, characterization all go on in tandem. Your assays are being developed in parallel with understanding the process and the product. So at the end you have to have robust, appropriate assays, and the commercial process, formulation and presentation must be defined.

Confirmation phase: You want to have your assays validated, tailored for the product, and suitable for monitoring quality and consistency.

For characterization, this is going on at the same time. Design phase: very little happening in the PD and quality world. A lot going on in research. In the development phase, though, you begin to understand the product degradation pathways and obtain this understanding physical/chemical analysis, theoretical deamidation sites - look for that in the primary structure, stress studies. But along with this, your stability experience is what is going to provide your practical understanding of what really happens with the product. You don't really care about some stress situation that will never happen with your product except that you want to be able to make sure that it doesn't happen. Your real-time studies and your accelerated studies are going to be very important here. Confirmation phase: product fully characterized - this is not the time to do development.

Some deliverables are covered in the stability program plan; for instance, the ICH guidance, other regulatory requirements. We all know that we can go to the guidance and find out when to do photostability studies, for instance, and how to do them. But some deliverables are going to be linked to your specific product strategy and very dependent on your individual product plan. So you have to do teamwork to understand what those project goals are.

As an example, let us say you are going to do a global Phase 3 study. How do you support clinical excursions? You may have regions where you have never been before in the clinic and these regions may not have appropriate storage units. What do you do? How do you support that? Likewise with transport. What happens if you have a study that is ongoing in Brazil and the product is sitting in a warehouse for a day? You don't want to be constantly throwing away your clinical product, so you have to be able to develop more robust systems along the way if you are going global.

How to meet regional expectations? I'll talk about this a little bit later. This is something that has been a big issue for a number of companies recently.

General Strategies and Complications

So some general strategies: Develop adequate clinical experience - goes without saying. Try to use a set of batches in clinical development with a range of attributes. Also reserve samples - we have heard that already this morning and that is an excellent idea. Demonstrate that the product can be held frozen and then freeze samples and store them because the assays will not be the same later in development.

Define your commercial manufacturing process and plan presentation as early as possible. This is not always easy because marketing is always waiting to see what people want and often you will find changes towards the end of development. But try to be an advocate for getting that information early. Design space…is something that is very important during development and the stability function can advocate for this. The design space idea is intended for process development. However, I think it does have application to stability.

Quality by design…i nvolves starting out with the idea that you are going to build quality in from the beginning. You document what is good enough, what is acceptable, and then you develop your product according to those deliverables. This concept may be applicable to stability simply because the more you know about your product, the more flexibility you are going to have at commercialization.

Product understanding and assay development also go hand in hand, as I have said. The goal is to gain an understanding of the product so that chosen tests monitor the relevant stability parameters. That is a big issue, probably one of the biggest ones. By the time you are at commercialization, you want to be sure that your tests are robust and that they are complete and that they monitor everything that should be monitored for your product.

So all of these ideas - if you can be an advocate for your group and help make sure that this development happens at an early time, you will end up with a better assay profile, better ways to do out-of-trend analysis, how to resolve nonconformances, and you will be able to bracket and matrix more easily.

Of course there are complications….Those of us who are in a business do understand that there is always change going on with development timelines, priorities, resourcing issues, groups being moved from one project to another. The idea of speed to the clinic, while it is essential, means often that there is less up front characterization and less method development up front. There are always changes in formulation, presentation, manufacturing process. And often you have a small number of bulk clinical batches so you don't have a very good understanding of the variability of your process early on.

The results of this are that your product knowledge at commercialization may be impacted and you may lack needed product understanding and expertise limits. Your options will be limited and your opportunities as well at commercialization. This is just a word to those of you who haven't been there - keep looking forward. The reason is remediation and catch-up after licensure is both risky and very expensive.

So what do you do when your project team says, 'sorry but we have to go into the clinic in two months?' You start with an understanding of what you want to have at filing. Always keep that in your mind. You actively participate on the product and project teams. You drive for needed development data and tests and escalate issues.

However, you need to use resources wisely. You can't gain credibility by asking for studies that are unnecessary or irrelevant. So you really need to understand what the issues are. Often, at least in larger companies, you will find that there are other groups that are doing comparable work, so you can actually find information that you may not know existed.

Think proactively. It is easy for those of us managing stability programs to put samples aside and use them for a sample library for later on because we know that for comparability we are going to need them.

Global Considerations

I am going to talk a little bit about global considerations now. ICH harmonization really provides a lot of value for those of us who are trying to understand what countries require. One of the things that it is hard for people to understand is that good stability programs may not ensure good global strategy for stability. The reasons for this - I think there are two reasons. One is that while the small molecules guidance is fairly prescriptive, the large molecule guidance is open to interpretation, as it has to be, and some countries interpret things differently. Also, there are regional or customary expectations and traditions that predated the adoption of the guidance or are otherwise still very ingrained in a culture in another country.

When you develop a program, you need to accommodate regional requirements to allow for business model changes, because you don't always know where you're going to be marketing your product or where you are going to want to apply for licensure later. For stability, the timeline and cost issues for doing additional studies is untenable most of the time.

A couple of things to add to this: The FDA makes it quite easy through a lot of possibilities for interaction throughout development for companies to come to the agency and say, 'you know, we are making these changes.' Or, 'this is our strategy, is this acceptable?' Other countries don't do this. I am thinking particularly of Japan where between the time that you file your application for a clinical trial and the time that you file your application for licensure, there is no contact with the agency. So all of that bulk of years of development work is invisible to the regulators, and they only see it when you file your license application. So your question has to be, 'were these changes acceptable? What changes can we make?'

There is another issue and that is the annual report structure that the US has is very helpful. And I can think of this in contrast to the European Union which doesn't have such a system. Where, for instance, for a process change maybe you would need six months data in Europe for the bulk, you might get away with one month data for the US, because the US knows they are going to be getting updates periodically. So there are good reasons why there are different requirements.

So for Japan, how do you succeed? First, you need to accept that regulatory requirements are different…. It is hard for some companies to believe that there is really a difference and so a lot of it is just getting over that initial 'that can't be true' sort of reaction. But what you will find is that there are requirements and then there are expectations.

So your first big task is to define what the requirements are. And to do this you need to work with your partners and work with your consultants. Benchmark: Crucial for our experience in being able to understand what the true requirements are is to benchmark comparable industry practices.

We have also been able to find English translations of Japanese regulatory documents. There are websites that exist that have such things. While you don't have the chance to speak formally with MHLW or other regulatory agencies in Japan, you do have the opportunity to speak informally. To do this you need an entree. You need your partner company in Japan to go to the agency and initiate an informal discussion, and then you can be invited in to tell this person about your company. This is all to do with the trust issues, with the familiarity issues, for Japan that are very important.

So once you have decided what your requirements are and what are the expectations or 'nice to haves,' for requirements you had better try and follow them if you can. Consider harmonization with other regional requirements. If Japan requirements cause you to do twice the stability work that other countries do, you might have to see if it is feasible. You might have to prioritize and your Japanese colleagues can help you with that. If you have resource constraints you might not be able to do everything. But if you don't do everything, you need to make sure that decision is clearly understood up front.

For expectations, again consider the cost of compliance with the risk of non-compliance. And in this case, some of us may not understand the powerful position that the Japanese clinical trial director holds in terms of not only implementing the clinical trial but in your own good name as a company. So if your Japanese colleagues at your partner company indicate that there is a big problem here with not meeting an expectation, it could be because of this comfort level factor - that your good name is very fragile in Japan and you really need to be sensitive to your colleagues when they tell you that, 'oh we really have to do this,' because they might be absolutely right.

Also I just wanted to say document everything again. It is especially important with working with development in Japan because, as I said, you do not have the opportunity to update the agency regularly, so you need a good record of all your changes.

Examples of issues that are very common: Repetitive testing…. Japan regulators customarily have expected that you will do three replicates of your test even if you have a test that has one replicate. You will do three of those tests on each stability time point. Seems a little counterintuitive because the guidance tells us we should be using our method validation information to decide how many replicates are required. However, that has nothing to do with it for Japan. It is an expectation but it is a very strong one. I am not saying that you need to do it but you do need to be aware that you are going to have a conversation about this.

Full specification testing at least yearly is another thing that is often asked for and again, something that will require a lot of negotiation if you don't want to do all of your specification testing every year. You will need to have a good justification for why.

I think that probably one of the biggest issues for Japan regulators and Japan industry as a whole is that they don't really understand biotech as well as other jurisdictions might. And so they do tend to adopt a lot of small molecule guidance inappropriately. A lot of what you need to do involves mentoring and educating and trying to develop a relationship with regulators informally so that you can explain your company and explain your goals.

Related issues: Specifications. This is something that is an issue because in Japan, your specs are usually set on your three conformance or confirmation lots, and your supporting data is not considered to be that relevant. So you might end up with tighter specs in Japan. Use of JP reagents and test methods, cosmetic appearance and particulates are all issues. The cosmetic appearance is not a quality issue. It is just that the clinical trial directors expect to see commercial-looking material at Phase 1. They want to see blister packs if you have got pills, and so that is going to be an issue.

Overall Benefits

I'll just say a little bit about a couple of things that we probably want to touch on in our talks. Out-of-trend investigations: There is really no consensus about this that I am aware of. There is no real guidance out there for us. There are some papers that have been developed by a PhRMA group. What level of monitoring is appropriate for biotech products? How do you identify out of trend and what assays are most significant for monitoring?...In this case, stats are not well established or widely used for this sort of activity. My point about the design space and the building in quality is that the better that you understand your product and the better you understand your assays, the better you'll be able to deal with out-of-trend issues, which you need to deal with.

Same with non-conformist resolution. The bottom line is that you will, by understanding your product better, by having more robust programs that started earlier in development, be able to address non-conformances in a more comprehensive way. And then of course, improving specifications: Specifications and stability are tied. You cannot get to commercialization without having all the tests be relevant and non-value added tests gone. You need to have that happen. The whole point of this is to reduce the risk of accepting unsuitable product or rejecting acceptable product.

So in conclusion, a comprehensive stability program includes the program and the strategy - what you have done, how and why you have done it, and document your rationale. There is a whole list of benefits which I've talked about.

A comprehensive stability strategy requires more early development work and organization. So it needs advocacy by those of us in technical fields. The work must be value-added and you have to keep the goals in mind. You partner with other functions. Sometimes you need service level agreements to make sure that things happen. You definitely need a good project plan and good program management. The benefit you will have is that there will be increased visibility to commercialization goals and improved teamwork.

Documentation: Again document everything, including your project plans, decisions, rationales, product information and also contingency plans.

It will provide a roadmap to commercialization, fewer surprises, a more robust submission, an increased ability to focus on phase-appropriate studies, incorporate global requirements as you need to, ensure that the tests are appropriate and improve your specs, address nonconformances, and evaluate out of trends and apply for post-approval assay relief as appropriate.

Genentech's Nashabeh responded "basically yes to both." Typically the assays are removed by Genentech prior to licensure, he explained. Based on all the cumulative data from the clinical phase, "we will adjust the number of assays and type of assays that are on the commercial stability protocol going forward."

Addressing the follow-up question on reduced sampling, he said that "it is not unusual that for commercial [product], for the annual commitment, that you may not have the same time points, the same frequency of testing as you would have in early development. Whether this occurs right at licensure or later on - that you will actually reduce the time points - is a matter of how much confidence you have in the prior data. That is how we have kind of done it over the years."

ä The post-marketing session participants discussed further the implications involved in making changes in the testing regimen as the analytical technology evolves.

Reiterating the agency's general interest in encouraging sponsors "to come up with better, more reliable, more precise assays," CDER official Shacter emphasized that "the trick is to be able to demonstrate the continuity."

The question, she said, is "are you really measuring the same parameter with both assays, or are you measuring different specific molecular characteristics? And if you are not, how can you link them, how can you go between them so that you have, again, continuity of your trending data - so that you will know that all of a sudden if you have a change in the stability data, was it a true change in the molecule or is it because you introduced a new assay that either has greater sensitivity or is measuring actually a different element of the molecule?"

Shacter recommended that sponsors "bring that change to the agency and discuss it with us so that we can come up with some good solution. Certainly make a proposal, teach us about what you are looking at, what the change is, and then we will work with you to institute the change."

ä PharmaNet Consulting Executive Director William Egan, who recently left the agency after an extended career in CBER review management, followed Shacter's comment by again emphasizing the need for retention samples.

"Having the retention samples to do the other comparisons is extraordinarily helpful because the material that has gone into the clinic and the material that has been going into people has not changed, only our ability to characterize it - being able to detect…the things that are actually there, whereas previously you could not. But the matter itself has not changed, only our perception of it. I think one has to keep that in mind."

Define Purpose and Goals Of Stability Program

Egan and Shacter gave presentations at the beginning of the post-marketing stability session that helped inform the ensuing discussions.

Egan stressed the need for understanding and carefully defining the purpose and goals of the stability program, while Shacter discussed the implications for changes to expiry and to the product, process, or analytical regimen. Shacter also provided case studies of stability problems that have come to the attention of the Office of Biotechnology Products regarding the products it regulates (see box on pp. 23-28).

ä Egan emphasized that understanding the purposes and goals of ongoing stability studies is necessary to determine how those studies should be set up in terms of what is evaluated and the number/spacing of the sample data points and the algorithms involved. "So I think it is actually fairly important to first answer the question, why are we doing these studies? What do we hope to get out of them? And then, how can that goal best be accomplished?"

The design of the protocol, in turn, will be dependent on those purposes and goals. The questions at issue, Egan pointed out, include: ? "Are the data from the annual studies being used, pooled, to better define the shelf life at some period?" and ? "How does one define 'out of specification' for any ongoing study? Does it refer to single data points or collective measures of those data points? And again, is that context dependent - dependent on the study that was set up?"

In making such OOS determinations, Egan continued, "certainly, if you have an ongoing stability study of three points, one at the beginning, one at the end, and one in the middle, and the one in the middle is out of specification, there is not much choice about how to deal with that. If it is an ongoing stability study and there are 50 data points and one of them is out, and the rate is very much the same as what had been observed historically, then that is not an issue. So it depends on exactly the study that was set up."

One question is whether the protocol can be defined to permit additional testing as warranted if a few points are starting to appear out of the expected range. "That protocol," Egan explained, "has to be set up so one is not testing into compliance, but rather trying to better define a particular measure - namely what is the quality of that product - and then answer the question about the consequences of being out of spec on other manufactured lots if that one lot is taken as representative of everything else that is being manufactured."

Egan summarized that ongoing stability testing programs "should be very well constructed to meet their intended goals, and those goals should be defined prospectively. And to the extent possible, I think we need to define how we are going to deal with various situations that might occur during the ongoing stability study - for example, a data point or points that fall below a certain potency limit. How are these going to be handled?" He stressed that "these stability studies are resource intensive, particularly if they are done well, so I think they should be thought through as to their purposes."

Looking to the future, Egan suggested that "what is needed is a continued dialogue on the purpose of annual stability studies and how to best achieve that purpose or purposes."

ä In her presentation, Shacter cautioned manufacturers not to think of the stability protocol as locked in stone and not needing amendment.

"A point to the wise and that is that change begets change," she advised. "Your stability protocol may have been very well designed for the original manufacturing process that you were using, but when you make a change to the process you could actually impact the relevance of the stability protocol. So you may need to modify the stability protocol to stay in concert with how you have changed the process."

For example, where formerly a firm did not need to be continuing to look at deamidation of the product, if a step is added that increases pH in the process, "you may need to go back and reassess that for stability," Shacter explained. "And similarly, if for some reason you have an extra high concentration of your protein during the process you may need to do additional aggregates testing."

The CDER biotech official further cautioned that changes in the manufacture of the drug substance will generally necessitate putting drug product lots made from that bulk on stability.

Shacter also provided the participants with some guidelines regarding making changes to stability tests: "If you are going to change a test, you should demonstrate, needless to say, that the test is either equal to, or better than, the original test that you were using. And if you want to remove a test, then you need to demonstrate that lack of utility of the test that you have been performing."

ä In line with the forum discussions, Shacter recognized the degree of concern among manufacturers over the number of stability tests that need to be done prior to and after licensure.

"We have talked a lot about the need to test orthogonal methods or any method that shows a change in stability, and I know the concern among manufacturers is that you will always have to do those tests…into perpetuity." However, she stressed, "that is not the point. I can guarantee you that we will consider the data showing the lack of utility of the stability test, so that if it is really not relevant, if it is not teaching us anything about either a critical quality parameter of the product or one that simply isn't variant even though it is critical, we will talk with you about taking that out of the stability protocol."

If a particular reviewer proves recalcitrant, Shacter urged participants to refer to the meeting concensus on this point, because "we don't want to have a burdensome number of stability tests ongoing." On the other hand, she stressed, "it is so important to have that information before licensure so that we can understand the product and know the meaning of any changes that we see."

ä Another important point made by Shacter relevant to changing a stability test - either introducing a new one or replacing an old one - is to do head-to-head comparisons.

"Don't rely on historical information from the former test," she warned, noting that "many people do try to rely on historical data versus doing side-by-side comparisons of the test." She urged sponsors to "accumulate as much comparative data as possible so that we know we have a basis for assessing the change in the test, and then submit the changes as a prior approval supplement to the agency." ¨ ¨

CDER's Shacter on Biotech Product Post-Marketing Stability Issues

The following talk on post-marketing stability issues was given by Division of Therapeutic Proteins Biochemistry Lab Chief Emily Shacter at the WCBP CMC Strategy Forum in late July. Shacter discussed the implication of manufacturing and methology changes on the stability program and cited stability-related problems encountered among the products regulated by CDER's Office of Biotechnology Products.

What I am going to focus on is what happens when you have a change in the manufacturing process, either intentional or unintentional, and what you should be doing about that. This would be a change to the process, to the product or to any assays that you are using as part of your stability protocol. The main goal of the stability studies in this case is to demonstrate comparability and continuity of both the product and your assessment of stability of that product.

I will also talk about some stability problems that have become apparent during licensure and focusing somewhat on managing and understanding OOS results, out of specification results, and some findings of inadequate stability testing. At least half of my talk will be giving you some case reports of various things that we have seen at the FDA that have happened in the course of stability testing of a product.

So just a little bit on guidance to sort of put everything in focus: It seems a little late but on the other hand some of these things actually have not been said so far in the meeting. The setting of specifications for product release needs to ensure that the product continues to have the safety, purity and potency reflected in product labeling. The emphasis for the purposes of this study is that it has to continue to have those qualities. That is the purpose of stability testing. The other most important point for us to remember is that a lot of our statutory ability to even regulate stability comes from the Federal Food, Drug and Cosmetic Act, Section 502(a) where it says that 'a biological drug product is deemed misbranded if its labeling is false or misleading in any particular.' So if your product has changed to the degree that the label is no longer reflective of the product, then the product is misbranded and you will have a compliance issue associated with that.

Advice On Making Changes

One of the common questions that comes up is 'how do you extend your expiration dating period?' Now we are talking post-licensure, and expiration dating is obviously a critical parameter, whereas yesterday when we talked about our early stages of clinical studies, 'well what is an expiration date, really?' But now we are talking about true expiration dates both for the drug substance and the drug product.

Really, all you need to do in order to extend an expiration date is to modify the stability protocol to add the longer time points, provide the updated stability data and any trend analysis that you have on those data to the agency, and if you have an approved stability protocol and you have data that meet the acceptance criteria showing that extension of the expiration date is in fact an appropriate thing to do, then you can extend the expiration date in an annual report. This obviously assumes that no new trends have been observed over the longer time course of looking at the stability of the product.

If you make a change to the manufacturing process or product, you should be putting those new lots on stability. The number of lots you need to put on stability depends on the nature of the change that you have made in the process. A significant change in the process might require something like two or three lots, whereas a relatively minor change in the process which is not expected to have a significant impact on product quality, probably one lot will do. We gauge those on a case-by-case basis depending on what we think the risk is to the stability of the product.

You should perform accelerated and or stress testing along the lines that we have talked about already during the meeting. You should do trend analysis across the lots to see if the manufacturing change has had any impact on what you know about stability of the product. You should analyze the data - and this is the most important point that I would like to make - using prospectively defined, statistically based acceptance criteria to assess the comparability to the previous process. It is not sufficient to take the accelerated stability data for example, to look at the lines compared to the old material and say, 'yes, they look similar to me.' That won't be acceptable. The other point I would like to make is that you should really be assessing again, as we have discussed, the rates of decay and not just end points over the time course of the stability testing.

After you have done all this you should submit a supplement to the agency. The level at which that supplement needs to be, whether it is an annual report, a CBE-30, or a prior approval supplement is determined based on guidance and CFR 601.12. One of the easiest routes is if you have a comparability protocol, then you can most likely submit the new information in the annual report.

A point to the wise: Change begets change. Your stability protocol may have been very well designed for the original process that you were using, but when you make a change to the process, you could actually impact the relevance of the stability protocol. So you may need to modify the stability protocol to stay in concert with how you have changed the process. Some simple examples are if you added a step where you have an increased pH in the process where you formerly did not need to be continuing to look at deamidation of the product, you may need to go back and reassess that for stability. Similarly, if you for some reason have an extra high concentration of your protein during the process you may need to do additional aggregates testing. Be sure that your stability protocol will continue to support any change that you make in the process.

A simple point: If you make a change to the manufacture of the drug substance, you also need to put drug product lots from those new drug substance lots on stability. That is a general rule.

Now if you want to make a change to your stability tests, a couple of guidelines: If you are going to change a test, you should demonstrate, needless to say, that the test is either equal to or better than the original test that you were using. If you want to remove a test, then you need to demonstrate the lack of utility of the test that you have been performing.

On this point, I know that there is a lot of concern among manufacturers over the number of stability tests that need to be done prior to and after licensure. We have talked a lot about the need to test orthogonal methods and lots of methods or any method that shows a change in stability. I know the concern among manufacturers is that you will always have to do those tests…into perpetuity. That is not the point. I can guarantee you that we will consider the data showing the lack of utility of the stability test so that if it is really not relevant, if it's not teaching us anything about either a critical quality parameter of the product or one that simply is not variant even though it is critical, we will talk with you about taking that out of the stability protocol. You can tell your reviewer since I know that there are some differences, depending on what reviewer you get, but just refer to this meeting because we do do that. We don't want to have a burdensome number of stability tests ongoing. It is so important to have that information before licensure so that we can understand the product and know the meaning of any changes that we see.

If you are going to change a stability test, introduce a new one or replace an old one, you should do head-to- head comparisons. Don't rely on historical information from the former test. Although that seems obvious probably to most of you, you would be surprised at how many people do try to rely on historical data versus doing side-by- side comparisons of the test. You should accumulate as much comparative data as possible so that we know we have a basis for assessing the change in the test. Then submit the change as a prior approval supplement to the agency.

I think an interesting question that comes up is 'how do you set comparable specifications for different types of end points?' For example, if you are looking at charge variants and you are going to replace isoelectric focusing, which is an assay with an awful lot of variability, with HPLC, which might give you more reliability, how are you going to compare the end points from those two tests since they are really very different?

There is not an a priori answer to that but it is obviously an interesting question for everybody to be thinking about. The important element here is to make sure that you have continuity. It is to make sure that the tests are really measuring the same thing, so that you know your changeover from one test to the next is still measuring the same parameter, and that you have continuity when you are doing trend analysis of your lots over time and over the course of many years. You want to be able to refer back to your older data.

Examples of The Stability Problems FDA Has Seen

The rest of the talk will be giving…case reports of some incidents that we have seen that relate to stability in the many products that come through the Office of Biotechnology Products. I didn't get any examples from the Office of New Drug Chemistry, but I assume that the stories would be rather similar.

The first one is a case [in which] the sponsor wanted to change from a lyophilized to a liquid formulation. In making that change they needed to now start testing the vials in the inverted as well as the upright configuration. What they observed was an increase in protein degradation over time. The reason for this was that now that the fluid was touching the stoppers, there was leaching of a metal from the rubber stoppers and it was activating a metalloproteinase that was present in the product. That metalloproteinase was probably always present in the product, but because there wasn't enough metal there to activate it, the changes in degradation were not seen until they went to this change in formulation. In this case, the resolution of the problem was to add a chelator to the formulation and that was acceptable.

Probably one of the most famous recent cases of a serious problem that we have had with a product is the story of Eprex and Pure Red Cell Aplasia. And I'm sure that most of you know the story but basically there was a change in formulation for this erythropoietin product - which is marketed in Europe, not in the United States - from a formulation containing human serum albumin to a formulation containing Tween. They used the same container closure system, which was a pre-filled syringe that had uncoated rubber stoppers. After the course of this change, basically what was found was that patients were coming down with a severe and life threatening disease, which as it turned out - at least the current hypothesis on it, with some good data but nobody's absolutely sure still what the cause has been - there was a leaching of organics from the rubber stoppers causing a change in probably immunogenicity of the product.

We don't usually talk about the Eprex story as having been a stability story, but the truth is that there was an increase in leaching of these organics from the rubber stoppers over time, so that at early time points the levels that were measured by HPLC were relatively low and they went up over time. There may be a correlation between the patients that actually experienced PRCA who were receiving old lots of material versus younger lots of material.

One point to make here is that there was no observable change in the drug product. The hypothesis is that the leachates that were injected into folks were actually serving as adjuvants and heightening the immunogenic response to the protein. Again this is hypothesis - it's not known, there is some data to support it - but the point is that actually the testing of the drug product wasn't detecting any change there.

The point of these two stories is that you need to revalidate the system compatibility and the stability of your drug product and drug substance if you change your formulation or the container closure system. What I think we learned from both stories is that you need to be able to examine changes both in the protein and in the impurities profile, because both of those - pretty serious changes to the product - resulted from impurities and not from product-related changes.

Another story where a sponsor wanted to go to a low dose version of a lyophilized product. This is a product that was very stable at 15 to 30^o C, had a long stability history, and in going to the smaller configuration the vial was smaller and the content was smaller, but they used the same container closure system. But what they found was that this smaller configuration was not as stable to store it at 15 to 30 ^o, basically controlled room temperature, as the larger vialed configuration. It turned out that they needed to be storing this lyophilized product at 2 to 8 ^o C instead of room temperature.

We don't understand the reason for this - possibly the ratio of the container closure system to the amount of protein in the vial. But the point is that you wouldn't necessarily expect it, but you can have a change to a lyophilized product just by changing the vial configuration.

Another observation was a series of data that were showing that there was a 20% loss in activity of a product - this was a cream in this case - over the shelf life. These were data that were discovered through post-marketing commitments to look at the ongoing stability of the product. What happened was because there was such a dramatic change in the activity of the product, it went out of spec while the product was on the market. It would have been recalled if there was any product left, but I believe that in this case, there wasn't any left. There were several lots that went out of spec. [They] were already used up by the time of the 12 month shelf life. So there wasn't any actual recall.

The resolution to the problem was, I think, a little bit creative, and that was that the sponsor added an in-process limit. So they did not change the release specification. They instituted an in house in-process limit that required that if the product, on release, was lower than a certain level, that lot could go out to market but those lots would have to be placed on stability. If the lot fell out of specification while the product was on the market, it would have to be recalled by the sponsor.

The next…is a case where the pre-market stability data were really inadequate. The sponsor extrapolated from real-time data to set an expiration date which was set wrong. This is a case where the product was actually gaining activity over time. Also to place as a reminder that products also gain activity over time, and it is not just a matter of losing activity, but also gains of activity.

This product gained activity in half of the expected time. So they had had a 24 month shelf life but within 12 months the product was out of specification. A re-analysis of the data following the guidelines in Q1E, showed that actually, the expiration date was not set appropriately and that the product had a shorter than expected shelf life at room temperature. The problem was probably aggregation in this case. I'm not sure that that was fully established, but the resolution to the problem was to change the shipping temperature and ultimately the storage temperature to 4° centigrade.

I think in this case, we don't like to admit this, but I think the FDA made a mistake in allowing the extrapolation that was made. So this really shouldn't have happened, and, in retrospect, I think that we will definitely be much more careful in terms of using the appropriate statistical criteria in setting a shelf life based on extrapolated data. We certainly don't like to do that at all. As a general rule we will extend shelf lives an incremental amount. For example, if you have 15 months of data, a shelf life of 18 months, if you know the decay rates, is a reasonable extension. But to go, for example, from 12 to 24 months is something that at least in DTP [Division of Therapeutic Proteins] we never do, and I am sure that DMA [Division of Monoclonal Antibodies] doesn't do that either. So lessons learned even for the FDA.

Now I have two examples of cases where products went out of specification for potency, largely due to poor potency assays. In this first case the potency assay had a very high degree of variability. No surprise, a number of lots were tested out of specification for potency while the product was on the market. Unfortunately in this case, this was a relatively poorly characterized molecule, so there were not good physical chemical tests to be able to tell us or the sponsor whether it was truly missing its potency specification or whether it was really an error due to a lousy potency assay that had an awful lot of variability. So there were multiple product recalls in this case. The rule of thumb coming out of this is that the looser your bioassay, the more chance you have of getting an out-of-specification result. We would encourage you, and I'm sure you want to have the same, to have the tightest assay limits that you can establish.

I know a lot of people talk about how widely variable potency assays are. I have to say that in my experience at the FDA, that is really not necessarily true. Even for assays that are around that are proliferation-based assays, so complex cell-based assays, it is not the norm to have the assay variability be in the range of two to three fold. Now some of the control comes from using an internal reference standard. But that is part of having to deal with the complexity of these assays. But I actually personally don't buy the argument that potency assays are inherently variable.

The second example of an unreliable potency assay resulting in out of specification results: This again was an enzyme that appeared to gain activity at a higher than expected rate during storage. There were both multiple product recalls and failures to release the product. In this particular case, there were two concurrent problems happening. One was that the potency assay was very inaccurate. It was actually giving them false high results, just by virtue of the design of the assay. It was a really poor choice of potency assay. But the other issue that was happening was that the product was unstable. It actually truly was unstable. It was a conjugate and the conjugate was falling apart, and as it fell apart that also changed the activity. So there were two concurrent things going on which made it very difficult to be able to predict what was going to be happening.

The resolution to this problem was to both improve the potency assay - to completely change it - so not just find better ways to tighten up this particular assay but to ditch the assay entirely. It was not destined to be a reliable assay in the first place. Then also to improve the molecular stability of the product. Obviously what you want to be doing is minimizing the variability of the assay and the potential for instability in your product.

One other point to make in this case is that since this was a conjugated product, obviously you need to test for the stability of the conjugate. Although that seems like an incredibly obvious thing, surprisingly that was not done sufficiently before this product went to market.

Now a case where we had an invalid process validation leading to products going out of specification while on the market: What happened here was that the manufacturer had in theory validated through small scale studies that a pH acidification step could go down as low as pH 2 and the product would be fine, there would not be any stability issues. But in fact what was observed was that marketed lots in which the material had been exposed to pH 2.2 actually were going out of spec for aggregates at pretty early time points - in fact early enough that it's probable that some of these lots should have been detected upon release, but because the aggregates assay was not terribly reliable, they actually were not testing it even on release. As a result of this, product lots were recalled and the manufacturer, to resolve the problem, basically raised the lower limit of the pH to which the protein could be exposed for viral inactivation studies.

Another case where there was a product that had multiple different product strengths. Issues were coming up for the low strength vials that were reconstituted in bactericidal water for injection. What was found was that the low dose versions were going out of spec for protein. In fact they were losing 20% and more of the protein in the vial. This is not something that we see very often. Most often, parameters such as pH and protein are pretty rock solid. So this was an impressive loss of protein, possibly due to aggregates in the protein coming out of solution, and it might have resulted from an interaction with the benzyl alcohol. The point was that the manufacturer hadn't validated that this product could be reconstituted and be WFI at all. So there were multiple product recalls.

Again, the resolution of this problem was for the sponsor to validate the use of WFI for reconstitution of the low dose vials. This was submitted to the agency as a prior approval supplement. The sponsor also needed to increase the protein release specification by 20%. So instead of being something like 90% to 125% of the labeled amount, it actually increased to 110% to 125%.

Now the sponsor came in and they said, 'Well, actually, we would like to loosen the specification for aggregates, because when this product gets reconstituted [in] WFI then the product goes out of spec for aggregates. But that is what is going to happen so can't we just change the specification for the allowable amount of aggregates?' Needless to say the answer was 'no.' If the specification is set wrong in the first place, then you can't retroactively come back and say, 'Well gosh it was always that way, therefore we want to change the spec back to what we should have known at the time.' That is not allowed.

We also do identify stability-related issues on inspection. I'll give you a couple examples of those. In one case the sponsor was testing the stability of a process intermediate, but the test samples were not being stored in the same configuration and the same conditions as the product intermediate was being stored. This was a 483 citation because, again it seems fairly obvious, the test samples need to be stored under the same conditions as the product itself.

In another case there wasn't any stability testing of a protein excipient that was used in the final drug product, even though it was known that this excipient would affect the PK and PD of the active ingredient. In fact that is part of why the excipient was there. The sponsor should have been testing the stability both of this critical excipient ingredient as well as the active ingredient. So that was a 483 citation.

We have also had any number of 483 citations come in from the field because of inadequate stability indicating assays in a stability protocol.

Now there is one 483 citation which we might have some reasonable disagreement on, and that is that we have seen some 483 citations coming to us for failure of a sponsor to do endotoxin testing as part of the stability protocol. The scientific rationale for requiring this testing is unclear to us. So there is a little bit of misunderstanding I think in the field about why you would or would not have to have that in there. If this happens to you while you have the field inspecting your facility, ask the inspector to please call the product review team so that we can hash this out with them because this is a 483 citation that does not need to happen in our perspective.

So some cheap advice: You should be establishing and optimizing meaningful and quantitative stability-indicating assays. You should plan for stability issues when making any change to the process or to the product. And you should institute appropriate stability testing as part of a process change. Don't operate on assumptions, because we at the FDA don't....As we like to say, 'Just show us the data.' We hope that we will exercise our own scientific acumen to negotiate with you about what the appropriate response should be.

Table
View full table

Ask The Analyst
Email

Print
PDF
Bookmark
Share

Tags:

Latest Headlines

24 Apr 2024

Pharma Can Pursue Claims Against Providers For 340B Duplicate Discounts In Medicaid Managed Care, HRSA Says

24 Apr 2024

The Aspirin Test For AI?

24 Apr 2024

Mary Beth Clarke, CDER Exec Programs Office Director And Generic User Fee Shepherd, Will Retire

24 Apr 2024

Rx Advertising: Interchangeability Doesn’t Mean Clinical Superiority To Other Biosimilars, FDA Says

See All

You must sign in to use this functionality

Authentication.SignIn.HeadSignInHeader

PS000076

Ask the Analyst is free for subscribers. Submit your question and one of our analysts will be in touch.

Your question has been successfully sent to the email address below and we will get back as soon as possible. my@email.address.

All fields are required.

Please make sure all fields are completed.

Please make sure you have filled out all fields

Please enter a valid e-mail address

Please enter a valid Phone Number

Ask your question to our analysts

Cancel

PS000076

All set! This article has been sent to my@email.address.

All fields are required. For multiple recipients, separate email addresses with a semicolon.

Please Note: Only individuals with an active subscription will be able to access the full article. All other readers will be directed to the abstract and would need to subscribe.

STABILITY PROGRAMS FOR BIOTECHNOLOGY PRODUCTS

News

Executive Summary

Pharma Can Pursue Claims Against Providers For 340B Duplicate Discounts In Medicaid Managed Care, HRSA Says

The Aspirin Test For AI?

Mary Beth Clarke, CDER Exec Programs Office Director And Generic User Fee Shepherd, Will Retire

Rx Advertising: Interchangeability Doesn’t Mean Clinical Superiority To Other Biosimilars, FDA Says

You must sign in to use this functionality

Authentication.SignIn.HeadSignInHeader

Register

Ask The Analyst

Email Article

STABILITY PROGRAMS FOR BIOTECHNOLOGY PRODUCTS

News

Executive Summary

Pharma Can Pursue Claims Against Providers For 340B Duplicate Discounts In Medicaid Managed Care, HRSA Says

The Aspirin Test For AI?

Mary Beth Clarke, CDER Exec Programs Office Director And Generic User Fee Shepherd, Will Retire

Rx Advertising: Interchangeability Doesn’t Mean Clinical Superiority To Other Biosimilars, FDA Says

You must sign in to use this functionality

Authentication.SignIn.HeadSignInHeader

Register

Ask The Analyst

Email Article

Sign In To Set a Search Alert