TL;DR: When a brief arrives in our inbox that just says “acne serum, 2% salicylic acid, clean formula, under $5 landed cost” — we know we’re going to need at least two more rounds of back-and-forth before we can touch the bench
TL;DR: A 2% salicylic acid serum sold in the US is classified as an OTC drug under the [FDA Cosmetics Guidelines](https://www.fda.gov/cosmetics) and triggers a completely different qualification process than the same formula positioned as a cosmetic exfoliant in the EU
Key Technical Parameters #
Getting from “I have a product idea” to an approved lab sample is where most first-time brand founders lose weeks — sometimes months — because the brief they hand to an OEM is missing the details that actually drive formulation decisions. This guide is written for brand founders who are ready to develop an acne or blemish control product but have never formally engaged a contract manufacturer before. The focus here isn’t the science of salicylic acid or niacinamide — those are covered elsewhere. This is operational: how to write a brief that gets you a useful first sample, how to evaluate what you receive, and how to move from sample approval to a first production batch without restarting the process three times.
What a Useful Formulation Brief Actually Contains #
When a brief arrives in our inbox that just says “acne serum, 2% salicylic acid, clean formula, under $5 landed cost” — we know we’re going to need at least two more rounds of back-and-forth before we can touch the bench. That’s not a complaint. It’s just the reality of what’s missing.
A workable brief for an acne & blemish control product covers six things.
Target market and regulatory pathway. This changes almost everything. A 2% salicylic acid serum sold in the US is classified as an OTC drug under the FDA Cosmetics Guidelines and triggers a completely different qualification process than the same formula positioned as a cosmetic exfoliant in the EU. We ask this in the first five minutes of every call. Skip it in your brief and we’ll be guessing.
Claimed benefits vs. on-pack claims. There’s a difference between what the formula does and what you’re allowed to print on the box. “Clears acne” is a drug claim in the US. “Visibly reduces the look of blemishes” is not. You don’t need to have this fully resolved before you approach us, but knowing roughly where you want to land changes which actives we’ll propose and at what concentrations.
Texture reference and format. “Serum” covers a huge range. Watery essence, 30cP gel, thick-but-clear fluid — these are different starting points with different stabilization demands. The most useful thing you can send is a benchmark product: something already on the market that’s close to the texture and finish you want. Not to copy it. To calibrate. We use it to set viscosity targets and negotiate packaging before the first batch.
Ingredient inclusions and exclusions. This is where clean beauty brands create the most complexity, and honestly that’s fine — we just need to know upfront. A “no fragrance, no essential oils, no PEG” list is easy to work with. What makes a brief harder is when exclusions arrive after the first sample, because by then we’ve already built the emulsion base around one of those ingredients. List your no-go ingredients before we start, not after.
Packaging concept or constraints. If you’ve already committed to airless pump bottles with a specific neck finish, tell us. At acne-active concentrations — salicylic acid especially — aluminium and certain zinc alloys can cause discolouration under accelerated conditions. We flag this in what we call our P-04 packaging compatibility screen, which we run on every new formula before we issue a stability protocol. Some brands come to us having already ordered packaging. That’s a harder situation to be in.
Regulatory target markets. List every country you plan to sell in at launch, plus your 12-month expansion markets. The EU, Australia, and South Korea each have their own notification or registration requirements for cosmetics with active ingredients. A formula that’s clean for one market sometimes requires reformulation for another, and it’s much easier to build that in at the start than to split SKUs later.
The Parameters That Predict Whether Your First Sample Will Be Useful #
Most samples that miss the mark on first round fail for one of four reasons: wrong texture, pH mismatch relative to claim, fragrance or sensorial expectation not communicated, or active concentration set too high for the base to stabilize.
On pH: for salicylic acid to function as a keratolytic, it needs to be below pH 4.0 — ideally in the 3.2–3.8 range. Above pH 4.5, most of the active is in its ionized, membrane-impermeable form and the exfoliation claim weakens considerably. We sometimes get briefs asking for “gentle, non-irritating BHA serum” but also “visible pore-clearing effect in 4 weeks.” Those two asks pull in opposite directions at the formulation level. At pH 3.5, you get efficacy. You also get the tingle that some consumers interpret as irritation. There isn’t a way around this — it’s not a formulation problem, it’s a claims-positioning problem. We push back on this brief almost every time.
For niacinamide at concentrations above 4%, we run compatibility screens against every other active in the formula before the first batch. Niacinamide and certain acidic actives — including low-pH vitamin C forms — generate niacin as a degradation product in the presence of heat, which causes flushing in a small but measurable percentage of users. This isn’t hypothetical; in our 2023 stability review of 11 multi-active acne formulas, 3 of them showed niacin accumulation above 0.1% after 8 weeks at 40°C. We caught it before those formulas shipped. That kind of compatibility check doesn’t happen if you don’t tell us about all the actives upfront.
A brief comparison of sample request types — and what to expect from each:
| Request Type | Typical Sample Volume | Timeline | What You Can Evaluate |
|---|---|---|---|
| Concept sample (unoptimized base) | 50–100g | 2–3 weeks | Texture, pH, initial compatibility |
| Stability-initiated formula sample | 100–200g | 4–6 weeks | Stability markers, sensory profile, packaging fit |
| Pre-production pilot batch | 5–10kg | 8–12 weeks | Reproducibility, fill performance, final sensory |
| First production batch (post-approval) | MOQ varies (typically 500–1,000kg) | 10–14 weeks from approval | Finished goods QC, shelf life initiation |
The variable most brands underestimate is the gap between concept sample and stability-initiated sample. That 4–6 week window is when we learn whether your formula is actually stable or just looks stable at room temperature. We’ve had formulas that appeared visually perfect at week 2, then separated at week 6 under 40°C/75% RH cycling. The emulsion base had been responding to the benzoyl peroxide peroxide loading in ways that weren’t visible until thermal stress was applied. One specific case from 2022 involved a 2.5% BPO gel system — phase separation occurred between weeks 5 and 6 in three of four test packaging formats. That round cost an extra four weeks. The brand founder had not included that timeline buffer in their launch plan.
Efficacy proxies are often what brands want to evaluate from a first sample, and we understand the pressure. A 2019 split-face clinical trial (n=44, 12 weeks, published in the Journal of Cosmetic Dermatology) showed that a 2% salicylic acid leave-on treatment reduced non-inflammatory lesion count by 37% vs. vehicle control — but the results only emerged clearly after week 6. Lab samples submitted at week 2 won’t show you that signal. What you can evaluate early: sebum absorption rate (blotting paper method), immediate pore appearance under magnification, sensory tolerance on inner forearm. Real-world efficacy evidence takes time. Don’t let a week-2 consumer test drive your formula decision.
Decision Framework: From Sample Approval to First Production Batch #
This is where process diverges most between brands, and where delays are usually self-inflicted.
If your sample has passed visual and sensory review and you have stability data at 8 weeks (40°C, 75% RH) showing no phase separation, no colour shift beyond ΔE 2.0, and no pH drift greater than 0.3 units — that’s a reasonable threshold to approve for production. Not every brand waits for full 12-week data before placing a production order. That’s a business risk decision, not a formulation one.
If your formula contains a regulated OTC active and you’re targeting the US market, the timeline changes significantly. Under the FDA Cosmetics Guidelines, OTC drug products require manufacturing in an FDA-registered facility, and the associated documentation package — including finished product testing, identity testing on actives, and stability data — adds weeks to the qualification timeline. We’re clear about this in every kickoff call, but it still surprises brands who didn’t realize they were in OTC territory.
If you’re launching in the EU and your formula contains zinc PCA above certain thresholds or uses any ingredient approaching a EU Cosmetics Regulation 1223/2009 Annex III restricted concentration limit, we initiate a safety assessment early — before production, not after. The CPSR (Cosmetic Product Safety Report) is a formal requirement, and the qualified person who signs it needs the stability data and exposure calculations before they can issue it. Don’t treat the safety assessment as an afterthought at the end of the development cycle.
If you’re in a time-constrained launch window and cannot wait for full stability data: the minimum we’ll sign off on is 4 weeks accelerated at 40°C with visual, pH, and viscosity checks passing. We note the limitation on the tech sheet. Some markets accept this. Others don’t. The NMPA Cosmetic Regulation in China, for instance, requires a more extensive stability dossier for registration-required categories — which includes most anti-acne products marketed with specific function claims.
Development fees and per-unit cost are structured differently depending on the project type:
- Custom formulation development: we typically charge a development fee in the range of USD 300–800 depending on formula complexity and number of active systems involved. This is credited against the first production order in most cases.
- Per-unit cost for finished goods in acne serum format (30ml, airless pump, standard actives) runs roughly in the range of USD 1.80–3.50 at 1,000-unit MOQ, with meaningful cost reduction at 3,000 units and above.
- Stability testing as a standalone service (if you need a documented stability package for regulatory submission) is quoted separately.
One thing we’ve started recommending more consistently: don’t approach the development fee as a price to minimize. The brands that negotiate development fees down to zero by using a “stock base with minimal adjustment” approach often end up paying for it in reformulation rounds later. A formula built to your brief from a clean starting point is harder to mess up at scale than a modified stock base that was never designed for your active load.
For acid exfoliation technology formats specifically — toners, serums, pads — we’ve found the most common place projects stall is the transition from approved lab sample to production batch, because the mixing order and shear conditions during scale-up affect viscosity in ways that a 200g lab sample doesn’t predict reliably. We document this in our batch protocol as a scale-up sensitivity flag. Not every ODM factory does. Worth asking about.
Formulation Notes for Brand Partners #
When you brief us on an acne or blemish control product, the first questions we ask are: what market, what format, and what’s the on-pack story? Those three things determine whether we’re building a cosmetic or an OTC drug, whether we’re working with acid actives that need pH-controlled bases, and whether the packaging you’ve fallen in love with is actually compatible with your formula.
The most common mistake we see is brands arriving with a full ingredient list — actives, texture agents, preservative, fragrance — and asking us to “just make it.” The problem is that list was usually assembled from other products’ INCI labels and doesn’t account for the interactions between those ingredients at the concentrations needed to perform. Specifically: acid actives and many commonly requested “clean” preservative systems are incompatible below pH 4.0. We reframe this early: we ask what you want the formula to do, and then we build the ingredient architecture around that, not the other way around.
Timeline: lab samples in 2–3 weeks from brief sign-off, accelerated stability initiated at the same time as sample dispatch (4–8 weeks to first stability read), 24-month real-time stability initiated concurrently. Production scheduling begins after formula approval, not before. First production batch typically 10–14 weeks from formula sign-off, depending on raw material lead times and packaging procurement.
Frequently Asked Questions #
We have a formula we developed with another lab — can you just manufacture it for me?
A: Yes, but we run it through our incoming formula review first, which covers active concentrations, preservative efficacy, and packaging compatibility. If the formula passes without changes, manufacturing can start relatively quickly. If it needs adjustment, that triggers a formulation fee and restarts the stability clock.
What’s your MOQ for samples, and do we pay for them?
A: Concept samples are 50–100g and typically covered under the development fee. Stability-initiated samples are 100–200g and may be charged separately if they require dedicated raw material procurement. Production MOQ starts at 500 units for most liquid formats, though some packaging formats require 1,000 units as a minimum fill run. We confirm this at brief intake, not at production scheduling.
We want to use benzoyl peroxide in a spot treatment — how does that change the timeline?
A: BPO is an OTC drug active in the US, which means the finished product requires OTC drug manufacturing protocols, active identity testing, and a more detailed stability package than a standard cosmetic. Under the FDA Cosmetics Guidelines, that adds roughly 4–6 weeks to the qualification timeline. Also, BPO is a strong oxidizer — packaging compatibility testing is mandatory, not optional. We’ve had BPO at 2.5% cause visible discolouration in polypropylene closures within 10 weeks. Packaging selection matters a lot here.
At what point can we lock in our packaging and start ordering it?
A: After stability-initiated sample is issued and initial compatibility results are back — roughly week 4–5 of the stability run. Ordering packaging before that point is a risk. If the formula requires pH adjustment after early stability data, the new pH can affect compatibility with certain coatings or metal components. We’ve seen projects where packaging was ordered at week 2 and had to be replaced after week-8 stability failure. The cost of waiting is usually smaller than the cost of replacing pre-ordered stock.
What’s something we should be asking about that we probably aren’t?
A: Scale-up behaviour of viscosity. Lab samples are made at 200g–500g under controlled shear conditions. A 500kg production batch uses different mixing equipment, different shear rates, and longer thermal exposure during processing. Gel and lotion formats for acne serums — especially those using carbomer or hydroxyethylcellulose at higher loadings — frequently come out slightly thinner or slightly thicker at production scale than the approved lab sample. We document the acceptable viscosity range in the formula spec before production, and we flag this to every new brand partner. Ask your OEM how they manage this transition, and ask to see the production-scale batch protocol alongside the lab sample tech sheet.
Have a product concept in mind? Contact our formulation team to request a complimentary brief review.
Challenge testing timeline is the one that always catches new clients off guard — we had a brand push back on the 28-day preservative efficacy test (USP 51 criteria B) because they’d already locked in a retailer launch window, and there’s just no way to compress that without invalidating the data.
The US OTC drug pathway was the one that blindsided us — same 2% salicylic acid formula we’d already sampled for EU launch, but the moment we flipped the target market to the US, we were suddenly looking at monograph compliance, labeling rewrites, and a completely different MOQ conversation with our manufacturer.
The China NMPA notification requirement for “special cosmetics” catches people off guard too — niacinamide at concentrations above 5% used in combination with salicylic acid can trigger a special cosmetic classification (祛痘类) that adds 6-9 months to registration timeline. We’ve had SEA clients pivot their entire China launch sequencing because of it, pushing a Tmall debut by nearly a year while the domestic-only reformulation cleared review.
Worth flagging for anyone targeting Southeast Asia specifically: Indonesia’s BPOM requires salicylic acid above 0.5% to be registered as an obat (drug product) rather than a cosmetic, which means clinical safety data, a local sponsor entity, and registration timelines that can run 12-18 months. We’ve had clients assume the ASEAN Cosmetic Directive harmonization covered them and then hit that wall hard mid-development.
Salicylic acid solubility is the thing that actually eats weeks in development — we’ve had batches where the SA looked fully dissolved at 45°C during manufacturing, then crashed out of solution at room temperature storage within the first 30 days of stability, and the pH drift that followed invalidated the entire batch. We now require our Incheon-based supplier to provide particle size distribution data alongside standard CoA, because purity on paper genuinely doesn’t tell you enough about how it’ll behave in a water-dominant base.
The pre-production pilot batch timeline in that table is optimistic if your formula has any actives requiring dissolution heat — our 1% azelaic acid + niacinamide gel went 14 weeks from stability-initiated sample to pilot sign-off because two of those weeks were just waiting on pH drift data at the 40°C/75% RH checkpoint.