Overview #
pH is not just a stability parameter in acid toner formulation. It is the primary performance lever — and the single most common point of failure when brands move from concept to commercial scale. We’ve reviewed hundreds of briefs for exfoliating toners over the years, and the pattern is consistent: brands focus on the acid name on the label, and underestimate everything else. The free acid fraction, the buffer system, the preservative compatibility, the packaging interaction — these are where products succeed or fail. This guide walks through how we evaluate briefs and what thresholds we actually use when deciding whether a formulation is manufacturable at scale.
The Four Variables That Actually Drive Performance #
Most brands come to us with a concentration request. “We want 10% glycolic.” Fine. But concentration alone tells us almost nothing about what the consumer will experience or whether the product will pass stability.
The variable that matters most is free acid concentration — the fraction of acid that exists in the undissociated, biologically active form. At pH 3.5, glycolic acid (pKa 3.83) sits at roughly 68% free acid. Drop to pH 3.0 and you’re above 87%. Raise to pH 4.0 and you’re below 40%. That’s not a small difference. That’s the difference between a product that delivers measurable exfoliation and one that doesn’t.
Here’s the framework we use internally when evaluating any acid toner brief:
1. Total acid concentration — the labeled percentage, typically 5–15% for AHAs, 0.5–2% for BHA (salicylic acid)
2. pH — target range 3.2–4.5 for most AHA formulations; below 3.5 triggers EU regulatory review under EU Cosmetics Regulation 1223/2009
3. Free acid fraction — calculated from Henderson-Hasselbalch; this is what we optimize, not the label claim
4. Buffer capacity — how much the pH drifts under manufacturing stress, packaging interaction, and temperature cycling
The fourth one is where most lab-to-production failures happen. A formula buffered at 500g behaves differently at 200kg. We’ve seen pH drift of 0.3–0.4 units just from the stainless steel vessel interaction with low-pH systems during extended mixing. That’s enough to shift your free acid fraction by 15–20% and push you outside your stability specification before the product even reaches filling.
| Acid Type | Typical Use Concentration | Target pH Range | pKa | Primary Skin Target |
|---|---|---|---|---|
| Glycolic Acid (AHA) | 5–12% | 3.2–4.0 | 3.83 | Surface exfoliation, texture |
| Lactic Acid (AHA) | 5–10% | 3.5–4.5 | 3.86 | Exfoliation + hydration (NMF) |
| Salicylic Acid (BHA) | 0.5–2.0% | 3.0–4.0 | 2.97 | Pore-clearing, sebum regulation |
| Mandelic Acid (AHA) | 5–10% | 3.5–4.5 | 3.41 | Sensitive skin, slower penetration |
| Citric Acid (AHA blend) | 1–5% (adjunct) | 3.0–4.5 | 3.13 | pH adjustment, mild exfoliation |
For brands targeting the EU market, the SCCS Scientific Opinion on AHA safety is the reference document we work from. The SCCS position on glycolic acid sets a maximum of 10% at pH ≥ 3.5 for rinse-off, and recommends sun protection labeling for leave-on products. We build that into every EU-destined brief from day one — not as an afterthought.
For brands targeting the US market, the FDA Cosmetics Guidelines don’t set hard concentration limits for AHAs in the same way, but the OTC drug pathway kicks in for salicylic acid above 2% — a detail that catches a lot of brands off guard.
Where Most Brands Get the pH Wrong #
Drop below pH 3.5 and you’re in regulatory grey territory in the EU. Most brands don’t realize this until we tell them.
The instinct is to push pH lower for stronger performance. And yes, at pH 3.0 versus pH 3.8, you will see a measurable difference in exfoliation rate. But the trade-offs stack up fast. Preservative efficacy becomes unpredictable — most broad-spectrum systems are optimized for pH 4.0–6.0, and below 3.5 you’re relying heavily on the acid itself for antimicrobial activity. That works in theory. In practice, we’ve had gram-negative contamination appear at week 8 of preservative challenge testing on a pH 3.2 formula that looked clean at week 4. The preservative was technically present. It just wasn’t working.
Skin tolerance is the other side of this. A clinical study we reference frequently in our briefs: a split-face RCT (n=42, 12 weeks) comparing 8% glycolic acid at pH 3.5 versus pH 4.0 showed equivalent improvement in skin texture scores (both groups: ~28% improvement vs. baseline) but a statistically significant difference in transient erythema — 34% incidence at pH 3.5 versus 18% at pH 4.0. The performance was essentially the same. The irritation profile was not. For most retail brands, that’s the right trade-off to make.
Honestly, most brands underestimate how fragile low-pH preservative systems become at production scale. A lot of clean beauty brands in particular push for paraben-free, phenoxyethanol-free systems at pH 3.2 and then wonder why their stability results are inconsistent. The chemistry doesn’t care about the marketing brief.
For our internal formulation approach to acid-based actives, see our acid exfoliation technology documentation.
Skin Tolerance Protocol: What We Actually Test #
This is usually where projects go sideways. Brands want a tolerance claim on pack — “suitable for sensitive skin” or “dermatologist tested” — without fully understanding what the testing protocol requires or what it costs.
Our standard tolerance protocol for acid toners runs in three tiers:
Tier 1 — In-house HRIPT screening: 50-subject repeat insult patch test, 21-day protocol. Minimum requirement for any acid toner we manufacture. Catches gross sensitization issues before external testing.
Tier 2 — Third-party clinical tolerance: 30-subject minimum, dermatologist-supervised, 4-week use study. Required for any “sensitive skin” claim. Adds 6–8 weeks to timeline and roughly $4,000–$8,000 USD depending on the CRO.
Tier 3 — Full clinical efficacy: 40+ subjects, 8–12 week duration, instrumental measurement (Corneometer, Tewameter, Visioscan). Required for any efficacy claim beyond “exfoliates.” Most indie brands skip this. We don’t recommend it, but we understand the budget reality.
The tolerance protocol also informs our formulation decisions. If a brand is targeting a sensitive skin positioning, we’ll typically recommend mandelic acid over glycolic — the larger molecular size slows penetration rate and the clinical tolerance data is more favorable. We’ll also buffer more conservatively, targeting pH 4.0–4.5 rather than 3.5–3.8, and we’ll include a barrier-support component: ceramide NP at 0.1–0.3%, or panthenol at 1–2%.
One thing we’ve learned: the packaging interaction with low-pH formulas is underestimated at brief stage. We rejected one packaging vendor’s PET bottle after pH drift testing showed a 0.25-unit increase over 12 weeks at 40°C. That doesn’t sound like much. But it moved a pH 3.5 formula to pH 3.75, which shifted the free acid fraction enough to affect the exfoliation claim. We now require all packaging vendors to submit pH compatibility data for any vessel contacting a formula below pH 4.5.
For brands developing sensitive-skin positioned products, our barrier repair and sensitive skin formulation resources cover the complementary actives we typically pair with acid systems.
The Scale-Up Failure Nobody Talks About #
Worked fine at 500g lab scale. At 180kg production, the pH was reading 0.35 units higher than target by the end of the mixing cycle. The formula used a citrate-phosphate buffer system that was sensitive to the temperature rise from the high-shear mixer. We hadn’t seen that at lab scale because the thermal mass is completely different.
We caught it before filling. But it cost us a batch hold and a reformulation cycle. The fix was straightforward — we adjusted the buffer ratio and added a post-mix pH correction step — but it added two weeks to the timeline and the brand had already committed to a launch date.
This is not unusual. It’s actually one of the more common failure modes we see with acid toners specifically, because the performance window is narrow. A moisturizer at pH 5.5 can tolerate 0.3 units of drift without anyone noticing. An acid toner at pH 3.5 cannot.
The other scale-up issue we see regularly: fragrance and botanical extract interactions. A lot of brands want to add botanical extracts to their acid toners — “clean” positioning, ingredient story on pack. Some of those extracts contain buffering compounds that interact with the acid system. We’ve seen lemon extract push pH up by 0.2 units at 1% inclusion. Green tea extract at 2% can do the same. We now run pH interaction screening on every botanical before it goes into a low-pH formula. It adds a week to development. It’s worth it.
Formulation Notes for Brand Partners #
What market? What are you expecting on-pack? Those are the first two questions we ask when an acid toner brief comes in — because the answers determine almost everything about the formulation approach.
If you’re targeting the EU, we’re working within the SCCS AHA framework from day one: maximum 10% glycolic at pH ≥ 3.5, mandatory sun protection labeling. If you’re targeting the US with a salicylic acid formula above 1%, we need to discuss the OTC drug pathway early — that’s a different regulatory track entirely. If you’re targeting markets regulated under NMPA Cosmetic Regulation, the registration timeline for certain acid concentrations adds 6–12 months to your launch plan.
On the commercial side: airless pump packaging for acid toners adds $0.40–$0.80 per unit at MOQ 3,000. Most brands want it for the premium positioning. It’s also genuinely better for pH stability — less oxygen headspace, less packaging interaction. Whether that’s worth the COGS impact depends on your retail price point.
For your brief, include: target market and regulatory pathway, desired acid type and concentration, pH preference or performance target, skin type positioning (sensitive vs. normal/oily), packaging format preference, any “free from” requirements that affect preservative selection, and your stability requirement (12-month vs. 24-month shelf life). The more specific you are upfront, the faster we can tell you what’s manufacturable.
What to include in your brief — checklist:
- Target market(s) and applicable regulatory framework
- Acid type, concentration, and pH target (or performance outcome if you’re unsure of the numbers)
- Skin type positioning and any tolerance/clinical claim requirements
- Packaging format and material (affects pH compatibility testing)
- “Free from” list (preservatives, fragrance, etc.) — be specific, not just “clean”
- Shelf life requirement (12 or 24 months, and storage conditions)
- Launch timeline — because it determines whether third-party clinical testing is feasible
Frequently Asked Questions #
Q: We want to call it “10% glycolic acid” on pack — is that actually stable at that concentration?
Yes, 10% glycolic is stable — but only if the pH is controlled tightly. We target pH 3.5–3.8 for a 10% glycolic formula, and we run 12-week accelerated stability at 40°C/75% RH as a minimum. The label claim is achievable; the question is whether your packaging can hold pH within ±0.2 units over 24 months. That’s what we test first.
Q: Can we combine AHA and BHA in the same toner?
We do it regularly. A common combination is 5% lactic acid plus 1% salicylic acid at pH 3.5–4.0. The challenge is that salicylic acid has limited water solubility — maximum around 0.2% in water alone — so you need a co-solvent system (typically 5–10% ethanol or propylene glycol) to keep it in solution. Short answer: it’s formulation-solvable, but don’t try to combine these two without addressing the solubility issue first.
Q: Our brand is “clean” — can we preserve a pH 3.2 acid toner without phenoxyethanol?
Honestly, this is one of the harder briefs we get. At pH 3.2, the acid itself provides significant antimicrobial activity, so you can sometimes pass challenge testing with a minimal preservative system — ethylhexylglycerin at 0.3–0.5% plus the inherent acid activity. But we’ve had this fail at week 8 of PCT more than once. We’re still not fully convinced the data is consistent enough to recommend it broadly. If your “clean” list excludes phenoxyethanol, we need to discuss the preservative options early and build in extra stability testing time.
Q: How long does development take for a custom acid toner?
For a straightforward formula — single AHA, standard preservative system, no clinical claims — we typically run 8–10 weeks from brief to stability-confirmed pilot batch. Add 6–8 weeks if you need third-party clinical tolerance testing. Add another 4–6 weeks if you’re targeting EU registration with a pH below 3.5. The timeline compounds quickly once you add claims.
Q: What’s the minimum order quantity for a custom acid toner, and does it affect formulation choices?
Our standard MOQ is 1,000 units for a custom formula. At that volume, encapsulated acid systems (which offer better stability and gentler delivery) roughly triple the raw material cost — so for most indie brands at MOQ 1,000, we recommend a well-buffered conventional system over encapsulation. At MOQ 5,000+, the encapsulation cost per unit becomes more defensible if the brand story supports it.
Have a product concept in mind? Contact our formulation team to request a complimentary brief review.
© 2026 Mastracare.com. All rights reserved.
Unauthorized reproduction or distribution is prohibited.