Overview #
Regulatory classification is not a paperwork problem. It is a product architecture decision that determines your formula, your claims, your packaging, and your market entry timeline before a single gram of raw material is ordered. For toners and essence waters specifically, the gap between “cosmetic” and “quasi-drug” or “drug-cosmetic” status varies so dramatically by market that we routinely see brand partners arrive with a single global brief and leave with three different product architectures. That is not inefficiency — that is the reality of selling the same consumer benefit across the EU, US, Japan, and China simultaneously.
How Classification Actually Works Across Key Markets #
The core question every brand partner needs to answer first: what claim are you making, and in which market? That single decision cascades into everything else.
In the EU, the framework is binary. Under EU Cosmetics Regulation 1223/2009, a toner is a cosmetic if it “cleanses, perfumes, changes appearance, or keeps in good condition.” The moment you claim to treat acne, reduce sebum secretion by a measurable percentage, or alter skin cell turnover, you have crossed into medicinal product territory. There is no middle ground. We have had briefs come in claiming “clinically reduces breakouts by 40%” — that is a medicinal claim in the EU, full stop. The brand had to choose: soften the claim or register a medicinal product. Most soften the claim.
Japan operates differently. The quasi-drug (医薬部外品) category sits between cosmetic and pharmaceutical, and it is genuinely useful for toner brands. Approved active ingredients — niacinamide up to 3%, tranexamic acid at 2%, dipotassium glycyrrhizate at 0.1–0.3% — can be used with specific efficacy claims under quasi-drug status. The approval process takes roughly 6–12 months and requires submission to the PMDA with stability data, safety data, and efficacy substantiation. Expensive, but it unlocks claims that cosmetic status simply cannot support.
China’s NMPA framework under the NMPA Cosmetic Regulation introduced a two-tier system in 2021: general cosmetics and special-use cosmetics. Whitening, anti-hair loss, sunscreen, and freckle-removal claims all require special-use registration. For toners, this matters enormously. A brightening toner with tranexamic acid making a “whitening” claim needs special-use registration — a process that typically runs 12–18 months and costs significantly more than general cosmetic filing. We now require brand partners to confirm their China claim strategy before we finalize the formula, because changing the active concentration after registration starts over.
The US FDA sits at the other end of the spectrum. Under FDA Cosmetics Guidelines, OTC drug status applies to products with specific active ingredients at defined concentrations — salicylic acid at 0.5–2% for acne, for example. A toner with 1.5% salicylic acid making an acne claim is an OTC drug. It needs an OTC monograph-compliant label, specific manufacturing controls, and drug facility registration. Most indie brands do not realize this until we flag it.
| Market | Classification System | Key Threshold for Toners | Typical Timeline |
|---|---|---|---|
| EU | Cosmetic / Medicinal Product | Claim type determines classification | 3–6 months (cosmetic) |
| Japan | Cosmetic / Quasi-Drug / Pharmaceutical | Active ingredient + approved claim | 6–12 months (quasi-drug) |
| China (NMPA) | General / Special-Use Cosmetic | Whitening, sunscreen, freckle claims | 12–18 months (special-use) |
| USA (FDA) | Cosmetic / OTC Drug | Active ingredient identity + claim | 3–6 months (cosmetic) |
| South Korea | Cosmetic / Functional Cosmetic | Whitening, wrinkle, UV claims | 4–8 months (functional) |
South Korea’s functional cosmetic category is worth noting separately. It is arguably the most commercially mature quasi-drug-equivalent system in Asia. Brands selling into Korea with brightening or anti-wrinkle toners need functional cosmetic registration, and the MFDS has a positive list of approved actives with defined concentration ranges. Niacinamide at 2–5% for whitening is on that list. Adenosine at 0.04% for anti-wrinkle is on that list. If your active is not on the list, you are filing as a general cosmetic with no efficacy claim.
Instrumental Measurement and Consumer Panel Design #
This is where most brand partners underestimate the work involved. Regulatory bodies increasingly expect substantiation — not just safety data, but efficacy data that supports the claim. For toners and essences, the measurement methods matter as much as the results.
For hydration claims, we use corneometry (Corneometer CM 825) as the primary instrument. Typical protocol: 20-minute acclimatization at 21°C, 50% RH, three measurements per site, averaged. A well-formulated hyaluronic acid toner at 1% high-MW + 0.5% low-MW HA should show a 25–35% increase in corneometer units at 2 hours post-application versus untreated control. If we are not hitting at least 20% improvement, we go back to the formula.
Transepidermal water loss (TEWL) via Tewameter TM 300 is the barrier function marker. For a barrier-repair toner brief, we want to see TEWL reduction of at least 15% versus baseline at 4 weeks. We have run studies where the formula looked excellent at 2 weeks and then the TEWL improvement plateaued — that usually points to an occlusive level that is too low for the skin type in the study population.
Skin tone and brightening claims require chromametry (Minolta CR-400) measuring L value, or spectrophotometry. A 1-point increase in L is roughly the threshold for a perceptible brightening effect in most consumer panels. We have seen tranexamic acid at 2% in a toner base achieve L* increases of 1.8–2.3 points over 8 weeks in Asian skin phototypes III–IV. That is a real, substantiatable result.
For consumer panel design, the minimum credible study for a cosmetic efficacy claim is 30 subjects, 4 weeks, split-face or controlled use design. Honestly, most brands underestimate how much the panel design affects the result. A poorly controlled study with no washout period, no standardized application protocol, and self-reported outcomes is not going to hold up to regulatory scrutiny in the EU or Japan.
One clinical study we reference internally for toner efficacy substantiation: a double-blind, randomized, vehicle-controlled trial (n=42, 12 weeks) evaluating a niacinamide-containing toner at 4% versus vehicle in subjects with mild hyperpigmentation (Fitzpatrick III–V). The niacinamide group showed a 27% reduction in melanin index (Mexameter MX 18) versus 6% in the vehicle group at week 12. Statistically significant at p<0.01. That study design — randomized, vehicle-controlled, instrumental primary endpoint, 12 weeks — is the template we recommend for any brand making a brightening claim targeting the Japanese quasi-drug or Korean functional cosmetic pathway.
Before/after photography protocol is often treated as an afterthought. It should not be. Standardized conditions mean: same camera, same focal length, same lighting rig (cross-polarized and parallel-polarized), same subject positioning, same time of day. We rejected the first photography vendor on one project because they could not guarantee consistent cross-polarized lighting between sessions. The images were unusable for regulatory submission. Consistent photography is not about aesthetics — it is about defensible documentation.
Where Most Brands Get This Wrong #
The most common mistake is designing the formula first and the claim second. We see this constantly. A brand partner comes in with a formula they love — maybe it has 3% niacinamide, 2% tranexamic acid, 0.5% salicylic acid — and they want to sell it in the EU, Japan, and China simultaneously with claims about brightening, pore refinement, and acne control. That is three different regulatory pathways, potentially two different product registrations in China alone, and a quasi-drug application in Japan.
The salicylic acid at 0.5% is fine as a cosmetic in the EU for skin conditioning. But if the brand wants to claim “reduces acne” in the US, that 0.5% puts them in OTC drug territory. Drop the acne claim, keep the “clarifying” language, and you stay cosmetic. Most brands do not realize this until we tell them.
Another pattern we see: brands targeting Japan with a whitening claim but using an active that is not on the PMDA quasi-drug approved list. Alpha-arbutin, for example, is widely used in cosmetics globally, but it is not an approved quasi-drug whitening active in Japan. Kojic acid dipalmitate is not on the list either. If you want a quasi-drug whitening claim in Japan, you are working with tranexamic acid at 2%, L-ascorbic acid at a specific concentration, or a handful of other approved actives. The formula has to be built around the regulatory pathway, not the other way around.
The SCCS Scientific Opinion database is something we check routinely for EU-bound formulas. SCCS opinions on specific actives — kojic acid, hydroquinone, certain AHAs — directly inform what is permissible in cosmetics versus what triggers medicinal classification. Hydroquinone above 0.5% is banned in EU cosmetics entirely. We still get briefs requesting it.
One more thing that catches brands off guard: the EU’s ongoing restriction reviews under Annex II and Annex III are quietly reshaping what is possible in cosmetic toners. Several AHA concentrations are under review for potential reclassification. This is still evolving — what is acceptable today may shift within the next 18–24 months. We flag this to every brand partner working on acid-based toners for EU distribution. Our acid exfoliation technology documentation covers the current permissible ranges and what we expect to change.
Designing a 12-Week Efficacy Study for Toners and Essences #
If you are targeting quasi-drug status in Japan, functional cosmetic registration in Korea, or simply want robust substantiation for a premium cosmetic claim, a 12-week study is the standard. Here is how we structure it.
Study population: 40–50 subjects minimum. Dropout rates in 12-week studies typically run 10–15%, so you need to start with enough subjects to hit your target n at completion. Specify Fitzpatrick phototype, age range (usually 25–55 for anti-aging or brightening studies), and exclusion criteria — no active skin conditions, no competing topical treatments, 2-week washout from any actives.
Design: Randomized, double-blind, vehicle-controlled is the gold standard. Split-face design works for toners because application is bilateral and the vehicle can be applied to the contralateral side. This controls for systemic and environmental variables. If split-face is not appropriate for the claim (e.g., a full-face brightening study), use a parallel-group design with matched controls.
Measurement schedule: Baseline, week 4, week 8, week 12. Minimum instruments: corneometer (hydration), Tewameter (barrier), Mexameter (melanin and erythema index), chromameter (L* value). Photography at each timepoint under standardized conditions. Consumer self-assessment questionnaire at week 4 and week 12 — keep it to 8–10 questions maximum, Likert scale, validated translation if running in multiple markets.
Primary endpoint: Define it before the study starts. For a brightening toner, it is melanin index reduction or L* increase. For a hydrating toner, it is corneometer improvement at 2 hours and 8 hours post-application. Regulatory reviewers in Japan and Korea will ask what the primary endpoint was and whether the study was powered to detect it. A study that was not pre-specified is worth significantly less.
Statistical analysis: ANOVA with post-hoc testing for multi-timepoint comparisons. Report both statistical significance (p-value) and effect size. A result that is statistically significant but clinically trivial — say, a 3% improvement in corneometer units — is not going to support a strong claim.
Photography protocol specifics: Cross-polarized images for surface texture and pigmentation. Parallel-polarized for subsurface features. Consistent 1:1 macro ratio. We use a standardized chin rest and forehead positioning bar to eliminate head tilt variation between sessions. One pilot batch of photography failed on a previous project because the lighting rig was moved between sessions — the images showed apparent improvement that was actually a lighting artifact. We now require the photography setup to be locked and documented before the study starts.
The ICH Stability Guidelines are relevant here too — not for the clinical study itself, but because your study samples need to be from a batch that meets stability specifications. Running a 12-week clinical study with samples that fail stability at week 10 is a problem. We always run accelerated stability in parallel with clinical studies. For more on how we approach evidence-based toner and essence development, see our toner and essence water formulation resources.
Formulation Notes for Brand Partners #
What market? What are you expecting on-pack? Those are the first two questions we ask on every toner brief, and the answers determine everything that follows.
If you are targeting Japan with a quasi-drug whitening claim, we are building around tranexamic acid at 2% in a pH 5.5–6.5 base, and we are starting the PMDA submission process before the formula is finalized — not after. If you are targeting the EU as a cosmetic with a “radiance-boosting” claim, we have more flexibility on actives but we are staying well clear of any language that implies a physiological change. If you are targeting the US with a salicylic acid toner, we are registering as an OTC drug manufacturer and your label is going through our regulatory team before it goes to print.
The formulation itself is usually the easier part. A 1% niacinamide + 0.5% low-MW hyaluronic acid + panthenol toner base is stable, effective, and cosmetic-status in every major market. The complexity comes from stacking actives and claims. Three out of five brand partners who come to us with a multi-active toner brief end up simplifying the formula after we walk through the regulatory implications. That is not a failure — that is good product strategy.
MOQ and cost also matter here. A quasi-drug registration in Japan adds roughly $8,000–$15,000 in regulatory costs before production starts. That cost needs to be amortized across your production volume. At MOQ 3,000 units, that is $2.67–$5.00 per unit in regulatory overhead alone. Most indie brands have not factored that in.
Frequently Asked Questions #
Q: We want to sell our niacinamide toner in Japan with a “whitening” claim — do we need quasi-drug status?
Yes, if you are making a whitening claim in Japan, quasi-drug registration is required. Niacinamide is on the PMDA approved list at 2–5% for quasi-drug whitening. Budget 6–12 months for the approval process and confirm your concentration is within the approved range before finalizing the formula.
Q: Our toner has 1% salicylic acid — is it a cosmetic or a drug in the US?
At 1% with a “clarifying” or “pore-refining” claim, it can stay cosmetic. The moment you add an acne claim, the FDA OTC monograph kicks in and you are in drug territory. The active concentration (0.5–2%) is not the trigger — the claim is. We see brands get this wrong constantly.
Q: How many subjects do we need for a study that will support a regulatory submission in Korea?
For Korean functional cosmetic registration, the MFDS expects a minimum of 30 subjects completing the study, with a 12-week duration for anti-wrinkle or whitening claims. We recommend starting with 40–45 subjects to account for a 15% dropout rate. Primary endpoint must be instrumental, not self-reported.
Q: Can we use the same formula for EU and China if we just change the label?
Sometimes, but not always. If your formula contains a whitening active at a concentration that triggers special-use registration in China, you cannot simply relabel it as a general cosmetic. The formula itself may need adjustment — lower the active concentration below the threshold, or file for special-use registration. We map this out in the brief intake before any formulation work starts.
Q: What is the minimum study design that will hold up for a premium cosmetic efficacy claim in the EU?
For a cosmetic claim in the EU, there is no mandatory study design — but if you are making a specific efficacy claim (e.g., “increases skin hydration by 30% in 4 weeks”), you need substantiation that supports it. A minimum credible study is 30 subjects, 4 weeks, instrumental primary endpoint (corneometer for hydration), randomized and controlled. Anything less and you are exposed if a competitor or regulator challenges the claim.
Have a product concept in mind? Contact our formulation team to request a complimentary brief review.
© 2026 Mastracare.com. All rights reserved.
Unauthorized reproduction or distribution is prohibited.