Overview #
Sensitive skin claims are not soft claims. In the EU, the US, and China, regulators are increasingly scrutinizing what brands put on pack — and the evidence required to back it up is getting harder to ignore. We see this every quarter in the briefs that come through our lab: a brand wants “clinically tested for sensitive skin” on the front panel, but hasn’t thought through what that actually requires from a study design, an instrumental measurement, or a regulatory documentation standpoint. The gap between what marketing wants and what the dossier needs to contain is where most projects stall.
What Each Regulatory Framework Actually Demands #
The three frameworks — EU Cosmetics Regulation 1223/2009, FDA Cosmetics Guidelines, and NMPA Cosmetic Regulation — treat sensitive skin claims very differently, and conflating them is a fast way to create compliance problems.
Under EU 1223/2009, “suitable for sensitive skin” is a cosmetic claim that must be substantiated under the EU Claims Regulation (655/2013). That means documented evidence — either consumer perception data, instrumental data, or both — held in the Product Information File. The SCCS Scientific Opinion framework also applies when you’re working with actives that have a dermal sensitization profile. We’ve had projects where a botanical extract with a known sensitization rate above 1% in patch testing forced a full reformulation because the brand wanted to keep the EU market open.
FDA takes a different angle. Sensitive skin products in the US are cosmetics unless they carry a drug claim — “reduces redness caused by rosacea” crosses that line immediately. The practical implication: your claim language controls your regulatory pathway. We almost always push back when a brand brief says “calms inflammation” because that’s OTC drug territory in the US, not cosmetic territory. The safer language is “helps skin look less red” or “formulated for reactive skin types.”
NMPA is the most documentation-intensive of the three. Since the 2021 Cosmetic Supervision and Administration Regulation overhaul, efficacy claims in China require substantiation filed with the registration dossier. For sensitive skin specifically, human safety assessment data — including a 30-day repeat insult patch test (RIPT) on at least 30 subjects — is now standard expectation, not optional. Three out of five new brand partners we onboard for China registration underestimate this timeline. The RIPT alone adds 6–8 weeks to your launch schedule.
| Framework | Claim Type | Key Evidence Requirement | Typical Timeline Impact |
|---|---|---|---|
| EU 1223/2009 + Claims Reg 655/2013 | Cosmetic claim, must be substantiated | Consumer perception study or instrumental data in PIF | 8–12 weeks for study + documentation |
| FDA (FD&C Act) | Cosmetic unless drug claim triggered | No mandatory pre-market filing; post-market substantiation expected | Minimal if claim language is managed |
| NMPA (2021 CSAR) | Efficacy claim requires dossier filing | RIPT (n≥30, 30 days) + efficacy substantiation data | 14–20 weeks including registration review |
The cost implication here is real. Running a compliant RIPT plus a consumer perception panel for a single SKU targeting all three markets simultaneously typically runs $18,000–$35,000 USD in third-party CRO fees, depending on the study design and geography. Most indie brands budget for one market and then scramble when they want to expand.
Instrumental Measurement: What We Actually Use in the Lab #
Consumer perception studies get the marketing attention, but instrumental data is what holds up in a regulatory challenge. For barrier-repair and sensitive skin products, the core measurement battery we recommend covers three parameters: transepidermal water loss (TEWL), corneometry (skin hydration), and erythema index.
TEWL is measured with a closed-chamber or open-chamber evaporimeter — we use both depending on the study protocol. Baseline TEWL in healthy skin typically runs 5–10 g/m²/h. In compromised barrier skin, you’re often looking at 15–25 g/m²/h or higher. A meaningful reduction after 4 weeks of product use is generally accepted as ≥20% decrease from baseline, though we’ve seen reviewers push for ≥25% in EU dossiers for stronger claim language.
Corneometry gives you capacitance-based hydration readings. The Corneometer CM 825 is the industry standard — readings below 30 AU indicate dry skin, 30–45 AU is normal range. For a sensitive skin product claiming “intensely hydrates,” we want to see a statistically significant increase of at least 15 AU from baseline at the 4-week timepoint, sustained at 8 weeks. If the numbers drop back between week 4 and week 8, that’s a formulation conversation, not a study design conversation.
Erythema index via Mexameter or Chromameter gives you objective redness quantification. This is particularly important for rosacea-adjacent claims and post-procedure skin positioning. We’ve had one project where the consumer panel said skin looked “calmer and less red” at week 6, but the Mexameter data showed no statistically significant change in erythema index. That’s a problem. The claim had to be softened to “skin feels calmer” — a perception claim, not an instrumental one.
For barrier repair and sensitive skin formulations, we also increasingly include skin microbiome diversity scoring as a secondary endpoint, particularly for products containing prebiotics or postbiotics. This is still evolving — what’s acceptable as supporting evidence today may become a primary endpoint requirement in 3–5 years, especially in the EU where the microbiome-skin axis is getting regulatory attention.
Consumer Panel Study Design: Where Most Brands Get This Wrong #
Honestly, most brands underestimate how much the panel design affects the credibility of the claim. A 20-person open-label self-assessment study is not the same as a 60-subject double-blind, vehicle-controlled trial. Both can appear on pack as “clinically tested” — but one will survive a regulatory challenge and one won’t.
The minimum we recommend for a credible sensitive skin consumer panel: 50 subjects with self-declared sensitive skin (confirmed by dermatologist screening), randomized split-face or parallel-group design, 8-week duration minimum, with both instrumental and self-assessment endpoints. The self-assessment questionnaire should use validated scales — we use a modified version of the Sensitive Scale-10 (SS-10) developed by Misery et al., which covers stinging, burning, tightness, and itching across 10 items.
One clinical reference point we use internally: a double-blind, randomized controlled trial (n=62, 12 weeks) evaluating a ceramide-dominant barrier repair emulsion showed a 34% reduction in TEWL from baseline versus 11% for vehicle control, with a statistically significant improvement in SS-10 scores at week 8 (p<0.01). That’s the kind of data architecture that supports a strong on-pack claim. The study design — parallel group, dermatologist-assessed, with both objective and subjective endpoints — is the template we walk brand partners through when they’re scoping a new sensitive skin launch.
Subject recruitment is where projects slow down. Recruiting genuinely sensitive skin subjects (not just self-declared) takes longer than brands expect. We now require CRO partners to provide a screening failure rate estimate upfront — typically 30–40% of candidates don’t pass the dermatologist screening. Build that into your timeline.
Before/After Photography Protocol: The Details That Matter #
Photography is often treated as an afterthought. It shouldn’t be. Poorly controlled before/after images are worse than no images — they invite regulatory scrutiny and consumer complaints.
The protocol we specify for sensitive skin studies: standardized lighting (5500K color temperature, cross-polarized and parallel-polarized captures), fixed camera-to-subject distance (typically 30 cm for facial close-ups), consistent positioning using a chin rest and forehead guide, and same time of day for all visits (morning, pre-product application, after 20-minute acclimatization). Subjects should arrive without makeup and with no product applied for at least 12 hours prior.
We’ve rejected photography data from two CRO partners in the past three years because their lighting setup wasn’t consistent between baseline and endpoint visits. The redness appeared to improve in the photos, but it was an artifact of slightly different light angle. That kind of data doesn’t go in a dossier.
Cross-polarized imaging specifically helps visualize subsurface redness and early erythema that standard photography misses. For sensitive skin claims, this is worth the additional equipment cost. Parallel-polarized imaging captures surface texture and scaling — relevant for barrier repair claims where visible flaking is a baseline characteristic.
Image analysis software — VISIA, Antera 3D, or similar — should be used to generate quantitative outputs, not just qualitative “looks better” assessments. Quantitative erythema mapping from photography, cross-referenced with Mexameter data, gives you two independent lines of evidence for the same endpoint. Regulators like that.
Designing a 12-Week Study for Barrier Repair & Sensitive Skin #
This is usually where the conversation gets practical. A brand comes to us with a finished formula, a target market, and a launch date. The study design has to fit all three constraints simultaneously.
For a 12-week sensitive skin efficacy study targeting EU and China registration simultaneously, here’s the architecture we recommend:
Study population: 60 subjects minimum (accounting for ~15% dropout), self-declared sensitive skin, confirmed by dermatologist using a standardized stinging test (lactic acid 10% applied to nasolabial fold — positive responders included). Age range 25–55, Fitzpatrick types II–IV for EU cohort, types III–V for China cohort if running parallel arms.
Primary endpoints: TEWL reduction from baseline at week 4, week 8, and week 12. Secondary endpoints: corneometry increase, erythema index change, SS-10 questionnaire score, and global dermatologist assessment at each visit.
Visit schedule: Baseline (day 0), week 2 (safety check only — no efficacy measurements), week 4, week 8, week 12. Photography at baseline, week 4, and week 12. Blood draw is not required for cosmetic claims, but a dermatologist safety assessment at every visit is non-negotiable for NMPA dossier purposes.
Product use: twice daily application, morning and evening, to full face. No other leave-on products permitted during the study period — this is the instruction that subjects most frequently violate, and it’s the most common reason for data exclusion. We now require CRO partners to include a product diary and a prohibited product list in the subject briefing pack.
Statistical analysis: ANCOVA with baseline as covariate for continuous endpoints, Wilcoxon signed-rank for ordinal questionnaire data. Pre-specify your primary endpoint and your significance threshold (p<0.05 is standard) before the study starts. Post-hoc endpoint switching is a red flag in any regulatory review.
The ICH Stability Guidelines framework, while primarily designed for pharmaceuticals, is increasingly referenced by NMPA reviewers when assessing cosmetic efficacy study methodology. Worth knowing.
One thing we haven’t fully solved: running a single study that satisfies EU, FDA, and NMPA simultaneously without a parallel-arm design that inflates cost and complexity. Our current approach — a core 60-subject EU/China study with a US label claim review layered on top — works, but it’s not elegant.
Formulation Notes for Brand Partners #
What market? What are you expecting on-pack? Those are the first two questions we ask when a sensitive skin brief comes in, because the answer changes everything about the formulation strategy.
If you’re targeting EU with a “dermatologically tested, suitable for sensitive skin” claim, we’re building around a fragrance-free, low-irritant base — typically a ceramide-phospholipid emulsion at pH 5.2–5.8, preserved with phenoxyethanol at ≤0.8% or a glycol-based system. The formula has to survive the PIF review, which means every ingredient with a known sensitization profile needs a justification note.
For China registration, the formulation documentation burden is higher. Every raw material needs a supplier safety data sheet, and if you’re using any ingredient on the NMPA restricted list — even at sub-threshold concentrations — you need a safety assessment addendum. We’ve had projects delayed by 10 weeks because a botanical extract supplier couldn’t provide the required heavy metal testing certificate.
For acid exfoliation and sensitive skin crossover products, the pH management is critical. Drop below pH 4.0 and you’re in a different regulatory conversation in the EU. Most brands don’t realize this until we tell them.
Budget for the study early. A 12-week CRO study for a single SKU, properly designed for multi-market use, runs $22,000–$45,000 USD depending on the CRO, geography, and endpoint battery. That’s before translation, regulatory filing fees, or dossier preparation. It’s not a small line item, and it’s not optional if you want credible claims.
Frequently Asked Questions #
Q: We want to put “clinically tested for sensitive skin” on pack for EU — what’s the minimum study we actually need?
A minimum credible study for that claim in the EU is 50 subjects with confirmed sensitive skin, 8 weeks duration, with at least one instrumental endpoint (TEWL or corneometry) plus a validated self-assessment scale. Open-label, 20-person studies technically satisfy the “clinically tested” wording but won’t hold up if a competitor or regulator challenges the claim under EU 655/2013. We recommend 60 subjects and a double-blind design if the claim is going on the front panel.
Q: Can we use the same study data for both EU and China registration?
Sometimes, but not always. NMPA requires a RIPT with at least 30 subjects as a standalone safety document — that’s separate from your efficacy study. If your EU study was conducted in Europe with a predominantly Caucasian cohort, NMPA reviewers may request a supplementary study with Chinese subjects, particularly for products making skin tone or complexion-related claims. Budget for this possibility from the start.
Q: Our formula has 0.5% niacinamide and 2% panthenol — do we need a full sensitization study?
At those concentrations, no — both are well-characterized ingredients with established safety profiles. What you do need is a documented safety assessment in your PIF (EU) or registration dossier (China) confirming the concentrations are within accepted limits. The SCCS Scientific Opinion database is your reference for EU. For NMPA, the 2021 Cosmetic Ingredient Safety Technical Guidelines apply.
Q: How long does a 12-week study actually take from brief to final report?
Add 6–8 weeks for CRO selection, protocol finalization, and ethics approval. Then 12 weeks of study execution. Then 4–6 weeks for data analysis and report writing. You’re looking at 22–26 weeks minimum from brief to final report in hand. Brands that plan to use study data for a launch dossier need to start this process at least 7 months before their target launch date.
Q: We’ve seen “hypoallergenic” on competitor products — is that a regulated claim?
In the EU and US, “hypoallergenic” is not a legally defined term — there’s no standardized test required to use it. That said, the FTC has historically scrutinized hypoallergenic claims, and EU claim substantiation rules mean you still need evidence that the product is less likely to cause allergic reactions than a comparator. In practice, we back hypoallergenic claims with a HRIPT (Human Repeat Insult Patch Test) on at least 50 subjects with zero sensitization reactions. That’s the defensible standard.
Have a product concept in mind? Contact our formulation team to request a complimentary brief review.
© 2026 Mastracare.com. All rights reserved.
Unauthorized reproduction or distribution is prohibited.