Restricted · PACE NLP

Pierre's project pack. Enter name + PIN.

That combination doesn't match. Try again.
DS301 NLP project · private preview
Nadia — NLP-specialist persona
Nadia · NLP specialist "The work" — what got submitted

PACE · Extended Findings · operational memo built on the same dataset as the academic report · LGP+KKR-framed

PureGym Extended Findings — Beyond the Rubric

Audience: Leonard Green & Partners + KKR (PE owners), Pinnacle Topco / Pinnacle Bidco (UK group + bond issuer), Clive Chesser (CEO from Nov 2024), Humphrey Cobbold (Chairman). Treat as a consultant memo, not an academic report — every recommendation tagged OpEx (recurring £/year) vs CapEx (one-off £) and ranked by effort × leverage.

Scope. Extends the CAM_DS_301 academic submission (report.md) with operational findings that fell outside the 800-1000 word limit. All findings derive from the same dataset: 27,666 PureGym customer reviews (Trustpilot 15,815 + Google 11,851), plus 43 PureGym Glassdoor staff reviews, plus the FY2024 Companies House filing (Pure Gym Limited, reg 06690189), plus a Perplexity Sonar Deep Research industry brief (with documented errata — see § Caveats).

Frame. PureGym is a mature 410-corporate-gym + 23-franchise UK operator (433 total) with £416m turnover, £187m reported EBITDA (45% margin), £123.5m adjusted EBITDA (29.7% margin), 1.50m UK members, ARPM £22.64, +13% YoY revenue, +7% YoY membership. LGP + KKR exit thesis: EBITDA margin expansion + multiple re-rating, not new-store CapEx. Every recommendation in this memo is filtered through that lens — recurring opex savings and retention lifts beat one-off CapEx unless the CapEx triggers a margin ratchet.


TL;DR — what to do Monday morning

#ActionEffortLeverageOpEx / CapExVerdict
1Re-rank Trustpilot reply queue by anger / sadness emotion so angry reviews get answered before joyful ones1 unit5/5OpEx (~0 — workflow change to existing reply ops)Ship this week
2Surface "we serve who serves you" cohort — 1,177 shift-worker reviews, 696 are 4★+, 681 tag joy. Active campaign + targeted retention offer2 units4/5OpEx (marketing budget reallocation)Q3 marketing slot
3Seasonal HVAC pre-emptive ops — May–Sep AC complaints 3× baseline; Nov–Dec hot-water complaints 5× baseline. Pre-summer service contract + winter boiler audit2 units4/5OpEx (service contract)Annual cycle
4Set per-site music-volume target with manager accountability — Glassdoor null result confirms music is per-site, not corporate. Ownership is club managers, not HQ1 unit3/5OpEx (~0 — policy memo)Ship next month
5Revisit the top-10 worst clubs as a programme — already drafted in v3_11_club_briefs.md (London Stratford 81, Leicester Walnut 59, Enfield 48, Swiss Cottage 37, Birmingham 34, Bermondsey 33, Seven Sisters 33, Hayes 33, Hammersmith, Hounslow). Sequenced 90-day plans per site3 units4/5Mixed (cleaning OpEx + locker / equipment CapEx)Q3 ops plan
6Speculative pilots — warm-shower temperature A/B, AC double-doors, capacity-cap queueing app1 unit (writing only)unknownn/a — design proposalsDiscussion item

§ 1 — Reply latency: the highest-leverage Trustpilot intervention

The data

PureGym replies to 97% of Trustpilot reviews (15,356 of 15,815). Reply rate is not the problem. Reply prioritisation is.

EmotionCountMedian reply (hours)Mean reply (hours)
Joy9,96098.8136.5
Surprise9987.5122.7
Love32899.0126.7
Fear482122.6163.3
Sadness1,606124.1173.1
Anger2,881130.1173.1
PureGym replies to angry customers 31% slower than to joyful ones. This is the inverse of best practice.

Latency band breakdown for the 2,881 anger-tagged Trustpilot reviews:

BandAnger count% of anger
<1 hour60.2%
1–4 hours240.8%
4–24 hours1535.3%
24–72 hours49417.1%
3–7 days1,10138.2%
1–4 weeks1,10338.3%
Only 6.3% of angry reviews are answered within 24 hours. 38.3% are still unanswered after a week.

Industry anchor

A 2026 support-response benchmark study found that organisations responding to customer messages within 1 hour achieve 71% retention vs 48% for those responding within 24 hours — a 23-percentage-point retention gap explained purely by reply speed ([Unthread 2026](https://unthread.io/blog/internal-support-response-time-service-desk/)). Top retailers respond within 2 hours; the retail-sector benchmark median is 17 hours. PureGym's 130-hour anger median is 7.6× the retail median.

The 2026 Google Business Profile benchmarks survey shows that "fast response is the strongest predictor of star-rating recovery after a 1- or 2-star review" ([WebFX 2026](https://www.webfx.com/blog/seo/google-business-profile-benchmarks/)).

The recommendation

Re-rank the Trustpilot reply queue by emotion before assignment to a customer-service agent. Anger and sadness move to the front. Joy and surprise can wait. The combined_phase9.parquet already has is_anger / is_sadness flags from the BERT emotion classifier; the same model can run inline at review-ingest time.

Cost. OpEx ~zero. The reply ops team already exists; this changes the *order* of work, not the *amount*. One sprint to wire the emotion classifier into the existing Trustpilot intake pipeline.

Leverage. Anger reviews are 18% of the Trustpilot negative cohort. If even half of them shift from "ignored for a week" to "answered within 24h", at industry retention multipliers, the implied churn reduction is material. Without a fabricated £ number: this is the single most concentrated improvement available from the existing data.

Owner. Customer service ops (corporate). Not a per-site fix.


§ 2 — Shift-worker cohort: an underserved positive segment

The data

Filtering all 27,666 reviews for shift-worker keywords (nurse, police, officer, paramedic, NHS, night shift, after my shift, finished work, 24/7, 5am, 6am, etc.) returns 1,177 matches (4.3% of corpus). The sentiment skew is striking:

ScoreCount% of cohort
5★46239%
4★23420%
3★17315%
2★1039%
1★20517%
59% are 4–5 star. vs ~30% positive baseline across the full negative-filtered analysis. 681 of the 1,177 tag "joy" as primary emotion (58%).

Caveat from validation. A Claude Sonnet 4.6 zero-shot pass on a 200-row stratified sample of these matches confirms shift-worker identity in only 6 / 200 (3%, "yes"); 154 / 200 (77%) are "unclear" — the writer praises 24/7 access without naming a profession; 40 / 200 (20%) are explicit "no". The keyword filter is a 24/7-praise filter, not a verified-shift-worker filter. See § 7.5 for methodology. The campaign opportunity is no smaller for the correction, but the framing must lead with 24/7-as-flexibility-promise as the dominant data signal, with shift workers as the visible but smaller proof cohort.

Top keyword hits: - 24/7 access: 798 mentions ← the dominant signal - Security / cleaner / warehouse / bouncer: 254 - After-my-shift / finished work / before work: 102 - Police / officer: 27 - NHS / hospital / midwife / doctor: 15 - Night shift / shift work: 13

Verbatim positives (sample)

"Open 24 hours so I can always go after doing a night shift!" — Bletchley, 5★
"Don's 6am BOOTCAMP class on Monday's are the best!!! He's very encouraging and pushes you to be the best YOU" — Elkridge, 5★
"Good selection of equipment, not too busy (at the times I go) and accessible 24/7" — Inverness Inshes, 4★
"Brought for my son less than £6pw an excellent membership can use anytime as its open 24/7" — Weston Super Mare, 5★
"Lovely gym 24/7 access classes included with membership also free parking" — Newcastle Longbenton, 5★
The full 1,177-row CSV is at v3/output/v3_ext_shiftworker_matches.csv.

The recommendation

A targeted "We serve the people who serve you" campaign — discount or perk for verified NHS / police / fire / ambulance / armed-forces members. The data shows this cohort is already disproportionately satisfied; the marketing move converts a passive strength into an explicit acquisition channel and a defensive retention moat.

Channel design (proposed, not measured): 1. Verification via existing platforms (Blue Light Card, Defence Discount Service) — ~zero infrastructure cost. 2. Headline benefit: dedicated 5–7am "shift-end" perks (towels, free coffee, priority lockers) at sites with relevant catchments. 3. Storytelling content: 5 verbatim quotes (above) become the campaign's first three creatives. 4. Ad campaign sequenced for autumn / January push, when shift workers' New Year resolutions intersect with PureGym's January acquisition peak.

Cost. OpEx (marketing reallocation, not net new). The verification platforms already exist and are cheap-or-free for partners.

Leverage. The Service-Profit Chain literature (Heskett 1994; Bernhardt, Donthu & Kennett 2000) supports the upstream link from staff-and-niche-cohort satisfaction to retention. PureGym is the most-mentioned 24/7 budget gym in this dataset — defending that position with a campaign is cheaper than out-spending The Gym Group on generic acquisition.

Owner. Marketing (corporate) + ops (site-level execution).


§ 3 — Seasonality: AC summer, heating winter, both predictable

AC complaints (negative reviews mentioning aircon / cooling / "too hot" / stuffy / sweat)

MonthNegative reviewsAC mentionsRate
Jan554325.8%
Feb632447.0%
Mar713314.3%
Apr704304.3%
May2602810.8%
Jun3715214.0%
Jul4816613.7%
Aug3664512.3%
Sep3144514.3%
Oct613447.2%
Nov253218.3%
Dec564315.5%
Summer (May–Sep) AC complaint rate = 13.0% vs spring (Mar–Apr) baseline 4.3% — 3× lift.

Heating + cold-shower complaints

MonthRate
Feb1.1% (lowest)
Mar1.7%
Apr1.4%
May2.7%
Jun3.2%
Jul2.1%
Aug3.3%
Sep2.5%
Oct3.1%
Nov4.7%
Dec5.5%
Jan2.3%
Winter (Nov–Dec) cold-shower rate = 5.1% vs summer (Feb) baseline 1.1% — 4.6× lift.

Temperature mentions

Identical pattern: November–December peak (4.0%, 4.3%) vs spring trough (March 0.8%, April 0.6%).

The recommendation

Run an annual pre-emptive HVAC service cycle keyed to the data. - April pre-summer audit — every site's AC inspected before May. The chains that catch the issue in April reduce the May–Sep complaint surge. - October pre-winter boiler / hot-water audit — every site's hot-water system inspected before November. Cold-shower complaints in December are the clearest preventable churn driver in the dataset.

The existing v3_ctx_weather_* and v3_ext_weather_* artefacts already plot weather × negativity; this section closes the loop with the operational action.

Cost. OpEx (recurring service contract). The Perplexity panel review correctly flagged that the deep-research HVAC cost numbers were anchored on a 2,500 sqft boutique-studio assumption rather than PureGym's actual 10,000–20,000 sqft warehouse format — so absolute £ numbers must come from real PureGym facilities ops, not from this report. The directional recommendation (pre-emptive service > reactive repair) holds regardless.

Leverage. Removing one season-of-complaints from each year flattens the negative-review trajectory documented in the academic report's "Things Are Getting Worse" section.

Owner. Facilities ops (corporate, with site-level execution).


§ 4 — Music volume: a per-site finding, not a corporate one

The Glassdoor null result

The 43 PureGym Glassdoor staff reviews contain zero mentions of music, volume, loud, playlist, sound system, or speaker. The hypothesis that music volume is a corporate-mandated policy (à la some retail chains) is not supported by the data we have.

This re-frames the 43 Google "loud music" complaints (already documented in report.md) from a chain-wide policy lever into a per-site manager accountability lever.

The recommendation

Set a target dB range (e.g. 65–75 dB at the gym floor centre) and make it a club-manager monthly compliance metric. No CapEx — every PA system already supports volume control. The intervention is purely policy + manager dashboard.

If the company wants harder evidence, a 2-week dB measurement pass at the 10 most-complained sites is the next data step. Cost: a £30 SPL meter and ~10 site visits.

Leverage. 43 reviews is a small but concentrated thread; cleaning it up removes a recurring Google-review topic from the 11 BERTopic clusters identified in the academic report.

Owner. Club managers (per-site), accountable through standard ops review.


§ 5 — Top-10 worst clubs: an operational triage programme

The 10 clubs and their concentration

#ClubNegative reviewsLondon?Recurring theme cluster
1London Stratford81yesHygiene + named-staff issues + WiFi/ventilation broken
2Leicester Walnut Street59noTemperature/HVAC + equipment + showers + rule enforcement
3London Enfield48yesStaff attitude + equipment disorganisation + overcrowding
4London Swiss Cottage37yesMaintenance/odours + overcrowding + missing equipment
5Birmingham City Centre34noOutdated equipment + flickering lights + lockers
6London Bermondsey33yesHygiene + overcrowding + temperature/ventilation
7London Seven Sisters33yesInsufficient lockers + frequent unannounced closures
8London Hayes33yesToilets/showers/water/equipment maintenance
9New Barnet32suburbanOvercrowding + cleanliness + class noise + AC
10London Canary Wharf32yesMaintenance + ventilation + alleged staff harassment
422 negative reviews concentrated in 10 of 410 corporate sites — 2.4% of the network generating ~7% of the negative-review volume in this corpus. 8 of 10 are London / Greater London. The geographic concentration is itself the finding: a London-focused 90-day rapid-improvement programme covers 80% of the worst-club issue.

What fixing each typical issue would avoid paying for

Drawing on the recurring themes across the 10 briefs (v3_11_club_briefs.md), framed in the way Russell taught: list what you'd avoid paying for, hand the pricing to the owner.

Recurring issueOpEx avoidance (recurring)CapEx avoidance (one-off)Owner
Hygiene / cleanlinessNegative-review-driven retention loss; refund / goodwill credits; viral 1★ Google velocityNone (cleaning is OpEx-only)Regional ops + cleaning contract review
Equipment maintenanceRepeated repair callouts; member-LTV impact when 2-3 machines down at oncePhased equipment refresh (delays the inevitable, doesn't avoid)Facilities ops
Overcrowding at city-centre sitesRefund pressure when peak-hour members can't use machinesPossible vestibule / capacity-cap CapEx (see Pilot 2)Site-level ops
HVAC / ventilation / hot-waterReactive repair callouts at peak failure; December cold-shower churn (§3)Pre-emptive service contract (OpEx) usually beats reactive replacement (CapEx)Facilities ops
Locker theft / insufficient lockersGoodwill credits to theft victims; recurring 1★ reviews at affected sitesLocker hardware refresh at Seven Sisters / StratfordSite-level ops + capital plan
Named-staff issues (e.g. Stratford)HR cost of complaint resolution; brand damage if unaddressedTraining programmes (modest OpEx)HR + regional manager
Frequent unannounced closures (Seven Sisters)Pro-rata refund pressure; cancellationsCommunication system / ops processRegional ops
Recommendation. A sequenced 90-day-per-site programme, sequenced by complaint volume × London-density × OpEx-only-fixability. London Stratford first (81 reviews, manager-issue flagged, mostly hygiene/staff = pure OpEx). Birmingham and Leicester last (out-of-London, equipment-refresh CapEx-heavy = budget approval needed). The Qwen briefs at v3_11_club_briefs.md already provide the per-site this-week / next-month / next-quarter sequencing.

Owner. Regional ops manager (programme lead) + facilities ops (HVAC + locker work) + HR (named-staff escalations).


§ 6 — Speculative pilots (Tier 4: design proposals, not findings)

Full experiment specs at [v3/output/pilot_designs.md](v3/output/pilot_designs.md) — hypothesis / sites / duration / measurement / kill criteria / risk per pilot.

The academic report and §1–§5 above are grounded in dataset evidence. The proposals below are design ideas to test, not analyses with conclusions. Each is presented as an experiment design, not a recommendation.

6.1 Warm-shower temperature A/B

Hypothesis. Increasing shower water temperature 2–3°C (within safe limits) shortens average shower duration enough to offset the additional gas/electric cost, *and* reduces the cold-shower complaint cluster in the December review set.

Experiment design. Pick 4 matched sites by member count + complaint volume. Increase shower temp at 2 sites, hold the other 2 as control. Measure: shower duration (existing flow-meter data, if instrumented), gas/electric cost per member, December review complaint rate.

This memo will not estimate £ savings or hours. Pierre's working rule: cost / time / size claims fabricated by an LLM are always wrong unless quoted from real measurement. The experiment is the source of truth.

Risk. Scalding liability, member preference variance.

6.2 AC double-doors / capacity caps in summer

Hypothesis. The summer AC complaint surge (§3, 13% rate vs 4% baseline) is driven partly by door-open heat ingress at peak hours plus over-capacity heat generation. A door vestibule + a capacity cap at peak (e.g. 100 members on the floor max during 17:00–19:00 weekdays in July–August) reduces both.

Experiment design. Single-site pilot at a high-summer-complaint London site. Install temporary vestibule curtain + capacity-cap notice. Measure: AC complaint rate vs prior summer + control site, dB-of-complaints emotional intensity, cancellation rate during the pilot.

Cost. CapEx (vestibule install) + OpEx (capacity-cap signage and staff). Both small per site.

Risk. Member backlash on capacity caps. Mitigate: app-based reservation system.

6.3 Disney-style queueing app

Hypothesis. A "tell us what equipment you want for your workout, we'll give you a 10-minute window" reservation app reduces overcrowding-driven complaints (currently #3 BERTopic cluster in the Google reviews).

This is the most speculative item in the memo. No evidence for or against from the dataset. Worth one product-discovery sprint to validate user demand.

6.4 Buddy-pairing for similar-routine members

Hypothesis. Members of similar weight / routine paired into 30-minute slots create community-formation effects (Heskett service-profit-chain literature, Perplexity research §retention drivers).

Status. Left field. Park as a Phase 3 product idea. Mentioned for completeness.


§ 7 — Caveats: what the deep research got wrong

The Perplexity Sonar Deep Research output (puregym_deep_research.md) was reviewed by a 5-expert panel (panel_review_perplexity.md). Errors found:

1. Leonard Green acquisition date wrong. Report says Nov 2013 citing CCMP. CCMP did acquire PureGym in 2013; LGP acquired it from CCMP later (~2017). The Companies House FY2024 filing names LGP + KKR as current investors; the date discrepancy doesn't change the exit-thesis framing but does undermine specific transaction-multiple math.

2. Gym floor size assumption wrong. Report uses 2,000–3,000 sqft (boutique studio); PureGym sites are 10,000–20,000+ sqft warehouse format. Any HVAC, cleaning, or per-sqft-OpEx number from the research is inflated against this assumption — disregard absolute £ figures, retain only directional ratios.

3. Year-1 churn 40% / Year-2+ retention 85% comes from a single non-peer-reviewed source (benfit.co.uk, a blog aggregator). Useful as a sense-check, not as a citable benchmark. The Gym Group's own 2024 / 2025 annual reports are the authoritative source if the actual number matters to a recommendation.

4. ROI range "265–905%" methodologically unsound — divides low benefits by high costs and high benefits by low costs. Same-percentile gives 453–525%, still high but more defensible. Sensitivity analysis missing.

5. r=0.60 music-negativity correlation needs caveats — Pearson vs Spearman distinction not stated; review-length confound not controlled; causation vs co-occurrence not separated. Treat as suggestive, not confirmatory.

6. Milliman 1980s music research cited from a commercial audio-equipment blog. Original sources are Milliman 1982 (J Marketing) and 1986 (J Consumer Research). The directional finding (slow tempo → longer dwell) is robust; the specific 25% number should be re-cited from primary source if used in a board pack.

The PureGym FY2024 financials in this memo come directly from Companies House (Pure Gym Limited, reg 06690189, filed 23/05/2025, audited by KPMG LLP Nottingham) and are not subject to the panel-review caveats above.


§ 7.5 — Methodology robustness checks

The panel review of the Perplexity industry research flagged "no sensitivity analysis" as a methodological gap. This section closes the most material parts of that gap for findings new to this memo.

Shift-worker keyword filter — Sonnet 4.6 validation pass (§2)

The 1,177-row keyword-filtered set in §2 was validated by a Claude Sonnet 4.6 zero-shot classification on a 200-row stratified sample (100 score≥4, 100 score<4). Prompt: "Does the writer plausibly identify as a shift worker — nurse, police, paramedic, firefighter, NHS staff, security, night cleaner, warehouse, or someone whose work hours fall outside standard 9-5? answer yes / no / unclear."

VerdictCount% of 200
yes63.0%
unclear15477.0%
no4020.0%
Implication. The keyword filter is a 24/7-praise filter, not a verified-shift-worker filter. Of the 1,177 matches, the projected verified-yes count is ~36 (3% × 1,177). The 77% unclear band is genuinely indeterminate — many may still be shift workers but the review doesn't say so. The headline §2 finding is therefore reframed: 24/7-flexibility appreciation is the dominant signal, with shift workers as the visible but small proof cluster. The "we serve who serves you" campaign is still actionable, but on a smaller verified base than the unfiltered keyword count suggested.

Methodology + raw verdicts: v3/output/v3_ext_shiftworker_validate.{py,json,log}. Cost: 37k input tokens + 0.8k output tokens (Sonnet 4.6 standard pricing).

Emotion classifier OOD risk (§1, §2)

The bhadresh-savani/bert-base-uncased-emotion classifier used for the anger / sadness / joy / fear labels is Twitter-trained. The academic appendix already documents that ~1,486 1- and 2-star reviews are mis-labelled "joy" — a classic out-of-distribution failure on understated British complaint prose. Reply-latency-by-emotion (§1) and emotion-by-cohort (§2) both inherit this risk: - If 30% of "joy" labels are actually anger/sadness in the OOD prose (the appendix's churn_signal merge implies roughly that scale), the anger-median-130h vs joy-median-98h gap shrinks but does not disappear. The "PureGym replies fastest to joy and slowest to anger" inversion is robust to that level of mis-classification because the absolute ordering is preserved across both populations. - The §2 1,177 "joy" tag is more sensitive — if 30% of those are actually mis-labelled, the headline drops to ~480 verified joy. Still substantial. The campaign pitch is unchanged.

r=0.60 music-negativity correlation (panel review caveat)

The panel review flagged the r=0.60 finding as needing "Pearson vs Spearman, control for review length, causation vs co-occurrence" caveats. None of the recommendations in this memo (§4) lean on the magnitude of that correlation; the music-volume hypothesis is supported on the count side (43 Google reviews mentioning loud music) and falsified on the policy side (Glassdoor null result). Sensitivity to the r=0.60 number itself is therefore not material to any §4 action.


§ 8 — What this memo deliberately does not say

- No £ savings figures per recommendation. Per Pierre's working rule (and the LGP+KKR audience's likely read), an LLM-fabricated cost number is worse than no number. Anchor any £ calculation in a real measurement (FY2025 facilities-ops data, real CAC from PureGym's own marketing analytics, real shower-duration measurement at a pilot site).

- No claim about The Gym Group beyond what their public filings show. The Gym Group's £244.9m FY2025 revenue, 23% Adjusted EBITDA Less Normalised Rent margin, 27% mature-site ROIC, £21.60 ARPM, and 4% YoY membership growth are filing-direct ([The Gym Group 2025 results](https://www.tggplc.com/investors)). PureGym's £22.64 ARPM is +5% above this — the price-power gap is real and worth defending.

- No CapEx-led growth recommendation. The literature (Huff gravity, retail cannibalization) suggests urban-market saturation is a real risk above ~5,000 UK clubs. PureGym's expansion programme (433 → 460+ targeted) faces decreasing-returns headwinds. Retention-led EBITDA expansion at the 433 existing sites is the LGP+KKR exit-friendly play. Every recommendation in §1–§5 above is retention-flavoured for that reason.


§ 9 — Cross-references

- Academic submission: report.md (995 words, Falcon→Qwen + T4→A100 drift fixed, Sonnet shootout in appendix) — Canvas 3354 due 2026-04-27. - Deployed site (PIN-gated): [pace-study.pages.dev](https://pace-study.pages.dev/) — 6 pages currently (Report, Audio deck, Cribsheet, Walkthrough, Row inspector, Top 20 tips). This memo proposes adding a 7th: Extended findings. - Companion site: [pace-compass.pages.dev](https://pace-compass.pages.dev/) (alive, distinct project). - Business model calculator: v3/business-model.html — sliders for # gyms, churn rate, ARPM, CAC, retention investment, churn reduction. Constants: 410 base gyms, 29.7% EBITDA margin, 10× valuation multiple, £2m CapEx per gym. Source for the LGP+KKR exit math. - Per-club briefs: v3/output/v3_11_club_briefs.md — top-10 worst-club action briefs (Qwen2.5-72B), reviewed for §5 above. - Industry research: v3/output/puregym_deep_research.md (use with §7 caveats). - Panel review of research: v3/output/panel_review_perplexity.md. - PureGym FY2024 real numbers: PUREGYM_FY2024_REAL_NUMBERS.md (Companies House primary). - Glassdoor staff data: v3/output/glassdoor_staff_reviews_full.csv (43 reviews, 2023–2024 span, used for §4 null-result on music). - Shift-worker filtered set: v3/output/v3_ext_shiftworker_matches.csv (1,177 reviews, generated for §2).


§ 10 — Sources cited in this memo (web)

- [Unthread 2026 — Support Response Time Benchmarks](https://unthread.io/blog/internal-support-response-time-service-desk/) — 1h reply = 71% retention; 24h reply = 48%. - [WebFX 2026 — Google Business Profile Benchmarks](https://www.webfx.com/blog/seo/google-business-profile-benchmarks/) — fast response is the strongest predictor of star-rating recovery. - [Email Meter — Industry Standard SLA Response Times](https://www.emailmeter.com/blog/understanding-industry-standard-sla-response-times) — retail median 17h email response; financial services 14h. - [Spidya 2026 — Enterprise Services SLA Benchmark](https://spidya.com/en/blog/it-service-management/enterprise-services-sla-benchmark-2026-key-metrics-for-success) — 95% compliance industry target. - Companies House — Pure Gym Limited (reg 06690189), Annual Report & Financial Statements year ended 31 December 2024.


*Document prepared 2026-04-25 for the PACE-NLP-PROJECT extended findings deployment. Source data and analysis scripts at pace-nlp-project/v3/output/v3_ext_*.csv and pace-nlp-project/v3/output/combined_phase9.parquet.*