Japan’s annual reports.Read in English. With span-cited receipts.
Yūhō are Japan’s annual securities reports: the equivalent of US 10-Ks, ~88,000 pages filed each year by listed companies. YuhoLens reads them in English with every claim, currency, margin, and segment linked back to a page and span in the source. Open weights, GGUF on AMD silicon.
Machine translation loses the meaning. Professional translation takes weeks and costs thousands.
「急激な為替変動は営業利益率に重大な影響を及ぼす可能性がある。」
“Sudden foreign-exchange shaking may give a serious feeling of influence to operating profit ratio.”
Example: a fluent Japanese risk sentence rendered by raw machine translation as “Sudden foreign-exchange shaking may give a serious feeling of influence to operating profit ratio” - the meaning is mangled.
Beat 03The lens
The lens.
YuhoLens reads the source. Translates with context. Refuses when the source doesn't say so.
Prolonged yen weakness materially compresses operating margin in the electronic-components segment.¹
[evidence insufficient]claim about FY25 guidance was not span-grounded; refused.
「急激な為替変動は営業利益率に重大な影響を及ぼす可能性がある」, p.23 §2.1
Example output: “Prolonged yen weakness materially compresses operating margin in the electronic-components segment,” cited to page 23, section 2.1 of the source. A second claim about FY25 guidance had no matching span, so it is replaced with “[evidence insufficient]” rather than asserted.
A four-stage pipeline. Span-grounded. Refuses when uncertain.
Section-split → translate-with-context → citation-grounder → judge. Every claim ties to a verbatim Japanese span; sentences without grounding are replaced with [evidence insufficient].
Step 01 / Ingest
Paste any EDINET row or ticker.
Pull a row from EDINET-Bench, or upload your own filing. The pipeline runs section-split and span-grounding in one query.
$↵
Step 02 / Fetch
We fetch the source.
Section-split, regex-bounded, page-aligned. Every claim will trace back to a specific span.
PAGE000
DOWNLOADING · EDINET · ROW 0%
Step 03 / Read
Read it in English. With receipts.
Span-cited memo. Hover any number to see the original Japanese and page reference.
Operatingmargincompressed3.4%YoY¹onyenweakness².
Example sentence: “Operating margin compressed 3.4% year-over-year on yen weakness,” with the first claim cited to 営業利益率 on page 23 section 2.1 and the second to 為替予約 on page 24.
, end of pass · paper out
§ 03The receipt領収書
Open weights. Open eval. Every row maps to a script in the public repo.
The whole pipeline, corpus build, SFT, ORPO, KG-2 eval, GGUF export, reproduces in one MI300X-day. ~$80 of compute. No private data, no held-out tricks; click any row to open the script that produced it.
RECEIPT · 7 ROWS
01
BF16 weightsMIT · HuggingFace
02
GGUF Q3–Q8 quantsFive sizes · 7.18–14.03 GiB
03
KG-2 eval scripts50-prompt set · graders
04
DPO + ORPO logsFull training run history
§ 04Hardware物理
Trained on AMD Instinct MI300X. 192 GB HBM3. ROCm 7.0.
Full-parameter SFT of a 14B model at sequence length 8,192 needs ~140 GB peak VRAM. The MI300X has 192 GB of HBM3 in a single accelerator, an 80 GB H100 cannot fit this run. We trained on a single MI300X for 23 days at ~$3.50/hour, then exported six GGUF quantizations so the same model fits on consumer 8 GB laptops.
192 GB
HBM3 in one accelerator
Largest single-GPU memory in production. An 80 GB H100 cannot fit this run.
ROCm 7.0
Full-stack open
Same toolchain in dev and prod. PyTorch, FlashAttention, vLLM, all upstream.
5.3 TB/s
HBM3 bandwidth
Why long-context Japanese filings stream through SFT without OOM at seq_len 8 192.
Same weights, six sizes, 7.18 → 14.03 GiB
Five GGUF quantizations ship with the model. Click any bar for size delta against the Q4_K_M baseline.
10.06 tok/s on an 8 GB consumer laptop (Q3_K_M)
Q3_K_M
size7.18 GiB
vs Q4_K_M−18.5% vs Q4_K_M
Smallest fit, 8 GB VRAM laptops
Q4_K_M
size8.81 GiB
vs Q4_K_Mbaseline
Default · best size/quality trade
Q5_K_M
size9.94 GiB
vs Q4_K_M+12.8% vs Q4_K_M
Tighter rounding · ~12 GB VRAM
Q6_K
size11.46 GiB
vs Q4_K_M+30.1% vs Q4_K_M
Near-lossless · 12–16 GB VRAM
Q8_0
size14.03 GiB
vs Q4_K_M+59.3% vs Q4_K_M
Reference quant · 16 GB VRAM
Train on MI300X. Run on a Macbook. Same weights.
§ 05Get it開示
Open weights. Open eval. Open ledger.
BF16 weights for the lab, GGUF Q4–Q8 for the laptop, and the full eval pipeline for the auditor. MIT-licensed today.