What common novel-format issues does it fix?

Smart line break (re-merge broken lines), paragraph indent, chapter title formatting, full-width → half-width, Trad/Simp conversion, ad/downloader watermark cleanup, line-end digit stripping, adjacent duplicate-line removal. Every step is an independent toggle inside the three collapsible config sections on the right.

What's the difference between "Chapter Split" and "Chapter Reorder" buttons?

"Chapter Split" only breaks inline-written chapter titles onto their own lines — a lightweight pre-processing pass that leaves everything else untouched. "Chapter Reorder" sorts the entire novel by the numeric chapter index extracted from each title (第十二章, Chapter 5, etc.), useful for downloads with scrambled chapter order.

How do "Filter Words" and the "Filter Threshold" work together?

Filter Words accepts comma-separated keywords (e.g. channel,downloader,group) — every line containing any of them is removed. To avoid wiping legitimate text, set Filter Threshold to N → lines longer than N characters are exempt (kept even if they match a keyword). 0 disables the exemption. This lets you clean short watermark lines while preserving long body paragraphs.

What's the "Protected Dictionary" and is it related to the Chinese Converter?

Yes — both tools share the same protected dictionary (same localStorage key). It defines terms that should NOT be converted during Trad/Simp transformation. It only applies when Trad/Simp conversion is actually enabled in Advanced Settings (set to t2s or s2t). When set to "None", the panel shows a "rules inactive" hint.

Is anything uploaded to a server?

No. The entire pipeline runs in your browser — files never leave your machine.

Novel Processor

Novel Processor is a one-stop cleanup tool built for web novels, e-books, and other long-form text. It auto-repairs formatting issues, strips ads, normalizes chapter titles, reorders by chapter index, and optionally does Trad/Simp conversion — turning chaotic novel text into something clean and readable.

Novel Processor interface

What problems does it solve?

Typical pain points with downloaded novels:

Messy layout: paragraphs not separated, chapters running together, scattered blank lines
Ad watermarks: downloader stamps, group-promotion lines, "split-volume reading" markers
Inconsistent format: mixed Trad/Simp, full-width vs half-width chaos
Broken line wrapping: sentences split mid-line, paragraphs broken at the wrong spots
Chapter ordering: scrambled tables of contents, chapter titles without line breaks

Page layout

Two columns:

Left: file upload + text input + three primary action buttons + result card
Right: three-section configuration Collapse + Protected Dictionary panel

Left column: input & actions

File upload

Drag-and-drop or click; supports TXT, MD, and other text formats
Multi-file mode by default — drop an entire novel collection at once
In Single-File Mode (toggle in Advanced Settings), uploading a new file replaces the current one

Three primary action buttons

Button	Behavior
Start Process	Run the full pipeline: applies every checked option from the right-side config in order
Chapter Split	Just one thing — split inline-concatenated chapter titles onto their own lines, nothing else
Chapter Reorder	Extract chapter index from each title (第十二章, Chapter 5, 第三卷第四章, …) and re-sort the whole book. Untitled chapters go last

When done, the result is auto-copied to clipboard and shown in the result card.

Right column: three-section configuration

1. Typesetting (expanded by default)

Smart Line Break (on): detect paragraph boundaries via Chinese punctuation, pure-numeric lines, and special starters; re-merge broken sentences
- Sub-toggle Paragraph Indent (visible only when Smart Line Break is on, default on): add \t to each paragraph head — gives book-style typography
Paragraph Split (off by default): break long paragraphs into shorter ones via sentence detection (compromise English NLP + Chinese punctuation rules) — better for mobile reading

2. Content Cleaning (collapsed by default)

Chapter Title Formatting (on): recognize "Chapter X" / "第X章" patterns and normalize them
Merge Duplicate Chapter Titles: when the same chapter title appears more than once (including back-to-back repeats and near-duplicates differing only in full/half-width punctuation), collapse them into one, keeping the more complete title
Strip Line-End Digits (off): remove trailing digits only on lines ≥ 10 chars (avoids stripping years from short titles)
Trim Spaces (on): strip leading/trailing whitespace per line
Remove Adjacent Duplicates (off): drop only consecutive duplicate lines (not full-text dedup — preserves common short dialogue like "嗯" / "好")
Special Start Text: enter the novel title or a recurring header word — matching lines are forced into their own paragraph (prevents incorrect merging)
Filter Words + Filter Threshold:
- Filter Words: comma-separated keywords (channel,downloader) — every matching line gets dropped
- Filter Threshold: lines longer than N chars are exempt (kept even if they match) — protects body paragraphs. 0 disables the exemption.

3. Advanced Settings (collapsed by default)

Trad/Simp Conversion Segmented:
- None → skip this step
- Trad → Simp → uses tw → cn
- Simp → Trad → uses cn → tw
Single-File Mode: one file at a time
Auto-Export (visible only in single-file mode): skip preview and download directly

Protected Dictionary (shared with Chinese Converter)

Bottom-right of the page:

Master toggle: enable / disable all rules
Shows current s2t / t2s rule counts
Manage Rules button: opens the drawer to add/edit/delete rules, plus batch import/export
Inactive hint: if Trad/Simp conversion is set to "None", the panel shows a "rules currently inactive" hint

The dictionary is shared with the Chinese Converter tool (same localStorage key) — edit in one place, applies in both. Only participates in processing when Trad/Simp conversion is actually enabled in Advanced Settings.

Full pipeline (Start Process button)

The complete pipeline triggered by the main button:

Trad/Simp conversion (if enabled): applies the appropriate protected rules
Normalize newlines + strip novel artifacts: unify \r\n → \n, remove common downloader watermarks / 分卷阅读 markers / horizontal-rule lines
Full-width → half-width: normalize Latin letters, digits, and punctuation
Chapter mark formatting: replace colons in 第X章： with spaces, compress redundant spaces around chapter markers
Chapter split (if enabled): break inline-written chapter titles
Keyword filtering (if filled): drop lines matching keywords (subject to threshold exemption)
Line-end digit stripping (if enabled): strip trailing digits on long lines
Paragraph splitting (if enabled): break long paragraphs
Smart line break / paragraph indent: re-merge broken sentences, add paragraph indent, collapse extra blank lines

Result card

After processing:

Auto-copied to clipboard
Copy / Export / Edit in place: same as Chinese Converter
Result → Source: pipe the result back into the input for further processing (rare)

With "Auto-Export" enabled, the preview is skipped and the file downloads immediately.

Tips

First-time use

Test on a small slice (1-2 chapters) first; tune the config before processing the full book
Start filter words with 1-2 obvious watermark terms; expand the list after verifying

Power moves

Filter Threshold is the key trick: set it to ~50 (or the average paragraph length) to surgically delete short watermark lines while keeping long body paragraphs
Run Chapter Reorder standalone: when you only want to fix chapter order, click Chapter Reorder directly — don't run the full Start Process pipeline
Pre-edit Protected Dictionary: for novels with rare Trad/Simp-mixed proper names, add them to the dictionary first, then run Start Process

When it shines

Format-broken text from pirated novel sites
Novel collections harvested by scrapers / auto-downloaders
Long-form text from OCR / PDF copying
Pre-processing before importing into Kindle / Moon+ Reader / Apple Books

Technical note

For deeply-obfuscated watermarks (scrambled-character promotional content), combine Filter Words with Filter Threshold as a first pass. Extremely obfuscated patterns are best handled by the Text Toolbox with a custom regex, then fed back here for layout.

Runs entirely in your browser — no data is uploaded.

#Novel Processor

#What problems does it solve?

#Page layout

#Left column: input & actions

#File upload

#Three primary action buttons

#Right column: three-section configuration

#1. Typesetting (expanded by default)

#2. Content Cleaning (collapsed by default)

#3. Advanced Settings (collapsed by default)

#Protected Dictionary (shared with Chinese Converter)

#Full pipeline (Start Process button)

#Result card

#Tips

#First-time use

#Power moves

#When it shines

#Technical note