Question 1

Is my text uploaded?

Accepted Answer

No. Chunking runs entirely in your browser in JavaScript — your document never leaves your device. The page only sends an anonymous usage counter (the tool name and the input size), never the content.

Question 2

How does it keep chunks coherent?

Accepted Answer

It splits on double line breaks (paragraphs) and keeps fenced code blocks whole. Sentence and hard splits are used only as a last resort for blocks larger than the chunk size.

Question 3

Is there a size limit?

Accepted Answer

Only your device memory. With no server you can chunk documents of many megabytes; large inputs are processed without freezing the page.

Question 4

What chunk size should I use?

Accepted Answer

Set it below your model context window, leaving room for the reply — for example 8,000 to 12,000 characters per chunk for a typical chat model.

Semantic Chunker — Split Long Text for LLMs Locally, Without Cutting Mid-Thought

Long text or code to split

How it works

Why chunk text for LLMs?

FAQ

Related Tools

Token Counter

PII & Secret Stripper

LLM Markdown Converter