LLM setup · Chronicle Keeper docs

Chronicle Keeper transcribes your audio on-device, but you choose which language model does the writing. Run one locally for free, or bring a cloud API key.

Your keys stay on your machine.

API keys are stored locally and LLM calls go straight from your computer to the provider — never through any Chronicle Keeper server. There isn't one.

Which should I pick?

Option	Cost	Privacy	Best for
Ollama (local)	Free	Stays on your device	Privacy, no bills, working offline
Anthropic Claude	Pay per use	Sent to Anthropic	Highest-quality recaps
OpenAI-compatible	Pay per use (Groq has a free tier)	Sent to provider	Speed, choice, existing keys

Not sure? Start with Ollama if you value privacy and zero cost, or Anthropic if you want the best writing and don't mind paying cents per session.

Option A — Ollama (local & free)

Ollama runs an open language model entirely on your own computer. No account, no key, no bill, and it works offline once the model is downloaded. It's the most private option.

Install Ollama

Download it for your OS from ollama.com/download and run the installer. On macOS and Windows it runs in the background automatically.

Pull a model

Open a terminal and download a model. A small, capable starting point:

ollama pull gemma4:e2b

Models we'd recommend for session summaries, roughly smallest to best:

Model	RAM	Good for
`gemma4:e2b`	~8 GB	Lightweight machines; quick, decent summaries
`gemma4:e4b`	~16 GB	The sweet spot — strong multilingual summaries (incl. German), still local
`qwen3:30b` (A3B Instruct)	~16 GB+	Faster and more detailed than gemma4 thanks to its mixture-of-experts design

Browse more in the model library. For summarizing, prefer an Instruct model over a Thinking / reasoning one — thinking models are slower and can leak their reasoning into the summary.

Make sure it's running

Ollama serves an API at http://localhost:11434. If the app can't reach it, start it manually:
```
ollama serve
```
Point Chronicle Keeper at it

In Settings → LLM, choose Ollama, confirm the base URL is http://localhost:11434, and pick the model you pulled. Hit Test to confirm the connection.

How big a model can I run?

Rough rule: a model needs about as many GB of RAM as it has billions of parameters. A 7–8B model wants ~8 GB free; pick a 3–4B model on lighter machines. Mixture-of-experts models (like gemma4:e4b or qwen3:30b) are an exception — they're large on disk but only activate a fraction at a time, so they run fast even on a 16 GB laptop. Ollama's official docs cover hardware in detail: Ollama FAQ →

Long sessions & speed

A full 3–4 hour session is a lot of text. Chronicle Keeper automatically sizes the model's context window to fit your whole transcript, so the summary always reflects the entire session — not just the first part. You don't need to configure anything.

Local summaries take a few minutes — that's normal.

On a typical laptop, reading a long transcript and writing a summary can take 3–5 minutes with a local model. The app isn't frozen; it's working. Cloud providers are much faster (seconds) if you're in a hurry.

Two advanced settings let you tune the local behaviour if needed (most people never touch them):

Context window cap (ollama_num_ctx_max) — the largest window the app will request. The default fits a long session; lower it only if a big model spills onto your CPU and crawls.
Timeout (ollama_timeout_seconds) — how long to wait before giving up. The default (several minutes) covers long local runs; raise it on slower machines or for very long sessions.

Option B — a cloud API key

Cloud models often write the best recaps. You'll create an account with a provider, generate an API key (a long secret string), and paste it into Chronicle Keeper's Settings. The key is stored locally on your machine.

Treat your API key like a password.

Don't share it or commit it anywhere public. Most providers let you revoke and regenerate keys if one leaks. Cloud usage is billed per token, so set a spending limit in the provider's dashboard.

Anthropic (Claude)

Built-in via the native Messages API. Great default for quality.

Create an account
Sign up at console.anthropic.com and add billing credit.
Generate a key
Go to Settings → API Keys → Create Key, name it (e.g. "Chronicle Keeper"), and copy the sk-ant-… string. It's shown only once.
Paste it in
In Chronicle Keeper Settings → LLM, choose Anthropic, paste the key, pick a model (e.g. a current Claude), and hit Test.

OpenAI-compatible providers

One built-in client covers OpenAI and a family of compatible services. Pick whichever you already have an account with — the setup is the same: create a key, paste it, select the model.

OpenAI

Paid

The original GPT models.

Key prefix: sk-…

platform.openai.com/api-keys →

Groq

Free tier

Very fast inference; generous free tier to start.

Key prefix: gsk_…

console.groq.com/keys →

DeepSeek

Paid

Strong, low-cost models.

Key prefix: sk-…

platform.deepseek.com →

Mistral

Paid

European models, good multilingual quality.

From La Plateforme.

console.mistral.ai →

Together

Paid

Hosts many open models behind one key.

Key prefix: …

api.together.xyz →

Perplexity

Paid

Sonar models.

Key prefix: pplx-…

perplexity.ai/settings/api →

Google Gemini

Free tier

Use Gemini's OpenAI-compatible endpoint.

Key from AI Studio.

aistudio.google.com →

MiniMax

Paid

Also covered by the compatible client.

From the MiniMax console.

minimax.io →

General steps for any of them

Sign up & add billing
Create an account on the provider's site (links above) and add a payment method or claim the free tier.
Create an API key
Find the API keys section, create a new key, and copy it. Keys are usually shown only once — store it somewhere safe.
Add it in Settings
In Chronicle Keeper, open Settings → LLM providers, pick the provider, paste the key, choose a model, and click Test to verify it works.

Cohere?

Cohere support is deferred for now. Any of the OpenAI-compatible providers above will work in the meantime.

Test before you transcribe

Whichever route you choose, use the Test / Ping button next to the provider in Settings. A green result means the key and model are good and you're ready to summarize. If it fails, see Troubleshooting → LLM.

Setting up your LLM

Which should I pick?

Option A — Ollama (local & free)

Install Ollama

Pull a model

Make sure it's running

Point Chronicle Keeper at it

Long sessions & speed

Option B — a cloud API key

Anthropic (Claude)

Create an account

Generate a key

Paste it in

OpenAI-compatible providers

OpenAI

Groq

DeepSeek

Mistral

Together

Perplexity

Google Gemini

MiniMax

General steps for any of them

Sign up & add billing

Create an API key

Add it in Settings

Test before you transcribe