Setting up your LLM
Chronicle Keeper transcribes your audio on-device, but you choose which language model does the writing. Run one locally for free, or bring a cloud API key.
API keys are stored locally and LLM calls go straight from your computer to the provider — never through any Chronicle Keeper server. There isn't one.
Which should I pick?
| Option | Cost | Privacy | Best for |
|---|---|---|---|
| Ollama (local) | Free | Stays on your device | Privacy, no bills, working offline |
| Anthropic Claude | Pay per use | Sent to Anthropic | Highest-quality recaps |
| OpenAI-compatible | Pay per use (Groq has a free tier) | Sent to provider | Speed, choice, existing keys |
Not sure? Start with Ollama if you value privacy and zero cost, or Anthropic if you want the best writing and don't mind paying cents per session.
Option A — Ollama (local & free)
Ollama runs an open language model entirely on your own computer. No account, no key, no bill, and it works offline once the model is downloaded. It's the most private option.
-
Install Ollama
Download it for your OS from ollama.com/download and run the installer. On macOS and Windows it runs in the background automatically.
-
Pull a model
Open a terminal and download a model. A small, capable starting point:
ollama pull gemma4:e2bModels we'd recommend for session summaries, roughly smallest to best:
Model RAM Good for gemma4:e2b~8 GB Lightweight machines; quick, decent summaries gemma4:e4b~16 GB The sweet spot — strong multilingual summaries (incl. German), still local qwen3:30b(A3B Instruct)~16 GB+ Faster and more detailed than gemma4 thanks to its mixture-of-experts design Browse more in the model library. For summarizing, prefer an Instruct model over a Thinking / reasoning one — thinking models are slower and can leak their reasoning into the summary.
-
Make sure it's running
Ollama serves an API at
http://localhost:11434. If the app can't reach it, start it manually:ollama serve -
Point Chronicle Keeper at it
In Settings → LLM, choose Ollama, confirm the base URL is
http://localhost:11434, and pick the model you pulled. Hit Test to confirm the connection.
Rough rule: a model needs about as many GB of RAM as it has billions of parameters. A 7–8B model wants ~8 GB free; pick a 3–4B model on lighter machines. Mixture-of-experts models (like gemma4:e4b or qwen3:30b) are an exception — they're large on disk but only activate a fraction at a time, so they run fast even on a 16 GB laptop. Ollama's official docs cover hardware in detail: Ollama FAQ →
Long sessions & speed
A full 3–4 hour session is a lot of text. Chronicle Keeper automatically sizes the model's context window to fit your whole transcript, so the summary always reflects the entire session — not just the first part. You don't need to configure anything.
On a typical laptop, reading a long transcript and writing a summary can take 3–5 minutes with a local model. The app isn't frozen; it's working. Cloud providers are much faster (seconds) if you're in a hurry.
Two advanced settings let you tune the local behaviour if needed (most people never touch them):
- Context window cap (
ollama_num_ctx_max) — the largest window the app will request. The default fits a long session; lower it only if a big model spills onto your CPU and crawls. - Timeout (
ollama_timeout_seconds) — how long to wait before giving up. The default (several minutes) covers long local runs; raise it on slower machines or for very long sessions.
Option B — a cloud API key
Cloud models often write the best recaps. You'll create an account with a provider, generate an API key (a long secret string), and paste it into Chronicle Keeper's Settings. The key is stored locally on your machine.
Don't share it or commit it anywhere public. Most providers let you revoke and regenerate keys if one leaks. Cloud usage is billed per token, so set a spending limit in the provider's dashboard.
Anthropic (Claude)
Built-in via the native Messages API. Great default for quality.
Create an account
Sign up at console.anthropic.com and add billing credit.
Generate a key
Go to Settings → API Keys → Create Key, name it (e.g. "Chronicle Keeper"), and copy the
sk-ant-…string. It's shown only once.Paste it in
In Chronicle Keeper Settings → LLM, choose Anthropic, paste the key, pick a model (e.g. a current Claude), and hit Test.
OpenAI-compatible providers
One built-in client covers OpenAI and a family of compatible services. Pick whichever you already have an account with — the setup is the same: create a key, paste it, select the model.
Groq
Free tierVery fast inference; generous free tier to start.
Key prefix: gsk_…
Google Gemini
Free tierUse Gemini's OpenAI-compatible endpoint.
Key from AI Studio.
aistudio.google.com →General steps for any of them
Sign up & add billing
Create an account on the provider's site (links above) and add a payment method or claim the free tier.
Create an API key
Find the API keys section, create a new key, and copy it. Keys are usually shown only once — store it somewhere safe.
Add it in Settings
In Chronicle Keeper, open Settings → LLM providers, pick the provider, paste the key, choose a model, and click Test to verify it works.
Cohere support is deferred for now. Any of the OpenAI-compatible providers above will work in the meantime.
Test before you transcribe
Whichever route you choose, use the Test / Ping button next to the provider in Settings. A green result means the key and model are good and you're ready to summarize. If it fails, see Troubleshooting → LLM.