Chronicle KeeperDocs
Docs / LLM setup

Setting up your LLM

Chronicle Keeper transcribes your audio on-device, but you choose which language model does the writing. Run one locally for free, or bring a cloud API key.

Your keys stay on your machine.

API keys are stored locally and LLM calls go straight from your computer to the provider — never through any Chronicle Keeper server. There isn't one.

Which should I pick?

OptionCostPrivacyBest for
Ollama (local)FreeStays on your devicePrivacy, no bills, working offline
Anthropic ClaudePay per useSent to AnthropicHighest-quality recaps
OpenAI-compatiblePay per use (Groq has a free tier)Sent to providerSpeed, choice, existing keys

Not sure? Start with Ollama if you value privacy and zero cost, or Anthropic if you want the best writing and don't mind paying cents per session.


Option A — Ollama (local & free)

Ollama runs an open language model entirely on your own computer. No account, no key, no bill, and it works offline once the model is downloaded. It's the most private option.

  1. Install Ollama

    Download it for your OS from ollama.com/download and run the installer. On macOS and Windows it runs in the background automatically.

  2. Pull a model

    Open a terminal and download a model. A small, capable starting point:

    ollama pull gemma4:e2b

    Models we'd recommend for session summaries, roughly smallest to best:

    ModelRAMGood for
    gemma4:e2b~8 GBLightweight machines; quick, decent summaries
    gemma4:e4b~16 GBThe sweet spot — strong multilingual summaries (incl. German), still local
    qwen3:30b (A3B Instruct)~16 GB+Faster and more detailed than gemma4 thanks to its mixture-of-experts design

    Browse more in the model library. For summarizing, prefer an Instruct model over a Thinking / reasoning one — thinking models are slower and can leak their reasoning into the summary.

  3. Make sure it's running

    Ollama serves an API at http://localhost:11434. If the app can't reach it, start it manually:

    ollama serve
  4. Point Chronicle Keeper at it

    In Settings → LLM, choose Ollama, confirm the base URL is http://localhost:11434, and pick the model you pulled. Hit Test to confirm the connection.

How big a model can I run?

Rough rule: a model needs about as many GB of RAM as it has billions of parameters. A 7–8B model wants ~8 GB free; pick a 3–4B model on lighter machines. Mixture-of-experts models (like gemma4:e4b or qwen3:30b) are an exception — they're large on disk but only activate a fraction at a time, so they run fast even on a 16 GB laptop. Ollama's official docs cover hardware in detail: Ollama FAQ →

Long sessions & speed

A full 3–4 hour session is a lot of text. Chronicle Keeper automatically sizes the model's context window to fit your whole transcript, so the summary always reflects the entire session — not just the first part. You don't need to configure anything.

Local summaries take a few minutes — that's normal.

On a typical laptop, reading a long transcript and writing a summary can take 3–5 minutes with a local model. The app isn't frozen; it's working. Cloud providers are much faster (seconds) if you're in a hurry.

Two advanced settings let you tune the local behaviour if needed (most people never touch them):

  • Context window cap (ollama_num_ctx_max) — the largest window the app will request. The default fits a long session; lower it only if a big model spills onto your CPU and crawls.
  • Timeout (ollama_timeout_seconds) — how long to wait before giving up. The default (several minutes) covers long local runs; raise it on slower machines or for very long sessions.

Option B — a cloud API key

Cloud models often write the best recaps. You'll create an account with a provider, generate an API key (a long secret string), and paste it into Chronicle Keeper's Settings. The key is stored locally on your machine.

Treat your API key like a password.

Don't share it or commit it anywhere public. Most providers let you revoke and regenerate keys if one leaks. Cloud usage is billed per token, so set a spending limit in the provider's dashboard.

Anthropic (Claude)

Built-in via the native Messages API. Great default for quality.

  1. Create an account

    Sign up at console.anthropic.com and add billing credit.

  2. Generate a key

    Go to Settings → API Keys → Create Key, name it (e.g. "Chronicle Keeper"), and copy the sk-ant-… string. It's shown only once.

  3. Paste it in

    In Chronicle Keeper Settings → LLM, choose Anthropic, paste the key, pick a model (e.g. a current Claude), and hit Test.

OpenAI-compatible providers

One built-in client covers OpenAI and a family of compatible services. Pick whichever you already have an account with — the setup is the same: create a key, paste it, select the model.

AI

OpenAI

Paid

The original GPT models.

Key prefix: sk-…

platform.openai.com/api-keys →
Gq

Groq

Free tier

Very fast inference; generous free tier to start.

Key prefix: gsk_…

console.groq.com/keys →
Ds

DeepSeek

Paid

Strong, low-cost models.

Key prefix: sk-…

platform.deepseek.com →
Mi

Mistral

Paid

European models, good multilingual quality.

From La Plateforme.

console.mistral.ai →
Tg

Together

Paid

Hosts many open models behind one key.

Key prefix:

api.together.xyz →
Px

Perplexity

Paid

Sonar models.

Key prefix: pplx-…

perplexity.ai/settings/api →
Ge

Google Gemini

Free tier

Use Gemini's OpenAI-compatible endpoint.

Key from AI Studio.

aistudio.google.com →
Mx

MiniMax

Paid

Also covered by the compatible client.

From the MiniMax console.

minimax.io →

General steps for any of them

  1. Sign up & add billing

    Create an account on the provider's site (links above) and add a payment method or claim the free tier.

  2. Create an API key

    Find the API keys section, create a new key, and copy it. Keys are usually shown only once — store it somewhere safe.

  3. Add it in Settings

    In Chronicle Keeper, open Settings → LLM providers, pick the provider, paste the key, choose a model, and click Test to verify it works.

Cohere?

Cohere support is deferred for now. Any of the OpenAI-compatible providers above will work in the meantime.

Test before you transcribe

Whichever route you choose, use the Test / Ping button next to the provider in Settings. A green result means the key and model are good and you're ready to summarize. If it fails, see Troubleshooting → LLM.