educast.cc

educast.cc is the public face of an end-to-end research-to-audio pipeline. You hand it a subject and a few knobs — how dense, how long, how many modules, which language — and it goes off and does the work an analyst would do: maps the territory, gathers facts, cross-checks them against independent sources, plans a narrative, writes it, narrates it, and ships it as a self-contained shareable page.

Every share page carries the artifact and the audit trail. You can listen to the episode, read the notes, glance at the cheat sheet, and inspect every source the writer was allowed to draw from — including which facts were independently corroborated, by how many sources, and with what verdict.

The knobs you turn

From those, the pipeline computes its own internal budgets — how deep to recurse, how many parallel searches to fan out, how many facts to verify, how many sections to write, how long each section should be. Higher density means deeper recursion, broader fan-out, more verification calls, and a tighter fact floor per subtopic. Longer length means more sections at roughly 4,000 characters apiece, with each section getting a proportional slice of the verified fact pool.

How the pipeline actually works

The flow below is the real shape of what happens between your prompt and the published episode. Every step is implemented; the bullet points under each are what that step actually does, in order.

Plan decompose the subject

Your prompt and density knob get turned into a research budget and a topic map.

• If you bullet-point any topics in the prompt, an LLM extracts them as mandatory topics — coverage is enforced later in the writing step.

• In course mode, an outline pass produces N module titles + descriptions + summaries from your title and module count.

• Density (1–10) is converted into concrete budgets: research depth, research breadth, max subtopics, a fact demand per subtopic, and a verification budget.

Explore phase 1 of 3 · broad sweep

A wide-angle pass that maps the territory before going deep.

• An LLM generates 8–10 broad search queries from the prompt and any mandatory topics.

• All queries fan out in parallel against a web search API; failed queries are logged and the rest of the pipeline keeps going.

• The result set is folded into a structured knowledge graph: subtopics, entities, initial facts (each tagged with importance and source indices), and typed relationships between entities.

Deep dive phase 2 of 3 · recursive

For each subtopic in the map, a focused researcher fills the gaps.

• Per subtopic: an LLM does a gap analysis — what facts are still missing? — and emits targeted queries.

• Pages are fetched (up to 25 per subtopic), parsed, and facts are extracted with source attribution.

• Recursion is controlled by your depth knob (1–5). Each level halves the breadth, so the tree narrows as it deepens.

• Subtopics are researched in parallel under a concurrency limiter so the pipeline doesn’t hammer either the search API or your wallet.

Verify phase 3 of 3 · corroboration

High-priority facts get re-checked against independent sources before they’re allowed near the script.

• Facts are ranked by priority (importance × provisional credibility) and the top N (your verification budget) are selected.

• For each, a fresh independent search is run — not the same query that surfaced it the first time.

• An LLM reads the new sources and issues a verdict: confirmed, partially_confirmed, contradicted, or insufficient_evidence, with an adjusted confidence score in [0, 1].

• The verdict, the adjusted score, and the corroborating source IDs are baked into the fact record. Contradicted facts are marked, not silently dropped.

Strategize narrative architecture

The verified knowledge graph gets shaped into a story arc.

• Facts are allocated to subtopics by a weighted score: roughly importance × 0.5 + sources × 0.3 + verifications × 0.2, with a floor so no discovered subtopic gets starved.

• A research-enhanced writing prompt is assembled — verified facts inline, citations attached, mandatory topics flagged.

• A section outline is generated with roughly one section per ~4,000 characters of target length.

Write long-form, sectioned

A writer drafts the script section by section, with style guardrails and an anti-repetition net.

• Each section is written against the brief, the outline, and a deny-list of fact claims used in earlier sections so the same anecdote doesn’t turn up twice.

• Mandatory topics are explicitly required in the writing prompt; sections target their allocated character count to within roughly ±10%.

• Style is deadpan and direct, with banned-pattern enforcement — an audited list of overused tics that the writer is told not to use.

• The script is saved to disk after each section, so a crash mid-way only loses the in-flight section, not the whole run.

Annotate for delivery make it sound human

The script gets prosody markup so the narrator doesn’t read it like a press release.

• SSML breaks, prosody slowdowns, and emphasis tags are inserted for comic timing and weight.

• Inline tone cues like [deadpan], [slowly], and [pause] are added for the voice persona to interpret, then stripped before TTS.

Narrate streaming TTS

Audio is generated chapter-by-chapter so playback can begin before the full episode finishes rendering.

• The script is split at chapter boundaries; each chapter is narrated as its own MP3.

• Provider is chosen per voice: Gemini, ElevenLabs, or Cartesia. The narrator gets a persona prompt for character.

• Chunks are quality-checked for length drift and tail-truncation before being concatenated. Sample rates are normalized via ffmpeg if providers disagree.

• A peaks waveform is decoded and saved alongside the MP3 for the player UI.

Notes & cheat sheet written companions

The same script is condensed into two reading artifacts.

• Notes: structured markdown — one section per chapter, 5–10 bullets each, key terms bolded, with a closing “Key Takeaways” block.

• Cheat sheet: a compressed mental model — the hook, the entities, a 4–6 beat narrative spine, typed connections between entities, and the most quotable anchor facts.

Publish self-contained share

The whole bundle gets uploaded as a single static page.

• The MP3, peaks file, notes, cheat sheet, and source audit are uploaded to Cloudflare R2.

• A vanity slug is reserved on educast.cc and the share is reachable at a clean URL.

• The landing page is fully self-contained — player, waveform, chapter nav, offline cache, tabs — no accounts, no tracking, no app to install.

Why you can trust what you hear

The verification phase is the part that earns the “deep research” in the name. A fact that surfaces once in one source is treated as provisional. A fact that survives an independent re-check — with the LLM looking at a different set of sources than the ones that originally produced it — gets a higher confidence score and goes into the script with corroboration count attached.

This isn’t a guarantee of truth — nothing on the open web is — but it’s a defensible epistemic floor. You can listen, then verify, then disagree, and the receipts are right there in the same page.

Languages

The writer detects language hints in the prompt itself. Say “in Hungarian”, “auf Deutsch”, “in Lithuanian”, or anything similar, and the entire script is written in that language. Supported languages are bounded by the underlying LLM (which speaks 100+) and by the TTS provider you pick — Gemini and ElevenLabs both ship multilingual voices. There is no hardcoded language list to fall off the edge of.

Deep research, narrated.

What it is

The knobs you turn

How the pipeline actually works

Why you can trust what you hear

Languages

What you get with every episode

The audio

Notes

Cheat sheet

Sources & audit

What it’s good for

About the creator