Senior PM at Amazon · Side-project builder · AI + Education

A multimodal AI agent for parents and teachers.

An AI system that turns a single educational moment into two outputs: instant retrieval-backed guidance for the parent, and a personalized narrated story for the child. Currently deployed at Apple Montessori in New Jersey.

3–5s
RAG retrieval guidance
60–90s
Full narrated lesson
6 layers
End-to-end pipeline
Runtime Architecture

Fast guidance first. Full lesson second.

Two user-speed lanes instead of one long blocking workflow.

Fast lane
3–5s

Retrieval-backed parent guidance returned while the full lesson is still generating.

Parent input Structured context Postgres + pgvector Guidance card
Async lane
60–90s

Narrated, illustrated story generation with durable state, moderation gates, and retry recovery.

Job queue Gemini planning ElevenLabs voice Imagen art SwiftUI delivery
Why retrieval for the fast path

Generation is too slow for the "my kid is melting down right now" use case. Retrieval over a curated corpus returns grounded help in seconds, not minutes.

Handling partial failures

TTS or image generation can fail mid-pipeline. Jobs persist state at each stage so the worker can resume from the last checkpoint instead of restarting.

Moderation as a pipeline stage

Content safety runs as a gate between generation and delivery — not a post-hoc filter. Stories that fail safety checks never reach TTS.

System Architecture

End-to-end production stack.

Six layers from client to infrastructure, each handling a specific concern.

Client

SwiftUI with auth gating and async job UX

Handles session readiness, request shaping, polling, and smoothed progress so the experience feels responsive while generation runs off-request.

SwiftUI · async/await · progress polling
Edge + Identity

Supabase auth, edge ingress, and storage

Edge Functions own low-latency ingress, user validation, and secure handoff into the async runtime.

Supabase Auth · Edge Functions · Storage
Retrieval

Postgres + pgvector for grounded guidance

Generates embeddings, runs RPC similarity search over a curated parenting corpus, and returns structured, retrieval-backed help.

Gemini embeddings · Postgres · pgvector · RPC
Async Runtime

FastAPI worker with durable job state

Queued jobs with progress updates, moderation gates, graceful degradation, and retry handling across generation stages.

FastAPI · task queue · job state machine
Generation

Multi-modal story and lesson generation

Model outputs become typed intermediate artifacts, then turn into narration and visuals while preserving the educational lesson and age fit.

Gemini · ElevenLabs · Imagen
Delivery + Infra

Cloud deploy, push notifications, asset CDN

Separates fast edge interactions from long-running compute. Assets are stored and pushed to the mobile client on completion.

Railway · APNs · Supabase Storage
Product Screens

What the product looks like in use.

A fast guidance lane up front, then a replayable library of narrated lessons.

Story creation screen
Create. Parents choose the format: SOS guidance, academic story, or character-building lesson.
Ask EE guidance screen
Guide. The fast lane returns grounded parenting help while the full story generates.
Story library screen
Deliver. Narrated, illustrated lessons families can replay anytime.