Voice Journal
AI-Powered Voice-to-Text Journaling App with Transcription, Summaries & Weekly Insights
Deliverables / Skills Utilized
π The Problem
Journaling is powerful, but typing on a phone kills the flow. Thoughts come fastest when you're walking, commuting, or lying in bedβmoments when a keyboard is the last thing you want.
Voice memos solve the capture problem but create a new one: recordings pile up, unsearchable and unreviewed. You end up with hundreds of audio files and zero actionable insights.
π‘ The Solution
A mobile-first voice journal that turns speech into structured, searchable entries through an AI pipeline: Record β Transcribe β Summarize β Tag β Analyze.
Every entry gets an AI-generated summary, mood tag, and topic classificationβmaking weeks of thoughts instantly searchable and reviewable through weekly insight reports.



π οΈ Technical Implementation
ποΈ Recording Engine
- Built on Expo AV for cross-platform audio capture
- Pause/resume support for natural, unstructured thoughts
- Background recording with lock-screen controls
- Configurable audio quality and format settings
π§ AI Processing Pipeline
- Step 1 β Transcription: OpenAI Whisper API converts audio to text with high accuracy
- Step 2 β Summarization: GPT-4o-mini condenses transcripts into concise journal entries
- Step 3 β Auto-Tagging: AI extracts topics, people, and themes from content
- Step 4 β Mood Detection: Sentiment analysis classifies emotional tone per entry
πΎ SQLite Schema & Migrations
- Normalized schema:
entries,tags,entry_tags,weekly_summaries - Version-tracked migrations for safe schema evolution
- Full-text search index on transcripts and summaries
- Efficient pagination for large journal histories
π Privacy & Security
- Biometric authentication (Face ID / fingerprint) on app launch
- All data stored locally on-device via SQLite
- Audio files encrypted at rest
- Zero cloud dependencyβno account required
π― Key Features
- ποΈ Voice Recording: Tap-to-record with pause/resume and waveform visualization
- π AI Transcription: Whisper-powered speech-to-text with punctuation and formatting
- π Smart Summaries: GPT-4o-mini distills rambling thoughts into clear entries
- π·οΈ Auto-Tagging: Topics, people, and themes extracted automatically
- π Mood Detection: Emotional tone tracked across entries for pattern recognition
- π Full-Text Search: Find any thought by keyword across all entries
- π Weekly Insights: AI-generated analytics on mood trends, top topics, and journaling streaks
- π΄ Offline-First: Works without internet; AI processing queues until connected



π Technical Highlights
| Metric | Detail |
|---|---|
| π§ AI Pipeline Steps | 4 (Transcribe β Summarize β Tag β Mood) |
| πΎ Database Tables | 4 (entries, tags, entry_tags, weekly_summaries) |
| π Search Latency | < 100ms full-text across all entries |
| π Auth Method | Biometric (Face ID / Fingerprint) |
| π΄ Offline Support | Full β queues AI processing for sync |
π± Cross-Platform: iOS & Android




π― Why This Matters
This project demonstrates:
- π± Mobile Development: Production React Native with Expo, handling audio, background tasks, and native APIs
- π€ AI Integration: Multi-step pipeline chaining Whisper and GPT-4o-mini for structured output
- πΎ Local-First Architecture: SQLite with migrations, FTS indexing, and encrypted storage
- π¨ Custom Design System: Purpose-built UI components for a journaling experience
- π Privacy Engineering: Zero-cloud architecture with biometric auth and on-device encryption
π§ Stack
- React Native / Expo: Cross-platform mobile framework
- TypeScript: Type-safe development across the entire codebase
- SQLite: Local-first storage with full-text search
- OpenAI Whisper: Speech-to-text transcription API
- GPT-4o-mini: Summarization, tagging, and mood analysis
- Expo AV: Audio recording and playback engine