
Clariox
React 18TypeScriptFastAPIPythonOpenAI GPT-4Azure Speech ServicesDeepgramPostgreSQL (Neon)Cloudflare R2StripeGoogle Cloud Run
Clariox is an AI-powered speech coaching platform that helps professionals sharpen their communication skills through simulated meetings, real-time feedback, and detailed speech analytics. I led the project from initial proof of concept through MVP and into production release, owning product vision, system architecture, and hands-on engineering.
Role: CEO, CTO & Lead Architect
Stage: PoC → MVP → Production
Leadership & Architecture
- Founded and led the product from concept validation to production-ready MVP, making all critical technical and product decisions
- Designed the full-stack architecture: async Python backend (FastAPI) with a React/TypeScript SPA, connected via REST APIs and WebSocket channels for real-time coaching sessions
- Architected a multi-provider AI pipeline integrating OpenAI (GPT-4 for coaching, Whisper for transcription), Azure Cognitive Services (pronunciation & prosody assessment), and Deepgram (real-time STT), with a factory pattern enabling seamless provider switching
- Designed an event-driven WebSocket system powering three distinct real-time session types: live coaching dialogue, meeting simulation with AI-generated topics, and comprehensive speech workshops with pronunciation scoring
- Built a modular service layer with clean separation of concerns: transcription, text-to-speech, speech metrics, grammar analysis, and AI coaching, each independently testable and replaceable
- Implemented scalable storage architecture using Cloudflare R2 with chunked video uploads and presigned URLs for secure playback
- Established the data model and user isolation strategy on PostgreSQL (Neon), ensuring all data is scoped per user via Clerk JWT authentication
Key Technical Achievements
- Real-time speech analytics: live WPM, filler word detection, pause analysis, and volume tracking streamed to the frontend during practice sessions
- AI-powered feedback engine: automated scoring for clarity, structure, grammar, pronunciation accuracy, fluency, and completeness with actionable improvement suggestions
- Subscription billing: full Stripe integration with free/pro tiers, webhook-driven status management, and customer portal
- Production deployment on Google Cloud Run with CI/CD pipeline