Voice AI Expert
Upwork

Remoto
•1 month ago
•No application
About
Voice AI Expert – Help Us Build Voice Command Feature About the Project We’re a healthcare SaaS team developing a secure, cloud-based platform that improves operational efficiency across hospitals and labs. Our next goal is to integrate voice command capabilities to allow users to perform key actions by speaking naturally and receive a voice confirmation in response. We’re looking for an expert in speech or voice AI who can help us evaluate technologies, design the architecture, and implement the first version of this feature with our existing engineering team. Engagement Details - Type: Part-time / 20 hrs per week What We Need Help With - Evaluate and recommend the best technologies for wake word, Speech-to-Text (STT), Text-to-Speech (TTS), and intent recognition. - Help us design and build a working voice command flow (voice → command → confirmation). - Work closely with our existing React Native frontend and FastAPI backend developers. - Advise on accuracy, latency, and integration best practices for production readiness. What We’re Looking For - Proven experience in Speech-to-Text / Voice AI / NLP. - Strong understanding of tools or APIs such as Whisper, Google STT, AWS Transcribe, or Picovoice Porcupine. - Ability to guide, mentor, and work hands-on during implementation. - Strong communication and collaboration skills with technical teams. Nice to Have: - Experience with healthcare or regulated SaaS systems. - Experience with FastAPI and AWS. - Familiarity with AI/LLM-driven command understanding. What You’ll Deliver - A functional voice command feature connected to our existing mobile and backend systems. - Real-time speech-to-action flow (voice → system command → confirmation). - Clear documentation and recommendations for production rollout. - Guidance for scaling, accuracy improvements, and future AI integrations.





