Principal Speech Scientist
Zyoin Group
Job Description
Principal Speech ScientistLocation: Hyderabad - Work From Office (5 Days)Experience: 7-12 YearsDomain: Speech AI NLP Machine Learning Deep Learning
Role OverviewWe are looking for a highly experienced Principal Speech Scientist to lead innovation in speech and language technologies. This is a senior individual contributor role with strong technical leadership responsibilities, focused on defining research direction, mentoring teams, and building production-ready speech AI solutions at scale.
Key ResponsibilitiesResearch Strategy & Technical LeadershipDefine and execute the long-term technical vision and roadmap for speech science researchDrive innovation by translating state-of-the-art academic research into scalable production solutionsProvide technical leadership and guide teams on best practices in model development and experimentationSpeech & Language Model DevelopmentDesign and develop advanced speech systems including ASR, TTS, speaker verification, diarization, and speech enhancementBuild high-performance models for speech translation and natural language understandingDevelop scalable model pipelines and optimize model performanceCross-Functional CollaborationWork closely with product, engineering, and design teams to align research initiatives with business goalsInfluence product direction through applied research and technical insightsContribute to architectural decisions related to AI-driven featuresMentorship & Thought LeadershipMentor scientists and engineers, fostering technical excellence and innovation cultureRepresent the organization in conferences and research forumsContribute to publications, patents, and knowledge sharing within the AI research communityExperimentation & Model OptimizationEstablish best practices for experimentation, evaluation metrics, and data pipeline developmentLead initiatives for model optimization, scalability, and deployment readinessSupport strategic decisions related to technology adoption and partnerships
Required Skills & QualificationsPhD or Master's degree in Speech Processing, Computational Linguistics, Electrical Engineering, Computer Science, or related field7+ years of industry experience in speech and audio processingStrong expertise in ASR, TTS, speaker diarization, speech enhancement, or speech translationStrong programming skills in Python with experience in PyTorch or TensorFlowExperience with large-scale model training and distributed systemsAbility to convert complex business problems into structured research solutionsStrong communication skills and experience mentoring technical teams
Nice to HaveExperience with large language models and multimodal AI architecturesExperience optimizing models for edge or on-device deploymentResearch publications in leading speech or AI conferencesPatent contributions in speech or audio technology